Transmitting Television as Clusters of Frame-to-Frame Differences

Abstract
A number of redundancy reduction techniques are used in a coder that is about eight times more efficient than simple PCM. The coder is capable of transmitting Picturephone® signals at an average rate of one bit per picture-element (2 Megabits per second). When there is movement in the scene, most transmission time is devoted to the parts of the picture that change significantly. The data are generated irregularly but the data flow is smoothed prior to transmission in a buffer that holds about one frame of data. The redundancy reduction techniques used and the behavior of the coder are discussed both from an intuitive and from a statistical viewpoint. The positions of elements that change are signaled by addressing the first element of a run of changes and marking the end of the run with a special code word. The changes of luminance are transmitted as frame-to-frame differences using variable-length code words. When rapid motion makes the buffer more than a quarter full, only differences for every second element are transmitted, the values of the intervening changed elements being set equal to the average of their neighbors. If the buffer continues to fill, the threshold that determines which changes are significant is raised from 4/256 to 7/256 of the maximum signal value. When violent motion causes the buffer to fill completely, replenishment is stopped for about one frame while the buffer empties. Subsampling and raising the threshold are not objectionable because viewers rarely detect the small impairments introduced in moving images. Observers are critical, however, of small impairments in stationary scenes. Thus, to maintain high quality in stationary areas, the entire picture is forcibly updated every three seconds by transmitting 8-bit luminance values for three lines of every frame. A record of the coder's behaviour is available as a 16-milimeter movie film.