TELEVISION SIGNAL TRANSMISSION SYSTEM WITH TWO LEVELS OF SYMBOL ENCODING
The present invention generally relates to a system for digitally transmitting a video signal and particularly concerns a system for adaptively improving the signal to noise performance of a digitally transmitted video signal.
A number of systems have recently been proposed for digitally transmitting a wideband HDTV signal over a standard 6 MHz television channel. These systems typically initially compress a wideband video source signal and then transmit the compressed signal over a selected 6 MHz television channel at a fixed data rate. While various video compression techniques are known in the art, temporally oriented systems seem to be presently preferred. In such systems, the compressed video information may be transmitted in the form of motion vectors and difference signals, each motion vector identifying a previous portion of the video signal closely matching the current portion and the difference signal representing the difference between the two portions. Transmission of the motion vector and difference signals may be effected, for example, by encoding the binary data representing the two signals as a series of N-level symbols, each symbol thereby representing N/2 bits of binary data. Each symbol in a 4-level transmission system would therefore represent two bits of binary data. Digital video transmission systems of the foregoing type provide for extremely accurate reproduction of the transmitted image under ideal conditions and can even tolerate a certain degree of interference without appreciable performance degradation. However, if the level of interference encountered exceeds a threshold representing the signal-to-noise performance of the system, the ability to reproduce the televised image may
be entirely lost. This is in contrast to an analog transmission system in which performance is gradually degraded as a function of interference. In the HDTV digital transmission environment, one source of such interference is expected to be NTSC co-channel signals from nearby television service areas.
It is therefore a basic object of the present invention to provide an improved digital video signal vtransmission system.
It is a more specific object of the invention to provide a digital video signal transmission system capable of operating in a relatively noisy environment.
It is another object of the invention to provide a digital video signal transmission system which is operable at different levels of signal-to-noise performance.
It is yet a further object of the invention to provide a digital video signal transmission system in which first selected video information is transmitted at a first signal-to-noise ratio and other video information is transmitted at a second smaller signal-to-noise ratio.
It is still another object of the invention to adaptively vary the amount of video information transmitted at the first and second signal-to-noise ratios.
These and other objects and advantages of the invention will be apparent upon reading the following description in conjunction with the drawings, in which:
Figure 1 is a block diagram of a video signal encoder and transmitter constructed in accordance with the invention;
Figures 2A and 2B are respective graphs useful in explaining the operation of the encoder of Figure 1;
Figure 3A illustrates a technique for formatting video* data according to a preferred embodiment of the invention;
Figures 3B and 3C illustrate the manner in which different segments of the video data of Figure 3A are
encoded for transmission at different levels of signal-to-noise performance according to a preferred embodiment of the invention; and
Figure 4 is a block diagram of a video signal receiver constructed in accordance with the invention.
Figure 1 illustrates a video signal transmitter constructed in accordance with the invention. In its presently preferred embodiment, the transmitter •incorporates a temporally based video compression system and utilizes a suppressed carrier, vestigial sideband form of transmission. However, it is to be understood that the invention may be used with other types of video compression and transmission systems so that the details thereof should not be viewed as to impose any unnecessary limitations on the invention.
Referring therefore in more detail to Figure 1, a video source 10 provides a wideband digital video signal which may have a bandwidth of up to about 37 MHz. Although not necessarily limited thereto, source 10 may provide, for example, a progressively scanned video signal in the form of successive frames of binary video data having a vertical periodicity equal to the NTSC standard (i.e. about 59.4 Hz) and a horizontal periodicity equal to three times the NTSC standard (i.e. about 47.25 KHz). The video data is applied in the form of a stream of binary pixel values from video source 10 to a transform coder 12 which may, for example, implement a discrete cosine transform. More specifically, transform coder 12 processes the binary pixel values for providing a series of blocks of spectral transform coefficients for each frame of the video source signal, each block of coefficients corresponding to a respective spatial region of the video image. In a presently preferred embodiment of the invention, each block comprises an (8 X 8) array of coefficients, with 14,400 blocks representing an entire video frame.
The coefficient blocks developed by transform coder 12 are applied one at a time to a perceptual modeling system 14 which develops an output reflecting the perceptual nature of the corresponding portion of the video image. Perceptual modeling system 14, a preferred embodiment of which is more fully described in copending U.S. application Serial No. 653,560, filed February 8, 1991, for example, develops an output reflecting the degree to which the corresponding block of the video image is characterized by such perceptual characteristics as edge, texture, etc. A video compression system 16 is responsive to the output of perceptual model 14 for compressing the video signal developed at the output of source 10 in accordance with a selected compression algorithm. In a preferred embodiment of the invention, video compression is affected in the temporal domain wherein, for example, successive portions of the video image are analyzed to derive a respective motion vector and difference signal. The motion vector identifies a closely matching group of video pixels from the previous frame and the difference signal represents the difference therebetween. The difference signal is preferably variably quantized in response to the output of perceptual model 14 in the form of, for example, either a 0, 3, 5 or 7 average bit-length code and multiplexed with the motion vector to provide an output compressed binary video data signal.
The compressed signal developed at the output of compressor 16 is applied to the input of a variable rate buffer 18. Buffer 18 controls the flow of video data in the transmitter by providing a feedback signal to perceptual model 14 which reflects the fullness of the buffer. In particular, perceptual model 14 is responsive to the feedback signal for controlling compressor 16 in a manner so as to vary the degree of compression effected thereby as a function of the fullness of the buffer.
Thus, as the fullness of buffer 18 increases, the extent of compression may likewise be increased and as fullness decreases the extent of compression may also be decreased to provide a relatively continuous flow of video data through the transmitter. The effect of the foregoing is illustrated in Figure 2A which plots the degree of error introduced by compressor 16 as a function of buffer fullness. As shown in the graph, since the degree of compression varies directly as a function of buffer fullness, the error level likewise varies directly with buffer fullness. That is, increased compression provides an increased error level and decreased compression provides a decreased error level. In this regard, it has been determined that error levels produced when buffer fullness exceeds a first threshold Tl are excessive and that error levels produced when buffer fullness is less than a second threshold T2 are less than a normally expected range of errors. Error levels between buffer fullness thresholds Tl and T2 are within a normally expected range.
With further reference to Figure 1, the compressed video data is coupled by buffer 18 to data sorter and formatter unit 20. As described previously, the video data may comprise a combination of both motion vectors and variably quantized difference signals, although other classes of data may also be provided. Unit 20 sorts the various classes of data (e.g. motion vectors and difference signals) according to their relative importance and likewise ranks the data within each respective class. Thus, for example, the data applied to or developed in unit 20 may include various global control data necessary for proper operation of the receivers in the system. This class of data is therefore assigned the highest level of importance. The motion vectors, which are considered more important than the difference signals to the accurate reproduction of the televised image, are
assigned the next highest level of importance and the difference signals the lowest relative level of importance. Unit 20 next formats the data into a transmission frame, an exemplary embodiment of which is illustrated in Figure 3A, according to the foregoing classification as well as the inter-classification ranking of the data. The exemplary transmission frame shown in Figure 3A is substantially coextensive with an NTSC field .and comprises a plurality of data segments 22 each corresponding to an NTSC horizontal line.
In accordance with the invention, the most important class of data, i.e. the control data in the foregoing example, is grouped in one or more initial data' segments 24, the next most important class, i.e. the motion vectors, in a succeeding group 26 of data segments 22 and finally the difference signals in a final group 28 of data segments 22. As mentioned above, the data within each group 24, 26, 28 of data segments 22 is likewise ranked according to its relative importance. Thus, for example, the more important motion vectors and difference signals are provided in the initial data segments of their respective groups, while the motion vectors and difference signals of lesser importance are provided in the lower data segments of the groups. Also, it will be understood that the number of motion vectors and difference signals may vary from frame to frame so that the number of data segments comprising groups 26 and 28 may likewise vary. The formatted data frames developed by unit 20 are supplied to a level encoder 30 which is operated in response to an encoder 32. Encoder control 32 is in turn responsive to the buffer fullness signal developed by buffer 18. Level encoder 30 is operable for encoding each segment 22 of the transmission frame in the form of about 684 four or two level symbols, each four-level symbol representing two binary data bits and each two-level symbol representing one binary data bit. The output of
SUBSTITUTE SHEET
level encoder 30 thus comprises a successive series of data segments 22 each characterized by a fixed symbol rate of approximately 10.5 megasy bols per second and a data rate of either 21 megabits per second (for four-level symbols) or 10.5 megabits per second (for two-level symbols) . The foregoing is illustrated in Figures 3B and 3C which respectively represent a four-level data segment 22 and a two-level data segment 22. Referring to Figure , 3B, it will be seen that the data segment comprises 684 four-level symbols representing 171 bytes of data. Each of the data symbols, which are equally spaced by about 92 nanoseconds, assume one of the four exemplary levels 0, 1, 2 or 3 for providing a fixed symbol rate of about 10.5 megasymbols per second and a corresponding data rate of about 21 megabits per second. The data segment of Figure 3C also comprises 684 symbols but only represents about 85 bytes of data corresponding to a data rate of about 10.5 megabits per second. According to an important aspect of the invention, the levels characterizing the symbols of the data segment of Figure 3C are selected from among the levels used to encode the symbols of the data segment ofFigure 3B. In particular, the symbols of the data segment of Figure 3C are encoded using the levels 0 and 2 of Figure 3B. These levels were selected as representing the best compromise between maintaining a reasonably large difference between the levels while, at the same time, minimizing interference of the transmitted signal into a nearby co-channel.
It will be appreciated that the two-level symbols of the data segment of Figure 3C, while providing half the data rate of the data segment of Figure 3B, provide a noise improvement in relation thereto of about 9.5 db. According to the invention, this improvement in noise performance is advantageously utilized by encoding certain of the data comprising segments 22 as two-level symbols in accordance with Figure 3C and other of the data as
SUBSTITUTE SHEET
four-level symbols in accordance with Figure 3B. The division of encoding levels may be set on a fixed basis, may be derived on an adaptively varying basis, or may comprise a combination of both techniques.
In a preferred embodiment of the invention, level encoder 30 is controlled by encoder control 32 to always encode certain data segments 22 of each transmission frame as two-level symbols, other segments 22 as two-level . symbols on an adaptively varying basis and the remaining segments 22 as four-level symbols. In particular, referring to Figure 3A, a pair of partitions 34 and 36 are established for defining the symbol levels to be used for encoding the various data segments. Partition 34 is a fixed partition, while partition 36 is variable between fixed partition 34 and the end of the frame. Data segments above fixed partition 34 are always encoded as two-level symbols, data segments between fixed partition 34 and variable partition 36 as two-level symbols and data segments below variable partition 36 as four-level symbols. In accordance with a preferred embodiment of the invention, fixed partition 34 is selected to define the global control data of data segment group 24 such that this class of data is always transmitted as robust two-level symbols. Variable partition 36 is controlled as a function of the fullness of buffer 18 as illustrated by the nominal curve 40 of Figure 2B. That is, at levels of buffer fullness below fullness threshold T2 variable partition 36 is adjusted downwards to the bottom of the frame and at levels of buffer fullness above fullness threshold Tl variable partition 36 is adjusted upwards to fixed partition 34. For levels of buffer fullness between thresholds Tl and T2, the variable partition is established by encoder control 32 between these two extremes such that the transmission frames are encoded at an average data rate reflected by the portion of curve 40 between thresholds Tl and T2.
SUBSTITUTE SHEET
In accordance with the foregoing, as much video data as can be accommodated by the system, in particular the most important portions thereof, is encoded in a robust manner as relatively noise-immune two-level symbols, the remaining data being encoded with reduced noise performance as four-level symbols. Moreover, increases in buffer fullness are compensated for by increasing the data rate which results from raising ••variable partition 36 in the transmission frame. Likewise, decreases in buffer fullness are advantageously utilized by lowering the variable partition and thereby reducing the data rate.
In order to facilitate decoding of the two and four-level symbols in a receiver, encoder control 32 inserts a count in the first segment of each transmission frame identifying the number of segments in each frame comprising two-level symbols. Since the count is above fixed partition 34 it will always be encoded as two-level symbols. This count, together with the remaining encoded symbols are coupled to a transmitter 42 for transmission over a 6 MHz television channel. Although not necessarily limited thereto, transmitter 30 preferably comprises the vestigial sideband, suppressed carrier transmitter described in copending application Serial No. 611,236, filed November 9, 1990.
Nominal curve 40 of Figure 2B may be adjusted by the broadcaster in a tradeoff between the quality of the broadcast and the extent of geographical coverage provided thereby. For example, if the broadcaster desires to provide a higher quality broadcast, curve 40' of Figure 2B may be employed to control level encoder 30 instead of curve 40. Curve 40' results in a tendency to raise variable partition 36 whereby an increased number of data segments are encoded as four-level symbols to increase the average transmitted data rate. While this improves the quality of the broadcast image, the extent of geographical
SUBSTITUTE SHEET
coverage is reduced as a consequence of the reduction in the average signal-to-noise ratio of the transmitted signal. On the other hand, if the broadcaster desires to increase the geographical coverage of the broadcast, curve 40" of Figure 2B may be employed to control level encoder 30. Curve 40" results in a tendency to lower variable partition 36 whereby an increased number of data segments are encoded as two-level symbols resulting in a decreased » average transmitted data rate. This provides a more robust signal having increased geographical coverage but at a reduced quality level.
With reference now to Figure 4, the transmitted signal is received by a receiver including a tuner 50. Tuner 50 converts the received signal to an intermediate frequency (IF) signal which is applied to a synchronous detector 52 and to a frequency and phase lock loop (FPLL) 54. FPLL 54 locks to the applied IF signal and generates a continuous wave output for application to synchronous detector 52. FPLL 54 may also may also generate an automatic frequency control voltage for controlling the operation of tuner 50. Synchronous detector 52, in response to the output of FPLL 54, demodulates the received signal and applies the demodulated signal to the inputs of a data decoder 56 and a clock and sync recovery circuit 58. The demodulated count signal identifying the number of segments of each received frame encoded as two-level symbols is coupled to a second input of data decoder 56. Circuit 58 applies a clock signal to a third input of data decoder 56 and appropriate sync signals for controlling a display 60.
Data decoder 56 is responsive to the count and clock signals for decoding the demodulated segments of two and four-level symbols coupled thereto. The decoded signal is applied through a buffer 62 as a fixed rate binary signal to a video processor 64 which decompresses the received signal to provide a wideband video output
SUBSTITUTE S EET
signal. It will be understood that since at least the most important portions of the received video signal were transmitted and received as relatively noise-immune, two-level symbols, highly accurate reproduction of corresponding portions of the image is achieved. Depending on the level of noise encountered during transmission, the remaining four-level symbols may be reproduced with the same or less accuracy. Finally, the » decoded video data developed by processor 64 is applied through a D/A converter 66 to display 60 for reproducing the televised image.
In a further embodiment, it has been found advantageous to transmit the four-level symbols (see Fig. 3B) at the respective levels +4, +2, 0 and -2. Also, as an alternative to the transmission of the two-level symbols at levels selected from among the levels used to encode the four-level symbols as previously described (see Fig. 3C) , certain benefits may be realized by transmitting the two-level symbols at respective values each disposed substantially midway between a selected pair of adjacent values of the four-level symbols. In this case, the two-level symbols may be transmitted at the levels +3 and -1. Among other benefits and assuming that the transmitted video data is random in nature, this results in a transmission whose time average value is the same in both modes. That is, in the four-level mode the average transmission level is (+4+2+0-2)/4=l and in the two-level mode the average transmission level is (+3-1)/2=1. Of course, the receiver in this embodiment would be adapted for distinguishing the six levels instead of the four levels in the previous embodiment.
While particular embodiments of the present invention have been shown and described, it will be obvious to those skilled in the art that changes and modifications may be made without departing from the invention in its boarder aspects. Therefore, the aim in
SUBSTITUTE SHEET
the appended claims is to cover all such changes and modifications as fall within the true spirit and scope of the invention. The matter set forth in the foregoing descriptions and accompanying drawings is offered by way of illustration only and not as a limitation. The actual scope of the invention is intended to be defined in the following claims when viewed in their proper perspective based on the prior art.