WO2006062330A1 - Method and apparatus for encoding, transmitting, and decoding a video signal - Google Patents

Method and apparatus for encoding, transmitting, and decoding a video signal Download PDF

Info

Publication number
WO2006062330A1
WO2006062330A1 PCT/KR2005/004147 KR2005004147W WO2006062330A1 WO 2006062330 A1 WO2006062330 A1 WO 2006062330A1 KR 2005004147 W KR2005004147 W KR 2005004147W WO 2006062330 A1 WO2006062330 A1 WO 2006062330A1
Authority
WO
WIPO (PCT)
Prior art keywords
picture
sequence layer
layer
base
picture sequence
Prior art date
Application number
PCT/KR2005/004147
Other languages
English (en)
French (fr)
Inventor
Seung Wook Park
Ji Ho Park
Byeong Moon Jeon
Original Assignee
Lg Electronics Inc.
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Priority claimed from KR1020050049897A external-priority patent/KR20060063605A/ko
Application filed by Lg Electronics Inc. filed Critical Lg Electronics Inc.
Priority to CN200580041787.5A priority Critical patent/CN101103633B/zh
Priority to EP05819115A priority patent/EP1820352A4/en
Publication of WO2006062330A1 publication Critical patent/WO2006062330A1/en

Links

Classifications

    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N19/00Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
    • H04N19/50Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using predictive coding
    • H04N19/503Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using predictive coding involving temporal prediction
    • H04N19/51Motion estimation or motion compensation
    • H04N19/53Multi-resolution motion estimation; Hierarchical motion estimation
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N21/00Selective content distribution, e.g. interactive television or video on demand [VOD]
    • H04N21/60Network structure or processes for video distribution between server and client or between remote clients; Control signalling between clients, server and network components; Transmission of management data between server and client, e.g. sending from server to client commands for recording incoming content stream; Communication details between server and client 
    • H04N21/65Transmission of management data between client and server
    • H04N21/658Transmission by the client directed to the server
    • H04N21/6587Control parameters, e.g. trick play commands, viewpoint selection
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N19/00Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
    • H04N19/10Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding
    • H04N19/102Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the element, parameter or selection affected or controlled by the adaptive coding
    • H04N19/103Selection of coding mode or of prediction mode
    • H04N19/105Selection of the reference unit for prediction within a chosen coding or prediction mode, e.g. adaptive choice of position and number of pixels used for prediction
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N19/00Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
    • H04N19/10Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding
    • H04N19/134Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the element, parameter or criterion affecting or controlling the adaptive coding
    • H04N19/164Feedback from the receiver or from the transmission channel
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N19/00Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
    • H04N19/10Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding
    • H04N19/169Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the coding unit, i.e. the structural portion or semantic portion of the video signal being the object or the subject of the adaptive coding
    • H04N19/179Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the coding unit, i.e. the structural portion or semantic portion of the video signal being the object or the subject of the adaptive coding the unit being a scene or a shot
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N19/00Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
    • H04N19/10Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding
    • H04N19/169Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the coding unit, i.e. the structural portion or semantic portion of the video signal being the object or the subject of the adaptive coding
    • H04N19/187Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the coding unit, i.e. the structural portion or semantic portion of the video signal being the object or the subject of the adaptive coding the unit being a scalable video layer
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N19/00Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
    • H04N19/30Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using hierarchical techniques, e.g. scalability
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N19/00Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
    • H04N19/30Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using hierarchical techniques, e.g. scalability
    • H04N19/36Scalability techniques involving formatting the layers as a function of picture distortion after decoding, e.g. signal-to-noise [SNR] scalability
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N19/00Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
    • H04N19/46Embedding additional information in the video signal during the compression process
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N19/00Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
    • H04N19/60Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using transform coding
    • H04N19/61Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using transform coding in combination with predictive coding
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N21/00Selective content distribution, e.g. interactive television or video on demand [VOD]
    • H04N21/20Servers specifically adapted for the distribution of content, e.g. VOD servers; Operations thereof
    • H04N21/23Processing of content or additional data; Elementary server operations; Server middleware
    • H04N21/234Processing of video elementary streams, e.g. splicing of video streams or manipulating encoded video stream scene graphs
    • H04N21/2343Processing of video elementary streams, e.g. splicing of video streams or manipulating encoded video stream scene graphs involving reformatting operations of video signals for distribution or compliance with end-user requests or end-user device requirements
    • H04N21/234327Processing of video elementary streams, e.g. splicing of video streams or manipulating encoded video stream scene graphs involving reformatting operations of video signals for distribution or compliance with end-user requests or end-user device requirements by decomposing into layers, e.g. base layer and one or more enhancement layers
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N21/00Selective content distribution, e.g. interactive television or video on demand [VOD]
    • H04N21/20Servers specifically adapted for the distribution of content, e.g. VOD servers; Operations thereof
    • H04N21/23Processing of content or additional data; Elementary server operations; Server middleware
    • H04N21/234Processing of video elementary streams, e.g. splicing of video streams or manipulating encoded video stream scene graphs
    • H04N21/2343Processing of video elementary streams, e.g. splicing of video streams or manipulating encoded video stream scene graphs involving reformatting operations of video signals for distribution or compliance with end-user requests or end-user device requirements
    • H04N21/234363Processing of video elementary streams, e.g. splicing of video streams or manipulating encoded video stream scene graphs involving reformatting operations of video signals for distribution or compliance with end-user requests or end-user device requirements by altering the spatial resolution, e.g. for clients with a lower screen resolution
    • H04N21/234372Processing of video elementary streams, e.g. splicing of video streams or manipulating encoded video stream scene graphs involving reformatting operations of video signals for distribution or compliance with end-user requests or end-user device requirements by altering the spatial resolution, e.g. for clients with a lower screen resolution for performing aspect ratio conversion
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N21/00Selective content distribution, e.g. interactive television or video on demand [VOD]
    • H04N21/20Servers specifically adapted for the distribution of content, e.g. VOD servers; Operations thereof
    • H04N21/23Processing of content or additional data; Elementary server operations; Server middleware
    • H04N21/234Processing of video elementary streams, e.g. splicing of video streams or manipulating encoded video stream scene graphs
    • H04N21/2343Processing of video elementary streams, e.g. splicing of video streams or manipulating encoded video stream scene graphs involving reformatting operations of video signals for distribution or compliance with end-user requests or end-user device requirements
    • H04N21/23439Processing of video elementary streams, e.g. splicing of video streams or manipulating encoded video stream scene graphs involving reformatting operations of video signals for distribution or compliance with end-user requests or end-user device requirements for generating different versions
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N21/00Selective content distribution, e.g. interactive television or video on demand [VOD]
    • H04N21/20Servers specifically adapted for the distribution of content, e.g. VOD servers; Operations thereof
    • H04N21/23Processing of content or additional data; Elementary server operations; Server middleware
    • H04N21/238Interfacing the downstream path of the transmission network, e.g. adapting the transmission rate of a video stream to network bandwidth; Processing of multiplex streams
    • H04N21/23805Controlling the feeding rate to the network, e.g. by controlling the video pump
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N21/00Selective content distribution, e.g. interactive television or video on demand [VOD]
    • H04N21/20Servers specifically adapted for the distribution of content, e.g. VOD servers; Operations thereof
    • H04N21/25Management operations performed by the server for facilitating the content distribution or administrating data related to end-users or client devices, e.g. end-user or client device authentication, learning user preferences for recommending movies
    • H04N21/262Content or additional data distribution scheduling, e.g. sending additional data at off-peak times, updating software modules, calculating the carousel transmission frequency, delaying a video stream transmission, generating play-lists
    • H04N21/26208Content or additional data distribution scheduling, e.g. sending additional data at off-peak times, updating software modules, calculating the carousel transmission frequency, delaying a video stream transmission, generating play-lists the scheduling operation being performed under constraints
    • H04N21/26216Content or additional data distribution scheduling, e.g. sending additional data at off-peak times, updating software modules, calculating the carousel transmission frequency, delaying a video stream transmission, generating play-lists the scheduling operation being performed under constraints involving the channel capacity, e.g. network bandwidth
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N21/00Selective content distribution, e.g. interactive television or video on demand [VOD]
    • H04N21/20Servers specifically adapted for the distribution of content, e.g. VOD servers; Operations thereof
    • H04N21/25Management operations performed by the server for facilitating the content distribution or administrating data related to end-users or client devices, e.g. end-user or client device authentication, learning user preferences for recommending movies
    • H04N21/266Channel or content management, e.g. generation and management of keys and entitlement messages in a conditional access system, merging a VOD unicast channel into a multicast channel
    • H04N21/2662Controlling the complexity of the video stream, e.g. by scaling the resolution or bitrate of the video stream based on the client capabilities
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N21/00Selective content distribution, e.g. interactive television or video on demand [VOD]
    • H04N21/40Client devices specifically adapted for the reception of or interaction with content, e.g. set-top-box [STB]; Operations thereof
    • H04N21/41Structure of client; Structure of client peripherals
    • H04N21/414Specialised client platforms, e.g. receiver in car or embedded in a mobile appliance
    • H04N21/41407Specialised client platforms, e.g. receiver in car or embedded in a mobile appliance embedded in a portable device, e.g. video client on a mobile phone, PDA, laptop
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N7/00Television systems
    • H04N7/16Analogue secrecy systems; Analogue subscription systems
    • H04N7/173Analogue secrecy systems; Analogue subscription systems with two-way working, e.g. subscriber sending a programme selection signal
    • H04N7/17309Transmission or handling of upstream communications
    • H04N7/17318Direct or substantially direct transmission and handling of requests

Definitions

  • the present invention relates to a method and apparatus for encoding and transmitting a video signal according to a scalable scheme, a method and apparatus for decoding such an encoded data stream, and the encoded data stream.
  • Scalable Video Codec is a method which encodes video into a sequence of pictures with the highest image quality while ensuring that part of the encoded picture sequence (specifically, a partial sequence of frames intermittently selected from the total sequence of frames) can be decoded to represent the video with a lower image quality.
  • Motion Compensated Temporal Filtering MCTF is an encoding scheme that has been suggested for use in the scalable video codec.
  • One solution to this problem is to hierarchically and additionally provide an auxiliary picture sequence for low bitrates, for example, a sequence of pictures that have a small screen size and/or a low frame rate.
  • One example is to encode and transmit 4CIF (Common Intermediate Format) , CIF, and QCIF (Quarter CIF) picture sequences of a video signal to a decoding apparatus as shown in Fig. 1.
  • Such picture sequences have redundancy since the same video signal source is encoded into the sequences.
  • one method entails inter-sequence prediction of video frames in a higher sequence from video frames in a lower sequence temporally coincident with the video frames in the higher sequence, so as to reduce the amount of coded information of the higher sequence, as illustrated in Fig. 1.
  • the original or base layer of a lower sequence may be used to predictively encode an original or base layer of a higher sequence.
  • the resulting encoded sequence is referred to as a base or original layer.
  • the quantization causes an information loss in the base or original layer.
  • the encoder 20 k of each sequence performs inverse quantization 201 k and inverse transformation 202 k to reconstitute the sequence prior to DCT and quantization.
  • a difference is then obtained between the actual sequence prior to DCT and quantization and the reconstituted sequence. This difference represents data lost during the DCT and quantization process.
  • the residual sequence layer data may undergo the same process to produce a higher level of SNR enhancement layer data, and higher levels of SNR enhancement layer data may also undergo the same process to obtain still higher levels of residual sequence layer data.
  • the various levels of SNR enhancement data will be collectively referred to as SNR enhancement layer data or residual sequence layer data.
  • the SNR enhancement layer data is provided such that the image quality can be gradually improved by increasing the decoding level of the SNR enhancement layer data, which is referred to as Fine Grained Scalability. Namely, the more levels of residual sequence layer data that are decoded and added to the associated base layer, the higher the quality of the resulting image. Because the number of levels of SNR enhancement layer data is controllable or selectable, these fine grain improvements in quality are scalable; hence the name, Fine Grained Scalability or FGS .
  • an extractor 22 transmits a stream selected depending on transmission channel bandwidth and the type of a sequence currently requested by the decoding apparatus. For example, as shown in Fig. 3, when the decoding apparatus currently requests a CIF sequence and the transmission channel bandwidth permits, data 301 of an SNR base layer of a QCIF sequence, data 302 of an SNR base layer of a CIF sequence, data 303 of an SNR enhancement layer of the QCIF sequence, and data 304 of an SNR enhancement layer of the CIF sequence are, in the named order, extracted in specific units from a storage unit 21 and a data stream including such extracted data is then transmitted.
  • the enhancement layers of the sequences are transmitted after all the base layers of the sequences are transmitted, and the sequences in each layer are transmitted in increasing order of their transfer rates. If the transmission channel bandwidth is reduced during transmission, the extractor 22 transmits only up to a transmittable bit, thereby failing to transmit the subsequent bitstream in each transmission segment 310. For example, in the case of Fig. 3, part of the data bitstream, starting from high-precision error compensation data part (i.e. , the LSB of the error compensation data) of the SNR enhancement layer of the CIF sequence, is not transmitted.
  • the above method which sequentially transmits sequences in increasing order of their transfer rates, may unnecessarily occupy the transmission channel due to transmission of unnecessary data, which is not used by the decoding apparatus.
  • the decoding apparatus decodes only the CIF sequence to display video to the user in the example of Fig. 3, the SNR enhancement layer data of the QCIF sequence is transmitted although the SNR enhancement layer data is not used while the SNR base layer data of the QCIF sequence is used for prediction of the SNR base layer frame of the CIF sequence.
  • the SNR enhancement layer data of the QCIF sequence is transmitted although it actually makes no contribution to improving the image quality, whereas the amount of transmitted data of the enhancement layer of the CIF sequence is reduced although it directly contributes to improving the image quality.
  • the present invention relates to a method and apparatus for encoding, transmitting and decoding a video signal.
  • a method of decoding a video signal at least a portion of a picture in a first picture sequence layer is decoded based on a second picture sequence layer if an indicator in the video signal indicates inter-layer prediction coding.
  • the second picture sequence layer may have a lower frame rate than the first picture sequence layer, may have a bitrate less than a bitrate of the first picture sequence layer, may have a picture resolution less than the first picture sequence layer, and/or may have a picture display size less than the first frame sequence.
  • the picture in the first picture sequence layer is a base picture, where a base picture has a base level of quality for the first picture sequence layer.
  • the decoding step may include improving the quality level of the decoded base picture using enhancement layer picture information associated with the base picture.
  • a value of the indicator greater than zero indicates inter-layer prediction coding for the base picture.
  • a portion of a picture in a first picture sequence layer is decoded based on at least a portion of a second picture sequence layer base picture in a second picture sequence layer and enhancement layer picture information associated with the second picture sequence layer base picture according to a quality level represented by an indicator in the video signal .
  • the second picture sequence layer base picture has a base level of quality for the second picture sequence layer, and the enhancement layer picture information associated with the second picture sequence layer base picture provides information to improve the quality level of the second picture sequence layer base picture.
  • the second picture sequence layer base picture may be decoded based on the enhancement layer picture information according to the quality level represented by the indicator to produced an enhanced picture, and the portion of the picture in the first picture sequence layer may be decoded based on the enhanced picture.
  • a decoder decodes at least a portion of a picture in a first picture sequence layer based on a second picture sequence layer if an indicator in the video signal indicates inter-layer prediction coding.
  • At least a portion of a picture in a first picture sequence layer is encoded based on a second picture sequence layer and an indicator in the video signal is set to indicate inter-layer prediction coding of the picture in the first picture sequence layer.
  • an encoder encodes at least a portion of a picture in a first picture sequence layer based on a second picture sequence layer and sets an indicator in the video signal to indicate inter-layer prediction coding of the picture in the first picture sequence layer.
  • a bitstream representing a video signal has a data structure, includes a first stream portion representing at least a portion of a picture in a first picture sequence layer encoded based on a second picture sequence layer, and includes an indicator to indicate inter-layer prediction coding of the picture in the first picture sequence layer.
  • Fig. 1 illustrates an example of sequences having different screen sizes and/or different frame rates into which a video signal is encoded through inter-sequence prediction
  • Fig. 2 is a block diagram of an apparatus for encoding the video signal into the sequences as shown in Fig. 1 and transmitting the sequences;
  • Fig. 3 illustrates a transmission format of data that the encoding apparatus of Fig. 2 extracts and transmits upon receiving a CIF sequence transmission request from a decoder
  • Fig. 4 is a block diagram of an apparatus for encoding a video signal into sequences having different screen sizes and/or different frame rates through inter-sequence prediction of the video signal according to an embodiment of the present invention
  • Fig. 5 illustrates sequences encoded by the apparatus of Pig. 4 and a transmission format of data that the apparatus of Pig. 4 ' extracts and transmits from the encoded sequences according to an embodiment of the present invention
  • Fig. 6 illustrates a transmission format of data extracted from the encoded sequences according to another embodiment of the present invention.
  • Fig. 7 is a block diagram of an apparatus for decoding a data stream encoded by the apparatus of Fig. 4.
  • Features, elements, and aspects of the invention that are referenced by the same numerals in different figures represent the same, equivalent, or similar features, elements, or aspects in accordance with one or more embodiments.
  • Fig. 4 is a block diagram of a video signal encoding apparatus to which an encoding and transmission method according to the present invention is applied.
  • the video signal encoding apparatus of Fig. 4 is similar in structure to the apparatus of Fig. 2. However, sequence encoders 40 k and an extractor 42 in the apparatus of Fig. 4 have different features from those of the apparatus of Fig. 2. The video signal encoding apparatus of Fig. 4 will now be described in detail, focusing on the sequence encoders 40 k and the extractor 42.
  • Each of the encoders 4O 2 and 4O 3 of lower picture sequences having different picture or display sizes (e.g., different resolution) and/or different frame rates provide not only data of an SNR base layer but also data of an SNR enhancement layer
  • each of the encoders 40i and 4O 2 of the higher sequences uses video frames reconstructed by using both the SNR base layer data and the associated SNR enhancement layer data of the lower sequence when performing inter-sequence prediction of video frames present in the sequence produced by each of the encoders 4O 1 and 4O 2 (S500) .
  • the level of the SNR enhancement layer data to be used for video reconstruction is determined and set in each encoder 40 k based on conditions, such as an image quality to be provided and a secured transmission channel capacity.
  • This level is then indicated by inserting a field or flag prediction_SNR_level inserted in headers (for example, slice or picture headers) of the encoded SNR base layer data stream so that the level value is transferred to a decoder.
  • the prediction_SNR_level value indicates the level value.
  • the prediction_SNR_level is also set in the extractor 42 of the encoding apparatus of Fig. 4. From each encoded data stream 501 stored in a storage unit 41, the extractor 42 extracts and transmits data for a picture sequence currently requested by a decoding apparatus.
  • Fig. 5 shows an arrangement of data units of each data stream segment transmitted when the decoding apparatus requests a CIF sequence and bandwidth required for the transmission channel is available.
  • the extractor 42 For CIF sequence transmission, the extractor 42 first arranges a data unit aa of the SNR base layer of its lower (i.e. , QCIF) sequence and subsequently arranges data units ab, ac, ad, ae and af up to the set prediction_SNR_level from among data units ab to ah of the SNR enhancement layer of the QCIF sequence. The extractor 42 subsequently arranges a data unit ba of the SNR base layer of the CIF sequence and data units bb, be, bd, be and bf up to the set prediction_SNR_level from among data units bb to bh of the SNR enhancement layer of the CIF sequence.
  • QCIF lower
  • the extractor 42 arranges remaining data units ag and ah of the SNR enhancement layer of the QCIF sequence, subsequent to the data units bb to bf, and arranges remaining data units bg and bh of the SNR enhancement layer of the CIF sequence, subsequent to the remaining data units ag and ah, and then transmits the arranged data stream.
  • the remaining data units ag and ah of the SNR enhancement layer of the QCIF sequence are not used when the video of the CIF sequence is presented.
  • the remaining data units ag and ah of the SNR enhancement layer of the QCIF sequence, which are not used in the prediction operation, are arranged and transmitted in the data stream when the transmission channel bandwidth permits because the user may view video of the QCIF sequence using a device having a low decoding capability such as a mobile phone after storing the data transmitted from the extractor 42.
  • the extractor 42 may arrange the remaining data units bg and bh of the SNR enhancement layer of the CIF sequence, subsequent to the data units bb to bf of the SNR enhancement layer of the CIF sequence up to the set prediction_SNR_level . Then the extractor 42 may arrange the remaining data units ag and ah of the SNR enhancement layer of the QCIF sequence at the end of the data stream, and transmit the arranged data stream.
  • part of the data stream starting from the SNR enhancement layer data of the QCIF sequence that makes no contribution to improving the image quality of currently decoded video, is not transmitted. If the channel conditions are further deteriorated, part of the data stream is not transmitted in the order from data that slightly increases the SNR of video to data that greatly increases the SNR of video. That is, the image quality of decoded video is resistant to variations in the channel conditions, as compared to the conventional transmission method.
  • the prediction_SNR_level is set to zero, SNR enhancement layer data of a lower sequence is not used for prediction of frames of a higher sequence, so that the SNR enhancement layer data of the lower sequence is not transmitted. Accordingly, a non-zero value of the prediction_SNR_level indicates that inter-layer prediction has taken place, while a zero value indicates no inter-layer prediction.
  • data of an SNR enhancement layer of a currently selected sequence is arranged and transmitted in a transmission segment and data of an SNR enhancement layer of a lower sequence is subsequently arranged and transmitted in the transmission segment .
  • FIG. 6 An example of such a case is illustrated in Fig. 6, in which a CIF sequence is selected so that data of an SNR enhancement layer of a QCIF sequence is not transmitted in a data stream. Even if the data of the SNR enhancement layer of the QCIF sequence is transmitted, it is arranged and transmitted at the end of the data stream (601) .
  • Fig. 7 is a block diagram of an embodiment of an apparatus for decoding a data stream encoded and transmitted by the apparatus of Fig. 4.
  • the decoding apparatus of Fig. 7 receives a plurality of sequences and decodes a higher sequence into a video signal, and includes a demuxer (or demultiplexer) 70, a main decoder 71, and a sub-decoder 72.
  • the demuxer 70 separates a received data stream into a data stream of a main sequence and a data stream of a sub-sequence.
  • the main decoder 71 converts the data stream of the separated main sequence (for example, a CIF sequence) back to an original video signal according to an MCTF scheme.
  • the sub-decoder 72 decodes the data stream of the separated sub-sequence (for example, a QCIF sequence) according to a specified scheme, for example, according to the MPEG4 or H.264 standard.
  • the main decoder 71 reads the prediction_SNR_level described above from a header of the input data stream and notifies the sub-decoder 72 of the prediction_SNR_level .
  • the notification of prediction_SNR_level between the decoders is not necessary in an embodiment where the prediction_SNR_level is recorded and transmitted in each of the sequences.
  • the sub-decoder 72 When decoding the received data stream of the sub-sequence, the sub-decoder 72 decodes SNR base layer data which may be included, together with the SNR enhancement layer, in the received data stream. Then, the sub-decoder 72 provides the main decoder 71 with frames that are decoded to improve the image quality of video using data up to the notified prediction_SNR_level, from among SNR enhancement layer data included in the received data stream of the sub-sequence.
  • the main decoder 71 decodes frames in the received main sequence, for which frames in the sub-sequence are used as their predictive images, into original video signals based on images predicted from frames provided from the sub-decoder 72 or, if needed, from scaled versions of these frames.
  • the decoding apparatus described above may be incorporated into a mobile communication terminal, a media player, or the like.
  • an apparatus and method for encoding and decoding a video signal performs inter-sequence prediction using video frames reconstructed by additionally using error compensation data (e.g. , SNR enhancement layer data or residual sequence layer data) , thereby improving the image quality relative to the amount of coded data.
  • error compensation data e.g. , SNR enhancement layer data or residual sequence layer data
  • the apparatus and method also arrange and transmit encoded data units sequentially starting from data units which greatly affect the image quality of a sequence that currently needs to be decoded, thereby making the image quality less sensitive to variations in the channel capacity. Also, the transfer rate may be reduced to more efficiently allocate the transmission channel .

Landscapes

  • Engineering & Computer Science (AREA)
  • Multimedia (AREA)
  • Signal Processing (AREA)
  • Databases & Information Systems (AREA)
  • Computer Networks & Wireless Communication (AREA)
  • General Engineering & Computer Science (AREA)
  • Compression Or Coding Systems Of Tv Signals (AREA)
PCT/KR2005/004147 2004-12-06 2005-12-06 Method and apparatus for encoding, transmitting, and decoding a video signal WO2006062330A1 (en)

Priority Applications (2)

Application Number Priority Date Filing Date Title
CN200580041787.5A CN101103633B (zh) 2004-12-06 2005-12-06 用于编码、传送及解码视频信号的方法和装置
EP05819115A EP1820352A4 (en) 2004-12-06 2005-12-06 METHOD AND APPARATUS FOR ENCODING, TRANSMITTING AND DECODING A VIDEO SIGNAL

Applications Claiming Priority (4)

Application Number Priority Date Filing Date Title
US63297304P 2004-12-06 2004-12-06
US60/632,973 2004-12-06
KR10-2005-0049897 2005-06-10
KR1020050049897A KR20060063605A (ko) 2004-12-06 2005-06-10 영상신호의 엔코딩과 그 전송, 그리고 디코딩을 위한 방법및 장치

Publications (1)

Publication Number Publication Date
WO2006062330A1 true WO2006062330A1 (en) 2006-06-15

Family

ID=36578119

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/KR2005/004147 WO2006062330A1 (en) 2004-12-06 2005-12-06 Method and apparatus for encoding, transmitting, and decoding a video signal

Country Status (3)

Country Link
EP (1) EP1820352A4 (ko)
KR (1) KR100880639B1 (ko)
WO (1) WO2006062330A1 (ko)

Cited By (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
FR2931025A1 (fr) * 2008-05-07 2009-11-13 Canon Kk Procede de determination d'attributs de priorite associes a des conteneurs de donnees, par exemple dans un flux video, procede de codage, programme d'ordinateur et dispositifs associes
US8243789B2 (en) 2007-01-25 2012-08-14 Sharp Laboratories Of America, Inc. Methods and systems for rate-adaptive transmission of video

Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JPH0818934A (ja) * 1994-06-30 1996-01-19 Matsushita Electric Ind Co Ltd 画像通信装置
JPH0846928A (ja) * 1994-07-27 1996-02-16 Matsushita Electric Ind Co Ltd 画像符号化前処理装置
JPH08289296A (ja) * 1995-04-10 1996-11-01 Oki Electric Ind Co Ltd 画像通信システムおよびその伝送方法
US6091775A (en) * 1997-04-17 2000-07-18 Sharp Kabushiki Kaisha Video-coding device and video-decoding device
JP2003125407A (ja) * 2001-10-15 2003-04-25 Matsushita Electric Ind Co Ltd 画像復号装置、画像復号方法および画像復号プログラム

Family Cites Families (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US7463683B2 (en) * 2000-10-11 2008-12-09 Koninklijke Philips Electronics N.V. Method and apparatus for decoding spatially scaled fine granular encoded video signals

Patent Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JPH0818934A (ja) * 1994-06-30 1996-01-19 Matsushita Electric Ind Co Ltd 画像通信装置
JPH0846928A (ja) * 1994-07-27 1996-02-16 Matsushita Electric Ind Co Ltd 画像符号化前処理装置
JPH08289296A (ja) * 1995-04-10 1996-11-01 Oki Electric Ind Co Ltd 画像通信システムおよびその伝送方法
US6091775A (en) * 1997-04-17 2000-07-18 Sharp Kabushiki Kaisha Video-coding device and video-decoding device
JP2003125407A (ja) * 2001-10-15 2003-04-25 Matsushita Electric Ind Co Ltd 画像復号装置、画像復号方法および画像復号プログラム

Non-Patent Citations (1)

* Cited by examiner, † Cited by third party
Title
See also references of EP1820352A4 *

Cited By (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US8243789B2 (en) 2007-01-25 2012-08-14 Sharp Laboratories Of America, Inc. Methods and systems for rate-adaptive transmission of video
FR2931025A1 (fr) * 2008-05-07 2009-11-13 Canon Kk Procede de determination d'attributs de priorite associes a des conteneurs de donnees, par exemple dans un flux video, procede de codage, programme d'ordinateur et dispositifs associes

Also Published As

Publication number Publication date
EP1820352A4 (en) 2011-07-27
KR20070090179A (ko) 2007-09-05
EP1820352A1 (en) 2007-08-22
KR100880639B1 (ko) 2009-01-30

Similar Documents

Publication Publication Date Title
US20060233246A1 (en) Method and apparatus for encoding, transmitting, and decoding a video signal
US8050326B2 (en) Method for providing and using information about inter-layer prediction for video signal
US7864849B2 (en) Method for scalably encoding and decoding video signal
US8532187B2 (en) Method and apparatus for scalably encoding/decoding video signal
US9338453B2 (en) Method and device for encoding/decoding video signals using base layer
US20060133482A1 (en) Method for scalably encoding and decoding video signal
US20090041130A1 (en) Method of transmitting picture information when encoding video signal and method of using the same when decoding video signal
WO2006126841A1 (en) Method for providing and using information about inter-layer prediction for video signal
US20060120454A1 (en) Method and apparatus for encoding/decoding video signal using motion vectors of pictures in base layer
WO2006104366A1 (en) Method for scalably encoding and decoding video signal
WO2006104365A1 (en) Method for scalably encoding and decoding video signal
US20080008241A1 (en) Method and apparatus for encoding/decoding a first frame sequence layer based on a second frame sequence layer
WO2006104364A1 (en) Method for scalably encoding and decoding video signal
KR20060063619A (ko) 영상 신호의 인코딩 및 디코딩 방법
US20070223573A1 (en) Method and apparatus for encoding/decoding a first frame sequence layer based on a second frame sequence layer
US20070280354A1 (en) Method and apparatus for encoding/decoding a first frame sequence layer based on a second frame sequence layer
WO2006062330A1 (en) Method and apparatus for encoding, transmitting, and decoding a video signal
WO2006104363A1 (en) Method for scalably encoding and decoding video signal
EP1933565A1 (en) Method and apparatus for encoding and/or decoding bit depth scalable video data using adaptive enhancement layer prediction

Legal Events

Date Code Title Description
AK Designated states

Kind code of ref document: A1

Designated state(s): AE AG AL AM AT AU AZ BA BB BG BR BW BY BZ CA CH CN CO CR CU CZ DE DK DM DZ EC EE EG ES FI GB GD GE GH GM HR HU ID IL IN IS JP KE KG KM KN KP KR KZ LC LK LR LS LT LU LV LY MA MD MG MK MN MW MX MZ NA NG NI NO NZ OM PG PH PL PT RO RU SC SD SE SG SK SL SM SY TJ TM TN TR TT TZ UA UG US UZ VC VN YU ZA ZM ZW

AL Designated countries for regional patents

Kind code of ref document: A1

Designated state(s): BW GH GM KE LS MW MZ NA SD SL SZ TZ UG ZM ZW AM AZ BY KG KZ MD RU TJ TM AT BE BG CH CY CZ DE DK EE ES FI FR GB GR HU IE IS IT LT LU LV MC NL PL PT RO SE SI SK TR BF BJ CF CG CI CM GA GN GQ GW ML MR NE SN TD TG

121 Ep: the epo has been informed by wipo that ep was designated in this application
WWE Wipo information: entry into national phase

Ref document number: 2005819115

Country of ref document: EP

Ref document number: 1020077012666

Country of ref document: KR

WWE Wipo information: entry into national phase

Ref document number: 200580041787.5

Country of ref document: CN

NENP Non-entry into the national phase

Ref country code: DE

WWE Wipo information: entry into national phase

Ref document number: 2007125472

Country of ref document: RU

WWP Wipo information: published in national office

Ref document number: 2005819115

Country of ref document: EP