EP1323316A2 - Feingranularskalierbarkeit mit zweierschleifembewegungskompensation - Google Patents
Feingranularskalierbarkeit mit zweierschleifembewegungskompensationInfo
- Publication number
- EP1323316A2 EP1323316A2 EP01985317A EP01985317A EP1323316A2 EP 1323316 A2 EP1323316 A2 EP 1323316A2 EP 01985317 A EP01985317 A EP 01985317A EP 01985317 A EP01985317 A EP 01985317A EP 1323316 A2 EP1323316 A2 EP 1323316A2
- Authority
- EP
- European Patent Office
- Prior art keywords
- motion
- differential
- frames
- enhancement layer
- frame residuals
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Ceased
Links
- 238000000034 method Methods 0.000 claims abstract description 23
- 238000010586 diagram Methods 0.000 description 10
- 230000005540 biological transmission Effects 0.000 description 5
- 238000004891 communication Methods 0.000 description 5
- 230000002123 temporal effect Effects 0.000 description 4
- 239000013598 vector Substances 0.000 description 3
- 238000003672 processing method Methods 0.000 description 2
- 238000013139 quantization Methods 0.000 description 2
- 238000013459 approach Methods 0.000 description 1
- 238000012986 modification Methods 0.000 description 1
- 230000004048 modification Effects 0.000 description 1
- 238000012545 processing Methods 0.000 description 1
Classifications
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N19/00—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
- H04N19/30—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using hierarchical techniques, e.g. scalability
- H04N19/34—Scalability techniques involving progressive bit-plane based encoding of the enhancement layer, e.g. fine granular scalability [FGS]
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N19/00—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
- H04N19/30—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using hierarchical techniques, e.g. scalability
- H04N19/31—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using hierarchical techniques, e.g. scalability in the temporal domain
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N19/00—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
- H04N19/60—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using transform coding
- H04N19/61—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using transform coding in combination with predictive coding
Definitions
- the present invention relates to video coding, and more particularly to a scalable enhancement layer video coding scheme that employs motion compensation within the enhancement layer for bi-directional predicted frames (B-frames) and predicted frames and bi-directional predicted frames and (P- and B-frames).
- Scalable enhancement layer video coding has been used for compressing video transmitted over computer networks having a varying bandwidth, such as the Internet.
- a current enhancement layer video coding scheme employing fine granular scalable coding techniques is shown in FIG. 1.
- the video coding scheme 10 includes a prediction-based base layer 11 coded at a bit rate RBL, an an FGS enhancement layer 12 coded at R EL -
- the prediction-based base layer 11 includes intraframe coded I frames, interframe coded P frames which are temporally predicted from previous I- or P-frames using motion estimation-compensation, and interframe coded bi-directional B-frames which are temporally predicted from both previous and succeeding frames adjacent the B-frame using motion estimation-compensation.
- the use of predictive and/or interpolative coding i.e., motion estimation and corresponding compensation, in the base layer 11 reduces temporal redundancy therein.
- the enhancement layer 12 includes FGS enhancement layer I-, P-, and B- frames derived by subtracting their respective reconstructed base layer frames from the respective original frames (this subtraction can also take place in the motion-compensated domain).
- the FGS enhancement layer I-, P- and B-frames in the enhancement layer are not motion-compensated.
- the FGS residual is taken from frames at the same time- instance.
- the primary reason for this is to provide flexibility which allows truncation of each FGS enhancement layer frame individually depending on the available bandwidth at transmission time.
- FIG. 2 shows a block-diagram of a conventional FGS encoder for coding the base layer 11 and enhancement layer 12 of the video coding scheme of FIG. 1.
- the enhancement layer residual of frame i (FGSR(i)) equals MCR(i)-MCRQ(i), where MCR(i) is the motion-compensated residual of frame i, and MCRQ(i) is the motion- compensated residual of frame i after the quantization and the dequantization processes.
- the current FGS enhancement layer video coding scheme 10 of FIG. 1 is very flexible, it has the disadvantage that its performance in terms of video image quality is relatively low compared with that of a non-scalable coder functioning at the same transmission bit-rate.
- the decrease in image quality is not due to the fine granular scalable coding of the enhancement layer 12 but mainly due to the reduced exploitation of the temporal redundancy among the FGS residual frames within the enhancement layer 12.
- the FGS enhancement layer frames of the enhancement layer 12 are derived only from the motion-compensated residual of their respective base layer I-, P-, and B-frames, no FGS enhancement layer frames are used to predict other FGS enhancement layer frames in the enhancement layer 12 or other frames in the base layer 11.
- a scalable enhancement layer video coding scheme is needed that employs motion-compensation in the enhancement layer to improve image quality while preserving most of the flexibility and attractive characteristics typical to the current FGS video coding scheme.
- the present invention is directed to an enhancement layer video coding scheme, and in particular an FGS enhancement layer video coding scheme that employs motion compensation within the enhancement layer for predicted and bi-directional predicted frames.
- One aspect of the invention involves a method comprising the steps of: coding an uncoded video with a non-scalable codec to generate base layer frames; computing differential frame residuals from the uncoded video and the base layer frames, at least portions of certain ones of the differential frame residuals being operative as references; applying motion-compensation to the at least portions of the differential frame residuals that are operative as references to generate reference motion-compensated differential frame residuals; and subtracting the reference motion-compensated differential frame residuals from respective ones of the differential frame residuals to generate motion-predicted enhancement layer frames.
- Another aspect of the invention involves a method comprising the steps of: decoding a base layer stream to generate base layer video frames; decoding an enhancement layer stream to generate differential frame residuals, at least portions of certain ones of the differential frame residuals being operative as references; applying motion-compensation to the at least portions of the differential frame residuals operative as references to generate reference motion-compensated differential frame residuals; adding the reference motion- compensated differential frame residuals with respective ones of the differential frame residuals to generate motion-predicted enhancement layer frames; and combining the motion- predicted enhancement layer frames with respective ones of the base layer frames to generate an enhanced video.
- Still another aspect of the invention involves a memory medium for encoding video, which comprises code for non-scalable encoding an uncoded original video into base layer frames; code for computing differential frame residuals from the uncoded original video and the base layer frames, at least portions of certain ones of the differential frame residuals being operative as references; code for applying motion-compensation to the at least portions of the differential frame residuals that are operative as references to generate reference motion-compensated differential frame residuals; and code for subtracting the reference motion-compensated differential frame residuals from respective ones of the differential frame residuals to generate motion-predicted enhancement layer frames.
- a further aspect of the invention involves a memory medium for decoding a compressed video having a base layer stream and an enhancement layer stream, which comprises: code for decoding the base layer stream to generate base layer video frames; code for decoding the enhancement layer stream to generate differential frame residuals, at least portions of certain ones of the differential frame residuals being operative as references; code for applying motion-compensation to the at least portions of the differential frame residuals operative as references to generate reference motion-compensated differential frame residuals; code for adding the reference motion-compensated differential frame residuals with respective ones of the differential frame residuals to generate motion-predicted enhancement layer frames; and code for combining the motion-predicted enhancement layer frames with respective ones of the base layer frames to generate an enhanced video.
- Still a further aspect of the invention involves an apparatus for coding video, which comprises: means for non-scalable coding an uncoded original video to generate base layer frames; means for computing differential frame residuals from the uncoded original video and the base layer frames, at least portions of certain ones of the differential frame residuals being operative as references; means for applying motion-compensation to the at least portions of the differential frame residuals that are operative as references to generate reference motion-compensated differential frame residuals; and means for subtracting the reference motion-compensated differential frame residuals from respective ones of the differential frame residuals to generate motion-predicted enhancement layer frames.
- Still another aspect of the invention involves an apparatus for decoding a compressed video having a base layer stream and an enhancement layer stream, which comprises: means for decoding the base layer stream to generate base layer video frames; means for decoding the enhancement layer stream to generate differential frame residuals, at least portions of certain ones of the differential frame residuals being operative as references; means for applying motion-compensation to the at least portions of the differential frame residuals operative as references to generate reference motion-compensated differential frame residuals; means for adding the reference motion-compensated differential frame residuals with respective ones of the differential frame residuals to generate motion-predicted enhancement layer frames; and means for combining the motion-predicted enhancement layer frames with respective ones of the base layer frames to generate an enhanced video.
- FIG. 1 shows a current enhancement layer video coding scheme
- FIG. 2 shows a block-diagram of a conventional encoder for coding the base layer and enhancement layer of the video coding scheme of FIG. 1;
- FIG. 3 A shows an enhancement layer video coding scheme according to a first exemplary embodiment of the present invention
- FIG. 3B shows an enhancement layer video coding scheme according to a second exemplary embodiment of the present invention
- FIG. 4 shows a block-diagram of an encoder, according to an exemplary embodiment of the present invention, that may be used for generating the enhancement layer video coding scheme of FIG. 3 A;
- FIG. 5 shows a block-diagram of an encoder, according to an exemplary embodiment of the present invention, that may be used for generating the enhancement layer video coding scheme of FIG. 3B;
- FIG. 6 shows a block-diagram of a decoder, according to an exemplary embodiment of the present invention, that may be used for decoding the compressed base layer and enhancement layer streams generated by the encoder of FIG. 4;
- FIG. 7 shows a block-diagram of a decoder, according to an exemplary embodiment of the present invention, that may be used for decoding the compressed base layer and enhancement layer streams generated by the encoder of FIG. 5; and
- FIG. 8 shows an exemplary embodiment of a system which may be used for implementing the principles of the present invention.
- FIG. 3A shows an enhancement layer video coding scheme 30 according to a first exemplary embodiment of the present invention.
- the video coding scheme 30 includes a prediction-based base layer 31 and a two-loop prediction-based enhancement layer 32.
- the prediction-based base layer 31 includes intraframe coded I frames, interframe coded predicted P-frames, and interframe coded bi-directional predicted B-frames, as in the conventional enhancement layer video scheme presented in FIG. 1.
- the base layer I- , P- and B-frames may be coded using conventional non-scalable frame-prediction coding techniques. (The base layer I-frames are of course not motion-predicted.).
- the two-loop prediction-based enhancement layer 32 includes non-motion- predicted enhancement layer I- and P-frames and motion-predicted enhancement layer B- frames.
- the non-motion-predicted enhancement layer I- and P-frames are derived conventionally by subtracting their respective reconstructed (decoded) base layer I- and P- frame residuals from their respective original base layer I- and P-frame residuals.
- the motion-predicted enhancement layer B-frames are each computed using: 1) motion-prediction from two temporally adjacent differential I- and P- or P- and P- frame residuals (a.k.a. enhancement layer frames), and 2) the differential B-frame residual obtained by subtracting the decoded base layer B-frame residual from the original base layer B-frame residual.
- the difference between 2) the differential B-frame residual and 1) the B-frame motion prediction obtained from the two temporally adjacent motion-compensated differential frame residuals provide a motion- predicted enhancement layer B-frame in the Enhancement Layer 32.
- Both the motion- predicted enhancement layer B frames resulting from this process and the non-motion- predicted enhancement layer I- and P- frames may be coded with any suitable scalable codec, preferably a fine granular scalable (FGS) codec as shown in FIG. 3 A.
- the video coding scheme 30 of the present invention improves the video image quality because it reduces temporal redundancy in the enhancement layer B-frames of the enhancement layer 32. Since the enhancement layer B-frames account for 66% of the total bit-rate budget for the enhancement layer 32 in an IBBP group of pictures (GOP) structure, the loss in image quality associated with performing motion compensation only for the enhancement layer B-frames is very limited for most video sequences. (In conventional enhancement layer video coding schemes, a popular rate-control is mostly performed within the enhancement layer by allocating an equal number of bits to all enhancement layer I-, P-, and B-frames.)
- rate-control plays an important role for achieving good performance with the video coding scheme of the present invention.
- the packet-loss robustness of the above scheme is similar to that of the current enhancement layer coding scheme of FIG. 1 : if an error occurs in a motion-predicted enhancement layer B-frame, this error will not propagate beyond the next received I- or P- frame. Two packet-loss scenarios can occur:
- FIG. 4 shows a block-diagram of an encoder 40, according to an exemplary embodiment of the present invention, that may be used for generating the enhancement layer video coding scheme of FIG. 3 A.
- the encoder 40 includes a base layer encoder 41 and an enhancement layer encoder 42.
- the base layer encoder 41 is conventional and includes a motion estimator 43 that generates motion information (motion vectors and prediction modes) from the original video sequence and appropriate reference frame stored in memory 44.
- a first motion compensator 45 in a first motion compensation loop 62 processes the motion information and generates motion-compensated base layer reference frames (Ref(i)).
- a first subtractor 46 subtracts the motion-compensated base layer reference frames Ref(i) from the original video sequence to generate motion-compensated residuals of the base layer frames MCR(i).
- the motion-compensated residuals of the base layer frames MCR(i) are processed by a discrete cosine transform (DCT) encoder 47, a quantizer 48, and an entropy encoder 49 into a portion of a compressed base layer stream (base layer frames) from the original video sequence.
- DCT discrete cosine transform
- the motion information generated by the motion estimator 43 is also combined, via a multiplexer 50, with the portion of the base layer stream processed by the first subtractor 46, DCT encoder 47, quantizer 48 and entropy encoder 49.
- the quantized motion-compensated residual of the base layer frames MCR(i) generated at the output of the quantizer 48 are dequantized by an inverse quantizer 51, and then inverse DCT transformed via an inverse DCT unit 52. This process generates quantized-and-dequantized versions of the motion-compensated residuals of the base layer frames MCRQ(i), at the output of the inverse DCT 52.
- the quantized-and-dequantized motion-compensated residuals of the base layer frames MCRQ(i) and their respective motion-compensated base layer reference frames Ref(i) are summed in an adder 53 to generate new reference frames that are stored in the first frame memory 44 and used by the motion estimator 43 and motion compensator 45 for processing other frames.
- the enhancement layer encoder 42 which preferably comprises an FGS enhancement layer encoder (as shown in FIG. 4), includes a second subtractor 54 that computes the difference between the motion-compensated residuals of the base layer frames MCR(i) and the quantized-and-dequantized motion-compensated residuals of the base layer frames MCRQ(i) to generate differential I-, P-, and B-frame residuals FGSR(i), which in the case of the I- and P-frame residuals, are the enhancement layer I- and P- frames.
- FGSR(i) differential I-, P-, and B-frame residuals
- a frame flow control device 55 is provided for enabling the differential I- and P- frame residuals to be processed conventionally while the differential B-frame residuals are processed with motion-compensation in the enhancement layer in accordance with the principles of the present invention.
- the frame flow control device 55 accomplishes this task by causing the data flow at the output of the second subtractor 54 to stream in a different manner in accordance with the type of frame that is outputted by the second subtractor 54.
- differential I- and P-frame residuals generated at the output of the second subtractor 54 are routed by the frame control device 55 to an FGS encoder 61 (or like scalable encoder) for FGS coding using conventional DCT encoding followed by bit-plane DCT scanning and entropy encoding to generate a portion (non-motion-predicted enhancement layer I- and P-frames) of a compressed enhancement layer stream.
- the differential I- and P-frame residuals generated at the output of the second subtractor 54 are also routed to a second frame memory 58 where they are used later on for motion- compensation.
- the differential B-frame residuals generated at the output of the second subtractor 54 are routed by the frame control device 55 to a tfiird subtractor 60 and the second frame memory 58.
- a second motion compensator 59 in second motion compensation loop 63 reuses the motion information from the original video sequence (the output of the motion estimator 43 of the base layer encoder 41) and the differential I- and P-frame residuals stored in the second frame memory 58, which are used as references, to generate reference motion-compensated differential (I- and P- or P- and P-) frame residuals MCFGSR(i). Note that only a portion of each reference differential I- and P-frame residual e.g.
- the third subtractor 60 generates each motion-predicted enhancement layer B-frame MCFGS(i) by subtracting the reference motion-compensated differential (I- and P- or P- and P-) frame residual MCFGSR(i) from its respective differential B-frame residual FGSR(i).
- the frame flow control device 55 routes the motion-predicted enhancement layer B-frames MCFGS(i) to the FGS encoder 61 for FGS coding using conventional DCT encoding followed by bit-plane DCT scanning and entropy encoding where they are added to the compressed enhancement layer stream.
- the base layer remains unchanged in the enhancement layer video coding scheme of FIG. 3 A.
- the enhancement layer I- and P-frames are processed in substantially the same manner as in the current FGS video coding scheme of FIG. 1, therefore, these frames are not motion-predicted within the enhancement layer.
- MCR(i) is the motion-compensated residual of frame i after the quantization and the dequantization processes
- FGSR(i) is substantially identical to the current FGS video coding scheme of FIG. 1, i.e., FGSR(i) equals MCR(i)-MCRQ(i)
- MCFGSR(i) is the reference motion-compensated differential frame residual for frame (i).
- enhancement layer B-frame processing method of the present invention merely requires an additional motion-compensation loop in the enhancement layer for providing motion- predicted enhancement layer B-frames.
- FIG. 6 shows a block-diagram of a decoder 70, according to an exemplary embodiment of the present invention, that may be used for decoding the compressed base layer and enhancement layer streams generated by the encoder 40 of FIG. 4.
- the decoder 70 includes a base layer decoder 71 and an enhancement layer decoder 72.
- the base layer decoder 71 includes a demultiplexer 75 which receives the encoded base layer stream and demultiplexes the stream into first and second data streams 76 and 77.
- the first data stream 76 which includes motion information (motion vectors and motion prediction modes), is applied to a first motion compensator 78.
- the motion compensator 78 uses the motion information and base layer reference video frames stored in an associated base layer frame memory 79 to generate motion-predicted base layer P- and B-frames that are applied to a first input 81 of a first adder 80.
- the second data stream 77 is applied to a base layer variable length code decoder 83 for decoding, and to an inverse quantizer 84 for dequantizing.
- the dequantized code is applied to an inverse DCT decoder 85 where the dequantized code is transformed into base layer residual video I-, P- and B-frames which are applied to a second input 82 of the first adder 80.
- the base layer residual video frames and motion-predicted base layer frames generated by the motion compensator 78 are summed in the first adder 80 to generate base layer video I-, P-, and B-frames that are stored in the base layer frame memory 79 and optionally outputted as a base layer video.
- the enhancement layer decoder 72 includes an FGS bit-plane decoder 86 or like scalable decoder that decodes the compressed enhancement layer stream to generate at first and second outputs 73 and 74 the differential I-, P-, and B-frame residuals which are respectively applied to first and second frame flow control devices 87 and 91.
- the first and second frame flow control devices 87 and 91 enable the differential I- and P-frame residuals to be processed differently from the differential B-frame residuals by causing the data flow at the outputs 73 and 74 of the FGS bit-plane decoder 86 to stream in a different manner in accordance with the type of enhancement layer frame that is outputted by the decoder 86.
- the differential I- and P-frame residuals at the first output 73 of the FGS bit-plane decoder 86 are routed by the first frame control device 87 to an enhancement layer frame memory 88 where they are stored and used later on for motion compensation.
- the differential B-frame residuals at the first output 73 of the FGS bit-plane decoder 86 are routed by the first frame control device 87 to a second adder 92 and processed as will be explained further on.
- a second motion compensator 90 reuses the motion information received by the base layer decoder 71 and the differential I- and P-frame residuals stored in the enhancement layer frame memory 88 to generate reference motion-compensated differential (I- and P- or P- and P-) frame residuals, which are used for predicting enhancement layer B- frames.
- the second adder 92 sums each reference motion-compensated differential frame residual and its respective differential B-frame residual to generate an enhancement layer B- frame.
- the second frame control device 91 sequentially routes the enhancement layer I- and P-frames (the differential I- and P-frame residuals) at the second output 74 of the FGS bit-plane decoder 86 and the motion-predicted enhancement layer B-frames at the output 93 of the second adder 92 to a third adder 89.
- the third adder 89 sums the enhancement layer I,-, P-, and B-frames together with their corresponding base layer I-, P-, and B-frames to generate an enhanced video.
- FIG. 3B shows an enhancement layer video coding scheme 100 according to a second exemplary embodiment of the present invention.
- the video coding scheme 100 of the second embodiment is substantially identical to the first embodiment of FIG. 3 A except that the enhancement layer P-frames in the two-loop prediction-based enhancement layer 132 are motion-predicted like the enhancement layer B-frames.
- each motion-predicted enhancement layer P-frame is computed in a manner similar to the enhancement B-frames i.e., each motion-predicted enhancement layer P-frame is computed using: 1) motion-prediction from a temporally adjacent differential I- or P-frame residual, and 2) the differential P-frame residual obtained by subtracting the decoded base layer P-frame residual from the original base layer P-frame residual. The difference between 2) the differential P-frame residual and 1) the P-frame motion prediction obtained from the temporally adjacent motion-compensated differential frame residual provide a motion- predicted enhancement layer P-frame in the Enhancement Layer 132.
- Both the motion- predicted enhancement layer P-and B-frames resulting from this process and the non-motion- predicted enhancement layer I-frames may be coded with any suitable scalable codec, preferably a fine granular scalable (FGS) codec as shown in FIG. 3B.
- FGS fine granular scalable
- the video coding scheme 100 of FIG. 3B provides further improvements in the video image quality. This is because the video coding scheme 100 reduces temporal redundancy in both the P- and B-frames of the enhancement layer 132.
- the video coding schemes of the present invention can be alternated with the current video coding scheme of FIG. 1 for the various portions of a video sequence or for various video sequences. Additionally, switching between all three video coding schemes i.e., current video coding scheme of FIG. 1 and the video coding schemes described in FIGS. 3 A and 3B, can be done based on channel characteristics and can be performed at encqding or at transmission time. Further the video coding schemes of the present invention achieve a large gain in coding efficiency with only a limited increase in complexity.
- FIG. 5 shows a block-diagram of an encoder 140, according to an exemplary embodiment of the present invention, that may be used for generating the enhancement layer video coding scheme of FIG. 3B.
- the encoder 140 of FIG. 5 is substantially identical to the encoder 40 of FIG. 4 (which is used for generating the enhancement layer video coding scheme of FIG. 3 A), except that the frame flow control device 55 used in the encoder 40 is omitted.
- the frame flow control device is not necessary in this encoder 140 because the differential I-frame residuals are not processed with motion-compensation and thus, do not need to be routed differently from the differential P- and B-frame residuals in the enhancement layer encoder 142.
- the differential I-frame residuals generated at the output of the second subtractor 54 pass to an FGS encoder 61 for FGS coding using conventional DCT encoding followed by bit-plane DCT scanning and entropy encoding to generate a portion (non- motion-predicted enhancement layer I- frames) of a compressed enhancement layer stream.
- the differential I- frame residuals also pass to a second frame memory 58 along with the differential P-frame residuals where they are used later on for motion-compensation.
- the differential P- and B-frame residuals generated at the output of the second subtractor 54 are also passed to a third subtractor 60.
- a second motion compensator 59 in second motion compensation loop 63 reuses the motion information from the original video sequence (the output of the motion estimator 43 of the base layer encoder 41) and the differential I- and P- frame residuals stored in the second frame memory 58, which are used as references, to generate reference motion-compensated differential (I or P) frame residuals MCFGSR(i) for motion-predicting enhancement layer P-frames and reference (I- and P- or P- and P-) frame residuals MCFGSR(i) for motion-predicting enhancement layer B-frames.
- the third subtractor 60 generates each motion-predicted enhancement layer P- or B-frame MCFGS(i) by subtracting the reference motion-compensated differential (I or P) or (I- and P- or P- and P-) frame residual MCFGSR(i) from its respective differential P- or B-frame residual
- the motion-predicted enhancement layer P- and B-frames MCFGS(i) then pass to the FGS encoder 61 for FGS coding using conventional DCT encoding followed by bit-plane DCT scanning and entropy encoding where they are added to the compressed enhancement layer stream.
- the base layer remains unchanged in the enhancement layer video coding scheme of FIG. 3B.
- enhancement layer P- and B-frame processing method of the present invention merely requires an additional motion-compensation loop in the enhancement layer for providing motion-predicted enhancement layer P-and B-frames.
- FIG. 7 shows a block-diagram of a decoder 170, according to an exempla y embodiment of the present invention, that may be used for decoding the compressed base layer and enhancement layer streams generated by the encoder 140 of FIG. 5.
- the decoder 170 of FIG. 7 is substantially identical to the decoder 70 of FIG. 6, except that the frame flow control devices 87 and 91 used in the decoder 70 are omitted.
- the frame flow control devices are not necessary in this decoder 170 because the differential I-frame residuals are not processed with motion-compensation and thus, do not need to be routed differently from the decoded differential P- and B-frame residuals in the enhancement layer decoder 172.
- the differential I- and P-frame residuals at the first output 73 of the FGS bit-plane decoder 86 pass to the enhancement layer frame memory 88 where they are stored and used later on for motion compensation.
- the differential P- and B-frame residuals at the second output 74 of the FGS bit-plane decoder 86 pass to a second adder 92.
- the differential I-frame residuals (enhancement layer I-frames hereinafter) at the second output 74 of the FGS bit-plane decoder 86 pass to a third adder 89, the purpose of which will be explained further on.
- the second motion compensator 90 reuses the motion information received by the base layer decoder 71 and the differential I- and P-frame residuals stored in the enhancement layer frame memory 88 to generate 1) reference motion-compensated differential (I- and P- or P- and P-) frame residuals, which are used for predicting enhancement layer B-frames, and 2) reference motion-compensated differential (I-or P-) frame residuals, which are used for predicting enhancement layer P-frames.
- the second adder 92 sums the reference motion-compensated differential frame residuals with their respective differential B-frame residuals or P-frame residuals to generate enhancement layer B- and P- frames.
- the third adder 89 sums the enhancement layer I,-, P-, and B-frames together with their corresponding base layer I-, P-, and B-frames to generate an enhanced video.
- FIG. 8 shows an exemplary embodiment of a system 200 which may be used for implementing the principles of the present invention.
- the system 200 may represent a television, a set-top box, a desktop, laptop or palmtop computer, a personal digital assistant (PDA), a video/image storage device such as a video cassette recorder (NCR), a digital video recorder (DNR), a TiNO device, etc., as well as portions or combinations of these and other devices.
- the system 200 includes one or more video/image sources 201, one or more input/output devices 202, a processor 203 and a memory 204.
- the video/image source(s) 201 may represent, e.g., a television receiver, a NCR or other video/image storage device.
- the source(s) 201 may alternatively represent one or more network connections for receiving video from a server or servers over, e.g., a global computer communications network such as the Internet, a wide area network, a metropolitan area network, a local area network, a terrestrial broadcast system, a cable network, a satellite network, a wireless network, or a telephone network, as well as portions or combinations of these and other types of networks.
- the input/output devices 202, processor 203 and memory 204 may communicate over a communication medium 205.
- the communication medium 205 may represent, e.g., a bus, a communication network, one or more internal connections of a circuit, circuit card or other device, as well as portions and combinations of these and other communication media.
- Input video data from the source(s) 201 is processed in accordance with one or more software programs stored in memory 204 and executed by processor 203 in order to generate output video/images supplied to a display device 206.
- the coding and decoding employing the principles of the present invention may be implemented by computer readable code executed by the system.
- the code may be stored in the memory 204 or read/downloaded from a memory medium such as a CD-ROM or floppy disk.
- hardware circuitry may be used in place of, or in combination with, software instructions to implement the invention.
- the elements shown in FIGS. 4-7 may also be implemented as discrete hardware elements.
Landscapes
- Engineering & Computer Science (AREA)
- Multimedia (AREA)
- Signal Processing (AREA)
- Compression Or Coding Systems Of Tv Signals (AREA)
- Compression, Expansion, Code Conversion, And Decoders (AREA)
Applications Claiming Priority (5)
| Application Number | Priority Date | Filing Date | Title |
|---|---|---|---|
| US23449900P | 2000-09-22 | 2000-09-22 | |
| US234499P | 2000-09-22 | ||
| US887743 | 2001-06-21 | ||
| US09/887,743 US6940905B2 (en) | 2000-09-22 | 2001-06-21 | Double-loop motion-compensation fine granular scalability |
| PCT/EP2001/010874 WO2002025954A2 (en) | 2000-09-22 | 2001-09-18 | Double-loop motion-compensation fine granular scalability |
Publications (1)
| Publication Number | Publication Date |
|---|---|
| EP1323316A2 true EP1323316A2 (de) | 2003-07-02 |
Family
ID=26928015
Family Applications (1)
| Application Number | Title | Priority Date | Filing Date |
|---|---|---|---|
| EP01985317A Ceased EP1323316A2 (de) | 2000-09-22 | 2001-09-18 | Feingranularskalierbarkeit mit zweierschleifembewegungskompensation |
Country Status (7)
| Country | Link |
|---|---|
| EP (1) | EP1323316A2 (de) |
| JP (1) | JP2004509581A (de) |
| KR (1) | KR100860950B1 (de) |
| CN (1) | CN1254115C (de) |
| AU (1) | AU2002220558A1 (de) |
| MY (1) | MY126133A (de) |
| WO (1) | WO2002025954A2 (de) |
Families Citing this family (3)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| EP1413142A2 (de) * | 2001-01-08 | 2004-04-28 | Siemens Aktiengesellschaft | Optimale snr-skalierbare videocodierung |
| KR100621581B1 (ko) | 2004-07-15 | 2006-09-13 | 삼성전자주식회사 | 기초 계층을 포함하는 비트스트림을 프리디코딩,디코딩하는 방법, 및 장치 |
| US20070201551A1 (en) * | 2006-01-09 | 2007-08-30 | Nokia Corporation | System and apparatus for low-complexity fine granularity scalable video coding with motion compensation |
Family Cites Families (7)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| GB9022326D0 (en) | 1990-10-15 | 1990-11-28 | British Telecomm | Signal coding |
| JPH04177992A (ja) * | 1990-11-09 | 1992-06-25 | Victor Co Of Japan Ltd | 階層性を有する画像符号化装置 |
| GB9206860D0 (en) | 1992-03-27 | 1992-05-13 | British Telecomm | Two-layer video coder |
| CA2126467A1 (en) | 1993-07-13 | 1995-01-14 | Barin Geoffry Haskell | Scalable encoding and decoding of high-resolution progressive video |
| JP3788823B2 (ja) | 1995-10-27 | 2006-06-21 | 株式会社東芝 | 動画像符号化装置および動画像復号化装置 |
| US5852565A (en) | 1996-01-30 | 1998-12-22 | Demografx | Temporal and resolution layering in advanced television |
| US6148026A (en) | 1997-01-08 | 2000-11-14 | At&T Corp. | Mesh node coding to enable object based functionalities within a motion compensated transform video coder |
-
2001
- 2001-09-18 EP EP01985317A patent/EP1323316A2/de not_active Ceased
- 2001-09-18 AU AU2002220558A patent/AU2002220558A1/en not_active Abandoned
- 2001-09-18 WO PCT/EP2001/010874 patent/WO2002025954A2/en not_active Ceased
- 2001-09-18 JP JP2002529033A patent/JP2004509581A/ja active Pending
- 2001-09-18 KR KR1020027006542A patent/KR100860950B1/ko not_active Expired - Fee Related
- 2001-09-18 CN CNB018038034A patent/CN1254115C/zh not_active Expired - Fee Related
- 2001-09-20 MY MYPI20014417A patent/MY126133A/en unknown
Non-Patent Citations (2)
| Title |
|---|
| See also references of WO0225954A3 * |
| WEIPING LI; FAN LING; XUEMIN CHEN: "Fine Granularity Scalability in MPEG-4 for Streaming Video", PROCEEDINGS OF ISCAS 2000, THE 2000 IEEE INTERNATIONAL SYMPOSIUM ON CIRCUITS AND SYSTEMS, GENEVA, CH, 28-31 MAY 2000, vol. 1, May 2000 (2000-05-01), IEEE, Piscataway, NJ, USA, pages I-299 - I-302, XP010503194 * |
Also Published As
| Publication number | Publication date |
|---|---|
| KR20020056940A (ko) | 2002-07-10 |
| JP2004509581A (ja) | 2004-03-25 |
| MY126133A (en) | 2006-09-29 |
| WO2002025954A3 (en) | 2002-10-31 |
| CN1486574A (zh) | 2004-03-31 |
| CN1254115C (zh) | 2006-04-26 |
| WO2002025954A2 (en) | 2002-03-28 |
| AU2002220558A1 (en) | 2002-04-02 |
| KR100860950B1 (ko) | 2008-09-30 |
Similar Documents
| Publication | Publication Date | Title |
|---|---|---|
| US6940905B2 (en) | Double-loop motion-compensation fine granular scalability | |
| US7042944B2 (en) | Single-loop motion-compensation fine granular scalability | |
| US20020037046A1 (en) | Totally embedded FGS video coding with motion compensation | |
| US6944222B2 (en) | Efficiency FGST framework employing higher quality reference frames | |
| US6639943B1 (en) | Hybrid temporal-SNR fine granular scalability video coding | |
| US6697426B1 (en) | Reduction of layer-decoding complexity by reordering the transmission of enhancement layer frames | |
| US20020118742A1 (en) | Prediction structures for enhancement layer in fine granular scalability video coding | |
| US20060291562A1 (en) | Video coding method and apparatus using multi-layer based weighted prediction | |
| EP1145561A1 (de) | System und verfahren zur codierung und decodierung des residuellen feingranularskalierbaren videosignals | |
| US6944346B2 (en) | Efficiency FGST framework employing higher quality reference frames | |
| US6904092B2 (en) | Minimizing drift in motion-compensation fine granular scalable structures | |
| KR100860950B1 (ko) | 이중 루프 움직임 보상 파인 그래뉼 스케일러빌러티 | |
| US20050135478A1 (en) | Reduction of layer-decoding complexity by reordering the transmission of enhancement layer frames |
Legal Events
| Date | Code | Title | Description |
|---|---|---|---|
| PUAI | Public reference made under article 153(3) epc to a published international application that has entered the european phase |
Free format text: ORIGINAL CODE: 0009012 |
|
| 17P | Request for examination filed |
Effective date: 20030502 |
|
| AK | Designated contracting states |
Designated state(s): AT BE CH CY DE DK ES FI FR GB GR IE IT LI LU MC NL PT SE TR |
|
| AX | Request for extension of the european patent |
Extension state: AL LT LV MK RO SI |
|
| 17Q | First examination report despatched |
Effective date: 20050301 |
|
| STAA | Information on the status of an ep patent application or granted ep patent |
Free format text: STATUS: THE APPLICATION HAS BEEN REFUSED |
|
| 18R | Application refused |
Effective date: 20060417 |