CN101663896A - Method and apparatus for encoding video data, method and apparatus for decoding encoded video data and encoded video signal - Google Patents

Method and apparatus for encoding video data, method and apparatus for decoding encoded video data and encoded video signal Download PDF

Info

Publication number
CN101663896A
CN101663896A CN 200880013169 CN200880013169A CN101663896A CN 101663896 A CN101663896 A CN 101663896A CN 200880013169 CN200880013169 CN 200880013169 CN 200880013169 A CN200880013169 A CN 200880013169A CN 101663896 A CN101663896 A CN 101663896A
Authority
CN
China
Prior art keywords
data
enhancement
layer
color
color channel
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN 200880013169
Other languages
Chinese (zh)
Inventor
高永英
武宇文
英格·多塞
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Thomson Licensing SAS
Original Assignee
Thomson Licensing SAS
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Thomson Licensing SAS filed Critical Thomson Licensing SAS
Publication of CN101663896A publication Critical patent/CN101663896A/en
Pending legal-status Critical Current

Links

Images

Abstract

For two or more versions of a video with different spatial, temporal or SNR resolution, scalability can be achieved by generating a base layer (BL) and an enhancement layer (EL). When a version of a video is available that has higher color bit depth than can be displayed, a common solution is tone mapping. A more efficient compression method is proposed for the case where the two or more versionswith different color bit depth use different color encoding. The present invention is based on joint inter-layer prediction among the available color channels. Thus, color bit depth scalability can also be used where the two or more versions with different color bit depth use different color encoding. In this case the inter-layer prediction is a joint prediction based on all color components. Prediction may also include color space conversion and gamma correction.

Description

Be used for to coding video data method and apparatus, be used for the video data of coding and the method and apparatus of the decoding video signal of coding
Technical field
The present invention relates to digital video coding.More specifically, the present invention relates to a kind of method and apparatus that is used for coding video data, a kind of being used for to the video data of coding and the method and apparatus of the decoding video signal of corresponding encoded.
Background technology
In recent years, many applications (as, digital movie workflow in Medical Image Processing, making and the post-production and the application relevant with home theater) in more and more expectation have and be higher than 8 digital picture/videos that the position is dark.Prior art image/video coding technology is equally advancing high-order dark coding.JVT carries out standardization with fidelity range extension (FRExt) H.264 to the dark coding of a high position, and FRExt supports dark and up to the chroma samples of 4:4:4 up to the position of 14 bits.On the other hand, moving image JPEG2000 (the 3rd part) supports the per minute amount up to 32 bits.
Potentially, consider that will there be this fact simultaneously in traditional 8 bits and higher bit digital imaging system on market in the following long period, the dark scalability of colour bits is very useful.There are some kinds of modes to handle the coexistence of 8 bit video and higher bit video.First kind of solution be, only provides the higher bit bitstream encoded, and make tone mapping method can provide at 8 bits of standard 8 bit display device to represent.Second kind of solution is to provide same broadcasting (simulcast) bit stream of the bit stream that comprises 8 bits of encoded.Selecting which bit stream to decode is the preference of decoder.This means, for example, support the stronger decoder of high 10 profiles of AVC can decode and export 10 bit video, and conventional decoder only can be exported 8 bit video.Typically, first kind of solution can not compatibility 8 bit decoder H.264/AVC.Second kind of compatible all current standard of solution, but need more expenses.Yet bit reduces and the good compromise of back between operating such can be scalable solution.SVC (being also referred to as scalable expansion H.264/AVC) considers to support the dark scalability in position.
For the also not too many research of the method for the dark scalability of colour bits.Be different from can the usage space up-sampling be realized between different resolution spatial scalability, what have challenge is, may be difficult to the low bit picture from reconstruct is encoded to the additional information of original higher bit picture, for example, scalability for 8 bit to 10 bits, because the cause of the quantization error of introducing when 8 bit pictures are encoded, additional information also can be up to 10 bits.The dark prediction in interlayer position is not similar to the FGS that utilizes bit plane scanning at transform domain yet.
In addition, dissimilar the color-coded different of color space, chromaticity coordinate and gamma correction (for example, RGB, YCrCb, HSV, XYZ) of known use may.There are various transfer algorithms.
But when having than the dark high dark video version time spent of colour bits of position that can show, general solution is tone mapping, and wherein, it is dark that high dynamic range is reduced to low colour bits, and keep contrast.But when two or more video version time spent with different spaces, time or SNR resolution, can be by producing basic layer (BL) and will realizing scalability with the enhancement layer (EL) of BL combination.
Yet the intrinsic problem of tone mapping method is, transmits than necessary data more data.For having the situation that the dark two or more versions in different colours position use the different colours coding, need more efficient compression method.
Summary of the invention
The present invention is based on the understanding to the following fact, and the described fact is: in the dark scalable video coding on the throne, it is favourable carrying out the associating inter-layer prediction between available color channels usually.Therefore, according to the present invention, under situation, also can use the dark scalability of colour bits with the dark two or more versions use different colours codings in different colours position.In this case, inter-layer prediction is based on the associated prediction of institute's color component.Prediction can also comprise color space conversion and gamma correction.
According to an aspect of the present invention, a kind of method that is used for the coding video data that comprises base layer data and enhancement data, wherein, basic layer and enhancement data comprise a plurality of Color Channels, for example, Y, Cr, Cb or R, G, B, and, basic layer and enhancement data have different dark, said method comprising the steps of: base layer data is encoded; At Color Channel, enhancement data is predicted respectively according to base layer data; And based on the enhancement data of described prediction, at Color Channel enhancement data is encoded respectively, wherein, under at least one pattern, according to all available basic layer Color Channels, each enhancement layer Color Channel of associated prediction, described method is further comprising the steps of at least one enhancement layer Color Channel: produce residual error data, described residual error data is poor between the Color Channel data of original enhancement layer Color Channel data and prediction; Original enhancement layer Color Channel data are encoded; Residual error data is encoded; For at least one enhancement layer Color Channel is selected encoded original enhancement layer Color Channel data, residual error data or encoded residual error data, wherein, the selection of described selection and other enhancement layer Color Channels is irrelevant; And provide selected enhancement layer Color Channel data, and provide indication to the selected coding mode that relates to described enhancement layer Color Channel as the enhancement layer dateout.
According to a further aspect in the invention, a kind of being used for may further comprise the steps the method that the encoded video data with BL and EL data is decoded: extract BL data and EL data from encoded video data, wherein, BL data and EL data include the mask data of a plurality of Color Channels; At least at first Color Channel of enhancement layer, the indication of having indicated coding mode is extracted; Base layer data to a plurality of Color Channels is decoded; Base layer data based on decoding is predicted the EL data, wherein, at least one pattern, according to all available BL Color Channels each EL Color Channel is carried out associated prediction; EL data to a plurality of Color Channels are decoded, and wherein, obtain residual error, and at least at described first Color Channel, according to indicated coding mode, use described indication to decode; And, the EL data of a plurality of Color Channels are reconstructed based on the EL data and the described residual error of prediction.
According to another aspect of the invention, a kind of equipment that is used for the coding video data that comprises base layer data and enhancement data, wherein, base layer data and enhancement data comprise a plurality of Color Channels, and, it is dark that basic layer and enhancement layer have different positions, and described equipment comprises: be used for carrying out apparatus for encoding to basic layer; Be used for respectively at Color Channel, according to the basic layer of device that enhancement layer is predicted; And be used for enhancement layer based on described prediction, respectively (for example at Color Channel, R, G, B) enhancement layer is carried out apparatus for encoding, wherein, in at least one pattern, come each enhancement layer Color Channel of associated prediction according to all available basic layer Color Channels, described equipment also comprises at least one enhancement layer Color Channel: be used to produce the device of residual error, described residual error is poor between the Color Channel image of original enhancement layer Color Channel image and prediction; Be used for original enhancement layer Color Channel image is carried out apparatus for encoding; Be used for residual error is carried out apparatus for encoding; Be used at least one enhancement layer Color Channel to select the device of encoded original enhancement layer Color Channel image, residual error or encoded residual error, wherein, the selection of described selection and other enhancement layer Color Channels has nothing to do; And be used to provide selected enhancement layer Color Channel data as the enhancement layer dateout and device to the indication of the selected coding mode that relates to described enhancement layer Color Channel is provided.
According to a further aspect in the invention, a kind of being used for comprises the equipment that the encoded video data with base layer data and enhancement data is decoded: the device that is used for extracting from encoded video data base layer data and enhancement data, wherein, base layer data and enhancement data include the mask data of a plurality of Color Channels; Be used at least first Color Channel, the device that the indication of having indicated coding mode is extracted at enhancement layer; Be used for device that the base layer data of a plurality of Color Channels is decoded; Be used for base layer data device that enhancement data is predicted, wherein, at least one pattern,, each enhancement layer Color Channel carried out associated prediction according to all available basic layer Color Channels based on decoding; Be used for device that the enhancement data of a plurality of Color Channels is decoded, and, obtain residual error, and,, use described indication to decode according to indicated coding mode at least at described first Color Channel; And be used for the device that enhancement data and described residual error based on prediction are reconstructed the enhancement data of a plurality of Color Channels.
According on the other hand, a kind of encoded vision signal that comprises base layer data and enhancement data, wherein, base layer data comprises first color-coded a plurality of Color Channels, and enhancement data comprises another second color-coded a plurality of Color Channels, it is dark that base layer data and enhancement data have different colour bits, and, described signal also comprises the coding mode indication, and described coding mode indicates it to comprise that encoded residual error data still is encoded macro block data at the first enhancement layer Color Channel at least.
The special advantage of the coding solution that is proposed is that it meets H.264/AVC standard, and with scalable expansion (SVC) H.264/AVC in the scalability of the described kind supported compatible mutually.
At least a realization has proposed the H.264/AVC compatible dark ges forschung solution of colour bits, wherein, will hang down bit (common 8 bits) and higher bit (for example, 10,12 or 14 bits) sequence and be encoded to basic layer and enhancement layer respectively.In an embodiment of disclosed solution, in macro block (MB) level, hang down the inter-layer prediction between bit BL and the higher bit EL, the redundancy between representing with the low bit that utilizes same video and higher bit.In addition, the dark prediction of interlayer colour bits to each Color Channel (for example, Y, Cb or Cr) is not independently.On the contrary, carry out with associated form, thereby predict deeply, determine the predicted version of each passage of enhancement layer MB with (the being generally three) Color Channels that own of the MB of basic unit that is positioned at same position of reconstruct by colour bits between stratum conjunctum.
Advantage embodiment of the present invention is disclosed in claims, following specification and accompanying drawing.
Description of drawings
Describe example embodiment of the present invention with reference to the accompanying drawings, in the accompanying drawings
Fig. 1 is the framework of the dark ges forschung of colour bits;
Fig. 2 is the associating inter-layer prediction in the intraframe coding;
Fig. 3 is the associating inter-layer prediction in the interframe encode; And
Fig. 4 is that colour bits are predicted deeply between adaptation layer in the interframe encode.
Embodiment
The ground that is without loss of generality, suppose to exist two layers of the dark scalability of colour bits: a layer is 8 bit video sequences, and another layer is 10 bit video sequences.At at least one implementation, figure 1 illustrates the framework of the dark ges forschung of colour bits that is proposed.
Ges forschung device Enc produces the dark scalable bit stream SBS in position, the picture of wherein multiplexing BL and EL coding.The decoder Dec that can contract can produce 8 bit video by only the BL bit stream being decoded, or produces 10 bit video by whole scalable bit stream SBS is decoded.Provide the dark a plurality of versions of not coordination of identical vision content to different clients, the dark ges forschung of colour bits that passes through to be proposed is realized apparatus self-adaptation.
What should emphasize is, two list entries, and 8 bits and 10 bit video sequences can dark aspect not only on the throne differences.Therefore, inter-layer prediction can comprise, for example:
1) at the adjustment of different gamma corrections and different chromaticity coordinates, for example, RGB color space (Rec.BT.601) is to the conversion of RGB color space (Rec.BT.709), and RGB color space (Rec.BT.601) to equipment is specified the RGB color space conversion.
2) color space conversion (comprising adjustment) at different gamma corrections, for example, the XYZ color space is to the conversion of sRGB color space, YCbCr color space (Rec.BT.709) is to the conversion of RGB color space (Rec.BT.709), and YCbCr color space (Rec.BT.601) is to the conversion of YCbCr color space (Rec.BT.709).
3) tone format conversion, for example, YCbCr 4:2:0 to YCbCr 4:2:2, YCbCr 4:2:0 to YCbCr 4:4:4,
4) color correction, and
5) combination of above-mentioned project.
Situation 1), 2) and 3) can relate to nonlinear transformation, and in situation 4) in, the relation between two sequences of being considered is the same with look-up table (LUT) complicated.In addition, situation 2) can also relate to the processing of striding different color channels.For example, YCbCr color space (Rec.BT.709) to the conversion mathematics of RGB color space (Rec.BT.709) is modeled as matrix operation, makes at each pixel, the value of R (G or B) is calculated in the linear combination of the value by Y, Cb and Cr.At least one implementation has proposed to comprise the associating inter-layer prediction of the processing of striding different color channels, and described associating inter-layer prediction can carry out under picture-level or MB level.
Below, provided the coding/decoding method of realizing the dark prediction of colour bits between stratum conjunctum.In this part, provide the details of each implementation.Also such implementation may be discussed in other parts.At least one implementation provides the compatible technical solution of uniting inter-layer prediction of AVC that is used to realize the dark scalability of colour bits.In Fig. 2 and Fig. 3, illustrated in the frame that comprises the dark prediction of MB level interlayer colour bits and the corresponding figure of the dark ges forschung device of colour bits in the interframe encode.The ground that is without loss of generality, the dark prediction of colour bits comprises the conversion of YCbCr color space (Rec.BT.709) to RGB color space (Rec.BT.709) between hypothetical layer.Decoding processing be in the frame and interframe encode in the inverse process of encoding process.
About Fig. 2 and Fig. 3, should be noted that three rate-distortion optimization pieces (RDO) RDOr, RDOg, RDOb are independently of one another.Promptly, for each Color Channel, can judge separately, under the situation that need not to predict, directly enhancement layer is carried out in the frame/interframe encode, still otherwise carry out prediction, produce residual error, and this residual error is directly carried out in the frame/interframe encode, still before rate-distortion optimisation is judged, carry out conversion (T), quantize (Q) and entropy coding.During RDO, the optimal compromise between specified data rate and the distortion, and select corresponding signal.Under the situation of inter prediction, as shown in Figure 3, in enhancement layer, can use motion vector 305r, 305g, 305b from basic layer MB.
Can in grammer, (for example, in the MB type field) comprise indication to selected type of coding.
Fig. 4 shows and use additional skip mode in each EL branch, makes RDO have 4 inputs: introduce new model (so-called skip mode) and skip the EL residual signals.If select skip mode by RDO, then EL does not comprise the bit of current MB.At the decoder place, only BL MB is decoded, and carry out the ELMB that the dark prediction of interlayer colour bits obtains reconstruct.Work is carried out in prediction in the same manner in the layer on principle.
Below tabulation provides the brief list of various implementations.This tabulation is not to be limit, but a fraction of concise and to the point description in many possibility implementations only is provided.
With reference to Fig. 2 and Fig. 3, a kind of method that is used for the coding video data that comprises base layer data and enhancement data, wherein, basic layer and enhancement data (for example comprise a plurality of Color Channels, Y, Cr, Cb or R, G, B), and, basic layer and enhancement data have different dark, this method may further comprise the steps:
To base layer data encode 201y, 201cr, 201cb; Enhancement data is predicted according to base layer data at Color Channel respectively; And based on the enhancement data of described prediction, at Color Channel (for example, R, G, B) enhancement data is encoded respectively,
Wherein, at least one pattern,, each enhancement layer Color Channel is carried out associated prediction 200, and described method also comprises other following steps at least one (or some or all) enhancement layer Color Channel according to all available basic layer Color Channels:
Produce residual error data R Res, B Res, G Res, described residual error data is original enhancement layer Color Channel data R EL, G EL, B ELAnd the prediction the Color Channel data between poor;
To original enhancement layer Color Channel data encode 202r, 202g, 202b;
To residual error data encode 203r, 203g, 203b, 204r, 204g, 204b;
For at least one enhancement layer Color Channel is selected RDO r, RDO g, RDO bEncoded original enhancement layer Color Channel data, residual error data or encoded residual error data, wherein, the selection of described selection and other enhancement layer Color Channels is irrelevant; And
Provide selected enhancement layer Color Channel data as the enhancement layer dateout, and the indication to the selected coding mode that relates to described enhancement layer Color Channel is provided.
In one embodiment, basic layer and enhancement layer use different color codings (for example, Y, CR, CB and R, G, B), and inter-layer prediction 200 also comprise in the frame with the color space conversion of interframe encode.
In one embodiment, color space conversion comprises the conversion from YCbCr color space (Rec.BT.709) to RGB color space (Rec.BT.709).
In one embodiment, residual error is encoded comprise entropy coding 204r, 204g, 204b.
In one embodiment, comprise skip mode 405 on the macro-block level at the additional code pattern of enhancement layer Color Channel data; In skip mode, enhancement data does not comprise the bit of respective macroblock.
In one embodiment, selecting RDO r, RDO g, RDO bStep in, described selection minimizes based on data transfer rate and distortion.
In one embodiment, carry out the prediction 200 of striding different color channels at picture-level.
In one embodiment, carry out the prediction of striding different color channels in macro-block level.
In one embodiment, described method also comprises: carry out entropy coding EC at each basic layer and enhancement layer Color Channel respectively Y, BL, EC Cb, BL, EC Cr, BL, EC Y, EL, EC Cb, EL, EC Cr, EL
According to a further aspect in the invention, be used for the method that the encoded video data with BL data and EL data is decoded be may further comprise the steps:
Extract base layer data and enhancement data from encoded video data, wherein, base layer data and enhancement data include the mask data of a plurality of Color Channels; At least at first Color Channel of enhancement layer, the indication of having indicated coding mode is extracted; Base layer data to a plurality of Color Channels is decoded; Base layer data based on decoding is predicted enhancement data, wherein, at least one pattern, according to all available basic layer Color Channels each enhancement layer Color Channel is carried out associated prediction; Enhancement data to a plurality of Color Channels is decoded, and wherein, obtains residual error, and at least at described first Color Channel, according to indicated coding mode, uses described indication to decode; And, the enhancement data of a plurality of Color Channels is reconstructed based on the enhancement data and the described residual error of prediction.
Following examples relate to the method that is used to decode.In one embodiment, basic layer and enhancement layer use different color codings (for example, Y, CR, CB or R, G, B), and prediction steps also comprises: in the frame with the color space conversion of interframe encode.
In one embodiment, color space conversion comprises that the YCbCr color space is to the RGB color space conversion.
In one embodiment, residual error is decoded comprise entropy decoding.
In one embodiment, adopt the additional decoding schema at the enhancement layer Color Channel, comprise the skip mode on the macro-block level, wherein, in skip mode, enhancement data does not comprise the bit of respective macroblock.
In one embodiment, on picture-level, carry out the prediction of striding different color channels.
In one embodiment, on macro-block level, carry out the prediction of striding different color channels.
In one embodiment, described method also comprises: decode at the entropy of each basic layer and enhancement layer Color Channel respectively.
According on the other hand, a kind of equipment that is used for the coding video data that comprises base layer data and enhancement data, wherein, base layer data and enhancement data (for example comprise a plurality of Color Channels, Y, CR, CB or R, G, B), and, basic layer and enhancement layer have different dark, described equipment comprises:
Be used for basic layer is carried out apparatus for encoding 201y, 201cr, 201cb;
Be used for respectively at basic layer of device 200 that enhancement layer is predicted of Color Channel basis; And
Be used for enhancement layer based on described prediction, at Color Channel R, G, B enhancement layer is carried out apparatus for encoding respectively, wherein, in at least one pattern, come associated prediction 200 each enhancement layer Color Channel R, G, B according to all available basic layer Color Channels, and described equipment comprises also at least one enhancement layer Color Channel:
Be used to produce residual error R Res, B Res, G ResDevice, described residual error is original enhancement layer Color Channel R EL, G EL, B ELAnd the prediction the Color Channel image between poor; Be used for original enhancement layer Color Channel image is carried out apparatus for encoding 202r, 202g, 202b; Be used for residual error is carried out apparatus for encoding 203r, 203g, 203b, 204r, 204g, 204b; Be used at least one enhancement layer Color Channel to select the device RDO of encoded original enhancement layer Color Channel image, residual error or encoded residual error r, RDO g, RDO b, wherein, the selection of described selection and other enhancement layer Color Channels is irrelevant; And be used to provide selected enhancement layer Color Channel data as the enhancement layer dateout and device to the indication of the selected coding mode that relates to described enhancement layer Color Channel is provided.
Following examples relate to the equipment that is used for coding video data.
In one embodiment, basic layer and enhancement layer use different color coding Y, CR, CB, R, G, B, and the device 200 that is used to carry out inter prediction also comprises: be used in the frame and the device of interframe encode execution color space conversion.
In one embodiment, color space conversion comprises that YCbCr color space (Rec.BT.709) to RGB color space (Rec.BT.709) changes.
In one embodiment, being used for that residual error is carried out apparatus for encoding comprises: device 204r, the 204g, the 204b that are used to carry out entropy coding.
In one embodiment, described equipment also comprises: be used for carrying out device 405 as the skip mode of additional code pattern at the enhancement layer Color Channel on macro-block level, wherein, in skip mode, enhancement layer does not comprise the bit of respective macroblock.
According to a further aspect in the invention, a kind of being used for comprises the equipment that the encoded video data with base layer data and enhancement data is decoded:
Be used for extracting from encoded video data the device of base layer data and enhancement data, wherein, base layer data and enhancement data comprise the mask data of a plurality of Color Channels; Be used at least first Color Channel, the device that the indication of having indicated coding mode is extracted at enhancement layer; Be used for device that the base layer data of a plurality of Color Channels is decoded; Be used for base layer data device that enhancement data is predicted, wherein, at least one pattern, each enhancement layer Color Channel carried out associated prediction according to all available basic layer Color Channels based on decoding; Be used for device that the enhancement data of a plurality of Color Channels is decoded, wherein, obtain residual error, and,, use described indication to decode according to indicated coding mode at least at described first Color Channel; And be used for enhancement data and described residual error, the device that the enhancement data of a plurality of Color Channels is reconstructed based on prediction.
Following examples relate to and are used for equipment that encoded video data is decoded.
In one embodiment, basic layer and enhancement layer use different color coding devices at Y, CR, CB color space or R, G, B color space respectively, and the device that is used to predict also comprises: be used in frame and the situation of interframe encode under the device of execution color space conversion.
In one embodiment, the device that is used to carry out color space conversion comprises: be used to carry out the device of YCbCr color space to the RGB color space conversion.
In one embodiment, be used for the device that residual error is decoded is comprised: the device that is used for the entropy decoding.
In one embodiment, described equipment also comprises: be used at least one enhancement layer Color Channel, carry out the device as skip mode decoding additional decoding schema, on the macro-block level, wherein, in skip mode, enhancement data does not comprise the bit of respective macroblock.
In one embodiment, be used to carry out the device of the prediction of striding different color channels at the enterprising line operate of picture-level.
In one embodiment, be used to carry out the device of the prediction of striding different color channels at the enterprising line operate of macro-block level.
In one embodiment, described equipment also comprises: be used for carrying out the device that entropy is decoded at each basic layer and enhancement layer Color Channel respectively.
According to another aspect, a kind of encoded vision signal that comprises base layer data and enhancement data, wherein, base layer data comprises first color-coded a plurality of Color Channels, Y for example, Cr, Cb, and enhancement data comprises another second color-coded a plurality of Color Channel R, G, B, wherein, it is dark that base layer data and enhancement data have different colour bits, and, described signal also comprises the coding mode indication, and described coding mode indicates it to comprise that encoded residual error data still is encoded macro block data at the first enhancement layer Color Channel at least.
According to an aspect, by all (being generally three) Color Channels, each Color Channel of enhancement layer MB is predicted according to the basic layer MB that is positioned at same position of reconstruct, unite inter-layer prediction.
The disclosure has been described various realizations.Yet the feature of described realization and aspect also are suitable for other realizations.For example, can use various technology to realize signaling, these technology include but not limited to, SPS grammer, other high-level syntaxs, non-high-level syntax, band external information, and conceal signaling.In addition, can use various coding techniquess.Correspondingly, although can describe realization described herein in concrete context, such description should not be considered as the restriction to such realization or contextual feature and notion.
For example can realize realization described herein with method or process, equipment or software program.Even only discuss (for example, only discussing as method) in the context of the realization of single form, realization of being discussed or feature also can realize with other forms (for example equipment or program).For example, equipment can be realized with suitable hardware, software and firmware.For example, method can realize in such as equipment such as computer or other treatment facilities.In addition, can also come implementation method by the instruction of carrying out by treatment facility or other equipment, and such instruction can be stored on the computer-readable medium, for example, CD or other computer readable storage devices or integrated circuit.
Apparent as those skilled in the art, realization can also produce and be formatted as that carrying for example can be stored or the signal of information transmitted.This information can comprise, for example, is used for the instruction of manner of execution, or the data that produced by one of described realization.For example, signal format can be turned to the value of specific syntax (if perhaps just at transfer syntax what for to can being grammer instruction itself) is carried as data.In addition, can be in encoder or decoder or the multiple realization of realization in the two.
In addition, can visualize other realizations by the disclosure.For example, can make up, delete, revise by various features or replenish and create additional realization disclosed realization.
To understand, only describe the present invention, can carry out modification on the details to the present invention under the prerequisite that does not depart from the scope of the present invention in the mode of example.
Can be independently or with the form of any appropriate combination disclosed each feature in book and (suitably) claims and the accompanying drawing that furnishes an explanation.Can come (suitably) to realize these features with the form of hardware, software or both combinations.Under usable condition, connection can be embodied as wireless connections or wired connection, be decided to be directly or special-purpose the connection and differ.The Reference numeral that occurs in the claims only is used as illustration purpose, should not cause limitations affect to the scope of claim.

Claims (15)

1, a kind of method that is used for the coding video data that comprises basic layer (BL) data and enhancement layer (EL) data, wherein, basic layer and enhancement data comprise a plurality of Color Channels (Y, Cr, Cb, R, G, B), and, basic layer and enhancement data have different dark, said method comprising the steps of:
-to base layer data encode (201y, 201cr, 201cb);
-respectively at described Color Channel, enhancement data is predicted (200) according to base layer data; And
-based on the enhancement data of described prediction, at described Color Channel (R, G, B) enhancement data is encoded respectively;
Wherein, at least one pattern,, each enhancement layer Color Channel is carried out associated prediction (200), and described method also comprises other following steps at least one enhancement layer Color Channel according to all available basic layer Color Channels:
-generation residual error data (R Res, B Res, G Res), described residual error data is original enhancement layer Color Channel (R EL, G EL, B EL) and the prediction the Color Channel data between poor;
-to original enhancement layer Color Channel data encode (202r, 202g, 202b);
-to residual error data encode (203r, 203g, 203b, 204r, 204g, 204b);
-be that at least one enhancement layer Color Channel is selected (RDO r, RDO g, RDO b) encoded original enhancement layer Color Channel data, residual error data or encoded residual error data, wherein, the selection of described selection and other enhancement layer Color Channels is irrelevant; And
-provide selected enhancement layer Color Channel data as the enhancement layer dateout, and the indication to the selected coding mode that relates to described enhancement layer Color Channel is provided.
2, method according to claim 1, wherein, basic layer and enhancement layer use different color codings (Y, CR, CB, R, G, B), and inter-layer prediction (200) also comprise in the frame with the color space conversion of interframe encode.
3, method according to claim 2, wherein, described color space conversion comprises: YCbCr color space (Rec.BT.709) is to the conversion of RGB color space (Rec.BT.709).
4, according to the described method of one of aforementioned claim, wherein, residual error encoded comprises entropy coding (204r, 204g, 204b).
5,, wherein, comprise skip mode (405) on the macro-block level at the additional code pattern of enhancement layer Color Channel data according to the described method of one of aforementioned claim; In skip mode, enhancement data does not comprise the bit at respective macroblock.
6, according to the described method of one of aforementioned claim, wherein, at described selection (RDO r, RDO g, RDO b) in the step, described selection minimizes based on data transfer rate and distortion.
7,, wherein, carry out the prediction (200) of striding different color channels at picture-level according to the described method of one of aforementioned claim.
8,, wherein, carry out the prediction of striding different color channels in macro-block level according to the described method of one of aforementioned claim.
9, according to the described method of one of aforementioned claim, wherein, also comprise: the entropy coding (EC that carries out at each basic layer and enhancement layer Color Channel respectively Y, BL, EC Cb, BL, EC Cr, BL, EC Y, EL, EC Cb, EL, EC Cr, EL).
10, a kind of method that the encoded video data with BL data and EL data is decoded of being used for may further comprise the steps:
-from encoded video data, extract base layer data and enhancement data, wherein, base layer data and enhancement data include the mask data of a plurality of Color Channels;
-at least at first Color Channel of enhancement layer, the indication of having indicated coding mode is extracted;
-base layer data of a plurality of Color Channels is decoded;
-based on the base layer data of decoding enhancement data is predicted, wherein, at least one pattern, each enhancement layer Color Channel is carried out associated prediction according to all available basic layer Color Channels;
-enhancement data of a plurality of Color Channels is decoded, wherein, obtain residual error, and,, use described indication to decode according to indicated coding mode at least at described first Color Channel; And
-based on the enhancement data and the described residual error of prediction, the enhancement data of a plurality of Color Channels is reconstructed.
11, a kind of equipment that is used for the coding video data that comprises basic layer (BL) and enhancement layer (EL), wherein, basic layer and enhancement data comprise a plurality of Color Channels (Y, Cr, Cb, R, G, B), and, basic layer and enhancement layer have different dark, described equipment comprises:
-be used for basic layer is carried out apparatus for encoding (201y, 201cr, 201cb);
-be used for respectively at Color Channel, according to the basic layer of device (200) that enhancement layer is predicted; And
-be used for enhancement layer based on described prediction, at Color Channel (R, G, B) enhancement layer is carried out apparatus for encoding respectively;
Wherein, at least one pattern, according to all available basic layer Color Channels, (R, G, B) carries out associated prediction (200) to each enhancement layer Color Channel, and described equipment also comprises at least one enhancement layer Color Channel:
-be used to produce residual error (R Res, B Res, G Res) device, described residual error is original enhancement layer Color Channel (R EL, G EL, B EL) and the prediction the Color Channel image between poor;
-be used for original enhancement layer Color Channel image is carried out apparatus for encoding (202r, 202g, 202b);
-be used for residual error is carried out apparatus for encoding (203r, 203g, 203b, 204r, 204g, 204b);
-be used at least one enhancement layer Color Channel to select the device (RDO of encoded original enhancement layer Color Channel image, residual error or encoded residual error r, RDO g, RDO b), wherein, the selection of described selection and other enhancement layer Color Channels is irrelevant; And
-be used to provide selected enhancement layer Color Channel data as the enhancement layer dateout and device to the indication of the selected coding mode that relates to described enhancement layer Color Channel is provided.
12, according to the described equipment of aforementioned claim, wherein, basic layer uses different color codings (Y, CR, CB, R, G, B) with enhancement layer, and the device (200) that is used for inter-layer prediction also comprises: be used at the device of carrying out color space conversion in the frame with interframe encode.
13, a kind of equipment that the encoded video data with basic layer and enhancement data is decoded of being used for comprises:
-be used for from the encoded video data extraction base layer data and the device of enhancement data,
Wherein, base layer data and enhancement data include the mask data of a plurality of Color Channels;
-be used at least first Color Channel, the device that the indication of having indicated coding mode is extracted at enhancement layer;
-be used for device that the base layer data of a plurality of Color Channels is decoded;
-be used for base layer data device that enhancement data is predicted based on decoding, wherein, at least one pattern,, each enhancement layer Color Channel is carried out associated prediction according to all available basic layer Color Channels;
-be used for device that the enhancement data of a plurality of Color Channels is decoded, wherein, obtain residual error, and, at least at described first Color Channel,, use described indication to decode according to indicated coding mode; And
-be used for the device that enhancement data and described residual error based on prediction are reconstructed the enhancement data of a plurality of Color Channels.
14, according to the described equipment of aforementioned claim, wherein, basic layer uses different color codings (Y, CR, CB, R, G, B) with enhancement layer, and the device that is used to predict also comprises: be used at the device of carrying out color space conversion in the frame with interframe encode.
15, a kind of encoded vision signal that comprises basic layer (BL) and enhancement layer (EL) data, wherein, base layer data comprises first color-coded a plurality of Color Channel (Y, Cr, Cb), and enhancement data comprises second different color-coded a plurality of Color Channel (R, G, B), wherein, it is dark that base layer data and enhancement data have different colour bits, and, described signal also comprises the coding mode indication, and described coding mode indicates it to comprise that encoded residual error data still is encoded macro block data at the first enhancement layer Color Channel at least.
CN 200880013169 2007-04-23 2008-04-09 Method and apparatus for encoding video data, method and apparatus for decoding encoded video data and encoded video signal Pending CN101663896A (en)

Applications Claiming Priority (3)

Application Number Priority Date Filing Date Title
US92578807P 2007-04-23 2007-04-23
US60/925,801 2007-04-23
US60/925,788 2007-04-23

Publications (1)

Publication Number Publication Date
CN101663896A true CN101663896A (en) 2010-03-03

Family

ID=41790676

Family Applications (1)

Application Number Title Priority Date Filing Date
CN 200880013169 Pending CN101663896A (en) 2007-04-23 2008-04-09 Method and apparatus for encoding video data, method and apparatus for decoding encoded video data and encoded video signal

Country Status (1)

Country Link
CN (1) CN101663896A (en)

Cited By (13)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN103477621A (en) * 2011-01-28 2013-12-25 艾艾欧有限公司 Color conversion based on an HVS model
CN103535038A (en) * 2011-05-16 2014-01-22 杜比实验室特许公司 Efficient architecture for layered vdr coding
WO2014106608A1 (en) * 2013-01-04 2014-07-10 Canon Kabushiki Kaisha Encoding and decoding methods and devices, and corresponding computer programs and computer readable media
CN104322068A (en) * 2012-06-27 2015-01-28 英特尔公司 Cross-layer cross-channel residual prediction
CN105025997A (en) * 2012-12-28 2015-11-04 高通股份有限公司 Inter-layer prediction using sample-adaptive adjustments for bit depth scalable video coding
CN105409217A (en) * 2014-06-19 2016-03-16 索尼公司 Image processing device, image processing method and computer-readable medium
CN105744277A (en) * 2011-11-04 2016-07-06 杜比实验室特许公司 Layer decomposition in hierarchical VDR coding
US9554142B2 (en) 2011-01-28 2017-01-24 Eye IO, LLC Encoding of video stream based on scene type
CN106878707A (en) * 2011-04-14 2017-06-20 杜比实验室特许公司 Many color channel multiple regression predictors
CN107257989A (en) * 2015-03-24 2017-10-17 英特尔公司 The palette compression of cluster
CN107690808A (en) * 2015-06-08 2018-02-13 高通股份有限公司 The derivation of colour gamut scalability parameter and table in scalable video coding
CN110913217A (en) * 2014-06-19 2020-03-24 Vid拓展公司 System and method for three-dimensional color mapping model parameter optimization
CN113329269A (en) * 2020-02-28 2021-08-31 杭州海康威视数字技术股份有限公司 Video encoding method, video decoding method, video encoding device, video decoding device, electronic equipment and storage medium

Cited By (23)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN103477621A (en) * 2011-01-28 2013-12-25 艾艾欧有限公司 Color conversion based on an HVS model
US9554142B2 (en) 2011-01-28 2017-01-24 Eye IO, LLC Encoding of video stream based on scene type
CN103477621B (en) * 2011-01-28 2016-09-28 艾艾欧有限公司 Color based on HVS pattern is changed
CN106878707B (en) * 2011-04-14 2018-05-08 杜比实验室特许公司 More color channel multiple regression predictors
CN106878707A (en) * 2011-04-14 2017-06-20 杜比实验室特许公司 Many color channel multiple regression predictors
US10021390B2 (en) 2011-04-14 2018-07-10 Dolby Laboratories Licensing Corporation Multiple color channel multiple regression predictor
US10237552B2 (en) 2011-04-14 2019-03-19 Dolby Laboratories Licensing Corporation Multiple color channel multiple regression predictor
CN103535038A (en) * 2011-05-16 2014-01-22 杜比实验室特许公司 Efficient architecture for layered vdr coding
US10298923B2 (en) 2011-05-16 2019-05-21 Dolby Laboratories Licensing Corporation Efficient architecture for layered VDR coding
CN105744277A (en) * 2011-11-04 2016-07-06 杜比实验室特许公司 Layer decomposition in hierarchical VDR coding
CN105744277B (en) * 2011-11-04 2019-12-24 杜比实验室特许公司 Layer decomposition in layered VDR coding
CN104322068A (en) * 2012-06-27 2015-01-28 英特尔公司 Cross-layer cross-channel residual prediction
US10536710B2 (en) 2012-06-27 2020-01-14 Intel Corporation Cross-layer cross-channel residual prediction
CN105025997A (en) * 2012-12-28 2015-11-04 高通股份有限公司 Inter-layer prediction using sample-adaptive adjustments for bit depth scalable video coding
CN105025997B (en) * 2012-12-28 2019-06-21 高通股份有限公司 The inter-layer prediction adaptively adjusted for bit depth scalable video coding using sample
WO2014106608A1 (en) * 2013-01-04 2014-07-10 Canon Kabushiki Kaisha Encoding and decoding methods and devices, and corresponding computer programs and computer readable media
CN105409217A (en) * 2014-06-19 2016-03-16 索尼公司 Image processing device, image processing method and computer-readable medium
CN105409217B (en) * 2014-06-19 2019-12-24 索尼公司 Image processing apparatus, image processing method, and computer readable medium
CN110913217A (en) * 2014-06-19 2020-03-24 Vid拓展公司 System and method for three-dimensional color mapping model parameter optimization
CN110913217B (en) * 2014-06-19 2022-01-25 Vid拓展公司 System and method for three-dimensional color mapping model parameter optimization
CN107257989A (en) * 2015-03-24 2017-10-17 英特尔公司 The palette compression of cluster
CN107690808A (en) * 2015-06-08 2018-02-13 高通股份有限公司 The derivation of colour gamut scalability parameter and table in scalable video coding
CN113329269A (en) * 2020-02-28 2021-08-31 杭州海康威视数字技术股份有限公司 Video encoding method, video decoding method, video encoding device, video decoding device, electronic equipment and storage medium

Similar Documents

Publication Publication Date Title
CN101663896A (en) Method and apparatus for encoding video data, method and apparatus for decoding encoded video data and encoded video signal
US20100128786A1 (en) Method and apparatus for encoding video data, method and apparatus for decoding encoded video data and encoded video signal
KR102562874B1 (en) Color gamut adaptation by feedback channel
JP5383674B2 (en) Method and apparatus for encoding and / or decoding video data using enhancement layer residual prediction for bit depth scalability
JP5346198B2 (en) Enhancement layer residual prediction for bit depth scalability using hierarchical LUT
CN101622878B (en) Video encoding method and video decoding method for enabling bit depth scalability
US20230276061A1 (en) Scalable video coding system with parameter signaling
CN101601300B (en) Method and apparatus for encoding and/or decoding bit depth scalable video data using adaptive enhancement layer prediction
CN101558650B (en) Method and apparatus for encoding and/or decoding video data
CN111480339B (en) Method and apparatus for video coding, computer readable storage medium, device
KR101719971B1 (en) Deblocking chroma data for video coding
CN102067609B (en) Methods and apparatus for video coding and decoding with reduced bit-depth update mode and reduced chroma sampling update mode
US20050259729A1 (en) Video coding with quality scalability
US20100220789A1 (en) Combined spatial and bit-depth scalability
TW201625002A (en) QP derivation and offset for adaptive color transform in video coding
CN105230023A (en) The self adaptation of color space, color samples rate and/or bit-depth switches
CN110754085A (en) Color remapping for non-4: 4:4 format video content
CN105659610A (en) Image processing device and method
EP3272124B1 (en) Scalable video coding system with parameter signaling

Legal Events

Date Code Title Description
C06 Publication
PB01 Publication
C10 Entry into substantive examination
SE01 Entry into force of request for substantive examination
AD01 Patent right deemed abandoned

Effective date of abandoning: 20100303

C20 Patent right or utility model deemed to be abandoned or is abandoned