US20230239498A1 - Bit depth variable for high precision data in weighted prediction syntax and semantics - Google Patents

Bit depth variable for high precision data in weighted prediction syntax and semantics Download PDF

Info

Publication number
US20230239498A1
US20230239498A1 US18/124,503 US202318124503A US2023239498A1 US 20230239498 A1 US20230239498 A1 US 20230239498A1 US 202318124503 A US202318124503 A US 202318124503A US 2023239498 A1 US2023239498 A1 US 2023239498A1
Authority
US
United States
Prior art keywords
weight
luma
chroma
component
delta
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Abandoned
Application number
US18/124,503
Inventor
Yue Yu
Limin Wang
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Arris Enterprises LLC
Original Assignee
Arris Enterprises LLC
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Arris Enterprises LLC filed Critical Arris Enterprises LLC
Priority to US18/124,503 priority Critical patent/US20230239498A1/en
Publication of US20230239498A1 publication Critical patent/US20230239498A1/en
Priority to US18/379,994 priority patent/US20240155151A1/en
Abandoned legal-status Critical Current

Links

Images

Classifications

    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N19/00Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
    • H04N19/50Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using predictive coding
    • H04N19/503Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using predictive coding involving temporal prediction
    • H04N19/51Motion estimation or motion compensation
    • H04N19/577Motion compensation with bidirectional frame interpolation, i.e. using B-pictures
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N19/00Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
    • H04N19/46Embedding additional information in the video signal during the compression process
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N19/00Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
    • H04N19/10Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding
    • H04N19/169Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the coding unit, i.e. the structural portion or semantic portion of the video signal being the object or the subject of the adaptive coding
    • H04N19/17Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the coding unit, i.e. the structural portion or semantic portion of the video signal being the object or the subject of the adaptive coding the unit being an image region, e.g. an object
    • H04N19/172Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the coding unit, i.e. the structural portion or semantic portion of the video signal being the object or the subject of the adaptive coding the unit being an image region, e.g. an object the region being a picture, frame or field
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N19/00Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
    • H04N19/10Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding
    • H04N19/169Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the coding unit, i.e. the structural portion or semantic portion of the video signal being the object or the subject of the adaptive coding
    • H04N19/186Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the coding unit, i.e. the structural portion or semantic portion of the video signal being the object or the subject of the adaptive coding the unit being a colour or a chrominance component
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N19/00Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
    • H04N19/50Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using predictive coding
    • H04N19/503Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using predictive coding involving temporal prediction
    • H04N19/51Motion estimation or motion compensation
    • H04N19/573Motion compensation with multiple frame prediction using two or more reference frames in a given prediction direction
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N19/00Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
    • H04N19/70Methods or arrangements for coding, decoding, compressing or decompressing digital video signals characterised by syntax aspects related to video coding, e.g. related to compression standards

Definitions

  • the High Efficiency Video Coding (“HEVC”) coding standard (also called H.265) is a coding standard promulgated by the ISO/IEC MPEG standardization organizations.
  • HEVC supports resolutions higher than “high definition,” which means pixels may be represented by a larger number of bits than the high definition pictures. For example, 4 K resolutions may include images that are 4,000 pixels wide compared to high definition images that are 1920 pixels wide.
  • Temporal motion prediction is an effective method to increase the coding efficiency and provides high compression.
  • HEVC uses a translational model for temporal motion prediction.
  • a prediction signal for a given current unit in a current picture is generated from a corresponding reference unit in a reference picture.
  • the coordinates of the reference unit are given by a motion vector that describes the translational motion along horizontal (x) and vertical (y) directions that would be added/subtracted to/from the coordinates of the current unit.
  • a decoder needs the motion vector to decode the compressed video.
  • HEVC may use single prediction using one reference pictures or bi-prediction using two reference pictures.
  • the pixels in reference units of the reference pictures are used as the prediction.
  • pixels of one of the reference units in bi-prediction may not yield the most accurate prediction.
  • HEVC may use weighted prediction when performing the motion estimation process. Weighted prediction may weight the pixels in one or both of the reference units used as the prediction differently.
  • Embodiments of the present invention provide a variable, BitDepth, that may be set at a value based on a number of bits used to represent pixels in pictures of a video.
  • the variable may be used in syntax elements in HEVC, such as the HEVC range extension, but other coding standards may be used.
  • HEVC High Efficiency Video Coding
  • the variable By using the variable, different resolutions for the video may be accommodated during the encoding and decoding process. For example, the number of pixels in the pictures may be represented by 8 bits, 10 bits, 12 bits, or another number of bits depending on the resolution.
  • Using the BitDepth variable in the syntax provides flexibility in the motion estimation and motion compensation process. For example, syntax elements used in the weighted prediction process may take into account different numbers of bits used to represent the pictures.
  • a method of an embodiment of the present invention includes the following steps: (1) setting a first value for a variable associated with a bit depth based on a number of bits associated with pixels of pictures in a video; (2) determining a first weighting factor for performing weighted prediction for a current unit of a current picture; (3) using the first weighting factor to weight pixels of a first reference unit of a first reference picture when performing motion compensation for the current unit; and (4) setting a second value for a weighted prediction syntax element associated with the weighting factor, wherein the second value for the weighted prediction syntax element is based on the first value.
  • FIG. 1 depicts a simplified system for encoding and decoding video according to one embodiment.
  • FIG. 2 depicts an example of the motion estimation and compensation process according to one embodiment.
  • FIG. 3 depicts a table of syntax elements that are used for weighted prediction according to one embodiment.
  • FIG. 4 depicts a simplified flowchart of a method for using the variable BitDepth in the encoding process according to one embodiment.
  • FIG. 5 depicts a simplified flowchart for decoding an encoded bitstream using the variable BitDepth according to one embodiment.
  • Particular embodiments provide a variable, BitDepth, that may be set at a value based on a number of bits used to represent pixels in pictures of a video.
  • the variable may be used in syntax elements in HEVC, such as the HEVC range extension, but other coding standards may be used.
  • HEVC High Efficiency Video Coding
  • the variable By using the variable, different resolutions for the video may be accommodated during the encoding and decoding process.
  • the number of pixels in the pictures may be represented by 8 bits, 10 bits, 12 bits, or another number of bits depending on the resolution.
  • Using the BitDepth variable in the syntax provides flexibility in the motion estimation and motion compensation process. For example, syntax elements used in the weighted prediction process may take into account different numbers of bits used to represent the pictures.
  • FIG. 1 depicts a simplified system 100 for encoding and decoding video according to one embodiment.
  • System 100 includes an encoder 102 and a decoder 104 .
  • Encoder 102 and decoder 104 may use a video coding standard to encode and decode video, such as HEVC.
  • encoder 102 and decoder 104 may use syntax elements from the HEVC range extension. Also, other elements of encoder 102 and decoder 104 may be appreciated.
  • Encoder 102 and decoder 104 perform temporal prediction through motion estimation and motion compensation.
  • Motion estimation is a process of determining a motion vector (MV) for a current unit of video.
  • the motion estimation process searches for a best match prediction for a current unit of video (e.g., a prediction unit (PU)) over reference pictures.
  • the best match prediction is described by the motion vector and associated reference picture ID.
  • a reference unit in a B picture may have up to two motion vectors that point to a previous reference unit in a previous picture and a subsequent reference unit in a subsequent reference picture in the picture order.
  • Motion compensation is then performed by subtracting a reference unit pointed to by the motion vector from the current unit of video to determine a residual error that can be encoded.
  • the two motion vectors point to two reference units, which can be combined to form a combined bi-directional reference unit.
  • the combined bi-directional reference unit can be subtracted from the current unit to determine the residual error.
  • encoder 102 and decoder 104 include motion estimation and compensation blocks 104 - 1 and 104 - 2 , respectively.
  • B and P pictures may exploit the temporal redundancy by using weighted prediction.
  • Weighted prediction applies weighting factors to one or both of the reference pictures.
  • the reference unit may be weighted before being used as a prediction for the current unit.
  • the reference units are weighted before combining the reference pictures into the combined bi-directional reference unit.
  • an average of the reference units from which the B picture is predicted may be used.
  • a weighted average (or other weighted calculation) of the reference units may be used to predict the current unit.
  • the pixels of reference units may be weighted by weighting factors before combining.
  • the following may be discussed with respect to B pictures, but the discussion will also apply to P pictures except that the combined bi-directional reference unit is replaced by one weighted reference unit.
  • the use of weighted prediction may be useful when certain conditions happen in the video, such as when one scene fades into another or where there is a gradual variation in luminance, such as when fading to or from black or in cross-fades.
  • the picture associated with the fading may not be as accurate to use as a prediction than a picture that does not include the fading.
  • Weighting the picture that does not include the fading higher may create a better prediction for the current unit. That is, the pixels of the reference unit that does not include the fading effect may have a higher correlation to the current unit.
  • the weighting may reduce the residual error and also reduce the bitrate.
  • the weighting factors may be provided for the luma and/or chroma components of reference units 202 . Also, the weighting factors may be different based on the reference list used. That is, the bi-prediction may select a reference picture from a list0 and a list1. Reference pictures in list0 may be weighted with a weighting factor w 0 and reference pictures in list1 may be weighted with a weighting factor w 1 . Also, only one of the reference pictures may be weighted.
  • the weighting factors may be information that adjusts the pixel values differently in the reference units. In one embodiment, the weighting factors may be percentages.
  • the weighted prediction process may use syntax elements that define parameters that encoder 102 and decoder 104 use to perform the weighted prediction. By setting the values of these parameters, a weighted prediction manager 106 - 1 in encoder 102 and a weighted prediction manager 106 - 2 in decoder 104 can perform the weighted prediction process. In a simple example, the weighting factors may weight pixels in different reference pictures differently. Then, weighted prediction managers 106 - 1 and 106 - 2 take the weighted average of the reference units to use as a combined bi-directional reference unit. Motion estimation and compensation blocks 104 - 1 and 104 - 2 then use this combined bi-directional reference unit in the motion compensation process for the current unit.
  • the syntax elements will be described in more detail below after describing the weighted prediction process in more detail.
  • FIG. 2 depicts an example of the motion estimation and compensation process according to one embodiment.
  • the video includes a number of pictures 200 - 1 - 200 - 5 .
  • a current picture is shown at 200 - 3 and includes a current unit of video 202 - 1 .
  • Current unit 202 - 1 may be bi-predicted using reference units from reference pictures in other pictures 200 , such as a previous picture 200 - 1 in the picture order and a subsequent picture 200 - 5 in the picture order.
  • Picture 200 - 1 includes a reference unit 202 - 2 and picture 200 - 5 includes a reference unit 202 - 3 , both of which can be used to predict current unit 202 - 1 .
  • the pixels of reference units 202 - 2 and 202 - 3 may be weighted differently.
  • the pixels of reference units may be weighted by the weighting factors.
  • the weighting factors may be percentages, such as the pixels of reference unit 202 - 2 may be weighted with a weighting factor w 0 of 0.25 and the pixels of reference unit 202 - 3 may be weighted with a weighting factor w 1 of 0.75. These weighting factors may then be used to calculate the pixel values used as the combined bi-directional reference unit for current unit 202 - 1 .
  • motion estimation and compensation block 104 - 1 can determine motion vectors that represent the location of reference units 202 - 2 and 202 - 3 with respect to current unit 202 - 1 . Then, motion estimation and compensation block 104 - 1 calculates a difference between the combined bi-directional reference unit and the current unit 202 - 1 as a residual error.
  • encoder 102 encodes the residual error, the motion vectors, and also the weighting factors used to determine the combined bi-directional reference unit.
  • Encoder 102 includes the encoded residual error, the motion vectors, and the weighting factors in an encoded bitstream that is sent to decoder 104 .
  • the term weighting factors is used for discussion purposes to represent information that is encoded that allows the weighted prediction process to be performed and the weighting factors to be determined. The syntax elements used to determine which information for the weighting factors that are encoded in the encoded bitstream are described in more detail below.
  • Decoder 104 receives the encoded bitstream and can reconstruct the pictures of the video. Decoder 104 may reconstruct reference units 202 - 2 and 202 - 3 from the encoded bitstream prior to decoding current unit 202 - 1 . Also, decoder 104 decodes the residual error for current unit 202 - 1 , the motion vectors for current unit 202 - 1 , and the weighting factors. Then, in decoder 104 , motion estimation and compensation block 104 - 2 may then use the residual error to reconstruct the current unit 202 - 1 . For example, motion estimation and compensation block 104 - 2 may use the motion vectors to locate reconstructed reference units 202 - 2 and 202 - 3 .
  • weighted prediction manager 106 - 2 applies the weighting factors to the reconstructed units 202 - 2 and 202 - 3 to form the reconstructed combined bi-directional reference unit.
  • the reconstructed residual error is then added to the reconstructed combined bi-directional predicted unit to form a reconstructed current unit.
  • syntax elements for weighted prediction may use a variable BitDepth in the weighted prediction process.
  • the variable BitDepth may represent a number of bits used to represent a pixel in a picture. The following will describe syntax elements that use the bit depth variable.
  • FIG. 3 depicts a table 300 of syntax elements that are used for weighted prediction according to one embodiment.
  • the combined bi-directional reference unit may be determined by applying the weighting factors in an averaging operation.
  • the weighting factor w 0 is multiplied by the previous reference unit 202 - 2 (e.g., the luma and chroma components of the pixels) and the weighting factor w 1 is multiplied by the subsequent reference unit 202 - 3 (e.g., the luma and chroma components of the pixels).
  • weighting factors e.g., normalization
  • the above example may not be exactly how encoder 102 and decoder 104 perform the calculation to determine the combined bi-directional reference unit.
  • Various methods may be used, but performing a division operation may be an expensive computational operation.
  • One method of deriving the weighting factors is to use bit shifting.
  • the weighting factors can be derived with a common denominator and the division is represented by a right shift of the combined weighted prediction of a number of bits based on a base 2 logarithm of the denominator. That is, the weighting factor, such as the luma weighting factor, may be bit shifted by the base 2 logarithm of the denominator.
  • the following syntax elements encode parameter values that can be used to perform the weighted prediction process.
  • the syntax elements in Table 300 of luma_log2_weight_denom at 302 , delta_chroma_log2_weight_denom at 304 , delta_luma_weight_l0[ i ] at 306 , delta_chroma_weight_l0[ i ][ j ] at 308 , delta­_luma_weight_l1[ i ] at 310 , and delta_chroma_weight_l1[ i ][ j ] at 312 may use a BitDepth variable for the number of bits that represent a pixel. This allows these syntax elements to be used for video that may represent pixels with a different number of bits, such as 8, 10, 12, 14, 16, etc., bits. The syntax elements can thus handle high precision data, which may represent pixels with more than 8 bits.
  • the syntax element luma_log2_weight_denom is the base 2 logarithm of the denominator for all luma weighting factors.
  • the value of luma_log2_weight_denom shall be in the range of 0 to BitDepth Y , inclusive, where BitDepth Y is the bit depth of the luma component of the reference picture. By including the range to be dependent on the variable BitDepth Y , the value can represent different numbers of bits for the pixels.
  • the syntax element luma_log2_weight_denom is used to calculate the luma weighting factor (e.g., the delta luma weighting factor of the syntax element delta_luma_weight_l0[ i ] and delta_luma_weight_l1[ i ] described below).
  • the syntax element delta_chroma_log2_weight_denom is the difference of the base 2 logarithm of the denominator for all chroma weighting factors.
  • the variable ChromaLog2WeightDenom which is the log 2 denominator of the chroma weighting factor, is derived to be equal to luma_log2_weight_denom + delta_chroma_log2_weight_denom, and the value shall be in the range of 0 to BitDepth c , inclusive, where the variable BitDepth c is the bit depth of the chroma component of the reference picture.
  • the chroma weighting factor value can represent different numbers of bits for the pixels.
  • the syntax element luma_weight_l0_flag[ i ] when equal to 1, specifies that weighting factors for the luma component of list 0 prediction using RefPicList0[ i ] are present.
  • the syntax element luma_weight_l0_flag[ i ] when equal to 0, specifies that these weighting factors are not present.
  • the syntax element chroma_weight_l0_flag[ i ] when equal to 1, specifies that weighting factors for the chroma prediction values of list 0 prediction using RefPicList0[ i ] are present.
  • the syntax element chroma_weight_l0_flag[ i ] when equal to 0, specifies that the chroma weighting factors are not present.
  • chroma_weight_l0_flag[ i ] when not present, it is inferred to be equal to 0.
  • the syntax element delta_luma_weight_l0[ i ] is the difference of the weighting factor applied to the luma prediction value for list 0 prediction using RefPicList0[ i ].
  • the variable LumaWeightL0[ i ] may be the luma weighting factor for list0 and is derived to be equal to (1 ⁇ luma_log2_weight_denom) + delta_luma_weight_l0[ i ]. That is, the luma weighting factor is a right shift of the variable luma_log2_weight_denom plus the value of delta_luma_weight_l0[ i ].
  • delta_luma_weight_l0_flag[ i ] When luma_weight_l0_flag[ i ] is equal to 1, the value of delta_luma_weight_l0[ i ] shall be in the range of –(1 ⁇ (BitDepth Y – 1)), (1 ⁇ (BitDepth Y – 1)) –1, inclusive, where BitDepth Y is the bit depth for the luma component of the reference picture. This sets the range of the weighting factor to be based on the variable BitDepth Y . When luma_weight_l0_flag[ i ] is equal to 0, LumaWeightL0[ i ] is inferred to be equal to 2 luma_log2_weight_denom .
  • the syntax element luma_offset_l0[ i ] is the additive offset applied to the luma prediction value for list 0 prediction using RefPicList0[ i ].
  • the value of luma_offset_l0[ i ] shall be in the range of -128 to 127, inclusive, or –(1 ⁇ (BitDepth Y – 1) to (1 ⁇ (BitDepth Y – 1) – 1, inclusive.
  • luma_weight_l0_flag[ i ] is equal to 0
  • luma_offset_l0[ i ] is inferred as equal to 0.
  • the syntax element delta_chroma_weight_l0[ i ][ j ] is the difference of the weighting factor applied to the chroma prediction values for list 0 prediction using RefPicList0[ i ] with j equal to 0 for Cb and j equal to 1 for Cr.
  • the variable ChromaWeightL0[ i ][ j ] may be the chroma weighting factor for list0 and is derived to be equal to (1 ⁇ ChromaLog2WeightDenom) + delta_chroma_weight_l0[ i ][ j ].
  • the chroma weighting factor is a right shift of the variable ChromaLog2WeightDenom plus the value of delta_chroma_weight_l0[ i ][ j ].
  • chroma_weight_l0_flag[ i ] is equal to 1
  • the value of delta_chroma_weight_l0[ i ][ j ] shall be in the range of –(1 ⁇ (BitDepth C –1)), (1 ⁇ (BitDepth C – 1)) –1, inclusive, where BitDepth C is the bit depth for the chroma component of the reference picture. This sets the range of the weighting factor to be based on the variable BitDepth C .
  • ChromaWeightL0[ i ][ j ] is inferred to be equal to 2 ChromaLog2WeightDenom.
  • the syntax element delta_chroma_offset_l0[ i ][ j ] is the difference of the additive offset applied to the chroma prediction values for list 0 prediction using RefPicList0[ i ] with j equal to 0 for Cb and j equal to 1 for Cr.
  • ChromaOffsetL0[ i ][ j ] is derived as follows:
  • ChromaOffsetL0[ i ][ j ] Clip3( -128, 127, ( delta_chroma_offset_l0[ i ][ j ] - (( 128 * ChromaWeight L0[ i ][ j ] ) >> ChromaLog2WeightDenom) + 128 ) )
  • delta_chroma_offset_10[ i ][ j ] shall be in the range of -512 to 511, inclusive.
  • variable ChromaOffsetL0[ i ][ j ] is derived as follows:
  • Clip3( -(1 ⁇ (BitDepth C - 1), (1 ⁇ (BitDepth C - 1) -1, (delta_chroma_offset_10[i]
  • delta_chroma_offset_10[ i ][ j ] shall be in the range of -(1 ⁇ (BitDepth C + 1) to (1 ⁇ (BitDepth C + 1) –1, inclusive.
  • ChromaOffsetL0[ i ][ j ] is inferred to be equal to 0.
  • the syntax elements luma_weight_l1_flag[ i ], chroma_weight_l1_flag[ i ], delta_luma_weight_l1[ i ], luma_offset_l1[ i ], delta_chroma_weight_l1[ i ][ j ], and delta_chroma_offset_l1[ i ][ j ] have the same semantics as luma_weight_10_flag[ i ], chroma_weight_l0_flag[ i ], delta_luma_weight_l0[ i ], luma_offset_l0[ i ], delta_chroma_weight_l0[ i ][ j ], and delta_chroma_offset_l0[ i ][ j ], respectively, with 10, L0, list 0, and List0 replaced by l1, L1, list 1, and List1, respectively. That is, these are the same syntax elements for list1 instead of
  • the above syntax elements luma_offset_l0, luma_offset_l1, ChromaOffsetL0, ChromaOffsetL1 are modified when the pixels of reference pictures may include higher precision, such as when the syntax element extended_precision_processing_flag is equal to 1.
  • encoder 102 may change the number of bits that is used for weighted prediction based on the number of bits that represent pixels in the pictures of the video.
  • encoder 102 knows the BitDepth for the pictures of a video and can set the variable BitDepth to the value of the number of bits that represent the pixels of pictures.
  • FIG. 4 depicts a simplified flowchart 400 of a method for using the variable BitDepth in the encoding process according to one embodiment.
  • encoder 102 determines a value for the variable BitDepth for the luma and chroma components. The value may be based on the precision of the video, such as a number of bits that represent the pixels in pictures of a video. Encoder 102 may determine the value based on analyzing characteristics of the video. Also, the number of bits that represent the pixels may be input or included in metadata associated with the video.
  • motion estimation and compensation block 104 - 1 determines when weighted prediction is enabled. For example, weighted prediction flags, luma_weight_l0_flag[ i] and chroma_weight_l0_flag[ i ], may be set to indicate that weighted prediction is enabled for the luma and chroma components.
  • weighted prediction manager 106 - 1 determines motion vectors for reference units 202 - 2 and 202 - 3 and weighting factors for the luma and chroma components for reference units 202 - 2 and 202 - 3 (e.g., list0 and list1).
  • the weighting factors may be assigned to the pictures (e.g., list0 and list1) that are being used as reference pictures for a current picture.
  • weighted prediction manager 106 - 1 then calculates the combined bi-directional reference unit to use in motion compensation based on the weighting factors and pixels of reference units 202 - 2 and 202 - 3 . As described above, the weighting factors weight pixels differently in reference units 202 - 2 and 202 - 3 .
  • motion estimation and compensation block 104 - 1 determines the residual error using current unit 202 - 1 and the combined bi-directional reference unit.
  • encoder 102 calculates the values for the applicable syntax elements used in the weighted prediction process. For example, the values for the syntax elements luma_log2_weight_denom, delta_chroma_log2_weight_denom, delta_luma_weight_l0[ i ], delta_chroma_weight_l0[ i ][ j ], delta_luma_weight_l1[ i ], and delta_chroma_weight_l1[ i ][ j ] using the variable BitDepth are calculated.
  • encoder 102 encodes the parameters for the syntax elements for the weighted prediction process in an encoded bitstream along with the motion vectors and residual error for current unit 202 - 1 . For example, encoder 102 encodes the values calculated for the syntax elements shown in Table 300 in FIG. 3 .
  • encoder 102 sends the encoded bitstream to decoder 104 .
  • FIG. 5 depicts a simplified flowchart 500 for decoding an encoded bitstream using the variable BitDepth according to one embodiment.
  • decoder 104 receives the encoded bitstream.
  • decoder 104 determines a current unit 202 - 1 in a current picture to decode.
  • decoder 104 decodes the encoded motion vectors from the encoded bitstream for current unit 202 - 1 and uses these motion vectors to locate the reconstructed reference units 202 - 2 and 202 - 3 .
  • decoder 104 determines the residual error for current unit 202 - 1 from the encoded bitstream.
  • weighted prediction manager 106 - 2 may determine the values for the syntax elements in Table 300 for weighted prediction. This may allow weighted prediction manager 106 - 2 to determine the weighting factors. At 512 , weighted prediction manager 106 - 2 may then apply the weighting factors to reference units 202 - 2 and 202 - 3 to determine the combined bi-directional reference unit. At 514 , motion estimation and compensation block 104 - 2 can then add the residual error for current unit 202 - 1 to the combined weighted prediction unit to form a reconstructed current unit.
  • particular embodiments use a variable BitDepth for syntax elements in the weighted prediction process.
  • the syntax elements luma_log2_weight_denom, delta_chroma_log2_weight_denom, delta_luma_weight_l0[ i ], delta_chroma_weight_l0[ i ][ j ], delta_luma_weight_l1[ i ], and delta_chroma_weight_l1[ i ][ j ] use the variable BitDepth.
  • Particular embodiments may be implemented in a non-transitory computer-readable storage medium for use by or in connection with the instruction execution system, apparatus, system, or machine.
  • the computer-readable storage medium contains instructions for controlling a computer system to perform a method described by particular embodiments.
  • the computer system may include one or more computing devices.
  • the instructions, when executed by one or more computer processors, may be configured to perform that which is described in particular embodiments.

Landscapes

  • Engineering & Computer Science (AREA)
  • Multimedia (AREA)
  • Signal Processing (AREA)
  • Compression Or Coding Systems Of Tv Signals (AREA)

Abstract

Particular embodiments provide a variable, BitDepth, that may be set at a value based on a number of bits used to represent pixels in pictures of a video. The variable may be used in syntax elements in HEVC, such as the HEVC range extension, but other coding standards may be used. By using the variable, different resolutions for the video may be accommodated during the encoding and decoding process. For example, the number of pixels in the pictures may be represented by 8 bits, 10 bits, 12 bits, or another number of bits depending on the resolution. Using the BitDepth variable in the syntax provides flexibility in the motion estimation and motion compensation process. For example, syntax elements used in the weighted prediction process may take into account different numbers of bits used to represent the pictures.

Description

    CROSS REFERENCES TO RELATED APPLICATIONS
  • The present disclosure claims priority to U.S. Provisional App. No. 61/900,332, entitled “Modification of Weighted Prediction Syntax and Semantics for HEVC Range Extension”, filed Nov. 5, 2013, the contents of which is incorporated herein by reference in its entirety.
  • BACKGROUND
  • The High Efficiency Video Coding (“HEVC”) coding standard (also called H.265) is a coding standard promulgated by the ISO/IEC MPEG standardization organizations. HEVC supports resolutions higher than “high definition,” which means pixels may be represented by a larger number of bits than the high definition pictures. For example, 4 K resolutions may include images that are 4,000 pixels wide compared to high definition images that are 1920 pixels wide.
  • Temporal motion prediction is an effective method to increase the coding efficiency and provides high compression. HEVC uses a translational model for temporal motion prediction. According to the translational model, a prediction signal for a given current unit in a current picture is generated from a corresponding reference unit in a reference picture. The coordinates of the reference unit are given by a motion vector that describes the translational motion along horizontal (x) and vertical (y) directions that would be added/subtracted to/from the coordinates of the current unit. A decoder needs the motion vector to decode the compressed video.
  • HEVC may use single prediction using one reference pictures or bi-prediction using two reference pictures. The pixels in reference units of the reference pictures are used as the prediction. In some conditions, such as when fading occurs, pixels of one of the reference units in bi-prediction may not yield the most accurate prediction. To compensate for this, HEVC may use weighted prediction when performing the motion estimation process. Weighted prediction may weight the pixels in one or both of the reference units used as the prediction differently.
  • SUMMARY
  • Embodiments of the present invention provide a variable, BitDepth, that may be set at a value based on a number of bits used to represent pixels in pictures of a video. The variable may be used in syntax elements in HEVC, such as the HEVC range extension, but other coding standards may be used. By using the variable, different resolutions for the video may be accommodated during the encoding and decoding process. For example, the number of pixels in the pictures may be represented by 8 bits, 10 bits, 12 bits, or another number of bits depending on the resolution. Using the BitDepth variable in the syntax provides flexibility in the motion estimation and motion compensation process. For example, syntax elements used in the weighted prediction process may take into account different numbers of bits used to represent the pictures.
  • More particularly, a method of an embodiment of the present invention includes the following steps: (1) setting a first value for a variable associated with a bit depth based on a number of bits associated with pixels of pictures in a video; (2) determining a first weighting factor for performing weighted prediction for a current unit of a current picture; (3) using the first weighting factor to weight pixels of a first reference unit of a first reference picture when performing motion compensation for the current unit; and (4) setting a second value for a weighted prediction syntax element associated with the weighting factor, wherein the second value for the weighted prediction syntax element is based on the first value.
  • BRIEF DESCRIPTION OF THE DRAWINGS
  • FIG. 1 depicts a simplified system for encoding and decoding video according to one embodiment.
  • FIG. 2 depicts an example of the motion estimation and compensation process according to one embodiment.
  • FIG. 3 depicts a table of syntax elements that are used for weighted prediction according to one embodiment.
  • FIG. 4 depicts a simplified flowchart of a method for using the variable BitDepth in the encoding process according to one embodiment.
  • FIG. 5 depicts a simplified flowchart for decoding an encoded bitstream using the variable BitDepth according to one embodiment.
  • DETAILED DESCRIPTION
  • Described herein are techniques for performing weighted prediction. In the following description, for purposes of explanation, numerous examples and specific details are set forth in order to provide a thorough understanding of particular embodiments. Particular embodiments as defined by the claims may include some or all of the features in these examples alone or in combination with other features described below, and may further include modifications and equivalents of the features and concepts described herein.
  • Particular embodiments provide a variable, BitDepth, that may be set at a value based on a number of bits used to represent pixels in pictures of a video. The variable may be used in syntax elements in HEVC, such as the HEVC range extension, but other coding standards may be used. By using the variable, different resolutions for the video may be accommodated during the encoding and decoding process. For example, the number of pixels in the pictures may be represented by 8 bits, 10 bits, 12 bits, or another number of bits depending on the resolution. Using the BitDepth variable in the syntax provides flexibility in the motion estimation and motion compensation process. For example, syntax elements used in the weighted prediction process may take into account different numbers of bits used to represent the pictures.
  • FIG. 1 depicts a simplified system 100 for encoding and decoding video according to one embodiment. System 100 includes an encoder 102 and a decoder 104. Encoder 102 and decoder 104 may use a video coding standard to encode and decode video, such as HEVC. Specifically, encoder 102 and decoder 104 may use syntax elements from the HEVC range extension. Also, other elements of encoder 102 and decoder 104 may be appreciated.
  • Encoder 102 and decoder 104 perform temporal prediction through motion estimation and motion compensation. Motion estimation is a process of determining a motion vector (MV) for a current unit of video. For example, the motion estimation process searches for a best match prediction for a current unit of video (e.g., a prediction unit (PU)) over reference pictures. The best match prediction is described by the motion vector and associated reference picture ID. Also, a reference unit in a B picture may have up to two motion vectors that point to a previous reference unit in a previous picture and a subsequent reference unit in a subsequent reference picture in the picture order. Motion compensation is then performed by subtracting a reference unit pointed to by the motion vector from the current unit of video to determine a residual error that can be encoded. In the case of bi-prediction, the two motion vectors point to two reference units, which can be combined to form a combined bi-directional reference unit. The combined bi-directional reference unit can be subtracted from the current unit to determine the residual error.
  • To perform motion estimation and compensation, encoder 102 and decoder 104 include motion estimation and compensation blocks 104-1 and 104-2, respectively. In motion compensation, B and P pictures may exploit the temporal redundancy by using weighted prediction. Weighted prediction applies weighting factors to one or both of the reference pictures. In P pictures, the reference unit may be weighted before being used as a prediction for the current unit. In B pictures, the reference units are weighted before combining the reference pictures into the combined bi-directional reference unit. Conventionally, an average of the reference units from which the B picture is predicted may be used. However, using weighted prediction, a weighted average (or other weighted calculation) of the reference units may be used to predict the current unit. That is, the pixels of reference units may be weighted by weighting factors before combining. The following may be discussed with respect to B pictures, but the discussion will also apply to P pictures except that the combined bi-directional reference unit is replaced by one weighted reference unit. The use of weighted prediction may be useful when certain conditions happen in the video, such as when one scene fades into another or where there is a gradual variation in luminance, such as when fading to or from black or in cross-fades. For example, when fading occurs, the picture associated with the fading may not be as accurate to use as a prediction than a picture that does not include the fading. Weighting the picture that does not include the fading higher may create a better prediction for the current unit. That is, the pixels of the reference unit that does not include the fading effect may have a higher correlation to the current unit. The weighting may reduce the residual error and also reduce the bitrate.
  • The weighting factors may be provided for the luma and/or chroma components of reference units 202. Also, the weighting factors may be different based on the reference list used. That is, the bi-prediction may select a reference picture from a list0 and a list1. Reference pictures in list0 may be weighted with a weighting factor w0 and reference pictures in list1 may be weighted with a weighting factor w1. Also, only one of the reference pictures may be weighted. The weighting factors may be information that adjusts the pixel values differently in the reference units. In one embodiment, the weighting factors may be percentages.
  • The weighted prediction process may use syntax elements that define parameters that encoder 102 and decoder 104 use to perform the weighted prediction. By setting the values of these parameters, a weighted prediction manager 106-1 in encoder 102 and a weighted prediction manager 106-2 in decoder 104 can perform the weighted prediction process. In a simple example, the weighting factors may weight pixels in different reference pictures differently. Then, weighted prediction managers 106-1 and 106-2 take the weighted average of the reference units to use as a combined bi-directional reference unit. Motion estimation and compensation blocks 104-1 and 104-2 then use this combined bi-directional reference unit in the motion compensation process for the current unit. The syntax elements will be described in more detail below after describing the weighted prediction process in more detail.
  • FIG. 2 depicts an example of the motion estimation and compensation process according to one embodiment. The video includes a number of pictures 200-1 - 200-5. A current picture is shown at 200-3 and includes a current unit of video 202-1. Current unit 202-1 may be bi-predicted using reference units from reference pictures in other pictures 200, such as a previous picture 200-1 in the picture order and a subsequent picture 200-5 in the picture order. Picture 200-1 includes a reference unit 202-2 and picture 200-5 includes a reference unit 202-3, both of which can be used to predict current unit 202-1.
  • In weighted prediction, the pixels of reference units 202-2 and 202-3 may be weighted differently. For example, the pixels of reference units may be weighted by the weighting factors. In a simple example, the weighting factors may be percentages, such as the pixels of reference unit 202-2 may be weighted with a weighting factor w0 of 0.25 and the pixels of reference unit 202-3 may be weighted with a weighting factor w1 of 0.75. These weighting factors may then be used to calculate the pixel values used as the combined bi-directional reference unit for current unit 202-1.
  • Once the reference units are determined, motion estimation and compensation block 104-1 can determine motion vectors that represent the location of reference units 202-2 and 202-3 with respect to current unit 202-1. Then, motion estimation and compensation block 104-1 calculates a difference between the combined bi-directional reference unit and the current unit 202-1 as a residual error.
  • Once determining the residual error, encoder 102 encodes the residual error, the motion vectors, and also the weighting factors used to determine the combined bi-directional reference unit. Encoder 102 includes the encoded residual error, the motion vectors, and the weighting factors in an encoded bitstream that is sent to decoder 104. The term weighting factors is used for discussion purposes to represent information that is encoded that allows the weighted prediction process to be performed and the weighting factors to be determined. The syntax elements used to determine which information for the weighting factors that are encoded in the encoded bitstream are described in more detail below.
  • Decoder 104 receives the encoded bitstream and can reconstruct the pictures of the video. Decoder 104 may reconstruct reference units 202-2 and 202-3 from the encoded bitstream prior to decoding current unit 202-1. Also, decoder 104 decodes the residual error for current unit 202-1, the motion vectors for current unit 202-1, and the weighting factors. Then, in decoder 104, motion estimation and compensation block 104-2 may then use the residual error to reconstruct the current unit 202-1. For example, motion estimation and compensation block 104-2 may use the motion vectors to locate reconstructed reference units 202-2 and 202-3. Then, weighted prediction manager 106-2 applies the weighting factors to the reconstructed units 202-2 and 202-3 to form the reconstructed combined bi-directional reference unit. The reconstructed residual error is then added to the reconstructed combined bi-directional predicted unit to form a reconstructed current unit.
  • As mentioned above, syntax elements for weighted prediction may use a variable BitDepth in the weighted prediction process. The variable BitDepth may represent a number of bits used to represent a pixel in a picture. The following will describe syntax elements that use the bit depth variable.
  • FIG. 3 depicts a table 300 of syntax elements that are used for weighted prediction according to one embodiment. There are syntax elements for the luma and chroma components of the video, and also for list0 and list1. The combined bi-directional reference unit may be determined by applying the weighting factors in an averaging operation. In an example to illustrate the calculation, the weighting factor w0 is multiplied by the previous reference unit 202-2 (e.g., the luma and chroma components of the pixels) and the weighting factor w1 is multiplied by the subsequent reference unit 202-3 (e.g., the luma and chroma components of the pixels). Then, the two values are added together and divided by the added weighting factors (e.g., normalization). The above example may not be exactly how encoder 102 and decoder 104 perform the calculation to determine the combined bi-directional reference unit. Various methods may be used, but performing a division operation may be an expensive computational operation. One method of deriving the weighting factors is to use bit shifting. The weighting factors can be derived with a common denominator and the division is represented by a right shift of the combined weighted prediction of a number of bits based on a base 2 logarithm of the denominator. That is, the weighting factor, such as the luma weighting factor, may be bit shifted by the base 2 logarithm of the denominator.
  • The following syntax elements encode parameter values that can be used to perform the weighted prediction process. The syntax elements in Table 300 of luma_log2_weight_denom at 302, delta_chroma_log2_weight_denom at 304, delta_luma_weight_l0[ i ] at 306, delta_chroma_weight_l0[ i ][ j ] at 308, delta­_luma_weight_l1[ i ] at 310, and delta_chroma_weight_l1[ i ][ j ] at 312 may use a BitDepth variable for the number of bits that represent a pixel. This allows these syntax elements to be used for video that may represent pixels with a different number of bits, such as 8, 10, 12, 14, 16, etc., bits. The syntax elements can thus handle high precision data, which may represent pixels with more than 8 bits.
  • The syntax element luma_log2_weight_denom is the base 2 logarithm of the denominator for all luma weighting factors. The value of luma_log2_weight_denom shall be in the range of 0 to BitDepthY, inclusive, where BitDepthY is the bit depth of the luma component of the reference picture. By including the range to be dependent on the variable BitDepthY, the value can represent different numbers of bits for the pixels. The syntax element luma_log2_weight_denom is used to calculate the luma weighting factor (e.g., the delta luma weighting factor of the syntax element delta_luma_weight_l0[ i ] and delta_luma_weight_l1[ i ] described below).
  • The syntax element delta_chroma_log2_weight_denom is the difference of the base 2 logarithm of the denominator for all chroma weighting factors. The variable ChromaLog2WeightDenom, which is the log 2 denominator of the chroma weighting factor, is derived to be equal to luma_log2_weight_denom + delta_chroma_log2_weight_denom, and the value shall be in the range of 0 to BitDepthc, inclusive, where the variable BitDepthc is the bit depth of the chroma component of the reference picture. By including the range to be dependent on the variable BitDepthc, the chroma weighting factor value can represent different numbers of bits for the pixels.
  • The syntax element luma_weight_l0_flag[ i ], when equal to 1, specifies that weighting factors for the luma component of list 0 prediction using RefPicList0[ i ] are present. The syntax element luma_weight_l0_flag[ i ], when equal to 0, specifies that these weighting factors are not present.
  • The syntax element chroma_weight_l0_flag[ i ], when equal to 1, specifies that weighting factors for the chroma prediction values of list 0 prediction using RefPicList0[ i ] are present. The syntax element chroma_weight_l0_flag[ i ], when equal to 0, specifies that the chroma weighting factors are not present. When chroma_weight_l0_flag[ i ] is not present, it is inferred to be equal to 0.
  • The syntax element delta_luma_weight_l0[ i ] is the difference of the weighting factor applied to the luma prediction value for list 0 prediction using RefPicList0[ i ]. The variable LumaWeightL0[ i ] may be the luma weighting factor for list0 and is derived to be equal to (1 << luma_log2_weight_denom) + delta_luma_weight_l0[ i ]. That is, the luma weighting factor is a right shift of the variable luma_log2_weight_denom plus the value of delta_luma_weight_l0[ i ]. When luma_weight_l0_flag[ i ] is equal to 1, the value of delta_luma_weight_l0[ i ] shall be in the range of –(1 << (BitDepthY – 1)), (1 << (BitDepthY – 1)) –1, inclusive, where BitDepthY is the bit depth for the luma component of the reference picture. This sets the range of the weighting factor to be based on the variable BitDepthY. When luma_weight_l0_flag[ i ] is equal to 0, LumaWeightL0[ i ] is inferred to be equal to 2luma_log2_weight_denom.
  • The syntax element luma_offset_l0[ i ] is the additive offset applied to the luma prediction value for list 0 prediction using RefPicList0[ i ]. The value of luma_offset_l0[ i ] shall be in the range of -128 to 127, inclusive, or –(1 << (BitDepthY – 1) to (1 << (BitDepthY – 1) – 1, inclusive. When luma_weight_l0_flag[ i ] is equal to 0, luma_offset_l0[ i ] is inferred as equal to 0.
  • The syntax element delta_chroma_weight_l0[ i ][ j ] is the difference of the weighting factor applied to the chroma prediction values for list 0 prediction using RefPicList0[ i ] with j equal to 0 for Cb and j equal to 1 for Cr. The variable ChromaWeightL0[ i ][ j ] may be the chroma weighting factor for list0 and is derived to be equal to (1 << ChromaLog2WeightDenom) + delta_chroma_weight_l0[ i ][ j ]. That is, the chroma weighting factor is a right shift of the variable ChromaLog2WeightDenom plus the value of delta_chroma_weight_l0[ i ][ j ]. When chroma_weight_l0_flag[ i ] is equal to 1, the value of delta_chroma_weight_l0[ i ][ j ] shall be in the range of –(1 << (BitDepthC –1)), (1 << (BitDepthC – 1)) –1, inclusive, where BitDepthC is the bit depth for the chroma component of the reference picture. This sets the range of the weighting factor to be based on the variable BitDepthC. When chroma_weight_l0_flag[ i ] is equal to 0, ChromaWeightL0[ i ][ j ] is inferred to be equal to 2ChromaLog2WeightDenom.
  • The syntax element delta_chroma_offset_l0[ i ][ j ] is the difference of the additive offset applied to the chroma prediction values for list 0 prediction using RefPicList0[ i ] with j equal to 0 for Cb and j equal to 1 for Cr.
  • The variable ChromaOffsetL0[ i ][ j ] is derived as follows:
  •        ChromaOffsetL0[ i ][ j ]                                                    =
           Clip3( -128, 127, ( delta_chroma_offset_l0[ i ][ j ] - (( 128 * ChromaWeight
           L0[ i ][ j ] ) >> ChromaLog2WeightDenom) + 128 ) )
  • The value of delta_chroma_offset_10[ i ][ j ] shall be in the range of -512 to 511, inclusive. Also, variable ChromaOffsetL0[ i ][ j ] is derived as follows:
  •        ChromaOffsetL0[i][j| = Clip3( -(1 << (BitDepthC - 1), (1 << (BitDepthC
           - 1) -1, (delta_chroma_offset_10[i]|j] - (((1 << (BitDepthC - 1))
           * ChromaWeightL0[i][j] ) >> ChromaLog2WeightDenom) + (1                   <<
           (BitDepthC - 1))).
  • The value of delta_chroma_offset_10[ i ][ j ] shall be in the range of -(1 << (BitDepthC + 1) to (1 << (BitDepthC + 1) –1, inclusive. When chroma_weight_l0_flag[ i ] is equal to 0, ChromaOffsetL0[ i ][ j ] is inferred to be equal to 0.
  • The syntax elements luma_weight_l1_flag[ i ], chroma_weight_l1_flag[ i ], delta_luma_weight_l1[ i ], luma_offset_l1[ i ], delta_chroma_weight_l1[ i ][ j ], and delta_chroma_offset_l1[ i ][ j ] have the same semantics as luma_weight_10_flag[ i ], chroma_weight_l0_flag[ i ], delta_luma_weight_l0[ i ], luma_offset_l0[ i ], delta_chroma_weight_l0[ i ][ j ], and delta_chroma_offset_l0[ i ][ j ], respectively, with 10, L0, list 0, and List0 replaced by l1, L1, list 1, and List1, respectively. That is, these are the same syntax elements for list1 instead of list0.
  • The variable sumWeightL0Flags is derived to be equal to the sum of luma_weight_l0_flag[ i ] + 2 * chroma_weight_l0_flag[ i ], for i = 0..num_ref_ ldx_l0_active_ minus1. When slice_type is equal to B, the variable sumWeightL1_Flags is derived to be equal to the sum of luma_weight_l1_flag[ i ] + 2 * chroma_weight_l1_flag[ i ], for i = 0..num_ ref_ idx _l1_active _minus 1. It is a requirement of bitstream conformance that, when slice_type is equal to P, sunWeightL0Flags shall be less than or equal to 24, and when slice_type is equal to B, the sum of sunWeightL0Flags and sumWeightL1Flags shall be less than or equal to 24.
  • The above syntax elements luma_offset_l0, luma_offset_l1, ChromaOffsetL0, ChromaOffsetL1 are modified when the pixels of reference pictures may include higher precision, such as when the syntax element extended_precision_processing_flag is equal to 1.
  • Accordingly, by using the variable BitDepth for the luma and chroma components, encoder 102 may change the number of bits that is used for weighted prediction based on the number of bits that represent pixels in the pictures of the video. In one embodiment, encoder 102 knows the BitDepth for the pictures of a video and can set the variable BitDepth to the value of the number of bits that represent the pixels of pictures.
  • FIG. 4 depicts a simplified flowchart 400 of a method for using the variable BitDepth in the encoding process according to one embodiment. At 402, encoder 102 determines a value for the variable BitDepth for the luma and chroma components. The value may be based on the precision of the video, such as a number of bits that represent the pixels in pictures of a video. Encoder 102 may determine the value based on analyzing characteristics of the video. Also, the number of bits that represent the pixels may be input or included in metadata associated with the video.
  • At 404, motion estimation and compensation block 104-1 determines when weighted prediction is enabled. For example, weighted prediction flags, luma_weight_l0_flag[ i] and chroma_weight_l0_flag[ i ], may be set to indicate that weighted prediction is enabled for the luma and chroma components.
  • At 406, weighted prediction manager 106-1 determines motion vectors for reference units 202-2 and 202-3 and weighting factors for the luma and chroma components for reference units 202-2 and 202-3 (e.g., list0 and list1). The weighting factors may be assigned to the pictures (e.g., list0 and list1) that are being used as reference pictures for a current picture.
  • At 408, weighted prediction manager 106-1 then calculates the combined bi-directional reference unit to use in motion compensation based on the weighting factors and pixels of reference units 202-2 and 202-3. As described above, the weighting factors weight pixels differently in reference units 202-2 and 202-3. At 410, motion estimation and compensation block 104-1 determines the residual error using current unit 202-1 and the combined bi-directional reference unit.
  • At 412, encoder 102 calculates the values for the applicable syntax elements used in the weighted prediction process. For example, the values for the syntax elements luma_log2_weight_denom, delta_chroma_log2_weight_denom, delta_luma_weight_l0[ i ], delta_chroma_weight_l0[ i ][ j ], delta_luma_weight_l1[ i ], and delta_chroma_weight_l1[ i ][ j ] using the variable BitDepth are calculated.
  • At 414, encoder 102 encodes the parameters for the syntax elements for the weighted prediction process in an encoded bitstream along with the motion vectors and residual error for current unit 202-1. For example, encoder 102 encodes the values calculated for the syntax elements shown in Table 300 in FIG. 3 . At 416, encoder 102 sends the encoded bitstream to decoder 104.
  • FIG. 5 depicts a simplified flowchart 500 for decoding an encoded bitstream using the variable BitDepth according to one embodiment. At 502, decoder 104 receives the encoded bitstream. At 504, decoder 104 then determines a current unit 202-1 in a current picture to decode. At 506, decoder 104 decodes the encoded motion vectors from the encoded bitstream for current unit 202-1 and uses these motion vectors to locate the reconstructed reference units 202-2 and 202-3. Also, at 508, decoder 104 determines the residual error for current unit 202-1 from the encoded bitstream.
  • At 510, weighted prediction manager 106-2 may determine the values for the syntax elements in Table 300 for weighted prediction. This may allow weighted prediction manager 106-2 to determine the weighting factors. At 512, weighted prediction manager 106-2 may then apply the weighting factors to reference units 202-2 and 202-3 to determine the combined bi-directional reference unit. At 514, motion estimation and compensation block 104-2 can then add the residual error for current unit 202-1 to the combined weighted prediction unit to form a reconstructed current unit.
  • Accordingly, particular embodiments use a variable BitDepth for syntax elements in the weighted prediction process. For example, the syntax elements luma_log2_weight_denom, delta_chroma_log2_weight_denom, delta_luma_weight_l0[ i ], delta_chroma_weight_l0[ i ][ j ], delta_luma_weight_l1[ i ], and delta_chroma_weight_l1[ i ][ j ] use the variable BitDepth.
  • Particular embodiments may be implemented in a non-transitory computer-readable storage medium for use by or in connection with the instruction execution system, apparatus, system, or machine. The computer-readable storage medium contains instructions for controlling a computer system to perform a method described by particular embodiments. The computer system may include one or more computing devices. The instructions, when executed by one or more computer processors, may be configured to perform that which is described in particular embodiments.
  • As used in the description herein and throughout the claims that follow, “a”, “an”, and “the” includes plural references unless the context clearly dictates otherwise. Also, as used in the description herein and throughout the claims that follow, the meaning of “in” includes “in” and “on” unless the context clearly dictates otherwise.
  • The above description illustrates various embodiments along with examples of how aspects of particular embodiments may be implemented. The above examples and embodiments should not be deemed to be the only embodiments, and are presented to illustrate the flexibility and advantages of particular embodiments as defined by the following claims. Based on the above disclosure and the following claims, other arrangements, embodiments, implementations and equivalents may be employed without departing from the scope hereof as defined by the claims.

Claims (4)

1-20. (canceled)
21. A method for decoding a bitstream of video data, the method comprising:
identifying one or more weight flags signaled in the bitstream that indicates presence of weighting factors for at least one of a luma component and/or a chroma component;
determining a first weighting factor for performing weighted prediction for a current unit of a current picture the first weighting factor for weighting pixels of a first reference unit of a first reference picture when performing motion compensation for the current unit;
determining a second weighting factor for weighting pixels of a second reference unit of a second reference picture when performing motion compensation for the current unit,
wherein when weighting factors for a luma component is present:
determining from a signaled delta_luma_weight_10 syntax element a difference of the first weighting factor and the second weighting factor applied to a luma prediction value for list 0 prediction using a variable RefPicList0[i] for a first luma component, and
deriving a variable LumaWeightLO associated with the luma component weighting factors, wherein when the one or more weight flags indicates presence of the weighting factor for a luma component, LumaWeightLO is derived to be equal to (1<<luma_log2_weight_denom)+delta_luma_weight_10 in a range of –(1<<(BitDepthy–1)), (1<<(BitDepthy–1)–1), inclusive,
wherein luma_log2_weight_denom is a base 2 logarithm of a denominator for all luma weighting factors, and BitDepthy is a bit depth for the luma component of the respective reference picture; and
wherein when weighting factors for a chroma component is present:
determining from a delta_chroma_weight_10[i][j] syntax element a difference of the first weighting factor and the second weighting factor applied to a chroma prediction value for list 0 prediction using a variable RefPicList0[i] with j equal to 0 for Cb or j equal to 1 for Cr for a second component; and
deriving a variable ChromaWeightLO associated with the chroma component weighting factor, wherein when the one or more weight flags indicates presence of the weighting factor for a chroma component, ChromaWeightLO is derived to be equal to ((1<<(luma_log2_weight_denom+delta_chroma_log2_weight_denom))+delta_ch roma_weight_10, delta_chroma_weight_10 in a range of –(1<<(BitDepthc-1)), ((1<<BitDepthc-1)–1), inclusive,
wherein delta_chroma_log2_weight_denom is a difference of a base 2 logarithm of a denominator for all chroma weighting factors, and BitDepthc is a bit depth for the chroma component of the respective reference picture;
wherein the delta_chroma_weight_10[i][j] syntax element is within the range set by the ChromaWeightLO, and
wherein the second component comprises a chroma component of the first reference unit or the second reference unit;
wherein a variable ChromaArrayType is used at least in part to derive a pred_weight_table based upon if (ChromaArrayType != 0);
decoding said bitstream based upon said pred_weight_table.
22. One or more computer-readable memory or storage devices storing encoded data as part of a bitstream of video data, the encoded data being organized to facilitate decoding by a video decoder performing operations, the video decoder being implemented using memory and one or more processing units, the operations comprising:
identifying one or more weight flags signaled in the bitstream that indicates presence of weighting factors for at least one of a luma component and/or a chroma component;
determining a first weighting factor for performing weighted prediction for a current unit of a current picture the first weighting factor for weighting pixels of a first reference unit of a first reference picture when performing motion compensation for the current unit;
determining a second weighting factor for weighting pixels of a second reference unit of a second reference picture when performing motion compensation for the current unit,
wherein when weighting factors for a luma component is present:
determining from a signaled delta_luma_weight_10 syntax element a difference of the first weighting factor and the second weighting factor applied to a luma prediction value for list 0 prediction using a variable RefPicList0[i] for a first luma component, and
deriving a variable LumaWeightLO associated with the luma component weighting factors, wherein when the one or more weight flags indicates presence of the weighting factor for a luma component, LumaWeightLO is derived to be equal to (1<<luma_log2_weight_denom)+delta_luma_weight_10 in a range of –(1<<(BitDepthy-1)), (1<<(BitDepthy–1)–1), inclusive,
wherein luma_log2_weight_denom is a base 2 logarithm of a denominator for all luma weighting factors, and BitDepthy is a bit depth for the luma component of the respective reference picture; and
wherein when weighting factors for a chroma component is present:
determining from a delta_chroma_weight_10[i][j] syntax element a difference of the first weighting factor and the second weighting factor applied to a chroma prediction value for list 0 prediction using a variable RefPicList0[i] with j equal to 0 for Cb or j equal to 1 for Cr for a second component; and
deriving a variable ChromaWeightLO associated with the chroma component weighting factor, wherein when the one or more weight flags indicates presence of the weighting factor for a chroma component, ChromaWeightLO is derived to be equal to ((1<<(luma_log2_weight_denom+delta_chroma_log2_weight_denom))+delta_ch roma_weight_10, delta_chroma_weight_10 in a range of –(1<<(BitDepthc–1)), ((1<<BitDepthc-1)–1), inclusive,
wherein delta_chroma_log2_weight_denom is a difference of a base 2 logarithm of a denominator for all chroma weighting factors, and BitDepthc is a bit depth for the chroma component of the respective reference picture;
wherein the delta_chroma_weight_10[i][j] syntax element is within the range set by the ChromaWeightLO, and
wherein the second component comprises a chroma component of the first reference unit or the second reference unit;
wherein a variable ChromaArrayType is used at least in part to derive a pred_weight_table based upon if (ChromaArrayType != 0);
wherein said bitstream is suitable to be decoded based upon said pred_weight_table.
23. A computing device comprising:
a video encoder for video data configured to provide a bitstream and perform operations comprising:
providing one or more weight flags signaled in the bitstream that indicates presence of weighting factors for at least one of a luma component and/or a chroma component;
wherein a first weighting factor for performing weighted prediction for a current unit of a current picture the first weighting factor for weighting pixels of a first reference unit of a first reference picture when performing motion compensation for the current unit;
wherein a second weighting factor for weighting pixels of a second reference unit of a second reference picture when performing motion compensation for the current unit, wherein when weighting factors for a luma component is present:
providing from a signaled delta_luma_weight_10 syntax element a difference of the first weighting factor and the second weighting factor applied to a luma prediction value for list 0 prediction using a variable RefPicList0[i] for a first luma component, and
wherein a variable LumaWeightLO associated with the luma component weighting factors, wherein when the one or more weight flags indicates presence of the weighting factor for a luma component, LumaWeightLO is derived to be equal to (1<<luma_log2_weight_denom)+delta_luma_weight_10 in a range of –(1<<(BitDepthy-1)), (1<<(BitDepthy–1)–1), inclusive,
wherein luma_log2_weight_denom is a base 2 logarithm of a denominator for all luma weighting factors, and BitDepthy is a bit depth for the luma component of the respective reference picture; and
wherein when weighting factors for a chroma component is present:
providing from a delta_chroma_weight_10[i][j] syntax element a difference of the first weighting factor and the second weighting factor applied to a chroma prediction value for list 0 prediction using a variable RefPicList0[i] with j equal to 0 for Cb or j equal to 1 for Cr for a second component; and
wherein a variable ChromaWeightLO associated with the chroma component weighting factor, wherein when the one or more weight flags indicates presence of the weighting factor for a chroma component, ChromaWeightLO is derived to be equal to ((1<<(luma_log2_weight_denom+delta_chroma_log2_weight_denom))+delta_ch roma_weight_10, delta_chroma_weight_10 in a range of –(1<<(BitDepthc–1)), ((1<<BitDepthc-1)–1), inclusive,
wherein delta_chroma_log2_weight_denom is a difference of a base 2 logarithm of a denominator for all chroma weighting factors, and BitDepthc is a bit depth for the chroma component of the respective reference picture;
wherein the delta_chroma_weight_10[i][j] syntax element is within the range set by the ChromaWeightLO, and
wherein the second component comprises a chroma component of the first reference unit or the second reference unit;
wherein a variable ChromaArrayType is used at least in part to derive a pred_weight_table based upon if (ChromaArrayType != 0);
encoding said bitstream based upon said pred_weight_table.
US18/124,503 2013-11-05 2023-03-21 Bit depth variable for high precision data in weighted prediction syntax and semantics Abandoned US20230239498A1 (en)

Priority Applications (2)

Application Number Priority Date Filing Date Title
US18/124,503 US20230239498A1 (en) 2013-11-05 2023-03-21 Bit depth variable for high precision data in weighted prediction syntax and semantics
US18/379,994 US20240155151A1 (en) 2013-11-05 2023-10-13 Bit depth variable for high precision data in weighted prediction syntax and semantics

Applications Claiming Priority (6)

Application Number Priority Date Filing Date Title
US201361900332P 2013-11-05 2013-11-05
US14/533,369 US10432963B2 (en) 2013-11-05 2014-11-05 Bit depth variable for high precision data in weighted prediction syntax and semantics
US16/548,836 US11070840B2 (en) 2013-11-05 2019-08-22 Bit depth variable for high precision data in weighted prediction syntax and semantics
US17/352,100 US11470349B2 (en) 2013-11-05 2021-06-18 Bit depth variable for high precision data in weighted prediction syntax and semantics
US17/900,610 US11641484B2 (en) 2013-11-05 2022-08-31 Bit depth variable for high precision data in weighted prediction syntax and semantics
US18/124,503 US20230239498A1 (en) 2013-11-05 2023-03-21 Bit depth variable for high precision data in weighted prediction syntax and semantics

Related Parent Applications (1)

Application Number Title Priority Date Filing Date
US17/900,610 Continuation US11641484B2 (en) 2013-11-05 2022-08-31 Bit depth variable for high precision data in weighted prediction syntax and semantics

Related Child Applications (1)

Application Number Title Priority Date Filing Date
US18/379,994 Continuation US20240155151A1 (en) 2013-11-05 2023-10-13 Bit depth variable for high precision data in weighted prediction syntax and semantics

Publications (1)

Publication Number Publication Date
US20230239498A1 true US20230239498A1 (en) 2023-07-27

Family

ID=53007029

Family Applications (10)

Application Number Title Priority Date Filing Date
US14/533,369 Active 2035-02-02 US10432963B2 (en) 2013-11-05 2014-11-05 Bit depth variable for high precision data in weighted prediction syntax and semantics
US16/548,836 Active US11070840B2 (en) 2013-11-05 2019-08-22 Bit depth variable for high precision data in weighted prediction syntax and semantics
US17/352,129 Active 2034-11-08 US11490117B2 (en) 2013-11-05 2021-06-18 Bit depth variable for high precision data in weighted prediction syntax and semantics
US17/352,117 Active 2034-11-08 US11490116B2 (en) 2013-11-05 2021-06-18 Bit depth variable for high precision data in weighted prediction syntax and semantics
US17/352,137 Active US11470350B2 (en) 2013-11-05 2021-06-18 Bit depth variable for high precision data in weighted prediction syntax and semantics
US17/352,100 Active US11470349B2 (en) 2013-11-05 2021-06-18 Bit depth variable for high precision data in weighted prediction syntax and semantics
US17/352,147 Active US11470351B2 (en) 2013-11-05 2021-06-18 Bit depth variable for high precision data in weighted prediction syntax and semantics
US17/900,610 Active US11641484B2 (en) 2013-11-05 2022-08-31 Bit depth variable for high precision data in weighted prediction syntax and semantics
US18/124,503 Abandoned US20230239498A1 (en) 2013-11-05 2023-03-21 Bit depth variable for high precision data in weighted prediction syntax and semantics
US18/379,994 Pending US20240155151A1 (en) 2013-11-05 2023-10-13 Bit depth variable for high precision data in weighted prediction syntax and semantics

Family Applications Before (8)

Application Number Title Priority Date Filing Date
US14/533,369 Active 2035-02-02 US10432963B2 (en) 2013-11-05 2014-11-05 Bit depth variable for high precision data in weighted prediction syntax and semantics
US16/548,836 Active US11070840B2 (en) 2013-11-05 2019-08-22 Bit depth variable for high precision data in weighted prediction syntax and semantics
US17/352,129 Active 2034-11-08 US11490117B2 (en) 2013-11-05 2021-06-18 Bit depth variable for high precision data in weighted prediction syntax and semantics
US17/352,117 Active 2034-11-08 US11490116B2 (en) 2013-11-05 2021-06-18 Bit depth variable for high precision data in weighted prediction syntax and semantics
US17/352,137 Active US11470350B2 (en) 2013-11-05 2021-06-18 Bit depth variable for high precision data in weighted prediction syntax and semantics
US17/352,100 Active US11470349B2 (en) 2013-11-05 2021-06-18 Bit depth variable for high precision data in weighted prediction syntax and semantics
US17/352,147 Active US11470351B2 (en) 2013-11-05 2021-06-18 Bit depth variable for high precision data in weighted prediction syntax and semantics
US17/900,610 Active US11641484B2 (en) 2013-11-05 2022-08-31 Bit depth variable for high precision data in weighted prediction syntax and semantics

Family Applications After (1)

Application Number Title Priority Date Filing Date
US18/379,994 Pending US20240155151A1 (en) 2013-11-05 2023-10-13 Bit depth variable for high precision data in weighted prediction syntax and semantics

Country Status (2)

Country Link
US (10) US10432963B2 (en)
WO (1) WO2015069734A1 (en)

Families Citing this family (8)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO2015069729A1 (en) 2013-11-05 2015-05-14 Arris Enterprises, Inc. Simplified processing of weighted prediction syntax and semantics using a bit depth variable for high precision data
EP3236657A1 (en) * 2016-04-21 2017-10-25 Ultra-D Coöperatief U.A. Dual mode depth estimator
WO2019244052A1 (en) 2018-06-19 2019-12-26 Beijing Bytedance Network Technology Co., Ltd. Different precisions for different reference list
CN110933427B (en) 2018-09-19 2023-05-12 北京字节跳动网络技术有限公司 Affine mode encoded mode dependent adaptive motion vector resolution
WO2020156516A1 (en) 2019-01-31 2020-08-06 Beijing Bytedance Network Technology Co., Ltd. Context for coding affine mode adaptive motion vector resolution
CN113424534A (en) * 2019-02-01 2021-09-21 北京字节跳动网络技术有限公司 Multiple syntax elements for adaptive motion vector resolution
CN113812151A (en) * 2019-03-11 2021-12-17 弗劳恩霍夫应用研究促进协会 Encoder and decoder, encoding method and decoding method with profile and level dependent encoding options
CN114663496B (en) * 2022-03-23 2022-10-18 北京科技大学 Monocular vision odometer method based on Kalman pose estimation network

Citations (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20150098503A1 (en) * 2013-10-03 2015-04-09 Qualcomm Incorporated High precision explicit weighted prediction for video coding

Family Cites Families (7)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US9883203B2 (en) 2011-11-18 2018-01-30 Qualcomm Incorporated Adaptive overlapped block motion compensation
US9503720B2 (en) * 2012-03-16 2016-11-22 Qualcomm Incorporated Motion vector coding and bi-prediction in HEVC and its extensions
SG11201400296RA (en) * 2012-06-27 2014-09-26 Toshiba Kk Encoding device, decoding device, encoding method, and decoding method
US9473779B2 (en) 2013-03-05 2016-10-18 Qualcomm Incorporated Parallel processing for video coding
WO2015069729A1 (en) 2013-11-05 2015-05-14 Arris Enterprises, Inc. Simplified processing of weighted prediction syntax and semantics using a bit depth variable for high precision data
US9918105B2 (en) 2014-10-07 2018-03-13 Qualcomm Incorporated Intra BC and inter unification
US10368107B2 (en) 2016-08-15 2019-07-30 Qualcomm Incorporated Intra video coding using a decoupled tree structure

Patent Citations (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20150098503A1 (en) * 2013-10-03 2015-04-09 Qualcomm Incorporated High precision explicit weighted prediction for video coding

Also Published As

Publication number Publication date
US11490116B2 (en) 2022-11-01
US20210314608A1 (en) 2021-10-07
US11641484B2 (en) 2023-05-02
US11070840B2 (en) 2021-07-20
US20210314605A1 (en) 2021-10-07
US20220417555A1 (en) 2022-12-29
US20210314609A1 (en) 2021-10-07
US20200059665A1 (en) 2020-02-20
US20150124882A1 (en) 2015-05-07
US11470351B2 (en) 2022-10-11
US11470349B2 (en) 2022-10-11
US20210314607A1 (en) 2021-10-07
US10432963B2 (en) 2019-10-01
US11470350B2 (en) 2022-10-11
US20210314606A1 (en) 2021-10-07
US20240155151A1 (en) 2024-05-09
WO2015069734A1 (en) 2015-05-14
US11490117B2 (en) 2022-11-01

Similar Documents

Publication Publication Date Title
US20230239498A1 (en) Bit depth variable for high precision data in weighted prediction syntax and semantics
US10631000B2 (en) Method and an apparatus for processing a video signal
US10271048B2 (en) Method of background residual prediction for video coding
KR20210027351A (en) Block size limit for DMVR
US20120230405A1 (en) Video coding methods and video encoders and decoders with localized weighted prediction
US10785475B2 (en) Method and apparatus of video coding with prediction offset
US20240146904A1 (en) Simplified processing of weighted prediction syntax and semantics using a bit depth variable for high precision data
US10432960B2 (en) Offset temporal motion vector predictor (TMVP)
EP3066833A1 (en) Simplified processing of weighted prediction syntax and semantics using a bit depth variable for high precision data

Legal Events

Date Code Title Description
STPP Information on status: patent application and granting procedure in general

Free format text: NOTICE OF ALLOWANCE MAILED -- APPLICATION RECEIVED IN OFFICE OF PUBLICATIONS

STCB Information on status: application discontinuation

Free format text: ABANDONED -- FAILURE TO PAY ISSUE FEE