WO2012033327A2 - Procédé de décodage d'image et appareil comprenant un procédé destiné à configurer une liste d'images de référence - Google Patents

Procédé de décodage d'image et appareil comprenant un procédé destiné à configurer une liste d'images de référence Download PDF

Info

Publication number
WO2012033327A2
WO2012033327A2 PCT/KR2011/006587 KR2011006587W WO2012033327A2 WO 2012033327 A2 WO2012033327 A2 WO 2012033327A2 KR 2011006587 W KR2011006587 W KR 2011006587W WO 2012033327 A2 WO2012033327 A2 WO 2012033327A2
Authority
WO
WIPO (PCT)
Prior art keywords
reference picture
list
slice
information
index
Prior art date
Application number
PCT/KR2011/006587
Other languages
English (en)
Korean (ko)
Other versions
WO2012033327A3 (fr
Inventor
임재현
전용준
박승욱
김정선
전병문
박준영
최영희
성재원
Original Assignee
엘지전자 주식회사
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by 엘지전자 주식회사 filed Critical 엘지전자 주식회사
Publication of WO2012033327A2 publication Critical patent/WO2012033327A2/fr
Publication of WO2012033327A3 publication Critical patent/WO2012033327A3/fr

Links

Images

Classifications

    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N19/00Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
    • H04N19/44Decoders specially adapted therefor, e.g. video decoders which are asymmetric with respect to the encoder
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N19/00Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
    • H04N19/70Methods or arrangements for coding, decoding, compressing or decompressing digital video signals characterised by syntax aspects related to video coding, e.g. related to compression standards

Definitions

  • the present invention relates to image processing, and more particularly, to a decoding method and apparatus including a method of constructing a reference picture list.
  • the demand for high resolution and high quality images such as high definition (HD) images and ultra high definition (UHD) images is increasing in various fields.
  • the higher the resolution and the higher quality of the image data the more information or bit rate is transmitted than the existing image data. Therefore, the image data can be transmitted by using a medium such as a conventional wired / wireless broadband line or by using a conventional storage medium.
  • the transmission cost and the storage cost are increased. High efficiency image compression techniques can be used to solve these problems.
  • the image compression technique includes an inter prediction technique that predicts pixel values included in a current picture from before and / or after a current picture, and predicts pixel values included in a current picture by using pixel information in the current picture.
  • inter prediction technique that predicts pixel values included in a current picture from before and / or after a current picture, and predicts pixel values included in a current picture by using pixel information in the current picture.
  • intra prediction technique an entropy encoding technique of allocating a short code to a high frequency of appearance and a long code to a low frequency of appearance.
  • a pixel value included in a current picture is predicted from a picture before and / or after a current picture, that is, a reference picture, information about a reference picture list may be generated and transmitted.
  • An object of the present invention is to provide an image decoding method and apparatus including a method of constructing a reference picture list.
  • Another object of the present invention is to provide an image decoding method and apparatus including a method of constructing a reference picture list capable of reducing the amount of information of the maximum number of reference pictures transmitted to a decoder.
  • Another technical problem of the present invention is to provide an image decoding method and apparatus including a combined list construction method capable of reducing the amount of information transmitted to a decoder.
  • Another object of the present invention is to provide an image decoding method and apparatus including a method of constructing a reference picture list capable of reducing the amount of information of inter prediction mode information transmitted to a decoder without generating a combine list.
  • Another technical problem of the present invention is to provide an image decoding method and apparatus including a method of constructing a reference picture list capable of reducing the amount of information transmitted to a decoder for a generalized P and B (GPB) slice.
  • a method of constructing a reference picture list capable of reducing the amount of information transmitted to a decoder for a generalized P and B (GPB) slice.
  • Another object of the present invention is to provide an image decoding method and apparatus capable of reducing the complexity of deriving a temporal motion information candidate for a low delay B slice.
  • Another object of the present invention is to provide an image decoding method and apparatus including a method of constructing a reference picture list capable of reducing the amount of information of reference picture index information transmitted to a decoder.
  • One embodiment of the present invention is a video decoding method.
  • the method includes receiving reference picture list information including first flag information, deriving maximum reference picture number information in the reference picture list based on the first flag information, and the maximum reference. Generating a reference picture list by using picture number information.
  • the first flag information indicates whether the number of first maximum reference pictures allowed for the current picture and the number of second maximum reference pictures allowed for the current slice are the same, and the first flag information indicates a sequence parameter. Sequence Parameter Set (SPS) information or Picture Parameter Set (PPS) information.
  • SPS Sequence Parameter Set
  • PPS Picture Parameter Set
  • the first flag information may indicate whether the maximum number of reference pictures allowed for all pictures and slices in the current sequence is the same.
  • the first flag information may be information obtained from sequence parameter set information.
  • the method may further include generating a reference picture list 0 and a reference picture list 1 for the current picture, from a temporal level of the reference picture included in the reference picture list 0 and the reference picture list 1 and the current picture. Determining a temporal distance of the memory; and inserting the reference picture based on the temporal level and the temporal distance to generate a combined list.
  • a reference picture having a small temporal distance is inserted first, a reference picture having a low temporal level is preferentially inserted among the reference pictures having the same temporal distance, and the temporal distance and the Reference pictures included in reference picture list 0 may be preferentially inserted among reference pictures having the same temporal level.
  • the method may further include receiving second flag information indicating whether a reference picture having a low temporal level is inserted first or always whether a reference picture included in reference picture list 0 is inserted first. .
  • the method may include generating a reference picture list 0 and a reference picture list 1 for a current picture, determining a quantization parameter of a reference picture included in the reference picture list 0 and the reference picture list 1, and a temporal distance from the current picture. And inserting the reference picture based on the quantization parameter (QP) and the temporal distance to generate a bind list.
  • QP quantization parameter
  • a reference picture having a small temporal distance is inserted first, and a reference picture having a low quantization parameter is preferentially inserted among the reference pictures having the same temporal distance, and the temporal distance and the quantization parameter are first inserted.
  • Reference pictures included in reference picture list 0 among the same reference pictures may be inserted first.
  • the method may include generating a reference picture list 0 and a reference picture list 1 for a current picture, scanning the reference pictures included in the reference picture list 0 and the reference picture list 1, and then scanning the current pictures to a current prediction target block in the current picture. Selecting a reference picture for the image and deriving motion information on the current prediction target block using the selected reference picture.
  • the reference picture of reference picture list 0 and the reference picture list 1 are referenced, starting from the reference picture of reference picture list 0, and then in descending order of the reference picture index.
  • the method includes receiving image information including information indicating whether a current slice is a Generalized P and B (GPB) slice from an encoder, and based on information indicating whether the current slice is a GPB slice, a reference picture list. Generating 0 and reference picture list1. In the generating of the reference picture list 0 and the reference picture list 1, if the current slice is a GPB slice, the reference picture list 1 is generated using the reference picture list 0 related information included in the image information.
  • the GPB slice is a slice in which the reference picture list 0 and the reference picture list 1 are the same.
  • the information indicating whether the current slice is a GPB slice is information indicated by a third flag indicating whether the current slice is a GPB slice, and the third flag is generated by an encoder and transmitted to the decoder. If the flag and the third flag indicates that the current slice is a GPB slice, the image information may not include reference picture list1 related information.
  • the third flag may be a flag generated by an encoder and transmitted to a decoder only when the current slice is a B picture.
  • the information indicating whether the current slice is a GPB slice is information indicated by a fourth flag indicating whether the reference picture list 0 and the reference picture list 1 are the same for the current slice, and the fourth flag. Is a flag generated by the encoder and transmitted to the decoder, and when the fourth flag indicates that the reference picture list 0 and the reference picture list 1 are the same for the current slice, the image information is related to the reference picture list1. It may not include.
  • the GPB slice is a slice type defined separately from an I slice, a P slice, and a B slice, and the B slice is a slice in which reference picture list 0 and reference picture list 1 are not the same, and whether the current slice is a GPB slice.
  • the indicating information is information indicated by the slice type of the GPB slice, and when the slice type of the current slice is a GPB slice, the image information may not include reference picture list1 related information.
  • Another embodiment of the present invention is a video decoding method.
  • the method includes receiving image information including information indicating whether a current slice is a low delay B slice, and based on information indicating whether the current slice is a low delay B slice, a current prediction target. Deriving a temporal motion information candidate by checking a reference picture list of a co-located block with respect to the block, and inter-screening using the temporal motion information candidate and the image information ) Performing prediction, wherein the low delay B slice is a slice having only a forward reference picture, the co-located block is a block located within a reference picture of the current prediction target block, and the current slice is a low delay B slice.
  • the same reference picture list and the list of reference pictures of the reference picture list of the co-located block in the current prediction block is first checked.
  • the method may further include receiving image information including fifth flag information indicating whether index L0 and index L1 indicate the same reference picture from the encoder with respect to the current prediction target block, and based on the fifth flag information. Generating an index L0 and the index L1.
  • the index L0 is a reference picture index of a reference picture referenced by the current prediction target block in reference picture list
  • the index L1 is a reference picture index of a reference picture referenced by the current prediction target block in reference picture list 1. to be.
  • the image information does not include the index L1 related information and is included in the image information.
  • the index L1 may be generated using the index L0 related information.
  • the index information included in the image information is included.
  • the index L1 is generated by adding 1 to the index value of the index L1 related information. If the index value of the index L1 is smaller than the index Co, the index value of the index L1 related information included in the image information is copied.
  • the index L1 may be generated, and the index Co may be an index value of the reference picture list 1 indicating the same picture as the reference picture indicated by the index L0.
  • FIG. 1 is a block diagram schematically illustrating an image encoding apparatus according to an embodiment of the present invention.
  • FIG. 2 is a conceptual diagram schematically illustrating a prediction unit according to an embodiment of the present invention.
  • FIG. 3 is a conceptual diagram schematically illustrating an example of a quad tree structure of a processing unit in a system to which the present invention is applied.
  • FIG. 4 is a block diagram schematically illustrating an image decoding apparatus according to an embodiment of the present invention.
  • FIG. 5 is a conceptual diagram schematically illustrating a predictor of an image decoding apparatus according to an embodiment of the present invention.
  • FIG. 6 is a conceptual diagram schematically illustrating an embodiment of a prediction method that may be used in a P picture and a B picture.
  • FIG. 7 is a conceptual diagram schematically illustrating a comparison between a conventional P picture, a conventional B picture, and a prediction method of a GPB.
  • FIG. 8 is a flowchart schematically illustrating a method for constructing a reference picture list in the encoder and transmitting reference picture list information according to an embodiment of the present invention.
  • FIG. 9 is a flowchart schematically illustrating a method of generating reference picture list by receiving reference picture list information in a decoder according to an embodiment of the present invention.
  • FIG. 10 is a conceptual diagram schematically illustrating an embodiment of a method of constructing a bind list.
  • FIG. 11 is a conceptual diagram schematically illustrating an embodiment of a hierarchical coding structure.
  • FIG. 12 is a conceptual diagram schematically showing an embodiment of a method of generating a combined list using a temporal level according to the present invention.
  • FIG. 13 is a flowchart illustrating an embodiment of a method for generating a combine list according to the present invention.
  • FIG. 14 is a conceptual diagram schematically illustrating a reference picture selection method according to an embodiment of the present invention.
  • 15 is a flowchart schematically illustrating a method for deriving motion information according to an embodiment of the present invention.
  • 16 is a conceptual diagram schematically illustrating an embodiment of a reference picture and a reference picture list for a GPB and low delay B picture.
  • 17 is a flowchart schematically illustrating a method of transmitting a reference picture list of an encoder according to an embodiment of the present invention.
  • FIG. 18 is a flowchart schematically illustrating a method of generating a reference picture list of a decoder according to an embodiment of the present invention.
  • 19 is a conceptual diagram schematically illustrating an embodiment of a picture in which a reference picture index of list 0 and a reference picture index of list 1 indicate the same reference picture.
  • 20 is a flowchart schematically illustrating a method of transmitting a reference picture index of an encoder according to an embodiment of the present invention.
  • 21 is a flowchart schematically illustrating a reference picture index generation method of a decoder according to an embodiment of the present invention.
  • each of the components in the drawings described in the present invention are shown independently for the convenience of the description of the different characteristic functions in the image encoding / decoding apparatus, each component is implemented by separate hardware or separate software It does not mean to be.
  • two or more of each configuration may be combined to form one configuration, or one configuration may be divided into a plurality of configurations.
  • Embodiments in which each configuration is integrated and / or separated are also included in the scope of the present invention without departing from the spirit of the present invention.
  • the components may not be essential components for performing essential functions in the present invention, but may be optional components for improving performance.
  • the present invention can be implemented including only the components essential for implementing the essentials of the present invention except for the components used for improving performance, and the structure including only the essential components except for the optional components used for improving performance. Also included within the scope of the present invention.
  • the image encoding apparatus 100 may include a picture splitter 105, a predictor 110, a transformer 115, a quantizer 120, a realigner 125, and an entropy encoder 130. , An inverse quantization unit 135, an inverse transform unit 140, a filter unit 145, and a memory 150.
  • the picture dividing unit 105 may divide the input picture into at least one processing unit.
  • the processing unit may be a prediction unit (PU), a transform unit (TU), or a coding unit (CU).
  • the predictor 110 includes an inter prediction unit for performing inter prediction and an intra prediction unit for performing intra prediction.
  • the prediction unit 110 generates a prediction block by performing prediction on the processing unit of the picture in the picture division unit 105.
  • the processing unit of the picture in the prediction unit 110 may be a coding unit, a transformation unit, or a prediction unit.
  • the processing unit in which the prediction is performed may differ from the processing unit in which the prediction method and the details are determined.
  • the method of prediction and the prediction mode are determined in units of prediction units, and the performance of prediction may be performed in units of transform units.
  • the residual value (residual block) between the generated prediction block and the original block is input to the converter 115.
  • prediction mode information and motion vector information used for prediction are encoded by the entropy encoder 130 together with the residual value and transmitted to the decoder.
  • the transform unit 115 performs a transform on the residual block in transform units and generates transform coefficients.
  • the transform unit in the transform unit 115 may be a transform unit and may have a quad tree structure. In this case, the size of the transform unit may be determined within a range of a predetermined maximum and minimum size.
  • the transform unit 115 may transform the residual block using a discrete cosine transform (DCT) and / or a discrete sine transform (DST).
  • DCT discrete cosine transform
  • DST discrete sine transform
  • the quantization unit 120 may generate quantization coefficients by quantizing the residual values transformed by the transformation unit 115.
  • the value calculated by the quantization unit 120 is provided to the inverse quantization unit 135 and the reordering unit 125.
  • the reordering unit 125 rearranges the quantization coefficients provided from the quantization unit 120. By rearranging the quantization coefficients, the efficiency of encoding in the entropy encoder 130 may be increased.
  • the reordering unit 125 may rearrange the quantization coefficients in the form of a two-dimensional block into a one-dimensional vector form through a coefficient scanning method.
  • the reordering unit 125 may increase the entropy coding efficiency of the entropy encoder 130 by changing the order of coefficient scanning based on probabilistic statistics of coefficients transmitted from the quantization unit.
  • the entropy encoder 130 may perform entropy encoding on the quantized coefficients rearranged by the reordering unit 125.
  • Entropy encoding may use, for example, an encoding method such as Exponential Golomb, Context-Adaptive Variable Length Coding (CAVLC), or Context-Adaptive Binary Arithmetic Coding (CABAC).
  • CABAC Context-Adaptive Binary Arithmetic Coding
  • the entropy encoder 130 may include quantization coefficient information, block type information, prediction mode information, division unit information, prediction unit information, transmission unit information, and motion vector of the coding unit received from the reordering unit 125 and the prediction unit 110.
  • Various information such as information, reference picture information, interpolation information of a block, and filtering information can be encoded.
  • the inverse quantization unit 135 inverse quantizes the quantized values in the quantization unit 120, and the inverse transformer 140 inversely transforms the inverse quantized values in the inverse quantization unit 135.
  • the residual value generated by the inverse quantization unit 135 and the inverse transformer 140 may be combined with the prediction block predicted by the prediction unit 110 to generate a reconstructed block.
  • the filter unit 145 may apply a deblocking filter and / or an adaptive loop filter (ALF) to the reconstructed picture.
  • ALF adaptive loop filter
  • the deblocking filter may remove block distortion generated at the boundary between blocks in the reconstructed picture.
  • the adaptive loop filter may perform filtering based on a value obtained by comparing a reconstructed image with an original image after the block is filtered through a deblocking filter. ALF may be performed only when high efficiency is applied.
  • the filter unit 145 may not apply filtering to the reconstructed block used for inter prediction.
  • the memory 150 may store the reconstructed block or the picture calculated by the filter unit 145.
  • the reconstructed block or picture stored in the memory 150 may be provided to the predictor 110 that performs inter prediction.
  • a coding unit is a unit in which coding / decoding of a picture is performed and may be divided with a depth based on a quad tree structure.
  • the coding unit may have various sizes, such as 64x64, 32x32, 16x16, and 8x8.
  • the encoder may transmit information about a largest coding unit (LCU) and a minimum coding unit (SCU) to the decoder.
  • Information (depth information) regarding the number of splittable times together with information about the maximum coding unit and / or the minimum coding unit may be transmitted to the decoder.
  • Information on whether the coding unit is split based on the quad tree structure may be transmitted from the encoder to the decoder through flag information such as a split flag.
  • the predictor 200 may include an inter prediction unit 210 and an intra prediction unit 220.
  • the inter prediction unit 210 may generate a prediction block by performing prediction based on information of at least one of a previous picture or a subsequent picture of the current picture.
  • the intra prediction unit 220 may generate a prediction block by performing prediction based on pixel information in the current picture.
  • the inter prediction unit 210 selects a reference picture with respect to the prediction unit and selects a reference block having the same size as that of the prediction unit in units of integer pixel samples. Subsequently, the inter prediction unit 210 is most similar to the current prediction unit in sub-integer sample units such as 1/2 pixel sample unit and 1/4 pixel sample unit, so that the residual signal is minimized and the size of the motion vector to be encoded is also minimal. Generate a predictive block that can be.
  • the motion vector may be expressed in units of integer pixels or less. For example, the motion vector may be expressed in units of 1/4 pixels for luminance pixels and in units of 1/8 pixels for chrominance pixels.
  • Information about the index and the motion vector of the reference picture selected by the inter prediction unit 210 is encoded and transmitted to the decoder.
  • FIG. 3 is a conceptual diagram schematically illustrating an example of a quad tree structure of a processing unit in a system to which the present invention is applied.
  • the maximum coding unit (LCU) 300 may have a hierarchical structure composed of smaller coding units 310 through division, and the hierarchical structure of the coding unit may be based on size information, depth information, split flag information, and the like. Can be specified.
  • the size information of the maximum coding unit, the split depth information, and whether the current coding unit is split may be included in a sequence parameter set (SPS) on the bit stream and transmitted to the image decoder.
  • SPS sequence parameter set
  • the split flag of the coding unit may not be transmitted to the minimum coding unit.
  • inter prediction may be performed in units of prediction units.
  • intra prediction a prediction mode may be determined in units of prediction units, and prediction may be performed in units of prediction units.
  • a prediction mode may be determined in units of prediction units, and intra prediction may be performed in units of transformation units.
  • the prediction unit 320 may be 2N ⁇ 2N or N ⁇ N (N is an integer), and in case of inter prediction, the prediction unit 330 may be 2N ⁇ . 2N, 2N ⁇ N, N ⁇ 2N, or N ⁇ N.
  • N ⁇ N may be determined to be applied only to the minimum size coding unit or to be applied only to intra prediction.
  • N ⁇ mN, mN ⁇ N, 2N ⁇ mN, or mN ⁇ 2N (m ⁇ 1) may be further defined and used.
  • the image decoder 400 includes an entropy decoder 410, a reordering unit 415, an inverse quantizer 420, an inverse transform unit 425, a predictor 430, and a filter unit 435. And a memory 440.
  • the input bit stream may be decoded according to a procedure in which image information is processed by the image encoder.
  • VLC variable length coding
  • 'VLC' variable length coding
  • the entropy decoder 410 may also be identical to the VLC table used in the encoder. Entropy decoding can be performed by implementing a VLC table. Even when CABAC is used to perform entropy encoding in the image encoder, the entropy decoding unit 410 may perform entropy decoding using CABAC correspondingly.
  • Information for generating a prediction block among the information decoded by the entropy decoder 410 may be provided to the predictor 430, and a residual value of which entropy decoding is performed by the entropy decoder may be input to the reordering unit 415.
  • the reordering unit 415 may reorder the bit stream deentropy decoded by the entropy decoding unit 410 based on a method of reordering the image encoder.
  • the reordering unit 415 may reorder the coefficients expressed in the form of a one-dimensional vector by restoring the coefficients in the form of a two-dimensional block.
  • the reordering unit 415 may receive the information related to the coefficient scanning performed by the encoder and perform the rearrangement by performing a reverse scanning method based on the scanning order performed by the corresponding encoder.
  • the inverse quantization unit 420 may perform inverse quantization based on the quantization parameter provided by the encoder and the coefficient values of the rearranged block.
  • the inverse transformer 425 may perform inverse DCT and / or inverse DST on DCT and DST performed by the encoder of the encoder with respect to the quantization result performed by the image encoder.
  • the inverse transform may be performed based on a transmission unit determined by the encoder or a division unit of an image.
  • the DCT and / or DST may be selectively performed according to a plurality of pieces of information, such as a prediction method, a size and a prediction direction of the current block, and the inverse transformer 425 of the decoder is performed by the transformer of the encoder.
  • Inverse transformation may be performed based on the transformation information.
  • the prediction unit 430 may generate the prediction block based on the prediction block generation related information provided by the entropy decoding unit 410 and previously decoded blocks and / or picture information provided by the memory 440.
  • the reconstruction block may be generated using the prediction block generated by the predictor 430 and the residual block provided by the inverse transform unit 425.
  • the reconstructed block and / or picture may be provided to the filter unit 435.
  • the filter unit 435 applies deblocking filtering, sample adaptive offset (SAO), and / or adaptive loop filtering (ALF) to the reconstructed block and / or picture.
  • deblocking filtering sample adaptive offset (SAO)
  • ALF adaptive loop filtering
  • the memory 440 may store the reconstructed picture or block to use as a reference picture or reference block, and may provide the reconstructed picture to the output unit.
  • FIG. 5 is a conceptual diagram schematically illustrating a predictor of an image decoding apparatus according to an embodiment of the present invention.
  • the predictor 500 may include an intra prediction unit 510 and an inter prediction unit 520.
  • the intra prediction unit 510 may generate a prediction block based on pixel information in the current picture when the prediction mode for the prediction unit is an intra prediction mode (intra prediction mode).
  • the inter picture prediction unit 520 may provide information, for example, motion required for inter picture prediction of the current prediction unit provided by the image encoder when the prediction mode for the corresponding prediction unit is an inter prediction mode (inter prediction mode).
  • the inter-screen prediction of the current prediction unit may be performed based on information included in at least one of a previous picture or a subsequent picture of the current picture including the current prediction unit using information on a vector, a reference picture index, and the like.
  • the motion information may be derived in response to the skip flag, the merge flag, and the like of the coding unit received from the encoder.
  • a "picture” or a “picture” can represent the same meaning as a “picture” according to the configuration or expression of the invention, the “picture” may be described as a “picture” or a “picture”.
  • inter prediction and inter prediction have the same meaning
  • intra prediction and intra prediction have the same meaning.
  • the inter prediction unit may generate a prediction block by performing prediction on the current unit using information of at least one of a previous picture or a subsequent picture of the current picture, not the current picture. have.
  • a picture used for prediction is referred to as a reference picture.
  • the reference picture used for prediction of the current unit may be represented by using a reference picture index and a motion vector information indicating the reference picture.
  • the P picture and the B picture may be encoded in the inter prediction mode, and the I picture, the P picture, and the B picture will be described later.
  • 6 is a conceptual diagram schematically illustrating an embodiment of a prediction method that may be used in a P picture and a B picture. 6 includes a prediction method 610 for a P picture and a prediction method 620, 630, 640 for a B picture.
  • a block to be predicted currently is called a prediction target block
  • a block generated by the prediction is a prediction block
  • a block used for prediction of the prediction target block in a reference picture is referred to as a reference block.
  • POC Picture Order Count
  • An I picture is a picture that is encoded independently in the picture regardless of the picture before and after. Prediction in the time direction is not applied to the I picture, and only intra-picture information is used for the encoding process.
  • the P picture is a picture that can be encoded by unidirectional prediction between pictures using one reference picture.
  • FIG. 6 shows unidirectional prediction 610 in a P picture.
  • P picture requires one reference picture list, which is referred to as reference picture list0.
  • the inter prediction using the reference picture selected from the reference picture list 0 is called L0 prediction, and L0 prediction is mainly used for forward prediction.
  • intra prediction or L0 prediction may be performed.
  • prediction may be performed on a prediction target block using one reference block that exists in the past than the current picture in decoding order. Accordingly, as shown in FIG. 6 (610), prediction using a past reference picture in time order may be used to perform inter-screen forward prediction. In addition, prediction using a future reference picture in time order may be used to perform reverse prediction between screens. Therefore, the P picture uses only one piece of motion information (motion vector, reference picture index) for the block to be predicted in one direction.
  • motion vector, reference picture index motion vector, reference picture index
  • a B picture is a picture that can be encoded by forward, backward, or bidirectional prediction between pictures using one or more, for example, two reference pictures.
  • One or more reference pictures may be used in the B picture.
  • the B picture requires two reference picture lists, and the two reference picture lists are referred to as reference picture list 0 and reference picture list 1, respectively.
  • the inter prediction using the reference picture selected from the reference picture list 0 is called L0 prediction, and L0 prediction is mainly used for forward prediction.
  • the inter prediction using the reference picture selected from the reference picture list 1 is called L1 prediction, and L1 prediction is mainly used for backward prediction.
  • inter prediction using two reference pictures selected from reference picture list 0 and reference picture list 1 is called bi prediction.
  • intra prediction, L0 prediction, L1 prediction, or bi prediction using two pieces of motion information may be performed.
  • bi-prediction 620 in a B picture.
  • prediction may be performed on a prediction target block by using both a forward reference block in a past reference picture and a backward reference block in a future reference picture in time order. That is, bidirectional prediction using both the forward reference block and the backward reference block may be used to perform inter prediction.
  • FIG. 6 also shows forward prediction 630 using two past reference pictures and B backward prediction 640 using two future reference pictures in a B picture.
  • two past reference pictures may be used in time order, and inter prediction may be performed on the prediction target block.
  • two future reference pictures may be used in temporal order to perform inter prediction on the prediction block.
  • the B picture may use up to two pieces of motion information for the picture to be predicted in the forward and / or reverse direction. B pictures are delayed but are often used to achieve high performance.
  • the structure of the above-mentioned reference picture list will be described later.
  • FIG. 7 is a conceptual diagram schematically illustrating a comparison between a conventional P picture, a conventional B picture, and a prediction method of a GPB. Referring to FIG. 7, each picture is shown in the display order (POC) of the screen.
  • POC display order
  • a reference picture list may be configured of pictures used for inter prediction for the current picture.
  • the B picture requires two reference picture lists: reference picture list 0 and reference picture list 1.
  • a picture having the same reference picture list 0 and reference picture list 1 among the B pictures is referred to as Generalized P and B or Generalized B picture.
  • only forward prediction may be allowed in the GPB, in which case backward prediction may not be performed.
  • prediction may be performed using two or more pieces of motion information on a block to be predicted, similarly to a B picture. If only forward prediction is allowed in GPB, no delay due to backward prediction is involved. In GPB, low delay coding is possible while maintaining high coding performance.
  • the characteristics of the I picture, the P picture, and the B picture may be different for each slice in one picture, not in a picture unit.
  • a slice having a feature of an I picture, a slice having a feature of a P picture, and a slice having a feature of a B picture in a slice unit may be referred to as an I slice, a P slice, and a B slice, respectively.
  • GPBs may be referred to as GPB slices or Generalized B slices.
  • reference picture list 0 is referred to as list 0
  • reference picture list 1 is referred to as list 1.
  • the information about the list may be generated by the inter-prediction unit of the encoder and encoded to form a bitstream and transmitted to the decoder.
  • the transmitted information may be decoded and used to perform prediction in the inter prediction unit of the decoder.
  • the maximum number of reference pictures for constructing the reference picture list may be defined by syntax elements as included in [Table 1] below in one embodiment.
  • the maximum number of reference pictures allowed in List 0 and List 1 may be defined first.
  • the current slice is a P or B slice, whether or not to update the information about the maximum number of reference pictures of the PPS in the current slice using num_ref_idx_active_override_flag in the slice header (slice_header).
  • num_ref_idx_active_override_flag 1
  • the maximum number of reference pictures allowed in the current slice may be defined using num_ref_idx_l0_active_minus1 and num_ref_idx_l1_active_minus1 .
  • Each of the pieces of information may be included in the PPS and the slice header and transmitted from the encoder to the decoder through the bitstream.
  • the maximum number of reference pictures is determined based on the PPS and may be adaptively determined differently for each slice. However, it is not always allowed to have different numbers of reference pictures for slices belonging to the same picture. In this case, num_ref_idx_active_override_flag need not be transmitted for each slice header.
  • the maximum number of reference pictures may be fixed for a sequence unit by being extended in the PPS unit. In this case, if the maximum number of reference pictures is defined and transmitted in both the PPS and the slice header, unnecessary information duplication occurs.
  • Table 2 shows an embodiment of a method of newly defining the maximum number of reference pictures in the syntax in order to prevent waste of such information.
  • a flag called num_ref_idx_no_override_flag is added to the Sequence Parameter Set (SPS).
  • SPS Sequence Parameter Set
  • the condition for the flag is added to the slice header so that the information about the maximum reference picture in the current slice is updated only when the flag is zero. Therefore, when the flag is set, that is, 1, the maximum number of reference pictures defined in the PPS may be commonly applied to all slices in the same picture.
  • Table 3 shows another embodiment of a method of defining the maximum number of reference pictures in the syntax for constructing the reference picture list.
  • a flag called num_ref_idx_no_override_flag is added to a picture parameter set (PPS).
  • PPS picture parameter set
  • the flag is a flag indicating whether the maximum number of reference pictures defined in the PPS is applied to all slices in the picture as it is.
  • the condition for the flag is added to the slice header so that the information about the maximum reference picture in the current slice is updated only when the flag is zero. Therefore, if the flag is set, that is, 1, the maximum number of reference pictures defined in the PPS is applied to all slices in the same picture in common.
  • num_ref_idx_no_override_flag defined in the PPS replaces the role played by num_ref_idx_active_override_flag in the slice header. Therefore, when a new flag is defined in the PPS, the amount of information transmitted through the bitstream is the same as when a new flag is not defined in the PPS.
  • num_ref_idx_active_override_flag should be transmitted or signaled by the number of slices.
  • the new flag is defined in the PPS, since the newly defined flag needs to be transmitted only once, the amount of information transmitted through the bitstream may be reduced.
  • Table 4 shows another embodiment of a method of defining the maximum number of reference pictures in the syntax for constructing the reference picture list.
  • a flag called num_ref_idx_infer_flag is added to the Sequence Parameter Set (SPS).
  • SPS Sequence Parameter Set
  • the flag is a flag indicating whether the same maximum number of reference pictures is used for the entire sequence. If the flag is 1, the number of maximum reference pictures num_ref_idx_l0_minus1 and num_ref_idx_l0_minus1 is defined in the SPS.
  • the maximum number of reference pictures for each is transmitted only when the flag num_ref_idx_infer_flag added to the SPS is zero. Therefore, since the same maximum number of reference pictures for the entire sequence can be defined using the flags and syntax elements added to the SPS, the amount of information transmitted through the bitstream can be reduced.
  • FIG. 8 is a flowchart schematically illustrating a method of transmitting a reference picture list information by constructing a reference picture list in an encoder according to an embodiment of the present invention.
  • the encoder generates a reference picture list (S810).
  • a reference picture list information related to the reference picture list may be generated together, and the information related to the reference picture list may include the maximum number of reference pictures for the list 0 and / or the list 1.
  • the maximum number of reference pictures may be defined first in the PPS as shown in [Table 1]. If it is determined to update it in the current slice, the number of reference pictures finally allowed in the current slice can be determined. In addition, as shown in Table 2, for all slices in the same picture, a flag indicating whether the maximum number of reference pictures defined in the PPS is applied as it may be defined in the SPS.
  • a flag indicating whether the maximum number of reference pictures defined in the PPS is applied as it is may be defined in the PPS.
  • a flag indicating whether the same maximum number of reference pictures is used for the entire sequence may be defined in the SPS.
  • the coded reference picture list information forms a bitstream and is transmitted from the encoder to the decoder (S830).
  • the maximum number of reference pictures defined in the PPS may be commonly applied to all slices in the current picture. At this time, information about the maximum number of reference pictures allowed in each slice may not be transmitted.
  • the maximum reference picture number information when the newly defined flag is 1 in the SPS, the maximum reference picture number information is included and transmitted in the SPS, and the maximum reference picture number may be commonly applied to the entire sequence. In this case, information about the number of separate reference pictures may not be transmitted in each PPS or slice header.
  • FIG. 9 is a flowchart schematically illustrating a method of generating reference picture list by receiving reference picture list information in a decoder according to an embodiment of the present invention.
  • the decoder receives information about a reference picture list (S910).
  • the information related to the reference picture list may include the maximum number of reference pictures for List0 and / or List1.
  • the received maximum reference picture number information includes information that can be transmitted by the encoder of FIG. 8, and may differ from each other according to the above-described embodiments of Tables 1 to 4. For example, in the embodiment of Table 4, when the newly defined flag is 1 in the SPS, only the maximum reference picture number information for the entire sequence included in the SPS is received, and a separate picture for each picture or slice is received. The maximum reference picture number information of may not be received.
  • the decoder may generate a reference picture list by using the decoded reference picture list information (S930).
  • a B slice or a GPB slice may have three prediction modes as modes used for inter prediction.
  • the three modes are bi-prediction mode (Pred_BI) using two reference pictures from List0 and List1, and uni-prediction mode using one reference picture from List0, respectively. It is a single prediction mode (Pred_L1) using (Pred_L0) and one reference picture from List1.
  • Pred_BI bi-prediction mode
  • Pred_L1 single prediction mode
  • Pred_L0 single prediction mode
  • bi-prediction is referred to as bi-prediction.
  • each mode Pre_BI, Pred_L0, and Pred_L1 may be indicated by a syntax element called inter_pred_idc on the system.
  • inter_pred_idc a syntax element
  • the index of inter_pred_idc transmitted per PU may be replaced with a flag such as inter_pred_flag, and the amount of information and overhead transmitted Can be reduced. Therefore, a combined list can be used for this.
  • the combined list may also be referred to as a combined reference picture list or LC.
  • 10 is a conceptual diagram schematically illustrating an embodiment of a method of constructing a bind list. 10 schematically illustrates a method of generating a combined list from lists 0 and 1.
  • reference pictures of list 0 and list 1 may be alternately mapped or inserted into a bind list.
  • the pictures may be mapped or inserted in the bind list in the order of the distant pictures from the pictures close in time to the current picture.
  • the mapping or insertion into the combined list may be skipped.
  • the above-described combined list generation order is referred to as a default order.
  • syntax elements such as pic_from_list_0_flag and ref_idx_list_curr may be transmitted for more efficient optimal combine list construction. The meaning of each syntax element is mentioned later.
  • Three modes can be reduced to two modes.
  • the two types of modes are pair prediction mode Pred_BI using two reference pictures from list 0 and list 1 and single prediction mode Pred_LC using one reference picture from the combined list. Therefore, the amount of information transmitted may be reduced for the reasons described above.
  • each syntax may be defined as follows.
  • ref_pic_list_combination_flag 1
  • list 0 and list 1 are combined to generate a combined list for short prediction.
  • the flag is 0, list 0 and list 1 are the same, and list 0 is used as a bind list.
  • the list 0 and the list 1 are the same, and the GPB means the picture in which the list 0 and the list 1 are the same.
  • num_ref_idx_lc_active_minus2 +2 indicates the number of reference pictures selected from list 0 or list 1 in the bind list.
  • the value of the corresponding syntax may be at least 2, and thus minus2 may be applied.
  • ref_pic_list_modification_flag_lc When ref_pic_list_modification_flag_lc is 1, it indicates that there are syntax elements pic_from_list_0_flag and ref_idx_list_curr for specifying the mapping from the combine list to list 0 and list 1.
  • the flag When the flag is 0, the syntax element does not exist, and a bind list may be generated according to a predetermined default order or method.
  • pic_from_list_0_flag indicates whether the current reference picture added to the bind list is from list 0 or from list 1. If the flag is 1, the list 0 may be added. If the flag is 0, the current reference picture may be added from the list 1.
  • ref_idx_list_curr indicates a reference picture index of a picture in the current reference picture list (list 0 or list 1) added at the end of the combined list.
  • LC List Combined
  • LC generation methods may include using a predefined default order followed by truncation of the LC length or using other syntax that is not based on the associated syntax element in the slice header.
  • the LC may be generated by alternately inserting entries of List 0 and List 1 in ascending order starting from the smallest index of List 0 as described above with reference to FIG. 10.
  • the quantization step QP may be encoded without directly encoding the quantization step.
  • the encoded QP may be transmitted from the encoder to the decoder, and the decoder may derive the quantization step from the QP.
  • the quantization step may increase, and the signal-to-noise ratio (SNR) may decrease.
  • a hierarchical coding structure may be used.
  • FIG. 11 is a conceptual diagram schematically illustrating an embodiment of a hierarchical coding structure.
  • the hierarchical coding structure according to the embodiment of FIG. 11 has four hierarchical levels, and the number of hierarchical steps may not be limited to four.
  • the size of the GOP is 8 in FIG.
  • GOP Group Of Pictures
  • Reference pictures referred to by each picture in FIG. 11 may be selected differently from those shown in FIG. 11.
  • pictures of different hierarchical levels may have different temporal levels.
  • a picture with a lower temporal level can be coded using a smaller QP. This means that a picture with a lower temporal level can provide better coding efficiency than a picture with a higher temporal level.
  • a combined list generation may be considered considering temporal level and / or QP.
  • FIG. 12 is a conceptual diagram schematically showing an embodiment of a method of generating a combined list using a temporal level according to the present invention.
  • pictures are shown in POC, that is, display order or temporal order, and the size of the POC is eight.
  • POC display order or temporal order
  • the size of the POC is eight.
  • LC represents a bind list constructed according to a default order
  • LC (prop.) Represents a bind list constructed according to an embodiment of the present invention.
  • a reference picture having a lower temporal level may be preferentially inserted when an entry of List0 or List1 is inserted into the LC.
  • the following rule may be applied when generating a combine list.
  • a reference picture that is closer in time to the current picture can be inserted into the LC first. This means that a reference picture with a smaller POC difference is inserted first.
  • a picture having a lower temporal level may be inserted first.
  • entries from List0 and List1 have the same temporal distance and the same temporal level as the current picture, the reference picture from List0 may be inserted first.
  • the combine list consisting of List 0 and List 1 inserted alternately starting from List 0 in the default order is 2, 4, 0, 6 in order.
  • the list of LCs may have entries of 4, 2, 0, and 6 in order.
  • the picture 1220 having a POC value of 5 has a lower temporal level of picture 8 than picture 2. Therefore, according to an embodiment of the present invention, picture 8 may be inserted before picture 2 in the bind list.
  • the picture 4 of the reference picture index (refIdx) of List 1 is 1 is already inserted into the LC. Since it is the same as the picture 4 from the list 0, it may not be inserted.
  • the list of LCs may have entries of 8, 4, and 2 in order.
  • the picture 1240 having a POC value of 7 has a lower temporal level of picture 8 than picture 6. Therefore, according to an embodiment of the present invention, the picture number 8 may be inserted before the picture number 6 in the combined list.
  • a picture having a lower temporal level can be coded using a lower QP. Therefore, the bind level may be generated using the QP value instead of the temporal level.
  • the reference picture having a lower QP value is inserted first.
  • the following rule may be applied when generating a combine list.
  • a reference picture that is closer in time to the current picture can be inserted into the LC first. This means that a reference picture with a smaller POC difference is inserted first.
  • a picture having a lower QP value may be inserted first.
  • the reference picture from List0 may be inserted first.
  • a flag indicating whether a combine list generation method using a temporal level or a QP value is enabled may be provided.
  • a picture having a lower QP or lower temporal level may be preferentially inserted when the nth entry of List0 (RefPicList0) and List1 (RefPicList1) is inserted into the combined list.
  • the flag when the n-th entry of List0 (RefPicList0) and List1 (RefPicList1) is inserted into the combined list, the picture of List0 may be inserted before the picture of List1. That is, when the flag is 0, a bind list may be generated according to a default order.
  • the flag may be defined in an SPS, PPS or slice header.
  • the flag generated by the encoder may be encoded and transmitted to the decoder.
  • a bind list may be generated using a temporal level or QP or in a default order according to the information of the flag.
  • FIG. 13 is a flowchart illustrating an embodiment of a method for generating a combine list according to the present invention.
  • FIG. 13 shows an embodiment when the new flag described above in FIG. 12 is 1;
  • the encoder or the decoder generates a reference picture list 0 and a reference picture list 1 (S1310).
  • the encoder or decoder determines the temporal distance from the current picture and the temporal level of each picture with respect to the picture from the reference picture list 0 and the picture from the reference picture list 1 (S1320). In this case, if the new flag described above in FIG. 12 is used, the temporal distance and temporal level may be determined when the flag is 1. In addition, a QP value may be used instead of the temporal level.
  • the encoder or decoder generates an LC using the temporal distance from the current picture, temporal level information of each picture, or the temporal distance from the current picture, QP information (S1330).
  • a picture having a temporal distance from the current picture may be preferentially inserted into the combine list. If the picture of list 0 and the picture of list 1 have the same temporal distance from the current picture, the picture with lower temporal level and / or lower QP value may be preferentially inserted into the combine list. When the temporal distance and temporal level are the same or the temporal distance and the QP value are the same, the pictures of the list 0 may be preferentially inserted into the bind list.
  • inter_pred_idc When a combined list is generated, a value that an inter_pred_idc syntax element may have may be reduced from three (L0, L1, Bi) to two (Uni, Bi). Accordingly, inter_pred_idc may be replaced by inter_pred_flag indicating whether bi-pred or uni_pred is used, and the amount of information transmitted may be reduced.
  • inter_pred_flag indicating whether bi-pred or uni_pred is used
  • a new syntax element for the combined list needs to be defined, in addition to ref_idx_lX, mvd_lX, and mvp_idx_lX, which are syntax elements for List0 and List1.
  • the new syntax element added to prediction_unit () may include a reference picture index ref_idx_lc of LC, motion vector difference mvd_lc of LC, motion vector predictor index mvp_idx_lc of LC, and the like.
  • New syntax elements added to the slice header may include num_ref_idx_lc_active_munus1, ref_pic_list_modification_flag_lc, pic_from_list_0_flag, ref_idx_list_curr, and the like.
  • mapping process for mapping LC related syntax to list 0 or list 1 for prediction units on which uni-prediction is performed.
  • a mapping process from refIdxLX to ref_idx_lc, mvd_lX to mvd_lc, mvp_idx_lX to mvp_idx_lc and LcToLx to LX may be required.
  • x and X in Lx, lX or LX may be interpreted as 0 or 1, where L and l mean a list. This applies equally below.
  • the burden for the definition of the LC-related syntax, the mapping to the list 0 or the list 1, or the duplication of the process may occur.
  • ref_idx_lc may be unnecessary redundant to ref_idx_l0 and ref_idx_l1, mvd_lc to mvd_l0 and mvd_l1, and mvp_idx_lc to mvp_idx_l0 and mvp_idx_l1.
  • the slice header also contains syntax for modification of the bind list.
  • Such syntaxes allow the order of entries in the bind list to be optimized for coding efficiency. However, such syntaxes can also be unnecessary redundant.
  • num_ref_idx_lc_active_minus1 may substantially refer to num_ref_idx_l0_active_minus1 and num_ref_idx_l1_active_minus1
  • ref_pic_list_modification_flag_lc may be ref_pic_list_modification_flag_l_ and ref_pic_list_list_mod_list_mod_list_mod_list_list_mod_list_list_mod_list_list_mod_list_mod_list_mod_list_mod_list_mod_list_mod_list_mod_list_mod_lmod_lmod_ll
  • inter_pred_idc Since the combined list can reduce the number of bits for inter_pred_idc from 3 (L0, L1, Bi) to 2 (Uni, Bi), inter_pred_idc can be replaced with inter_pred_flag.
  • list 0 L0 can be utilized like LC.
  • List 0 related syntax that may be used instead of LC generation may include, for example, ref_idx_l0, mvd_l0, mvp_idx_l0, and the like.
  • a new reference picture selection process may be provided to reduce the number of bits required for transmission of inter prediction mode information (eg, inter_pred_flag) without generating a new list.
  • FIG. 14 is a conceptual diagram schematically illustrating a reference picture selection method according to an embodiment of the present invention.
  • FIG. 14 schematically illustrates a reference picture selection method in the case where List0 related syntax is used instead of the combine list without generating the combine list as described above.
  • pictures are shown in POC, that is, display order or temporal order.
  • a picture having a POC value of 7 represents the current picture.
  • a picture indicated by the reference picture index refIdxL0 and refIdxL1, respectively, of the reference pictures included in the reference picture list 0 and the reference picture list 1 is indicated. That is, 1420 of FIG. 14 illustrates a relationship between refIdxL0 and a POC and a relationship between refIdxL1 and a POC.
  • 1430 of FIG. 14 illustrates a method of selecting a reference picture, which will be described later.
  • the input may be a reference picture index refIdxLX and the output may be a two-dimensional array of luma samples, that is, the reference picture refPicLX.
  • the output of the process may be RefPicListX [refIdxLX].
  • RefPicListX means a reference picture list and refIdxLX means a reference picture index indicating a reference picture in the reference picture list.
  • X may mean 0 or 1 in some cases.
  • a new reference picture selection process provided according to an embodiment of the present invention is as follows.
  • variable idxMax is set as follows.
  • idxMax ref_idx_l0
  • LX is set to a variable that points to L0 or L1.
  • idxL0 and idxL1 are set to the indexes in the reference picture lists RefPicList0 and RefPicList1. idxL0 and idxL1 are both set to zero.
  • variable idxLX is set to zero.
  • RefPicList0 [refIdxL0] is the first reference picture to appear, that is, if there were no duplicate reference pictures before,
  • RefPicList1 [refIdxL1] is the first reference picture to appear, i.e. if there were no previous reference pictures,
  • RefPicList0 [refIdxL0] is output.
  • RefPicList1 [refIdxL1] is output.
  • a schematic concept of the reference picture selection process is shown at 1430 of the embodiment of FIG. 14.
  • reference pictures of the list 0 and the list 1 may be scanned in a certain order to select the reference picture.
  • scanning starts from the first entry of reference picture list 0 (RefPicList0).
  • the reference picture list 0 (RefPicList0) and the reference picture list 1 (RefPicList1) may be sequentially scanned in turn. Therefore, the order in which the reference picture is scanned may be “6, 8, 4, 6, 2, 4, 0, 2” based on the POC of the picture.
  • reference pictures are not scanned but skipped.
  • a reference picture having refIdxL1 of 1 and POC of 6 in reference picture list 1 is not scanned. This is because the reference picture is already scanned when refIdxL0 is 0 in the reference picture list 0. Therefore, referring to FIG. 14, the order in which the reference pictures are scanned is “6, 8, 4, 2, 0”.
  • the List0 related syntax may be used instead of the LC related syntax without generating a separate bind list. That is, ref_idx_l0 may be used instead of ref_idx_lc.
  • the entries of the list 0 and the list 1 may be used as the reference picture by the above-described reference picture selection process.
  • the reference picture selection process it is only necessary to indicate whether inter_pred_flag is pair prediction or short prediction among inter prediction modes without generating LC. Therefore, the reference picture selection process can provide the advantages of the combined list as it is.
  • a new motion vector and reference picture index derivation process is as follows.
  • X may mean 0 or 1 in some cases.
  • idxL0 and idxL1 are set to the indexes in the reference picture lists RefPicList0 and RefPicList1. idxL0 and idxL1 are both set to zero.
  • variable idxLC is set to zero.
  • RefPicList0 [refIdxL0] is the first reference picture to appear, that is, if there were no duplicate reference pictures before,
  • RefPicList1 [refIdxL1] is the first reference picture to appear, i.e. if there were no previous reference pictures,
  • mvd_l0 [xP] [yP] [1] mvd_l0 [xP] [yP] [1] (vertical component of the motion vector difference)
  • refIdxL0 -1
  • refIdxL1 idxL1
  • mvd_l1 [xP] [yP] [1] mvd_l0 [xP] [yP] [1] (vertical component of the motion vector difference)
  • inter_pred_flag is not the same as Pred_Uni, Pred_LX, and Pred_BI,
  • variable mvdLX can be derived as follows.
  • the luminance motion vector mvLX can be derived as follows.
  • mvLX [0] mvpLX [0] + mvdLX [0]
  • mvLX [1] mvpLX [1] + mvdLX [1]
  • the order or method in which the reference picture is scanned is the same as described in the embodiment of FIG.
  • List0 related syntax is used instead of LC related syntax without generating a separate bind list. That is, ref_idx_l0, mvd_l0, and mvp_idx_l0 may be used instead of ref_idx_lc, mvd_lc, and mvp_idx_lc, respectively. Therefore, motion information such as a motion vector and a reference picture index in the process may be derived from a copy from list 0, not a combined list.
  • the entries of the list 0 and the list 1 may be used as the reference picture by the above-described reference picture selection process.
  • the process can provide the benefits of the bind list as it is.
  • the motion information may include a motion vector, a reference picture index, and the like.
  • 15 may include the method of selecting a reference picture according to the embodiment of FIG. 14 described above, and in the embodiment of FIG. 15, a bind list may not be generated.
  • the encoder or the decoder generates a reference picture list 0 and a reference picture list 1 (S1510). In this case, the encoder or the decoder may not generate a separate list such as a combined list.
  • the encoder or the decoder scans the reference picture using the reference picture list 0 and the reference picture list 1 (S1520). At this time, the first entry of the reference picture list 0 may be scanned. In addition, the reference picture list 0 and the reference picture list 1 may be sequentially scanned in order of increasing the reference picture index. Also, a reference picture that has been previously scanned may be skipped.
  • the encoder or the decoder may derive motion information such as a motion vector and a reference picture index from the selected reference picture through a scan process (S1530).
  • the List0 related syntax may be used instead of the LC related syntax without generating the combine list as in the above-described motion vector and reference picture index derivation process.
  • both entries of List 0 and List 1 can be used as reference pictures, and only inter-pred_flag needs to indicate whether inter-pred_flag is bi-prediction or uni-prediction among inter prediction modes without generating LC. Therefore, the amount of information transmitted may be reduced, such as when a bind list is generated.
  • ref_pic_list_combination_flag When the combined list is generated, a flag called ref_pic_list_combination_flag may be used as described above.
  • ref_pic_list_combination_flag When ref_pic_list_combination_flag is 1, list 0 and list 1 are combined to generate a combined list for short prediction.
  • the flag When the flag is 0, it means that list 0 and list 1 are the same, and list 0 may be used as a bind list.
  • the flag when the flag is 0, it may mean GPB.
  • a picture having the same reference picture list 0 and reference picture list 1 among the B pictures is referred to as a generalized P and B or a generalized B picture.
  • only forward prediction may be allowed in the GPB.
  • FIG. 16 is a conceptual diagram schematically illustrating an embodiment of a reference picture and a reference picture list for a GPB and low delay B picture.
  • pictures are shown in the display order of POCs, that is, pictures from the left.
  • 1610 of FIG. 16 illustrates a case in which the current picture is a GPB, that is, a generalized B picture.
  • the reference picture indices of List 0 and List 1 may not be the same. However, the reference pictures of List 0 and List 1 are both reference pictures used for forward prediction.
  • a P picture having only a forward reference picture or a B picture having only a forward reference picture may be used for low delay coding.
  • a B picture having only a forward reference picture may be referred to as a low delay B picture, and may be referred to as a low delay B slice in a slice unit.
  • FIG. 1620 of FIG. 16 illustrates a case where the current picture is a low delay B picture.
  • a reference picture is generated including an initialization and a modification process.
  • the reference picture list 0 and the reference picture list 1 are always generated together for the B slice.
  • the slice header includes information transmitted for an inter picture (P picture, B picture).
  • information for list 1 is additionally included and transmitted.
  • Information for list 1 may include num_ref_idx_l1_active_minus1, collocated_from_l0_flag, ref_pic_list_modification_flag_l1, and the like.
  • num_ref_idx_l1_active_minus1 indicates the number of reference pictures present in list 1
  • collocated_from_l0_flag indicates whether a block having the same position used for inter prediction is obtained from list 0 or list 1.
  • ref_pic_list_modification_flag_l1 has the same meaning as ref_pic_list_modification_flag_lc described above, and only lc is a flag changed to l1.
  • the information for List 1 may also include switched interpolation filter with offset (SIFO) information indicating an interpolation filter and offset information to be used when reference pictures in List 1 are used.
  • SIFO switched interpolation filter with offset
  • list 0 and list 1 In a typical B picture, all information for list 0 and list 1 should be transmitted. However, in the GPB picture or the GPB slice, the reference picture list 0 and the reference picture list 1 are the same. Therefore, in GPB, list 0 does not need to be transmitted separately since list 0 has substantially all the information about the reference picture.
  • This duplication of information can be resolved by copying all entries of list 0 to list 1 without using list 1 in the case of GPB slices. That is, if the current slice is a B slice and List0 and List1 are the same, all entries of List0 can be copied to List1 in the same order.
  • the coding method of the slice header may be modified to remove duplication of information in the GPB, and a syntax element called generalized_b_slice may be provided.
  • generalized_b_slice indicates whether the current slice is a GPB slice.
  • a syntax element indicating whether or not it is a GPB is not limited to its name (generalized_b_slice), and a syntax element indicating whether it is a GPB under any other name is included in the spirit of the present invention.
  • the slice header may include information as shown in Table 6 below.
  • Table 7 below shows a modified slice header according to an embodiment of the present invention.
  • Table 8 below is an example of the SIFO parameter sifo_param () to which the present invention is not applied.
  • Table 8 also has the same problems as in Table 6.
  • Table 9 shows an embodiment of the SIFO parameter sifo_param () to which the present invention is applied.
  • information related to list 1 may be transmitted only when the current slice is not GPB using generalized_b_slice . Thus, duplication of unnecessary information for the GPB slice can be eliminated.
  • Table 10 below is an example of syntax to which the present invention is not applied.
  • Table 10 also has the same problems as in Table 6.
  • Table 11 below shows an embodiment of syntax to which the present invention is applied.
  • the information sifo_offset_l1 related to list 1 may be transmitted only when the current slice is not GPB using generalized_b_slice . Thus, duplication of unnecessary information for the GPB slice can be eliminated.
  • the method for preventing transmission of List 1 related information from a GPB slice is not limited to the embodiments of Tables 7, 9, and 11.
  • GPB refers to a slice in which List0 and List1 are the same among B slices, it may not be necessary to determine whether the current picture is a GPB in a P picture that does not need information about List1. Therefore, the generalized_b_slice flag may be transmitted only when the current slice is a B slice.
  • the encoder may not separately transmit the generalized_b_slice syntax to the decoder without transmitting List1 related information for the GPB slice.
  • the decoder may compare the reference picture of list 0 with the reference picture of list 1 to determine whether it is a GPB, and may derive the value of the syntax to 1 and use it. In this case, too, it may have an advantage that transmission of unnecessary information regarding the list 1 is prevented.
  • the amount of information transmitted using the same principle may be reduced. At this time, the amount of information transmitted with respect to the reference picture list can be reduced by half, and coding efficiency can be increased.
  • the list 1 related information when the current slice is a generalized B slice, the list 1 related information may be omitted without being transmitted because it is included in the slice header.
  • the omitted slice header information may be obtained by directly copying information of reference pictures having the same POC of List0. Therefore, the decoder can copy and use the omitted information from the list 0 related information.
  • the syntax below illustrates one embodiment of a method for deriving information about List1.
  • num_ref_idx_l1_active_munus1 num_ref_idx_l0_active_minus1;
  • sifo_info_l1 sifo_info_l0;
  • a merge skip method, a prediction unit merge method, an AMVP method, or the like may be used.
  • AMVP method motion prediction related information of one AMVP candidate block among two spatial AMVP candidate blocks and one temporal AMVP candidate block may be used as motion vector related information of the current prediction target block. Additionally, the difference between the motion vector value of the current prediction target block and the motion vector value of the selected AMVP candidate block may be used as the motion prediction related information of the current block.
  • a co-located block may be derived from a reference picture based on the current prediction target block.
  • both list 0 and list 1 of the same position block can be checked.
  • a motion vector that crosses the current picture among the motion vectors of the same location block may be selected for temporal motion vector scaling.
  • the motion vector of the same position block may not cross the current picture. In this case, it may be meaningless that the motion vector traversing the current picture is preferentially selected, and complexity may increase since both list 0 and list 1 should be checked.
  • a method of first checking a list identical to the reference picture list of the current picture among the reference picture lists of the same position block may be used. At this time, if there is no valid motion information in the checked list first, another list of the same position block may be checked.
  • the embodiment of the present invention is not limited to the above, and the list to be checked first may be selected in other ways.
  • Information on whether the picture is a low delay B picture may be included in a flag and transmitted from the encoder to the decoder.
  • the encoder may not transmit information about whether the encoder is a low delay B picture.
  • the decoder may compare the reference pictures of list 0 and list 1 to determine whether the picture is a low delay B picture.
  • ref_pic_list_combination_flag specifies whether reference picture list 0 and reference picture list 1 are the same or not. If List0 and List1 are equal, they correspond to GPB slices. Therefore, when the current slice is a GPB slice according to the flag information, transmission of List 1 related information may be skipped.
  • ref_pic_list_combination_flag may be signaled after List1 related syntax.
  • whether the current slice is a GPB slice cannot be determined, and therefore, transmission of the list1 related information cannot be skipped.
  • the signaling of the List1 related information in the GPB slice may be skipped.
  • ref_pic_list_identical_flag may be used instead of ref_pic_list_combination_flag as a flag name.
  • ref_pic_list_identical_flag When ref_pic_list_identical_flag is 0, list 0 and list 1 are combined to generate a combined list for short prediction.
  • the flag When the flag is 1, list 0 and list 1 are the same, and list 0 is used as a bind list.
  • the new flag may indicate whether List0 and List1 are the same. If the flag is 1, it means that the current slice is a GPB slice. However, the flag is not limited to the name and may be included in the spirit of the present invention as long as the syntax element indicates whether the list 0 and the list 1 are the same regardless of any other names.
  • Table 12 below shows an embodiment of a slice header configured using ref_pic_list_identical_flag according to the above-described method.
  • Table 13 below shows an embodiment of the ref_pic_list_modification () configuration in the slice header of Table 12.
  • Table 14 below shows an embodiment of the ref_pic_list_combination () configuration in the slice header of Table 12.
  • ref_pic_list_identical_flag is defined before List1 related syntax elements num_ref_idx_l1_active_minus1, collocated_from_l0_flag, and ref_pic_list_modification_flag_l1.
  • a condition is added to each List1 related syntax element so that List1 related syntax elements num_ref_idx_l1_active_minus1, collocated_from_l0_flag and ref_pic_list_modification_flag_l1 may be transmitted to the decoder only when ref_pic_list_identical_flag is 0.
  • List1 related syntax elements are not transmitted in the GPB slice. Therefore, transmission of unnecessary information in the GPB slice can be prevented.
  • the semantic of collocated_from_l0_flag may be changed by ref_pic_list_identical_flag.
  • collocated_from_l0_flag 1 means that the co-located picture is derived from list 0, otherwise it is derived from list 1.
  • collocated_from_l0_flag 1 means that the co-located picture is derived from list 0, otherwise it is derived from list 1. If collocated_from_l0_flag does not exist, collocated_from_l0_flag is assumed to be 1. Referring to Table 12, collocated_from_l0_flag is not present when the current slice is not a B slice or a GPB slice. In this case, since all the reference pictures can be derived from the list 0, the flag can be estimated as one.
  • a method of separately defining a GPB in a slice type may be used.
  • I slice In the slice type currently used, three kinds of slices are defined: I slice, P slice, and B slice. This can be confirmed in the following [Table 15].
  • Inter_pred_flag is used to specify whether bi-pred or uni-pred is used for the current prediction unit.
  • the slice type and the prediction mode or name corresponding to the inter_pred_flag value may be checked in the following [Table 16].
  • the prediction mode is estimated to be Pred_L0 if the slice type is P and Pred_BI if the slice type is B.
  • Table 17 shows an embodiment of a method of defining a slice type when a GPB is defined in the slice type.
  • Table 18 shows an embodiment of a method of defining inter_pred_flag when GPB is defined in a slice type.
  • the prediction mode is estimated to be Pred_L0 if the slice type is P and Pred_BI if the slice type is B or GPB.
  • the GPB slice means a slice in which the reference picture list 0 and the reference picture list 1 are the same.
  • the definition of the B slice may be changed by defining a separate slice type called GPB.
  • a conventional B slice refers to a slice decoded using intra prediction or inter prediction using at most two reference pictures.
  • GPB is defined as a separate slice type
  • a B slice is a slice that is decoded by using intra prediction or inter prediction using up to two reference pictures, and a slice in which reference picture list 0 and reference picture list 1 are not the same. Means. That is, in the definition of a new B slice, the GPB slice is not included in the B slice.
  • Table 19 below shows an embodiment of a slice header configured using a GPB slice type according to the present invention.
  • Table 20 below shows an embodiment of the ref_pic_list_modification () configuration in the slice header of Table 19.
  • the slice type is GPB as a condition for defining num_ref_idx_active_override_flag. This is because the newly defined B slice does not include the GPB slice. In addition, List0 related information is transmitted only in the B slice, not the GPB slice. Therefore, in the case of a GPB slice, unnecessary List1 related information may not be transmitted.
  • slice_type% 5! 3 means that the current slice is not an I slice.
  • 2 is used instead of 3 because the slice_type value of the I slice is 2, but 3 may be used instead of 2 since the slice_type value of the I slice is 3 in the newly defined slice type.
  • 17 is a flowchart schematically illustrating a method of transmitting a reference picture list of an encoder according to an embodiment of the present invention. 17 schematically illustrates the flow of a method of transmitting a reference picture list according to the embodiments of Tables 7, 9, 11, 14, and 17, and 20, described above. .
  • the encoder determines whether the current slice is GPB (S1710).
  • the encoder encodes and transmits information indicating whether the current slice is GPB (S1720).
  • the information indicating whether or not the GPB may be included in the slice header through generalized_b_slice and transmitted as in the embodiments of Tables 7, 9, and 11.
  • ref_pic_list_identical_flag having a modified ref_pic_list_combination_flag may be included in the slice header and transmitted.
  • the information indicating whether the GPB is a GPB slice is defined as a separate slice type as in the embodiments of Tables 17 to 20, and may be transmitted by being reflected in the syntax of the slice header.
  • the information related to the list 1 may not be transmitted. This is because List0 and List1 are the same in the GPB slice, so that List0 has substantially all the information about the reference picture.
  • the information related to the list 1 is not transmitted, redundancy of unnecessary information can be eliminated, and thus the coding efficiency can be improved by reducing the amount of transmitted information.
  • the encoder encodes and transmits information indicating whether the current slice is GPB and List 1 related information to the decoder (S1730).
  • FIG. 18 is a flowchart schematically illustrating a method of generating a reference picture list of a decoder according to an embodiment of the present invention.
  • FIG. 18 schematically illustrates a flow of a method of generating a reference picture list according to the embodiments of Tables 7, 9, 11, 14, and 17, and 20, described above. .
  • the decoder receives information indicating whether the current slice is GPB from the encoder (S1810).
  • the received information can be decrypted at the decoder.
  • the decoder determines whether the current slice is GPB based on the information received from the encoder (S1820).
  • generalized_b_slice is used as in the embodiments of Tables 7, 9, and 11, the decoder may determine whether the current slice is a GPB slice through generalized_b_slice.
  • the flag ref_pic_list_identical_flag is used as in the embodiments of Tables 12, 13, and 14, the decoder may determine whether the current slice is a GPB slice through ref_pic_list_identical_flag.
  • it may be determined whether the current slice is a GPB slice based on a separately defined GPB slice type.
  • the decoder derives the reference picture list1 related information from the reference picture list0 related information (S1830). As described above with reference to FIG. 17, in the case of a GPB slice, the encoder may not transmit List 1 related information in order to remove unnecessary redundancy. However, since List0 and List1 are the same in the GPB slice, List1 related information can be derived using information about List0.
  • the encoder transmits List 1 related information to the decoder.
  • the decoder decodes the information received from the encoder to derive list 1 related information (S1840).
  • the decoder may derive the reference picture list related information as described above, and generate the reference picture list using the reference picture list.
  • the reference picture index (refIdxL0) indicating the reference picture of the list 0 referenced by the prediction target block is referenced by the prediction target block.
  • the reference picture index refIdxL1 indicating the reference picture of List 1 indicates the same reference picture.
  • FIG. 19 is a conceptual diagram schematically illustrating an embodiment of a picture in which a reference picture index of list 0 and a reference picture index of list 1 indicate the same reference picture.
  • pictures are shown in display order, that is, in temporal order.
  • the prediction target block in the current picture has both a reference picture index refIdxL0 for indicating the reference picture of List0 and a reference picture index refIdxL1 for indicating the reference picture of List1.
  • the reference picture index for the reference picture referenced by the current prediction target block in list 0 is referred to as index L0
  • the reference picture index for the reference picture referred to by the current prediction target block in list 1 is referred to as index L1.
  • both the index L0 and the index L1 of the prediction target block in the current picture indicate the reference picture List0 [0] closest in temporal distance.
  • the index L1 may not be transmitted. In this case, however, the encoder needs to transmit information on whether the reference picture indicated by the index L0 and the index L1 are the same to the decoder.
  • a flag called ref_idx_l1_present_flag may be used to indicate information on whether the reference picture indicated by the index L0 and the index L1 is the same.
  • the flag is not bound to the name, and any flag may be included in the spirit of the present invention as long as the flag indicates information on whether the reference picture indicated by the index L0 and the index L1 is the same.
  • Table 21 shows a syntax structure related to the reference picture index of the prediction unit according to the embodiment of the present invention.
  • ref_idx_l1_present_flag if ref_idx_l1_present_flag is 1, ref_idx_l1 is present. If ref_idx_l1_present_flag is 0, it means that ref_idx_l1 is not present. If ref_idx_l1_present_flag does not exist, it is assumed to be 1. When ref_idx_l1_present_flag is 0, ref_idx_l1 is estimated to be a value indicating the same reference picture as the reference picture indicated by ref_idx_l0.
  • ref_idx_l1_present_flag is defined only when the prediction mode of the current unit is pair prediction and the current slice is GPB.
  • ref_idx_l1 is defined only when ref_idx_l1_present_flag is 1.
  • index L1 When ref_idx_l1_present_flag is 0, information about index L1 may not be transmitted, where index L0 and index L1 indicate the same reference picture. According to an embodiment of the present invention, since the transmission of index L1 (ref_idx_l1) information may be skipped when the index L0 and the index L1 indicate the same reference picture, the amount of information to be transmitted may be reduced.
  • the index L1 in the encoder may be derived by the following process.
  • the encoder uses this to derive the index L1, but otherwise, the encoder may derive the index L1 from the index L0 information. This is because the reference pictures indicated by index L0 and index L1 are the same when ref_idx_l1_present_flag is 0.
  • the index L1 information should be transmitted.
  • a method capable of reducing the amount of information or the number of bits necessary for the transmission of the index L1 information may be provided. This method may be provided when ref_pic_list_combination_flag is 1, that is, the current slice is a GPB slice, inter_pred_flag is Pred_BI, that is, the inter picture encoding mode is a bi-prediction mode, and ref_idx_l1_present_flag is present and a value of 1 is satisfied. The above condition may be satisfied when the reference pictures indicated by the list L0 and the list L1 are different.
  • a method for reducing the bit amount of index L1 information transmitted is described.
  • the encoder Since the reference pictures indicated by the list L0 and the list L1 are different from each other, the encoder generates the index L1 information by using the remaining indexes except the index indicating the same picture as the reference picture indicated by the index L0 among the indexes of the list 1. Can be. In this case, after subtracting 1 from the number of indexes of List 1 having a value higher than the index indicating the same picture as the reference picture indicated by the index L0, the index L1 information may be generated using the changed index value. In this case, the bit amount of the transmitted index L1 information may be reduced.
  • List 1 may indicate four reference pictures, and values of 0, 1, 2, and 3 are assigned to an index indicating each reference picture.
  • the index of the list 1 indicating the same picture as the reference picture indicated by the index L0 is 1.
  • the index L1 may have an index value of 0, 2 or 3, and the encoder may transmit the indicated index L1 information to the decoder as it is.
  • index L1 may have an index value of 0, 1 or 2 instead of 0, 2 or 3, and the amount of information or bits to be transmitted may be reduced.
  • the process for deriving the index L1 information on the decoder side may be performed as follows.
  • refIdxL1 means a reference picture index for List 1 derived from the decoder.
  • variable refIdxCo is an index value of List 1 indicating the same reference picture as the reference picture indicated by the index L0 (ref_idx_l0).
  • ref_idx_l1 information transmitted from the encoder may be copied to refIdxL1 as it is.
  • ref_idx_l1 information may be generated after 1 is subtracted from the index values. Therefore, the decoder can copy ref_idx_l1 plus 1 to refIdxL1.
  • the decoder can use the ref_idx_l1 information received from the encoder as it is. If the index value of List1 indicating the same picture as the reference picture indicated by index L0 is 1 and the index value of index L1 is 2 or 3, the index value of 1 or 2 subtracted from each index value is index L1 information. Can be used to generate Therefore, the decoder can derive the index L1 information by copying the ref_idx_l1 information received from the encoder plus refIdxL1.
  • index L1 When the index value of index L1 is larger than the index value of List1 indicating the same picture as the reference picture pointed to by index L0, a value obtained by subtracting 1 from the index value may be used to generate index L1 information, so ref_idx_l1 and refIdxCo There may be the same case. Even in this case, the decoder can derive the index L1 information by copying the ref_idx_l1 information received from the encoder plus refIdxL1.
  • the condition (if (! Ref_pic_list_combination_flag)) specifying whether List 0 and List 1 are the same in the syntax of [Table 21] may be modified or removed.
  • the GPB slice condition may be removed.
  • FIG. 20 is a flowchart schematically illustrating a method of transmitting a reference picture index of an encoder according to an embodiment of the present invention.
  • FIG. 20 schematically illustrates a flow of a reference picture index transmission method according to the above-described embodiments of FIGS. 19 and 21 below.
  • the inter prediction mode may be pair prediction.
  • the reference picture index transmission method according to the embodiment of FIG. 20 may also be applied when the inter prediction mode is pair prediction and the current slice is a GPB slice.
  • the encoder determines whether the reference picture index for the list 0 and the reference picture index for the list 1 indicate the same reference picture (S2010). In this case, as in the above-described embodiment, a flag called ref_idx_l1_present_flag may be used. According to an embodiment, the encoder may set the flag value to 0 when the reference picture index for list 0 and the reference picture index for list 1 indicate the same reference picture, and otherwise set the flag value to 1. have.
  • the encoder transmits a flag such as ref_idx_l1_present_flag to the decoder to inform the decoder of this information (S2020).
  • the encoder may not transmit the reference picture index for List1. This is because the index L1 can be derived using the information about the index L0 since the reference picture index for the list 0 and the reference picture index for the list 1 indicate the same reference picture. Since the transmission of the reference picture index for the list 1 is omitted, the amount of information transmitted may be reduced.
  • the encoder If the information indicated by the reference picture index for list 0 and the information indicated by the reference picture index for list 1 are not the same, the encoder not only flags indicating this information but also reference picture index for list 1 to the encoder. It transmits (S2030).
  • the encoder subtracts 1 from each index value of List 1 having a value higher than that of List 1 indicating the same picture as the reference picture indicated by index L0, and then generates index L1 information using the changed index value. It may be. In this case, the bit amount of the transmitted index L1 information may be reduced.
  • FIG. 21 is a flowchart schematically illustrating a reference picture index generation method of a decoder according to an embodiment of the present invention. 19 and Table 21 schematically illustrate the flow of a reference picture index generation method according to an embodiment.
  • the decoder receives from the encoder a flag indicating whether the reference picture index for the list 0 and the reference picture index for the list 1 indicate the same reference picture (S2110).
  • the received flag information can be decoded in the decoder.
  • the decoder determines whether the reference picture index for the list 0 and the reference picture index for the list 1 indicate the same reference picture based on the flag information received from the encoder (S2120). In this case, as in the embodiment of Table 19, a flag called ref_idx_l1_present_flag may be used. When the flag is 0, it may indicate that the index L0 and the index L1 indicate the same reference picture. Otherwise, the index L0 and the index L1 do not indicate the same reference picture.
  • the decoder When the index L0 and the index L1 indicate the same reference picture, the decoder derives the index L1 from the index L0 information (S2130). As described above with reference to FIG. 20, when the index L0 and the index L1 indicate the same reference picture, the encoder may not transmit the index L1 information in order to remove unnecessary duplication. However, since the index L0 and the index L1 indicate the same reference picture, the index L1 may be derived from the index L0 information.
  • the decoder decodes the information received from the encoder to derive the index L1 (S2140).
  • the encoder may generate index L1 information using the changed index value after subtracting 1 from each index value of List 1 having a value higher than that of List 1 indicating the same picture as the reference picture indicated by index L0. have. Therefore, when the index value of index L1 is larger than the index value of List 1 indicating the same picture as the reference picture indicated by index L0, the decoder derives index L1 by adding 1 to the index value of the index L1 information received from the encoder. can do.

Landscapes

  • Engineering & Computer Science (AREA)
  • Multimedia (AREA)
  • Signal Processing (AREA)
  • Compression Or Coding Systems Of Tv Signals (AREA)

Abstract

Cette invention se rapporte à un procédé de décodage d'image. Des informations d'image, y compris des informations qui indiquent si une tranche actuelle est ou non une tranche GPB, sont reçues en provenance d'un codeur, et une liste d'images de référence 0 et une liste d'images de référence 1 sont générées sur la base des informations qui indiquent si une tranche actuelle est ou non une tranche GPB. Selon la présente invention, la quantité d'informations transmises d'un codeur vers un décodeur est réduite, et le rendement de codage est amélioré.
PCT/KR2011/006587 2010-09-08 2011-09-06 Procédé de décodage d'image et appareil comprenant un procédé destiné à configurer une liste d'images de référence WO2012033327A2 (fr)

Applications Claiming Priority (18)

Application Number Priority Date Filing Date Title
US38070810P 2010-09-08 2010-09-08
US61/380,708 2010-09-08
US201161477622P 2011-04-21 2011-04-21
US201161477625P 2011-04-21 2011-04-21
US61/477,625 2011-04-21
US61/477,622 2011-04-21
US201161478916P 2011-04-25 2011-04-25
US201161478921P 2011-04-25 2011-04-25
US201161478911P 2011-04-25 2011-04-25
US61/478,916 2011-04-25
US61/478,921 2011-04-25
US61/478,911 2011-04-25
US201161481723P 2011-05-02 2011-05-02
US61/481,723 2011-05-02
US201161481727P 2011-05-03 2011-05-03
US61/481,727 2011-05-03
US201161484681P 2011-05-11 2011-05-11
US61/484,681 2011-05-11

Publications (2)

Publication Number Publication Date
WO2012033327A2 true WO2012033327A2 (fr) 2012-03-15
WO2012033327A3 WO2012033327A3 (fr) 2012-06-14

Family

ID=45811065

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/KR2011/006587 WO2012033327A2 (fr) 2010-09-08 2011-09-06 Procédé de décodage d'image et appareil comprenant un procédé destiné à configurer une liste d'images de référence

Country Status (1)

Country Link
WO (1) WO2012033327A2 (fr)

Cited By (11)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
KR20150009412A (ko) * 2013-07-15 2015-01-26 주식회사 케이티 복수의 레이어를 지원하는 영상의 부호화 및 복호화 방법 및 이를 이용하는 장치
WO2015016536A1 (fr) * 2013-07-30 2015-02-05 주식회사 케이티 Procédé de codage et de décodage d'image supportant une pluralité de couches et appareil utilisant celui-ci
CN105453564A (zh) * 2013-07-30 2016-03-30 株式会社Kt 支持多个层的图像编码和解码方法以及使用该方法的装置
US9894369B2 (en) 2013-07-30 2018-02-13 Kt Corporation Image encoding and decoding method supporting plurality of layers and apparatus using same
CN109905704A (zh) * 2012-12-14 2019-06-18 Lg 电子株式会社 编码视频的方法、解码视频的方法以及使用其的装置
WO2020056168A1 (fr) 2018-09-12 2020-03-19 Futurewei Technologies, Inc. Signalisation d'index pour structures de liste d'images de référence
CN111435989A (zh) * 2019-01-15 2020-07-21 华为技术有限公司 视频编码、解码方法、装置及计算机存储介质
WO2021091253A1 (fr) * 2019-11-05 2021-05-14 엘지전자 주식회사 Procédé et appareil de codage d'image/vidéo basé sur un type de tranche
CN113711596A (zh) * 2019-09-10 2021-11-26 株式会社 Xris 图像信号编码/解码方法及其装置
CN115485981A (zh) * 2020-05-21 2022-12-16 阿里巴巴集团控股有限公司 视频编码中处理参考图像的方法
CN116235490A (zh) * 2021-06-25 2023-06-06 腾讯美国有限责任公司 用于视频编码的方法和设备

Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
KR20060068254A (ko) * 2004-12-16 2006-06-21 엘지전자 주식회사 비디오 부호화 방법, 복호화 방법 그리고, 복호화 장치
KR20080007086A (ko) * 2006-07-14 2008-01-17 엘지전자 주식회사 비디오 신호의 디코딩/인코딩 방법 및 장치
KR20080076870A (ko) * 2008-07-18 2008-08-20 엘지전자 주식회사 모션벡터 결정방법
KR20080082575A (ko) * 2008-08-04 2008-09-11 엘지전자 주식회사 레퍼런스 픽쳐 선택 방법

Patent Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
KR20060068254A (ko) * 2004-12-16 2006-06-21 엘지전자 주식회사 비디오 부호화 방법, 복호화 방법 그리고, 복호화 장치
KR20080007086A (ko) * 2006-07-14 2008-01-17 엘지전자 주식회사 비디오 신호의 디코딩/인코딩 방법 및 장치
KR20080076870A (ko) * 2008-07-18 2008-08-20 엘지전자 주식회사 모션벡터 결정방법
KR20080082575A (ko) * 2008-08-04 2008-09-11 엘지전자 주식회사 레퍼런스 픽쳐 선택 방법

Cited By (37)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN109905704A (zh) * 2012-12-14 2019-06-18 Lg 电子株式会社 编码视频的方法、解码视频的方法以及使用其的装置
CN109982078B (zh) * 2012-12-14 2023-03-28 Lg 电子株式会社 编码视频的方法、解码视频的方法以及使用其的装置
CN109905704B (zh) * 2012-12-14 2022-12-16 Lg 电子株式会社 编码视频的方法、解码视频的方法以及使用其的装置
CN109982077B (zh) * 2012-12-14 2022-12-13 Lg 电子株式会社 编码视频的方法、解码视频的方法以及使用其的装置
CN109982076B (zh) * 2012-12-14 2022-12-13 Lg 电子株式会社 编码视频的方法、解码视频的方法以及使用其的装置
CN109982078A (zh) * 2012-12-14 2019-07-05 Lg 电子株式会社 编码视频的方法、解码视频的方法以及使用其的装置
CN109982076A (zh) * 2012-12-14 2019-07-05 Lg 电子株式会社 编码视频的方法、解码视频的方法以及使用其的装置
CN109982077A (zh) * 2012-12-14 2019-07-05 Lg 电子株式会社 编码视频的方法、解码视频的方法以及使用其的装置
KR102179125B1 (ko) * 2013-07-15 2020-11-16 주식회사 케이티 복수의 레이어를 지원하는 영상의 부호화 및 복호화 방법 및 이를 이용하는 장치
KR20150009412A (ko) * 2013-07-15 2015-01-26 주식회사 케이티 복수의 레이어를 지원하는 영상의 부호화 및 복호화 방법 및 이를 이용하는 장치
US10148952B2 (en) 2013-07-30 2018-12-04 Kt Corporation Image encoding and decoding method supporting plurality of layers and apparatus using same
WO2015016536A1 (fr) * 2013-07-30 2015-02-05 주식회사 케이티 Procédé de codage et de décodage d'image supportant une pluralité de couches et appareil utilisant celui-ci
US10200702B2 (en) 2013-07-30 2019-02-05 Kt Corporation Image encoding and decoding method supporting plurality of layers and apparatus using same
CN105453564B (zh) * 2013-07-30 2019-05-10 株式会社Kt 支持多个层的图像编码和解码方法以及使用该方法的装置
US10148949B2 (en) 2013-07-30 2018-12-04 Kt Corporation Scalable video signal encoding/decoding method and apparatus
US10148950B2 (en) 2013-07-30 2018-12-04 Kt Corporation Image encoding and decoding method supporting plurality of layers and apparatus using same
US9894369B2 (en) 2013-07-30 2018-02-13 Kt Corporation Image encoding and decoding method supporting plurality of layers and apparatus using same
US9894357B2 (en) 2013-07-30 2018-02-13 Kt Corporation Image encoding and decoding method supporting plurality of layers and apparatus using same
US10154259B2 (en) 2013-07-30 2018-12-11 Kt Corporation Image encoding and decoding method supporting plurality of layers and apparatus using same
CN105453564A (zh) * 2013-07-30 2016-03-30 株式会社Kt 支持多个层的图像编码和解码方法以及使用该方法的装置
US9769478B2 (en) 2013-07-30 2017-09-19 Kt Corporation Scalable video signal encoding/decoding method and apparatus
US9762910B2 (en) 2013-07-30 2017-09-12 Kt Corporation Image encoding and decoding method supporting plurality of layers and apparatus using same
US9762909B2 (en) 2013-07-30 2017-09-12 Kt Corporation Image encoding and decoding method supporting plurality of layers and apparatus using same
AU2019337644B2 (en) * 2018-09-12 2023-06-29 Huawei Technologies Co., Ltd. Index signaling for reference picture list structures
WO2020056168A1 (fr) 2018-09-12 2020-03-19 Futurewei Technologies, Inc. Signalisation d'index pour structures de liste d'images de référence
EP3847804A4 (fr) * 2018-09-12 2021-10-27 Huawei Technologies Co., Ltd. Signalisation d'index pour structures de liste d'images de référence
EP4300954A3 (fr) * 2018-09-12 2024-01-10 Huawei Technologies Co., Ltd. Signalisation candidate pour structures de liste d'images de référence
EP4164227A1 (fr) * 2018-09-12 2023-04-12 Huawei Technologies Co., Ltd. Codeur vidéo, décodeur vidéo et procédés correspondants
CN111435989A (zh) * 2019-01-15 2020-07-21 华为技术有限公司 视频编码、解码方法、装置及计算机存储介质
CN111435989B (zh) * 2019-01-15 2023-11-17 华为技术有限公司 视频编码、解码方法、装置及计算机存储介质
CN113711596A (zh) * 2019-09-10 2021-11-26 株式会社 Xris 图像信号编码/解码方法及其装置
CN114762350A (zh) * 2019-11-05 2022-07-15 Lg 电子株式会社 基于切片类型的图像/视频编译方法和设备
US11825096B2 (en) 2019-11-05 2023-11-21 Lg Electronics Inc. Slice type-based image/video coding method and apparatus
WO2021091253A1 (fr) * 2019-11-05 2021-05-14 엘지전자 주식회사 Procédé et appareil de codage d'image/vidéo basé sur un type de tranche
CN115485981A (zh) * 2020-05-21 2022-12-16 阿里巴巴集团控股有限公司 视频编码中处理参考图像的方法
EP4136840A4 (fr) * 2021-06-25 2023-10-11 Tencent America Llc Procédé et appareil de codage vidéo
CN116235490A (zh) * 2021-06-25 2023-06-06 腾讯美国有限责任公司 用于视频编码的方法和设备

Also Published As

Publication number Publication date
WO2012033327A3 (fr) 2012-06-14

Similar Documents

Publication Publication Date Title
US10652551B2 (en) Inter prediction method and apparatus therefor
EP3205103B1 (fr) Unification inter et intra copie de bloc
WO2012033327A2 (fr) Procédé de décodage d'image et appareil comprenant un procédé destiné à configurer une liste d'images de référence
US9451256B2 (en) Reusing parameter sets for video coding
KR102085395B1 (ko) 스케일러블 및 멀티-뷰 비디오 코딩에서의 진보된 잔차 예측
AU2013232041B2 (en) Motion vector coding and bi-prediction in HEVC and its extensions
KR102006531B1 (ko) Hevc 및 확장들에 대한 비디오 파라미터 세트
US9420286B2 (en) Temporal motion vector prediction in HEVC and its extensions
KR102381380B1 (ko) 영상 디코딩 방법 및 이를 이용하는 장치
CN107660341A (zh) 切片级帧内块复制及其它视频译码改进
CA2922408C (fr) Type de dependance inter-vues en mv-hevc
AU2013280848A1 (en) Random access and signaling of long-term reference pictures in video coding
WO2014052424A1 (fr) Codage vidéo échelonnable en hevc
AU2013272130A1 (en) Signaling data for long term reference pictures for video coding
KR102282331B1 (ko) 인터-레이어 신택스 예측 제어
KR20220025017A (ko) 비디오 코딩에서 참조 픽처 리샘플링을 위한 다수의 픽처 크기 및 적합성 윈도우의 핸들링
CN115191117A (zh) 视频编码中的参考图片列表和共位图片信令通知

Legal Events

Date Code Title Description
121 Ep: the epo has been informed by wipo that ep was designated in this application

Ref document number: 11823760

Country of ref document: EP

Kind code of ref document: A2

NENP Non-entry into the national phase in:

Ref country code: DE

122 Ep: pct application non-entry in european phase

Ref document number: 11823760

Country of ref document: EP

Kind code of ref document: A2