WO2017065490A1 - Procédé destiné au codage/décodage d'une image et appareil s'y rapportant - Google Patents

Procédé destiné au codage/décodage d'une image et appareil s'y rapportant Download PDF

Info

Publication number
WO2017065490A1
WO2017065490A1 PCT/KR2016/011419 KR2016011419W WO2017065490A1 WO 2017065490 A1 WO2017065490 A1 WO 2017065490A1 KR 2016011419 W KR2016011419 W KR 2016011419W WO 2017065490 A1 WO2017065490 A1 WO 2017065490A1
Authority
WO
WIPO (PCT)
Prior art keywords
split
divided
unit
block
division
Prior art date
Application number
PCT/KR2016/011419
Other languages
English (en)
Korean (ko)
Inventor
손은용
허진
남정학
박승욱
예세훈
Original Assignee
엘지전자(주)
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by 엘지전자(주) filed Critical 엘지전자(주)
Publication of WO2017065490A1 publication Critical patent/WO2017065490A1/fr

Links

Images

Classifications

    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N19/00Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
    • H04N19/10Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding
    • H04N19/102Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the element, parameter or selection affected or controlled by the adaptive coding
    • H04N19/103Selection of coding mode or of prediction mode
    • H04N19/11Selection of coding mode or of prediction mode among a plurality of spatial predictive coding modes
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N19/00Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
    • H04N19/10Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding
    • H04N19/102Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the element, parameter or selection affected or controlled by the adaptive coding
    • H04N19/124Quantisation
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N19/00Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
    • H04N19/10Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding
    • H04N19/102Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the element, parameter or selection affected or controlled by the adaptive coding
    • H04N19/129Scanning of coding units, e.g. zig-zag scan of transform coefficients or flexible macroblock ordering [FMO]
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N19/00Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
    • H04N19/46Embedding additional information in the video signal during the compression process
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N19/00Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
    • H04N19/90Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using coding techniques not provided for in groups H04N19/10-H04N19/85, e.g. fractals
    • H04N19/96Tree coding, e.g. quad-tree coding

Definitions

  • the present invention relates to a still image or moving image processing method, and more particularly, to a method for encoding / decoding a still image or moving image using a quantization unit (or block) structure and an apparatus supporting the same.
  • Compression coding refers to a series of signal processing techniques for transmitting digitized information through a communication line or for storing in a form suitable for a storage medium.
  • Media such as an image, an image, an audio, and the like may be a target of compression encoding.
  • a technique of performing compression encoding on an image is called video image compression.
  • Next-generation video content will be characterized by high spatial resolution, high frame rate and high dimensionality of scene representation. Processing such content would result in a tremendous increase in terms of memory storage, memory access rate, and processing power.
  • An object of the present invention is to propose a method of defining or dividing a quantization unit, which is a basic block structure, when quantizing / dequantizing an image.
  • An object of the present invention is to propose a method for signaling a partition structure of a quantization unit.
  • a method of decoding a video by a decoding apparatus includes parsing a QU division flag indicating whether a quantization unit (QU) is split from a current block, and activating the QU division flag.
  • the method may include deriving a split form of the QU and deriving a quantization parameter (QP) for each divided QU.
  • An aspect of the present invention provides a decoding apparatus for decoding an image, comprising: a QU division flag parser for parsing a QU division flag indicating whether a quantization unit (QU) is split from a current block, the QU division flag When is activated, it may include a QU division type derivation unit for deriving the division form of the QU and a QP derivation unit for deriving a quantization parameter (QP) for each divided QU.
  • a QU division flag parser for parsing a QU division flag indicating whether a quantization unit (QU) is split from a current block
  • the QU division flag When is activated, it may include a QU division type derivation unit for deriving the division form of the QU and a QP derivation unit for deriving a quantization parameter (QP) for each divided QU.
  • the QU may be divided in a quad-tree form from the current block.
  • the QU may be divided according to a QU division type selected from a plurality of predefined QU division types.
  • the partitioned form of the QU may be derived by parsing information on the QU partitioned mode indicating the partitioned form of the QU.
  • the split form of the QU may be derived according to an intra prediction mode applied to the current block.
  • a split form may be determined as 2N ⁇ N or 2N ⁇ nU.
  • a split form may be determined as N ⁇ 2N or nL ⁇ 2N.
  • the QP difference value is parsed, and the QP may be derived by summing a QP predictor derived from a QP of a neighboring QU and the parsed QP difference value.
  • the neighboring QU may be a QU neighboring the top of the QU or a QU neighboring to the left of the QU.
  • quantization can be flexibly performed by defining a basic unit for performing quantization / dequantization.
  • FIG. 1 is a schematic block diagram of an encoder in which encoding of a still image or video signal is performed according to an embodiment to which the present invention is applied.
  • FIG. 2 is a schematic block diagram of a decoder in which encoding of a still image or video signal is performed according to an embodiment to which the present invention is applied.
  • FIG. 3 is a diagram for describing a partition structure of a coding unit that may be applied to the present invention.
  • FIG. 4 is a diagram for explaining a prediction unit applicable to the present invention.
  • FIG. 5 is a diagram illustrating a structure in which a prediction unit, a transform unit, and a quantization unit are divided from a coding unit according to an embodiment of the present invention.
  • FIG. 6 is a diagram illustrating a structure in which a quantization unit is divided from a coding unit according to an embodiment of the present invention.
  • FIG. 7 is a diagram illustrating a structure in which a quantization unit is divided from a transform unit according to an embodiment of the present invention.
  • FIG. 8 is a diagram illustrating an image decoding method according to an embodiment of the present invention.
  • FIG. 9 is a diagram illustrating a quantization unit divider according to an embodiment of the present invention.
  • the term 'block' or 'unit' refers to a unit in which a process of encoding / decoding such as prediction, transformation, and / or quantization is performed, and may be configured as a multidimensional array of samples (or pixels, pixels).
  • 'Block' or 'unit' may mean a multi-dimensional array of samples for luma components, or may mean a multi-dimensional array of samples for chroma components.
  • the multi-dimensional arrangement of the sample for the luma component and the multi-dimensional arrangement of the sample for the chroma component may also be included.
  • 'block' or 'unit' refers to a coding block (CB) that represents an array of samples to be encoded / decoded, and a coding tree block composed of a plurality of coding blocks (CTB).
  • CB coding block
  • CB coding block
  • CB coding tree block composed of a plurality of coding blocks
  • PB Prediction Block
  • PU Prediction Unit
  • TB Transform Block
  • a 'block' or 'unit' is a syntax structure used in encoding / decoding an array of samples for a luma component and / or a chroma component. can be interpreted to include a sturcture.
  • the syntax structure refers to zero or more syntax elements existing in the bitstream in a specific order, and the syntax element refers to an element of data represented in the bitstream.
  • a 'block' or 'unit' includes a coding unit (CU) including a coding block (CB) and a syntax structure used for encoding the coding block (CB), and a plurality of coding units.
  • TUs transform units
  • the 'block' or 'unit' is not necessarily limited to an array of square or rectangular samples (or pixels or pixels), and polygonal samples having three or more vertices (or pixels or pixels). It can also mean an array of. In this case, it may also be referred to as a polygon block or a polygon unit.
  • FIG. 1 is a schematic block diagram of an encoder in which encoding of a still image or video signal is performed according to an embodiment to which the present invention is applied.
  • the encoder 100 may include an image divider 110, a subtractor 115, a transform unit 120, a quantizer 130, an inverse quantizer 140, an inverse transform unit 150, and a filtering unit. 160, a decoded picture buffer (DPB) 170, a predictor 180, and an entropy encoder 190.
  • the predictor 180 may include an inter predictor 181 and an intra predictor 182.
  • the image divider 110 divides an input video signal (or a picture or a frame) input to the encoder 100 into one or more blocks.
  • the subtractor 115 outputs a predicted signal (or a predicted block) output from the predictor 180 (that is, the inter predictor 181 or the intra predictor 182) in the input image signal. ) To generate a residual signal (or differential block). The generated difference signal (or difference block) is transmitted to the converter 120.
  • the transform unit 120 may convert a differential signal (or a differential block) into a transform scheme (eg, a discrete cosine transform (DCT), a discrete sine transform (DST), a graph-based transform (GBT), and a karhunen-loeve transform (KLT)). Etc.) to generate transform coefficients.
  • a transform scheme eg, a discrete cosine transform (DCT), a discrete sine transform (DST), a graph-based transform (GBT), and a karhunen-loeve transform (KLT)
  • the quantization unit 130 quantizes the transform coefficients and transmits the transform coefficients to the entropy encoding unit 190, and the entropy encoding unit 190 entropy codes the quantized signals and outputs them as bit streams.
  • the quantized signal output from the quantization unit 130 may be used to generate a prediction signal.
  • the quantized signal may recover the differential signal by applying inverse quantization and inverse transformation through an inverse quantization unit 140 and an inverse transformation unit 150 in a loop.
  • a reconstructed signal (or a reconstruction block) may be generated by adding the reconstructed difference signal to a prediction signal output from the inter predictor 181 or the intra predictor 182.
  • the filtering unit 160 applies filtering to the reconstruction signal and outputs it to the reproduction apparatus or transmits the decoded picture buffer to the decoding picture buffer 170.
  • the filtered signal transmitted to the decoded picture buffer 170 may be used as the reference picture in the inter prediction unit 181. As such, by using the filtered picture as a reference picture in the inter prediction mode, not only image quality but also encoding efficiency may be improved.
  • the decoded picture buffer 170 may store the filtered picture for use as a reference picture in the inter prediction unit 181.
  • the inter prediction unit 181 performs temporal prediction and / or spatial prediction to remove temporal redundancy and / or spatial redundancy with reference to a reconstructed picture.
  • the reference picture used to perform the prediction is a transformed signal that has been quantized and dequantized in units of blocks at the time of encoding / decoding in the previous time, blocking artifacts or ringing artifacts may exist. have.
  • the inter prediction unit 181 may interpolate the signals between pixels in sub-pixel units by applying a lowpass filter to solve performance degradation due to discontinuity or quantization of such signals.
  • the sub-pixels mean virtual pixels generated by applying an interpolation filter
  • the integer pixels mean actual pixels existing in the reconstructed picture.
  • the interpolation method linear interpolation, bi-linear interpolation, wiener filter, or the like may be applied.
  • the interpolation filter may be applied to a reconstructed picture to improve the precision of prediction.
  • the inter prediction unit 181 may generate an interpolation pixel by applying an interpolation filter to an integer pixel and perform prediction using an interpolated block composed of interpolated pixels.
  • the intra predictor 182 predicts the current block by referring to samples in the vicinity of the block to which the current encoding is to be performed.
  • the intra prediction unit 182 may perform the following process to perform intra prediction. First, reference samples necessary for generating a prediction signal may be prepared. The predicted signal (predicted block) may be generated using the prepared reference sample. Then, the prediction mode is encoded. In this case, the reference sample may be prepared through reference sample padding and / or reference sample filtering. Since the reference sample has been predicted and reconstructed, there may be a quantization error. Accordingly, the reference sample filtering process may be performed for each prediction mode used for intra prediction to reduce such an error.
  • the predicted signal (or predicted block) generated by the inter predictor 181 or the intra predictor 182 is used to generate a reconstruction signal (or reconstruction block) or a differential signal (or differential). Block).
  • FIG. 2 is a schematic block diagram of a decoder in which encoding of a still image or video signal is performed according to an embodiment to which the present invention is applied.
  • the decoder 200 includes an entropy decoding unit 210, an inverse quantization unit 220, an inverse transform unit 230, an adder 235, a filtering unit 240, and a decoded picture buffer (DPB).
  • Buffer Unit (250) the prediction unit 260 may be configured.
  • the predictor 260 may include an inter predictor 261 and an intra predictor 262.
  • the reconstructed video signal output through the decoder 200 may be reproduced through the reproducing apparatus.
  • the decoder 200 receives a signal (ie, a bit stream) output from the encoder 100 of FIG. 1, and the received signal is entropy decoded through the entropy decoding unit 210.
  • the inverse quantization unit 220 obtains a transform coefficient from the entropy decoded signal using the quantization step size information.
  • the inverse transform unit 230 applies an inverse transform scheme to inverse transform the transform coefficients to obtain a residual signal (or a differential block).
  • the adder 235 outputs the obtained difference signal (or difference block) from the predictor 260 (that is, the predicted signal (or prediction) output from the predictor 260 (that is, the inter predictor 261 or the intra predictor 262). By adding to the generated block), a reconstructed signal (or a restored block) is generated.
  • the filtering unit 240 applies filtering to the reconstructed signal (or the reconstructed block) and outputs the filtering to the reproduction device or transmits the decoded picture buffer unit 250 to the reproduction device.
  • the filtered signal transmitted to the decoded picture buffer unit 250 may be used as a reference picture in the inter predictor 261.
  • the embodiments described by the filtering unit 160, the inter prediction unit 181, and the intra prediction unit 182 of the encoder 100 are respectively the filtering unit 240, the inter prediction unit 261, and the decoder of the decoder. The same may be applied to the intra predictor 262.
  • a still image or video compression technique uses a block-based image compression method.
  • the block-based image compression method is a method of processing an image by dividing the image into specific block units, and may reduce memory usage and calculation amount.
  • FIG. 3 is a diagram for describing a partition structure of a coding unit that may be applied to the present invention.
  • the encoder splits one image (or picture) into units of a coding tree unit (CTU) in a rectangular shape.
  • CTU coding tree unit
  • one CTU is sequentially encoded according to a raster scan order.
  • the size of the CTU may be set to any one of 64 ⁇ 64, 32 ⁇ 32, and 16 ⁇ 16.
  • the encoder may select and use the size of the CTU according to the resolution of the input video or the characteristics of the input video.
  • the CTU includes a coding tree block (CTB) for luma components and a CTB for two chroma components corresponding thereto.
  • CTB coding tree block
  • One CTU may be divided into a quad-tree structure. That is, one CTU has a square shape and is divided into four units having a half horizontal size and a half vertical size to generate a coding unit (CU). have. This partitioning of the quad-tree structure can be performed recursively. That is, a CU is hierarchically divided into quad-tree structures from one CTU.
  • CU coding unit
  • the CU refers to a basic unit of coding in which an input image is processed, for example, intra / inter prediction is performed.
  • the CU includes a coding block (CB) for a luma component and a CB for two chroma components corresponding thereto.
  • CB coding block
  • the size of a CU may be set to any one of 64 ⁇ 64, 32 ⁇ 32, 16 ⁇ 16, and 8 ⁇ 8.
  • the root node of the quad-tree is associated with the CTU.
  • the quad-tree is split until it reaches a leaf node, which corresponds to a CU.
  • the CTU may not be divided according to the characteristics of the input image.
  • the CTU corresponds to a CU.
  • a node that is no longer divided ie, a leaf node
  • CU a node that is no longer divided
  • CU a node that is no longer divided
  • CU a node corresponding to nodes a, b, and j are divided once in the CTU and have a depth of one.
  • a node (ie, a leaf node) that is no longer divided in a lower node having a depth of 2 corresponds to a CU.
  • CU (c), CU (h) and CU (i) corresponding to nodes c, h and i are divided twice in the CTU and have a depth of two.
  • a node that is no longer partitioned (ie, a leaf node) in a lower node having a depth of 3 corresponds to a CU.
  • CU (d), CU (e), CU (f), and CU (g) corresponding to nodes d, e, f, and g are divided three times in the CTU, Has depth.
  • the maximum size or the minimum size of the CU may be determined according to characteristics (eg, resolution) of the video image or in consideration of encoding efficiency. Information about this or information capable of deriving the information may be included in the bitstream.
  • a CU having a maximum size may be referred to as a largest coding unit (LCU), and a CU having a minimum size may be referred to as a smallest coding unit (SCU).
  • LCU largest coding unit
  • SCU smallest coding unit
  • a CU having a tree structure may be hierarchically divided with predetermined maximum depth information (or maximum level information).
  • Each partitioned CU may have depth information. Since the depth information indicates the number and / or degree of division of the CU, the depth information may include information about the size of the CU.
  • the size of the SCU can be obtained by using the size and maximum depth information of the LCU. Or conversely, using the size of the SCU and the maximum depth information of the tree, the size of the LCU can be obtained.
  • information indicating whether the corresponding CU is split may be transmitted to the decoder.
  • This split mode is included in all CUs except the SCU. For example, if the flag indicating whether to split or not is '1', the CU is divided into 4 CUs again. If the flag indicating whether to split or not is '0', the CU is not divided further. Processing may be performed.
  • a CU is a basic unit of coding in which intra prediction or inter prediction is performed.
  • HEVC divides a CU into prediction units (PUs) in order to code an input image more effectively.
  • the PU is a basic unit for generating a prediction block, and may generate different prediction blocks in PU units within one CU. However, PUs belonging to one CU are not mixed with intra prediction and inter prediction, and PUs belonging to one CU are coded by the same prediction method (ie, intra prediction or inter prediction).
  • the PU is not divided into quad-tree structures, but is divided once in a predetermined form in one CU. This will be described with reference to the drawings below.
  • FIG. 4 is a diagram for explaining a prediction unit applicable to the present invention.
  • the PU is divided differently according to whether an intra prediction mode or an inter prediction mode is used as a coding mode of a CU to which the PU belongs.
  • FIG. 4A illustrates a PU when an intra prediction mode is used
  • FIG. 4B illustrates a PU when an inter prediction mode is used.
  • N ⁇ N type PU when divided into N ⁇ N type PU, one CU is divided into four PUs, and different prediction blocks are generated for each PU unit.
  • the division of the PU may be performed only when the size of the CB for the luminance component of the CU is the minimum size (that is, the CU is the SCU).
  • one CU has 8 PU types (ie, 2N ⁇ 2N). , N ⁇ N, 2N ⁇ N, N ⁇ 2N, nL ⁇ 2N, nR ⁇ 2N, 2N ⁇ nU, 2N ⁇ nD).
  • PU partitioning in the form of N ⁇ N may be performed only when the size of the CB for the luminance component of the CU is the minimum size (that is, the CU is the SCU).
  • AMP Asymmetric Motion Partition
  • 'n' means a 1/4 value of 2N.
  • AMP cannot be used when the CU to which the PU belongs is a CU of the minimum size.
  • an optimal partitioning structure of a coding unit (CU), a prediction unit (PU), and a transformation unit (TU) is subjected to the following process to perform a minimum rate-distortion. It can be determined based on the value. For example, looking at the optimal CU partitioning process in 64 ⁇ 64 CTU, rate-distortion cost can be calculated while partitioning from a 64 ⁇ 64 CU to an 8 ⁇ 8 CU.
  • the specific process is as follows.
  • the partition structure of the optimal PU and TU that generates the minimum rate-distortion value is determined by performing inter / intra prediction, transform / quantization, inverse quantization / inverse transform, and entropy encoding for a 64 ⁇ 64 CU.
  • the 32 ⁇ 32 CU is subdivided into four 16 ⁇ 16 CUs, and a partition structure of an optimal PU and TU that generates a minimum rate-distortion value for each 16 ⁇ 16 CU is determined.
  • 16 ⁇ 16 blocks by comparing the sum of the rate-distortion values of the 16 ⁇ 16 CUs calculated in 3) above with the rate-distortion values of the four 8 ⁇ 8 CUs calculated in 4) above. Determine the partition structure of the optimal CU within. This process is similarly performed for the remaining three 16 ⁇ 16 CUs.
  • a prediction mode is selected in units of PUs, and prediction and reconstruction are performed in units of actual TUs for the selected prediction mode.
  • the TU means a basic unit in which actual prediction and reconstruction are performed.
  • the TU includes a transform block (TB) for a luma component and a TB for two chroma components corresponding thereto.
  • TB transform block
  • the TUs are hierarchically divided into quad-tree structures from one CU to be coded.
  • the TU divided from the CU can be further divided into smaller lower TUs.
  • the size of the TU may be set to any one of 32 ⁇ 32, 16 ⁇ 16, 8 ⁇ 8, and 4 ⁇ 4.
  • a root node of the quad-tree is associated with a CU.
  • the quad-tree is split until it reaches a leaf node, which corresponds to a TU.
  • the CU may not be divided according to the characteristics of the input image.
  • the CU corresponds to a TU.
  • a node ie, a leaf node
  • TU (a), TU (b), and TU (j) corresponding to nodes a, b, and j are divided once in a CU and have a depth of 1.
  • FIG. 3B TU (a), TU (b), and TU (j) corresponding to nodes a, b, and j are divided once in a CU and have a depth of 1.
  • a node (ie, a leaf node) that is no longer divided in a lower node having a depth of 2 corresponds to a TU.
  • TU (c), TU (h), and TU (i) corresponding to nodes c, h, and i are divided twice in a CU and have a depth of two.
  • a node that is no longer partitioned (ie, a leaf node) in a lower node having a depth of 3 corresponds to a CU.
  • TU (d), TU (e), TU (f), and TU (g) corresponding to nodes d, e, f, and g are divided three times in a CU. Has depth.
  • a TU having a tree structure may be hierarchically divided with predetermined maximum depth information (or maximum level information). Each divided TU may have depth information. Since the depth information indicates the number and / or degree of division of the TU, it may include information about the size of the TU.
  • information indicating whether the corresponding TU is split may be delivered to the decoder.
  • This partitioning information is included in all TUs except the smallest TU. For example, if the value of the flag indicating whether to split is '1', the corresponding TU is divided into four TUs again. If the value of the flag indicating whether to split is '0', the corresponding TU is no longer divided.
  • various blocks are designed to be flexibly divided, so that the most suitable block division structure is selected under a characteristic of a video and a limited bit rate condition.
  • Examples of such blocks include a coding unit (CU) that is a basic unit of coding, a prediction unit (PU) that is a basic unit of prediction, and a transform unit (TU), which is a basic unit of transform.
  • CU coding unit
  • PU prediction unit
  • TU transform unit
  • Table 1 illustrates a block structure in intra prediction (ie, intra prediction).
  • Table 2 illustrates a block structure in inter prediction (ie, inter prediction).
  • the maximum size of the CU illustrated in Tables 1 and 2 is assumed to be 64x64 and the maximum depth of the CU is 4. Accordingly, a CU may have four sizes of 64 ⁇ 64, 32 ⁇ 32, 16 ⁇ 16, and 8 ⁇ 8, and PUs and TUs that can be selected for each CU size are shown in Tables 1 and 2. In this case, it is assumed that the maximum / minimum size of the TU is 32 ⁇ 32/4 ⁇ 4 and the maximum depth of the TU is 3.
  • the present invention defines a quantization unit (QU) structure that allows flexible quantization in addition to the existing blocks, and performs quantization / dequantization suitable for each situation using the defined QU. Suggest how to. Thus, higher coding efficiency can be achieved.
  • QU quantization unit
  • An embodiment of the present invention proposes a definition and division method for a quantization unit (QU), which is a basic unit for performing quantization and inverse quantization operations.
  • QU quantization unit
  • the QU according to the present invention may be defined as a unit (region) for performing the same quantization or inverse quantization operation and / or a unit (region) in which quantization or inverse quantization is performed using the same quantization step size. .
  • the QU may be divided into quad-trees based on the CU. This will be described with reference to the drawings below.
  • FIG. 5 is a diagram illustrating a structure in which a prediction unit, a transform unit, and a quantization unit are divided from a coding unit according to an embodiment of the present invention.
  • FIG. 5A illustrates a structure in which CUs of 1 to 13 are divided from 64 ⁇ 64 CTUs.
  • FIG. 5 (b) illustrates a structure in which a PU is divided from each CU partitioned from the CTU illustrated in FIG. 5 (a).
  • CU3, CU5, and CU13 are divided into PUs of various types.
  • CU3 is divided into 3_P1 and 3_P2 in a 2N ⁇ N divided form
  • CU5 is divided into 5_P1 and 5_P2 in an N ⁇ 2N divided form
  • CU13 is divided into 13_P1 and 13_P2 in an N ⁇ 2N divided form.
  • the remaining CUs that are not divided use the same size PU as the CU.
  • FIG. 5 (c) illustrates a structure in which a TU is divided from each CU partitioned from the CTU illustrated in FIG. 5 (a).
  • CU3, CU5, CU11, and CU13 are divided into TUs having various depths.
  • CU3 is divided into 3_T1, 3_T2, 3_T3, 3_T4 at depth 1
  • CU5 is divided into 5_T1, 5_T9, 5_T10 at depth 1, 5_T2, 5_T7, 5_T8 at depth 2, 5_T3, 5_T4 at depth 3,
  • 5_T5, 5_T6, CU11 is divided into 11_T1, 11_T2, 11_T3, and 11_T4 of depth 1
  • CU13 represents a state divided into 13_T1, 13_T2, 13_T3, and 13_T4 of depth 1.
  • CUs that are not divided use TUs having the same size as the CU.
  • FIG. 5 (d) illustrates a partition structure of QUs from each CU partitioned from the CTU illustrated in FIG. 5 (a) according to one embodiment of the present invention.
  • the QU may be divided from the CU in a quad-tree form as in the conventional CU and TU partitioning scheme.
  • CU3, CU5, and CU6 are divided into QUs having various depths.
  • CU3 is divided into 3_Q1, 3_Q2, 3_Q3, 3_Q4 at depth 1
  • CU5 is divided into 5_Q1, 5_Q6, 5_Q7 at depth 1, 5_Q2, 5_Q3, 5_Q4, 5_Q5 at depth 2
  • CU6 is 6_Q1, 6_Q2, at depth 1
  • the state divided into 6_Q7, 6_Q3, 6_Q4, 6_Q5, and 6_Q6 of depth 2 is shown.
  • the encoder may determine the optimal partition structure of the QU based on a minimum rate-distortion value as in the CU / TU partitioning scheme described above.
  • the QU may be split into a new form that is not previously defined based on the CU.
  • the partitioning structure of the QU may be partitioned into a rectangle (that is, a binary-tree) rather than a quad-tree, or various reflections reflecting characteristics of a prediction direction (according to an intra prediction mode).
  • a form of partitioning scheme may be used. This will be described with reference to the drawings below.
  • FIG. 6 is a diagram illustrating a structure in which a quantization unit is divided from a coding unit according to an embodiment of the present invention.
  • N represents the size of the width and height of the CU.
  • splitting forms such as 2N ⁇ 2N, N ⁇ N, N ⁇ 2N, 2N ⁇ N, nL ⁇ 2N, and 2N ⁇ nU according to existing PU splitting schemes (or QU can be split from the CU by using the split mode).
  • nR ⁇ 2N and 2N in the partition mode of the conventional PU are reflected by reflecting the tendency of 2-dimensional (2D) transformation in which non-zero transform coefficients are concentrated toward a low frequency.
  • XnD may not be used.
  • 6 (a) to 6 (f) illustrate a rectangular partition structure (for example, N ⁇ 2N, 2N ⁇ N, nL ⁇ 2N, 2N ⁇ nU) according to a conventional PU partitioning scheme.
  • the present invention is not limited thereto. That is, the QU may be divided from the CU in the form of a binary tree, without being limited to a rectangular partition structure according to a conventional PU partitioning scheme.
  • the QU may be split from the CU in a split form (or split mode) as shown in FIGS. 6 (g) to 6 (l).
  • Fig. 6 (g) is divided into N / 4 ⁇ N / 4 sized QUs and the rest
  • Fig. 6 (h) is divided into N / 2 ⁇ N / 2 sized QUs and the rest
  • Fig. 6 (i) The case of dividing into 3N / 4x3N / 4 sized QU and the rest is illustrated.
  • 6 (j) to 6 (l) are (0, N / 2-1) and (N) assuming that the coordinates of the top-left sample of the N ⁇ N CU are (0,0).
  • CU is divided into two QUs based on a line segment connecting / 2-1,0), and the CU is divided into two QUs based on a line segment connecting (0, N-1) and (N-1,0).
  • FIGS. 6 (l) to 6 (l) illustrate a split form reflecting a tendency of 2-dimensional (2D) transformation in which non-zero transform coefficients are concentrated toward a low frequency. Doing. According to the degree to which the non-zero transform coefficients are driven toward the low frequency, the division form of any one of FIGS. 6 (g) to 6 (i) may be determined by the encoder. Similarly, the division form of any one of FIGS. 6 (j) to 6 (l) may be determined by the encoder.
  • the split form of the QU may be determined according to a distribution characteristic of a non-zero transform coefficient of the transform coefficient according to the intra prediction direction. For example, if a vertical in-picture prediction in which non-zero transform coefficients are concentrated in the top row of a block is used for a CU or TU, FIG. 6 (d) or FIG. A division mode such as 6 (f) may be selected. In addition, when the intra-picture prediction in the horizontal direction in which a non-zero transform coefficient is concentrated in the column on the left side of the block is used for the CU or TU, FIG. 6 (c) or FIG. 6 ( The division mode as e) may be selected.
  • the most suitable division of FIGS. 6 (g) to 6 (l) is used.
  • the mode can be selected.
  • the most suitable split mode among FIGS. 6 (g) to 6 (l) may be selected by the encoder, or between the prediction directions in each screen and the split modes according to FIGS. 6 (g) to 6 (l).
  • Correspondence relationships may be predefined with certain rules. As such, when the corresponding relationship between the specific intra prediction mode and the splitting mode of the QU is predefined, there is an advantage that no separate signaling for the QU splitting mode from the encoder to the decoder is required.
  • All or part of the division structure of the various QUs illustrated in FIG. 6 correspond to division types of the QU (division modes of the QU), and the QU may be divided from the CU by any one division structure determined by the encoder. have. Also, as described above, when a corresponding relationship between a specific intra prediction mode and a split mode of the QU is predefined, the decoder may derive a split form of the QU split from the current CU according to the intra prediction mode.
  • FIG. 7 is a diagram illustrating a structure in which a quantization unit is divided from a transform unit according to an embodiment of the present invention.
  • FIG. 7 illustrates a structure in which a QU is divided from each of the TUs divided as shown in FIG. 5C, and the shaded portion in FIG. 7 illustrates a QU divided based on the TUs.
  • TU1, 3_T2, TU4, 5_T9, and TU12 are divided into various types of QUs.
  • TU1 is divided into 1_Q1 and 1_Q2 in the form of 2N ⁇ N division
  • 3_T2 is divided into 3_T2Q1 and 3_T2Q2 in the form of 2N ⁇ N division
  • TU4 is divided into 4_Q1 and 4_Q2 in the form of 2N ⁇ NU division
  • 5_T9 is nL ⁇ 2N.
  • the divided form is divided into 5_T9Q1 and 5_T9Q2, and TU12 exemplifies a state divided into 12_Q1, 12_Q2, 12_Q3, and 12_Q4 in an N ⁇ N divided form (or quad-tree form).
  • FIG. 5 (d) partitions the QU based on a CU, the QU can be partitioned regardless of the TU partition structure, whereas the QU partition is proposed in FIG.
  • the TU-based QU partitioning splits the QU based on each partitioned TU, causing a limitation that the QU cannot be larger than the TU.
  • a method of dividing a QU based on a CU or a TU is illustrated, but the present invention is not limited thereto, and the QU may be divided based on the CTU. Also in this case, a quad-tree shape may be applied as shown in FIG. 5 and a partitioned shape as shown in FIG. 6 may also be applied.
  • a method of decoding QU split information in a decoder is as follows.
  • FIG. 8 is a diagram illustrating an image decoding method according to an embodiment of the present invention.
  • the decoder parses a QU Split Flag for a current block (S801).
  • the QU Split Flag may indicate whether the QU is split from a reference block (eg, CTU, CU, TU, etc.) that is a reference (ie, started) of the QU split. That is, it may be indicated whether the current block is divided into QUs.
  • a reference block eg, CTU, CU, TU, etc.
  • the QU Split Flag may be transmitted in syntax for the reference block.
  • the decoder derives the QU Split Type (S802).
  • the decoder may perform Step S803 without performing Step S802.
  • the decoder may parse the QU Split Mode.
  • all or part of the split form of the QU illustrated in FIG. 6 is predefined as the QU split modes, and the QU split mode selected by the encoder among the predefined QU split modes is the split mode (QU Split Mode). ) May be sent.
  • a QU Split Mode indicating a split type (or type) of a QU may not be explicitly transmitted from the encoder to the decoder.
  • the decoder may determine the QU division type of the current block according to the intra prediction mode (that is, the prediction direction) applied to the current block. That is, the split form of the QU may be determined according to a distribution characteristic of a non-zero transform coefficient of the transform coefficient according to the intra prediction direction.
  • the decoder may The division mode as shown in FIG. 6D or 6F may be selected.
  • a horizontal intra prediction is used for the current block (CU or TU) in which a non-zero transform coefficient is concentrated in a column on the left side of the block, the decoder is described with reference to FIG. 6. (c) or the division mode shown in FIG. 6 (e) can be selected.
  • the most suitable division of FIGS. 6 (g) to 6 (l) is used.
  • the mode may be selected by the decoder.
  • the correspondence between the prediction direction in each screen and the split form according to FIGS. 6 (g) to 6 (l) may be predefined with a predetermined rule, and the decoder may be defined in the screen according to the predefined correspondence.
  • the QU division type may be determined according to the prediction direction.
  • the decoder derives a quantization parameter (QP) for each QU (S803).
  • the quantization parameter refers to a variable used in the decoding process for scaling of transform coefficient levels.
  • the decoder may parse the quantization parameter difference value QP_ ⁇ for each QU and add the parsed QP_ ⁇ and the quantization parameter predictor QP_pred to derive a QP.
  • the QP values of neighboring QUs may be used as the QP_pred for the current QU.
  • the neighbor QU used as QP_pred may be selected by the encoder and information about the neighbor QU used (ie, whether it is an upper neighbor QU or a left neighbor QU) may be transmitted to the decoder.
  • the QP value of the current picture may be used as QP_pred.
  • the QP value of the current QU may be set as in Equation 1 below.
  • Quantization and inverse quantization in each QU are performed using QP_QU derived from Equation (1).
  • FIG. 9 is a diagram illustrating a quantization unit divider according to an embodiment of the present invention.
  • the quantization unit divider implements the functions, processes, and / or methods proposed in FIGS. 5 to 8.
  • the quantization unit divider may be implemented as a decoder device in combination with all or a part of the components of the decoder illustrated in FIG. 2.
  • the quantization unit divider may include a QU split flag parser 901, a QU split form derivator 902, and a QP derivator 903.
  • the QU Split Flag parser 901 parses a QU Split Flag for the current block.
  • the QU Split Flag may indicate whether the QU is split from a reference block (eg, CTU, CU, TU, etc.) that is the basis of (i.e., starting from) the division of the QU. Can be. That is, it may be indicated whether the current block is divided into QUs.
  • a reference block eg, CTU, CU, TU, etc.
  • the QU Split Flag may be transmitted in syntax for the reference block.
  • the QU Split Type Derivation Unit 902 derives the QU Split Type.
  • the QU Split Type Derivation Unit 902 may not derive the QU Split Type. .
  • the QU split type derivation unit 902 may perform the QU split mode (QU).
  • QU QU split mode
  • the split mode can be parsed. For example, all or part of the split form of the QU illustrated in FIG. 6 is predefined as the QU split modes, and the QU split mode selected by the encoder among the predefined QU split modes is the split mode (QU Split Mode). ) May be sent.
  • a QU Split Mode indicating a split type (or type) of a QU may not be explicitly transmitted from the encoder to the decoder.
  • the QU division type deriving unit 902 may determine the QU division type of the current block according to the intra prediction mode (that is, the prediction direction) applied to the current block. That is, the split form of the QU may be determined according to a distribution characteristic of a non-zero transform coefficient of the transform coefficient according to the intra prediction direction.
  • the QU splitting form The derivation unit 902 may select a split mode as illustrated in FIG. 6D or 6F.
  • a QU division type derivation unit 902 may select a split mode as shown in FIG. 6 (c) or FIG. 6 (e).
  • the most suitable division of FIGS. 6 (g) to 6 (l) is used.
  • the mode may be selected by the QU division type derivation unit 902.
  • the correspondence relationship between the prediction direction in each screen and the split mode according to FIGS. 6 (g) to 6 (l) may be defined in advance with a predetermined rule, and the QU division type derivation unit 902 is predefined.
  • the QU splitting shape may be determined according to the prediction direction in the screen according to the correspondence.
  • the QP derivation unit 903 derives a quantization parameter (QP) for each QU.
  • QP_ ⁇ may be parsed for each QU, and QP_ ⁇ and quantization parameter predictor QP_pred may be summed as shown in Equation 1 to derive QP for each QU.
  • the QPs of neighboring QUs may be used as the QP predictor QP_pred for the current QU.
  • information about the neighbor QU used as QP_pred may be transmitted from the encoder.
  • the QP derivation unit 903 may parse information about the neighbor QU. Can be.
  • the QP value of the current picture may be used as QP_pred.
  • the QP value determined by the above method is transferred to the inverse quantization unit, and the inverse quantization unit may perform inverse quantization using the QP value.
  • FIGS. 8 and 9 a method of splitting a QU from any one of a CU, a TU, and a CTU (that is, a reference block) using a QU division mode determined by an encoder among a plurality of predefined QU division modes (FIG. 6).
  • a QU division mode determined by an encoder among a plurality of predefined QU division modes.
  • the partitioning of the existing CU / TU is performed. The same may be applied.
  • the decoder may determine whether to split the reference block into QUs having a depth of 1 according to the QU Split Flag value of the reference block. If the reference block is divided into QUs having a depth of 1, whether or not the reference block is divided into QUs having a depth of 2 may be determined according to a QU split flag value for each QU having a depth of 1. Similarly, it is possible to determine whether to divide the QU having a depth of 3 according to the QU Split Flag value for each QU of the divided depth 2. This process may be repeated until the maximum depth value (or minimum QU size value) for the QU split is reached.
  • each component or feature is to be considered optional unless stated otherwise.
  • Each component or feature may be embodied in a form that is not combined with other components or features. It is also possible to combine some of the components and / or features to form an embodiment of the invention.
  • the order of the operations described in the embodiments of the present invention may be changed. Some components or features of one embodiment may be included in another embodiment or may be replaced with corresponding components or features of another embodiment. It is obvious that the claims may be combined to form an embodiment by combining claims that do not have an explicit citation relationship in the claims or as new claims by post-application correction.
  • Embodiments according to the present invention may be implemented by various means, for example, hardware, firmware, software, or a combination thereof.
  • an embodiment of the present invention may include one or more application specific integrated circuits (ASICs), digital signal processors (DSPs), digital signal processing devices (DSPDs), programmable logic devices (PLDs), FPGAs ( field programmable gate arrays), processors, controllers, microcontrollers, microprocessors, and the like.
  • ASICs application specific integrated circuits
  • DSPs digital signal processors
  • DSPDs digital signal processing devices
  • PLDs programmable logic devices
  • FPGAs field programmable gate arrays
  • processors controllers, microcontrollers, microprocessors, and the like.
  • an embodiment of the present invention may be implemented in the form of a module, procedure, function, etc. that performs the functions or operations described above.
  • the software code may be stored in memory and driven by the processor.
  • the memory may be located inside or outside the processor, and may exchange data with the processor by various known means.

Landscapes

  • Engineering & Computer Science (AREA)
  • Multimedia (AREA)
  • Signal Processing (AREA)
  • Compression Or Coding Systems Of Tv Signals (AREA)

Abstract

La présente invention concerne un procédé destiné au codage/décodage d'une image et un appareil s'y rapportant. Plus précisément, un procédé destiné à un appareil de décodage décodant une image peut comprendre les étapes consistant : à analyser un drapeau de division d'une unité de quantification (QU) indiquant si oui ou non la QU est divisée à partir d'un bloc courant ; à dériver un format de division de la QU si le drapeau de division de la QU est activé ; et à dériver un paramètre de quantification (QP) pour chaque QU divisée.
PCT/KR2016/011419 2015-10-13 2016-10-12 Procédé destiné au codage/décodage d'une image et appareil s'y rapportant WO2017065490A1 (fr)

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
US201562240539P 2015-10-13 2015-10-13
US62/240,539 2015-10-13

Publications (1)

Publication Number Publication Date
WO2017065490A1 true WO2017065490A1 (fr) 2017-04-20

Family

ID=58518381

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/KR2016/011419 WO2017065490A1 (fr) 2015-10-13 2016-10-12 Procédé destiné au codage/décodage d'une image et appareil s'y rapportant

Country Status (1)

Country Link
WO (1) WO2017065490A1 (fr)

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN111345039A (zh) * 2017-11-16 2020-06-26 松下电器(美国)知识产权公司 编码装置、解码装置、编码方法和解码方法

Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
KR20120140365A (ko) * 2011-06-21 2012-12-31 인텔렉추얼디스커버리 주식회사 쿼드트리 구조 기반의 적응적 양자화 파라미터 부호화 및 복호화 방법 및 장치
US20130022108A1 (en) * 2011-06-30 2013-01-24 General Instrument Corporation Quantization parameter derivation from qp predictor
KR20130052722A (ko) * 2011-11-08 2013-05-23 삼성전자주식회사 변환블록 크기에 기초한 양자화 파라미터 결정 방법 및 그 장치
KR20140131606A (ko) * 2013-05-02 2014-11-14 삼성전자주식회사 HEVC(high efficiency video coding)에서 코딩 유닛에 대한 양자화 파라미터를 변화시키는 방법과 장치, 및 시스템
US20150172720A1 (en) * 2010-08-17 2015-06-18 Samsung Electronics Co., Ltd. Video encoding method and apparatus using transformation unit of variable tree structure, and video decoding method and apparatus

Patent Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20150172720A1 (en) * 2010-08-17 2015-06-18 Samsung Electronics Co., Ltd. Video encoding method and apparatus using transformation unit of variable tree structure, and video decoding method and apparatus
KR20120140365A (ko) * 2011-06-21 2012-12-31 인텔렉추얼디스커버리 주식회사 쿼드트리 구조 기반의 적응적 양자화 파라미터 부호화 및 복호화 방법 및 장치
US20130022108A1 (en) * 2011-06-30 2013-01-24 General Instrument Corporation Quantization parameter derivation from qp predictor
KR20130052722A (ko) * 2011-11-08 2013-05-23 삼성전자주식회사 변환블록 크기에 기초한 양자화 파라미터 결정 방법 및 그 장치
KR20140131606A (ko) * 2013-05-02 2014-11-14 삼성전자주식회사 HEVC(high efficiency video coding)에서 코딩 유닛에 대한 양자화 파라미터를 변화시키는 방법과 장치, 및 시스템

Cited By (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN111345039A (zh) * 2017-11-16 2020-06-26 松下电器(美国)知识产权公司 编码装置、解码装置、编码方法和解码方法
CN111345039B (zh) * 2017-11-16 2024-04-02 松下电器(美国)知识产权公司 解码装置和解码方法

Similar Documents

Publication Publication Date Title
WO2018128322A1 (fr) Procédé de traitement d'image et dispositif associé
WO2018080135A1 (fr) Procédé et appareil de codage/décodage vidéo, et support d'enregistrement à flux binaire mémorisé
WO2018128323A1 (fr) Procédé et dispositif de codage/décodage de signal vidéo au moyen d'une transformée secondaire
WO2016204531A1 (fr) Procédé et dispositif permettant de réaliser un filtrage adaptatif selon une limite de bloc
WO2018044088A1 (fr) Procédé et dispositif de traitement d'un signal vidéo
WO2018236031A1 (fr) Procédé de traitement d'image basé sur un mode d'intraprédiction, et appareil associé
WO2018236028A1 (fr) Procédé de traitement d'image basé sur un mode d'intra-prédiction et appareil associé
WO2018008904A2 (fr) Procédé et appareil de traitement de signal vidéo
WO2013025065A2 (fr) Appareil de codage/décodage d'images et procédé par lequel une sélection de filtres par unités précises est appliquée
WO2018062880A1 (fr) Procédé de traitement d'image et appareil associé
WO2018212569A1 (fr) Procédé de traitement d'image basé sur un mode de prédiction intra et appareil associé
WO2018070713A1 (fr) Procédé et appareil pour dériver un mode de prédiction intra pour un composant de chrominance
WO2019117639A1 (fr) Procédé de codage d'image sur la base d'une transformée et son dispositif
WO2018056702A1 (fr) Procédé et appareil de traitement de signal vidéo
WO2018101687A1 (fr) Dispositif et procédé de codage/décodage d'image, et support d'enregistrement contenant un flux binaire
WO2016200115A1 (fr) Procédé et dispositif de filtrage de déblocage
WO2016137166A1 (fr) Procédé de traitement d'image basé sur un mode de prédiction intra, et dispositif associé
WO2018066809A1 (fr) Procédé et dispositif de division d'unité de codage de composante de chrominance
WO2016114583A1 (fr) Procédé et dispositif de traitement de signal vidéo
WO2016140439A1 (fr) Procédé et dispositif pour coder et décoder un signal vidéo par utilisation d'un filtre de prédiction amélioré
WO2020242145A1 (fr) Procédé et appareil de codage vidéo utilisant un ensemble adaptatif de paramètres
WO2016133356A1 (fr) Procédé et dispositif de codage/décodage de signaux vidéo en utilisant un ordre de balayage adaptatif
WO2017065592A1 (fr) Procédé et appareil de codage et de décodage de signal vidéo
WO2018128222A1 (fr) Procédé et appareil de décodage d'image dans un système de codage d'image
WO2017065422A1 (fr) Procédé permettant de traiter une image et appareil s'y rapportant

Legal Events

Date Code Title Description
121 Ep: the epo has been informed by wipo that ep was designated in this application

Ref document number: 16855707

Country of ref document: EP

Kind code of ref document: A1

NENP Non-entry into the national phase

Ref country code: DE

122 Ep: pct application non-entry in european phase

Ref document number: 16855707

Country of ref document: EP

Kind code of ref document: A1