WO2018128511A1 - Dispositif et procédé de codage ou de décodage d'image - Google Patents

Dispositif et procédé de codage ou de décodage d'image Download PDF

Info

Publication number
WO2018128511A1
WO2018128511A1 PCT/KR2018/000380 KR2018000380W WO2018128511A1 WO 2018128511 A1 WO2018128511 A1 WO 2018128511A1 KR 2018000380 W KR2018000380 W KR 2018000380W WO 2018128511 A1 WO2018128511 A1 WO 2018128511A1
Authority
WO
WIPO (PCT)
Prior art keywords
block
current block
split
intra
intra mode
Prior art date
Application number
PCT/KR2018/000380
Other languages
English (en)
Korean (ko)
Inventor
임정연
이선영
손세훈
신재섭
Original Assignee
에스케이텔레콤 주식회사
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Priority claimed from KR1020180002063A external-priority patent/KR102528387B1/ko
Application filed by 에스케이텔레콤 주식회사 filed Critical 에스케이텔레콤 주식회사
Priority to US16/476,847 priority Critical patent/US10904522B2/en
Priority to CN201880005740.0A priority patent/CN110169060B/zh
Publication of WO2018128511A1 publication Critical patent/WO2018128511A1/fr

Links

Images

Classifications

    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N19/00Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
    • H04N19/10Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding
    • H04N19/102Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the element, parameter or selection affected or controlled by the adaptive coding
    • H04N19/103Selection of coding mode or of prediction mode
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N19/00Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
    • H04N19/10Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding
    • H04N19/102Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the element, parameter or selection affected or controlled by the adaptive coding
    • H04N19/129Scanning of coding units, e.g. zig-zag scan of transform coefficients or flexible macroblock ordering [FMO]
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N19/00Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
    • H04N19/10Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding
    • H04N19/134Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the element, parameter or criterion affecting or controlling the adaptive coding
    • H04N19/136Incoming video signal characteristics or properties
    • H04N19/14Coding unit complexity, e.g. amount of activity or edge presence estimation
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N19/00Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
    • H04N19/10Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding
    • H04N19/169Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the coding unit, i.e. the structural portion or semantic portion of the video signal being the object or the subject of the adaptive coding
    • H04N19/17Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the coding unit, i.e. the structural portion or semantic portion of the video signal being the object or the subject of the adaptive coding the unit being an image region, e.g. an object
    • H04N19/176Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the coding unit, i.e. the structural portion or semantic portion of the video signal being the object or the subject of the adaptive coding the unit being an image region, e.g. an object the region being a block, e.g. a macroblock
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N19/00Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
    • H04N19/50Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using predictive coding
    • H04N19/503Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using predictive coding involving temporal prediction

Definitions

  • the present invention relates to image encoding or decoding for efficiently encoding an image. More specifically, it relates to intra prediction techniques that can be useful in blocks with different textures in the form of diagonal edges or diagonal shapes.
  • JEM Joint Exploration Model
  • JVET Joint Video Exploration Team
  • CU coding unit
  • QTBT recursive Quadtree plus Binarytree partition structure.
  • PU Prediction Unit
  • TU Transform Unit
  • HEVC High Efficiency Video Coding
  • 360-degree video (hereinafter referred to as '360 video') is a video taken from multiple directions with multiple cameras or omni-directional cameras, and the video in multiple directions can be compressed into one 2D video to compress and transmit video from multiple scenes. Stitching, the stitched image is compressed and transmitted to the decoding apparatus. After the decoding apparatus decodes the compressed image, the decoding apparatus maps and reproduces the image in 3D.
  • EPR Equirectangular Projection
  • the EPR format has the disadvantage of distorting the 3D sphere-shaped 360 image by increasing the pixels at the top and bottom of the image and severely distorting the image.
  • the EPR format also increases the data amount and the encoding throughput in the enlarged portion. Accordingly, various projection formats have been proposed that can replace the EPR format.
  • ISP Icosahedral projection
  • FIG. 1 (a) is a general native layout form, and (b) and (c) are two kinds of compact layouts. All three types illustrated in FIG. 1 include a lot of diagonal edges due to the characteristics of the icosahedron composed of triangular faces.
  • 2 is a compact layout of an Octahedron projection (OHP) map of 360 video. As shown in FIG. 2, the area where two faces in one block exist occupies a significant portion of the image.
  • OHP Octahedron projection
  • the present disclosure describes intra prediction techniques that can be useful in blocks with different textures in diagonal shapes or diagonal shapes.
  • intra prediction technique of the present disclosure consider adding a new intra mode referred to as a "split intra mode" to existing intra modes.
  • the prediction region is divided into two regions in the diagonal direction so that each region can be predicted in a separate directional or non-directional mode.
  • a method comprising: decoding syntax elements defining an intra mode and a residual block of a current block of image data from an encoded bitstream; Based on the decoded syntax elements defining the intra mode of the current block, the plurality of available intra modes including a plurality of directional modes, a plurality of non-directional modes, and at least one split intra mode; Determining an intra mode; Determining a prediction block for the current block based on the determined intra mode of the current block; And reconstructing the current block based on the prediction block and the residual block.
  • the divisional intra mode is a mode in which one of the plurality of directional modes and the plurality of non-directional modes is individually applied to each of two regions divided by dividing the current block diagonally.
  • an apparatus for decoding image data comprising a memory and one or more processors, the one or more processors are syntax that defines an intra mode and a residual block of a current block of image data from an encoded bitstream Decrypting the elements; Based on the decoded syntax elements defining the intra mode of the current block, the plurality of available intra modes including a plurality of directional modes, a plurality of non-directional modes, and at least one split intra mode; Determining an intra mode; Determining a prediction block for the current block based on the determined intra mode of the current block; And reconstruct the current block based on the prediction block and the residual block.
  • ISP Icosahedral projection
  • ODP Octahedron projection
  • FIG. 3 is a block diagram of an image encoding apparatus that may use the techniques of this disclosure.
  • FIG. 4 is a conceptual diagram of exemplary block partitioning using a QTBT structure.
  • FIG. 5 is a conceptual diagram illustrating a plurality of intra modes that can be used for intra prediction.
  • FIG. 6 is an exemplary diagram of neighboring blocks of a current block.
  • FIG 7 illustrates an image decoding apparatus that may use the techniques of the present disclosure.
  • FIG. 8 is a diagram illustrating a positional relationship between a current block and reference samples in intra prediction.
  • 9A and 9B are conceptual diagrams illustrating how reference samples are used for a current block in directional intra mode.
  • 10A and 10B are diagrams illustrating types of split intra mode applied to a square block.
  • 11A and 11B illustrate types of split intra mode applied to a rectangular block.
  • 12A and 12B are diagrams for describing respective reference pixels used for two regions of the current block in the split intra mode.
  • 13A, 13B, 14A, and 14B are conceptual diagrams illustrating some exemplary schemes in which reference samples are used for two regions in split intra mode
  • 15A to 15C are conceptual views illustrating a method of determining prediction pixel values on a diagonal line in a split intra mode of the split_right_down type.
  • 16 is a flowchart illustrating an example method for encoding image data according to one or more examples of the present invention.
  • 17 is a flowchart illustrating an example method for decoding image data in accordance with one or more examples of the present invention.
  • FIG. 3 is a block diagram of an image encoding apparatus that may use the techniques of this disclosure.
  • the image encoding apparatus includes a block divider 310, a predictor 320, a subtractor 330, a transformer 340, a quantizer 345, an encoder 350, an inverse quantizer 360, and an inverse transform unit ( 365, an adder 370, a filter unit 380, and a memory 390.
  • each component may be implemented as a hardware chip, or may be implemented in software and implemented so that the microprocessor executes a function of software corresponding to each component.
  • the block dividing unit 310 After dividing each picture constituting the image into a plurality of coding tree units (CTUs), the block dividing unit 310 recursively divides the CTUs using a tree structure.
  • a leaf node in the tree structure becomes a CU (coding unit) which is a basic unit of coding.
  • CU coding unit
  • QT QuadTree
  • QTBT QuadTree
  • BT binaryTree
  • BinaryTree BinaryTree
  • the CTU is first divided into a QT structure.
  • the leaf nodes of the QT may then be further partitioned by BT.
  • the partition information generated by the block divider 310 by dividing the CTU by the QTBT structure is encoded by the encoder 350 and transmitted to the image decoding apparatus.
  • a first flag (QT split flag, QT_split_flag) indicating whether a block of a corresponding node is split is encoded. If the first flag is 1, the block of the node is divided into four blocks of the same size. If the first flag is 1, the node is no longer divided by QT.
  • a second flag (BT split flag, BT_split_flag) indicating whether a block of the corresponding node is split is encoded.
  • BT there may be a plurality of partition types. For example, there may be two types of partitioning a block of a node horizontally into two blocks of the same size and a type of partitioning vertically. Alternatively, there may further be a type in which blocks of the corresponding node are further divided into two blocks having an asymmetric shape.
  • the asymmetric form may include a form in which a block of a node is divided into two rectangular blocks having a size ratio of 1: 3.
  • FIG. 4 is an exemplary diagram of block division using a QTBT structure.
  • 4A illustrates an example of dividing a block by a QTBT structure
  • FIG. 4B illustrates a tree structure.
  • the solid line indicates the division by the QT structure
  • the dotted line indicates the division by the BT structure.
  • the parenthesis indicates a layer of QT
  • the parenthesis indicates a layer of BT.
  • a number represents partition type information.
  • the block corresponding to the first node of layer 1 of the QT proceeds to BT.
  • BT there are two types of BT splitting a block of a node into two blocks of the same size horizontally and vertically.
  • BT_split_flag indicating that the split is performed by BT.
  • the partition type of the second block of (layer 1) divided from the root node of the BT is horizontal, the block corresponding to the root node of the BT is QT divided. That is, it is a case where it should be expressed by QT split information rather than BT split information.
  • the following information may be further encoded.
  • the information is encoded as header information of an image, for example, may be encoded by a sequence parameter set (SPS) or a picture parameter set (PPS).
  • SPS sequence parameter set
  • PPS picture parameter set
  • CTU size the top layer of QTBT, that is, the block size of the root node
  • MinQTSize the minimum block size of leaf nodes allowed in QT
  • MaxBTSize the maximum block size of the root node allowed by BT
  • MaxBTDepth the maximum depth allowed by BT
  • MinBTSize the minimum block size of leaf nodes allowed in BT
  • a block having the same size as MinBTSize in BT is no longer split and no split information (second flag, split type information) regarding BT is also encoded.
  • second flag, split type information the maximum or minimum block size that a loop or leaf node of the QT and BT can have at a high level such as a sequence parameter set (SPS) or a picture parameter set (PPS) can be defined to determine whether the CTU is divided or not.
  • SPS sequence parameter set
  • PPS picture parameter set
  • the amount of coding for the information indicating the partition type can be reduced.
  • a block corresponding to a CU to be encoded or decoded is called a 'current block'.
  • the prediction unit 320 predicts the current block and generates a prediction block.
  • the predictor 320 includes an intra predictor 322 and an inter predictor 324.
  • the intra predictor 322 predicts pixels in the current block by using pixels (reference pixels) positioned around the current block in the current picture including the current block. There are a plurality of intra prediction modes according to the prediction direction, and the peripheral pixels to be used and the equations are defined differently according to each prediction mode. In particular, the intra predictor 322 may determine an intra mode to be used to encode the current block. In some examples, intra prediction unit 322 may encode the current block using several intra modes and select an appropriate intra mode to use from the tested modes. For example, intra predictor 322 calculates rate distortion values using rate-distortion analysis for several tested intra modes, and intra with the best rate distortion characteristics of the tested modes. You can also select the mode.
  • the plurality of available intra modes may include two non-directional modes (planar mode and DC mode) and 65 directional modes.
  • the present disclosure proposes an additional intra mode suitable for intra prediction of blocks having diagonal edges.
  • the number of directional and non-directional modes, the number of all intra modes, etc. are merely exemplary, and various combinations are possible within the scope of the present invention.
  • the intra predictor 322 selects one intra mode from among a plurality of intra modes, and selects a current block by using a peripheral equation (which may be referred to as a reference pixel or a reference sample) determined by the selected intra mode and an expression. Predict. Information on the selected intra mode is encoded by the encoder 350 and transmitted to the image decoding apparatus.
  • the intra prediction unit 322 is used as an intra mode of the current block among the plurality of intra modes in order to efficiently encode intra mode information indicating which mode of the plurality of intra modes is used as the intra mode of the current block. Some of the more likely modes can be determined as the most probable mode (MPM).
  • mode information indicating whether the intra mode of the current block is selected from the MPM is generated and transmitted to the encoder 350.
  • the intra mode of the current block is selected from the MPMs
  • the first intra identification information for indicating which mode of the MPMs is selected as the intra mode of the current block is transmitted to the encoder.
  • the second intra identification information for indicating which mode other than the MPM is selected as the intra mode of the current block is transmitted to the encoder.
  • the present invention is not limited thereto, and the number of MPMs included in the MPM list may be selected within a range of 3 to 10.
  • the MPM list is constructed using the intra mode of neighboring blocks of the current block.
  • the neighboring block may be, for example, all or some of the left block L, the upper block A, the lower left block BL, the upper right block AR, and the upper left block AL of the current block. It may include.
  • the intra mode of these neighboring blocks is included in the MPM list.
  • the intra prediction mode of the valid blocks in the order of the left block (L), the top block (A), the bottom left block (BL), the top right block (AR), and the top left block (AL) is included in the MPM list
  • a candidate is formed by adding a planar mode and a DC mode to the intra modes of the blocks.
  • valid modes in the order of the left block (L), the top block (A), the planar mode, the DC mode, the bottom left block (BL), the top right block (AR), and the top left block (AL) may be added to the MPM list.
  • valid modes in the order of the left block (L), the top block (A), the planar mode, the bottom left block (BL), the top right block (AR), the top left block (AL), and the DC mode may be added to the MPM list. have.
  • the MPM list contains only different intra modes. That is, when a duplicated mode is present, only one of them is included in the MPM list.
  • the MPM may be derived by adding -1 or +1 to the directional modes in the list.
  • the number of modes that are insufficient in the order of vertical mode, horizontal mode, diagonal mode, etc. are added to the MPM list. You may.
  • the inter prediction unit 324 searches a block most similar to the current block in the reference picture encoded and decoded before the current picture, and generates a prediction block for the current block by using the searched block. A motion vector corresponding to a displacement between the current block in the current picture and the prediction block in the reference picture is generated.
  • the motion information including the information about the reference picture and the motion vector used to predict the current block is encoded by the encoder 350 and transmitted to the image decoding apparatus.
  • the subtractor 330 subtracts the prediction block generated by the intra predictor 322 or the inter predictor 324 from the current block to generate a residual block.
  • the transformer 340 converts the residual signal in the residual block having pixel values of the spatial domain into a transform coefficient of the frequency domain.
  • the transform unit 340 may convert the residual signals in the residual block using the size of the current block as a conversion unit, or divide the residual block into a plurality of smaller subblocks and convert the residual signals in a subblock-sized transform unit. You can also convert. There may be various ways of dividing the residual block into smaller subblocks. For example, it may be divided into sub-blocks of a predetermined same size, or a quadtree (QT) scheme may be used in which the residual block is a root node.
  • QT quadtree
  • the encoder 350 generates a bitstream by encoding the quantized transform coefficients by using an encoding method such as CABAC.
  • the encoder 350 encodes information such as CTU size, MinQTSize, MaxBTSize, MaxBTDepth, MinBTSize, QT split flag, BT split flag, split type, etc. related to block division, so that the image decoding apparatus is the same as the image encoding apparatus. Allows you to split blocks.
  • the encoder 350 encodes information about a prediction type indicating whether a current block is encoded by intra prediction or inter prediction, and encodes intra prediction information or inter prediction information according to the prediction type.
  • the inverse quantizer 360 inverse quantizes the quantized transform coefficients output from the quantizer 345 to generate transform coefficients.
  • the inverse transformer 365 restores the residual block by converting the transform coefficients output from the inverse quantizer 360 from the frequency domain to the spatial domain.
  • the adder 370 adds the reconstructed residual block and the predicted block generated by the predictor 320 to reconstruct the current block.
  • the pixels in the reconstructed current block are used as reference pixels when intra prediction of the next order of blocks.
  • the filter unit 380 deblocks and filters the boundary between the reconstructed blocks in order to remove blocking artifacts that occur due to encoding / decoding of blocks. When all the blocks in a picture are reconstructed, the reconstructed picture is used as a reference picture for inter prediction of a block in a picture to be encoded later.
  • FIG 7 illustrates an image decoding apparatus that may use the techniques of the present disclosure.
  • the image decoding apparatus includes a decoder 710, an inverse quantizer 720, an inverse transformer 730, a predictor 740, an adder 750, a filter 760, and a memory 770.
  • the image decoding apparatus may be implemented by each component as a hardware chip, or may be implemented by software and a microprocessor to execute a function of software corresponding to each component.
  • the decoder 710 decodes the bitstream received from the image encoding apparatus, extracts information related to block division, determines a current block to be decoded, and includes prediction information and residual signal information necessary for reconstructing the current block. Extract
  • the decoder 710 extracts information on the CTU size from a Sequence Parameter Set (SPS) or Picture Parameter Set (PPS) to determine the size of the CTU, and divides the picture into a CTU of the determined size.
  • the CTU is determined as the highest layer of the tree structure, that is, the root node, and the CTU is partitioned using the tree structure by extracting partition information about the CTU. For example, when splitting a CTU using a QTBT structure, first, a first flag (QT_split_flag) related to splitting of QT is extracted, and each node is divided into four nodes of a lower layer. For the node corresponding to the leaf node of the QT, the second flag BT_split_flag and the split type information related to the splitting of the BT are extracted to split the corresponding leaf node into the BT structure.
  • SPS Sequence Parameter Set
  • PPS Picture Parameter Set
  • the QT splitting flag QT_split_flag corresponding to the node of the highest layer of the QTBT structure is extracted. Since the value of the extracted QT split flag QT_split_flag is 1, the node of the highest layer is divided into four nodes of the lower layer (layer 1 of QT). Then, the QT splitting flag QT_split_flag for the first node of layer 1 is extracted. Since the extracted QT split flag (QT_split_flag) has a value of 0, the first node of layer 1 is no longer split into a QT structure.
  • BT which is the root node of BT.
  • the BT split flag (BT_split_flag) for the first node of (layer 1) split from the root node of BT is extracted. Since the BT split flag BT_split_flag is 1, the split type information of the block of the first node of (layer 1) is extracted. Since the partition type information of the block of the first node of (layer 1) is 1, the block of the first node of (layer 1) is vertically divided. Then, the BT split flag BT_split_flag of the second node of (layer 1) divided from the root node of BT is extracted. Since the BT split flag BT_split_flag is 0, it is no longer split by BT.
  • BT split flag (BT_split_flag 1) for the first node of (layer 1) partitioned from the root node of BT, and extracting the partition type information (0), the first node of (layer 1)
  • the blocks are divided horizontally.
  • the BT split flag BT_split_flag of the second node of (layer 1) divided from the root node of BT is extracted. If the BT split flag BT_split_flag is 1, the split type information is automatically set to 1 without extracting the split type information anymore, and the block of the second node of (layer 1) is vertically split.
  • the decoder 710 first extracts the QT split flag QT_split_flag repeatedly to split the CTU into a QT structure.
  • the BT split flag (BT_split_flag) is extracted for the leaf node of the QT, and when the BT split flag (BT_split_flag) indicates the split, split type information is extracted.
  • the decoder 710 may confirm that the CTU is divided into a structure as shown in FIG.
  • the decoder 710 extracts the information and uses the information when extracting the split information for QT and BT. Can reflect.
  • the decoder 710 does not extract the split information (QT split flag) about the QT of the block from the bitstream (that is, the QT split flag of the block does not exist in the bitstream), and automatically extracts the value. Set to zero.
  • the decoder 710 does not extract the BT partition flag for the leaf node having a block larger than MaxBTSize in QT, and automatically sets the BT partition flag to 0.
  • the depth of the node of the BT reaches MaxBTDepth, the block of the node is no longer split.
  • the BT partition flag of the corresponding node is not extracted from the bitstream, and its value is automatically set to zero. Also, in BT, blocks having the same size as MinBTSize are no longer split. Accordingly, the decoder 710 does not extract the BT partition flag of the block having the same size as MinBTSize from the bitstream, and automatically sets the value to zero.
  • the decoder 710 determines the current block (current block) to be decoded by splitting the tree structure, the decoder 710 extracts information about a prediction type indicating whether the current block is intra predicted or inter predicted.
  • the decoder 710 extracts syntax elements of intra prediction information (intra mode) of the current block. First, the decoder 710 extracts mode information (ie, an MPM flag) indicating whether an intra mode of a current block is selected from the MPMs. Also, in general, when the intra mode encoding information indicates that the intra mode of the current block is selected from the MPMs, first intra identification information for indicating which mode of the MPMs is selected as the intra mode of the current block is extracted; When the intra mode encoding information indicates that the intra mode of the current block is not selected among the MPMs, the second intra identification information for indicating which mode other than the MPM is selected as the intra mode of the current block is extracted. do.
  • mode information ie, an MPM flag
  • the decoder 710 extracts information on the quantized transform coefficients of the current block as information on the residual signal.
  • the inverse quantizer 720 inversely quantizes the quantized transform coefficients, and the inverse transformer 730 inversely transforms the inverse quantized transform coefficients from the frequency domain to the spatial domain to generate a residual block for the current block.
  • the predictor 740 includes an intra predictor 742 and an inter predictor 744.
  • the intra predictor 742 is activated when the intra prediction is the prediction type of the current block
  • the inter predictor 744 is activated when the intra prediction is the prediction type of the current block.
  • the intra predictor 742 determines the intra mode of the current block among the plurality of intra modes from the syntax element for the intra mode extracted from the decoder 710, and uses the reference pixels around the current block according to the intra mode. Predict the block.
  • the intra predictor 742 constructs an MPM list including a predetermined number of MPMs from neighboring blocks of the current block.
  • the method of constructing the MPM list is the same as that of the intra predictor 322 of FIG. 3.
  • the intra prediction unit 742 may determine the first intra identification information among the MPMs in the MPM list. Select the indicated MPM as the intra mode of the current block. On the other hand, if the mode information indicates that the intra mode of the current block is not selected from the MPM, the intra mode of the current block is determined among the remaining intra modes except the MPMs in the MPM list using the second intra identification information.
  • the inter predictor 744 determines motion information of the current block using a syntax element for the intra mode extracted from the decoder 710, and predicts the current block using the determined motion information.
  • the adder 750 reconstructs the current block by adding the residual block output from the inverse transformer and the prediction block output from the inter predictor or the intra predictor.
  • the pixels in the reconstructed current block are utilized as reference pixels in intra prediction of the block to be subsequently decoded.
  • the filter unit 760 deblocks and filters the boundary between the reconstructed blocks in order to remove blocking artifacts caused by block-by-block decoding, and stores them in the memory 390. When all the blocks in a picture are reconstructed, the reconstructed picture is used as a reference picture for inter prediction of a block in a picture to be decoded later.
  • the techniques disclosed below relate to intra coding.
  • the techniques of this disclosure may be performed by, for example, the image encoding apparatus and the image decoding apparatus shown and described with respect to FIGS. 3 and 7. That is, in one example, the intra prediction unit described with respect to FIG. 3 may perform certain techniques described below when performing intra prediction while encoding a block of image data. In another example, the intra prediction unit described with respect to FIG. 7 may perform certain techniques described below when performing intra prediction while decoding a block of video data.
  • FIGS. 9A and 9B are conceptual diagrams illustrating how reference samples are used for a current block in directional intra mode. As illustrated in FIGS. 9A and 9B, the predictive block of the current block is generated singularly according to the direction of the selected intra mode.
  • the present disclosure is directed to intra prediction techniques that can be useful in blocks with different textures in the form of diagonal edges or diagonal shapes.
  • the current block to be predicted is divided into two regions, and each region can be predicted individually according to the directional or non-directional mode illustrated in FIG. 5.
  • the new intra mode proposed in the present disclosure may be referred to as "split intra mode" in the following description.
  • split intra mode In split intra mode, a first type (referred to as “Split_right_down type”), in which two areas are separated by a diagonal line connecting the upper and lower right corners of the block, and two areas are divided by a diagonal line connecting the left and right ears of the block. There are two types (referred to as “Split_right_up type”). As shown in Figs. 10A, 10B, 11A and 11B, in the divided intra mode, the two regions are divided by a diagonal line crossing the current block. Two types for square blocks are illustrated in FIGS. 10A and 10B, and two types for rectangular blocks are illustrated in FIGS. 11A and 11B.
  • FIG. 12A is a diagram for describing respective reference pixels used for two regions of the current block in the split intra mode of the Split_right_down type.
  • the lower left region of the current block includes the left reference pixels R 0,1 , R 0,2 , (hatched in (a) of FIG. 12A).
  • R 0,3 , ... R 0,2N are used to determine predicted pixel values
  • the upper right region includes the upper reference pixels R 1,0 , R 2,0 , (hatched in (b) of FIG. 12A).
  • R 3,0 ... R 2N, 0 ) is used to determine prediction pixel values.
  • R 0,0 may or may not be used as the reference pixel for both regions.
  • the upper left region is the left and upper reference pixels ( ⁇ R 0, 0 , R 0 , 1 , R 0 , 2 , adjacent to the corresponding block (hatched in (a) of FIG. 12B).
  • the lower right region is the left and upper reference pixels R 0, N + 1 that are not adjacent to the corresponding block (hatched in (b) of FIG. 12B).
  • R 0, N + 2 , R 0 , N + 3 , ... R 0,2N and R N + 1, 0 , R N + 2, 0 , R N + 3,0 , ... R 2N, 0 ) is used to determine prediction pixel values.
  • R 0, N +1 and R N + 1,0 may or may not be used as reference pixels for both regions.
  • R 0, N +1 and R N + 1,0 may be used only to generate predicted pixel values on the diagonal. That is, the predicted pixel values on the diagonal may be determined from R 0, N + 1 and R N + 1,0 .
  • each mode to be used for prediction of two regions in split intra mode may be fixed. That is, one or two modes selected from the directional or non-directional modes illustrated in FIG. 5 may be assigned to each region as a default mode.
  • the default modes assigned to the two regions may have the same direction or reverse directions.
  • the default modes allocated to the two regions may be planar or DC mode, which is a non-directional mode.
  • the lower left region of the current block determines predictive pixel values in a DC mode using left reference pixels
  • the upper right region predicts pixel values in a DC mode using upper reference pixels. Can be determined.
  • the default modes assigned to the two regions may have the directionality illustrated in FIGS. 13A and 13B.
  • each mode to be used for prediction of the two regions in the split intra mode may be selected by the image encoding apparatus in consideration of compression efficiency, among the directional or non-directional modes illustrated in FIG. 5. .
  • FIGS. 14A and 14B are conceptual diagrams illustrating some example ways in which reference samples are used for two regions in split intra mode.
  • the two regions are predicted in opposite directions, and in the example of FIG. 13B, the predictions for the two regions have the same direction.
  • the predictions for the two regions have different directions from each other.
  • the prediction shown by the scheme illustrated in FIG. 13B is bidirectional. That is, the left reference pixels R 0,0 , R 0,1 , R 0,2 , R 0, 3 , ...
  • each pixel in the current block is predicted using a left reference pixel and an upper reference pixel value located on a straight line parallel to a diagonal line dividing the current block into two regions.
  • the prediction pixel and the two reference pixels used to predict the pixel value lie on a straight line parallel to the diagonal that divides the current block into two regions.
  • the prediction pixels P 1, 1 is predicted from the left reference pixels (R 0,2) and an upper reference pixel (R 2,0).
  • Predicted pixel P N , 1 (or P 1, N ) is predicted from left R 0, N +1 and top R N + 1,0 .
  • P N, N is predicted from the left R 0,2N and the upper R 2N, 0 .
  • weights according to the distance from the prediction pixel position to the reference pixel position may be given to the reference pixel values.
  • the pixel value of the prediction pixel P N , 1 may be determined to be closer to R N + 1,0 than to R 0, N + 1 .
  • the first method is to treat split_right_down and split_right_up as new intra modes and signal them. For example, if the two intra modes are added to a video standard having n intra modes, split_right_down may be in ( n +1) times and split_right_up may be in ( n +2) times. Or you can reverse the order. The added two modes are signaled in the same way as other existing modes.
  • the second method is to treat split_right_down and split_right_up as one split_intra_mode and signal it. For example, if one intra mode is added to a video standard having n intra modes, split_intra_mode may be ( n + 1) times. The added one mode is signaled in the same manner as other existing modes. Here, when the ( n + 1) mode is selected, the 1-bit flag is additionally used to signal whether split_right_down or split_right_up is used.
  • the third method is to treat split_right_down and split_right_up as one split_intra_mode and signal them. For example, if one intra mode is added to a video standard having n intra modes, split_intra_mode may be ( n + 1) times. The added one mode is signaled in the same manner as other existing modes. In this case, when the ( n + 1) mode is selected, whether split_right_down or split_right_up is derived is derived from values of reference pixels by the image decoding apparatus without signaling of a separate syntax element.
  • split_right_down was selected as the type of split intra mode of a given block, the lower left region of the current block would have a different texture than the lower right region, so that the reference pixels adjacent to the lower left side were also different from the reference pixels adjacent to the upper right region. Note that it is likely to have different values. Accordingly, the apparatus for decoding an image may infer that split_right_down is used if Equation 1 is satisfied, or infer that split_right_up is selected if it is not satisfied.
  • Equation 1 the left side is the left reference pixels R 0, 1 , R 0 , 2 , for the lower left region when split_right_down is applied.
  • R 0 , 3 ...
  • R 3,0 ... R 2N, 0 ) represents the difference between the mean values.
  • the right side is the left and upper reference pixels R 0, 0 , R 0 , 1 , R 0 , 2 , adjacent to the corresponding block for the upper left region when split_right_up is applied.
  • the left and upper reference pixels R 0 N + 1 , R 0 , N + 2 , not adjacent to the corresponding block for the lower right region and the average value of R N, 0 ) R 0 , N + 3 , ... R 0,2N & R N + 1, 0 , R N + 2, 0 , R N + 3,0 , ... R 2N, 0 ) represents the difference between the mean values.
  • 2N reference pixels are all considered in Equation 1, a similar scheme may be applied to N reference pixels selected from 2N to reduce the amount of computation. For example, a similar approach is applied for reference pixels selected one pixel across, such as "0, 2, 4, 6, 8, 10 " or "1, 3, 5, 7, 9 ". It may be.
  • split_right_up can be used instead of the existing DC mode.
  • split_right_up may be one of a non-directional mode (diagonal planar).
  • the intra mode may include two non-directional modes and 65 directional modes including the conventional (vertical / horizontal) planar mode and the diagonal planar mode.
  • the diagonal planar mode may replace the DC mode in every encoding / decoding process including the MPM setting process described with reference to FIGS. 3 to 7.
  • the split intra mode when a mode for two regions of the current block is selected by the image encoding apparatus from among a plurality of intra modes including directional and non-directional modes, additional signaling is required for each mode applied to the two regions. .
  • the default mode when the default mode is allocated to two regions for the split intra mode, only information on whether the split intra mode is applied to the current block is sufficient.
  • prediction pixel values placed on a diagonal line separating two regions may be determined using reference samples of either region. That is, the pixels on the diagonal may be included in either area and may be predicted in the same mode as the other pixels in the area. Alternatively, pixels on the diagonal can be determined using both reference samples in both regions.
  • FIG. 15A to 15C are conceptual views illustrating a method of determining prediction pixel values on a diagonal line in a split intra mode of the split_right_down type.
  • FIG. 15A is an example of how predictive pixel values on a diagonal line are determined using reference samples for the upper right region
  • FIG. 15B is an example of how predictive pixel values on diagonal line are determined using reference samples for the lower left region.
  • 15C is an example of how the prediction pixel values on the diagonal line are determined using both reference samples for the upper right region and reference samples for the lower left region.
  • segmented intra mode may be particularly useful for intra prediction of 360 images in which projection formats consisting of triangular faces may be used.
  • a 360 image is converted to a two-dimensional image based on Icosahedral projection (ISP) and Octahedron projection (OHP)
  • ISP Icosahedral projection
  • OHP Octahedron projection
  • the prediction pixels should also be filled using the lower right reference sample values. If the information about the original pixel values on the diagonal is unknown, the prediction pixels can be filled using the upper left and lower right reference sample values.
  • the image encoding apparatus and the image decoding apparatus may generate prediction pixel values on a diagonal line using reference samples of a predetermined region.
  • the apparatus for encoding an image may generate prediction pixel values on a diagonal line using reference samples of a specific (upper left or lower right) region and then explicitly signal information about the diagonal.
  • the split intra mode may be activated / deactivated by a flag indicating activation / deactivation of the split intra mode.
  • the split intra mode may be activated / deactivated by a flag indicating 360 video.
  • the split intra mode may be activated / deactivated by a syntax element that defines the projection format used for the two-dimensional representation of 360 video. As such, split intra mode may or may not be included in the plurality of available intra modes, as desired.
  • 16 is a flowchart illustrating an example method for encoding image data in accordance with one or more examples described above of the present invention.
  • the video encoding apparatus determines an intra mode for predicting a current block of video data among a plurality of available intra modes (S1610).
  • the plurality of available intra modes includes a plurality of directional modes, a plurality of non-directional modes, and at least one split intra mode.
  • the prediction block for the current block is predicted separately for each of the two regions (ie, the first region and the second region) divided by dividing the current block diagonally.
  • One of a plurality of directional modes and a plurality of non-directional modes is applied to the prediction for each region.
  • One directional or non-directional mode may be commonly applied to the two regions, and a mode different from the mode applied to the first region may be applied to the second region.
  • the mode applied to the first region and the mode applied to the second region may be predetermined among a plurality of directional or non-directional modes.
  • Each region of the split intra mode may be assigned one or two modes selected from a plurality of directional or non-directional modes as a default mode. Default modes assigned to the two regions may have the same directionality, reverse direction to each other, or both directions at the same time.
  • the default modes allocated to the two regions may be planar or DC mode, which is a non-directional mode.
  • each mode to be used for prediction of the two regions in the split intra mode may be selected by the image encoding apparatus in consideration of compression efficiency, among the directional or non-directional modes illustrated in FIG. 5. .
  • the plurality of available intra modes include two split intra modes, a first split intra mode in which two areas are separated by a diagonal line connecting the upper and lower right corners of the current block, and the two areas are current blocks. There may be a second split intra mode separated by a diagonal line connecting the lower left and upper right ears of.
  • the one split intra mode is a first type in which two areas are divided by a diagonal line connecting the upper left and the right lower end of the current block, and a second type in which the two areas are separated by a diagonal connecting the left and right ears of the current block. This may exist.
  • the image encoding apparatus determines a prediction block for the current block based on the determined intra mode of the current block (S1620).
  • the determined intra mode of the current block S1620.
  • split intra modes at least some of the reference pixels used to determine the prediction pixels for the first region are different from the reference pixels used to determine the prediction pixels for the second region.
  • the first split intra mode is determined to be the intra mode of the current block
  • at least some of the reference samples used to determine the prediction pixels for the first area are At least some of the reference samples selected on the left block L and the bottom left block BL of the current block, and used to determine the prediction pixels for the second region (the top right region) are the top block A of the current block.
  • the upper right block (AR) is the upper right block (AR).
  • the second split intra mode is determined to be the intra mode of the current block
  • at least some of the reference samples used to determine the prediction pixels for the first of the two regions are the current block.
  • At least some of the reference samples selected on the left block (L) and the top block (A) of and used to determine the prediction pixels for the second region (lower region) are the lower left block BL of the current block.
  • the upper right block AR is the upper right block AR.
  • the image encoding apparatus determines a residual block based on the current block and the prediction block (S1630).
  • the image encoding apparatus encodes syntax elements that define an intra mode and a residual block of the current block (S1640).
  • the image encoding apparatus may further encode syntax elements that define one of a plurality of directional modes and a plurality of non-directional modes applied to each of the two regions.
  • 17 is a flow chart illustrating an exemplary method of decoding image data in accordance with one or more examples described above of the present invention.
  • the image decoding apparatus decodes syntax elements that define an intra mode and a residual block for the current block of image data from the encoded bitstream (S1710).
  • the image decoding apparatus determines an intra mode for predicting a current block of image data, from among a plurality of available intra modes, based on the decoded syntax elements (S1720).
  • the plurality of available intra modes includes a plurality of directional modes, a plurality of non-directional modes, and at least one split intra mode.
  • the image decoding apparatus further decodes syntax elements defining one of a plurality of directional modes and a plurality of non-directional modes applied to each of two regions from the encoded bitstream. can do.
  • the image decoding apparatus determines a prediction block for the current block based on the determined intra mode of the current block (S1730).
  • the image decoding apparatus reconstructs the current block based on the determined prediction block and the residual block (S1740).

Landscapes

  • Engineering & Computer Science (AREA)
  • Multimedia (AREA)
  • Signal Processing (AREA)
  • Compression Or Coding Systems Of Tv Signals (AREA)

Abstract

Une technique d'intra-prédiction de la présente invention considère l'ajout d'un nouveau mode intra, appelé « intra-mode de division » à des intra-modes existants. Sur la base de l'intra-mode de division, une région de prédiction est divisée en diagonale en deux régions, et chaque région peut être prédite avec un mode directionnel ou non directionnel séparé. Des techniques qui peuvent être considérées lorsque la prédiction intra d'un bord diagonal qui peut exister dans un bloc à coder à travers celle-ci sont expliquées ici.
PCT/KR2018/000380 2017-01-09 2018-01-09 Dispositif et procédé de codage ou de décodage d'image WO2018128511A1 (fr)

Priority Applications (2)

Application Number Priority Date Filing Date Title
US16/476,847 US10904522B2 (en) 2017-01-09 2018-01-09 Apparatus and method for video encoding or decoding using intra-predicting diagonal edges
CN201880005740.0A CN110169060B (zh) 2017-01-09 2018-01-09 用于对图像进行编码或解码的设备和方法

Applications Claiming Priority (4)

Application Number Priority Date Filing Date Title
KR10-2017-0003158 2017-01-09
KR20170003158 2017-01-09
KR10-2018-0002063 2018-01-08
KR1020180002063A KR102528387B1 (ko) 2017-01-09 2018-01-08 영상 부호화 또는 복호화하기 위한 장치 및 방법

Publications (1)

Publication Number Publication Date
WO2018128511A1 true WO2018128511A1 (fr) 2018-07-12

Family

ID=62790981

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/KR2018/000380 WO2018128511A1 (fr) 2017-01-09 2018-01-09 Dispositif et procédé de codage ou de décodage d'image

Country Status (1)

Country Link
WO (1) WO2018128511A1 (fr)

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
GB2577056A (en) * 2018-09-11 2020-03-18 British Broadcasting Corp Bitstream decoder

Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
KR100571814B1 (ko) * 2002-09-30 2006-04-17 삼성전자주식회사 색상의 공간 예측 부호화를 이용한 영상 부호화 및 복호화방법 및 장치
KR20130005233A (ko) * 2011-07-05 2013-01-15 한국전자통신연구원 영상 정보 부호화 방법 및 복호화 방법
KR101292093B1 (ko) * 2010-07-02 2013-08-08 (주)휴맥스 화면내 예측 부호화를 위한 영상 부호화/복호화 장치 및 방법
KR20140005257A (ko) * 2011-01-21 2014-01-14 톰슨 라이센싱 기하학적 기반 인트라 예측을 위한 방법 및 장치
KR101629475B1 (ko) * 2009-09-23 2016-06-22 삼성전자주식회사 기하학 기반의 블록 분할 인트라 예측을 이용한 깊이영상 부호화 방법 및 장치

Patent Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
KR100571814B1 (ko) * 2002-09-30 2006-04-17 삼성전자주식회사 색상의 공간 예측 부호화를 이용한 영상 부호화 및 복호화방법 및 장치
KR101629475B1 (ko) * 2009-09-23 2016-06-22 삼성전자주식회사 기하학 기반의 블록 분할 인트라 예측을 이용한 깊이영상 부호화 방법 및 장치
KR101292093B1 (ko) * 2010-07-02 2013-08-08 (주)휴맥스 화면내 예측 부호화를 위한 영상 부호화/복호화 장치 및 방법
KR20140005257A (ko) * 2011-01-21 2014-01-14 톰슨 라이센싱 기하학적 기반 인트라 예측을 위한 방법 및 장치
KR20130005233A (ko) * 2011-07-05 2013-01-15 한국전자통신연구원 영상 정보 부호화 방법 및 복호화 방법

Cited By (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
GB2577056A (en) * 2018-09-11 2020-03-18 British Broadcasting Corp Bitstream decoder
GB2577056B (en) * 2018-09-11 2022-12-14 British Broadcasting Corp Bitstream decoder

Similar Documents

Publication Publication Date Title
US20210092367A1 (en) Apparatus and method for video encoding or decoding
US20190289332A1 (en) Method, apparatus and system for de-blocking video data
WO2018128466A1 (fr) Dispositif et procédé de codage ou de décodage d'image
TW201729593A (zh) 視訊寫碼中具有非正方形預測區塊之照明補償
CN110169060B (zh) 用于对图像进行编码或解码的设备和方法
JP7379656B2 (ja) ビデオ信号処理方法及び装置
JP7197728B2 (ja) 変換インデックスを利用する映像コーディング
KR20190083948A (ko) 영상을 부호화 또는 복호화하는 방법 및 장치
WO2019198997A1 (fr) Procédé de codage d'image à base d'intraprédiction et appareil pour cela
WO2016056822A1 (fr) Procédé et dispositif de codage vidéo 3d
WO2020251260A1 (fr) Procédé et dispositif de traitement de signal vidéo utilisant un procédé de prédiction dpcm de blocs
WO2019194500A1 (fr) Procédé de codage d'images basé sur une prédication intra et dispositif associé
CN113016180A (zh) 用于视频编码和解码的虚拟管线
WO2019135628A1 (fr) Procédé et dispositif de codage ou de décodage d'image
JP2023025271A (ja) 映像コーディングにおけるマトリクスベースのイントラ予測のための変換
WO2019240425A1 (fr) Procédé d'inter-prédiction et dispositif de décodage d'image
JP2023033544A (ja) イントラ予測ベースの映像コーディングにおける変換
CN113875256A (zh) 用于视频编解码的方法和装置
WO2019059721A1 (fr) Codage et décodage d'image à l'aide d'une technique d'amélioration de résolution
WO2022114742A1 (fr) Appareil et procédé de codage et décodage vidéo
WO2014073877A1 (fr) Procédé et appareil pour traiter un signal vidéo multi-vues
WO2018128511A1 (fr) Dispositif et procédé de codage ou de décodage d'image
WO2018212582A1 (fr) Procédé et dispositif de codage ou de décodage en prédiction intra
WO2019135636A1 (fr) Procédé et appareil de codage/décodage d'images utilisant une corrélation dans ycbcr
CN113892262A (zh) 帧内模式候选配置方法和视频解码设备

Legal Events

Date Code Title Description
121 Ep: the epo has been informed by wipo that ep was designated in this application

Ref document number: 18736454

Country of ref document: EP

Kind code of ref document: A1

NENP Non-entry into the national phase

Ref country code: DE

122 Ep: pct application non-entry in european phase

Ref document number: 18736454

Country of ref document: EP

Kind code of ref document: A1