WO2020004979A1 - 영상 부호화/복호화 방법 및 장치 - Google Patents
영상 부호화/복호화 방법 및 장치 Download PDFInfo
- Publication number
- WO2020004979A1 WO2020004979A1 PCT/KR2019/007821 KR2019007821W WO2020004979A1 WO 2020004979 A1 WO2020004979 A1 WO 2020004979A1 KR 2019007821 W KR2019007821 W KR 2019007821W WO 2020004979 A1 WO2020004979 A1 WO 2020004979A1
- Authority
- WO
- WIPO (PCT)
- Prior art keywords
- block
- partition
- merge candidate
- prediction
- motion
- Prior art date
Links
- 238000000034 method Methods 0.000 title claims description 124
- 230000033001 locomotion Effects 0.000 claims abstract description 297
- 238000005192 partition Methods 0.000 claims abstract description 226
- 239000013598 vector Substances 0.000 claims description 116
- 230000002123 temporal effect Effects 0.000 claims description 34
- 230000002457 bidirectional effect Effects 0.000 claims description 23
- 238000000638 solvent extraction Methods 0.000 description 27
- 238000010586 diagram Methods 0.000 description 15
- 239000000872 buffer Substances 0.000 description 11
- 230000008569 process Effects 0.000 description 10
- 238000013139 quantization Methods 0.000 description 9
- 238000004891 communication Methods 0.000 description 7
- 238000001914 filtration Methods 0.000 description 7
- 230000011664 signaling Effects 0.000 description 7
- 208000037170 Delayed Emergence from Anesthesia Diseases 0.000 description 6
- 230000003044 adaptive effect Effects 0.000 description 4
- 230000008859 change Effects 0.000 description 4
- 230000006835 compression Effects 0.000 description 4
- 238000007906 compression Methods 0.000 description 4
- 230000009466 transformation Effects 0.000 description 4
- PXFBZOLANLWPMH-UHFFFAOYSA-N 16-Epiaffinine Natural products C1C(C2=CC=CC=C2N2)=C2C(=O)CC2C(=CC)CN(C)C1C2CO PXFBZOLANLWPMH-UHFFFAOYSA-N 0.000 description 2
- 230000015556 catabolic process Effects 0.000 description 2
- 238000006731 degradation reaction Methods 0.000 description 2
- 230000006870 function Effects 0.000 description 2
- 230000011218 segmentation Effects 0.000 description 2
- 238000003491 array Methods 0.000 description 1
- 230000005540 biological transmission Effects 0.000 description 1
- 239000011449 brick Substances 0.000 description 1
- 238000004364 calculation method Methods 0.000 description 1
- 238000006243 chemical reaction Methods 0.000 description 1
- 230000000694 effects Effects 0.000 description 1
- 238000005516 engineering process Methods 0.000 description 1
- 238000009499 grossing Methods 0.000 description 1
- 238000010295 mobile communication Methods 0.000 description 1
- 230000002093 peripheral effect Effects 0.000 description 1
- 230000004044 response Effects 0.000 description 1
- 238000010845 search algorithm Methods 0.000 description 1
Images
Classifications
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N19/00—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
- H04N19/50—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using predictive coding
- H04N19/503—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using predictive coding involving temporal prediction
- H04N19/51—Motion estimation or motion compensation
- H04N19/537—Motion estimation other than block-based
- H04N19/543—Motion estimation other than block-based using regions
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N19/00—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
- H04N19/10—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding
- H04N19/102—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the element, parameter or selection affected or controlled by the adaptive coding
- H04N19/103—Selection of coding mode or of prediction mode
- H04N19/105—Selection of the reference unit for prediction within a chosen coding or prediction mode, e.g. adaptive choice of position and number of pixels used for prediction
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N19/00—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
- H04N19/10—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding
- H04N19/102—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the element, parameter or selection affected or controlled by the adaptive coding
- H04N19/119—Adaptive subdivision aspects, e.g. subdivision of a picture into rectangular or non-rectangular coding blocks
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N19/00—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
- H04N19/10—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding
- H04N19/102—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the element, parameter or selection affected or controlled by the adaptive coding
- H04N19/132—Sampling, masking or truncation of coding units, e.g. adaptive resampling, frame skipping, frame interpolation or high-frequency transform coefficient masking
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N19/00—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
- H04N19/10—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding
- H04N19/134—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the element, parameter or criterion affecting or controlling the adaptive coding
- H04N19/136—Incoming video signal characteristics or properties
- H04N19/137—Motion inside a coding unit, e.g. average field, frame or block difference
- H04N19/139—Analysis of motion vectors, e.g. their magnitude, direction, variance or reliability
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N19/00—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
- H04N19/10—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding
- H04N19/169—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the coding unit, i.e. the structural portion or semantic portion of the video signal being the object or the subject of the adaptive coding
- H04N19/17—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the coding unit, i.e. the structural portion or semantic portion of the video signal being the object or the subject of the adaptive coding the unit being an image region, e.g. an object
- H04N19/176—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the coding unit, i.e. the structural portion or semantic portion of the video signal being the object or the subject of the adaptive coding the unit being an image region, e.g. an object the region being a block, e.g. a macroblock
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N19/00—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
- H04N19/10—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding
- H04N19/169—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the coding unit, i.e. the structural portion or semantic portion of the video signal being the object or the subject of the adaptive coding
- H04N19/182—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the coding unit, i.e. the structural portion or semantic portion of the video signal being the object or the subject of the adaptive coding the unit being a pixel
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N19/00—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
- H04N19/42—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals characterised by implementation details or hardware specially adapted for video compression or decompression, e.g. dedicated software implementation
- H04N19/423—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals characterised by implementation details or hardware specially adapted for video compression or decompression, e.g. dedicated software implementation characterised by memory arrangements
- H04N19/426—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals characterised by implementation details or hardware specially adapted for video compression or decompression, e.g. dedicated software implementation characterised by memory arrangements using memory downsizing methods
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N19/00—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
- H04N19/46—Embedding additional information in the video signal during the compression process
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N19/00—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
- H04N19/50—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using predictive coding
- H04N19/503—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using predictive coding involving temporal prediction
- H04N19/51—Motion estimation or motion compensation
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N19/00—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
- H04N19/50—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using predictive coding
- H04N19/503—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using predictive coding involving temporal prediction
- H04N19/51—Motion estimation or motion compensation
- H04N19/513—Processing of motion vectors
- H04N19/517—Processing of motion vectors by encoding
- H04N19/52—Processing of motion vectors by encoding by predictive encoding
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N19/00—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
- H04N19/50—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using predictive coding
- H04N19/503—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using predictive coding involving temporal prediction
- H04N19/51—Motion estimation or motion compensation
- H04N19/573—Motion compensation with multiple frame prediction using two or more reference frames in a given prediction direction
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N19/00—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
- H04N19/70—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals characterised by syntax aspects related to video coding, e.g. related to compression standards
Definitions
- the present invention relates to a method and apparatus for image encoding / decoding.
- Video compression is largely composed of intra prediction (or intra prediction), inter prediction (or inter prediction), transform, quantization, entropy coding, and in-loop filter. Meanwhile, as the demand for high resolution video increases, the demand for stereoscopic video content also increases as a new video service. There is a discussion about a video compression technology for effectively providing high resolution and ultra high resolution stereoscopic image contents.
- An object of the present invention is to provide an image encoding / decoding method and apparatus having improved efficiency.
- Another object of the present invention is to provide a method and apparatus for adaptively constructing a merge candidate list of a current block.
- An object of the present invention is to provide a method and apparatus for limiting a prediction direction of a block to which diagonal motion division is applied.
- Another object of the present invention is to provide a weighted prediction method and apparatus for a block to which diagonal motion division is applied.
- Another object of the present invention is to provide a method and apparatus for storing motion information of a block to which diagonal motion division is applied.
- An image decoding method and apparatus construct a merge candidate list of a current block, derive motion information of the current block based on the merge candidate list and the merge candidate index, and based on the derived motion information. For example, inter prediction of the current block may be performed.
- the current block may be divided into a first partition and a second partition based on a diagonal motion division.
- At least one of the first partition and the second partition may have a triangular shape.
- the merge candidate index may be signaled with respect to at least one of the first partition and the second partition.
- the merge candidate index is signaled when a predetermined flag is a first value, and the flag indicates whether the current block performs motion compensation based on diagonal motion division. Can be represented.
- the value of the flag may be derived from the decoding apparatus based on a predetermined encoding parameter.
- the encoding parameter may include at least one of a slice type to which the current block belongs, an inter mode type of the current block, or a size of the current block.
- the merge candidate list may include at least one of a spatial merge candidate, a temporal merge candidate, a combined merge candidate, or a merge candidate having zero motion vectors.
- the motion information of the partition may be derived based on the motion information of the merge candidate specified by the merge candidate index.
- the partition when the merge candidate has motion information of bidirectional prediction, the partition may be limited to have only motion information of unidirectional prediction.
- the pixels of the current block may have predetermined weights m and n on pixels belonging to the reference block of the first partition and pixels belonging to the reference block of the second partition. Can be predicted by application.
- m and n are any one of 0, 1, 2, 4, 6, 7, 8, 16, 28, 31, or 32, and the sum of m and n May be any one of 2, 8, or 32.
- the weight may be determined based on a position of a pixel in the current block.
- the coding / decoding efficiency can be improved through block division of various forms.
- the present invention can improve the encoding / decoding efficiency of motion information by using an adaptive merge candidate list.
- the present invention can efficiently reduce the memory bandwidth by limiting the prediction direction of a block to which diagonal motion division is applied.
- the present invention can reduce artifacts near the partition boundary through weighted prediction of blocks to which diagonal motion partition is applied.
- the accessibility of the motion information and the reliability as reference information can be improved through the motion information storage method according to the present invention.
- a computer-readable recording medium for storing a bitstream generated by the video encoding method / apparatus according to the present invention can be provided.
- FIG. 1 is a block diagram illustrating a video encoding apparatus according to the present invention.
- FIG. 2 is a block diagram illustrating an image decoding apparatus according to the present invention.
- 3 is an embodiment of a block partitioning method of any form using one line segment.
- FIG. 4 is a diagram illustrating examples of various arbitrary block divisions that can be generated using square and rectangular block divisions and left and right diagonal segments.
- FIG. 5 shows a method of dividing a square or non-square block into a triangular form using two types of diagonal lines proposed by the present invention.
- FIG. 6 is a diagram illustrating a concept of a mask for segmentation in a circle or ellipse form among masks for mask-based motion prediction and compensation proposed in the present invention.
- FIG. 7 is a diagram illustrating a form in which encoding information of a corresponding block is stored when diagonal motion division is used.
- FIG. 8 illustrates a merge mode based inter prediction method according to an embodiment to which the present invention is applied.
- FIG. 11 is a diagram illustrating a concept of limited mask-based motion prediction and compensation as an embodiment to which the present invention is applied.
- FIGS. 12 and 13 are diagrams illustrating the concept of a method of performing prediction by dividing one coding block into two partitions using one straight line according to an embodiment to which the present invention is applied.
- 14 to 17 illustrate a weighted prediction method for a diagonal motion-divided current block according to an embodiment to which the present invention is applied.
- An image decoding method and apparatus construct a merge candidate list of a current block, derive motion information of the current block based on the merge candidate list and the merge candidate index, and based on the derived motion information. For example, inter prediction of the current block may be performed.
- the current block may be divided into a first partition and a second partition based on a diagonal motion division.
- At least one of the first partition and the second partition may have a triangular shape.
- the merge candidate index may be signaled with respect to at least one of the first partition and the second partition.
- the merge candidate index is signaled when a predetermined flag is a first value, and the flag indicates whether the current block performs motion compensation based on diagonal motion division. Can be represented.
- the value of the flag may be derived from the decoding apparatus based on a predetermined encoding parameter.
- the encoding parameter may include at least one of a slice type to which the current block belongs, an inter mode type of the current block, or a size of the current block.
- the merge candidate list may include at least one of a spatial merge candidate, a temporal merge candidate, a combined merge candidate, or a merge candidate having zero motion vectors.
- the motion information of the partition may be derived based on the motion information of the merge candidate specified by the merge candidate index.
- the partition when the merge candidate has motion information of bidirectional prediction, the partition may be limited to have only motion information of unidirectional prediction.
- the pixels of the current block may have predetermined weights m and n on pixels belonging to the reference block of the first partition and pixels belonging to the reference block of the second partition. Can be predicted by application.
- m and n are any one of 0, 1, 2, 4, 6, 7, 8, 16, 28, 31, or 32, and the sum of m and n May be any one of 2, 8, or 32.
- the weight may be determined based on a position of a pixel in the current block.
- first and second may be used to describe various components, but the components should not be limited by the terms. The terms are used only for the purpose of distinguishing one component from another.
- some of the configuration of the apparatus or some of the steps of the method may be omitted.
- the order of some of the components of the apparatus or some of the steps of the method may be changed.
- other configurations or other steps may be inserted into part of the device or part of the steps of the method.
- each component shown in the embodiments of the present invention are shown independently to represent different characteristic functions, and do not mean that each component is composed of separate hardware or one software unit. That is, each component is described by listing each component for convenience of description, and at least two of the components may be combined to form one component, or one component may be divided into a plurality of components to perform a function. The integrated and separated embodiments of each of these components are also included within the scope of the present invention without departing from the spirit of the invention.
- the video decoding apparatus (Video Decoding Apparatus) to be described below is a civil security camera, civil security system, military security camera, military security system, personal computer (PC, Personal Computer), notebook computer, portable multimedia player (PMP, Portable MultimediaPlayer), It may be a device included in a server terminal such as a wireless communication terminal, a smart phone, a TV application server and a service server, and a communication modem for communicating with a user terminal such as various devices or a wired or wireless communication network. It may mean a variety of devices including a communication device such as an image, a memory for storing various programs and data for inter- or intra-prediction for decoding or decoding an image, a microprocessor for executing and operating a program, and the like. Can be.
- the image encoded in the bitstream by the encoder is real-time or non-real-time through the wired or wireless communication network, such as the Internet, local area wireless communication network, wireless LAN network, WiBro network, mobile communication network or the like, cable, universal serial bus (USB, It can be transmitted to a video decoding apparatus through various communication interfaces such as a universal serial bus), decoded, reconstructed, and played back.
- the bitstream generated by the encoder may be stored in the memory.
- the memory may include both a volatile memory and a nonvolatile memory.
- a memory may be represented as a recording medium storing a bitstream.
- a video may be composed of a series of pictures, and each picture may be divided into a coding unit such as a block.
- a coding unit such as a block.
- FIG. 1 is a block diagram illustrating a video encoding apparatus according to the present invention.
- the image encoding apparatus 100 may include a picture splitter 110, a predictor 120 and 125, a transformer 130, a quantizer 135, a realigner 160, and an entropy encoder. 165, an inverse quantizer 140, an inverse transformer 145, a filter 150, and a memory 155.
- the picture dividing unit 110 may divide the input picture into at least one processing unit.
- the processing unit may be a prediction unit (PU), a transform unit (TU), or a coding unit (CU).
- a coding unit may be used as a unit for encoding or may be used as a unit for decoding.
- the prediction unit may be split in the form of at least one square or rectangle having the same size in one coding unit, or the prediction unit of any one of the prediction units split in one coding unit is different from one another. It may be divided to have a different shape and / or size than the unit.
- the intra prediction may be performed without splitting into a plurality of prediction units NxN.
- the predictors 120 and 125 may include an inter predictor 120 that performs inter prediction or inter prediction, and an intra predictor 125 that performs intra prediction or intra prediction. Whether to use inter prediction or intra prediction on the prediction unit may be determined, and specific information (eg, an intra prediction mode, a motion vector, a reference picture, etc.) according to each prediction method may be determined.
- the residual value (residual block) between the generated prediction block and the original block may be input to the transformer 130.
- prediction mode information and motion vector information used for prediction may be encoded by the entropy encoder 165 together with the residual value and transmitted to the decoder.
- a method of using a merge prediction region (MER) in consideration of parallel processing in merging motion information of adjacent blocks in a spatial or temporal manner is used. It can be applied to the predictors 120 and 125. That is, the present invention provides a parallel prediction region (PER: Parallel) for constructing spatially or temporally adjacent blocks of a current block in consideration of parallel processing in prediction techniques such as inter-screen prediction, intra-picture prediction, and inter-component prediction among video coding techniques.
- PER Parallel
- Estimation Region can be used.
- the inter prediction unit 120 may predict the prediction unit based on the information of at least one of the previous picture or the next picture of the current picture. In some cases, the inter prediction unit 120 may predict the prediction unit based on the information of the partial region in which the encoding is completed in the current picture. You can also predict units.
- the inter predictor 120 may include a reference picture interpolator, a motion predictor, and a motion compensator.
- the reference picture interpolator may receive reference picture information from the memory 155 and generate pixel information of an integer pixel or less in the reference picture.
- a DCT based 8-tap interpolation filter having different filter coefficients may be used to generate pixel information of integer pixels or less in units of 1/4 pixels.
- a DCT-based interpolation filter having different filter coefficients may be used to generate pixel information of an integer pixel or less in units of 1/8 pixels.
- the motion predictor may perform motion prediction based on the reference picture interpolated by the reference picture interpolator.
- various methods such as a full search-based block matching algorithm (FBMA), a three step search (TSS), and a new three-step search algorithm (NTS) may be used.
- the motion vector may have a motion vector value in units of 1/2 or 1/4 pixels based on the interpolated pixels.
- the motion prediction unit may predict the current prediction unit by using a different motion prediction method.
- various methods such as a skip method, a merge method, an advanced motion vector prediction (AMVP) method, an intra block copy method, and the like may be used.
- AMVP advanced motion vector prediction
- the intra predictor 125 may generate a prediction unit based on reference pixel information around the current block, which is pixel information in the current picture. If the neighboring block of the current prediction unit is a block that has performed inter prediction, and the reference pixel is a pixel that has performed inter prediction, the reference pixel of the block that has performed intra prediction around the reference pixel included in the block where the inter prediction has been performed Can be used as a substitute for information. That is, when the reference pixel is not available, the unavailable reference pixel information may be replaced with at least one reference pixel among the available reference pixels.
- a residual block may include a prediction unit performing prediction based on the prediction units generated by the prediction units 120 and 125 and residual information including residual information that is a difference from an original block of the prediction unit.
- the generated residual block may be input to the transformer 130.
- the transform unit 130 converts the residual block including residual information of the original block and the prediction unit generated by the prediction units 120 and 125 into a discrete cosine transform (DCT), a discrete sine transform (DST), and a KLT. You can convert using the same conversion method. Whether to apply DCT, DST, or KLT to transform the residual block may be determined based on intra prediction mode information of the prediction unit used to generate the residual block.
- DCT discrete cosine transform
- DST discrete sine transform
- KLT KLT
- the quantization unit 135 may quantize the values converted by the transformer 130 into the frequency domain.
- the quantization coefficient may change depending on the block or the importance of the image.
- the value calculated by the quantization unit 135 may be provided to the inverse quantization unit 140 and the reordering unit 160.
- the reordering unit 160 may reorder coefficient values with respect to the quantized residual value.
- the reordering unit 160 may change the two-dimensional block shape coefficients into a one-dimensional vector form through a coefficient scanning method. For example, the reordering unit 160 may scan from DC coefficients to coefficients in the high frequency region by using a Zig-Zag scan method and change them into one-dimensional vectors.
- a vertical scan that scans two-dimensional block shape coefficients in a column direction instead of a zig-zag scan may be used, and a horizontal scan that scans two-dimensional block shape coefficients in a row direction. That is, according to the size of the transform unit and the intra prediction mode, it is possible to determine which scan method among the zig-zag scan, the vertical scan, and the horizontal scan is used.
- the entropy encoder 165 may perform entropy encoding based on the values calculated by the reordering unit 160.
- Entropy coding may use various coding methods such as, for example, Exponential Golomb, Context-Adaptive Variable Length Coding (CAVLC), and Context-Adaptive Binary Arithmetic Coding (CABAC).
- CABAC Context-Adaptive Binary Arithmetic Coding
- the entropy encoder 165 may encode residual value coefficient information of a coding unit from the reordering unit 160 and the predictors 120 and 125.
- the inverse quantizer 140 and the inverse transformer 145 inverse quantize the quantized values in the quantizer 135 and inversely transform the transformed values in the transformer 130.
- the residual value generated by the inverse quantizer 140 and the inverse transformer 145 is reconstructed by combining the prediction units predicted by the motion estimator, the motion compensator, and the intra predictor included in the predictors 120 and 125. You can create a Reconstructed Block.
- the filter unit 150 may include at least one of a deblocking filter, an offset correction unit, and an adaptive loop filter (ALF).
- the deblocking filter may remove block distortion caused by boundaries between blocks in the reconstructed picture.
- the offset correction unit may correct the offset with respect to the original image on a pixel-by-pixel basis for the deblocking image.
- the pixels included in the image are divided into a predetermined number of areas, and then, an area to be offset is determined, an offset is applied to the corresponding area, or offset considering the edge information of each pixel. You can use this method.
- Adaptive Loop Filtering ALF
- ALF Adaptive Loop Filtering
- the memory 155 may store the reconstructed block or picture calculated by the filter unit 150, and the stored reconstructed block or picture may be provided to the predictors 120 and 125 when performing inter prediction.
- FIG. 2 is a block diagram illustrating an image decoding apparatus according to the present invention.
- the image decoder 200 includes an entropy decoder 210, a reordering unit 215, an inverse quantizer 220, an inverse transformer 225, a predictor 230, 235, and a filter unit ( 240, a memory 245 may be included.
- the input bitstream may be decoded by a procedure opposite to that of the image encoder.
- the entropy decoder 210 may perform entropy decoding in a procedure opposite to that of the entropy encoding performed by the entropy encoder of the image encoder. For example, various methods such as Exponential Golomb, Context-Adaptive Variable Length Coding (CAVLC), and Context-Adaptive Binary Arithmetic Coding (CABAC) may be applied to the method performed by the image encoder.
- various methods such as Exponential Golomb, Context-Adaptive Variable Length Coding (CAVLC), and Context-Adaptive Binary Arithmetic Coding (CABAC) may be applied to the method performed by the image encoder.
- the entropy decoder 210 may decode information related to intra prediction and inter prediction performed by the encoder.
- the reordering unit 215 may reorder the entropy decoded bitstream by the entropy decoding unit 210 based on a method of rearranging the bitstream. Coefficients expressed in the form of a one-dimensional vector may be reconstructed by reconstructing the coefficients in a two-dimensional block form.
- the inverse quantization unit 220 may perform inverse quantization based on the quantization parameter provided by the encoder and the coefficient values of the rearranged block.
- the inverse transform unit 225 may perform an inverse transform, i.e., an inverse DCT, an inverse DST, and an inverse KLT, for a quantization result performed by the image encoder, that is, a DCT, DST, and KLT. Inverse transformation may be performed based on a transmission unit determined by the image encoder.
- the inverse transform unit 225 of the image decoder may selectively perform a transform scheme (eg, DCT, DST, KLT) according to a plurality of pieces of information such as a prediction method, a size of a current block, and a prediction direction.
- a transform scheme eg, DCT, DST, KLT
- the prediction units 230 and 235 may generate the prediction block based on the prediction block generation related information provided by the entropy decoder 210 and previously decoded blocks or picture information provided by the memory 245.
- the intra prediction when performing the intra prediction or the intra prediction in the same way as the operation of the image encoder, when the size of the prediction unit and the size of the transform unit are the same, the pixel existing on the left side and the pixel present on the upper left side
- intra prediction is performed on a prediction unit based on the pixel present at the top, but the size of the prediction unit and the size of the transformation unit are different when performing the intra prediction, the intra prediction is performed using a reference pixel based on the transformation unit. You can make predictions.
- intra prediction using NxN division may be used only for a minimum coding unit.
- the predictors 230 and 235 may include a prediction unit determiner, an inter predictor, and an intra predictor.
- the prediction unit determiner receives various information such as prediction unit information input from the entropy decoder 210, prediction mode information of the intra prediction method, and motion prediction related information of the inter prediction method, and distinguishes the prediction unit from the current coding unit, and predicts It may be determined whether the unit performs inter prediction or intra prediction.
- a merge prediction region (MER) is used in consideration of parallel processing.
- the method may be applied to the predictors 230 and 235. That is, the present invention provides a parallel prediction region (PER: Parallel) for constructing spatially or temporally adjacent blocks of a current block in consideration of parallel processing in prediction techniques such as inter-screen prediction, intra-picture prediction, and inter-component prediction among video coding techniques.
- PER Parallel
- the inter prediction unit 230 predicts the current prediction based on information included in at least one of a previous picture or a subsequent picture of the current picture including the current prediction unit by using information required for inter prediction of the current prediction unit provided by the image encoder. Inter prediction may be performed on a unit.
- a motion prediction method of a prediction unit included in a coding unit based on a coding unit includes a skip mode, a merge mode, an AMVP mode, and an intra block copy mode. It can be determined whether or not it is a method.
- the intra predictor 235 may generate a prediction block based on pixel information in the current picture.
- intra prediction may be performed based on intra prediction mode information of the prediction unit provided by the image encoder.
- the intra predictor 235 may include an adaptive intra smoothing (AIS) filter, a reference pixel interpolator, and a DC filter.
- the AIS filter is a part of filtering the reference pixel of the current block and determines whether to apply the filter according to the prediction mode of the current prediction unit.
- AIS filtering may be performed on the reference pixel of the current block by using the prediction mode and the AIS filter information of the prediction unit provided by the image encoder. If the prediction mode of the current block is a mode that does not perform AIS filtering, the AIS filter may not be applied.
- the reference pixel interpolator may generate a reference pixel having an integer value or less by interpolating the reference pixel. If the prediction mode of the current prediction unit is a prediction mode for generating a prediction block without interpolating the reference pixel, the reference pixel may not be interpolated.
- the DC filter may generate the prediction block through filtering when the prediction mode of the current block is the DC mode.
- the reconstructed block or picture may be provided to the filter unit 240.
- the filter unit 240 may include a deblocking filter, an offset correction unit, and an ALF.
- Information about whether a deblocking filter is applied to a corresponding block or picture, and when the deblocking filter is applied to the corresponding block or picture, may be provided with information about whether a strong filter or a weak filter is applied.
- the deblocking filter related information provided by the image encoder may be provided and the deblocking filtering of the corresponding block may be performed in the image decoder.
- the offset correction unit may perform offset correction on the reconstructed image based on the type of offset correction and offset value information applied to the image during encoding.
- the ALF may be applied to a coding unit based on ALF application information, ALF coefficient information, and the like provided from the encoder. Such ALF information may be provided included in a specific parameter set.
- the memory 245 may store the reconstructed picture or block to use as a reference picture or reference block, and may provide the reconstructed picture to the output unit.
- 3 is an embodiment of a block partitioning method of any form using one line segment.
- FIG. 3 illustrates an embodiment of dividing a block into two different arbitrary block types using one line segment.
- one block 300 may be configured as a square or non-square block.
- the square or non-square block may be recursively divided into various tree shapes, quad-tree block partition, binary-tree block partition, and ternary-tree. -tree) can be partitioned using block partitioning.
- One block shown in FIG. 3 may be divided into two different arbitrary block types using line segments having a specific angle and distance with respect to the origin of the block.
- an angle and a distance which are parameters for representing a line segment, based on the origin of the block, may be signaled in units of blocks.
- FIG. 4 is a diagram illustrating examples of various arbitrary block divisions that can be generated using square and rectangular block divisions and left and right diagonal segments.
- FIG. 4 illustrates an example in which a block may be divided into various arbitrary forms using only left or right diagonal line segments in a block division structure using various tree types.
- 400 blocks are divided into four blocks of 410, 420, 430, and 440 using the first quad-tree block partition.
- 410 blocks which are the first blocks among the blocks divided by the quad-tree block division, do not use additional subtree block division, and are divided into two triangular blocks using diagonal line segments proposed by the present invention.
- the block 410 indicates that the block is divided using a diagonal line connecting the upper right side and the lower left side of the block.
- 420 blocks which are the second blocks among the blocks divided by the quad-tree block division, are divided into 421, 422, 423, and 424 blocks using quad-tree block division as an additional lower tree block division.
- the 421 block which is the first lower block of the 420 block, shows an example of dividing into two triangular blocks using a diagonal line segment proposed in the present invention without using additional subtree block partitioning.
- block 421 indicates that a block is divided using a diagonal line connecting an upper left end and a lower right end of the block.
- the fourth block 424 of the lower blocks of the second block 420 is again divided into two non-square blocks 425 and 426 using binary-tree block partitioning.
- block 425 represents an embodiment divided into two triangular blocks using a diagonal line segment proposed by the present invention, and shows that the block is divided by using a diagonal line connecting the upper left and lower right ends.
- the 430 block which is the third block among the blocks divided by the quad-tree block partition, is divided into 431 and 432 blocks using binary-tree block partitioning as an additional lower tree block partition.
- the 432 block which is the second lower block of the 430 block, represents an embodiment divided into two triangular blocks using a diagonal line segment proposed by the present invention, without using an additional lower tree block partition.
- the block is divided using a diagonal line connecting the lower left corner with the lower left corner.
- the 440 block which is the fourth block among the blocks divided by the quad-tree block division, is divided into 441 and 441 blocks using binary-tree block division as an additional lower tree block division.
- 441 blocks which are the first sub-blocks of 440 blocks, are divided into 443 and 444 blocks using binary-tree block partitioning, and 443 blocks are divided into ternary-tree block partitioning. It is divided into 445, 446, and 447 blocks.
- 447 blocks among the blocks partitioned using the ternary-tree block partition are shown by dividing the block into two triangular blocks using a diagonal line connecting the upper left and lower right ends.
- the quad-tree block proposed in the present invention.
- various block partitioning structures such as a partitioning structure, a binary-tree block partitioning structure, and a ternary-tree block partitioning structure, any block partitioning form using various angles and distances shown in FIG. It is possible to express the block division form of.
- FIG. 5 shows a method of dividing a square or non-square block into a triangular form using two types of diagonal lines proposed by the present invention.
- FIG. 5A illustrates an example of dividing a square or non-square block into two triangle blocks by using a diagonal line connecting the upper left and lower right ends of the block.
- FIG. 5B illustrates an example of dividing a square or non-square block into two triangle blocks by using a diagonal line connecting a right upper end and a lower left end of the block.
- Two triangular blocks divided in (a) and (b) of FIG. 5 may be referred to as a first partition and a second partition according to a position.
- a diagonal line connecting an upper left end and a lower right end of a block may be expressed by Equation 1 using the upper left position of the block as an origin.
- Equation 1 w means the width of the block, h means the height of the block.
- a diagonal line connecting the upper right end and the lower left end of the block may be expressed by Equation 2 using the upper left position of the block as an origin.
- Equation 2 w means the width of the block, h means the height of the block.
- a flag indicating whether diagonal motion division is performed for one coding unit (CU) may be used. For example, when the flag is the first value, diagonal motion division may be performed. Otherwise, diagonal motion division may not be performed.
- the flag may be encoded and signaled by the encoding apparatus or may be derived from the decoding apparatus based on a predetermined encoding parameter.
- the encoding parameter may include a slice type, an inter mode type, a block size / type, a ratio of a width and a height of a block, and the like.
- the flag may be set to the first value only when the slice type to which the current block belongs is a B slice.
- the flag may be set to the first value only when the inter mode of the current block is the merge mode.
- the flag may be set to the first value only when at least one of the width or height of the current block is greater than or equal to a predetermined threshold size.
- the threshold size may be 4, 8, 16 or more.
- the flag may be set to the first value only when the number W * H of the samples belonging to the current block is greater than or equal to a predetermined threshold number.
- the threshold number may be 32, 64 or more.
- the flag may be set to the first value only when the ratio of the width and height of the current block is smaller than a predetermined threshold.
- the threshold value may be 4, 8 or more.
- direction information of the diagonal motion division may be signaled.
- the current block may be divided based on a diagonal line connecting the upper left and right lower ends, otherwise, the current block may be divided based on the diagonal line connecting the upper right and lower left ends.
- the information for the diagonal motion division may not be signaled according to the mode information of the current block.
- the mode information of the current block is an intra prediction mode, it is not signaled.
- the information for the diagonal motion division may not be signaled according to the size information of the current block.
- the size information of the current block may be defined as a width or height size, a ratio of width and height, a product of width and height, and the like. For example, if the width or height of the current block is less than 8, the current block may not use diagonal motion division.
- the diagonal motion division may not be used.
- This particular ratio means that if the ratio of width to height is greater than 1: 3 or 3: 1 (eg, 1: 4 or 4: 1, 1: 8 or 8: 1), then the current block does May not be used. In such a case, the diagonal motion partitioning information is not signaled and the syntax is not parsed.
- FIG. 6 is a diagram illustrating a concept of a mask for segmentation in a circle or ellipse form among masks for mask-based motion prediction and compensation proposed in the present invention.
- one block may be divided into two different regions using a mask including a part of a circle or an ellipse. It is shown.
- the boundary may be smoothed by adjusting the weight at the boundary of the division, and the weighted prediction method described below may be applied in the same or similar manner.
- a corresponding block in a reference picture is selected using motion information of a spatial or temporally adjacent block of the current block, and a shape of a mask is determined based on the pixel configuration of the corresponding block. You can decide.
- using the pixel configuration of the corresponding block may mean configuring a mask to detect an edge inside the block and divide the block based on the edge.
- FIG. 7 is a diagram illustrating a form in which encoding information of a corresponding block is stored when diagonal motion division is used.
- encoding information may be stored by dividing the block into at least one of a horizontal direction and a vertical direction, rather than a diagonally divided form. .
- the information in the encoding information of the corresponding block, the information may be stored in a diagonally divided form.
- many video coding methods and apparatuses do not store encoding information in units of pixels, but store encoding information such as motion information in units of blocks having a specific size, such as 4x4 blocks or 8x8 blocks.
- the encoded information stored as described above may be used as reference information of a block that is encoded / decoded after the current block.
- the first partition may be referenced based on the upper right position of the current block, and the second partition may be referenced based on the lower left position of the current block.
- the first partition may be referred to based on the upper left position of the current block, and the second partition may be referred to based on the lower right position of the current block.
- an additional motion vector storing process may be included in storing the motion vectors of the first partition and the second partition in the motion vector buffer.
- the process of extracting motion from the motion vector buffer may include a process of inferring a prediction direction.
- each partition may have only a motion vector in the L0 or L1 direction.
- the additional motion vector storing process may mean a process of constructing a motion vector for bidirectional prediction using the motion vector of each partition.
- the motion vector for bidirectional prediction includes a motion vector in the L0 direction and a motion vector in the L1 direction.
- a process of inferring the prediction direction of each partition from the signaling information for the motion vector prediction may be included.
- the first partition may be forced to the motion vector in the L0 direction and the second partition may be forced to the motion vector in the L1 direction and stored in the motion vector buffer.
- the first partition may be forced to the motion vector in the L1 direction and the second partition may be forced to the motion vector in the L0 direction and stored in the motion vector buffer.
- Forcing to the motion vector in the L0 direction means changing the motion vector in the L1 direction to the motion vector in the L0 direction when the motion vector of the corresponding partition is in the L0 direction.
- the change may include at least one of inverting a (x, y) coordinate value of a motion vector with respect to an origin or scaling using a predetermined coding parameter (eg, a reference picture index or POC information of a reference picture). It may include.
- a predetermined coding parameter eg, a reference picture index or POC information of a reference picture.
- the signaling information on the motion vector prediction may mean information on which motion vector prediction value mvp is used by the first partition and the second partition in which motion vector prediction list.
- the signaling information for the motion vector prediction may include information transmitted in an index form, and may mean information on which prediction candidates are used by the first partition and the second partition in the motion vector prediction candidates.
- the method may further include determining a partition type of the current block and a motion vector prediction candidate corresponding to both the first partition and the second partition using one index information.
- a method of using the motion vector prediction candidate as it is by using the signaled index information may be used, and further signaling a motion vector difference between the signaled index information and the corresponding motion vector prediction candidate. The method can also be used.
- a method of using the motion vector prediction candidate as it is may be referred to as a SKIP mode or a merge mode, and a method of additionally signaling a motion vector difference with the motion vector prediction candidate is a motion vector prediction mode (AMVP mode). May be referred to collectively.
- AMVP mode motion vector prediction mode
- the motion vector difference may be transmitted to the first partition and the second partition, respectively, or may be transmitted by configuring the first partition and the second partition as one bidirectional differential motion vector.
- one block may be divided into two blocks.
- the motion vector buffer corresponding to the current block may be divided into two parts in consideration of the size information of the current block, and the motion vectors of the first partition and the second partition may be stored in the divided partition. For example, by dividing the width and height of the motion vector buffer corresponding to the current block, the motion vector of the first partition may be stored in some buffers, and the motion vector of the second partition may be stored in some of the buffers.
- the current block when the current block is a 16x8 block, the current block may be divided into two 8x8 blocks, and the motion vector of the first partition and the motion vector of the second partition may be stored in corresponding motion vector buffers, respectively. .
- the sizes of N and M may be compared with respect to the NxM block. If N is large, N may be divided into two and the motion vector may be stored in a motion vector buffer corresponding to each partition. On the other hand, when M is large, M may be divided into two parts, and a motion vector may be stored in a motion vector buffer corresponding to each partition.
- the current block may consist of a plurality of n * m subblocks.
- the subblock may mean a unit for storing the motion vector.
- n and m may be 4 or 8.
- n and m may be fixed values pre-committed to the encoding / decoding apparatus, or may be determined depending on the size / type of the current block.
- the sub-blocks may be square or non-square.
- the current block includes a sub block belonging to only the first partition PART_0 (hereinafter referred to as a first region), a sub block belonging to only the second partition PART_1 (hereinafter referred to as a second region), and a sub located on a diagonal line. It may include a block (hereinafter referred to as a third region).
- the motion vector of the first partition may be stored in the first region
- the motion vector of the second partition may be stored in the second region.
- each partition may have only a motion vector of unidirectional prediction.
- motion vectors mvL0 and mvL1 of bidirectional prediction may be stored in the third region.
- the motion vector of the bidirectional prediction may be generated by a combination of the motion vector mv1 of the first partition and the motion vector mv2 of the second partition.
- a motion vector in the L0 direction among mv1 and mv2 may be assigned to mvL0
- a motion vector in the L1 direction among mv1 and mv2 may be assigned to mvL1.
- Example 1 when the prediction directions of the first partition and the second partition are the same, either of the motion vector mv1 of the first partition or the motion vector mv2 of the second partition is included in the third region. May optionally be stored.
- either the motion vector mv1 of the first partition or the motion vector mv2 of the second partition may be selectively stored according to the partition type of the current block. That is, when the current block is divided diagonally, the motion vector mv1 of the first partition is stored in the third region, otherwise, the motion vector mv2 of the second partition is stored in the third region. Can be. Conversely, when the current block is divided diagonally to the left, the motion vector mv2 of the second partition is stored in the third region, otherwise, the motion vector mv1 of the first partition is stored in the third region. Can be.
- only the motion vector mv1 of the first partition may be stored in the third area, or only the motion vector mv1 of the second partition may be stored.
- the motion vector to be stored in the third region may be determined in consideration of at least one of encoding information of the first partition and / or the second partition or the partition type of the current block.
- the encoding information may be a reference picture index, a merge candidate index, a size value of a vector, and the like.
- the motion vector of the partition corresponding to the minimum value among the reference picture index refIdx1 of the first partition and the reference picture index refIdx2 of the second partition may be selectively stored.
- a motion vector of a partition corresponding to the maximum value of two reference picture indices may be selectively stored.
- a process of storing a motion vector of either the first partition or the second partition according to the diagonal division direction This can be done.
- a process of selectively storing the encoded information using encoding information may be performed, such as case comparison between merge candidate indexes. have.
- a process of converting the prediction direction may be further performed.
- both the first partition and the second partition are L0 predictions
- one of two L0 motion vectors may be allocated to mvL0 and the other may be allocated to mvL1.
- both the first partition and the second partition are L1 prediction
- one of two L1 motion vectors may be allocated to mvL1 and the other may be allocated to mvL0.
- a motion vector for bidirectional prediction may be generated through the transformation of the prediction direction, and stored in the third region.
- the motion compensation method of the aforementioned diagonal motion division will be described.
- the following embodiment will be described on the premise that the current block is encoded in the merge mode, but this does not limit the type of the inter mode, and the same may be applied to the SKIP mode, the AMVP mode, the affine mode, and the like. .
- FIG. 8 illustrates a merge mode based inter prediction method according to an embodiment to which the present invention is applied.
- a merge candidate list of the current block may be configured (S800).
- the current block may be determined through at least one of the block division methods described with reference to FIGS. 3 to 6.
- the current block may have a square or non-square shape.
- the current block can be divided into two partitions through diagonal partitioning. At least one of the two partitions may be a triangular partition.
- the merge candidate list may include at least one of a spatial merge candidate or a temporal merge candidate of the current block.
- the motion information of the spatial merge candidate may be derived from the motion information of the spatial neighboring block of the current block.
- the spatial neighboring block is a block belonging to the same picture as the current block and may mean a block adjacent to the current block.
- the spatial neighboring block may include a block adjacent to at least one of a left side, an upper end, an upper right end, a lower left end, or an upper left end of the current block.
- the upper left neighboring block may be used only when at least one of the blocks adjacent to the left, the upper, the upper right and the lower left is not available.
- the motion information of the temporal merge candidate may be derived from the motion information of the temporal neighboring block of the current block.
- the temporal neighboring block is a block belonging to a picture different from the current block and may be defined as a block having the same position as the current block.
- the block of the same position is at least one of a block BR adjacent to the lower right corner of the current block, a block CTR including the position of the center sample of the current block, or a block TL including the position of the upper left sample of the current block. It can mean one.
- the block of the same position may mean a block including a position shifted by a predetermined disparity vector from the position of the upper left sample of the current block.
- the disparity vector may be determined based on any one of the motion vectors of the spatial neighboring block described above.
- the disparity vector may be set to the motion vector of the left neighboring block or may be set to the motion vector of the upper neighboring block.
- the disparity vector may be determined based on a combination of at least two of the motion vectors of the spatial neighboring block described above. The combination may include a calculation process such as a maximum value, a minimum value, a median value, a weighted average value, and the like.
- the disparity vector may be set to an intermediate value or an average value between the motion vector of the left neighboring block and the motion vector of the lower left neighboring block.
- the motion vector and the reference picture index of the temporal merge candidate may be respectively derived from the motion vector and the reference picture index of the temporal neighboring block described above.
- the motion vector of the temporal merge candidate is derived from the motion vector of the temporal neighboring block, and the reference picture index of the temporal merge candidate is set to a default value (eg, 0) pre-committed to the decoding apparatus regardless of the temporal neighboring block. Can be.
- a method of generating a merge candidate list based on a spatial / temporal merge candidate will be described in detail with reference to FIGS. 9 and 10.
- the merge candidate list may further include at least one of a combined merge candidate or a merge candidate having zero motion vectors.
- the combination merge candidate may be derived by combining n merge candidates belonging to the pre-generated merge candidate list.
- n may be an integer of 2, 3, 4 or more.
- the number n of merge candidates to be combined may be a fixed value pre-committed to the encoding / decoding apparatus, or may be encoded and signaled by the encoding apparatus. The signaling may be performed in at least one unit of a sequence, a picture, a slice, a tile, a sub-tile (brick), or a predetermined block.
- the number n of merge candidates to be combined may be variably determined based on the number of remaining merge candidates.
- the number of residual merge candidates may mean a difference between the maximum number of merge candidates included in the merge candidate list and the current number of merge candidates included in the merge candidate list.
- the maximum number may be a number pre-committed to the encoding / decoding apparatus, or may be encoded and signaled by the encoding apparatus.
- the current number may mean the number of merge candidates configured before adding the combined merge candidate. For example, when the number of remaining merge candidates is 1, two merge candidates are used, and when the number of remaining merge candidates is larger than 1, three or more merge candidates may be used.
- the n merge candidates may be determined in consideration of the prediction direction of each merge candidate in the merge candidate list. For example, among merge candidates included in the merge candidate list, only merge candidates that are bidirectional prediction may be selectively used, or only merge candidates that are unidirectional prediction may be selectively used.
- the combined merge candidate may be derived using both the spatial merge candidate and the temporal merge candidate, or may be derived using only either the spatial merge candidate or the temporal merge candidate.
- the combined merge candidate may be added after the spatial / temporal merge candidate in the merge candidate list. That is, the index of the combined merge candidate may be larger than the index of the spatial / temporal merge candidate.
- the combined merge candidate may be added between the spatial merge candidate and the temporal merge candidate in the merge candidate list. That is, the index of the combined merge candidate may be larger than the index of the spatial merge candidate and smaller than the index of the temporal merge candidate.
- the position of the combined merge candidate may be variably determined in consideration of the prediction direction of the combined merge candidate. Depending on whether the prediction direction of the combined merge candidate is bidirectional prediction, the positions of the combined merge candidates in the merge candidate list may be rearranged. For example, if the combined merge candidate is bidirectional prediction, an index smaller than the spatial or temporal merge candidate may be assigned, otherwise an index larger than the spatial or temporal merge candidate may be assigned.
- the motion information of the combined merge candidate may be derived by weighted average of the motion information of the first merge candidate and the second merge candidate.
- the reference picture index in the LX direction of the combined merge candidate may be derived as the reference picture index in the LX direction of either the first merge candidate or the second merge candidate.
- the reference picture index in the LX direction of the combined merge candidate may be derived using only the reference picture index in the LX direction of the first merge candidate.
- the first merge candidate may have a smaller index than the second merge candidate.
- the weighted weighted average is [1: 1], [1: 2], [1: 3], or [2: 3], but is not limited thereto.
- the weight may be pre-defined in the encoding / decoding apparatus or may be derived in the decoding apparatus. In this case, the weight may be derived in consideration of at least one of the distance between the current picture and the reference picture of the merge candidate or the prediction direction of the merge candidate.
- the motion information in the L0 direction of the combined merge candidate may be derived from the motion information in the L0 direction of the first merge candidate, and the motion information in the L1 direction may be derived from the motion information in the L1 direction of the second merge candidate.
- the motion information in the L0 direction of the combined merge candidate may be derived from the motion information in the L0 direction of the second merge candidate, and the motion information in the L1 direction may be derived from the motion information in the L1 direction of the first merge candidate.
- the above-described motion information includes at least one of a prediction direction flag, a reference picture index, or a motion vector, and may be similarly interpreted in the following embodiments.
- motion information of the current block may be derived from the merge candidate list (S810).
- a merge index of the current block may be signaled.
- the merge index may be information encoded to specify any one of a plurality of merge candidates included in the merge candidate list.
- the merge index may be signaled based on a flag indicating whether motion compensation based on diagonal motion division is performed. For example, if the flag indicates that motion compensation based on diagonal motion partitioning is to be performed (ie, the flag is a first value), the merge index is signaled, otherwise the merge index will not be signaled. Can be.
- the flag is as described with reference to FIG. 5, and a detailed description thereof will be omitted.
- the merge index may be signaled for each of the first partition and the second partition of the current block (Example 1).
- the motion information of the first partition may be derived using motion information of the merge candidate having the same index as the merge index mergeIdx1 of the signaled first partition.
- mergeIdx2 mergeIdx2
- one merge index mergeIdx may be signaled for the current block (Embodiment 2). That is, the first partition and the second partition belonging to the current block may share the signaled mergeIdx. Based on the motion information of the merge candidate specified by mergeIdx, motion information of the first and second partitions may be derived.
- the merge candidate when the merge candidate specified by mergeIdx is bidirectional prediction, the merge candidate may have motion information in the L0 direction and motion information in the L1 direction.
- the motion information of the first partition may be derived from one of the motion information in the L0 direction and the motion information in the L1 direction
- the motion information of the second partition may be derived from the other.
- the motion information of the first partition is derived to the motion information of the merge candidate specified by mergeIdx
- the motion information of the second partition is (mergeIdx + k It can be derived to the motion information of the merge candidate specified by.
- k may be an integer having an absolute value of 1, 2, 3, or more.
- the motion information of the first and second partitions may be derived based on the motion information of the pre-committed merge candidate in the merge candidate list (Embodiment 3).
- signaling of the merge index may be omitted.
- the pre-committed merge candidate may be a merge candidate with an index of zero.
- the motion information of the first and second partitions may be derived in consideration of whether a merge candidate having an index of 0 is bidirectional prediction, which is the same as that of the second embodiment.
- the pre-committed merge candidate may be a merge candidate having the smallest index among the merge candidates that are bidirectional predictions.
- the merge candidate that is bidirectional prediction may include at least one of a spatial / temporal merge candidate or a combined merge candidate.
- the motion information of the first and second partitions is derived based on the motion information of the merge candidate, which is bidirectional prediction, as described above in the second embodiment.
- the partition-specific motion information may be derived based on any one of the above-described embodiments 1 to 3, and the motion information for each partition may be derived based on a combination of at least two of the embodiments 1-3.
- the partition may be forced to perform only one-way prediction (Example 4).
- the motion information of the first partition is motion information in the L0 direction of the merge candidate corresponding to mergeIdx1. It can be derived using.
- the merge candidate corresponding to mergeIdx1 may not have the motion information in the L0 direction.
- motion information of the first partition may be derived using motion information of the merge candidate in the L1 direction.
- the merge index (mergeIdx1) of the first partition is an odd number (eg, 1, 3, 5, etc.)
- the motion information of the first partition is derived by using the motion information in the L1 direction of the merge candidate corresponding to mergeIdx1. Can be.
- the merge candidate corresponding to mergeIdx1 may not have the motion information in the L1 direction.
- motion information of the first partition may be derived using motion information in the L0 direction of the corresponding merge candidate.
- the merge index mergeIdx1 may be encoded and signaled by the encoding apparatus, may be derived based on the signaled merge index, or may be pre-committed to the decoding apparatus.
- the motion information of the second partition may be derived in the same manner as the above-described first partition, and a detailed description thereof will be omitted.
- the motion information of the first partition may be obtained by using the motion information in the L1 direction of the merge candidate corresponding to mergeIdx1. If not, the motion information of the first partition may be derived using motion information in the L0 direction of the merge candidate corresponding to mergeIdx1.
- the partition may be forced to perform only unidirectional prediction according to the position of the diagonal motion partitioned partition (Example 5).
- the first partition of the current block is forced to refer only to motion information in the L0 direction of the merge candidate specified by the merge index mergeIdx1, and the second partition is configured to refer to the merge candidate specified by the merge index mergeIdx2. It may be forced to refer to only motion information in the L1 direction.
- the merge candidate specified by mergeIdx1 does not have the motion information in the L0 direction (ie, L1 prediction)
- the motion information in the L1 direction of the merge candidate may be referred to.
- the merge candidate specified by mergeIdx2 does not have the motion information in the L1 direction (ie, L0 prediction)
- the motion information in the L0 direction of the merge candidate may be referred to.
- unidirectional prediction may be forced, which will be described in detail with reference to FIGS. 11 to 13.
- inter prediction of the current block may be performed using the derived motion information (S820).
- the prediction pixel of the current block is obtained using either the first reference block P P0 specified by the motion vector of the first partition or the second reference block P P1 specified by the motion vector of the second partition.
- the prediction pixel may be obtained by applying a predetermined weight to the first pixel in the first reference block and the second pixel in the second reference block.
- the first pixel and the second pixel may be at the same position as the prediction pixel.
- the prediction block P CUR of the current block according to the present invention may be obtained as in Equation 3 below.
- P P0 and P P1 mean prediction blocks predicted by different movements
- MASK P0 (x, y) and MASK P1 (x, y) represent weights at coordinates (x, y)
- MASK P0 ( The sum of x, y) and MASK P1 (x, y) should be equal to 2 shift .
- the offset may be 0 or 2 (shift-1) .
- the pixel P (x1, y1) positioned on the diagonal is predicted by weighting the pixel at the position (x1, y1) in the first partition and the pixel at the position (x1, y1) in the second partition.
- the peripheral pixel P (x2, y2) of the pixel P (x1, y1) is predicted by weighting the pixel at the position (x2, y2) in the first partition and the pixel at the position (x2, y2) in the second partition.
- the weighted prediction will be described in detail with reference to FIGS. 14 to 17.
- the current block 900 is a block partitioned using left diagonal motion partitioning, and may include a first partitioning block 910 and a second partitioning block 920.
- the first partition block 910 and the second partition block 920 may perform independent motion prediction and motion compensation, respectively, and generate one prediction block corresponding to the current block 900.
- the first partition block 910 and the second partition block 920 may use the merge candidate list generated based on the square or non-square current block as shown in FIG. 8.
- the first division block 910 and the second division block 920 have different movement characteristics, and therefore, the first division block 910 and the second division.
- Block 920 may use different motion prediction and merge candidates.
- the first partition block 910 may have low correlation with the left neighboring blocks A0 and A1 of the spatial neighboring blocks of the current block 900.
- A0 and A1 may be excluded.
- motion information of the temporal neighboring block C0 corresponding to the lower right position of the current block 900 and motion information of the temporal neighboring block C4 corresponding to the upper right position of the current block 900 may be used as additional merge candidates. Can be.
- merge candidates included in the merge candidate list are combined to form an additional merge candidate list, and merge candidates having zero motion vectors are also used. You can create a list.
- the second split block 920 may have a low correlation with the upper neighboring blocks B0 and B1 among the spatial neighboring blocks of the current block 900.
- B0 and B1 may be excluded.
- motion information of the temporal neighboring block C0 corresponding to the lower right position of the current block 900 and motion information of the temporal neighboring block C5 corresponding to the upper left position of the current block 900 may be used as additional merge candidates. Can be.
- merge candidates included in the merge candidate list are combined to form an additional merge candidate list, and merge candidates having zero motion vectors are also used. You can create a list.
- the current block 1000 is a block partitioned using right diagonal motion partitioning and may include a first partitioning block 1010 and a second partitioning block 1020.
- the first divided block 1010 and the second divided block 1020 may perform independent motion prediction and motion compensation, respectively, and generate one prediction block corresponding to the current block 1000.
- the first split block 1010 and the second split block 1020 may use the merge candidate list generated based on the square or non-square current block, as shown in FIG. 8.
- the first division block 1010 and the second division block 1020 have different movement characteristics, and thus, the first division block 1010 and the second division.
- Block 1020 may use different motion prediction and merge candidates.
- the first partition block 1010 may have low correlation with the lower left neighboring block A0 and the upper right neighboring block B0 among the spatial neighboring blocks of the current block 1100.
- the lower left and upper right neighboring blocks A0 and B0 may be excluded.
- the merge candidate may further include motion information of the temporal neighboring block C2 located in the upper left corner based on the center position of the current block 1000 and motion information of the temporal neighboring block C3 corresponding to the upper left position of the current block 1000. Can be used as
- merge candidates included in the merge candidate list are combined to form an additional merge candidate list, and merge candidates having zero motion vectors are also used. You can create a list.
- the second partition block 1020 is different from the left neighboring block A1, the upper neighboring block B1, and the upper left neighboring block B2 among the spatial neighboring blocks of the current block 1000. It can be seen that the correlation is low. In this case, in constructing the merge candidate list of the second division block 1020, A1, B1, and B2 may be excluded. In addition, the motion information of the temporal neighboring block C0 corresponding to the lower right end position of the current block 1000, the temporal neighboring block C1 corresponding to the center position of the current block 1000, and the lower right end of the current block 1000. The motion information of the temporal neighboring block C6 corresponding to the position may be used as an additional merge candidate.
- merge candidates included in the merge candidate list are combined to form an additional merge candidate list, and merge candidates having zero motion vectors are also used. You can create a list.
- FIG. 11 is a diagram illustrating a concept of limited mask-based motion prediction and compensation as an embodiment to which the present invention is applied.
- FIG. 11 illustrates a method of limiting prediction directions according to specific conditions in order to reduce memory bandwidth in performing mask-based motion prediction and compensation proposed by the present invention.
- the prediction direction of the current block may be limited.
- the threshold size may be 4, 8, 16 or more.
- the prediction direction of the current block may be limited.
- the predetermined inter mode may include at least one of a SKIP mode, a merge mode, an AMVP mode, or an affine mode.
- the inter mode may be restricted according to each partition of the current block. For example, when the first partition PART_0 is coded in the merge mode, the second partition PART_1 may be restricted from being coded in the merge mode. However, when the first partition PART_0 is not coded in the merge mode, the second partition PART_1 may be coded in the merge mode.
- the method of limiting the prediction direction may be applied.
- At least one of the two pieces of motion information may be motion information for performing unidirectional prediction.
- FIGS. 12 and 13 are diagrams illustrating the concept of a method of performing prediction by dividing one coding block into two partitions using one straight line according to an embodiment to which the present invention is applied.
- FIG. 12 is a diagram illustrating triangulation of one square or non-square block based on a straight line connecting an upper right and a lower left in an embodiment in which one coding block is divided into two partitions using one straight line.
- the present invention may include not only a triangular shape divided based on a straight line connecting the upper right and the lower left, but also a triangular shape divided based on a straight line connecting the upper left and the lower right.
- the present invention includes a case in which one coding block is divided into two in the vertical / horizontal direction, wherein the two divided partitions may have the same width or height.
- the present invention may include a case where one coding block is divided into two partitions based on an arbitrary straight line.
- the current block 1201 is divided based on a straight line connecting the upper right and lower left.
- each partition may perform bidirectional prediction or unidirectional prediction.
- the present invention proposes a method and apparatus for performing prediction with reference to reference pictures in different directions, in case of being forced to use only unidirectional prediction. That is, when the first partition performs the L0 prediction, the second partition may essentially perform the L1 prediction. Referring to FIG. 12, a first partition performs prediction with reference to a reference picture in the L0 direction, and a second partition performs prediction in performing a prediction with reference to a reference picture in the L1 direction.
- a first partition performs prediction with reference to a reference picture in the L1 direction
- a second partition performs prediction with reference to a reference picture in the L0 direction.
- first partition and the second partition refer to different reference picture lists.
- the present invention may also include performing prediction by referring to a reference picture list in which the first partition and the second partition are the same.
- 14 to 17 illustrate a weighted prediction method for a diagonal motion-divided current block according to an embodiment to which the present invention is applied.
- a weight 2 may be applied to a pixel position belonging to a first partition, and a weight 1 may be applied to a pixel position positioned on a diagonal line.
- a weight of 0 may be applied to the pixel position belonging to the second partition, the effect of not referring to the pixel of the corresponding position can be obtained.
- the weight 2 may be applied to the pixel position belonging to the second partition, and the weight 1 may be applied to the pixel position positioned on the diagonal. However, a weight of 0 may be applied to pixel positions belonging to the first partition.
- the two equally-positioned pixels to which the weight is applied may be summed and divided by the sum of the weights to obtain a final prediction pixel.
- the aforementioned method may be referred to as a mask-based motion prediction and compensation method, and may be extended to various types of masks.
- Equation 4 When the diagonal motion division shown in FIG. 14 is configured in the form of a mask, Equation 4 is obtained.
- the MASK P0 refers to the weight of the first partition
- the MASK P1 refers to the weight of the second partition.
- Equation 5 The equation for generating the final prediction block using the weight of Equation 4 is shown in Equation 5.
- P DMP (x, y) (P P0 (x, y) x MASK P0 (x, y) + P P1 (x, y) x MASK P1 (x, y)) >> shift
- P DMP denotes a final square or non-square prediction block obtained by using diagonal motion division
- P P0 (x, y) is a reference pixel corresponding to a position (x, y) in the first partition
- P P1 ( x, y) means reference pixels corresponding to (x, y) positions in the second partition, respectively.
- Shift is the final shift value according to the weight. Since the weight shown in FIG. 14 is 2, the shift value is 1.
- the sum of the weights for each position of the mask generated by the diagonal proposed by the present invention is not limited to 2, and may be an exponential power of 2, such as 4, 8, 16, 32, and the like.
- the adjacent region may include two pixels adjacent in the top or left direction and two pixels adjacent in the bottom or right direction from pixels positioned on a diagonal line.
- pixels P (x1, y1) (hereinafter, referred to as a first region) positioned diagonally may include pixels at a position (x1, y1) in a reference block of the first partition and (x1, y1) in a reference block of the second partition.
- y1) can be predicted by applying a weight of 4 to each pixel at position y1).
- the upper or left neighboring pixel P (x2, y2) (hereinafter referred to as the second region) of the first region applies a weight of 6 to the pixel at the position (x2, y2) in the reference block of the first partition, and It can be predicted by applying the weight 2 to the pixel at the position (x2, y2) in the reference block of the two partitions.
- the upper or left neighboring pixel P (x3, y3) (hereinafter referred to as the third region) of the second region applies a weight of 7 to the pixel at the position (x3, y3) in the reference block of the first partition, and It can be predicted by applying the weight 1 to the pixel at the position (x3, y3) in the reference block of the two partitions.
- the remaining pixels P (x4, y4) except for the first to third regions are applied with a weight of 8 to the pixel at the position (x4, y4) in the reference block of the first partition, and It can be predicted by applying a weight of 0 to the pixel at the position (x4, y4) in the reference block.
- the lower or right neighboring pixels P (x5, y5) (hereinafter referred to as a fourth region) of the first region apply weight 2 to the pixel at the position (x5, y5) in the reference block of the first partition. It can be predicted by applying the weight 6 to the pixel at the position (x5, y5) in the reference block of the second partition.
- the lower or right neighboring pixel P (x6, y6) (hereinafter referred to as a fifth region) of the fourth region applies a weight of 1 to the pixel at the position (x6, y6) in the reference block of the first partition, and It can be predicted by applying a weight of 7 to the pixel at the position (x6, y6) in the reference block of the two partitions.
- the remaining pixels P (x7, y7) except for the first region, the fourth region, and the fifth region apply a weight of 0 to the pixel at the position (x7, y7) in the reference block of the first partition, It can be predicted by applying a weight of 8 to the pixel at the position (x7, y7) in the reference block of the second partition.
- FIG. 16 illustrates an embodiment in which the sum of weights applied to each pixel is 32 and may be used when the block is sharper than in the embodiment in which the sum of the weights shown in FIG. 15 is 8.
- different weights may be applied to predetermined regions, which have been described in detail with reference to FIG. 15, and thus a detailed description thereof will be omitted.
- the weight is any one of [32: 0], [31: 1], [28: 4], [16:16], [4:28], [31: 1], or [0:32]. It can be selectively used according to the position of the pixel.
- the adjacent region may include one pixel adjacent in the top or left direction and one pixel adjacent in the bottom or right direction from the pixels positioned on the diagonal.
- different weights may be applied to predetermined regions, which have been described in detail with reference to FIG. 15, and thus a detailed description thereof will be omitted.
- the weight may be any one of [8: 0], [6: 2], [4: 4], [2: 6], or [0: 8], which may be selectively used according to the position of the pixel.
- various embodiments of the present disclosure may be implemented by hardware, firmware, software, or a combination thereof.
- one or more Application Specific Integrated Circuits (ASICs), Digital Signal Processors (DSPs), Digital Signal Processing Devices (DSPDs), Programmable Logic Devices (PLDs), Field Programmable Gate Arrays (FPGAs), General Purpose It may be implemented by a general processor, a controller, a microcontroller, a microprocessor, and the like.
- scope of the disclosure include software or machine-executable instructions (eg, an operating system, an application, firmware, a program, etc.) to cause an operation in accordance with various embodiments of the method to be executed on an apparatus or a computer, and such software or Instructions, and the like, including non-transitory computer-readable media that are stored and executable on a device or computer.
- software or machine-executable instructions eg, an operating system, an application, firmware, a program, etc.
- the present invention can be used to encode / decode a video signal.
Landscapes
- Engineering & Computer Science (AREA)
- Multimedia (AREA)
- Signal Processing (AREA)
- Compression Or Coding Systems Of Tv Signals (AREA)
Abstract
Description
Claims (10)
- 현재 블록의 머지 후보 리스트를 구성하는 단계;상기 머지 후보 리스트 및 머지 후보 인덱스를 기반으로, 상기 현재 블록의 움직임 정보를 유도하는 단계; 및상기 유도된 움직임 정보를 기반으로, 상기 현재 블록의 인터 예측을 수행하는 단계를 포함하되,상기 현재 블록은, 대각선 움직임 분할에 기초하여 제1 파티션과 제2 파티션으로 분할되고,상기 제1 파티션과 상기 제2 파티션 중 적어도 하나의 형태는 삼각형인, 영상 복호화 방법.
- 제1항에 있어서,상기 머지 후보 인덱스는, 상기 제1 파티션과 상기 제2 파티션 중 적어도 하나에 대해서 시그날링되는, 영상 복호화 방법.
- 제2항에 있어서,상기 머지 후보 인덱스는, 소정의 플래그가 제1 값인 경우에 시그날링되고,상기 플래그는, 상기 현재 블록이 대각선 움직임 분할에 기반한 움직임 보상을 수행하는지 여부를 나타내는, 영상 복호화 방법.
- 제3항에 있어서,상기 플래그의 값은, 소정의 부호화 파라미터에 기초하여 복호화 장치에서 유도되고,상기 부호화 파라미터는, 상기 현재 블록이 속한 슬라이스 타입, 상기 현재 블록의 인터 모드의 타입 또는 상기 현재 블록의 크기 중 적어도 하나를 포함하는, 영상 복호화 방법.
- 제1항에 있어서,상기 머지 후보 리스트는, 공간적 머지 후보, 시간적 머지 후보, 조합 머지 후보 또는 제로 움직임 벡터를 가진 머지 후보 중 적어도 하나를 포함하는, 영상 복호화 방법.
- 제1항에 있어서,상기 파티션의 움직임 정보는, 상기 머지 후보 인덱스에 의해 특정된 머지 후보의 움직임 정보에 기초하여 유도되는, 영상 복호화 방법.
- 제6항에 있어서,상기 머지 후보가 양방향 예측의 움직임 정보를 가지는 경우, 상기 파티션은 단방향 예측의 움직임 정보만을 가지도록 제한되는, 영상 복호화 방법.
- 제1항에 있어서,상기 현재 블록의 화소는, 상기 제1 파티션의 참조 블록에 속한 화소와 상기 제2 파티션의 참조 블록에 속한 화소에 소정의 가중치(m 및 n)를 적용하여 예측되는, 영상 복호화 방법.
- 제8항에 있어서,상기 m과 n은, 0, 1, 2, 4, 6, 7, 8, 16, 28, 31 또는 32 중 어느 하나이며,상기 m과 n의 합은, 2, 8 또는 32 중 어느 하나인, 영상 복호화 방법.
- 제9항에 있어서,상기 가중치는, 상기 현재 블록 내 화소의 위치에 기초하여 결정되는, 영상 복호화 방법.
Priority Applications (9)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CA3141352A CA3141352A1 (en) | 2018-06-27 | 2019-06-27 | Video encoding/decoding method and apparatus |
KR1020217002752A KR20210016634A (ko) | 2018-06-27 | 2019-06-27 | 영상 부호화/복호화 방법 및 장치 |
CN202311708295.3A CN117499639A (zh) | 2018-06-27 | 2019-06-27 | 图像编码/解码方法和用于发送图像信息的数据的方法 |
CN201980043307.0A CN112385231B (zh) | 2018-06-27 | 2019-06-27 | 图像编码/解码方法和装置 |
CN202311703414.6A CN117499638A (zh) | 2018-06-27 | 2019-06-27 | 对图像进行编码/解码的方法和发送比特流的方法 |
US17/255,625 US11490077B2 (en) | 2018-06-27 | 2019-06-27 | Image encoding/decoding method and apparatus involving merge candidate list and triangular shape partitions |
CN202311703386.8A CN117615128A (zh) | 2018-06-27 | 2019-06-27 | 对图像进行编码/解码的方法和发送比特流的方法 |
CN202311703196.6A CN117499637A (zh) | 2018-06-27 | 2019-06-27 | 对图像进行编码/解码的方法和发送比特流的方法 |
US17/858,929 US20220394241A1 (en) | 2018-06-27 | 2022-07-06 | Image encoding/decoding method and apparatus involving merge candidate list and triangular shape partitions |
Applications Claiming Priority (10)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
KR20180074255 | 2018-06-27 | ||
KR10-2018-0074255 | 2018-06-27 | ||
KR20180079891 | 2018-07-10 | ||
KR10-2018-0079891 | 2018-07-10 | ||
US201862697982P | 2018-07-13 | 2018-07-13 | |
US62/697,982 | 2018-07-13 | ||
KR20180082348 | 2018-07-16 | ||
KR10-2018-0082348 | 2018-07-16 | ||
KR10-2018-0120959 | 2018-10-11 | ||
KR20180120959 | 2018-10-11 |
Related Child Applications (2)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
US17/255,625 A-371-Of-International US11490077B2 (en) | 2018-06-27 | 2019-06-27 | Image encoding/decoding method and apparatus involving merge candidate list and triangular shape partitions |
US17/858,929 Continuation US20220394241A1 (en) | 2018-06-27 | 2022-07-06 | Image encoding/decoding method and apparatus involving merge candidate list and triangular shape partitions |
Publications (1)
Publication Number | Publication Date |
---|---|
WO2020004979A1 true WO2020004979A1 (ko) | 2020-01-02 |
Family
ID=68987347
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
PCT/KR2019/007821 WO2020004979A1 (ko) | 2018-06-27 | 2019-06-27 | 영상 부호화/복호화 방법 및 장치 |
Country Status (5)
Country | Link |
---|---|
US (2) | US11490077B2 (ko) |
KR (1) | KR20210016634A (ko) |
CN (5) | CN117499639A (ko) |
CA (1) | CA3141352A1 (ko) |
WO (1) | WO2020004979A1 (ko) |
Families Citing this family (10)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
TWI526056B (zh) * | 2011-04-27 | 2016-03-11 | Jvc Kenwood Corp | A moving picture coding apparatus, a motion picture coding method, a motion picture coding program, a transmission apparatus, a transmission method, a transmission program, a video decoding apparatus, a video decoding method, a video decoding program, a reception device, a reception method, Receiving program |
WO2012147344A1 (ja) | 2011-04-27 | 2012-11-01 | 株式会社Jvcケンウッド | 動画像復号装置、動画像復号方法、及び動画像復号プログラム |
CA3141303A1 (en) * | 2018-05-30 | 2019-12-05 | Digitalinsights Inc. | Image encoding/decoding method and device |
US11695967B2 (en) | 2018-06-22 | 2023-07-04 | Op Solutions, Llc | Block level geometric partitioning |
CN112889289B (zh) * | 2018-10-10 | 2024-08-23 | 三星电子株式会社 | 通过使用运动矢量差分值对视频进行编码和解码的方法以及用于对运动信息进行编码和解码的设备 |
KR20210118154A (ko) | 2019-01-28 | 2021-09-29 | 오피 솔루션즈, 엘엘씨 | 적응형 개수의 영역들을 갖는 기하학적 파티셔닝에서의 인터 예측 |
BR112021014671A2 (pt) | 2019-01-28 | 2021-09-28 | Op Solutions, Llc | Transformada discreta de cosseno de formato adaptativo para particionamento geométrico com um número adaptativo de regiões |
WO2020184979A1 (ko) * | 2019-03-11 | 2020-09-17 | 주식회사 엑스리스 | 영상 신호 부호화/복호화 방법 및 이를 위한 장치 |
CN110312130B (zh) * | 2019-06-25 | 2021-10-15 | 浙江大华技术股份有限公司 | 基于三角模式的帧间预测、视频编码方法及设备 |
WO2023208131A1 (en) * | 2022-04-29 | 2023-11-02 | Mediatek Inc. | Efficient geometric partitioning mode video coding |
Citations (4)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
WO2013070001A1 (ko) * | 2011-11-08 | 2013-05-16 | 한국전자통신연구원 | 후보 리스트 공유 방법 및 이러한 방법을 사용하는 장치 |
KR20140064944A (ko) * | 2011-09-09 | 2014-05-28 | 엘지전자 주식회사 | 인터 예측 방법 및 그 장치 |
KR20160143583A (ko) * | 2015-06-05 | 2016-12-14 | 인텔렉추얼디스커버리 주식회사 | 움직임 벡터 후보 선택 방법 및 이를 이용하는 영상 부호화/복호화 방법 |
WO2017183751A1 (ko) * | 2016-04-22 | 2017-10-26 | 엘지전자(주) | 인터 예측 모드 기반 영상 처리 방법 및 이를 위한 장치 |
Family Cites Families (11)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
EP2208350A2 (en) * | 2007-10-12 | 2010-07-21 | Thomson Licensing | Methods and apparatus for video encoding and decoding geometrically partitioned bi-predictive mode partitions |
KR102219985B1 (ko) * | 2010-05-04 | 2021-02-25 | 엘지전자 주식회사 | 비디오 신호의 처리 방법 및 장치 |
CN107835420B (zh) * | 2011-10-18 | 2021-05-14 | 株式会社Kt | 视频信号解码方法 |
US9451277B2 (en) * | 2012-02-08 | 2016-09-20 | Qualcomm Incorporated | Restriction of prediction units in B slices to uni-directional inter prediction |
US9591312B2 (en) * | 2012-04-17 | 2017-03-07 | Texas Instruments Incorporated | Memory bandwidth reduction for motion compensation in video coding |
CN106233725B (zh) * | 2014-03-31 | 2019-08-02 | 英迪股份有限公司 | 用于对图像进行解码的装置及其方法 |
CN107113424B (zh) * | 2014-11-18 | 2019-11-22 | 联发科技股份有限公司 | 以帧间预测模式编码的块的视频编码和解码方法 |
KR20170058838A (ko) * | 2015-11-19 | 2017-05-29 | 한국전자통신연구원 | 화면간 예측 향상을 위한 부호화/복호화 방법 및 장치 |
CN108353168B (zh) * | 2015-11-20 | 2023-04-21 | 韩国电子通信研究院 | 对图像进行编/解码的方法和编/解码图像的装置 |
WO2020184979A1 (ko) * | 2019-03-11 | 2020-09-17 | 주식회사 엑스리스 | 영상 신호 부호화/복호화 방법 및 이를 위한 장치 |
US12095984B2 (en) * | 2022-02-07 | 2024-09-17 | Tencent America LLC | Sub-block based constraint on bi-prediction for out-of-boundary conditions |
-
2019
- 2019-06-27 CN CN202311708295.3A patent/CN117499639A/zh active Pending
- 2019-06-27 CN CN202311703414.6A patent/CN117499638A/zh active Pending
- 2019-06-27 CN CN201980043307.0A patent/CN112385231B/zh active Active
- 2019-06-27 CN CN202311703386.8A patent/CN117615128A/zh active Pending
- 2019-06-27 US US17/255,625 patent/US11490077B2/en active Active
- 2019-06-27 CA CA3141352A patent/CA3141352A1/en active Pending
- 2019-06-27 WO PCT/KR2019/007821 patent/WO2020004979A1/ko active Application Filing
- 2019-06-27 CN CN202311703196.6A patent/CN117499637A/zh active Pending
- 2019-06-27 KR KR1020217002752A patent/KR20210016634A/ko not_active Application Discontinuation
-
2022
- 2022-07-06 US US17/858,929 patent/US20220394241A1/en active Pending
Patent Citations (4)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
KR20140064944A (ko) * | 2011-09-09 | 2014-05-28 | 엘지전자 주식회사 | 인터 예측 방법 및 그 장치 |
WO2013070001A1 (ko) * | 2011-11-08 | 2013-05-16 | 한국전자통신연구원 | 후보 리스트 공유 방법 및 이러한 방법을 사용하는 장치 |
KR20160143583A (ko) * | 2015-06-05 | 2016-12-14 | 인텔렉추얼디스커버리 주식회사 | 움직임 벡터 후보 선택 방법 및 이를 이용하는 영상 부호화/복호화 방법 |
WO2017183751A1 (ko) * | 2016-04-22 | 2017-10-26 | 엘지전자(주) | 인터 예측 모드 기반 영상 처리 방법 및 이를 위한 장치 |
Non-Patent Citations (1)
Title |
---|
YONGJO AHN: "Diagonal motion partitions on top of QTBT block structure", JOINT VIDEO EXPLORATION TEAM (JVET) OF ITU-T SG 16 WP 3, 25 October 2017 (2017-10-25), Macao, CN, pages 1 - 6, Retrieved from the Internet <URL:http://phenix.int-evry.fr/jvet> * |
Also Published As
Publication number | Publication date |
---|---|
CN117499639A (zh) | 2024-02-02 |
CN117499638A (zh) | 2024-02-02 |
KR20210016634A (ko) | 2021-02-16 |
US20210274162A1 (en) | 2021-09-02 |
CN117615128A (zh) | 2024-02-27 |
CN112385231A (zh) | 2021-02-19 |
CN112385231B (zh) | 2024-01-02 |
CA3141352A1 (en) | 2020-01-02 |
US11490077B2 (en) | 2022-11-01 |
US20220394241A1 (en) | 2022-12-08 |
CN117499637A (zh) | 2024-02-02 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
WO2020004979A1 (ko) | 영상 부호화/복호화 방법 및 장치 | |
WO2018070790A1 (ko) | 영상의 부호화/복호화 방법 및 장치 | |
WO2017222325A1 (ko) | 비디오 신호 처리 방법 및 장치 | |
WO2020009419A1 (ko) | 병합 후보를 사용하는 비디오 코딩 방법 및 장치 | |
WO2018212577A1 (ko) | 비디오 신호 처리 방법 및 장치 | |
WO2018066959A1 (ko) | 비디오 신호 처리 방법 및 장치 | |
WO2018174593A1 (ko) | 적응적인 화소 분류 기준에 따른 인루프 필터링 방법 | |
WO2016175549A1 (ko) | 비디오 신호의 처리 방법 및 이를 위한 장치 | |
WO2018106047A1 (ko) | 비디오 신호 처리 방법 및 장치 | |
WO2019194568A1 (ko) | 어파인 모델 기반의 영상 부호화/복호화 방법 및 장치 | |
WO2018008905A1 (ko) | 비디오 신호 처리 방법 및 장치 | |
WO2018236028A1 (ko) | 인트라 예측 모드 기반 영상 처리 방법 및 이를 위한 장치 | |
WO2013002557A2 (ko) | 움직임 정보의 부호화 방법 및 장치, 그 복호화 방법 및 장치 | |
WO2019078581A1 (ko) | 영상 부호화/복호화 방법, 장치 및 비트스트림을 저장한 기록 매체 | |
WO2011021838A2 (en) | Method and apparatus for encoding video, and method and apparatus for decoding video | |
WO2018062880A1 (ko) | 영상 처리 방법 및 이를 위한 장치 | |
WO2017082443A1 (ko) | 영상 코딩 시스템에서 임계값을 이용한 적응적 영상 예측 방법 및 장치 | |
WO2016085231A1 (ko) | 비디오 신호 처리 방법 및 장치 | |
WO2016159610A1 (ko) | 비디오 신호 처리 방법 및 장치 | |
WO2018044089A1 (ko) | 비디오 신호 처리 방법 및 장치 | |
WO2019231206A1 (ko) | 영상 부호화/복호화 방법 및 장치 | |
WO2016048092A1 (ko) | 비디오 신호 처리 방법 및 장치 | |
WO2019182329A1 (ko) | 영상 복호화 방법/장치, 영상 부호화 방법/장치 및 비트스트림을 저장한 기록 매체 | |
WO2018182184A1 (ko) | 부호화 트리 유닛 및 부호화 유닛의 처리를 수행하는 영상 처리 방법, 그를 이용한 영상 복호화, 부호화 방법 및 그 장치 | |
WO2019245261A1 (ko) | 영상 부호화/복호화 방법 및 장치 |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
121 | Ep: the epo has been informed by wipo that ep was designated in this application |
Ref document number: 19825092 Country of ref document: EP Kind code of ref document: A1 |
|
NENP | Non-entry into the national phase |
Ref country code: DE |
|
ENP | Entry into the national phase |
Ref document number: 20217002752 Country of ref document: KR Kind code of ref document: A |
|
122 | Ep: pct application non-entry in european phase |
Ref document number: 19825092 Country of ref document: EP Kind code of ref document: A1 |
|
ENP | Entry into the national phase |
Ref document number: 3141352 Country of ref document: CA |