WO2020256102A1 - 動画像符号化装置、動画像符号化方法、及び動画像符号化プログラム、動画像復号装置、動画像復号方法及び動画像復号プログラム - Google Patents
動画像符号化装置、動画像符号化方法、及び動画像符号化プログラム、動画像復号装置、動画像復号方法及び動画像復号プログラム Download PDFInfo
- Publication number
- WO2020256102A1 WO2020256102A1 PCT/JP2020/024137 JP2020024137W WO2020256102A1 WO 2020256102 A1 WO2020256102 A1 WO 2020256102A1 JP 2020024137 W JP2020024137 W JP 2020024137W WO 2020256102 A1 WO2020256102 A1 WO 2020256102A1
- Authority
- WO
- WIPO (PCT)
- Prior art keywords
- block
- prediction
- motion vector
- unit
- merge
- Prior art date
Links
Images
Classifications
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N19/00—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
- H04N19/10—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding
- H04N19/102—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the element, parameter or selection affected or controlled by the adaptive coding
- H04N19/103—Selection of coding mode or of prediction mode
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N19/00—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
- H04N19/10—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding
- H04N19/102—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the element, parameter or selection affected or controlled by the adaptive coding
- H04N19/103—Selection of coding mode or of prediction mode
- H04N19/105—Selection of the reference unit for prediction within a chosen coding or prediction mode, e.g. adaptive choice of position and number of pixels used for prediction
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N19/00—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
- H04N19/10—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding
- H04N19/102—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the element, parameter or selection affected or controlled by the adaptive coding
- H04N19/103—Selection of coding mode or of prediction mode
- H04N19/11—Selection of coding mode or of prediction mode among a plurality of spatial predictive coding modes
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N19/00—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
- H04N19/10—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding
- H04N19/102—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the element, parameter or selection affected or controlled by the adaptive coding
- H04N19/119—Adaptive subdivision aspects, e.g. subdivision of a picture into rectangular or non-rectangular coding blocks
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N19/00—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
- H04N19/10—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding
- H04N19/102—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the element, parameter or selection affected or controlled by the adaptive coding
- H04N19/132—Sampling, masking or truncation of coding units, e.g. adaptive resampling, frame skipping, frame interpolation or high-frequency transform coefficient masking
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N19/00—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
- H04N19/10—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding
- H04N19/134—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the element, parameter or criterion affecting or controlling the adaptive coding
- H04N19/136—Incoming video signal characteristics or properties
- H04N19/137—Motion inside a coding unit, e.g. average field, frame or block difference
- H04N19/139—Analysis of motion vectors, e.g. their magnitude, direction, variance or reliability
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N19/00—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
- H04N19/10—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding
- H04N19/134—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the element, parameter or criterion affecting or controlling the adaptive coding
- H04N19/157—Assigned coding mode, i.e. the coding mode being predefined or preselected to be further used for selection of another element or parameter
- H04N19/159—Prediction type, e.g. intra-frame, inter-frame or bidirectional frame prediction
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N19/00—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
- H04N19/10—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding
- H04N19/169—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the coding unit, i.e. the structural portion or semantic portion of the video signal being the object or the subject of the adaptive coding
- H04N19/17—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the coding unit, i.e. the structural portion or semantic portion of the video signal being the object or the subject of the adaptive coding the unit being an image region, e.g. an object
- H04N19/176—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the coding unit, i.e. the structural portion or semantic portion of the video signal being the object or the subject of the adaptive coding the unit being an image region, e.g. an object the region being a block, e.g. a macroblock
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N19/00—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
- H04N19/42—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals characterised by implementation details or hardware specially adapted for video compression or decompression, e.g. dedicated software implementation
- H04N19/423—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals characterised by implementation details or hardware specially adapted for video compression or decompression, e.g. dedicated software implementation characterised by memory arrangements
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N19/00—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
- H04N19/50—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using predictive coding
- H04N19/503—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using predictive coding involving temporal prediction
- H04N19/51—Motion estimation or motion compensation
- H04N19/513—Processing of motion vectors
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N19/00—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
- H04N19/50—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using predictive coding
- H04N19/503—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using predictive coding involving temporal prediction
- H04N19/51—Motion estimation or motion compensation
- H04N19/513—Processing of motion vectors
- H04N19/517—Processing of motion vectors by encoding
- H04N19/52—Processing of motion vectors by encoding by predictive encoding
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N19/00—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
- H04N19/50—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using predictive coding
- H04N19/503—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using predictive coding involving temporal prediction
- H04N19/51—Motion estimation or motion compensation
- H04N19/55—Motion estimation with spatial constraints, e.g. at image or region borders
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N19/00—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
- H04N19/50—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using predictive coding
- H04N19/593—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using predictive coding involving spatial prediction techniques
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N19/00—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
- H04N19/70—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals characterised by syntax aspects related to video coding, e.g. related to compression standards
Definitions
- the present invention relates to an image coding and decoding technique for dividing an image into blocks and performing prediction.
- the image to be processed is divided into blocks which are a set of a predetermined number of pixels, and processing is performed in block units. Coding efficiency is improved by dividing into appropriate blocks and appropriately setting in-screen prediction (intra prediction) and inter-screen prediction (inter prediction).
- Patent Document 1 discloses an intra-prediction technique for obtaining a predicted image using decoded pixels adjacent to a block to be encoded / decoded.
- Patent Document 1 uses only the decoded pixels adjacent to the block to be encoded / decoded for prediction, and the prediction efficiency is poor.
- a block vector candidate derivation unit for deriving a block vector candidate of a block to be processed in a picture to be processed from the coded information stored in the coded information storage memory, and the block vector A selection unit that selects a selection block vector from candidates and a reference position correction unit that corrects the reference position of the reference block so as to refer to the inside of the referenceable area for the reference block referenced by the selection block vector are provided. Based on the reference position of the reference block, the decoded pixels in the processing target picture are acquired from the decoded image memory as the predicted value of the processing target block.
- the image to be encoded / decoded is evenly divided into a predetermined size.
- This unit is defined as a tree block.
- the size of the tree block is 128 x 128 pixels, but the size of the tree block is not limited to this, and any size may be set.
- the tree blocks of the processing target (corresponding to the coding target in the coding process and the decoding target in the decoding process) are switched in the raster scan order, that is, in the order of left to right and top to bottom.
- the inside of each tree block can be further recursively divided.
- the block to be coded / decoded after the tree block is recursively divided is defined as the coded block.
- tree blocks and coded blocks are collectively defined as blocks. Efficient coding is possible by performing appropriate block division.
- the size of the tree block may be a fixed value previously agreed between the coding device and the decoding device, or the size of the tree block determined by the coding device may be transmitted to the decoding device.
- the maximum size of the tree block is 128x128 pixels, and the minimum size of the tree block is 16x16 pixels.
- the maximum size of the coded block is 64x64 pixels, and the minimum size of the coded block is 4x4 pixels.
- ⁇ Prediction mode> Intra-prediction (MODE_INTRA), which predicts from the processed image signal of the processed image, and inter-prediction (MODE_INTER), which predicts from the image signal of the processed image, are switched for each processing target coding block.
- the processed image is used for an image, an image signal, a tree block, a block, a coded block, etc. obtained by decoding a signal whose coding is completed in the coding process, and an image, an image signal, which has been decoded in the decoding process. Used for tree blocks, blocks, coded blocks, etc.
- the mode that distinguishes between the intra prediction (MODE_INTRA) and the inter prediction (MODE_INTER) is defined as the prediction mode (PredMode).
- the prediction mode (PredMode) has an intra prediction (MODE_INTRA) or an inter prediction (MODE_INTER) as a value.
- Intra Block Copy prediction is a process of encoding / decoding a block to be processed by referring to a decoded pixel in a picture to be processed as a predicted value. Then, the distance from the processing target block to the reference pixel is represented by a block vector. Since the block vector refers to the picture to be processed and the reference picture is uniquely determined, the reference index is unnecessary. The difference between the block vector and the motion vector is whether the referenced picture is a processed picture or a processed picture.
- the block vector can be selected from 1-pixel accuracy or 4-pixel accuracy using the adaptive motion vector resolution (AMVR).
- AMVR adaptive motion vector resolution
- In the intra block copy two modes, a predicted intra block copy mode and a merge intra block copy mode, can be selected.
- the prediction intra-block copy mode is a mode in which the block vector of the block to be processed is determined from the prediction block vector derived from the processed information and the difference block vector.
- the prediction block vector is derived from the processed block adjacent to the processing target block and the index for identifying the prediction block vector.
- the index and difference block vector for specifying the predicted block vector are transmitted as a bit stream.
- the merge intra-block copy mode is a mode in which the intra-block copy prediction information of the processing target block is derived from the intra-block copy prediction information of the processed block adjacent to the processing target block without transmitting the differential motion vector.
- L0 prediction (pred_L0) is available for P-slices.
- Pred_L0 L0 prediction
- Pred_L1 L1 prediction
- Pred_BI dual prediction
- the L0 prediction (Pred_L0) is an inter-prediction that refers to a reference picture managed by L0
- the L1 prediction (Pred_L1) is an inter-prediction that refers to a reference picture managed by L1.
- the bi-prediction (Pred_BI) is an inter-prediction in which both L0 prediction and L1 prediction are performed, and one reference picture managed in each of L0 and L1 is referred to.
- Information that identifies L0 prediction, L1 prediction, and bi-prediction is defined as an inter-prediction mode. In the subsequent processing, it is assumed that the constants and variables with the subscript LX attached to the output are processed for each L0 and L1.
- the predicted motion vector mode is a mode in which an index for specifying a predicted motion vector, a differential motion vector, an inter-prediction mode, and a reference index are transmitted to determine inter-prediction information of a block to be processed.
- the predicted motion vector is a predicted motion vector candidate derived from a processed block adjacent to the processing target block or a block belonging to the processed image and located at the same position as or near (near) the processing target block, and a predicted motion. Derived from the index to identify the vector.
- ⁇ Merge mode> In the merge mode, the processed block adjacent to the processed block or the block belonging to the processed image and located at the same position as or near (near) the processed block without transmitting the differential motion vector and the reference index.
- the inter-prediction information of the block to be processed is derived from the inter-prediction information of.
- the processed block adjacent to the processing target block and the inter-prediction information of the processed block are defined as spatial merge candidates.
- Blocks that belong to the processed image and are located at the same position as or near (near) the block to be processed, and inter-prediction information derived from the inter-prediction information of that block are defined as time merge candidates.
- Each merge candidate is registered in the merge candidate list, and the merge index identifies the merge candidate used in the prediction of the block to be processed.
- FIG. 11 is a diagram illustrating a reference block referred to for deriving inter-prediction information in the predicted motion vector mode and the merge mode.
- A0, A1, A2, B0, B1, B2, and B3 are processed blocks adjacent to the processing target block.
- T0 is a block belonging to the processed image, and is a block located at the same position as or near (near) the processing target block in the processing target image.
- A1 and A2 are blocks located on the left side of the processing target coding block and adjacent to the processing target coding block.
- B1 and B3 are blocks located above the processing target coding block and adjacent to the processing target coding block.
- A0, B0, and B2 are blocks located at the lower left, upper right, and upper left of the coded block to be processed, respectively.
- the coded block is divided into subblocks of a predetermined unit, and the motion vector is individually determined for each of the divided subblocks to perform the motion compensation.
- the motion vector of each sub-block is derived from the inter-prediction information of the processed block adjacent to the processing target block or the block belonging to the processed image and located at the same position as or near (near) the processing target block. Derived based on one or more control points.
- the size of the subblock is 4x4 pixels, but the size of the subblock is not limited to this, and the motion vector may be derived in pixel units.
- FIG. 14 shows an example of affine transformation motion compensation when there are two control points.
- the two control points have two parameters, a horizontal component and a vertical component. Therefore, the affine transformation when there are two control points is called a four-parameter affine transformation.
- CP1 and CP2 in FIG. 14 are control points.
- FIG. 15 shows an example of affine transformation motion compensation when there are three control points. In this case, the three control points have two parameters, a horizontal component and a vertical component. Therefore, the affine transformation when there are three control points is called a 6-parameter affine transformation.
- CP1, CP2, and CP3 in FIG. 15 are control points.
- Affine transformation motion compensation can be used in both the predicted motion vector mode and the merge mode.
- the mode in which the affine transformation motion compensation is applied in the predicted motion vector mode is defined as the subblock predicted motion vector mode
- the mode in which the affine transformation motion compensation is applied in the merge mode is defined as the subblock merge mode.
- the syntax for expressing the prediction mode of the coded block will be described with reference to FIGS. 12A, 12B, and 13.
- merge_flag is a flag indicating whether to use the merge intrablock copy mode or the predicted intrablock copy mode.
- merge_triangle_flag 1
- merge_triangle_split_dir direction the merge triangle indexes merge_triangle_idx0 and merge_triangle_idx1 for each of the two split partitions.
- merge_idx the merge index merge_idx is sent.
- FIG. 13 shows the values of each syntax element of inter-prediction and the corresponding prediction modes.
- the POC is a variable associated with the encoded picture, and a value that increases by 1 according to the output order of the picture is set. Depending on the value of POC, it is possible to determine whether the pictures are the same, determine the context between the pictures in the output order, and derive the distance between the pictures. For example, if the POCs of the two pictures have the same value, it can be determined that they are the same picture. If the POCs of the two pictures have different values, it can be determined that the picture with the smaller POC value is the picture to be output first, and the difference between the POCs of the two pictures is the distance between the pictures in the time axis direction. Shown.
- FIG. 1 is a block diagram of the image coding device 100 according to the first embodiment.
- the image coding device 100 of the embodiment includes a block division unit 101, an inter prediction unit 102, an intra prediction unit 103, a decoded image memory 104, a prediction method determination unit 105, a residual generation unit 106, and an orthogonal conversion / quantization unit 107.
- the block division unit 101 recursively divides the input image to generate a coded block.
- the block division unit 101 includes a 4-division unit that divides the block to be divided in the horizontal direction and a vertical direction, and a 2-3 division unit that divides the block to be divided into either the horizontal direction or the vertical direction. Including.
- the block division unit 101 uses the generated coded block as a processing target coding block, and supplies the image signal of the processing target coded block to the inter prediction unit 102, the intra prediction unit 103, and the residual generation unit 106. Further, the block division unit 101 supplies information indicating the determined recursive division structure to the bit string coding unit 108. The detailed operation of the block dividing unit 101 will be described later.
- the inter-prediction unit 102 performs inter-prediction of the coded block to be processed.
- the inter-prediction unit 102 derives a plurality of inter-prediction information candidates from the inter-prediction information stored in the coded information storage memory 111 and the decoded image signal stored in the decoded image memory 104.
- a suitable inter-prediction mode is selected from the plurality of derived candidates, and the selected inter-prediction mode and the prediction image signal corresponding to the selected inter-prediction mode are supplied to the prediction method determination unit 105.
- the detailed configuration and operation of the inter-prediction unit 102 will be described later.
- the intra prediction unit 103 performs intra prediction of the coded block to be processed.
- the intra prediction unit 103 refers to the decoded image signal stored in the decoded image memory 104 as a reference pixel, and intra prediction based on the coding information such as the intra prediction mode stored in the coding information storage memory 111. Generates a predicted image signal.
- the intra-prediction unit 103 selects a suitable intra-prediction mode from a plurality of intra-prediction modes, and predicts the selected intra-prediction mode and the prediction image signal according to the selected intra-prediction mode. It is supplied to the determination unit 105. The detailed configuration and operation of the intra prediction unit 103 will be described later.
- the decoded image memory 104 stores the decoded image generated by the decoded image signal superimposing unit 110.
- the decoded image memory 104 supplies the stored decoded image to the inter prediction unit 102 and the intra prediction unit 103.
- the prediction method determination unit 105 evaluates each of the intra prediction and the inter prediction by using the coding information, the code amount of the residual, the distortion amount between the predicted image signal and the image signal to be processed, and the like. , Determine the optimal prediction mode.
- the prediction method determination unit 105 supplies intra-prediction information such as the intra-prediction mode to the bit string coding unit 108 as coding information.
- the prediction method determination unit 105 uses the inter-prediction information such as the merge index and the information indicating whether or not the sub-block merge mode (sub-block merge flag) is used as the coding information in the bit string coding unit 108. Supply to.
- the prediction method determination unit 105 indicates whether or not the inter-prediction mode, the predicted motion vector index, the reference indexes of L0 and L1, the differential motion vector, and the subblock predicted motion vector mode.
- Inter-prediction information such as (sub-block prediction motion vector flag) is supplied to the bit string coding unit 108 as coding information. Further, the prediction method determination unit 105 supplies the determined coding information to the coding information storage memory 111. The prediction method determination unit 105 supplies the residual generation unit 106 and the prediction image signal to the decoded image signal superimposition unit 110.
- the residual generation unit 106 generates a residual by subtracting the predicted image signal from the image signal to be processed, and supplies the residual to the orthogonal transformation / quantization unit 107.
- the orthogonal conversion / quantization unit 107 performs orthogonal conversion and quantization on the residual according to the quantization parameter to generate the orthogonal conversion / quantization residual, and the generated residual is used as the bit string coding unit 108. And the inverse quantization / inverse orthogonal conversion unit 109.
- the bit string coding unit 108 encodes the coding information according to the prediction method determined by the prediction method determination unit 105 for each coding block, in addition to the information in units of sequences, pictures, slices, and coding blocks. Specifically, the bit string coding unit 108 encodes the prediction mode PredMode for each coding block. When the prediction mode is inter-prediction (MODE_INTER), the bit string encoding unit 108 uses a flag for determining whether or not it is in merge mode, a subblock merge flag, a merge index in the case of merge mode, and an inter-prediction mode in the case of not in merge mode.
- MODE_INTER inter-prediction
- Inter-prediction information such as a predicted motion vector index, information about a differential motion vector, and a subblock predicted motion vector flag is encoded according to a specified syntax (bit string syntax rule) to generate a first bit string.
- bit string syntax rule bit string syntax rule
- the bit string coding unit 108 encodes a flag for determining whether or not it is an intra block copy according to a specified syntax.
- the coding information such as the merge index in the merge mode, the prediction block vector index in the merge mode, and the difference block vector is encoded according to the specified syntax.
- the coding information such as the intra-prediction mode is encoded according to the specified syntax.
- the first bit string is generated.
- the bit string coding unit 108 entropy-encodes the orthogonal transformation and the quantized residual according to the specified syntax to generate a second bit string.
- the bit string coding unit 108 multiplexes the first bit string and the second bit string according to the specified syntax, and outputs a bit stream.
- the inverse quantization / inverse orthogonal conversion unit 109 calculates the residual by inversely quantizing and inversely orthogonalizing the orthogonal transformation / quantization residual supplied from the orthogonal transformation / quantization unit 107, and the calculated residual. The difference is supplied to the decoded image signal superimposing unit 110.
- the decoded image signal superimposition unit 110 superimposes the predicted image signal according to the determination by the prediction method determination unit 105 and the residuals inversely quantized and inversely orthogonally converted by the inverse quantization / inverse orthogonal conversion unit 109 to obtain the decoded image. It is generated and stored in the decoded image memory 104.
- the decoded image signal superimposing unit 110 may store the decoded image in the decoded image memory 104 after performing a filtering process on the decoded image to reduce distortion such as block distortion due to coding.
- the coding information storage memory 111 stores coding information such as a prediction mode (inter prediction or intra prediction) determined by the prediction method determination unit 105.
- the coding information stored in the coding information storage memory 111 includes inter-prediction information such as a determined motion vector, a reference index of reference lists L0 and L1, and a history prediction motion vector candidate list.
- the coding information stored in the coding information storage memory 111 includes the merge index and information indicating whether or not the sub-block merge mode is in addition to the above-mentioned information (sub-block merge flag). ) Inter-prediction information is included.
- the coding information stored in the coding information storage memory 111 includes the inter prediction mode, the predicted motion vector index, the differential motion vector, and the subblock prediction in addition to the above-mentioned information.
- Inter-prediction information such as information indicating whether or not the motion vector mode is set (subblock prediction motion vector flag) is included.
- the coding information stored in the coding information storage memory 111 includes intra-prediction information such as the determined intra-prediction mode.
- FIG. 2 is a block showing a configuration of an image decoding device according to an embodiment of the present invention corresponding to the image coding device of FIG.
- the image decoding device of the embodiment includes a bit string decoding unit 201, a block division unit 202, an inter prediction unit 203, an intra prediction unit 204, a coded information storage memory 205, an inverse quantization / inverse orthogonal conversion unit 206, and a decoded image signal superimposition.
- a unit 207 and a decoded image memory 208 are provided.
- each configuration of the orthogonal conversion unit 206, the decoded image signal superimposing unit 207, and the decoded image memory 208 includes the coding information storage memory 111 of the image coding apparatus of FIG. 1, the inverse quantization / inverse orthogonal conversion unit 109, and the decoded image signal. It has a function corresponding to each configuration of the superimposing unit 110 and the decoded image memory 104.
- the bit stream supplied to the bit string decoding unit 201 is separated according to the specified syntax rules.
- the bit string decoding unit 201 decodes the separated first bit string to obtain sequence, picture, slice, coded block unit information, and coded block unit coding information. Specifically, the bit string decoding unit 201 decodes the prediction mode PredMode for determining whether to perform inter-prediction (MODE_INTER) or intra-prediction (MODE_INTRA) for each coded block.
- the prediction mode is inter-prediction (MODE_INTER)
- the bit string decoding unit 201 has a flag for determining whether or not it is in merge mode, a merge index in case of merge mode, a subblock merge flag, and inter-prediction in the case of predicted motion vector mode.
- the coded information (inter-predicted information) related to the mode, predicted motion vector index, differential motion vector, sub-block predicted motion vector flag, etc. is decoded according to the specified syntax, and the coded information (inter-predicted information) is decoded in the inter-predicted unit 203, And it is supplied to the coded information storage memory 205 via the block division unit 202.
- the prediction mode is intra prediction (MODE_INTRA)
- the bit string decoding unit 201 decodes the flag for determining whether or not it is an intra block copy.
- the coded information (intra-predicted information) such as the merge index in the merge mode, the predicted block vector index in the merge mode, and the difference block vector is decoded according to the specified syntax.
- the coded information such as the intra-prediction mode is decoded according to the specified syntax.
- the coding information (intra prediction information) is supplied to the coding information storage memory 205 via the inter prediction unit 203, the intra prediction unit 204, and the block division unit 202.
- the bit string decoding unit 201 decodes the separated second bit string, calculates the orthogonally transformed / quantized residual, and supplies the orthogonal transformed / quantized residual to the inversely quantized / inversely orthogonal converter 206. To do.
- the inter-prediction unit 203 describes the code of the already decoded image signal stored in the coding information storage memory 205 when the prediction mode PredMode of the coded block to be processed is inter-prediction (MODE_INTER) and the prediction motion vector mode.
- MODE_INTER inter-prediction
- a plurality of predicted motion vector candidates are derived using the conversion information, and the derived candidates for the plurality of predicted motion vectors are registered in the predicted motion vector candidate list described later.
- the inter-prediction unit 203 selects a predicted motion vector according to the predicted motion vector index decoded and supplied by the bit string decoding unit 201 from a plurality of predicted motion vector candidates registered in the predicted motion vector candidate list.
- a motion vector is calculated from the differential motion vector decoded by the bit string decoding unit 201 and the selected predicted motion vector, and the calculated motion vector is stored in the coding information storage memory 205 together with other coding information.
- the coding information of the coding block supplied and stored here is the flags predFlagL0 [xP] [yP], predFlagL1 [xP] [yP], which indicate whether to use the prediction modes PredMode, L0 prediction, and L1 prediction.
- xP and yP are indexes indicating the positions of the upper left pixels of the coded block in the picture.
- PredMode is inter-prediction (MODE_INTER) and the inter-prediction mode is L0 prediction (Pred_L0)
- the flag predFlagL0 indicating whether to use L0 prediction is 1, and the flag predFlagL1 indicating whether to use L1 prediction. Is 0.
- the flag predFlagL0 indicating whether or not to use the L0 prediction is 0, and the flag predFlag L1 indicating whether or not to use the L1 prediction is 1.
- the flag predFlagL0 indicating whether or not to use L0 prediction and the flag predFlagL1 indicating whether or not to use L1 prediction are both 1.
- the prediction mode PredMode of the coded block to be processed is inter-prediction (MODE_INTER) and the merge mode is set, merge candidates are derived.
- a plurality of merge candidates are derived, registered in the merge candidate list described later, and registered in the merge candidate list.
- a flag indicating whether or not to use the L0 prediction and the L1 prediction of the selected merge candidate by selecting the merge candidate corresponding to the merge index decoded and supplied by the bit string decoding unit 201 from the plurality of merge candidates.
- predFlagL0 [xP] [yP] [yP] [yP] [yP] [yP] [yP] reference index refIdxL0 [xP] [yP], refIdxL1 [xP] [yP], L0, L1 motion vector mvL0 [xP] [yP] ], MvL1 [xP] [yP] and other inter-prediction information is stored in the coded information storage memory 205.
- xP and yP are indexes indicating the positions of the upper left pixels of the coded block in the picture. The detailed configuration and operation of the inter-prediction unit 203 will be described later.
- the intra prediction unit 204 performs intra prediction when the prediction mode PredMode of the coded block to be processed is intra prediction (MODE_INTRA).
- the coding information decoded by the bit string decoding unit 201 includes an intra prediction mode.
- the intra prediction unit 204 generates a prediction image signal by intra prediction from the decoded image signal stored in the decoded image memory 208 according to the intra prediction mode included in the coding information decoded by the bit string decoding unit 201. Then, the generated predicted image signal is supplied to the decoded image signal superimposing unit 207. Since the intra prediction unit 204 corresponds to the intra prediction unit 103 of the image coding device 100, the same processing as that of the intra prediction unit 103 is performed.
- the inverse quantization / anti-orthogonal conversion unit 206 performs anti-orthogonal conversion and anti-quantization on the orthogonal conversion / quantized residual decoded by the bit string decoding unit 201, and is inversely orthogonal conversion / anti-quantization. Get the residuals.
- the decoded image signal superimposition unit 207 is inversely quantized and inversely orthogonally converted by the inverse quantization / inverse orthogonal conversion unit 206 with the predicted image signal inter-predicted by the inter-prediction unit 203 or the predicted image signal intra-predicted by the intra prediction unit 204.
- the decoded image signal is decoded by superimposing the dequantized residual, and the decoded image signal is stored in the decoded image memory 208.
- the decoded image signal superimposing unit 207 may perform a filtering process on the decoded image to reduce block distortion due to coding and then store in the decoded image memory 208. ..
- FIG. 3 is a flowchart showing an operation of dividing an image into tree blocks and further dividing each tree block.
- the input image is divided into tree blocks of a predetermined size (step S1001).
- Each tree block is scanned in a predetermined order, that is, in the order of raster scan (step S1002), and the inside of the tree block to be processed is divided (step S1003).
- FIG. 7 is a flowchart showing the detailed operation of the division process in step S1003. First, it is determined whether or not to divide the block to be processed into four (step S1101).
- the processing target block is divided into four (step S1102).
- Each block obtained by dividing the block to be processed is scanned in the Z scan order, that is, in the order of upper left, upper right, lower left, and lower right (step S1103).
- FIG. 5 is an example of the Z scan order
- FIG. 601 of FIG. 6A is an example of dividing the processing target block into four. Numbers 0 to 3 of 601 in FIG. 6A indicate the order of processing.
- the division process of FIG. 7 is recursively executed for each block divided in step S1101 (step S1104).
- step S1105) If it is determined that the block to be processed is not divided into four, it is divided into 2-3 (step S1105).
- FIG. 8 is a flowchart showing the detailed operation of the 2-3 division process of step S1105. First, it is determined whether or not the block to be processed is divided into 2-3, that is, whether or not to perform either 2-division or 3-division (step S1201).
- step S1211 If it is not determined that the block to be processed is divided into 2-3, that is, if it is determined not to be divided, the division is terminated (step S1211). That is, the block divided by the recursive division process is not further recursively divided.
- step S1202 it is determined whether or not to further divide the block to be processed into two.
- step S1203 it is determined whether or not to divide the processing target block vertically (vertical direction) (step S1203), and based on the result, the processing target block is vertically (vertically) divided. It is divided into two (step S1204) or the block to be processed is divided into two left and right (horizontal direction) (step S1205).
- step S1204 the processing target block is divided into upper and lower (vertical) two divisions as shown in 602 of FIG. 6B, and as a result of step S1205, the processing target block is left and right (horizontal) as shown in 604 of FIG. 6D.
- Direction Divided into two parts.
- step S1202 if it is not determined that the block to be processed is divided into two, that is, if it is determined to be divided into three, it is determined whether or not to divide the block to be processed in the upper, middle, lower (vertical direction) (step S1206). ), Based on the result, the processing target block is divided into three in the upper, middle and lower directions (vertical direction) (step S1207), or the processing target block is divided into three in the left middle right (horizontal direction) (step S1208). As a result of step S1207, the processing target block is divided into upper, middle and lower (vertical direction) three divisions as shown in 603 of FIG. 6C, and as a result of step S1208, the processing target block is left as shown in 605 of FIG. 6E. It is divided into three parts on the middle right (horizontal direction).
- step S1209 After executing any one of step S1204, step S1205, step S1207, and step S1208, each block obtained by dividing the block to be processed is scanned in the order of left to right and top to bottom (step S1209). Numbers 0 to 2 of 602 to 605 in FIGS. 6B to 6E indicate the order of processing. For each of the divided blocks, the 2-3 division process of FIG. 8 is recursively executed (step S1210).
- the recursive block division described here may limit the necessity of division depending on the number of divisions, the size of the block to be processed, and the like.
- the information that limits the necessity of division may be realized in a configuration that does not transmit information by making an agreement in advance between the coding device and the decoding device, or the coding device limits the necessity of division. It may be realized by the configuration which transmits to the decoding apparatus by deciding the information to be performed and recording it in a bit string.
- each block after division is called the child block.
- the block division unit 202 divides the tree block by the same processing procedure as the block division unit 101 of the image coding apparatus 100.
- the block division unit 101 of the image coding apparatus 100 applies an optimization method such as estimation of the optimum shape by image recognition and optimization of the distortion rate to determine the optimum shape of the block division, whereas the image decoding apparatus
- the block division unit 202 in 200 is different in that the block division shape is determined by decoding the block division information recorded in the bit string.
- FIG. 9 shows the syntax (syntax rule of the bit string) regarding the block division of the first embodiment.
- coding_quadtree represents the syntax of the block quadtree processing.
- multi_type_tree represents the syntax for dividing a block into two or three.
- mtt_split_vertical which is a flag indicating whether to divide vertically or horizontally
- mtt_split_binary which is a flag determining whether to divide into two or three
- mtt_split_vertical 1 indicates splitting in the vertical direction
- mtt_split_vertical 0 indicates splitting in the horizontal direction
- mtt_split_binary 1 indicates that it is divided into two
- mtt_split_binary 0 indicates that it is divided into three.
- FIG. 40 is a diagram showing a detailed configuration of the intra prediction unit 103 of the image coding device 100 of FIG. 1.
- the normal intra prediction unit 351 generates a prediction image signal by normal intra prediction from decoded pixels adjacent to the coded block to be processed, selects a suitable intra prediction mode from a plurality of intra prediction modes, and selects the appropriate intra prediction mode.
- the predicted intra prediction mode selected and the predicted image signal corresponding to the selected intra prediction mode are supplied to the prediction method determination unit 105.
- 10A and 10B show examples of intra-prediction.
- FIG. 10A shows the correspondence between the prediction direction of the normal intra-prediction and the intra-prediction mode number.
- the intra prediction mode 50 generates an intra prediction image by copying pixels in the vertical direction.
- the intra prediction mode 1 is a DC mode, and is a mode in which all the pixel values of the processing target block are set as the average value of the reference pixels.
- the intra prediction mode 0 is a Planar mode, which is a mode for creating a two-dimensional intra prediction image from reference pixels in the vertical and horizontal directions.
- FIG. 10B is an example of generating an intra prediction image in the case of the intra prediction mode 40. For each pixel of the block to be processed, the value of the reference pixel in the direction indicated by the intra prediction mode is copied. When the reference pixel in the intra prediction mode is not an integer position, the reference pixel value is determined by interpolation from the reference pixel values at the surrounding integer positions.
- the intra-block copy prediction unit 352 acquires a decoded area of the same image signal as the coded block to be processed from the decoded image memory 104, generates a prediction image signal by the intra-block copy processing, and determines the prediction method 105. Supply to. The detailed configuration and processing of the intra-block copy prediction unit 352 will be described later.
- FIG. 41 is a diagram showing a detailed configuration of the intra prediction unit 204 of the image decoding device 200 of FIG. 2.
- the normal intra prediction unit 361 generates a prediction image signal by normal intra prediction from decoded pixels adjacent to a coded block to be processed, selects a suitable intra prediction mode from a plurality of intra prediction modes, and selects the appropriate intra prediction mode.
- the predicted image signal corresponding to the selected intra prediction mode and the selected intra prediction mode is obtained.
- This predicted image signal is supplied to the decoded image signal superimposing unit 207 via the switch 364. Since the processing of the normal intra prediction unit 361 of FIG. 41 corresponds to the normal intra prediction unit 351 of FIG. 40, detailed description thereof will be omitted.
- the intra-block copy prediction unit 362 acquires a decoded area of the same image signal as the coded block to be processed from the decoded image memory 208, and obtains the predicted image signal by the intra-block copy processing. This predicted image signal is supplied to the decoded image signal superimposing unit 207 via the switch 364. The detailed configuration and processing of the intra-block copy prediction unit 362 will be described later.
- the inter-prediction method according to the embodiment is carried out by the inter-prediction unit 102 of the image coding device of FIG. 1 and the inter-prediction unit 203 of the image decoding device of FIG.
- the inter-prediction method according to the embodiment will be described with reference to the drawings.
- the inter-prediction method is performed in either coding or decoding processing in units of coded blocks.
- FIG. 16 is a diagram showing a detailed configuration of the inter-prediction unit 102 of the image coding apparatus of FIG.
- the normal predicted motion vector mode derivation unit 301 derives a plurality of normal predicted motion vector candidates, selects a predicted motion vector, and calculates a differential motion vector between the selected predicted motion vector and the detected motion vector.
- the detected inter-prediction mode, reference index, motion vector, and calculated difference motion vector become the inter-prediction information of the normal prediction motion vector mode.
- This inter-prediction information is supplied to the inter-prediction mode determination unit 305.
- the detailed configuration and processing of the normal prediction motion vector mode derivation unit 301 will be described later.
- the normal merge mode derivation unit 302 derives a plurality of normal merge candidates, selects the normal merge candidates, and obtains the inter-prediction information of the normal merge mode. This inter-prediction information is supplied to the inter-prediction mode determination unit 305. The detailed configuration and processing of the normal merge mode derivation unit 302 will be described later.
- the sub-block prediction motion vector mode derivation unit 303 derives a plurality of sub-block prediction motion vector candidates, selects a sub-block prediction motion vector, and selects a difference motion vector between the selected sub-block prediction motion vector and the detected motion vector. calculate.
- the detected inter-prediction mode, reference index, motion vector, and calculated difference motion vector become the inter-prediction information of the sub-block prediction motion vector mode. This inter-prediction information is supplied to the inter-prediction mode determination unit 305.
- the sub-block merge mode derivation unit 304 derives a plurality of sub-block merge candidates, selects the sub-block merge candidates, and obtains the inter-prediction information of the sub-block merge mode. This inter-prediction information is supplied to the inter-prediction mode determination unit 305.
- the inter-prediction mode determination unit 305 is based on the inter-prediction information supplied from the normal prediction motion vector mode derivation unit 301, the normal merge mode derivation unit 302, the sub-block prediction motion vector mode derivation unit 303, and the sub-block merge mode derivation unit 304. , Judge inter-prediction information. Inter-prediction mode determination unit 305 supplies inter-prediction information according to the determination result to motion compensation prediction unit 306.
- the motion compensation prediction unit 306 performs inter-prediction for the reference image signal stored in the decoded image memory 104 based on the determined inter-prediction information. The detailed configuration and processing of the motion compensation prediction unit 306 will be described later.
- FIG. 22 is a diagram showing a detailed configuration of the inter-prediction unit 203 of the image decoding apparatus of FIG.
- the normal predicted motion vector mode derivation unit 401 derives a plurality of normal predicted motion vector candidates, selects a predicted motion vector, calculates an added value of the selected predicted motion vector and the decoded differential motion vector, and obtains the motion vector. To do.
- the decoded inter-prediction mode, reference index, and motion vector are the inter-prediction information of the normal prediction motion vector mode. This inter-prediction information is supplied to the motion compensation prediction unit 406 via the switch 408. The detailed configuration and processing of the normal predicted motion vector mode derivation unit 401 will be described later.
- the normal merge mode derivation unit 402 derives a plurality of normal merge candidates, selects the normal merge candidates, and obtains the inter-prediction information of the normal merge mode. This inter-prediction information is supplied to the motion compensation prediction unit 406 via the switch 408. The detailed configuration and processing of the normal merge mode derivation unit 402 will be described later.
- the sub-block prediction motion vector mode derivation unit 403 derives a plurality of sub-block prediction motion vector candidates, selects a sub-block prediction motion vector, and calculates the sum of the selected sub-block prediction motion vector and the decoded differential motion vector. Calculate and use as a motion vector.
- the decoded inter-prediction mode, reference index, and motion vector become the inter-prediction information of the sub-block prediction motion vector mode. This inter-prediction information is supplied to the motion compensation prediction unit 406 via the switch 408.
- the sub-block merge mode derivation unit 404 derives a plurality of sub-block merge candidates, selects the sub-block merge candidates, and obtains the inter-prediction information of the sub-block merge mode. This inter-prediction information is supplied to the motion compensation prediction unit 406 via the switch 408.
- the motion compensation prediction unit 406 performs inter-prediction for the reference image signal stored in the decoded image memory 208 based on the determined inter-prediction information.
- the detailed configuration and processing of the motion compensation prediction unit 406 are the same as those of the motion compensation prediction unit 306 on the coding side.
- the normal prediction motion vector mode derivation unit 301 of FIG. 17 includes a spatial prediction motion vector candidate derivation unit 321, a time prediction motion vector candidate derivation unit 322, a history prediction motion vector candidate derivation unit 323, a prediction motion vector candidate replenishment unit 325, and a normal motion. It includes a vector detection unit 326, a predicted motion vector candidate selection unit 327, and a motion vector subtraction unit 328.
- the normal prediction motion vector mode derivation unit 401 of FIG. 23 includes a space prediction motion vector candidate derivation unit 421, a time prediction motion vector candidate derivation unit 422, a history prediction motion vector candidate derivation unit 423, a prediction motion vector candidate replenishment unit 425, and a prediction motion. It includes a vector candidate selection unit 426 and a motion vector addition unit 427.
- FIG. 19 is a flowchart showing a normal motion vector mode derivation processing procedure by the normal motion vector mode derivation unit 301 on the coding side
- FIG. 25 is a normal motion vector mode derivation process by the normal motion vector mode derivation unit 401 on the decoding side. It is a flowchart which shows a procedure.
- the normal motion vector detection unit 326 detects the normal motion vector for each inter-prediction mode and reference index (step S100 in FIG. 19).
- the difference motion vector of the motion vector used in the inter-prediction of the normal prediction motion vector mode is calculated for each of L0 and L1, respectively (steps S101 to S106 in FIG. 19).
- the prediction mode PredMode of the block to be processed is inter-prediction (MODE_INTER) and the inter-prediction mode is L0 prediction (Pred_L0)
- the prediction motion vector candidate list mvpListL0 of L0 is calculated and the prediction motion vector mvpL0 is selected.
- the difference motion vector mvdL0 of the motion vector mvL0 of L0 is calculated.
- the inter-prediction mode of the block to be processed is L1 prediction (Pred_L1)
- the prediction motion vector candidate list mvpListL1 of L1 is calculated, the prediction motion vector mvpL1 is selected, and the difference motion vector mvdL1 of the motion vector mvL1 of L1 is calculated. ..
- the prediction motion vector candidate list mvpList L0 of L0 is calculated, the prediction motion vector mvpL0 of L0 is selected, and L0.
- the motion vector mvL0 of the motion vector mvL0 is calculated, the predicted motion vector candidate list mvpListL1 of L1 is calculated, the predicted motion vector mvpL1 of L1 is calculated, and the differential motion vector mvdL1 of the motion vector mvL1 of L1 is calculated. To do.
- L0 and L1 are represented as a common LX.
- X of LX is 0, and in the process of calculating the differential motion vector of L1, X of LX is 1.
- the other list is represented as LY.
- step S102 When the LX motion vector mvLX is used (step S102: YES in FIG. 19), the LX predicted motion vector candidates are calculated and the LX predicted motion vector candidate list mvpListLX is constructed (step S103 in FIG. 19). Multiple predicted motions in the space predicted motion vector candidate derived section 321 in the normal predicted motion vector mode derived section 301, the time predicted motion vector candidate derived section 322, the historical predicted motion vector candidate derived section 323, and the predicted motion vector candidate supplement section 325. Derivation of vector candidates and construction of predicted motion vector candidate list mvpListLX. The detailed processing procedure of step S103 of FIG. 19 will be described later with reference to the flowchart of FIG.
- the predicted motion vector candidate selection unit 327 selects the LX predicted motion vector mvpLX from the LX predicted motion vector candidate list mvpListLX (step S104 in FIG. 19).
- the predicted motion vector candidate list mvpListLX one element (the i-th element counting from 0) is represented as mvpListLX [i].
- mvpListLX [i] the difference motion vector, which is the difference between the motion vector mvLX and the candidate mvpListLX [i] of each predicted motion vector stored in the predicted motion vector candidate list mvpListLX.
- the code amount when these difference motion vectors are encoded is calculated for each element (predicted motion vector candidate) of the predicted motion vector candidate list mvpListLX.
- the predicted motion vector candidate mvpListLX [i] having the minimum sign amount for each candidate of the predicted motion vector is selected as the predicted motion vector mvpLX, and the candidate mvpListLX [i] is selected.
- the index i in the predicted motion vector candidate list mvpListLX is represented by a small number. Select the candidate mvpListLX [i] as the optimal predicted motion vector mvpLX and get its index i.
- the motion vector subtraction unit 328 subtracts the selected LX predicted motion vector mvpLX from the LX motion vector mvLX.
- mvdLX mvLX --mvpLX
- the difference motion vector mvdLX of LX is calculated as (step S105 in FIG. 19).
- the prediction motion vector candidate list mvpListL0 of L0 is calculated and the prediction motion is predicted. Select the vector mvpL0 and calculate the motion vector mvL0 of L0.
- the inter-prediction mode of the block to be processed is L1 prediction (Pred_L1)
- the prediction motion vector candidate list mvpListL1 of L1 is calculated, the prediction motion vector mvpL1 is selected, and the motion vector mvL1 of L1 is calculated.
- the prediction motion vector candidate list mvpList L0 of L0 is calculated, the prediction motion vector mvpL0 of L0 is selected, and L0.
- the motion vector mvL0 of L1 is calculated, the predicted motion vector candidate list mvpList L1 of L1 is calculated, the predicted motion vector mvpL1 of L1 is calculated, and the motion vector mvL1 of L1 is calculated respectively.
- L0 and L1 are represented as a common LX.
- LX represents an inter-prediction mode used for inter-prediction of the coded block to be processed.
- X is 0 in the process of calculating the motion vector of L0, and X is 1 in the process of calculating the motion vector of L1.
- the other reference list is represented as LY.
- step S202 When the LX motion vector mvLX is used (step S202: YES in FIG. 25), the LX predicted motion vector candidates are calculated and the LX predicted motion vector candidate list mvpListLX is constructed (step S203 in FIG. 25). Multiple predicted motions in the space predicted motion vector candidate derived section 421, the time predicted motion vector candidate derived section 422, the historical predicted motion vector candidate derived section 423, and the predicted motion vector candidate supplement section 425 in the normal predicted motion vector mode derivation section 401. Calculate vector candidates and build a predicted motion vector candidate list mvpListLX. The detailed processing procedure of step S203 of FIG. 25 will be described later using the flowchart of FIG.
- the predicted motion vector candidate selection unit 426 selects the predicted motion vector candidate mvpListLX [mvpIdxLX] corresponding to the index mvpIdxLX of the predicted motion vector decoded and supplied by the bit string decoding unit 201 from the predicted motion vector candidate list mvpListLX. It is taken out as the predicted motion vector mvpLX (step S204 in FIG. 25).
- the motion vector addition unit 427 adds the LX differential motion vector mvdLX and the LX predicted motion vector mvpLX, which are decoded and supplied by the bit string decoding unit 201.
- mvLX mvpLX + mvdLX
- the motion vector mvLX of LX is calculated as (step S205 in FIG. 25).
- FIG. 20 shows a normal predicted motion vector mode derivation having a function common to the normal predicted motion vector mode derivation unit 301 of the image coding device and the normal predicted motion vector mode derivation unit 401 of the image decoding device according to the embodiment of the present invention. It is a flowchart which shows the processing procedure of processing.
- the normal prediction motion vector mode derivation unit 301 and the normal prediction motion vector mode derivation unit 401 include a prediction motion vector candidate list mvpListLX.
- the predicted motion vector candidate list mvpListLX has a list structure, and is provided with a storage area for storing the predicted motion vector index indicating the location inside the predicted motion vector candidate list and the predicted motion vector candidate corresponding to the index as elements. .. The number of the predicted motion vector index starts from 0, and the predicted motion vector candidate is stored in the storage area of the predicted motion vector candidate list mvpListLX.
- the predicted motion vector candidate list mvpListLX can register at least two predicted motion vector candidates (inter-prediction information). Further, 0 is set in the variable numCurrMvpCand indicating the number of predicted motion vector candidates registered in the predicted motion vector candidate list mvpListLX.
- Spatial prediction motion vector candidate derivation units 321 and 421 derive prediction motion vector candidates from blocks adjacent to the left side.
- the inter-prediction information of the block (A0 or A1 in FIG. 11) adjacent to the left side that is, the flag indicating whether or not the predicted motion vector candidate can be used, the motion vector, the reference index, and the like are referred to for the predicted motion.
- the vector mvLXA is derived, and the derived mvLXA is added to the predicted motion vector candidate list mvpListLX (step S301 in FIG. 20).
- X is 0 in the case of L0 prediction
- X is 1 in the case of L1 prediction (the same applies hereinafter).
- the spatial prediction motion vector candidate derivation units 321 and 421 derive the prediction motion vector candidates from the adjacent blocks on the upper side.
- the inter-prediction information of the block adjacent to the upper side that is, the flag indicating whether or not the predicted motion vector candidate can be used, the motion vector, the reference index, and the like are referred to.
- the predicted motion vector mvLXB is derived, and if the derived mvLXA and mvLXB are not equal, mvLXB is added to the predicted motion vector candidate list mvpListLX (step S302 in FIG. 20). The processing of steps S301 and S302 in FIG.
- the reference index refIdxN (N indicates A or B, and so on) is derived.
- the time prediction motion vector candidate derivation units 322 and 422 derive candidates for the prediction motion vector from the block in the picture whose time is different from the current processing target picture.
- the flags availableFlagLXCol indicating whether the predicted motion vector candidates of the coded blocks of the pictures at different times are available, the motion vector mvLXCol, the reference index refIdxCol, and the reference list listCol are derived, and the mvLXCol is predicted as the motion vector candidate. Add to list mvpListLX (step S303 in FIG. 20).
- time prediction motion vector candidate derivation unit 322 and 422 can be omitted in units of sequence (SPS), picture (PPS), or slice.
- the history prediction motion vector candidate derivation units 323 and 423 add the history prediction motion vector candidates registered in the history prediction motion vector candidate list HmvpCandList to the prediction motion vector candidate list mvpListLX. (Step S304 in FIG. 20). The details of the registration processing procedure in step S304 will be described later with reference to the flowchart of FIG.
- the predicted motion vector candidate supplementing units 325 and 425 add the predicted motion vector candidates having a predetermined value such as (0,0) until the predicted motion vector candidate list mvpListLX is satisfied (S305 in FIG. 20).
- the normal merge mode derivation unit 302 of FIG. 18 includes a spatial merge candidate derivation unit 341, a time merge candidate derivation unit 342, an average merge candidate derivation unit 344, a history merge candidate derivation unit 345, a merge candidate replenishment unit 346, and a merge candidate selection unit 347. including.
- the normal merge mode derivation unit 402 of FIG. 24 includes a spatial merge candidate derivation unit 441, a time merge candidate derivation unit 442, an average merge candidate derivation unit 444, a history merge candidate derivation unit 445, a merge candidate replenishment unit 446, and a merge candidate selection unit 447. including.
- FIG. 21 describes a procedure of the normal merge mode derivation process having a function common to the normal merge mode derivation unit 302 of the image coding device and the normal merge mode derivation unit 402 of the image decoding device according to the embodiment of the present invention. It is a flowchart.
- the normal merge mode derivation unit 302 and the normal merge mode derivation unit 402 include a merge candidate list mergeCandList.
- Merge candidate list The mergeCandList has a list structure, and has a merge index indicating the location inside the merge candidate list and a storage area for storing the merge candidates corresponding to the indexes as elements. The number of the merge index starts from 0, and the merge candidates are stored in the storage area of the merge candidate list mergeCandList.
- the merge candidate of the merge index i registered in the merge candidate list mergeCandList is represented by mergeCandList [i].
- the merge candidate list mergeCandList can register at least 6 merge candidates (inter-prediction information). Further, 0 is set in the variable numCurrMergeCand indicating the number of merge candidates registered in the merge candidate list mergeCandList.
- the processing target block is obtained from the coding information stored in the coding information storage memory 111 of the image coding device or the coding information storage memory 205 of the image decoding device.
- Spatial merge candidates from each block (B1, A1, B0, A0, B2 in FIG. 11) adjacent to the block are derived in the order of B1, A1, B0, A0, B2, and the derived spatial merge candidates are derived as merge candidates.
- Register in the list mergeCandList (step S401 in FIG. 21).
- N indicating any of B1, A1, B0, A0, B2 or the time merge candidate Col is defined.
- Flags availableFlagN indicating whether the inter-prediction information of block N can be used as a spatial merge candidate, reference index refIdxL0N of L0 of spatial merge candidate N and reference index refIdxL1N of L1, L0 prediction indicating whether L0 prediction is performed.
- the merge candidate is derived without referring to the inter-prediction information of the block included in the coded block to be processed
- the inter-prediction information of the block included in the coded block to be processed is derived. Spatial merge candidates using is not derived.
- the time merge candidate derivation unit 342 and the time merge candidate derivation unit 442 derive the time merge candidates from the pictures at different times and register the derived time merge candidates in the merge candidate list mergeCandList (FIG. 21).
- Step S402 Flags availableFlagCol indicating whether time merge candidates are available, L0 prediction flags predFlag L0Col indicating whether L0 prediction of time merge candidates is performed, and L1 prediction flags predFlagL1Col indicating whether L1 prediction is performed, and L0.
- the motion vector mvL0Col and the motion vector mvL1Col of L1 are derived.
- time merge candidate derivation unit 342 and the time merge candidate derivation unit 442 can be omitted in units of sequence (SPS), picture (PPS), or slice.
- the history merge candidate derivation unit 345 and the history merge candidate derivation unit 445 register the history prediction motion vector candidates registered in the history prediction motion vector candidate list HmvpCandList in the merge candidate list mergeCandList (step S403 in FIG. 21). .. If the number of merge candidates numCurrMergeCand registered in the merge candidate list mergeCandList is smaller than the maximum number of merge candidates MaxNumMergeCand, the number of merge candidates registered in the merge candidate list mergeCandList numCurrMergeCand is limited to the maximum number of merge candidates MaxNumMergeCand. Historical merge candidates are derived and registered in the merge candidate list mergeCandList.
- the average merge candidate derivation unit 344 and the average merge candidate derivation unit 444 derive the average merge candidate from the merge candidate list mergeCandList and add the derived average merge candidate to the merge candidate list mergeCandList (step of FIG. 21). S404). If the number of merge candidates numCurrMergeCand registered in the merge candidate list mergeCandList is smaller than the maximum number of merge candidates MaxNumMergeCand, the number of merge candidates registered in the merge candidate list mergeCandList numCurrMergeCand is limited to the maximum number of merge candidates MaxNumMergeCand.
- the average merge candidate is derived and registered in the merge candidate list mergeCandList.
- the average merge candidate has a motion vector obtained by averaging the motion vectors of the first merge candidate and the second merge candidate registered in the merge candidate list mergeCandList for each L0 prediction and L1 prediction. It is a good merge candidate.
- the merge candidate replenishment unit 346 and the merge candidate replenishment unit 446 if the number of merge candidates numCurrMergeCand registered in the merge candidate list mergeCandList is smaller than the maximum number of merge candidates MaxNumMergeCand, they are registered in the merge candidate list mergeCandList.
- the existing number of merge candidates numCurrMergeCand derives additional merge candidates up to the maximum number of merge candidates MaxNumMergeCand and registers them in the merge candidate list mergeCandList (step S405 in FIG. 21).
- merge candidates whose motion vector has a value of (0,0) and whose prediction mode is L0 prediction (Pred_L0) are added.
- a merge candidate whose motion vector has a value of (0,0) and whose prediction mode is bi-prediction (Pred_BI) is added.
- the reference index when adding a merge candidate is different from the reference index already added.
- the merge candidate selection unit 347 and the merge candidate selection unit 447 select the merge candidate from the merge candidates registered in the merge candidate list mergeCandList.
- the merge candidate selection unit 347 on the coding side selects the merge candidate by calculating the code amount and the strain amount, and sets the merge index indicating the selected merge candidate and the inter-prediction information of the merge candidate in the inter-prediction mode determination unit. It is supplied to the motion compensation prediction unit 306 via the 305.
- the merge candidate selection unit 447 on the decoding side selects the merge candidate based on the decoded merge index, and supplies the selected merge candidate to the motion compensation prediction unit 406.
- FIG. 26 is a flowchart illustrating a procedure for initializing / updating the history prediction motion vector candidate list.
- the history prediction motion vector candidate list HmvpCandList is updated by the coded information storage memory 111 and the coded information storage memory 205.
- the history prediction motion vector candidate list update unit may be installed in the inter prediction unit 102 and the inter prediction unit 203 to update the history prediction motion vector candidate list HmvpCandList.
- the history prediction motion vector candidate list HmvpCandList is set when the normal prediction motion vector mode or the normal merge mode is selected by the prediction method determination unit 105.
- the history prediction motion vector candidate list HmvpCandList is updated when the prediction information decoded by the bit string decoding unit 201 is in the normal prediction motion vector mode or the normal merge mode.
- the inter-prediction information used when performing inter-prediction in the normal prediction motion vector mode or the normal merge mode is registered in the history prediction motion vector candidate list HmvpCandList as the inter-prediction information candidate hMvpCand.
- the inter-prediction information candidate hMvpCand includes the reference index refIdxL0 of L0 and the reference index refIdxL1 of L1, the L0 prediction flag predFlag L0 indicating whether L0 prediction is performed, and the L1 prediction flag predFlag L1 indicating whether L1 prediction is performed.
- the motion vector mvL0 of L0 and the motion vector mvL1 of L1 are included.
- Inter-prediction information candidate among the elements that is, inter-prediction information registered in the history prediction motion vector candidate list HmvpCandList provided in the coding information storage memory 111 on the coding side and the coding information storage memory 205 on the decoding side. If the inter-prediction information with the same value as hMvpCand exists, delete the element from the historical prediction motion vector candidate list HmvpCandList. On the other hand, if there is no inter-prediction information with the same value as the inter-prediction information candidate hMvpCand, the first element of the historical prediction motion vector candidate list HmvpCandList is deleted, and the inter-prediction information candidate is at the end of the historical prediction motion vector candidate list HmvpCandList. Add hMvpCand.
- the number of elements of the history prediction motion vector candidate list HmvpCandList provided in the coding information storage memory 111 on the coding side and the coding information storage memory 205 on the decoding side of the present invention is 6.
- the history prediction motion vector candidate list HmvpCandList for each slice is initialized (step S2101 in FIG. 26). Empty all elements of the history prediction motion vector candidate list HmvpCandList at the beginning of the slice, and the number of history prediction motion vector candidates registered in the history prediction motion vector candidate list HmvpCandList (current number of candidates) The value of NumHmvpCand becomes 0. Set.
- the initialization of the history prediction motion vector candidate list HmvpCandList is performed in slice units (first coded block of slices), it may be performed in picture units, tile units, or tree block line units.
- a FALSE (false) value is set in the flag electricalCandExist indicating whether or not the same candidate exists, and 0 is set in the removal target index removeIdx indicating the deletion target candidate (step S2103 in FIG. 26).
- Step S2104 in FIG. 26 Determine whether or not the inter-prediction information candidate hMvpCand to be registered exists (step S2104 in FIG. 26).
- the prediction method determination unit 105 on the coding side determines the normal prediction motion vector mode or the normal merge mode, or when the bit string decoding unit 201 on the decoding side decodes the decoding as the normal prediction motion vector mode or the normal merge mode.
- the inter-prediction information be the inter-prediction information candidate hMvpCand to be registered.
- the coding side prediction method determination unit 105 determines the intra prediction mode, subblock prediction motion vector mode or subblock merge mode, or the decoding side bit string decoding unit 201 determines the intra prediction mode, subblock prediction motion vector mode.
- step S2104 NO in FIG. 26. If the inter-prediction information candidate hMvpCand to be registered exists, the process of step S2105 or less is performed (step S2104: YES in FIG. 26).
- FIG. 27 is a flowchart of the same element confirmation processing procedure. Number of history prediction motion vector candidates When the value of NumHmvpCand is 0 (step S2121: NO in FIG. 27), the history prediction motion vector candidate list HmvpCandList is empty and the same candidate does not exist, so steps S2122 to S2125 in FIG. 27 are skipped. Then, the same element confirmation processing procedure is terminated.
- step S2123 When the value of the number of historically predicted motion vector candidates NumHmvpCand is larger than 0 (YES in step S2121 in FIG. 27), the process of step S2123 is repeated from 0 to NumHmvpCand-1 in the historical predicted motion vector index hMvpIdx (step in FIG. 27). S2122 to S2125). First, it is compared whether or not the hMvpCandList [hMvpIdx], which is the xth element of the historical prediction motion vector candidate list counting from 0, is the same as the inter-prediction information candidate hMvpCand (step S2123 in FIG. 27). If they are the same (step S2123: YES in FIG.
- a TRUE (true) value is set in the flag electricalCandExist indicating whether or not the same candidate exists, and the removal target index removeIdx indicating the position of the element to be deleted is currently set.
- FIG. 28 is a flowchart of the element shift / addition processing procedure of the history prediction motion vector candidate list HmvpCandList in step S2106 of FIG.
- First it is determined whether to remove the elements stored in the historical prediction motion vector candidate list HmvpCandList and then add a new element, or to add a new element without removing the elements. Specifically, it is compared whether or not TRUE (true) or NumHmvpCand is 6 in the flag identicalCandExist indicating whether or not the same candidate exists (step S2141 in FIG. 28).
- step S2141 YES in FIG. 28
- the historical prediction motion vector candidate list HmvpCandList Add a new element after removing the element stored in.
- Set the initial value of index i to the value of removeIdx + 1.
- the element shift process in step S2143 is repeated from this initial value to NumHmvpCand. (Steps S2142 to S2144 in FIG. 28).
- the element is shifted forward (step S2143 in FIG.
- step S2142 the inter-prediction information candidate hMvpCand is added to the (NumHmvpCand-1) th HmvpCandList [NumHmvpCand-1] counting from 0, which corresponds to the end of the history prediction motion vector candidate list (step S2145 in FIG. 28), and this history prediction is performed.
- the element shift / addition process of the motion vector candidate list HmvpCandList is completed.
- TRUE (true) and NumHmvpCand do not satisfy any of the conditions 6 in the flag electricalCandExist indicating whether or not the same candidate exists (step S2141: NO in FIG.
- the history prediction motion vector candidate list HmvpCandList it is stored in the history prediction motion vector candidate list HmvpCandList.
- the inter-prediction information candidate hMvpCand is added to the end of the history prediction motion vector candidate list without excluding the elements (step S2146 in FIG. 28).
- the end of the history prediction motion vector candidate list is the HmvpCandList [NumHmvpCand] which is the NumHmvpCand th from 0.
- NuMHmvpCand is incremented by 1, and the element shift and additional processing of this history prediction motion vector candidate list HmvpCandList are completed.
- FIG. 31 is a diagram illustrating an example of the update process of the history prediction motion vector candidate list.
- HMVP2 deletes the element HMVP2 from the history prediction motion vector candidate list HmvpCandList and the rear element HMVP3 ⁇ Shift (copy) HMVP5 forward one by one, add a new element to the end of the history prediction motion vector candidate list HmvpCandList (Fig. 31B), and complete the update of the history prediction motion vector candidate list HmvpCandList (Fig. 31C). ).
- FIG. 29 is a flowchart illustrating a history prediction motion vector candidate derivation processing procedure.
- step S2201 in FIG. 29 When the current number of predicted motion vector candidates numCurrMvpCand is less than 2, which is the maximum number of elements in the predicted motion vector candidate list mvpListLX, and when the value of the number of historical predicted motion vector candidates NumHmvpCand is greater than 0 (step S2201 in FIG. 29). YES), the processes of steps S2202 to S2209 of FIG. 29 are performed.
- steps S2203 to S2208 of FIG. 29 are repeated until the index i is 1 to 4 and the number of historical prediction motion vector candidates numCheckedHMVPCand, whichever is smaller (steps S2202 to S2209 of FIG. 29).
- the current number of predicted motion vector candidates numCurrMvpCand is 2 or more, which is the maximum number of elements of the predicted motion vector candidate list mvpListLX (steps S2203: NO in FIG. 29)
- the processing of steps S2204 to S2209 in FIG. 29 is omitted.
- the history prediction motion vector candidate derivation processing procedure ends.
- step S2203 When the current number of predicted motion vector candidates numCurrMvpCand is smaller than 2, which is the maximum number of elements of the predicted motion vector candidate list mvpListLX (step S2203: YES in FIG. 29), the processes after step S2204 in FIG. 29 are performed.
- steps S2205 to S2207 are performed for Y's 0 and 1 (L0 and L1), respectively (steps S2204 to S2208 in FIG. 29).
- the processing of steps S2206 to S2209 in FIG. 29 is omitted.
- the history prediction motion vector candidate derivation processing procedure ends.
- step S2205 YES in FIG. 29
- the processes after step S2206 in FIG. 29 are performed.
- Step S2206 add the LY motion vector of the historical prediction motion vector candidate HmvpCandList [NumHmvpCand-i] to the numCurrMvpCand th element mvpListLX [numCurrMvpCand] counting from 0 in the predicted motion vector candidate list (step in FIG. 29).
- step S2207 the number of current predicted motion vector candidates numCurrMvpCand is incremented by 1.
- the additional processing of step S2207 is skipped.
- steps S2205 to S2207 of FIG. 29 are performed at both L0 and L1 (steps S2204 to S2208 of FIG. 29).
- the processes after step S2203 are performed again (steps S2202 to S2209 in FIG. 29).
- step S404 of FIG. 21 which is a process common to the history merge candidate derivation unit 345 of the normal merge mode derivation unit 302 on the coding side and the history merge candidate derivation unit 445 of the normal merge mode derivation unit 402 on the decoding side.
- the procedure for deriving the history merge candidate from the history merge candidate list HmvpCandList will be described in detail.
- FIG. 30 is a flowchart illustrating a history merge candidate derivation processing procedure.
- the initialization process is performed (step S2301 in FIG. 30).
- the initial value of the index hMvpIdx is set to 1, and the additional processing from step S2303 to step S2310 in FIG. 30 is repeated from this initial value to NuMHmvpCand (steps S2302 to S2311 in FIG. 30). If the number of elements registered in the current merge candidate list numCurrMergeCand is not less than or equal to (maximum number of merge candidates MaxNumMergeCand-1), merge candidates have been added to all elements in the merge candidate list, so this history merge candidate derivation The process ends (NO in step S2303 in FIG. 30).
- step S2304 If the number of elements registered in the current merge candidate list numCurrMergeCand is (maximum number of merge candidates MaxNumMergeCand-1) or less, the processing in step S2304 and subsequent steps is performed. A FALSE value is set in sameMotion (step S2304 in FIG. 30). Subsequently, the initial value of the index i is set to 0, and the processes of steps S2306 and S2307 of FIG. 30 are performed from this initial value to numOrigMergeCand-1 (S2305 to S2308 of FIG. 30).
- step S2309 in FIG. 30 when sameMotion is FALSE (false) (step S2309 in FIG. 30). YES), that is, the element HmvpCandList [NumHmvpCand --hMvpIdx] of the (NumHmvpCand --hMvpIdx) th element in the history prediction motion vector candidate list does not exist in the mergeCandList, so the mergeCandList [numCurrMergeCand] of the merge candidate list Add the (NumHmvpCand --hMvpIdx) th element HmvpCandList [NumHmvpCand --hMvpIdx] counting from 0 in the motion vector candidate list, and increment numCurrMergeCand by 1 (step S2310 in FIG.
- the index hMvpIdx is incremented by 1 (step S2302 in FIG. 30), and the iterative processing of steps S2302 to S2311 in FIG. 30 is repeated.
- step S403 in FIG. 21 which is a process common to the average merge candidate derivation unit 344 of the normal merge mode derivation unit 302 on the coding side and the average merge candidate derivation unit 444 of the normal merge mode derivation unit 402 on the decoding side.
- the method of deriving the average merge candidate which is a procedure, will be described in detail.
- FIG. 38 is a flowchart illustrating a procedure for deriving the average merge candidate.
- the initialization process is performed (step S1301 in FIG. 38).
- Step S1302 to S1303 in FIG. 38 If the number of elements registered in the current merge candidate list numCurrMergeCand is not less than or equal to (maximum number of merge candidates MaxNumMergeCand-1), merge candidates have been added to all elements in the merge candidate list, so this history merge candidate derivation The process ends (step S1304 in FIG. 38). If the number of elements registered in the current merge candidate list numCurrMergeCand is (maximum number of merge candidates MaxNumMergeCand-1) or less, the processing of step S1305 and subsequent steps is performed.
- step S1305 in FIG. 38 It is determined whether or not the i-th motion information mergeCandList [i] of the merge candidate list and the j-th motion information mergeCandList [j] of the merge candidate list are both invalid (step S1305 in FIG. 38), and both are invalid. In that case, move on to the next element without deriving the average merge candidates of mergeCandList [i] and mergeCandList [j]. If both mergeCandList [i] and mergeCandList [j] are not invalid, the following processing is repeated with X as 0 and 1 (steps S1306 to 1314 in FIG. 38).
- step S1309 the average merge candidate of the LX prediction having the motion vector of the LX prediction obtained by averaging the movement vectors of the LX prediction of [j] and the reference index of the LX prediction of mergeCandList [i] is derived and set to the LX prediction of the averageCand, and the averageCand Enable LX prediction (step S1309 in FIG. 38).
- step S1308 of FIG. the average merge candidate of the LX prediction having the motion vector of the LX prediction obtained by averaging the movement vectors of the LX prediction of [j] and the reference index of the LX prediction of mergeCandList [i] is derived and set to the LX prediction of the averageCand, and the averageCand Enable LX prediction (step S1309 in FIG. 38).
- mergeCandList [i] The average merge candidate of the LX prediction having the motion vector and the reference index of the LX prediction is derived and set to the LX prediction of the averageCand, and the LX prediction of the averageCand is valid (step S1310 in FIG. 38). If the LX prediction of mergeCandList [i] is not valid in step S1307 of FIG. 38, it is determined whether or not the LX prediction of mergeCandList [j] is valid (step S1311 of FIG. 38).
- step S1312 the LX prediction of mergeCandList [j] is valid, that is, if the LX prediction of mergeCandList [i] is invalid and the LX prediction of mergeCandList [j] is valid, then the motion vector of the LX prediction of mergeCandList [j]
- the average merge candidate of the LX prediction having the reference index is derived and set to the LX prediction of the averageCand, and the LX prediction of the averageCand is enabled (step S1312 in FIG. 38). In step S1311 of FIG.
- the average merge candidate averageCand of the L0 prediction, L1 prediction or BI prediction generated as described above is added to the mergeCandList [numCurrMergeCand] of the numCurrMergeCand th mergeCandList [numCurrMergeCand] of the merge candidate list, and the numCurrMergeCand is incremented by 1 (step S1315 in FIG. 38). This completes the process of deriving the average merge candidate.
- the average merge candidate is averaged for each of the horizontal component of the motion vector and the vertical component of the motion vector.
- the motion compensation prediction unit 306 acquires the position and size of the block currently subject to prediction processing in coding. Further, the motion compensation prediction unit 306 acquires the inter prediction information from the inter prediction mode determination unit 305. The reference index and motion vector are derived from the acquired inter-prediction information, and the reference picture specified by the reference index in the decoded image memory 104 is moved from the same position as the image signal of the prediction block by the motion vector. A prediction signal is generated after the image signal is acquired.
- the inter-prediction mode in inter-prediction is prediction from a single reference picture such as L0 prediction or L1 prediction
- the prediction signal acquired from one reference picture is used as the motion compensation prediction signal
- the inter-prediction mode is BI.
- the prediction mode is prediction from two reference pictures such as prediction
- the weighted average of the prediction signals acquired from the two reference pictures is used as the motion compensation prediction signal
- the motion compensation prediction signal is used to determine the prediction method.
- Supply to unit 105 the ratio of the weighted averages of the biprediction is 1: 1, but weighted averages may be performed using other ratios.
- the weighting ratio may be increased as the distance between the picture to be predicted and the reference picture is closer. Further, the weighting ratio may be calculated by using the correspondence table between the combination of picture intervals and the weighting ratio.
- the motion compensation prediction unit 406 has the same function as the motion compensation prediction unit 306 on the coding side.
- the motion compensation prediction unit 406 transfers the inter-prediction information from the normal prediction motion vector mode derivation unit 401, the normal merge mode derivation unit 402, the subblock prediction motion vector mode derivation unit 403, and the subblock merge mode derivation unit 404 to switch 408. Get through.
- the motion compensation prediction unit 406 supplies the obtained motion compensation prediction signal to the decoded image signal superimposition unit 207.
- ⁇ About inter-prediction mode The process of making a prediction from a single reference picture is defined as a simple prediction, and in the case of a single prediction, either one of the two reference pictures registered in the reference lists L0 and L1, which is L0 prediction or L1 prediction, is used. Make a prediction.
- FIG. 32 shows a case where the reference picture (RefL0Pic) of L0 is at a time before the processing target picture (CurPic) in a simple prediction.
- FIG. 33 shows a case where the reference picture of the L0 prediction is at a time after the processing target picture in the simple prediction.
- the reference picture for the L0 prediction in FIGS. 32 and 33 can be replaced with the reference picture for the L1 prediction (RefL1Pic) to perform simple prediction.
- FIG. 34 shows a case where the reference picture of the L0 prediction is at a time before the processing target picture and the reference picture of the L1 prediction is at a time after the processing target picture in the bi-prediction.
- FIG. 35 shows a case where the reference picture of the L0 prediction and the reference picture of the L1 prediction are at a time before the picture to be processed in the bi-prediction.
- FIG. 36 shows a case where the reference picture of the L0 prediction and the reference picture of the L1 prediction are at a time after the processing target picture in the bi-prediction.
- L0 prediction and L1 prediction may be performed using the same reference picture. It should be noted that the determination of whether the motion compensation prediction is performed by simple prediction or double prediction is determined based on, for example, information (for example, a flag) indicating whether or not to use L0 prediction and whether or not to use L1 prediction. To.
- ⁇ About reference index> In the embodiment of the present invention, in order to improve the accuracy of motion compensation prediction, it is possible to select the optimum reference picture from a plurality of reference pictures in motion compensation prediction. Therefore, the reference picture used in the motion compensation prediction is used as the reference index, and the reference index is encoded in the bit stream together with the differential motion vector.
- the motion compensation prediction unit 306 is used when the inter prediction information by the normal prediction motion vector mode derivation unit 301 is selected in the inter prediction mode determination unit 305. Acquires this inter-prediction information from the inter-prediction mode determination unit 305, derives the inter-prediction mode, reference index, and motion vector of the block currently being processed, and generates a motion compensation prediction signal. The generated motion compensation prediction signal is supplied to the prediction method determination unit 105.
- the motion compensation prediction unit 406 is normally used when the switch 408 is connected to the normal prediction motion vector mode derivation unit 401 in the process of decoding, as shown by the inter prediction unit 203 on the decoding side of FIG.
- the inter-prediction information by the prediction motion vector mode derivation unit 401 is acquired, the inter-prediction mode, the reference index, and the motion vector of the block currently being processed are derived, and the motion compensation prediction signal is generated.
- the generated motion compensation prediction signal is supplied to the decoded image signal superimposing unit 207.
- the motion compensation prediction unit 306 is used when the inter-prediction information by the normal merge mode derivation unit 302 is selected in the inter-prediction mode determination unit 305.
- This inter-prediction information is acquired from the inter-prediction mode determination unit 305, the inter-prediction mode, reference index, and motion vector of the block currently being processed are derived, and a motion compensation prediction signal is generated.
- the generated motion compensation prediction signal is supplied to the prediction method determination unit 105.
- the motion compensation prediction unit 406 is in the normal merge mode when the switch 408 is connected to the normal merge mode derivation unit 402 in the decoding process, as shown by the inter prediction unit 203 on the decoding side in FIG.
- the inter-prediction information by the derivation unit 402 is acquired, the inter-prediction mode, the reference index, and the motion vector of the block currently being processed are derived, and the motion compensation prediction signal is generated.
- the generated motion compensation prediction signal is supplied to the decoded image signal superimposing unit 207.
- ⁇ Motion compensation processing based on subblock prediction motion vector mode> In the motion compensation prediction unit 306, as shown by the inter prediction unit 102 on the coding side of FIG. 16, when the inter prediction information by the subblock prediction motion vector mode derivation unit 303 is selected in the inter prediction mode determination unit 305.
- the inter-prediction information is acquired from the inter-prediction mode determination unit 305, the inter-prediction mode, the reference index, and the motion vector of the block currently being processed are derived, and the motion compensation prediction signal is generated.
- the generated motion compensation prediction signal is supplied to the prediction method determination unit 105.
- the motion compensation prediction unit 406 as shown by the inter-prediction unit 203 on the decoding side of FIG. 22, when the switch 408 is connected to the subblock prediction motion vector mode derivation unit 403 in the decoding process,
- the sub-block prediction motion vector mode derivation unit 403 acquires the inter-prediction information, derives the inter-prediction mode, the reference index, and the motion vector of the block currently being processed, and generates a motion compensation prediction signal.
- the generated motion compensation prediction signal is supplied to the decoded image signal superimposing unit 207.
- the motion compensation prediction unit 306 is used when the inter-prediction information by the sub-block merge mode derivation unit 304 is selected in the inter-prediction mode determination unit 305. , This inter-prediction information is acquired from the inter-prediction mode determination unit 305, the inter-prediction mode, reference index, and motion vector of the block currently being processed are derived, and the motion compensation prediction signal is generated. The generated motion compensation prediction signal is supplied to the prediction method determination unit 105.
- the motion compensation prediction unit 406 is a subblock when the switch 408 is connected to the subblock merge mode derivation unit 404 in the decoding process, as shown by the inter prediction unit 203 on the decoding side of FIG.
- the inter-prediction information by the merge mode derivation unit 404 is acquired, the inter-prediction mode, the reference index, and the motion vector of the block currently being processed are derived, and the motion compensation prediction signal is generated.
- the generated motion compensation prediction signal is supplied to the decoded image signal superimposing unit 207.
- motion compensation by the affine model can be used based on the following flags.
- the following flags are reflected in the following flags based on the inter-prediction conditions determined by the inter-prediction mode determination unit 305 in the coding process, and are encoded in the bit stream.
- Sps_affine_enabled_flag indicates whether or not motion compensation by the affine model can be used in inter-prediction. If sps_affine_enabled_flag is 0, it is suppressed so that it is not motion compensation by the affine model in sequence units. Also, inter_affine_flag and cu_affine_type_flag are not transmitted in the CU (encoded block) syntax of the encoded video sequence. If sps_affine_enabled_flag is 1, motion compensation by the affine model can be used in the coded video sequence.
- Sps_affine_type_flag indicates whether or not motion compensation by the 6-parameter affine model can be used in inter-prediction. If sps_affine_type_flag is 0, it is suppressed so that it is not motion compensation by the 6-parameter affine model. Also, cu_affine_type_flag is not transmitted in the CU syntax of the encoded video sequence. If sps_affine_type_flag is 1, motion compensation by the 6-parameter affine model can be used in the coded video sequence. If sps_affine_type_flag does not exist, it shall be 0.
- inter_affine_flag When decoding P or B slices, if inter_affine_flag is 1 in the CU currently being processed, an affine model is used to generate a motion compensation prediction signal for the CU currently being processed. Motion compensation is used. If inter_affine_flag is 0, the affine model is not used for the CU currently being processed. If inter_affine_flag does not exist, it shall be 0.
- the motion compensation prediction signal is generated using the reference index and motion vector to be processed in subblock units.
- the 4-parameter affine model is a mode in which the motion vector of a subblock is derived from the four parameters of the horizontal component and the vertical component of the motion vector of each of the two control points, and the motion is compensated for each subblock.
- FIG. 39A is an example in which the effective reference area is determined by using the coded tree block unit as the intra-block copy reference block.
- Reference numerals 500, 501, 502, 503, and 504 in FIG. 39A are coded tree blocks, and 504 is a coded tree block to be processed.
- Reference numeral 505 is a coded block to be processed.
- the processing order of the coded tree blocks is 500, 501, 502, 503, 504.
- the three coding tree blocks 501, 502, and 503 processed immediately before the coding tree block 504 including the processing target coding block 505 are set as effective reference areas of the processing target coding block 505.
- a coded tree block processed before the coded tree block 501, and a coded tree block containing the process target coded block 505 regardless of whether or not the processing is completed before the process target coded block 505. All areas included in 504 are invalid reference areas.
- FIG. 39B is an example in which the effective reference area is determined by using the unit obtained by dividing the coded tree block into four as the intra-block copy reference block.
- Reference numerals 515 and 516 in FIG. 39B are coded tree blocks, and 516 is a coded tree block to be processed.
- the coded tree block 515 is divided into 506, 507, 508, and 509, and 516 is divided into 510, 511, 512, and 513.
- Reference numeral 514 is a coded block to be processed.
- the processing order of the intra-block copy reference block is 506, 507, 508, 509, 510, 511, 512, 513.
- the three intra-block copy reference blocks 508, 509, and 510 processed immediately before the intra-block copy reference block 511 including the processing target coding block 514 are set as the effective reference area of the processing target coding block 514.
- Intrablock copy Intrablock copy including the coded tree block processed before the reference block 508 and the coded block 514 to be processed, regardless of whether the processing is completed before the coded block 514 to be processed. All the areas included in the reference block 511 are invalid reference areas.
- the block vector detection unit 375 detects the block vector mvL (step S4500 in FIG. 44). Subsequently, the IBC space block vector candidate derivation unit 371, the IBC history prediction block vector candidate derivation unit 372, the IBC prediction block vector candidate supplementation unit 373, the IBC prediction block vector candidate selection unit 376, and the block vector subtraction unit 378 are used to display the prediction block vector.
- the difference block vector of the block vector used in the mode is calculated (steps S4501 to S4503 in FIG. 44).
- step S4501 in FIG. 44 Calculate the candidates for the predicted block vector and construct the block vector candidate list mvpList (step S4501 in FIG. 44).
- IBC space block vector candidate derivation unit 371, IBC history block vector candidate derivation unit 372, and IBC prediction block vector candidate replenishment unit 373 in the intra block copy prediction unit 352 derive multiple prediction block vector candidates and predict block vector.
- Build a candidate list mvpList The detailed processing procedure of step S4501 of FIG. 44 will be described later using the flowchart of FIG. 47.
- the IBC prediction block vector candidate selection unit 376 selects the prediction block vector mvpL from the prediction block vector candidate list mvpListL (step S4502 in FIG. 44). Calculate each difference block vector which is the difference between the block vector mvL and the candidate mvpListL [i] of each prediction block vector stored in the prediction block vector candidate list mvpListL. The amount of code when these difference block vectors are encoded is calculated for each element of the predicted block vector candidate list mvpListL.
- the prediction block vector candidate mvpListL [i] having the minimum sign amount for each candidate of the prediction block vector is selected as the prediction block vector mvpL, and the candidate mvpListL [i] is selected.
- Get index i When there are multiple candidates for the predicted block vector that is the smallest generated code amount in the predicted block vector candidate list mvpListL, the predicted block vector whose index i in the predicted block vector candidate list mvpListL is represented by a small number.
- the candidate mvpListL [i] of is selected as the optimal prediction block vector mvpL, and its index i is acquired.
- the block vector subtraction unit 378 subtracts the predicted block vector mvpL selected from the block vector mvL.
- mvdL mvL --mvpL
- the difference block vector mvdL is calculated as (step S4503 in FIG. 44).
- the prediction block vector mode processing procedure on the decoding side will be described with reference to FIG. 45.
- the IBC spatial prediction block vector candidate derivation unit 471, the IBC history block vector candidate derivation unit 472, and the IBC prediction block vector replenishment unit 473 calculate the block vector to be used in the prediction block vector mode (steps S4600 to FIG. 45). S4602). Specifically, the prediction block vector candidate list mvpListL is calculated, the prediction block vector mvpL is selected, and the block vector mvL is calculated.
- Prediction block vector candidates are calculated to construct a prediction block vector candidate list mvpListL (step S4601 in FIG. 45).
- IBC space block vector candidate derivation unit 471, IBC history block vector candidate derivation unit 472, and IBC block vector replenishment unit 473 in the intra block copy prediction unit 362 calculate multiple prediction block vector candidates, and the prediction block vector candidate list. Build mvpListL. The detailed processing procedure of step S4601 of FIG. 45 will be omitted.
- FIG. 47 shows a prediction intra-block copy mode derivation process having a function common to the intra-block copy prediction unit 352 of the moving image coding device and the intra-block copy prediction unit 362 of the moving image decoding device according to the embodiment of the present invention. It is a flowchart which shows the processing procedure.
- the intra-block copy prediction unit 352 and the intra-block copy prediction unit 362 include a prediction block vector candidate list mvpListL.
- the predicted block vector candidate list mvpListL has a list structure, and is provided with a predicted block vector index indicating the location inside the predicted block vector candidate list and a storage area for storing the predicted block vector candidates corresponding to the index as elements.
- the number of the prediction block vector index starts from 0, and the prediction block vector candidate is stored in the storage area of the prediction block vector candidate list mvpListL.
- the prediction block vector candidate list mvpListL can register three prediction block vector candidates. Further, 0 is set in the variable numCurrMvpIbcCand indicating the number of predicted block vector candidates registered in the predicted block vector candidate list mvpListL.
- the IBC spatial block vector candidate derivation units 371 and 471 derive candidates for the predicted block vector from the blocks adjacent to the left side (step S4801 in FIG. 47).
- the flag availableFlagLA indicating whether or not the predicted block vector candidate of the block (A0 or A1) adjacent to the left side is available and the block vector mvLA are derived, and mvLA is added to the predicted block vector candidate list mvpListL.
- the IBC spatial block vector candidate derivation units 371 and 471 derive candidates for the prediction block vector from the blocks (B0, B1 or B2) adjacent to the upper side (step S4802 in FIG. 47).
- the flag availableFlagLB indicating whether the predicted motion vector candidates of the adjacent blocks on the upper side are available and the block vector mvLB are derived, and if mvLA and mvLB are not equal, mvLB is added to the predicted block vector candidate list mvpListL. to add.
- the processing of steps S4801 and S4802 of FIG. 47 is common except that the position and number of adjacent blocks to be referred to are different, and the flag availableFlagLN indicating whether or not the predicted block vector candidate of the coded block can be used, and the motion vector mvLN (N is A or B, and so on) is derived.
- the IBC history block vector candidate derivation units 372 and 472 add the history block vector candidates registered in the history block vector candidate list HmvpIbcCandList to the prediction block vector candidate list mvpListL.
- the motion vector is the block vector
- the reference index list is L0
- the history prediction motion vector candidate list HmvpCandList is the history block vector candidate list. Since it is sufficient if the operation is the same as that of HmvpIbcCandList, the description is omitted.
- the IBC prediction block vector replenishment units 373 and 473 add block vectors having predetermined values, such as until the prediction block vector candidate list mvpListL is satisfied (0,0) (S4804 in FIG. 47).
- the intra-block copy prediction unit 352 of FIG. 42 includes an IBC space block vector candidate derivation unit 371, an IBC history block vector candidate derivation unit 372, an IBC block vector replenishment unit 373, a reference position correction unit 380, a reference area boundary correction unit 381, and an IBC. It includes a merge candidate selection unit 374 and an IBC prediction mode determination unit 377.
- the intra-block copy prediction unit 362 of FIG. 43 is referred to by the IBC space block vector candidate derivation unit 471, the IBC history block vector candidate derivation unit 472, the IBC block vector replenishment unit 473, the IBC merge candidate selection unit 474, and the reference position correction unit 480.
- the area boundary correction unit 481 and the block copy unit 477 are included.
- FIG. 46 shows a merge intra-block copy mode derivation process having a function common to the intra-block copy prediction unit 352 of the moving image coding device and the intra-block copy prediction unit 362 of the moving image decoding device according to the embodiment of the present invention. It is a flowchart explaining a procedure.
- the intra-block copy prediction unit 352 and the intra-block copy prediction unit 362 include a merge intra-block copy candidate list mergeIbcCandList.
- Merge intra-block copy candidate list mergeIbcCandList has a list structure, and has a merge index indicating the location inside the merge intra-block copy candidate and a storage area for storing the merge intra-block copy candidate corresponding to the index as an element.
- the numbers in the merge index start from 0, and the merge intrablock copy candidates are stored in the storage area of the mergeIbcCandList merge intrablock copy candidate list.
- the merge candidate of the merge index i registered in the merge intra-block copy candidate list mergeIbcCandList is represented by mergeIbcCandList [i].
- the merge candidate list mergeCandList can register at least three merge intra-block copy candidates. Further, 0 is set in the variable numCurrMergeIbcCand indicating the number of merge intrablock copy candidates registered in the mergeIbcCandList merge intrablock copy candidate list.
- the coding stored in the coding information storage memory 111 of the moving image coding device or the coding information storage memory 205 of the moving image decoding device are derived from the information, and the derived spatial merge candidates are registered in the merge intra-block copy candidate list mergeIbcCandList (step S4701 in FIG. 46).
- N indicating either one of the spatial merge candidates A and B is defined.
- the flag availableFlagN and the block vector mvL indicating whether or not the intra-block copy prediction information of the block N can be used as the spatial block vector merge candidate N are derived.
- the block vector merge candidate is derived without referring to other coded blocks included in the block including the coded block to be processed, the block including the coded block to be processed is derived. Spatial block vector merge candidates included in are not derived.
- the IBC history block vector candidate derivation unit 372 and the IBC history block vector candidate derivation unit 472 add the history prediction block vector candidates registered in the history prediction block vector candidate list HmvpIbcCandList to the merge intrablock copy candidate list mergeIbcCandList. (Step S4702 in FIG. 46).
- the block vector already added to the mergeIbcCandList and the block vector of the history prediction block vector candidate have the same value, the block vector is not added to the mergeIbcCandList.
- the IBC merge candidate selection unit 374 and the IBC merge candidate selection unit 474 select one from the intra-block merge candidates registered in the merge intra-block copy candidate list mergeIbcCandList (step S4704 in FIG. 46).
- the IBC merge candidate selection unit 374 selects a merge candidate by acquiring the decoded image at the reference position from the decoded image memory 104 and calculating the code amount and the strain amount, and sets a merge index indicating the selected intra-block merge candidate. It is supplied to the IBC prediction mode determination unit 377.
- the IBC prediction mode determination unit 377 selects whether or not it is in the merge mode by calculating the code amount and the strain amount, and supplies the result to the prediction method determination unit 105.
- the IBC merge candidate selection unit 474 on the decoding side selects an intra-block merge candidate based on the decoded merge index, and supplies the selected intra-block merge candidate to the reference position correction unit 480.
- the reference position correction unit 380 and the reference position correction unit 480 perform a process of correcting the reference position for the intra-block merge candidate (step S4705 in FIG. 46). Details of the processing of the reference position correction unit 380 and the reference position correction unit 480 will be described later.
- the reference area boundary correction unit 381 and the reference area boundary correction unit 481 perform a process of correcting the reference area boundary for the intra-block merge candidate (step S4706 in FIG. 46). Details of the processing of the reference position correction unit 381 and the reference position correction unit 481 will be described later.
- the block copy unit 477 acquires the decoded image at the reference position from the decoded image memory 208 and supplies it to the decoded image signal superimposing unit 207.
- the block copy unit 477 copies the luminance component and the color difference component.
- the above block vector mvL indicates the brightness block vector.
- the color difference block vector mvC is when the color difference format is 420.
- mvC ((mvL >> (3 + 2))) * 32 Will be. According to the above equation, each of the x and y components of mvC is processed.
- FIG. 48 is a flowchart illustrating the processing of the reference position correction unit 380 and the reference position correction unit 480. Now, it is assumed that the unit of the intra-block copy reference block is the coded tree block (CTU), and its size is not 128x128 pixels.
- CTU coded tree block
- the position of the coded block to be processed is (xCb, yCb), the block vector is (mvL [0], mvL [1]), the width of the coded block to be processed is cbWidth, and the height is cbHeight.
- the size of the CTU is 128x128 pixels (S6002). Since the size is not 128x128 pixels (S6002: NO), the upper left and lower right positions of the referenceable area are calculated (S6003).
- the size of CTU is CtbLog2SizeY.
- FIG. 49 is a diagram showing how the reference position is corrected.
- 6001 indicates a processing target coding tree block
- 6002 indicates a processing target coding block
- 6003 indicates a referenceable area.
- xRefBR xRefTL + cbWidth-1 as in S6001
- xRefBR is also corrected as xRefTL is corrected.
- the block vector mvL [0] may be corrected.
- mvL [0] (xAvlTL-xCb) ⁇ 4 To correct.
- xRefTL xAvlTL, so the reference position can be corrected.
- the intra-block copy prediction unit 352 It is assumed that some block vectors in the block vector candidate list constructed by the intra-block copy prediction unit 352 are outside the referenceable area. If the reference position is not corrected, those block vectors cannot be used as candidates for the IBC merge mode because they cannot be referenced by the block vectors. On the other hand, when the reference position is corrected in the present invention, all the block vectors of the constructed block vector candidate list are inside the referenceable area. Therefore, it is possible to all the block vectors, and all the block vectors can be candidates for the IBC merge mode. Therefore, in the IBC merge mode selection unit 374, the optimum prediction mode can be selected from the candidates of each IBC merge mode corresponding to all the block vectors, so that the coding efficiency is improved.
- the merge index indicating the IBC merge mode using those block vectors operates as unencoded. However, due to malfunction or the like, such a merge index may be encoded to generate a bitstream. Alternatively, a part of the bitstream may be missing due to packet loss or the like, and the decryption result may become such a merge index.
- the decoding result may differ depending on the decoding device, or the decoding process may stop.
- the reference position is corrected in the present invention, all the block vectors of the constructed block vector candidate list are inside the referenceable area. Therefore, even if such an incomplete bit stream is decoded, the reference position is corrected inside the referenceable area and reference is possible. By correcting the reference position in this way, the memory access range is guaranteed. As a result, the decoding result becomes the same depending on the decoding device, and the decoding process can be continued, so that the robustness of the decoding device can be improved.
- the target is the brightness block vector.
- the color difference block vector is calculated from the luminance block vector. That is, if the luminance block vector is corrected, the color difference block vector is also corrected. Therefore, it is not necessary to correct the reference position again in the color difference. Compared with the need to determine whether or not reference is possible based on both the luminance and the color difference when the block vector is not corrected, the amount of processing can be reduced.
- the corrected block vector is stored in the coding information storage memory 111 or the coding information storage memory 205 as the block vector of the coded block to be processed. That is, the corrected reference position and the position pointed to by the block vector are the same.
- a deblocking filter process may be performed when the decoding result is saved in the decoded image memory. In this filtering process, the strength of the filter is controlled by the difference between the block vectors of the two blocks facing the block boundary. When the block vector is not corrected, the corrected reference position and the position pointed to by the block vector are different, and the filter strength is more appropriate, so that the coding efficiency can be improved.
- yRefBR yRefTL + cbHeight-1 as in S6001
- yRefBR is also corrected as yRefTL is corrected.
- the block vector mvL [1] may be corrected.
- mvL [1] (yAvlTL --yCb) ⁇ 4 To correct.
- yRefTL yAvlTL, so the reference position can be corrected.
- xRefBR xRefTL + cbWidth-1 as in S6001
- xRefTL xRefBR- (cbWidth-1)
- xRefTL is also corrected as xRefBR is corrected.
- the block vector mvL [0] may be corrected.
- yRefBR yRefTL + cbHeight-1 as in S6001
- yRefTL yRefBR- (cbHeight-1)
- yRefTL is also corrected as yRefBR is corrected.
- the block vector mvL [1] may be corrected.
- mvL [1] (yAvlBR-(yCb + cbHeitght --1)) ⁇ 4 To correct.
- yRefBR yAvlBR, so the reference position can be corrected.
- the reference block r1 is located at 6051.
- the reference position in the x direction is corrected as in the case where the reference block is r2.
- the reference position in the y direction is corrected as in the case where the reference block is r4.
- the reference block r1 is located at 6052, which is inside the referenceable area.
- each reference block is located inside the referenceable area.
- FIG. 50 is a diagram for explaining the upper left and lower right positions when the referenceable area is rectangular.
- the coded tree block 6101 to be processed is divided into four, and the coded block 6102 to be processed is located at the upper left of the division.
- the referenceable area has an inverted L shape like the shaded area in 6103.
- the range is 6103.
- FIG. 51 is a flowchart illustrating a process of correcting a reference position in a portion where the referenceable area is not rectangular.
- FIG. 52A is a diagram showing how the reference position is corrected in S6024 and S6025.
- blk_idx 0.
- the reference block r1 is located at 6201
- the upper left of the reference block is located in a region smaller than X2 (x direction of 6112) and Y1 (y direction of 6111) (S6022: YES).
- the difference between the reference block and the referenceable area in the x direction is smaller than the difference between the reference block and the referenceable area in the y direction (S6023: YES).
- xRefBR xRefTL + cbWidth-1 as in S6001
- xRefBR is also corrected as xRefTL is corrected.
- the block vector mvL [0] may be corrected.
- mvL [0] (xAvlTL + NL-xCb) ⁇ 4 To correct.
- xRefTL xAvlTL + NL, so the reference position can be corrected.
- yRefBR yRefTL + cbHeight-1 as in S6001
- yRefBR is also corrected as yRefTL is corrected.
- the block vector mvL [0] may be corrected.
- mvL [1] (yAvlTL + 64 --yCb) ⁇ 4 To correct.
- yRefTL yAvlTL + 64, so the reference position can be corrected.
- the reference block r3 is located at 6205.
- the difference between the reference block and the referenceable area in the x direction is smaller than the difference between the reference block and the referenceable area in the y direction (S6023: YES). Therefore, it is located at 6206 by correcting the reference position in the x direction in the same manner as the reference block r1 (S6024). At this point, the reference block is outside the referenceable area. However, the reference position in the y direction is corrected by the processing of S6006 and S6007 described later. After all, the reference block is inside the referenceable area.
- FIG. 52B is a diagram showing how the reference position is corrected in S6029 and S6030.
- xRefBR xRefTL + cbWidth-1 as in S6001
- xRefTL xRefBR- (cbWidth-1)
- xRefTL is also corrected as xRefBR is corrected.
- the block vector mvL [0] may be corrected.
- mvL [0] (xAvlBR --NR-(xCb + cbWitdh --1)) ⁇ 4 To correct.
- xRefBR xAvlBR, so the reference position can be corrected.
- the block vector mvL [1] may be corrected.
- mvL [1] (yAvlBR --64-(yCb + cbHeight --1)) ⁇ 4 To correct.
- yRefBR yAvlBR, so the reference position can be corrected.
- the reference block r3 is located at 6215.
- the difference between the reference block and the referenceable area in the x direction is not smaller than the difference between the reference block and the referenceable area in the y direction (S6028: NO). Therefore, it is located at 6216 by correcting the reference position in the y direction in the same manner as the reference block r2 (S6030). At this point, the reference block is outside the referenceable area. However, the reference position in the x direction is corrected by the processing of S6008 and S6009 described later. After all, the reference block is inside the referenceable area.
- the processing of S6025 when the processing of S6025 is performed, the processing of S6006 and S6007 may be omitted, when the processing of S6029 is performed, the processing of S6008 and S6009 may not be performed, or the processing of S6030 may be performed. In that case, the processing of S6010 and S6011 may not be performed.
- step S6023 may be omitted and the configuration may be such that step S6024 is always executed, or the configuration may be such that step S6025 is always executed.
- step S6028 may be omitted, and the configuration may be such that step S6029 is always executed, or the configuration may be such that step S6030 is always executed.
- the reference position can be corrected by a simple process.
- the reference position is corrected by using the processes of S6012, S6013 and S6004 to S6011. Instead of this, as shown in FIG. 53, it can also be realized by a process (S6101) of decomposing the referenceable area into two and correcting each reference position.
- one referenceable area (6301) is designated as a referenceable area A
- the other referenceable area (6302) is designated as a referenceable area B.
- FIG. 55 is a flowchart illustrating a process (S6101) of decomposing the referenceable area into two and correcting each reference position.
- S6101 the same process as in FIG. 48 is assigned the same step number, and the description thereof will be omitted.
- the upper left and lower right positions of the referenceable area A are calculated (S6111).
- the reference position can be corrected and the reference can be made. Further, by decomposing the referenceable area into two and correcting each reference position, the processing can be simplified and the amount of calculation can be reduced.
- one referenceable area (6301) is designated as a referenceable area A
- the other referenceable area (6302) is designated as a referenceable area B.
- the referenceable area A and the referenceable area B may be exchanged so that one referenceable area (6301) becomes the referenceable area B and the other referenceable area (6302) is treated as the referenceable area A. ..
- the processing is switched. This may determine whether the intra-block copy reference block is a unit obtained by dividing the coded tree block into four, or whether the size of the CTU is larger than the maximum size of the coded block. You may do so.
- the bitstream output by the image coding apparatus has a specific data format so that it can be decoded according to the coding method used in the embodiment.
- the bit stream may be recorded and provided on a recording medium that can be read by a computer such as an HDD, SSD, flash memory, or optical disk, or may be provided from a server via a wired or wireless network. Therefore, the image decoding device corresponding to this image coding device can decode the bit stream of this specific data format regardless of the providing means.
- the bitstream may be converted and transmitted in a data format suitable for the transmission form of the communication path.
- a transmission device that converts the bitstream output by the image encoding device into encoded data in a data format suitable for the transmission form of the communication path and transmits it to the network, and a transmission device that receives the encoded data from the network and sends the bitstream.
- a receiving device that restores the data and supplies it to the image decoding device is provided.
- the transmitting device includes a memory for buffering a bit stream output by the image coding device, a packet processing unit for packetizing the bit stream, and a transmitting unit for transmitting the encoded data packetized via the network.
- the receiving device receives the encoded data packetized via the network, a memory for buffering the received encoded data, and packet-processes the encoded data to generate a bit stream for image decoding. Includes a packet processing unit provided to the device.
- a relay device for supplying to the receiving device may be provided.
- the relay device includes a receiving unit that receives the packetized encoded data transmitted by the transmitting device, a memory that buffers the received encoded data, and a transmitting unit that transmits the packetized encoded data to the network. Including. Further, the relay device includes a received packet processing unit that packet-processes the packetized encoded data to generate a bit stream, a recording medium that stores the bit stream, and a transmission packet processing unit that packets the bit stream. But it's okay.
- the display device may be used.
- the display unit reads the decoded image signal generated by the decoded image signal superimposing unit 207 and stored in the decoded image memory 208 and displays it on the screen.
- the image pickup device may be used.
- the image pickup unit inputs the captured image signal to the block division unit 101.
- FIG. 37 shows an example of the hardware configuration of the coding / decoding device of this embodiment.
- the coding / decoding device includes the configuration of the image coding device and the image decoding device according to the embodiment of the present invention.
- the coding / decoding device 9000 includes a CPU 9001, a codec IC 9002, an I / O interface 9003, a memory 9004, an optical disk drive 9005, a network interface 9006, and a video interface 9009, and each part is connected by a bus 9010.
- the image coding unit 9007 and the image decoding unit 9008 are typically implemented as a codec IC9002.
- the image coding process of the image coding device according to the embodiment of the present invention is executed by the image coding unit 9007, and the image decoding process in the image decoding device according to the embodiment of the present invention is performed by the image decoding unit 9008.
- the I / O interface 9003 is realized by, for example, a USB interface, and is connected to an external keyboard 9104, mouse 9105, or the like.
- the CPU 9001 controls the coding / decoding device 9000 so as to execute the operation desired by the user based on the user operation input via the I / O interface 9003.
- User operations using the keyboard 9104, mouse 9105, etc. include selection of which function to execute, coding or decoding, setting of coding quality, bitstream input / output destination, image input / output destination, and the like.
- the optical disc drive 9005 When the user desires an operation of playing back an image recorded on the disc recording medium 9100, the optical disc drive 9005 reads a bit stream from the inserted disc recording medium 9100 and reads the read bit stream via the bus 9010. It is sent to the image decoding unit 9008 of the codec IC9002.
- the image decoding unit 9008 executes the image decoding process in the image decoding device according to the embodiment of the present invention on the input bit stream, and sends the decoded image to the external monitor 9103 via the video interface 9009.
- the coding / decoding device 9000 has a network interface 9006 and can be connected to an external distribution server 9106 or a mobile terminal 9107 via the network 9101.
- the network interface 9006 is used from the input disc recording medium 9100. Instead of reading the bitstream, it gets the bitstream from network 9101. Further, when the user desires to reproduce the image recorded in the memory 9004, the image decoding process in the image decoding device according to the embodiment of the present invention is executed on the bit stream recorded in the memory 9004. To do.
- the video interface 9009 inputs an image from the camera 9102, and the image coding unit 9007 of the codec IC 9002 via the bus 9010 Send to.
- the image coding unit 9007 executes the image coding process in the image coding device according to the embodiment of the present invention on the image input via the video interface 9009 to create a bit stream. Then, the bit stream is sent to the memory 9004 via the bus 9010. If the user desires to record the bitstream on the disc recording medium 9100 instead of the memory 9004, the optical disc drive 9005 writes the bitstream to the inserted disc recording medium 9100.
- Such a hardware configuration is realized, for example, by replacing the codec IC9002 with an image coding unit 9007 or an image decoding unit 9008, respectively.
- the above processing related to coding and decoding may be realized as a transmission, storage, and reception device using hardware, and is stored in a ROM (read-only memory), a flash memory, or the like. It may be realized by firmware or software such as a computer.
- the firmware program and software program may be recorded on a recording medium readable by a computer or the like and provided, or may be provided from a server via a wired or wireless network, or terrestrial or satellite digital broadcasting data broadcasting. May be provided as.
- the present invention can be used in an image coding and decoding technique for dividing an image into blocks and performing prediction.
- 100 image encoding device 101 block division unit, 102 inter prediction unit, 103 intra prediction unit, 104 decoded image memory, 105 prediction method determination unit, 106 residual generation unit, 107 orthogonal conversion / quantization unit, 108 bit string coding Unit, 109 inverse quantization / inverse orthogonal conversion unit, 110 decoded image signal superimposition unit, 111 coded information storage memory, 200 image decoding device, 201 bit string decoding unit, 202 block division unit, 203 inter-prediction unit, 204 intra-prediction unit , 205 Coding information storage memory, 206 inverse quantization / inverse orthogonal conversion unit, 207 decoded image signal superimposition unit, 208 decoded image memory.
Landscapes
- Engineering & Computer Science (AREA)
- Multimedia (AREA)
- Signal Processing (AREA)
- Compression Or Coding Systems Of Tv Signals (AREA)
Abstract
Description
実施の形態では、所定の大きさで符号化・復号処理対象画像を均等分割する。この単位をツリーブロックと定義する。図4では、ツリーブロックのサイズを128x128画素としているが、ツリーブロックのサイズはこれに限定されるものではなく、任意のサイズを設定してよい。処理対象(符号化処理においては符号化対象、復号処理においては復号対象に対応する。)のツリーブロックは、ラスタスキャン順、すなわち左から右、上から下の順序で切り替わる。各ツリーブロックの内部は、さらに再帰的な分割が可能である。ツリーブロックを再帰的に分割した後の、符号化・復号の対象となるブロックを符号化ブロックと定義する。また、ツリーブロック、符号化ブロックを総称してブロックと定義する。適切なブロック分割を行うことにより効率的な符号化が可能となる。ツリーブロックのサイズは、符号化装置と復号装置で予め取り決めた固定値とすることもできるし、符号化装置が決定したツリーブロックのサイズを復号装置に伝送するような構成をとることもできる。ここでは、ツリーブロックの最大サイズを128x128画素、ツリーブロックの最小サイズを16x16画素とする。また、符号化ブロックの最大サイズを64x64画素、符号化ブロックの最小サイズを4x4画素とする。
処理対象符号化ブロック単位で、処理対象画像の処理済み画像信号から予測を行うイントラ予測(MODE_INTRA)、及び処理済み画像の画像信号から予測を行うインター予測(MODE_INTER)を切り替える。
処理済み画像は、符号化処理においては符号化が完了した信号を復号した画像、画像信号、ツリーブロック、ブロック、符号化ブロック等に用いられ、復号処理においては復号が完了した画像、画像信号、ツリーブロック、ブロック、符号化ブロック等に用いられる。
このイントラ予測(MODE_INTRA)とインター予測(MODE_INTER)を識別するモードを予測モード(PredMode)と定義する。予測モード(PredMode)はイントラ予測(MODE_INTRA)、またはインター予測(MODE_INTER)を値として持つ。
イントラブロックコピー(Intra Block Copy)予測は、処理対象ピクチャにおける復号済みの画素を予測値として参照し、処理対象ブロックを符号化/復号する処理である。そして、処理対象ブロックから参照する画素までの距離は、ブロックベクトルで表す。ブロックベクトルは処理対象ピクチャを参照し、参照ピクチャは一意に定まるため、参照インデックスは不要である。ブロックベクトルと動きベクトルの違いは、参照するピクチャが処理対象ピクチャか処理済みピクチャかである。また、ブロックベクトルは、適応動きベクトル解像度(AMVR)を用いて、1画素精度または4画素精度を選択できる。
イントラブロックコピーでは、予測イントラブロックコピーモードと、マージイントラブロックコピーモードの2つのモードを選択可能である。
予測イントラブロックコピーモードは、処理済みの情報から導出する予測ブロックベクトルと、差分ブロックベクトルから、処理対象ブロックのブロックベクトルを決定するモードである。予測ブロックベクトルは、処理対象ブロックに隣接する処理済みブロックと、予測ブロックベクトルを特定するためのインデックスから導出する。予測ブロックベクトルを特定するためのインデックス、差分ブロックベクトルはビットストリームで伝送する。
マージイントラブロックコピーモードは、差分動きベクトルを伝送せずに、処理対象ブロックに隣接する処理済みブロックのイントラブロックコピー予測情報から、処理対象ブロックのイントラブロックコピー予測情報を導出するモードである。
処理済み画像の画像信号から予測を行うインター予測では、複数の処理済み画像を参照ピクチャとして用いることができる。複数の参照ピクチャを管理するため、L0(参照リスト0)とL1(参照リスト1)の2種類の参照リストを定義し、それぞれ参照インデックスを用いて参照ピクチャを特定する。PスライスではL0予測(Pred_L0)が利用可能である。BスライスではL0予測(Pred_L0)、L1予測(Pred_L1)、双予測(Pred_BI)が利用可能である。L0予測(Pred_L0)はL0で管理されている参照ピクチャを参照するインター予測であり、L1予測(Pred_L1)はL1で管理されている参照ピクチャを参照するインター予測である。双予測(Pred_BI)はL0予測とL1予測が共に行われ、L0とL1のそれぞれで管理されている1つずつの参照ピクチャを参照するインター予測である。L0予測、L1予測、双予測を特定する情報を、インター予測モードと定義する。以降の処理において出力に添え字LXが付いている定数、変数に関しては、L0、L1ごとに処理が行われることを前提とする。
予測動きベクトルモードは、予測動きベクトルを特定するためのインデックス、差分動きベクトル、インター予測モード、参照インデックスを伝送し、処理対象ブロックのインター予測情報を決定するモードである。予測動きベクトルは、処理対象ブロックに隣接する処理済みブロック、または処理済み画像に属するブロックで処理対象ブロックと同一位置またはその付近(近傍)に位置するブロックから導出した予測動きベクトル候補と、予測動きベクトルを特定するためのインデックスから導出する。
マージモードは、差分動きベクトル、参照インデックスを伝送せずに、処理対象ブロックに隣接する処理済みブロック、または処理済み画像に属するブロックで処理対象ブロックと同一位置またはその付近(近傍)に位置するブロックのインター予測情報から、処理対象ブロックのインター予測情報を導出するモードである。
図11は、予測動きベクトルモード、マージモードで、インター予測情報を導出するために参照する参照ブロックを説明する図である。A0,A1,A2,B0,B1,B2,B3は、処理対象ブロックに隣接する処理済みブロックである。T0は、処理済み画像に属するブロックで、処理対象画像における処理対象ブロックと同一位置またはその付近(近傍)に位置するブロックである。
アフィン変換動き補償は、符号化ブロックを所定単位のサブブロックに分割し、分割された各サブブロックに対して個別に動きベクトルを決定して動き補償を行うものである。各サブブロックの動きベクトルは、処理対象ブロックに隣接する処理済みブロック、または処理済み画像に属するブロックで処理対象ブロックと同一位置またはその付近(近傍)に位置するブロックのインター予測情報から導出する1つ以上の制御点に基づき導出する。本実施の形態では、サブブロックのサイズを4x4画素とするが、サブブロックのサイズはこれに限定されるものではないし、画素単位で動きベクトルを導出してもよい。
図15に、制御点が3つの場合のアフィン変換動き補償の例を示す。この場合、3つの制御点が水平方向成分、垂直方向成分の2つのパラメータを有する。このため、制御点が3つの場合のアフィン変換を、6パラメータアフィン変換と呼称する。図15のCP1、CP2、CP3が制御点である。
図12A、図12B、および図13を用いて、符号化ブロックの予測モードを表現するためのシンタックスを説明する。図12Aのpred_mode_flagは、インター予測か否かを示すフラグである。pred_mode_flagが0であればインター予測となり、pred_mode_flagが1であればイントラ予測となる。イントラ予測の場合には、イントラブロックコピー予測であるかを示すフラグであるpred_mode_ibc_flagを送る。イントラブロックコピー予測である場合(pred_mode_ibc_flag=1)は、merge_flagを送る。merge_flagは、マージイントラブロックコピーモードとするか、予測イントラブロックコピーモードとするかを示すフラグである。マージイントラブロックコピーモードである場合(merge_flag=1)は、マージインデックスmerge_idxを送る。イントラブロックコピー予測でない場合(pred_mode_ibc_flag=0)、通常イントラ予測とし、通常イントラ予測の情報intra_pred_modeを送る。
インター予測の場合にはmerge_flagを送る。merge_flagは、マージモードとするか、予測動きベクトルモードとするかを示すフラグである。予測動きベクトルモードの場合(merge_flag=0)、サブブロック予測動きベクトルモードを適用するか否かを示すフラグinter_affine_flagを送る。サブブロック予測動きベクトルモードを適用する場合(inter_affine_flag=1)、cu_affine_type_flagを送る。cu_affine_type_flagは、サブブロック予測動きベクトルモードにおいて、制御点の数を決定するためのフラグである。
一方、マージモードの場合(merge_flag=1)、図12Bのmerge_subblock_flagを送る。merge_subblock_flagは、サブブロックマージモードを適用するか否かを示すフラグである。サブブロックマージモードの場合(merge_subblock_flag=1)、マージインデックスmerge_subblock_idxを送る。一方、サブブロックマージモードでない場合(merge_subblock_flag=0)、三角マージモードを適用するか否かを示すフラグmerge_triangle_flagを送る。三角マージモードを適用する場合(merge_triangle_flag=1)、ブロックを分割する方向merge_triangle_split_dir、および分割された2つのパーティションごとにマージ三角インデックスmerge_triangle_idx0,merge_triangle_idx1を送る。一方、三角マージモードを適用しない場合(merge_triangle_flag=0)、マージインデックスmerge_idxを送る。
図13にインター予測の各シンタックスエレメントの値と、それに対応する予測モードを示す。merge_flag=0,inter_affine_flag=0は、通常予測動きベクトルモード(Inter Pred Mode)に対応する。merge_flag=0,inter_affine_flag=1は、サブブロック予測動きベクトルモード(Inter Affine Mode)に対応する。merge_flag=1,merge_subblock_flag=0,merge_trianlge_flag=0は、通常マージモード(Merge Mode)に対応する。merge_flag=1,merge_subblock_flag=0,merge_trianlge_flag=1は、三角マージモード(Triangle Merge Mode)に対応する。merge_flag=1,merge_subblock_flag=1は、サブブロックマージモード(Affine Merge Mode)に対応する。
POC(Picture Order Count)は符号化されるピクチャに関連付けられる変数であり、ピクチャの出力順序に応じた1ずつ増加する値が設定される。POCの値によって、同じピクチャであるかを判別したり、出力順序でのピクチャ間の前後関係を判別したり、ピクチャ間の距離を導出したりすることができる。例えば、2つのピクチャのPOCが同じ値を持つ場合、同一のピクチャであると判断できる。2つのピクチャのPOCが違う値を持つ場合、POCの値が小さいピクチャのほうが、先に出力されるピクチャであると判断でき、2つのピクチャのPOCの差が時間軸方向でのピクチャ間距離を示す。
本発明の第1の実施の形態に係る画像符号化装置100及び画像復号装置200について説明する。
実施の形態に係るイントラ予測方法は、図1の画像符号化装置100のイントラ予測部103および図2の画像復号装置200のイントラ予測部204において実施される。
実施の形態に係るイントラ予測方法について、図面を用いて説明する。イントラ予測方法は符号化ブロック単位で符号化及び復号の処理の何れでも実施される。
<符号化側のイントラ予測部103の説明>
図40は図1の画像符号化装置100のイントラ予測部103の詳細な構成を示す図である。通常イントラ予測部351は、処理対象の符号化ブロックに隣接する復号済み画素から、通常イントラ予測により予測画像信号を生成し、複数のイントラ予測モードの中から適したイントラ予測モードを選択し、選択されたイントラ予測モード、及び選択されたイントラ予測モードに応じた予測画像信号を予測方法決定部105に供給する。図10A及び図10Bにイントラ予測の例を示す。図10Aは、通常イントラ予測の予測方向とイントラ予測モード番号の対応を示したものである。例えば、イントラ予測モード50は、垂直方向に画素をコピーすることによりイントラ予測画像を生成する。イントラ予測モード1は、DCモードであり、処理対象ブロックのすべての画素値を参照画素の平均値とするモードである。イントラ予測モード0はPlanarモードであり、垂直方向・水平方向の参照画素から2次元的なイントラ予測画像を作成するモードである。図10Bは、イントラ予測モード40の場合のイントラ予測画像を生成する例である。処理対象ブロックの各画素に対し、イントラ予測モードの示す方向の参照画素の値をコピーする。イントラ予測モードの参照画素が整数位置でない場合には、周辺の整数位置の参照画素値から補間により参照画素値を決定する。
イントラブロックコピー予測部352は、復号画像メモリ104から処理対象の符号化ブロックと同一の画像信号の復号済み領域を取得し、イントラブロックコピー処理により、予測画像信号を生成し、予測方法決定部105に供給する。イントラブロックコピー予測部352の詳細な構成と処理については後述する。
<復号側のイントラ予測部204の説明>
図41は図2の画像復号装置200のイントラ予測部204の詳細な構成を示す図である。
通常イントラ予測部361は、処理対象の符号化ブロックに隣接する復号済み画素から、通常イントラ予測により予測画像信号を生成し、複数のイントラ予測モードの中から適したイントラ予測モードを選択し、選択されたイントラ予測モード、及び選択されたイントラ予測モードに応じた予測画像信号を得る。この予測画像信号がスイッチ364を経由して復号画像信号重畳部207に供給される。図41の通常イントラ予測部361の処理は、図40の通常イントラ予測部351に対応するものであるため、詳細の説明を省略する。
イントラブロックコピー予測部362は、復号画像メモリ208から処理対象の符号化ブロックと同一の画像信号の復号済み領域を取得し、イントラブロックコピー処理により、予測画像信号を得る。この予測画像信号がスイッチ364を経由して復号画像信号重畳部207に供給される。イントラブロックコピー予測部362の詳細な構成と処理については後述する。
実施の形態に係るインター予測方法は、図1の画像符号化装置のインター予測部102および図2の画像復号装置のインター予測部203において実施される。
図16は図1の画像符号化装置のインター予測部102の詳細な構成を示す図である。通常予測動きベクトルモード導出部301は、複数の通常予測動きベクトル候補を導出して予測動きベクトルを選択し、選択した予測動きベクトルと、検出された動きベクトルとの差分動きベクトルを算出する。検出されたインター予測モード、参照インデックス、動きベクトル、算出された差分動きベクトルが通常予測動きベクトルモードのインター予測情報となる。このインター予測情報がインター予測モード判定部305に供給される。通常予測動きベクトルモード導出部301の詳細な構成と処理については後述する。
図22は図2の画像復号装置のインター予測部203の詳細な構成を示す図である。
図17の通常予測動きベクトルモード導出部301は、空間予測動きベクトル候補導出部321、時間予測動きベクトル候補導出部322、履歴予測動きベクトル候補導出部323、予測動きベクトル候補補充部325、通常動きベクトル検出部326、予測動きベクトル候補選択部327、動きベクトル減算部328を含む。
図19を参照して符号化側の通常予測動きベクトルモード導出処理手順を説明する。図19の処理手順の説明において、図19に示した通常という言葉を省略することがある。
mvdLX = mvLX - mvpLX
としてLXの差分動きベクトルmvdLXを算出する(図19のステップS105)。
次に、図25を参照して復号側の通常予測動きベクトルモード処理手順を説明する。復号側では、空間予測動きベクトル候補導出部421、時間予測動きベクトル候補導出部422、履歴予測動きベクトル候補導出部423、予測動きベクトル候補補充部425で、通常予測動きベクトルモードのインター予測で用いる動きベクトルをL0,L1毎にそれぞれ算出する(図25のステップS201~S206)。具体的には処理対象ブロックの予測モードPredModeがインター予測(MODE_INTER)で、処理対象ブロックのインター予測モードがL0予測(Pred_L0)の場合、L0の予測動きベクトル候補リストmvpListL0を算出して、予測動きベクトルmvpL0を選択し、L0の動きベクトルmvL0を算出する。処理対象ブロックのインター予測モードがL1予測(Pred_L1)の場合、L1の予測動きベクトル候補リストmvpListL1を算出して、予測動きベクトルmvpL1を選択し、L1の動きベクトルmvL1を算出する。処理対象ブロックのインター予測モードが双予測(Pred_BI)の場合、L0予測とL1予測が共に行われ、L0の予測動きベクトル候補リストmvpListL0を算出して、L0の予測動きベクトルmvpL0を選択し、L0の動きベクトルmvL0を算出するとともに、L1の予測動きベクトル候補リストmvpListL1を算出して、L1の予測動きベクトルmvpL1を算出し、L1の動きベクトルmvL1をそれぞれ算出する。
mvLX = mvpLX + mvdLX
としてLXの動きベクトルmvLXを算出する(図25のステップS205)。
図20は本発明の実施の形態に係る画像符号化装置の通常予測動きベクトルモード導出部301及び画像復号装置の通常予測動きベクトルモード導出部401とで共通する機能を有する通常予測動きベクトルモード導出処理の処理手順を表すフローチャートである。
図18の通常マージモード導出部302は、空間マージ候補導出部341、時間マージ候補導出部342、平均マージ候補導出部344、履歴マージ候補導出部345、マージ候補補充部346、マージ候補選択部347を含む。
なお、マージ候補リストmergeCandList内に登録されているマージ候補数numCurrMergeCandが、最大マージ候補数MaxNumMergeCandより小さい場合、マージ候補リストmergeCandList内に登録されているマージ候補数numCurrMergeCandが最大マージ候補数MaxNumMergeCandを上限として履歴マージ候補は導出されて、マージ候補リストmergeCandListに登録される。
なお、マージ候補リストmergeCandList内に登録されているマージ候補数numCurrMergeCandが、最大マージ候補数MaxNumMergeCandより小さい場合、マージ候補リストmergeCandList内に登録されているマージ候補数numCurrMergeCandが最大マージ候補数MaxNumMergeCandを上限として平均マージ候補は導出されて、マージ候補リストmergeCandListに登録される。
ここで、平均マージ候補は、マージ候補リストmergeCandListに登録されている第1のマージ候補と第2のマージ候補の有する動きベクトルをL0予測及びL1予測毎に平均して得られる動きベクトルを有する新たなマージ候補である。
次に、符号化側の符号化情報格納メモリ111及び復号側の符号化情報格納メモリ205に備える履歴予測動きベクトル候補リストHmvpCandListの初期化方法および更新方法について詳細に説明する。図26は履歴予測動きベクトル候補リスト初期化・更新処理手順を説明するフローチャートである。
次に、符号化側の通常予測動きベクトルモード導出部301の履歴予測動きベクトル候補導出部323、復号側の通常予測動きベクトルモード導出部401の履歴予測動きベクトル候補導出部423で共通の処理である図20のステップS304の処理手順である履歴予測動きベクトル候補リストHmvpCandListからの履歴予測動きベクトル候補の導出方法について詳細に説明する。図29は履歴予測動きベクトル候補導出処理手順を説明するフローチャートである。
次に、符号化側の通常マージモード導出部302の履歴マージ候補導出部345、復号側の通常マージモード導出部402の履歴マージ候補導出部445で共通の処理である図21のステップS404の処理手順である履歴マージ候補リストHmvpCandListからの履歴マージ候補の導出方法について詳細に説明する。図30は履歴マージ候補導出処理手順を説明するフローチャートである。
履歴予測動きベクトル候補リストのすべての要素の確認が完了するか、マージ候補リストのすべての要素にマージ候補が追加されたら、本履歴マージ候補の導出処理を完了する。
次に、符号化側の通常マージモード導出部302の平均マージ候補導出部344、復号側の通常マージモード導出部402の平均マージ候補導出部444で共通の処理である図21のステップS403の処理手順である平均マージ候補の導出方法について詳細に説明する。図38は平均マージ候補導出処理手順を説明するフローチャートである。
動き補償予測部306は、符号化において現在予測処理の対象となっているブロックの位置およびサイズを取得する。また、動き補償予測部306は、インター予測情報をインター予測モード判定部305から取得する。取得したインター予測情報から参照インデックスおよび動きベクトルを導出し、復号画像メモリ104内の参照インデックスで特定される参照ピクチャを、動きベクトルの分だけ予測ブロックの画像信号と同一位置より移動させた位置の画像信号を取得した後に予測信号を生成する。
単一の参照ピクチャからの予測を行う処理を単予測と定義し、単予測の場合はL0予測またはL1予測という、参照リストL0、L1に登録された2つの参照ピクチャのいずれか一方を利用した予測を行う。
本発明の実施の形態では、動き補償予測の精度向上のために、動き補償予測において複数の参照ピクチャの中から最適な参照ピクチャを選択することを可能とする。そのため、動き補償予測で利用した参照ピクチャを参照インデックスとして利用するとともに、参照インデックスを差分動きベクトルとともにビットストリーム中に符号化する。
動き補償予測部306は、図16の符号化側におけるインター予測部102でも示されるように、インター予測モード判定部305において、通常予測動きベクトルモード導出部301によるインター予測情報が選択された場合には、このインター予測情報をインター予測モード判定部305から取得し、現在処理対象となっているブロックのインター予測モード、参照インデックス、動きベクトルを導出し、動き補償予測信号を生成する。生成された動き補償予測信号は、予測方法決定部105に供給される。
動き補償予測部306は、図16の符号化側におけるインター予測部102でも示されるように、インター予測モード判定部305において、通常マージモード導出部302によるインター予測情報が選択された場合には、このインター予測情報をインター予測モード判定部305から取得し、現在処理対象となっているブロックのインター予測モード、参照インデックス、動きベクトルを導出し、動き補償予測信号を生成する。生成された動き補償予測信号は、予測方法決定部105に供給される。
動き補償予測部306は、図16の符号化側におけるインター予測部102でも示されるように、インター予測モード判定部305において、サブブロック予測動きベクトルモード導出部303によるインター予測情報が選択された場合には、このインター予測情報をインター予測モード判定部305から取得し、現在処理対象となっているブロックのインター予測モード、参照インデックス、動きベクトルを導出し、動き補償予測信号を生成する。生成された動き補償予測信号は、予測方法決定部105に供給される。
動き補償予測部306は、図16の符号化側におけるインター予測部102でも示されるように、インター予測モード判定部305において、サブブロックマージモード導出部304によるインター予測情報が選択された場合には、このインター予測情報をインター予測モード判定部305から取得し、現在処理対象となっているブロックのインター予測モード、参照インデックス、動きベクトルを導出し、動き補償予測信号を生成する。生成された動き補償予測信号は、予測方法決定部105に供給される。
通常予測動きベクトルモード、および通常マージモードでは、以下のフラグに基づいてアフィンモデルによる動き補償が利用できる。以下のフラグは、符号化処理においてインター予測モード判定部305により決定されるインター予測の条件に基づいて以下のフラグに反映され、ビットストリーム中に符号化される。復号処理においては、ビットストリーム中の以下のフラグに基づいてアフィンモデルによる動き補償を行うか否かを特定する。
図39を参照してイントラブロックコピーの有効参照領域を説明する。図39Aは符号化ツリーブロック単位をイントラブロックコピー基準ブロックとして、有効参照領域を決定する場合の例である。図39Aの500、501、502、503、504は符号化ツリーブロックであり、504が処理対象の符号化ツリーブロックである。505は、処理対象符号化ブロックである。符号化ツリーブロックの処理順は、500、501、502、503、504の順とする。この場合、処理対象符号化ブロック505を含む符号化ツリーブロック504の直前に処理された3つの符号化ツリーブロック501、502、503を処理対象符号化ブロック505の有効参照領域とする。符号化ツリーブロック501より前に処理された符号化ツリーブロック、及び処理対象符号化ブロック505より前に処理が完了しているか否かに関わらず、処理対象符号化ブロック505を含む符号化ツリーブロック504に含まれる領域はすべて無効参照領域とする。
図44を参照して符号化側の予測イントラブロックコピー処理手順を説明する。
まず、ブロックベクトル検出部375でブロックベクトルmvLを検出する(図44のステップS4500)。 続いて、IBC空間ブロックベクトル候補導出部371、IBC履歴予測ブロックベクトル候補導出部372、IBC予測ブロックベクトル候補補充部373、IBC予測ブロックベクトル候補選択部376、ブロックベクトル減算部378で、予測ブロックベクトルモードで用いるブロックベクトルの差分ブロックベクトルを算出する(図44のステップS4501~S4503)。
mvdL = mvL - mvpL
として差分ブロックベクトルmvdLを算出する(図44のステップS4503)。
次に、図45を参照して復号側の予測ブロックベクトルモード処理手順を説明する。復号側では、IBC空間予測ブロックベクトル候補導出部471、IBC履歴ブロックベクトル候補導出部472、IBC予測ブロックベクトル補充部473で、予測ブロックベクトルモードで用いるブロックベクトルを算出する(図45のステップS4600~S4602)。具体的には予測ブロックベクトル候補リストmvpListLを算出して、予測ブロックベクトルmvpLを選択し、ブロックベクトルmvLを算出する。
mvL = mvpL + mvdL
としてブロックベクトルmvLを算出する(図45のステップS4602)。
図47は本発明の実施の形態に係る動画像符号化装置のイントラブロックコピー予測部352及び動画像復号装置のイントラブロックコピー予測部362とで共通する機能を有する予測イントラブロックコピーモード導出処理の処理手順を表すフローチャートである。
図42のイントラブロックコピー予測部352は、IBC空間ブロックベクトル候補導出部371、IBC履歴ブロックベクトル候補導出部372、IBCブロックベクトル補充部373、参照位置補正部380、参照領域境界補正部381、IBCマージ候補選択部374、IBC予測モード判定部377を含む。
mvC = ( ( mvL >> ( 3 + 2 ) ) * 32
となる。上式により、mvCのx,y成分それぞれが処理される。
図48は、参照位置補正部380及び参照位置補正部480の処理を説明するフローチャートである。いま、イントラブロックコピー基準ブロックの単位は符号化ツリーブロック(CTU)であり、その大きさは128x128画素でないものとする。
( xRefTL, yRefTL ) = ( xCb + ( mvL[ 0 ] >> 4 ), yCb + ( mvL[ 1 ] >> 4 ) )
( xRefBR, yRefBR ) = ( xRefTL + cbWidth - 1, yRefTL + cbHeight - 1 )
となる。ここで、処理対象符号化ブロックの位置を(xCb,yCb)、ブロックベクトルを(mvL[0],mvL[1])とし、処理対象符号化ブロックの幅はcbWidth、高さはcbHeightとする。
次に、CTUの大きさが128x128画素か否かを判定する(S6002)。いま、その大きさは128x128画素でないので(S6002:NO)、参照可能領域の左上および右下の位置を算出する(S6003)。参照可能領域の左上を( xAvlTL, yAvlTL )、右下を( xAvlBR, yAvlBR )とすると、
NL = Min( 1, 7 - CtbLog2SizeY ) - ( 1 << ((7 - CtbLog2SizeY) << 1) )
( xAvlTL, yAvlTL ) = ( ((xCb >> CtbLog2SizeY) + NL) << CtbLog2SizeY,
(yCb >> CtbLog2SizeY) << CtbLog2SizeY )
( xAvlBR, yAvlBR ) = ( ((xCb >> CtbLog2SizeY) << CtbLog2SizeY) - 1,
(((yCb >> CtbLog2SizeY) + 1) << CtbLog2SizeY) - 1 )
となる。ここで、CTUのサイズはCtbLog2SizeYとする。
mvL[0] = (xAvlTL - xCb) << 4
と補正する。これにより、xRefTL=xAvlTLとなるので、参照位置を補正できる。
mvL[1] = (yAvlTL - yCb) << 4
と補正する。これにより、yRefTL=yAvlTLとなるので、参照位置を補正できる。
mvL[0] = (xAvlBR - (xCb + cbWidth - 1)) << 4
と補正する。これにより、xRefBR=xAvlBRとなるので、参照位置を補正できる。
mvL[1] = (yAvlBR - (yCb + cbHeitght - 1)) << 4
と補正する。これにより、yRefBR=yAvlBRとなるので、参照位置を補正できる。
offset[4] = {0, 64, 128, 128}
NL = -offset[3 - blk_idx], NR = offset[blk_idx]
( xAvlTL, yAvlTL ) = ( (xCb >> CtbLog2SizeY) << CtbLog2SizeY + NL,
(yCb >> CtbLog2SizeY) << CtbLog2SizeY )
( xAvlBR, yAvlBR ) = ( ((xCb >> CtbLog2SizeY) << CtbLog2SizeY) - 1 + NR,
(((yCb >> CtbLog2SizeY) + 1) << CtbLog2SizeY) - 1 )
となる。ここで、blk_idxは、処理対象の符号化ブロックの位置を示すインデックスである。処理対象の符号化ツリーブロックを4分割したうち、処理対象の符号化ブロックが左上に位置している場合は、blk_idx=0とする。同様に、処理対象の符号化ブロックがそれぞれ右上、左下、右下に位置している場合は、blk_idxは1,2,3とする。図50Aはblk_idx=0の場合を示す図である。同様に、図50Bから図50Dは、それぞれblk_idx=1から3の場合を示す図である。
offset[4] = {64, 128, 64, 0}, NL = offset[blk_idx]
(X1, Y1) = (xAvlTL, yAvlTL + 64)
(X2, Y2) = (xAvlTL + NL, yAvlTL)
となる。
mvL[0] = (xAvlTL + NL - xCb) << 4
と補正する。これにより、xRefTL=xAvlTL+NLとなるので、参照位置を補正できる。
mvL[1] = (yAvlTL + 64 - yCb) << 4
と補正する。これにより、yRefTL=yAvlTL+64となるので、参照位置を補正できる。
offset[4] = {0, 64, 128, 64}, NR = offset[blk_idx]
(X3, Y3) = (xAvlBR, yAvlBR - 64)
(X4, Y4) = (xAvlBR - NR, yAvlBR)
となる。
mvL[0] = (xAvlBR - NR - (xCb + cbWitdh - 1)) << 4
と補正する。これにより、xRefBR=xAvlBRとなるので、参照位置を補正できる。
mvL[1] = (yAvlBR - 64 - (yCb + cbHeight - 1)) << 4
と補正する。これにより、yRefBR=yAvlBRとなるので、参照位置を補正できる。
xOffsetTL[4] = {-128, -128, -64, 0}, yOffsetTL[4] = {64, 64, 64, 0}
xOffsetBR[4] = {0, 0, 0, 128}, yOffsetBR[4] = {128, 128, 128, 64}
( xAvlTL, yAvlTL ) = ( (xCb >> CtbLog2SizeY) << CtbLog2SizeY
+ xOffsetTL[blk_idx],
(yCb >> CtbLog2SizeY) << CtbLog2SizeY + yOffsetTL[blk_idx])
( xAvlBR, yAvlBR ) = ( ((xCb >> CtbLog2SizeY) << CtbLog2SizeY) - 1
+ xOffsetBR[blk_idx],
(((yCb >> CtbLog2SizeY) + 1) << CtbLog2SizeY) - 1 + yOffsetBR[blk_idx] )となる。
out_xRefTL = xRefTL < xAvlTL
out_yRefTL = yRefTL < yAvlTL
out_xRefBR = xRefBR > xAvlBR
out_yRefBR = yRefBR > yAvlBR
として算出する(S6112)。
xOffsetTL[4] = {-64, 0, 0, 0}, yOffsetTL[4] = {0, 0, 0, 0}
xOffsetBR[4] = {0, 64, 128, 64}, yOffsetBR[4] = {128, 64, 64, 128}
( xAvlTL, yAvlTL ) = ( (xCb >> CtbLog2SizeY) << CtbLog2SizeY
+ xOffsetTL[blk_idx],
(yCb >> CtbLog2SizeY) << CtbLog2SizeY + yOffsetTL[blk_idx])
( xAvlBR, yAvlBR ) = ( ((xCb >> CtbLog2SizeY) << CtbLog2SizeY) - 1
+ xOffsetBR[blk_idx],
(((yCb >> CtbLog2SizeY) + 1) << CtbLog2SizeY) - 1 + yOffsetBR[blk_idx] )
となる。
Claims (6)
- 符号化情報格納メモリに格納された符号化情報から処理対象ピクチャ内の処理対象ブロックのブロックベクトル候補を導出するブロックベクトル候補導出部と、
前記ブロックベクトル候補から選択ブロックベクトルを選択する選択部と、
前記選択ブロックベクトルにより参照する参照ブロックについて、参照可能領域の内部を参照するように前記参照ブロックの参照位置を補正する参照位置補正部とを備え、
前記参照ブロックの参照位置に基づいて、前記処理対象ピクチャ内の復号済み画素を、前記処理対象ブロックの予測値として復号画像メモリから取得する画像符号化装置。 - 符号化情報格納メモリに格納された符号化情報から処理対象ピクチャ内の処理対象ブロックのブロックベクトル候補を導出するブロックベクトル候補導出ステップと、
前記ブロックベクトル候補から選択ブロックベクトルを選択する選択ステップと、
前記選択ブロックベクトルにより参照する参照ブロックについて、参照可能領域の内部を参照するように前記参照ブロックの参照位置を補正する参照位置補正ステップとを備え、
前記参照ブロックの参照位置に基づいて、前記処理対象ピクチャ内の復号済み画素を、前記処理対象ブロックの予測値として復号画像メモリから取得する画像符号化方法。 - 符号化情報格納メモリに格納された符号化情報から処理対象ピクチャ内の処理対象ブロックのブロックベクトル候補を導出するブロックベクトル候補導出ステップと、
前記ブロックベクトル候補から選択ブロックベクトルを選択する選択ステップと、
前記選択ブロックベクトルにより参照する参照ブロックについて、参照可能領域の内部を参照するように前記参照ブロックの参照位置を補正する参照位置補正ステップとを備え、
前記参照ブロックの参照位置に基づいて、前記処理対象ピクチャ内の復号済み画素を、前記処理対象ブロックの予測値として復号画像メモリから取得する画像符号化プログラム。 - 符号化情報格納メモリに格納された符号化情報から処理対象ピクチャ内の処理対象ブロックのブロックベクトル候補を導出するブロックベクトル候補導出部と、
前記ブロックベクトル候補から選択ブロックベクトルを選択する選択部と、
前記選択ブロックベクトルにより参照する参照ブロックについて、参照可能領域の内部を参照するように前記参照ブロックの参照位置を補正する参照位置補正部とを備え、
前記参照ブロックの参照位置に基づいて、前記処理対象ピクチャ内の復号済み画素を、前記処理対象ブロックの予測値として復号画像メモリから取得する画像復号装置。 - 符号化情報格納メモリに格納された符号化情報から処理対象ピクチャ内の処理対象ブロックのブロックベクトル候補を導出するブロックベクトル候補導出ステップと、
前記ブロックベクトル候補から選択ブロックベクトルを選択する選択ステップと、
前記選択ブロックベクトルにより参照する参照ブロックについて、参照可能領域の内部を参照するように前記参照ブロックの参照位置を補正する参照位置補正ステップとを備え、
前記参照ブロックの参照位置に基づいて、前記処理対象ピクチャ内の復号済み画素を、前記処理対象ブロックの予測値として復号画像メモリから取得する画像復号方法。 - 符号化情報格納メモリに格納された符号化情報から処理対象ピクチャ内の処理対象ブロックのブロックベクトル候補を導出するブロックベクトル候補導出ステップと、
前記ブロックベクトル候補から選択ブロックベクトルを選択する選択ステップと、
前記選択ブロックベクトルにより参照する参照ブロックについて、参照可能領域の内部を参照するように前記参照ブロックの参照位置を補正する参照位置補正ステップとを備え、
前記参照ブロックの参照位置に基づいて、前記処理対象ピクチャ内の復号済み画素を、前記処理対象ブロックの予測値として復号画像メモリから取得する画像復号プログラム。
Priority Applications (13)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
JP2021526917A JP6958768B2 (ja) | 2019-06-20 | 2020-06-19 | 動画像符号化装置、動画像符号化方法、及び動画像符号化プログラム、動画像復号装置、動画像復号方法及び動画像復号プログラム |
CN202410940782.0A CN118694969A (zh) | 2019-06-20 | 2020-06-19 | 图像编码装置和方法、以及图像解码装置和方法 |
MX2021015089A MX2021015089A (es) | 2019-06-20 | 2020-06-19 | Dispositivo de codificacion de video, metodo de codificacion de video, programa de codificacion de video, dispositivo de decodificacion de video, metodo de decodificacion de video y programa de decodificacion de video. |
CN202410940728.6A CN118694968A (zh) | 2019-06-20 | 2020-06-19 | 图像编码装置和方法、以及图像解码装置和方法 |
CN202080031476.5A CN113767622B (zh) | 2019-06-20 | 2020-06-19 | 动图像编码装置和方法、以及动图像解码装置和方法 |
CN202410940888.0A CN118694970A (zh) | 2019-06-20 | 2020-06-19 | 图像编码装置和方法、以及图像解码装置和方法 |
KR1020217030959A KR20210128482A (ko) | 2019-06-20 | 2020-06-19 | 동화상 부호화 장치, 동화상 부호화 방법 및, 동화상 부호화 프로그램, 동화상 복호 장치, 동화상 복호 방법 및 동화상 복호 프로그램 |
EP20826811.0A EP3989545A4 (en) | 2019-06-20 | 2020-06-19 | VIDEO CODING DEVICE, VIDEO CODING METHOD, VIDEO CODING PROGRAM, VIDEO DECODING DEVICE, VIDEO DECODING METHOD AND VIDEO DECODING PROGRAM |
BR112021025852-3A BR112021025852B1 (pt) | 2019-06-20 | 2020-06-19 | Dispositivo de codificação de imagem, método de codificação de imagem, dispositivo de decodificação de imagem e método de decodificação de imagem |
CN202211171879.7A CN115442594A (zh) | 2019-06-20 | 2020-06-19 | 动图像编码装置和方法、以及动图像解码装置和方法 |
JP2021165349A JP7287433B2 (ja) | 2019-06-20 | 2021-10-07 | 動画像符号化装置、動画像符号化方法、及び動画像符号化プログラム、動画像復号装置、動画像復号方法及び動画像復号プログラム |
US17/552,389 US11936849B2 (en) | 2019-06-20 | 2021-12-16 | Moving picture coding device, moving picture coding method, moving picture coding program, moving picture decoding device, moving picture decoding method, and moving picture decoding program |
US18/435,795 US20240179300A1 (en) | 2019-06-20 | 2024-02-07 | Moving picture coding device, moving picture coding method, moving picture coding program, moving picture decoding device, moving picture decoding method, and moving picture decoding program |
Applications Claiming Priority (2)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
JP2019-114395 | 2019-06-20 | ||
JP2019114395 | 2019-06-20 |
Related Child Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
US17/552,389 Continuation US11936849B2 (en) | 2019-06-20 | 2021-12-16 | Moving picture coding device, moving picture coding method, moving picture coding program, moving picture decoding device, moving picture decoding method, and moving picture decoding program |
Publications (1)
Publication Number | Publication Date |
---|---|
WO2020256102A1 true WO2020256102A1 (ja) | 2020-12-24 |
Family
ID=74040853
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
PCT/JP2020/024137 WO2020256102A1 (ja) | 2019-06-20 | 2020-06-19 | 動画像符号化装置、動画像符号化方法、及び動画像符号化プログラム、動画像復号装置、動画像復号方法及び動画像復号プログラム |
Country Status (8)
Country | Link |
---|---|
US (2) | US11936849B2 (ja) |
EP (1) | EP3989545A4 (ja) |
JP (4) | JP6958768B2 (ja) |
KR (1) | KR20210128482A (ja) |
CN (5) | CN118694968A (ja) |
BR (1) | BR112021025852B1 (ja) |
MX (1) | MX2021015089A (ja) |
WO (1) | WO2020256102A1 (ja) |
Cited By (1)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
JPWO2020262503A1 (ja) * | 2019-06-25 | 2020-12-30 |
Families Citing this family (3)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN118694968A (zh) * | 2019-06-20 | 2024-09-24 | Jvc建伍株式会社 | 图像编码装置和方法、以及图像解码装置和方法 |
JP7063416B2 (ja) * | 2019-06-21 | 2022-05-09 | 株式会社Jvcケンウッド | 動画像符号化装置、動画像符号化方法、及び動画像符号化プログラム、動画像復号装置、動画像復号方法及び動画像復号プログラム |
CN116805968A (zh) * | 2022-03-16 | 2023-09-26 | 腾讯科技(深圳)有限公司 | 视频编解码方法、装置、计算机可读介质及电子设备 |
Citations (2)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
JP2009246975A (ja) | 2008-03-28 | 2009-10-22 | Samsung Electronics Co Ltd | 画像のイントラ予測モード情報の符号化及び/又は復号化方法及び装置 |
JP2017130938A (ja) * | 2014-07-07 | 2017-07-27 | 寰發股▲ふん▼有限公司HFI Innovation Inc. | イントラブロックコピー検索と補償範囲の方法 |
Family Cites Families (19)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
KR101605220B1 (ko) * | 2009-01-15 | 2016-03-21 | 르네사스 일렉트로닉스 가부시키가이샤 | 화상 처리 장치, 디코드 방법, 프레임 내 복호 장치, 프레임 내 복호 방법, 및 프레임 내 부호화 장치 |
JP5857244B2 (ja) * | 2011-03-07 | 2016-02-10 | パナソニックIpマネジメント株式会社 | 動き補償装置、動画像符号化装置、動画像復号装置、動き補償方法、プログラム、及び集積回路 |
HUE066386T2 (hu) * | 2011-05-31 | 2024-07-28 | Jvckenwood Corp | Mozgókép-kódoló eszköz, mozgókép-kódoló eljárás és mozgókép-kódoló program, valamint mozgókép-dekódoló eszköz, mozgókép-dekódoló eljárás és mozgókép-dekódoló program |
JP5821542B2 (ja) * | 2011-11-07 | 2015-11-24 | 富士通株式会社 | 動画像符号化装置、および動画像復号装置 |
JP2013102260A (ja) * | 2011-11-07 | 2013-05-23 | Jvc Kenwood Corp | 動画像復号装置、動画像復号方法及び動画像復号プログラム |
AU2013228045A1 (en) * | 2013-09-13 | 2015-04-02 | Canon Kabushiki Kaisha | Method, apparatus and system for encoding and decoding video data |
JP2015173404A (ja) * | 2014-03-12 | 2015-10-01 | 富士通株式会社 | 動画像符号化装置、動画像符号化方法及び動画像符号化用コンピュータプログラム |
US9756354B2 (en) | 2014-03-17 | 2017-09-05 | Qualcomm Incorporated | Block vector predictor for intra block copying |
WO2015169200A1 (en) | 2014-05-06 | 2015-11-12 | Mediatek Singapore Pte. Ltd. | Method of block vector prediction for intra block copy mode coding |
US9877043B2 (en) * | 2014-06-19 | 2018-01-23 | Vid Scale Inc. | Methods and systems for intra block copy coding with block vector derivation |
WO2016051362A1 (en) | 2014-10-03 | 2016-04-07 | Nokia Technologies Oy | Method and equipment for encoding and decoding an intra block copy vector |
US9918105B2 (en) * | 2014-10-07 | 2018-03-13 | Qualcomm Incorporated | Intra BC and inter unification |
CN106797475B (zh) * | 2014-10-08 | 2019-12-10 | 联发科技股份有限公司 | 视频编码方法 |
US9854237B2 (en) * | 2014-10-14 | 2017-12-26 | Qualcomm Incorporated | AMVP and merge candidate list derivation for intra BC and inter prediction unification |
US10200713B2 (en) * | 2015-05-11 | 2019-02-05 | Qualcomm Incorporated | Search region determination for inter coding within a particular picture of video data |
US20190068967A1 (en) * | 2016-02-03 | 2019-02-28 | Sharp Kabushiki Kaisha | Predicted-image generation device, video decoding device, and video coding device |
WO2018097078A1 (ja) * | 2016-11-22 | 2018-05-31 | パナソニック インテレクチュアル プロパティ コーポレーション オブ アメリカ | 符号化装置、復号装置、符号化方法及び復号方法 |
TWI692690B (zh) * | 2017-12-05 | 2020-05-01 | 慧榮科技股份有限公司 | 存取快閃記憶體模組的方法及相關的快閃記憶體控制器與電子裝置 |
CN118694968A (zh) * | 2019-06-20 | 2024-09-24 | Jvc建伍株式会社 | 图像编码装置和方法、以及图像解码装置和方法 |
-
2020
- 2020-06-19 CN CN202410940728.6A patent/CN118694968A/zh active Pending
- 2020-06-19 MX MX2021015089A patent/MX2021015089A/es unknown
- 2020-06-19 EP EP20826811.0A patent/EP3989545A4/en active Pending
- 2020-06-19 CN CN202080031476.5A patent/CN113767622B/zh active Active
- 2020-06-19 BR BR112021025852-3A patent/BR112021025852B1/pt active IP Right Grant
- 2020-06-19 CN CN202410940782.0A patent/CN118694969A/zh active Pending
- 2020-06-19 CN CN202211171879.7A patent/CN115442594A/zh active Pending
- 2020-06-19 WO PCT/JP2020/024137 patent/WO2020256102A1/ja active Application Filing
- 2020-06-19 CN CN202410940888.0A patent/CN118694970A/zh active Pending
- 2020-06-19 KR KR1020217030959A patent/KR20210128482A/ko not_active Application Discontinuation
- 2020-06-19 JP JP2021526917A patent/JP6958768B2/ja active Active
-
2021
- 2021-10-07 JP JP2021165349A patent/JP7287433B2/ja active Active
- 2021-12-16 US US17/552,389 patent/US11936849B2/en active Active
-
2023
- 2023-05-25 JP JP2023086339A patent/JP7521650B2/ja active Active
-
2024
- 2024-02-07 US US18/435,795 patent/US20240179300A1/en active Pending
- 2024-02-29 JP JP2024029842A patent/JP2024051142A/ja active Pending
Patent Citations (2)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
JP2009246975A (ja) | 2008-03-28 | 2009-10-22 | Samsung Electronics Co Ltd | 画像のイントラ予測モード情報の符号化及び/又は復号化方法及び装置 |
JP2017130938A (ja) * | 2014-07-07 | 2017-07-27 | 寰發股▲ふん▼有限公司HFI Innovation Inc. | イントラブロックコピー検索と補償範囲の方法 |
Non-Patent Citations (2)
Title |
---|
PANG, CHAO ET AL.: "Non-CE2: Zero merging candidates derivation for Intra BC/Inter signalling unification", JOINT COLLABORATIVE TEAM ON VIDEO CODING (JCT-VC) OF ITU-T SG 16 WP 3 AND ISO/IEC JTC 1/SC 29/WG 11 20TH MEETING, 10 February 2015 (2015-02-10), Geneva, CH, pages 1 - 5, XP030241323 * |
XU, JUN ET AL.: "On intra block copying in RExt", JOINT COLLABORATIVE TEAM ON VIDEO CODING (JCT-VC) OF ITU-T SG 16 WP 3 AND ISO/IEC JTC 1/SC 29/WG 11 15TH MEETING, 24 October 2013 (2013-10-24), Geneva, CH, pages 1 - 12, XP030238679 * |
Cited By (4)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
JPWO2020262503A1 (ja) * | 2019-06-25 | 2020-12-30 | ||
JP7164035B2 (ja) | 2019-06-25 | 2022-11-01 | 株式会社Jvcケンウッド | 動画像符号化装置、動画像符号化方法、及び動画像符号化プログラム、動画像復号装置、動画像復号方法及び動画像復号プログラム |
JP2022191434A (ja) * | 2019-06-25 | 2022-12-27 | 株式会社Jvcケンウッド | 動画像符号化装置、動画像符号化方法、及び動画像符号化プログラム、動画像復号装置、動画像復号方法及び動画像復号プログラム |
JP7375890B2 (ja) | 2019-06-25 | 2023-11-08 | 株式会社Jvcケンウッド | 動画像符号化装置、動画像符号化方法、及び動画像符号化プログラム、動画像復号装置、動画像復号方法及び動画像復号プログラム |
Also Published As
Publication number | Publication date |
---|---|
US11936849B2 (en) | 2024-03-19 |
JP7287433B2 (ja) | 2023-06-06 |
BR112021025852A2 (pt) | 2022-02-08 |
MX2021015089A (es) | 2022-01-18 |
JPWO2020256102A1 (ja) | 2021-10-21 |
BR112021025852B1 (pt) | 2023-05-16 |
CN118694968A (zh) | 2024-09-24 |
JP6958768B2 (ja) | 2021-11-02 |
JP2024051142A (ja) | 2024-04-10 |
CN113767622B (zh) | 2024-07-26 |
JP7521650B2 (ja) | 2024-07-24 |
US20220109829A1 (en) | 2022-04-07 |
JP2023099774A (ja) | 2023-07-13 |
CN118694969A (zh) | 2024-09-24 |
CN113767622A (zh) | 2021-12-07 |
EP3989545A4 (en) | 2022-08-17 |
CN115442594A (zh) | 2022-12-06 |
EP3989545A1 (en) | 2022-04-27 |
JP2022023855A (ja) | 2022-02-08 |
KR20210128482A (ko) | 2021-10-26 |
US20240179300A1 (en) | 2024-05-30 |
CN118694970A (zh) | 2024-09-24 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
WO2020256102A1 (ja) | 動画像符号化装置、動画像符号化方法、及び動画像符号化プログラム、動画像復号装置、動画像復号方法及び動画像復号プログラム | |
JP7375890B2 (ja) | 動画像符号化装置、動画像符号化方法、及び動画像符号化プログラム、動画像復号装置、動画像復号方法及び動画像復号プログラム | |
JP6911912B2 (ja) | 画像符号化装置、画像符号化方法、画像符号化プログラム、画像復号装置、画像復号方法及び画像復号プログラム | |
WO2020256105A1 (ja) | 動画像符号化装置、動画像符号化方法、及び動画像符号化プログラム、動画像復号装置、動画像復号方法及び動画像復号プログラム | |
JP2020167663A (ja) | 動画像復号装置、動画像復号方法、動画像復号プログラム、動画像符号化装置、動画像符号化方法及び動画像符号化プログラム | |
WO2020262506A1 (ja) | 画像符号化装置、画像符号化方法、及び画像符号化プログラム、画像復号装置、画像復号方法及び画像復号プログラム | |
JP7060773B2 (ja) | 画像復号装置、画像復号方法及び画像復号プログラム | |
JP6801830B1 (ja) | 動画像符号化装置、動画像符号化方法、及び動画像符号化プログラム、動画像復号装置、動画像復号方法及び動画像復号プログラム | |
JP6763469B1 (ja) | 画像符号化装置、画像符号化方法、画像符号化プログラム、画像復号装置、画像復号方法及び画像復号プログラム | |
WO2020262505A1 (ja) | 画像符号化装置、画像符号化方法、及び画像符号化プログラム、画像復号装置、画像復号方法及び画像復号プログラム | |
JP7354610B2 (ja) | 画像符号化装置、画像符号化方法及び画像符号化プログラム | |
WO2020179828A1 (ja) | 動画像符号化装置、動画像符号化方法、及び動画像符号化プログラム、動画像復号装置、動画像復号方法及び動画像復号プログラム | |
JP2021002750A (ja) | 画像符号化装置、画像符号化方法及び画像符号化プログラム | |
JP2021002704A (ja) | 画像符号化装置、画像符号化方法及び画像符号化プログラム | |
JP2021002749A (ja) | 画像復号装置、画像復号方法及び画像復号プログラム | |
JP2021002748A (ja) | 画像符号化装置、画像符号化方法及び画像符号化プログラム | |
JP2021002752A (ja) | 画像復号装置、画像復号方法及び画像復号プログラム | |
JP2021002751A (ja) | 画像符号化装置、画像符号化方法及び画像符号化プログラム | |
JP2021057870A (ja) | 画像符号化装置、画像符号化方法、画像符号化プログラム、画像復号装置、画像復号方法及び画像復号プログラム |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
121 | Ep: the epo has been informed by wipo that ep was designated in this application |
Ref document number: 20826811 Country of ref document: EP Kind code of ref document: A1 |
|
DPE1 | Request for preliminary examination filed after expiration of 19th month from priority date (pct application filed from 20040101) | ||
ENP | Entry into the national phase |
Ref document number: 2021526917 Country of ref document: JP Kind code of ref document: A |
|
ENP | Entry into the national phase |
Ref document number: 20217030959 Country of ref document: KR Kind code of ref document: A |
|
NENP | Non-entry into the national phase |
Ref country code: DE |
|
REG | Reference to national code |
Ref country code: BR Ref legal event code: B01A Ref document number: 112021025852 Country of ref document: BR |
|
WWE | Wipo information: entry into national phase |
Ref document number: 2020826811 Country of ref document: EP |
|
ENP | Entry into the national phase |
Ref document number: 2020826811 Country of ref document: EP Effective date: 20220120 |
|
ENP | Entry into the national phase |
Ref document number: 112021025852 Country of ref document: BR Kind code of ref document: A2 Effective date: 20211220 |