WO2021057579A1 - 编解码方法、装置及设备 - Google Patents
编解码方法、装置及设备 Download PDFInfo
- Publication number
- WO2021057579A1 WO2021057579A1 PCT/CN2020/115657 CN2020115657W WO2021057579A1 WO 2021057579 A1 WO2021057579 A1 WO 2021057579A1 CN 2020115657 W CN2020115657 W CN 2020115657W WO 2021057579 A1 WO2021057579 A1 WO 2021057579A1
- Authority
- WO
- WIPO (PCT)
- Prior art keywords
- current image
- image block
- pixel
- motion vector
- frame
- Prior art date
Links
- 238000000034 method Methods 0.000 title claims abstract description 84
- 239000013598 vector Substances 0.000 claims abstract description 425
- 230000002457 bidirectional effect Effects 0.000 claims description 70
- 230000007774 longterm Effects 0.000 claims description 63
- 230000003287 optical effect Effects 0.000 claims description 40
- 230000004913 activation Effects 0.000 claims description 23
- 230000004927 fusion Effects 0.000 claims description 21
- PXFBZOLANLWPMH-UHFFFAOYSA-N 16-Epiaffinine Natural products C1C(C2=CC=CC=C2N2)=C2C(=O)CC2C(=CC)CN(C)C1C2CO PXFBZOLANLWPMH-UHFFFAOYSA-N 0.000 description 80
- 238000010586 diagram Methods 0.000 description 17
- 238000006073 displacement reaction Methods 0.000 description 13
- 230000002123 temporal effect Effects 0.000 description 9
- 230000008569 process Effects 0.000 description 8
- 238000005516 engineering process Methods 0.000 description 6
- 238000004364 calculation method Methods 0.000 description 2
- 238000005192 partition Methods 0.000 description 2
- 239000007787 solid Substances 0.000 description 2
- 230000006835 compression Effects 0.000 description 1
- 238000007906 compression Methods 0.000 description 1
- 230000000694 effects Effects 0.000 description 1
- 230000006872 improvement Effects 0.000 description 1
- 230000004048 modification Effects 0.000 description 1
- 238000012986 modification Methods 0.000 description 1
- 238000013139 quantization Methods 0.000 description 1
- 238000000638 solvent extraction Methods 0.000 description 1
- 230000009466 transformation Effects 0.000 description 1
Images
Classifications
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N19/00—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
- H04N19/10—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding
- H04N19/134—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the element, parameter or criterion affecting or controlling the adaptive coding
- H04N19/157—Assigned coding mode, i.e. the coding mode being predefined or preselected to be further used for selection of another element or parameter
- H04N19/159—Prediction type, e.g. intra-frame, inter-frame or bidirectional frame prediction
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N19/00—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
- H04N19/10—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding
- H04N19/102—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the element, parameter or selection affected or controlled by the adaptive coding
- H04N19/103—Selection of coding mode or of prediction mode
- H04N19/105—Selection of the reference unit for prediction within a chosen coding or prediction mode, e.g. adaptive choice of position and number of pixels used for prediction
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N19/00—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
- H04N19/10—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding
- H04N19/102—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the element, parameter or selection affected or controlled by the adaptive coding
- H04N19/103—Selection of coding mode or of prediction mode
- H04N19/11—Selection of coding mode or of prediction mode among a plurality of spatial predictive coding modes
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N19/00—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
- H04N19/10—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding
- H04N19/102—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the element, parameter or selection affected or controlled by the adaptive coding
- H04N19/119—Adaptive subdivision aspects, e.g. subdivision of a picture into rectangular or non-rectangular coding blocks
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N19/00—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
- H04N19/10—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding
- H04N19/134—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the element, parameter or criterion affecting or controlling the adaptive coding
- H04N19/136—Incoming video signal characteristics or properties
- H04N19/137—Motion inside a coding unit, e.g. average field, frame or block difference
- H04N19/139—Analysis of motion vectors, e.g. their magnitude, direction, variance or reliability
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N19/00—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
- H04N19/10—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding
- H04N19/169—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the coding unit, i.e. the structural portion or semantic portion of the video signal being the object or the subject of the adaptive coding
- H04N19/17—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the coding unit, i.e. the structural portion or semantic portion of the video signal being the object or the subject of the adaptive coding the unit being an image region, e.g. an object
- H04N19/176—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the coding unit, i.e. the structural portion or semantic portion of the video signal being the object or the subject of the adaptive coding the unit being an image region, e.g. an object the region being a block, e.g. a macroblock
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N19/00—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
- H04N19/10—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding
- H04N19/169—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the coding unit, i.e. the structural portion or semantic portion of the video signal being the object or the subject of the adaptive coding
- H04N19/182—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the coding unit, i.e. the structural portion or semantic portion of the video signal being the object or the subject of the adaptive coding the unit being a pixel
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N19/00—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
- H04N19/10—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding
- H04N19/189—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the adaptation method, adaptation tool or adaptation type used for the adaptive coding
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N19/00—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
- H04N19/44—Decoders specially adapted therefor, e.g. video decoders which are asymmetric with respect to the encoder
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N19/00—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
- H04N19/50—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using predictive coding
- H04N19/503—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using predictive coding involving temporal prediction
- H04N19/51—Motion estimation or motion compensation
- H04N19/513—Processing of motion vectors
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N19/00—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
- H04N19/50—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using predictive coding
- H04N19/503—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using predictive coding involving temporal prediction
- H04N19/51—Motion estimation or motion compensation
- H04N19/527—Global motion vector estimation
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N19/00—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
- H04N19/50—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using predictive coding
- H04N19/503—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using predictive coding involving temporal prediction
- H04N19/51—Motion estimation or motion compensation
- H04N19/577—Motion compensation with bidirectional frame interpolation, i.e. using B-pictures
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N19/00—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
- H04N19/70—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals characterised by syntax aspects related to video coding, e.g. related to compression standards
Definitions
- This application relates to video coding and decoding technologies, and in particular to a coding and decoding method, device, and equipment.
- an encoding and decoding method including:
- an encoding and decoding method including:
- the current image block when the motion vector adjustment mode is enabled for the current image block, the current image block simultaneously satisfies the following conditions:
- the current mode of the current image block is the traditional fusion mode or the traditional skip mode
- the control information is to allow the current image block to enable the motion vector adjustment mode
- the current image block adopts the bidirectional prediction mode, one display order of the two reference frames is located before the frame to which the current image block belongs, and the other display order is located after the frame to which the current image block belongs, and the distance between the two reference frames and the frame to which the current image block belongs is equal;
- the size of the current image block satisfies the limited conditions
- an encoding and decoding method including:
- the control information is to allow the current image block to enable the motion vector adjustment mode
- the current image block adopts the bidirectional prediction mode, one display order of the two reference frames is located before the frame to which the current image block belongs, and the other display order is located after the frame to which the current image block belongs, and the distance between the two reference frames and the frame to which the current image block belongs is equal;
- the weighted weights of the two reference frames of the current image block are the same;
- the size of the current image block satisfies the limited conditions
- the length and width of the two reference frames of the current image block are respectively the same as the length and width of the frame to which the current image block belongs;
- an encoding and decoding device including:
- the determining unit is used to determine whether the current image block meets the enabling condition of the bidirectional optical flow mode
- the processing unit is used to allow the bidirectional optical flow mode to be enabled if it is; if not, refuse to enable the bidirectional optical flow mode;
- the conditions satisfied by the current image block include: the size of the reference frame of the current image block is the same as the size of the frame to which the current image block belongs, and the reference frame is not long-term Reference frame.
- a coding and decoding device including:
- the determining unit is used to determine whether the current image block satisfies the activation condition of the motion vector adjustment mode
- a coding and decoding device including:
- the processing unit is configured to allow the motion vector adjustment mode to be enabled if it is yes; if not, refuse to enable the motion vector adjustment mode;
- the current image block when the motion vector adjustment mode is enabled for the current image block, the current image block simultaneously satisfies the following conditions:
- the current mode of the current image block is the traditional fusion mode or the traditional skip mode
- the weighted weights of the two reference frames of the current image block are the same;
- the size of the current image block satisfies the limited conditions
- a coding and decoding device including:
- the determining unit is used to determine whether the current image block satisfies the activation condition of the motion vector adjustment mode
- the processing unit is configured to allow the motion vector adjustment mode to be enabled if it is yes; if not, refuse to enable the motion vector adjustment mode;
- the current image block when the motion vector adjustment mode is enabled for the current image block, the current image block simultaneously meets the following conditions:
- the current mode of the current image block is the traditional fusion mode or the traditional skip mode
- the control information is to allow the current image block to enable the motion vector adjustment mode
- the current image block adopts the bidirectional prediction mode, one display order of the two reference frames is located before the frame to which the current image block belongs, and the other display order is located after the frame to which the current image block belongs, and the distance between the two reference frames and the frame to which the current image block belongs is equal;
- the weighted weights of the two reference frames of the current image block are the same;
- the size of the current image block satisfies the limited conditions
- the length and width of the two reference frames of the current image block are respectively the same as the length and width of the frame to which the current image block belongs;
- the two reference frames of the current image block are not long-term reference frames.
- an encoding terminal device including a processor and a machine-readable storage medium, the machine-readable storage medium stores machine-executable instructions that can be executed by the processor, and The processor is configured to execute machine-executable instructions to implement the encoding and decoding method described in any one of the first aspect to the fourth aspect.
- a decoding end device including a processor and a machine-readable storage medium, the machine-readable storage medium stores machine-executable instructions that can be executed by the processor, and The processor is configured to execute machine-executable instructions to implement the encoding and decoding method described in any one of the first aspect to the fourth aspect.
- the predicted compensation value of each pixel of the current image block is determined, and then , Based on the predicted value of each pixel of the current image block and the predicted compensation value, the final predicted value of each pixel of the current image block is determined.
- the prediction compensation adjustment does not need to be limited to the image block using the bidirectional prediction mode, and is not limited to the sub-block An image block in which the motion vector is the same as the motion vector of each pixel in the corresponding sub-block extends the application range of the prediction compensation adjustment.
- FIGS. 1A to 1B are schematic diagrams of block division shown in an exemplary embodiment of the present application.
- Fig. 2 is a schematic diagram showing an 8-plug interpolation according to an exemplary embodiment of the present application
- Fig. 4 is a schematic diagram showing a motion vector of a control point of an affine motion mode according to an exemplary embodiment of the present application
- Fig. 5A is a schematic diagram of an entire pixel block according to an exemplary embodiment of the present application.
- FIG. 5B is a schematic diagram of filling 1 row/column of pixels according to an exemplary embodiment of the present application.
- FIG. 5C is a schematic diagram of filling 2 rows/columns of pixels according to an exemplary embodiment of the present application.
- FIG. 5D is a schematic diagram of filling 2 rows/columns of pixels according to another exemplary embodiment of the present application.
- Fig. 6 is a schematic flowchart of a coding and decoding method shown in an exemplary embodiment of the present application
- FIGS. 7A to 7E are schematic diagrams of filling 1 row/column of pixels according to an exemplary embodiment of the present application.
- Fig. 8 is a schematic flowchart of a method for selecting candidate motion vectors according to an exemplary embodiment of the present application
- Fig. 9 is a schematic flowchart of a method for selecting candidate motion vectors according to an exemplary embodiment of the present application.
- FIG. 11 is a schematic flowchart of a method for selecting a prediction mode according to an exemplary embodiment of the present application.
- FIG. 13 is a schematic diagram showing the hardware structure of an encoding terminal device according to an exemplary embodiment of the present application.
- a coding tree unit (Coding Tree Unit, CTU for short) is recursively divided into CU (Coding Unit) using a quadtree.
- CU Coding Unit
- the CU may be further divided into two or four prediction units (Prediction Unit, PU for short), and the same prediction information is used in the same PU.
- Prediction Unit PU for short
- a CU can be further divided into multiple transform units (Transform Units, TU for short).
- Transform Units Transform Units, TU for short.
- the current image block in this application is a PU.
- VVC Versatile Video Coding
- a partition structure that mixes binary tree/tri fork tree/quad tree replaces the original partition mode, cancels the original distinction between the concepts of CU, PU, and TU, and supports a more flexible way of partitioning CU.
- the CU can be divided into a square or a rectangle.
- the CTU first divides the quadtree, and then the leaf nodes of the quadtree can be further divided into binary trees and ternary trees.
- FIG. 1A there are five types of CU divisions, namely, quadtree division, horizontal binary tree division, vertical binary tree division, horizontal trinomial tree division, and vertical trinomial tree division. It can be any combination of the above five division types. Different division methods can be seen from the above, so that the shape of each PU is different, such as rectangles and squares of different sizes.
- Prediction signal refers to the pixel value derived from the pixel that has been coded and decoded.
- the residual is obtained from the difference between the original pixel and the predicted pixel, and then the residual transformation quantization and coefficient coding are performed.
- Motion Vector In inter-frame coding, MV is used to represent the relative displacement between the current coding block and the best matching block in the reference image. Each divided block (may be called a sub-block) has a corresponding motion vector that needs to be transmitted to the decoding end. If the MV of each sub-block is independently coded and transmitted, especially if it is divided into small-sized sub-blocks, it needs to consume a lot of bits. In order to reduce the number of bits used to encode the MV, the spatial correlation between adjacent image blocks is used in video encoding to predict the MV of the current block to be encoded based on the MV of the adjacent encoded block, and then encode the prediction difference . This can effectively reduce the number of bits representing MV.
- the MV of the adjacent encoded block is generally used to predict the MV of the current image block, and then the MV prediction value (Motion Vector Prediction, referred to as MVP) and the real motion vector are predicted.
- MVP Motion Vector Prediction
- the difference between the estimates, that is, the Motion Vector Difference (MVD for short) is coded, thereby effectively reducing the number of MV coded bits.
- Motion compensation refers to the process of obtaining all predicted pixel values of the current image block through interpolation (or copying).
- BDOF mode (Bi-directional Optical Flow, bi-directional optical flow mode): It can also be referred to as BIO mode, which refers to a mode in which motion compensation values are adjusted based on the motion compensation values of two reference frames and based on the optical flow method.
- the predicted value of each pixel of the current image block may be determined based on the motion information of the current image block.
- an 8*8 image block it can be divided into 4 4*4 sub-blocks, and the predicted value of each pixel in each 4*4 sub-block is determined respectively.
- Step S320 Determine the offset vector of each pixel of the current image block.
- the final predicted value of each pixel of the current image block can be determined based on the predicted value of each pixel of the current image block and the predicted compensation value.
- the predicted compensation value of each pixel of the current image block is determined, and then , Based on the predicted value of each pixel of the current image block and the predicted compensation value, the final predicted value of each pixel of the current image block is determined.
- the prediction compensation adjustment does not need to be limited to the image block using the bidirectional prediction mode, and is not limited to the sub-block An image block in which the motion vector is the same as the motion vector of each pixel in the corresponding sub-block extends the application range of the prediction compensation adjustment.
- determining the predicted value of each pixel of the current image block may include:
- the gradient value of each pixel of the current image block in that direction is determined.
- determining the offset vector of each pixel of the current image block may include:
- determining the final predicted value of each pixel of the current image block based on the predicted value of each pixel of the current image block and the predicted compensation value may include:
- the final predicted value of each pixel of the current image block in this direction is determined.
- the unidirectional predicted value of each pixel of the current image block may be determined.
- the forward prediction value of each pixel of the current image block can be determined; if the current image block adopts the backward prediction mode, the backward prediction of each pixel point of the current image block can be determined value.
- the following takes the unidirectional prediction mode adopted by the current image block as the forward prediction mode as an example.
- the forward gradient value of each pixel of the current image block can be determined based on the forward predicted value of each pixel of the current image block; on the other hand, The forward offset vector of each pixel of the current image block can be determined.
- the forward prediction compensation value of each pixel of the current image block can be determined based on the forward gradient value and offset vector of each pixel of the current image block, and the forward prediction value of each pixel of the current image block and the prediction compensation can be determined. Value to determine the final predicted value of each pixel in the current image block.
- determining the predicted value of each pixel of the current image block may include:
- the forward and backward gradient values of each pixel of the current image block are determined.
- determining the final predicted value of each pixel of the current image block based on the predicted value of each pixel of the current image block and the predicted compensation value may include:
- the final predicted value of each pixel of the current image block is determined.
- the current image block adopts the bidirectional prediction mode
- the final predicted value of each pixel of the current image block is determined.
- the method for determining the forward final prediction value or the backward final prediction value of each pixel of the current image block can refer to the implementation of the current image block using the unidirectional prediction mode.
- the forward and backward final predicted values of each pixel of the current image block can be used to determine the final predicted value of each pixel of the current image block. The final predicted value.
- determining the offset vector of each pixel of the current image block may include:
- the offset vector of the remaining pixels in the sub-block is determined.
- the offset of any pixel in the sub-block (herein referred to as the designated pixel) may be determined first.
- the shift vector and further, based on the shift vector of the designated pixel in the sub-block, the shift vector of the remaining pixels in the sub-block is determined.
- the foregoing determination of the offset vector of the specified pixel of the sub-block may include:
- the offset vector of the designated pixel in the sub-block is determined.
- the offset of the specified pixel in the sub-block from the center position of the sub-block and the affine parameter can be used to determine the value of the specified pixel in the sub-block. Offset vector.
- the foregoing determination of the offset vector of the specified pixel in the sub-block based on the offset of the specified pixel in the sub-block from the center position of the sub-block and the affine parameter may include:
- the second affine parameter determines the horizontal component of the offset vector of the specified pixel in the sub-block
- the fourth affine parameter determines the vertical component of the offset vector of the designated pixel in the sub-block.
- the first affine parameter is the same as the fourth affine parameter, and both are the ratio of the first value to the width of the sub-block, and the second affine parameter is the same as the third affine parameter.
- the parameters are opposite, and the third affine parameter is the ratio of the second value to the width of the sub-block.
- the first affine parameter is the ratio of the first value to the width of the sub-block
- the second affine parameter is the ratio of the third value to the height of the sub-block
- the third affine parameter is the first The ratio of the two values to the width of the sub-block
- the fourth affine parameter is the ratio of the fourth value to the height of the sub-block.
- the first value is the difference between the horizontal component of the motion vector of the upper right control point of the sub-block and the horizontal component of the motion vector of the upper left control point
- the second value is the vertical component of the motion vector of the upper right control point of the sub-block and the upper left control point
- the third value is the difference between the horizontal component of the motion vector of the lower left control point of the sub-block and the horizontal component of the motion vector of the upper left control point
- the fourth value is the lower left control point of the sub-block The difference between the vertical component of the motion vector and the vertical component of the motion vector of the upper left control point.
- the motion vectors of the upper left control point, the upper right control point, and the lower left control point are (v 0x , v 0y ), (v 1x , v 1y ), and (v 2x , v 2y ), respectively
- the first value is v 1x -v 0x
- the second value is v 1y -v 0y
- the third value is v 2x -v 0x
- the fourth value is v 2y -v 0y
- the first affine parameter to the fourth affine parameter (assuming c to f respectively) under the parametric affine model can be as shown in formulas 1 and 2:
- determining the offset vector of the designated pixel in the sub-block can be achieved by the following formula:
- ⁇ v x (x, y) is the horizontal component of the offset vector of the specified pixel in the sub-block
- ⁇ v y (x, y) is the vertical component of the offset vector of the specified pixel in the sub-block
- ( ⁇ x, ⁇ y) is the offset of the designated pixel from the center position of the sub-block.
- determining the offset vectors of the remaining pixels in the sub-block based on the offset vectors of the designated pixels in the sub-block may include:
- any pixel in the row of the specified pixel in the sub-block based on the horizontal component of the offset vector of the pixel and the second affine parameter, determine the value of the remaining pixels in the sub-block in the same column as the pixel.
- the horizontal component of the offset vector and, based on the vertical component of the offset vector of the pixel and the fourth affine parameter, determine the vertical component of the offset vector of the remaining pixels in the same column as the pixel in the sub-block.
- the offset vector of the specified pixel in the sub-block is determined, the offset vector of the sub-block, the first affine parameter, and the third affine
- the parameters respectively determine the offset vectors of the remaining pixels in the same row as the designated pixel in the sub-block.
- the offset vector of each pixel in the row of the specified pixel in the sub-block is determined, for any pixel in the row of the specified pixel in the sub-block, it can be based on the offset vector of the pixel, the second The affine parameter and the fourth affine parameter determine the offset vector of the remaining pixels in the same column as the pixel in the sub-block.
- the horizontal component of the offset vector of the pixel may be the horizontal component of the offset vector of the adjacent pixel (if any) to the left of the pixel and the first The sum of affine parameters, or the difference between the horizontal component of the offset vector of the adjacent pixel (if any) to the right of the pixel and the first affine parameter;
- the vertical component of the offset vector of the pixel can be the The sum of the vertical component of the offset vector of the adjacent pixel on the left side of the pixel (if any) and the third affine parameter, or the vertical of the offset vector of the adjacent pixel on the right of the pixel (if any) The difference between the component and the third affine parameter.
- the horizontal component of the offset vector of the pixel may be the horizontal component of the offset vector of the adjacent pixel (if any) above the pixel and the second affine parameter
- the vertical component of the offset vector of the pixel point can be the offset vector above the pixel point
- the sum of the vertical component of the offset vector of the adjacent pixel (if present) and the fourth affine parameter, or the vertical component of the offset vector of the adjacent pixel (if present) below the pixel and the fourth affine The difference between the parameters.
- determining the gradient value of each pixel of the current image block based on the predicted value of each pixel of the current image block may include:
- N rows/columns refer to N rows and N columns
- the gradient value of each pixel in the sub-block is determined.
- N is a positive integer
- pixels are integers
- the foregoing filling of N rows/columns of full pixels on the upper, lower, left, and right edges of the sub-block may include:
- the pixel values of the nearest N rows/columns of the whole pixel points around the whole pixel block are respectively used as the filling values of the upper, lower, left, and right edges of the sub-block.
- the nearest equal-size whole pixel block (a block composed of whole pixels) in the reference frame to the sub-block may be determined first.
- the sub-block is a 4*4 sub-block
- the 4*4 sub-block whole pixel block closest to the sub-block in the reference frame can be determined, and the schematic diagram thereof may be as shown in FIG. 5A.
- the above-mentioned filling of N rows/columns of full pixels on the upper, lower, left, and right edges of the sub-block may include:
- the finally determined filling point may be as shown in FIG. 5D.
- the above-mentioned filling of N rows/columns of full pixels on the upper, lower, left, and right edges of the sub-block may include:
- the sub-pixel vertical component of the predicted value of the pixel in the sub-block is greater than half a pixel, then fill the upper edge of the sub-block with the nearest neighbor integer pixel above, and fill the lower edge of the sub-block with the next nearest integer pixel below. pixel;
- the sub-pixel vertical component of the predicted value of the pixel in the sub-block is less than half a pixel, then fill the upper edge of the sub-block with the upper next nearest integer pixel, and fill the lower edge of the sub-block with the lower nearest neighbor integer pixel;
- the left edge of the sub-block is filled with the nearest whole pixel on the left, and the right edge of the sub-block is filled with the next Neighboring pixels;
- the sub-pixel horizontal component of the predicted value of the pixel in the sub-block is less than half a pixel, then fill the left edge of the sub-block with the next nearest whole pixel on the left, and fill the right edge of the sub-block with the nearest one on the right Adjacent to the whole pixel.
- the determination of the horizontal component of the gradient value of a pixel is taken as an example. Considering that when determining the horizontal component of the gradient value of a pixel, it is necessary to use a pixel whose left side and the relative displacement of the pixel is one pixel. The pixel value of a point, and the pixel value of a pixel where the relative displacement between the right side and the pixel point is one pixel. In addition, when the sub-block is filled with pixels, the filled rows/columns are all whole pixels. Therefore, when the predicted value of the pixel in the sub-block is non-integer, the filled whole pixel is the same as the edge of the sub-block. The relative displacement of the pixel point will not be one pixel (the relative displacement between a non-integer pixel and an entire pixel is a non-integer pixel).
- the sub-block In order to reduce the error of the determined gradient value of the pixel and improve the accuracy of the determined gradient value of the pixel, when filling the upper, lower, left, and right edges of the sub-block with 1 row/column of whole pixels, it can be based on the sub-block Whether the vertical and horizontal components of the sub-pixels of the predicted value of the pixel (that is, the part other than the whole pixel, if the pixel value is 5.4, then the sub-pixel is 0.4) is greater than half a pixel (that is, 0.5 pixel), to select the filling The position of the whole pixel.
- the sub-pixel of the predicted value of the pixel in the sub-block refers to the sub-pixel of the corresponding position of the predicted value of the pixel in the sub-block in the reference frame.
- the relative displacement between the next nearest whole pixel below the pixel and the pixel is closest to one pixel.
- the relative displacement between the nearest integer pixel below the pixel and the pixel is 0.2
- the next nearest pixel below the pixel is The relative displacement of the pixel point and the pixel point
- the relative displacement of the remaining whole pixel points under the pixel point and the pixel point are all greater than 1.2.
- the relative displacement of the next nearest whole pixel point under the pixel point and the pixel point The displacement is closest to one pixel.
- the pixel is filled with the next nearest whole pixel.
- the sub-pixel vertical component of the predicted value of the pixel in the sub-block is less than half a pixel, then the upper edge of the sub-block is filled with the next nearest whole pixel, and the lower edge of the sub-block is filled with the lower Nearest whole pixel;
- the left edge of the sub-block is filled with the nearest whole pixel on the left, and the right edge of the sub-block is filled with the next Neighboring pixels;
- the processing can be performed according to the case that the sub-pixel horizontal component of the predicted value of the pixel is greater than half a pixel, or according to the predicted value of the pixel If the sub-pixel horizontal component is less than half a pixel, it will be processed.
- the processing can be performed according to the case that the sub-pixel vertical component of the predicted value of the pixel is greater than half a pixel, or according to the sub-pixel of the predicted value of the pixel.
- the case where the vertical component of the pixel is smaller than half a pixel is processed.
- filling 1 row/column of pixels on the upper, lower, left, and right edges of the sub-block respectively includes:
- the filling area formed by the rows/columns filled in the upper, lower, left, and right edges of the sub-block can be used The whole pixel points except the 4 corner points are filled.
- the foregoing determination of the gradient value of each pixel in the sub-block based on the predicted value of each pixel of the sub-block and the pixel value of the filled whole pixel may include:
- the horizontal component of the gradient value of the pixel may be determined based on the pixel value of the pixel on the left and the pixel value of the pixel on the right, and based on the pixel The pixel value of the upper pixel and the pixel value of the lower pixel.
- the gradient value of the pixel can be determined by the following formula:
- g x (i, j) is the horizontal component of the gradient value of the pixel in the i-th column and j-th row in the sub-block
- g y (i, j) is the i-th column and j-th row in the sub-block
- I(i,j) is the pixel value of the pixel in the i-th column and j-th row in the sub-block.
- the pixel value in the sub-block has the predicted value; the filled whole pixel has the original value; for the decoding end, the pixel value in the sub-block is It is the predicted value; the pixel value of the filled whole pixel is the reconstructed value.
- the method may further include:
- the prediction compensation adjustment enabling condition may include that the current image block adopts a specified prediction mode, and the motion vector of the sub-block in the specified prediction mode is not completely the same as the motion vector of each pixel in the sub-block.
- the predictive compensation adjustment enabling condition can be set to the specified prediction mode for the current image block.
- the predicted value of each pixel of the current image block is determined, it may be determined based on the prediction mode of the current image block whether to enable the prediction compensation adjustment solution provided in the embodiment of the present application.
- the current image block is predicted and compensated according to the method described in the foregoing embodiment.
- the designated prediction module includes an affine motion mode.
- the aforementioned predictive compensation adjustment enabling condition may further include: the currently predicted component is a luminance component, that is, the luminance component of the image block that adopts the specified prediction mode is subjected to predictive compensation adjustment according to the predictive compensation adjustment solution provided in this embodiment of the application. .
- the aforementioned prediction compensation adjustment enabling condition may further include: the currently predicted component is a chrominance component, that is, the chrominance component of an image block using a specified prediction mode is predicted according to the prediction compensation adjustment solution provided in this embodiment of the application. Compensation adjustment.
- FIG. 6 is a schematic flowchart of an encoding and decoding method provided by an embodiment of this application. Taking the decoding process and adopting the Affine mode as an example, as shown in FIG. 6, the encoding and decoding method may include:
- Step S600 Determine the predicted value of each pixel of the decoded block.
- the decoding block is divided into 4*4 sub-blocks, and each sub-block is decoded separately.
- the predicted value of each pixel in the sub-block can be determined based on the Affine mode.
- a pixel with a coordinate (i, j) in a sub-block its pixel value is denoted as I(i, j).
- Step S610 Determine whether the decoded block satisfies the predictive compensation adjustment enabling condition, if yes, go to step S620; otherwise, end the current process.
- the predicted value of each pixel of the decoded block is determined, it can be determined whether the decoded block satisfies the prediction compensation adjustment activation condition.
- the prediction compensation adjustment may not be performed according to the prediction compensation adjustment scheme provided in the embodiment of the present application.
- the predicted value of each pixel of the decoded block determined in step S600 may be changed It is determined as the final predicted value of each pixel; or, the prediction compensation adjustment scheme can be carried out according to other strategies, and the specific implementation is not limited.
- Step S620 For any sub-block, determine the gradient value of each pixel of the sub-block based on the predicted value of each pixel of the sub-block.
- Step S630 Determine the offset vector of each pixel of the sub-block.
- Step S640 Determine the prediction compensation value of each pixel of the sub-block based on the gradient value and the offset vector of each pixel of the sub-block.
- Step S650 Determine the final predicted value of each pixel of the decoded block based on the predicted value of each pixel of the decoded block and the predicted compensation value.
- the forward and backward prediction values and prediction compensation values of each pixel of the decoding block can be determined according to the methods described in the above steps S600 to S640, and then the final prediction value of each pixel of the decoding block can be obtained. Predictive value.
- the current image block adopts the affine motion mode.
- the enabling condition of the encoding and decoding method flow included in the foregoing steps S600 to S650 may include:
- the currently predicted component is the luminance component.
- the current image block adopts affine motion mode
- the currently predicted component is the chrominance component.
- the gradient value of the pixel can be determined based on formulas 4 and 5 (the horizontal component of the gradient value of the pixel can be described as: mm_gradx, the gradient of the pixel The vertical component can be described as: mm_grady).
- filling 1 row/column of full pixels on the upper, lower, left, and right edges of the 4*4 sub-blocks may include the following implementation manners:
- the whole pixel filled in the upper, lower, left, and right edges of the 4*4 sub-block may be selected.
- the upper edge of the 4*4 sub-block is filled with the nearest neighboring whole pixel above, for the 4*4
- the bottom edge of the sub-block is filled with the next-neighbor integer pixels below
- the left edge of the 4*4 sub-block is filled with the left-most neighboring integer pixels
- the right edge of the 4*4 sub-block is filled with the next-neighbor integer pixels on the right point.
- the upper edge of the 4*4 sub-block is filled with the next nearest whole pixel, and the 4*4 sub-block is filled with The bottom edge of the block is filled with the nearest neighboring integer pixels below, the left edge of the 4*4 sub-block is filled with the left next neighboring integer pixels, and the right edge of the 4*4 sub-block is filled with the nearest neighboring integer pixel on the right .
- the upper edge of the 4*4 sub-block is filled with the next nearest whole pixel, for 4*
- the bottom edge of the 4 sub-block is filled with the nearest neighbor integer pixel below
- the left edge of the 4*4 sub-block is filled with the left nearest neighbor integer
- the right edge of the 4*4 sub-block is filled with the next nearest neighbor on the right pixel.
- the triangle is the predicted value of each pixel in the 4*4 sub-block
- the circle is the reconstructed value of the entire pixel in the reference frame
- the shaded circle is the one selected for filling. The reconstructed value of the whole pixel in the reference frame.
- the processing can be performed according to the case that the sub-pixel horizontal component of the predicted value of the pixel is greater than half a pixel, or according to the prediction of the pixel The case where the sub-pixel horizontal component of the value is less than half a pixel is processed.
- the processing can be performed according to the case that the sub-pixel vertical component of the predicted value of the pixel is greater than half a pixel, or according to the sub-pixel of the predicted value of the pixel.
- the case where the vertical component of the pixel is smaller than half a pixel is processed.
- FIG. 7E a schematic diagram of filling the top, bottom, left, and right edges of the 4*4 sub-block with 1 row/column may be as shown in FIG. 7E.
- determining the offset vector of each pixel of the sub-block may include:
- pixel As the first pixel as an example.
- the offset between the first pixel in the 4*4 sub-block and the center position of the 4*4 sub-block can be determined (Assuming ( ⁇ x, ⁇ y)).
- the offset vector of the first pixel of the 4*4 sub-block can be determined by the following formula:
- ⁇ v x (w, 0) is the horizontal component of the offset vector of the pixel in the w+1 column and row 1 in the 4*4 sub-block
- ⁇ v y (w, 0) is the 4*4 sub-block
- width is the width of the current image block
- height is the height of the current image block
- MAX_CU_DEPTH is the width or depth of the maximum image block
- the default is 7 (that is, the width or height of the corresponding image block is 128).
- c, d, e, and f can be determined by the following formulas:
- the offset vector of each pixel may be enlarged.
- the offset vector of the remaining pixels in the first row of the 4*4 sub-block can be determined by the following formula:
- the offset vectors of the remaining pixels in the 4*4 sub-block can be determined by the following formula:
- the offset vector of each pixel enlarged by N1 times can be reduced by N2 times.
- N2 may be the same as N1 or different from N1.
- the determined offset vector can be reduced by the following formula:
- nOffset 1 ⁇ (MAX_CU_DEPTH-1) (16)
- mvx is the horizontal component of a pixel offset vector
- mvy is the vertical component of the pixel offset vector
- ⁇ means left shift
- bitdlimit max(6, bitdepth-6), bitdpth is the bit depth, that is, the bit width required for the brightness value, generally 10 or 8.
- bdlimit max(5, bitdepth-7).
- the offset vector of each pixel in the sub-block can be determined by the following formula:
- the prediction and compensation value of each pixel of the sub-block can be determined by the following formula:
- ⁇ I (i, j) is the prediction compensation value of the pixel in the i-th column and j-th row in the sub-block
- g x (i, j) is the value of the pixel in the i-th column and j-th row in the sub-block
- the horizontal component of the gradient value, g y (i, j) is the vertical component of the gradient value of the pixel in the i-th column and the j-th row in the sub-block
- ⁇ v x (i, j) is the i-th column in the sub-block.
- the horizontal component of the offset vector of the pixel in the j row, ⁇ v y (i, j) is the vertical component of the offset vector of the pixel in the i-th column and the j-th row in the sub-block.
- unidirectional (forward or backward) prediction compensation adjustment can be realized by the following formula:
- mm_dI3 (mm_src+mm_dI2)*mm_w (24)
- the decoded block adopts a bidirectional prediction mode
- the forward and backward prediction compensation adjustments are completed respectively, and then the prediction compensation adjustment of the decoded block is implemented.
- the forward prediction compensation adjustment can be realized by the following formula:
- mm_dI01 mvx0*mm_gradx0+mvy0*mm_grady0 (25)
- mm_dI03 (mm_src0+mm_dI02)*mm_w0 (27)
- mm_dI02 is the forward prediction compensation value of a certain pixel
- mm_dI03 is the final predicted value of the pixel in the forward direction
- mvx0 is the horizontal component of the offset vector in the forward direction of the pixel
- mvy0 is the forward direction of the pixel.
- the vertical component of the offset vector of the pixel mm_gradx0 is the horizontal component of the gradient value of the pixel in the forward direction
- mm_grady0 is the vertical component of the gradient value of the pixel in the forward direction
- mm_src0 is the predicted value of the pixel in the forward direction
- mm_w0 is the The forward weight value of the pixel.
- mm_dI13 (mm_src1+mm_dI12)*mm_w1 (30)
- mm_dI12 is the backward prediction compensation value of a certain pixel
- mm_dI13 is the final prediction value of the pixel in the backward direction
- mvx1 is the horizontal component of the backward offset vector of the pixel
- mvy1 is the backward of the pixel.
- the vertical component of the offset vector of the pixel mm_gradx1 is the horizontal component of the backward gradient value of the pixel
- mm_grady1 is the vertical component of the backward gradient value of the pixel
- mm_src1 is the backward predicted value of the pixel
- mm_w1 is the The weight value of the pixel in the backward direction.
- the weighting process is completed (the sum of the forward final predicted value and the backward final predicted value is divided by the forward weight value and the backward weight value The sum of the two), the final predicted value of each pixel of the decoded block is obtained.
- Step S800 Determine whether the current image block meets the time domain reference frame use condition. If yes, go to step S810; otherwise, go to step S820.
- step S810 the motion vector of the time-domain reference frame is allowed to be used for encoding/decoding of the current image block.
- Step S820 Refuse to use the motion vector of the time-domain reference frame for encoding/decoding of the current image block.
- the probability of the motion vector of the time domain reference frame being finally selected is low, in order to improve the coding and decoding performance, avoid the situation where the motion vector of the time domain reference frame is not applicable.
- the condition of whether to allow the motion vector of the time-domain reference frame to be used (herein referred to as the time-domain reference frame use condition) can be preset.
- the motion vector of the time domain reference frame is allowed to be used for the encoding/decoding of the current image block, or , Refuse to use the motion vector of the time-domain reference frame for encoding/decoding of the current image block.
- the above-mentioned time-domain reference frame usage conditions may include:
- the size of the reference frame is the same as the size of the frame to which the current image block belongs.
- the reference frame size can be changed
- the size of the frame to which the current image block belongs is the same as the time domain reference frame use condition, that is, when the size of the reference frame is the same as the size of the frame to which the current image block belongs, the motion vector of the time domain reference frame is allowed to be used for the current image block
- the size of the reference frame is different from the size of the frame to which the current image block belongs, refuse to use the motion vector of the time domain reference frame for encoding/decoding of the current image block.
- time-domain reference frame usage conditions may further include:
- the reference frame is not a long-term reference frame.
- the long-term reference frame is relative to the reference frame adjacent to the frame to which the current image block belongs. Generally speaking, the long-term reference frame is farther from the frame to which the current image block belongs.
- the reference frame can also be selected
- the frame is not a long-term reference frame and is used as a time-domain reference frame. That is, when the size of the reference frame is the same as the size of the frame to which the current image block belongs, and the reference frame is not a long-term reference frame, the motion vector of the time-domain reference frame is allowed to be used.
- the motion vector of the time domain reference frame is rejected and used for the current image block Encoding/decoding.
- the aforementioned allowable use of the motion vector of the time-domain reference frame, after being used for encoding/decoding of the current image block may further include:
- the motion vector of the time-domain reference frame is allowed to be scaled and used for the encoding/decoding of the current image block
- the motion vector of the time-domain reference frame used for scaling is rejected for encoding/decoding of the current image block.
- the probability that the motion vector of the scaled time-domain reference frame is finally selected is low, therefore, in order to improve the coding and decoding performance, avoid applying the scaled time-domain reference frame
- the motion vector of the scaled time domain reference frame is used as the candidate motion vector, and the conditions for whether to allow the use of the motion vector of the scaled time domain reference frame (in this article, the time domain reference frame motion Conditions for use of vector scaling).
- time-domain reference frame motion vector scaling usage conditions may include:
- the reference frame size can be the same as the size of the frame to which the current image block belongs, and the reference frame is not a long-term reference frame, as the time domain reference frame motion vector scaling use condition.
- the motion vector of the time-domain reference frame after scaling is allowed to be used for encoding/decoding of the current image block;
- the motion vector of the scaled time-domain reference frame is rejected for encoding/decoding of the current image block.
- the method further includes:
- the first type of prediction mode includes modes involving temporal motion vectors, such as TMVP mode, BDOF mode, DMVR mode, etc.;
- the second type of prediction mode includes modes involving scaling of temporal motion vectors, such as SMVD mode.
- the second type prediction mode may also be allowed to be used.
- the foregoing determining whether the time domain reference frame motion vector scaling usage condition is satisfied it may further include:
- the first type prediction mode and the second type prediction mode are allowed to be used;
- the second type of prediction mode is rejected.
- the first type prediction mode may also be rejected.
- FIG. 9 is a schematic flowchart of a method for selecting a candidate motion vector according to an embodiment of this application.
- the method for selecting a candidate motion vector may include the following steps:
- Step S900 Determine whether the current image block satisfies the time domain reference frame motion vector scaling use condition. If yes, go to step S910; otherwise, go to step S920.
- Step S910 Allow zooming to use the motion vector of the time-domain reference frame for encoding/decoding of the current image block.
- the motion vector of the scaled time-domain reference frame considering that the motion vector of the scaled time-domain reference frame has a low probability of being finally selected under certain specific circumstances, in order to improve the coding and decoding performance, it is avoided that the scaled time-domain is not applicable.
- the motion vector of the reference frame the motion vector of the scaled time-domain reference frame is used as the candidate motion vector, and the conditions for whether to allow the use of the motion vector of the scaled time-domain reference frame (referred to as Time domain reference frame motion vector scaling conditions).
- time-domain reference frame motion vector scaling usage conditions may include:
- the size of the reference frame is the same as the size of the frame to which the current image block belongs, and the reference frame is not a long-term reference frame.
- the reference frame size can be the same as the size of the frame to which the current image block belongs, and the reference frame is not a long-term reference frame, as the time domain reference frame motion vector scaling use condition.
- the motion vector of the time-domain reference frame after scaling is allowed to be used for encoding/decoding of the current image block;
- the motion vector of the scaled time-domain reference frame is rejected for encoding/decoding of the current image block.
- the foregoing determining whether the time domain reference frame motion vector scaling usage condition is satisfied it may further include:
- the first type prediction mode and the second type prediction mode are allowed to be used;
- the second type of prediction mode is rejected.
- the first type of prediction mode includes modes involving temporal motion vectors, such as TMVP mode, BDOF mode, DMVR mode, etc.;
- the second type of prediction mode includes modes involving scaling of temporal motion vectors, such as SMVD mode.
- the first type prediction mode may also be rejected.
- FIG. 10 is a schematic flowchart of a method for selecting a prediction mode according to an embodiment of this application. As shown in FIG. 10, the method for selecting a prediction mode may include the following steps:
- Step S1000 Determine whether the current image block meets the specified prediction mode activation condition. If yes, go to step S1010; otherwise, go to step S1020.
- Step S1010 allowing the designated prediction mode to be enabled.
- Step S1020 Refuse to enable the designated pre-stored mode.
- the use of a specified prediction mode will reduce the codec performance. Therefore, in order to improve the codec performance, avoid using the specified prediction mode as a candidate when the specified prediction mode is not applicable.
- the prediction mode it is possible to pre-set the conditions of whether to allow the specified prediction mode to be enabled (referred to as the specified prediction mode use condition in this article).
- the above-mentioned designated prediction mode includes a first type of prediction mode.
- the first type of prediction mode includes modes involving temporal motion vectors, such as TMVP mode, BDOF mode, DMVR mode, and so on.
- the above specified prediction mode usage conditions may include:
- the size of the reference frame is the same as the size of the frame to which the current image block belongs.
- the size of the reference frame is the same as the specified prediction mode. That is, when the size of the reference frame is the same as the size of the frame to which the current image block belongs, the specified prediction mode is allowed to be enabled; when the size of the reference frame is the same as that of the current image block. When the frame size is different, refuse to enable the specified prediction mode.
- the above specified prediction mode usage conditions may also include:
- the reference frame is not a long-term reference frame.
- the reference frame can also be The long-term reference frame is used as a condition for the specified prediction mode, that is, when the size of the reference frame is the same as the size of the frame to which the current image block belongs, and the reference frame is not a long-term reference frame, the specified prediction mode is allowed to be enabled; when the size of the reference frame is the same as the current image When the size of the frame to which the block belongs is different, or/and the reference frame is a long-term reference frame, refuse to enable the specified prediction mode.
- the foregoing designated prediction mode includes a first type prediction mode and a second type prediction mode.
- the second type of prediction mode includes a mode that involves scaling a temporal motion vector, such as the SMVD mode.
- the above specified prediction mode usage conditions may include:
- the reference frame size can be the same as the size of the frame to which the current image block belongs, and the reference frame is not a long-term reference frame, as the specified prediction mode use condition.
- the specified prediction mode is allowed; when the size of the reference frame is different from the size of the frame to which the current image block belongs, or/and the reference frame When it is a long-term reference frame, refuse to enable the specified prediction mode.
- the above specified prediction mode usage conditions may include:
- the reference The frame size is the same as the size of the frame to which the current image block belongs, and the reference frame is not a long-term reference frame, as the use condition of the specified prediction mode.
- the specified prediction mode is allowed to be enabled; when the size of the reference frame is different from the size of the frame to which the current image block belongs, or/and the reference frame When it is a long-term reference frame, refuse to enable the specified prediction mode.
- Step S1100 Determine whether the current image block meets the DMVR mode activation condition. If yes, go to step S1110; otherwise, go to step S1120.
- Step S1120 Refuse to enable the DMVR mode.
- the DMVR mode enabling condition in this article considering that the efficiency of decoding in DMVR mode is low in some specific situations, in order to improve the codec performance and avoid enabling the DMVR mode when the DMVR mode is not applicable, it can be preset Whether to allow the DMVR mode to be enabled (referred to as the DMVR mode enabling condition in this article).
- the current image block when the current image block is decoded, it can be determined whether the current image block meets the DMVR mode activation condition; if so, the DMVR mode is allowed to be activated; otherwise, the DMVR mode is refused to be activated.
- the current mode is regular merge/skip mode.
- the candidate motion information list includes: spatial neighboring block candidate motion information, temporal neighboring block candidate motion information, spatial non-neighboring block candidate motion information, motion information obtained by combining existing motion information, or/and, by default Sports information and so on.
- the DMVR mode when it is determined that the current image block is in the regular merge/skip mode, the DMVR mode is allowed to be enabled; otherwise, the DMVR mode is refused to be enabled.
- the aforementioned DMVR mode activation conditions may include:
- the switch for controlling the DMVR mode at the sequence level is the first value, and the switch for controlling the DMVR mode at the frame level is the first value;
- the current mode is regular merge/skip mode
- the current image block adopts bidirectional prediction mode.
- One display order of the two reference frames is before the frame to which the current image block belongs, and the other display order is after the frame to which the current image block belongs, and the distance between the two reference frames and the frame to which the current image block belongs ;
- the current image block size meets the limited conditions
- the length and width of the two reference frames are respectively the same as the length and width of the frame to which the current image block belongs.
- the above-mentioned first value is 1.
- the size of the current image block meets the restriction conditions, including: the width of the current image block is greater than or equal to 8, the height of the current image block is greater than or equal to 8, and the area of the current image block is greater than or equal to 128.
- the predicted compensation value of each pixel of the current image block is determined, and further, based on the current
- the prediction value and prediction compensation value of each pixel of the image block are used to determine the final prediction value of each pixel of the current image block.
- the prediction compensation adjustment does not need to be limited to the image block using the bidirectional prediction mode, and is not limited to the motion vector and the motion vector of each sub-block.
- the image block in which the motion vector of each pixel in the corresponding sub-block is the same extends the application range of the prediction compensation adjustment.
- the current image block meets the following conditions at the same time: the current mode of the current image block is the traditional fusion mode or the traditional skip mode; the sequence-level control motion vector adjustment mode switch is the first selection Value; the switch of the frame-level control motion vector adjustment mode is the first value; the current image block adopts the bidirectional prediction mode, and the two reference frames are displayed in a sequence before the frame of the current image block, and the other display sequence is located in the frame of the current image block After that, the distance between the two reference frames and the frame to which the current image block belongs is equal; the weights of the two reference frames of the current image block are the same; the size of the current image block meets the limited conditions; the length and width of the two reference frames of the current image block They are the same as the length and width of the frame to which the current image block belongs; the two reference frames of the current image block are not long-term reference frames.
- the sequence-level control motion vector adjustment mode switch is the first value, which means that the sequence-level control allows the current image block to enable the motion vector adjustment mode;
- the frame-level control motion vector adjustment mode switch is the first value, which means the frame-level control Allow the current image block to enable the motion vector adjustment mode.
- FIG. 12 is a schematic structural diagram of an encoding and decoding apparatus provided by an embodiment of this application.
- the encoding and decoding apparatus may include:
- the first determining unit 1210 is configured to determine the predicted value of each pixel of the current image block
- the second determining unit 1220 is configured to determine the gradient value of each pixel of the current image block based on the predicted value of each pixel of the current image block;
- the third determining unit 1230 is configured to determine the offset vector of each pixel of the current image block
- the fourth determining unit 1240 is configured to determine the prediction compensation value of each pixel of the current image block based on the gradient value and the offset vector of each pixel of the current image block;
- the fifth determining unit 1250 is configured to determine the final predicted value of each pixel of the current image block based on the predicted value of each pixel of the current image block and the predicted compensation value.
- the first determining unit 1210 is specifically configured to determine the unidirectional predicted value of each pixel of the current image block
- the second determining unit 1220 is specifically configured to determine the gradient value of each pixel of the current image block in this direction based on the unidirectional predicted value of each pixel of the current image block;
- the fifth determining unit 1250 is specifically configured to determine the final predicted value of each pixel of the current image block in the direction based on the predicted value of each pixel of the current image block in the direction and the predicted compensation value.
- the first determining unit 1210 is specifically configured to respectively determine the forward and backward predicted values of each pixel of the current image block;
- the second determining unit 1220 is specifically configured to determine the forward and backward gradient values of each pixel of the current image block based on the forward and backward predicted values of each pixel of the current image block;
- the third determining unit, 1230 is specifically configured to respectively determine the forward and backward offset vectors of each pixel of the current image block;
- the fourth determining unit 1240 is specifically configured to determine the forward and backward predictions of each pixel of the current image block based on the forward and backward gradient values and offset vectors of each pixel of the current image block. Compensation value
- the fifth determining unit 1250 is specifically configured to determine the final forward and backward values of each pixel of the current image block based on the forward and backward predicted values and the predicted compensation value of each pixel of the current image block. Predicted value; based on the forward and backward final predicted values of each pixel of the current image block, determine the final predicted value of each pixel of the current image block.
- the third determining unit 1230 is specifically configured to:
- the offset vector of the remaining pixels in the sub-block is determined.
- the third determining unit 1230 is specifically configured to determine the sub-block based on the offset between the designated pixel in the sub-block and the center position of the sub-block, and an affine parameter The offset vector of the specified pixel as described in.
- the third determining unit 1230 is specifically configured to:
- the affine parameter and the fourth affine parameter determine the vertical component of the offset vector of the designated pixel in the sub-block.
- the first affine parameter is the same as the fourth affine parameter, and both are the ratio of the first value to the width of the sub-block.
- the second affine parameter is opposite to the third affine parameter, and the third affine parameter is the ratio of the second value to the width of the sub-block;
- the first value is the difference between the horizontal component of the motion vector of the upper-right control point of the sub-block and the horizontal component of the motion vector of the upper-left control point
- the second value is the difference of the motion vector of the upper-right control point of the sub-block. The difference between the vertical component and the vertical component of the motion vector of the upper left control point.
- the first affine parameter is the ratio of the first value to the width of the sub-block
- the second affine parameter is the third value and the width of the sub-block.
- the ratio of the height of the sub-block, the third affine parameter is the ratio of the second value to the width of the sub-block, and the fourth affine parameter is the ratio of the fourth value to the height of the sub-block;
- the first value is the difference between the horizontal component of the motion vector of the upper-right control point of the sub-block and the horizontal component of the motion vector of the upper-left control point
- the second value is the difference of the motion vector of the upper-right control point of the sub-block.
- the third value is the difference between the horizontal component of the motion vector of the lower left control point and the horizontal component of the motion vector of the upper left control point of the sub-block.
- the fourth value is the difference between the vertical component of the motion vector of the lower left control point and the vertical component of the motion vector of the upper left control point of the sub-block.
- the gradient value of each pixel in the sub-block is determined.
- the pixel values of the adjacent N rows/columns of the entire pixel points around the entire pixel block are respectively used as the filling values of the upper, lower, left, and right edges of the sub-block.
- the second determining unit 1220 is specifically configured to:
- the second determining unit 1220 is specifically configured to:
- the sub-pixel vertical component of the predicted value of the pixel in the sub-block is greater than half a pixel, then fill the upper edge of the sub-block with the nearest neighbor integer pixel above, and fill the lower edge of the sub-block with the next nearest integer pixel below. pixel;
- the sub-pixel vertical component of the predicted value of the pixel in the sub-block is less than half a pixel, then fill the upper edge of the sub-block with the upper next nearest integer pixel, and fill the lower edge of the sub-block with the lower nearest neighbor integer pixel;
- the left edge of the sub-block is filled with the nearest whole pixel on the left, and the right edge of the sub-block is filled with the next Neighboring pixels;
- the sub-pixel horizontal component of the predicted value of the pixel in the sub-block is less than half a pixel, then fill the left edge of the sub-block with the next nearest whole pixel on the left, and fill the right edge of the sub-block with the nearest one on the right Adjacent to the whole pixel.
- the second determining unit 1220 is specifically configured to:
- any pixel in the sub-block determine the horizontal component of the gradient value of the pixel based on the pixel value of the N adjacent pixels on the left side of the pixel and the pixel values of the N adjacent pixels on the right side , And determine the vertical component of the gradient value of the pixel according to the pixel values of the N adjacent pixels above the pixel and the pixel values of the N adjacent pixels below the pixel.
- the second determining unit 1220 is further configured to:
- the predictive compensation adjustment enabling condition includes that the current image block adopts a specified prediction mode, and the motion vector of the sub-block in the specified prediction mode is not exactly the same as the motion vector of each pixel in the sub-block. .
- the predictive compensation adjustment activation condition further includes: the currently predicted component is a luminance component.
- the predictive compensation adjustment activation condition further includes: the currently predicted component is a chrominance component.
- the specified prediction mode includes an affine motion mode.
- An embodiment of the present application proposes another encoding and decoding device, and the device includes:
- the determining unit is used to determine whether the current image block meets the enabling condition of the bidirectional optical flow mode
- the conditions satisfied by the current image block include: the size of the reference frame of the current image block is the same as the size of the frame to which the current image block belongs, and the reference frame is not long-term Reference frame.
- the conditions satisfied by the current image block include: the size of the reference frame of the current image block is different from the size of the frame to which the current image block belongs, or/and,
- the reference frame is a long-term reference frame.
- the device further includes:
- the prediction unit is used to determine the final predicted value of the current image block if the bidirectional optical flow mode is allowed to be enabled: determine the predicted value of each pixel of the current image block; based on the predicted value of each pixel of the current image block, Determine the gradient value of each pixel of the current image block; determine the offset vector of each pixel of the current image block; determine the current image block based on the gradient value of each pixel of the current image block and the offset vector The predicted compensation value of each pixel; based on the predicted value of each pixel of the current image block and the predicted compensation value, the final predicted value of each pixel of the current image block is determined.
- the prediction unit fills the upper, lower, left, and right edges of the sub-block with N rows/columns of whole pixel points respectively, it is specifically used to: determine the same size whole pixel block closest to the sub-block in the reference frame; The pixel values of N rows/columns of whole pixel points adjacent to the block are respectively used as the filling values of the upper, lower, left, and right edges of the sub-block.
- An embodiment of the present application proposes another encoding and decoding device, and the device includes:
- the determining unit is used to determine whether the current image block satisfies the activation condition of the motion vector adjustment mode
- the conditions satisfied by the current image block include: the size of the reference frame of the current image block is the same as the size of the frame to which the current image block belongs, and the reference frame is not long-term Reference frame.
- An embodiment of the present application proposes another encoding and decoding device, and the device includes:
- the determining unit is used to determine whether the current image block satisfies the activation condition of the motion vector adjustment mode
- the processing unit is configured to allow the motion vector adjustment mode to be enabled if it is yes; if not, refuse to enable the motion vector adjustment mode;
- the current image block when the motion vector adjustment mode is enabled for the current image block, the current image block simultaneously satisfies the following conditions:
- the current mode of the current image block is the traditional fusion mode or the traditional skip mode
- the current image block adopts the bidirectional prediction mode, one display order of the two reference frames is located before the frame to which the current image block belongs, and the other display order is located after the frame to which the current image block belongs, and the distance between the two reference frames and the frame to which the current image block belongs is equal;
- the weighted weights of the two reference frames of the current image block are the same;
- the size of the current image block satisfies the limited conditions
- the length and width of the two reference frames of the current image block are respectively the same as the length and width of the frame to which the current image block belongs.
- the current image block does not satisfy any one of the following conditions:
- the current mode of the current image block is the traditional fusion mode or the traditional skip mode
- the control information is to allow the current image block to enable the motion vector adjustment mode
- the current image block adopts the bidirectional prediction mode, one display order of the two reference frames is located before the frame to which the current image block belongs, and the other display order is located after the frame to which the current image block belongs, and the distance between the two reference frames and the frame to which the current image block belongs is equal;
- the weighted weights of the two reference frames of the current image block are the same;
- control information is allowing the current image block to enable the motion vector adjustment mode, including:
- the sequence level controls the switch of the motion vector adjustment mode as the first value
- the frame-level switch for controlling the motion vector adjustment mode is the first value
- the sequence-level control motion vector adjustment mode switch is the first value, which means that the sequence-level control allows the current image block to enable the motion vector adjustment mode;
- the frame-level control motion vector adjustment mode switch is the first value, which means the frame-level control Allow the current image block to enable the motion vector adjustment mode.
- the size of the current image block satisfies the restriction conditions, including: the width of the current image block is greater than or equal to 8, the height of the current image block is greater than or equal to 8, and the area of the current image block is greater than or equal to 128.
- An embodiment of the present application proposes another encoding and decoding device, and the device includes:
- the determining unit is used to determine whether the current image block satisfies the activation condition of the motion vector adjustment mode
- the processing unit is configured to allow the motion vector adjustment mode to be enabled if it is yes; if not, refuse to enable the motion vector adjustment mode;
- the current image block when the motion vector adjustment mode is enabled for the current image block, the current image block simultaneously satisfies the following conditions:
- the current mode of the current image block is the traditional fusion mode or the traditional skip mode
- the control information is to allow the current image block to enable the motion vector adjustment mode
- the current image block adopts the bidirectional prediction mode, one display order of the two reference frames is located before the frame to which the current image block belongs, and the other display order is located after the frame to which the current image block belongs, and the distance between the two reference frames and the frame to which the current image block belongs is equal;
- the weighted weights of the two reference frames of the current image block are the same;
- the size of the current image block satisfies the limited conditions
- the length and width of the two reference frames of the current image block are respectively the same as the length and width of the frame to which the current image block belongs;
- the two reference frames of the current image block are not long-term reference frames.
- the current image block does not satisfy any one of the following conditions:
- the current mode of the current image block is the traditional fusion mode or the traditional skip mode
- the control information is to allow the current image block to enable the motion vector adjustment mode
- the current image block adopts the bidirectional prediction mode, one display order of the two reference frames is located before the frame to which the current image block belongs, and the other display order is located after the frame to which the current image block belongs, and the distance between the two reference frames and the frame to which the current image block belongs is equal;
- the weighted weights of the two reference frames of the current image block are the same;
- the size of the current image block satisfies the limited conditions
- the length and width of the two reference frames of the current image block are respectively the same as the length and width of the frame to which the current image block belongs;
- the two reference frames of the current image block are not long-term reference frames.
- the exemplary control information is allowing the current image block to enable the motion vector adjustment mode, including:
- the sequence level controls the switch of the motion vector adjustment mode as the first value
- the sequence-level control motion vector adjustment mode switch is the first value, which means that the sequence-level control allows the current image block to enable the motion vector adjustment mode;
- the frame-level control motion vector adjustment mode switch is the first value, which means the frame-level control Allow the current image block to enable the motion vector adjustment mode.
- the size of the current image block satisfies the restriction conditions, including: the width of the current image block is greater than or equal to 8, the height of the current image block is greater than or equal to 8, and the area of the current image block is greater than or equal to 128.
- an encoding terminal device which includes a processor and a machine-readable storage medium, the machine-readable storage medium stores machine-executable instructions that can be executed by the processor, and the processor is used to execute The machine can execute instructions to implement the following steps:
- the conditions satisfied by the current image block include: the size of the reference frame of the current image block is the same as the size of the frame to which the current image block belongs, and the reference frame is not a long-term reference frame; or,
- the conditions satisfied by the current image block include: the size of the reference frame of the current image block is the same as the size of the frame to which the current image block belongs, and the reference frame is not a long-term reference frame; or,
- the current image block satisfies the activation condition of the motion vector adjustment mode; if yes, allow the motion vector adjustment mode to be enabled; if not, refuse to enable the motion vector adjustment mode; wherein, when the current image block enables the motion vector adjustment mode, The current image block satisfies the following conditions at the same time: the current mode of the current image block is the traditional fusion mode or the traditional skip mode; the control information is to allow the current image block to enable the motion vector adjustment mode; the current image block adopts the bidirectional prediction mode, two references One display order of frames is before the frame to which the current image block belongs, and the other display order is after the frame to which the current image block belongs, and the distance between the two reference frames and the frame to which the current image block belongs is equal; the weighted weight of the two reference frames of the current image block Same; the size of the current image block meets the limited conditions; the length and width of the two reference frames of the current image block are the same as the length and width of the frame to which the current image block belongs; or,
- the current image block satisfies the activation condition of the motion vector adjustment mode; if yes, allow the motion vector adjustment mode to be enabled; if not, refuse to enable the motion vector adjustment mode; wherein, when the current image block enables the motion vector adjustment mode, The current image block satisfies the following conditions at the same time: the current mode of the current image block is the traditional fusion mode or the traditional skip mode; the control information is to allow the current image block to enable the motion vector adjustment mode; the current image block adopts the bidirectional prediction mode, two references One display order of frames is before the frame to which the current image block belongs, and the other display order is after the frame to which the current image block belongs, and the distance between the two reference frames and the frame to which the current image block belongs is equal; the weighted weight of the two reference frames of the current image block Same; the size of the current image block satisfies the defined condition; the length and width of the two reference frames of the current image block are the same as the length and width of the frame to which the current image block belongs;
- the conditions satisfied by the current image block include: the size of the reference frame of the current image block is the same as the size of the frame to which the current image block belongs, and the reference frame is not a long-term reference frame; or,
- the conditions satisfied by the current image block include: the size of the reference frame of the current image block is the same as the size of the frame to which the current image block belongs, and the reference frame is not a long-term reference frame; or,
- the current image block satisfies the activation condition of the motion vector adjustment mode; if yes, allow the motion vector adjustment mode to be enabled; if not, refuse to enable the motion vector adjustment mode; wherein, when the current image block enables the motion vector adjustment mode, The current image block satisfies the following conditions at the same time: the current mode of the current image block is the traditional fusion mode or the traditional skip mode; the control information is to allow the current image block to enable the motion vector adjustment mode; the current image block adopts the bidirectional prediction mode, two references One display order of frames is before the frame to which the current image block belongs, and the other display order is after the frame to which the current image block belongs, and the distance between the two reference frames and the frame to which the current image block belongs is equal; the weighted weight of the two reference frames of the current image block Same; the size of the current image block meets the limited conditions; the length and width of the two reference frames of the current image block are the same as the length and width of the frame to which the current image block belongs; or,
- the current image block satisfies the activation condition of the motion vector adjustment mode; if yes, allow the motion vector adjustment mode to be enabled; if not, refuse to enable the motion vector adjustment mode; wherein, when the current image block enables the motion vector adjustment mode, The current image block satisfies the following conditions at the same time: the current mode of the current image block is the traditional fusion mode or the traditional skip mode; the control information is to allow the current image block to enable the motion vector adjustment mode; the current image block adopts the bidirectional prediction mode, two references One display order of frames is before the frame to which the current image block belongs, and the other display order is after the frame to which the current image block belongs, and the distance between the two reference frames and the frame to which the current image block belongs is equal; the weighted weight of the two reference frames of the current image block Same; the size of the current image block satisfies the defined condition; the length and width of the two reference frames of the current image block are the same as the length and width of the frame to which the current image block belongs;
- the encoding terminal device may include a processor 1301 and a machine-readable storage medium 1302 storing machine-executable instructions.
- the processor 1301 and the machine-readable storage medium 1302 may communicate via a system bus 1303.
- the processor 1301 can execute the codec method described above.
- the machine-readable storage medium 1302 mentioned herein may be any electronic, magnetic, optical, or other physical storage device, and may contain or store information, such as executable instructions, data, and so on.
- the machine-readable storage medium can be: RAM (Radom Access Memory), volatile memory, non-volatile memory, flash memory, storage drive (such as hard drive), solid state drive, any type of storage disk (Such as CD, DVD, etc.), or similar storage media, or a combination of them.
- FIG. 14 is a schematic diagram of the hardware structure of a decoding end device provided by an embodiment of this application.
- the decoding end device may include a processor 1401 and a machine-readable storage medium 1402 storing machine-executable instructions.
- the processor 1401 and the machine-readable storage medium 1402 may communicate via a system bus 1403. Moreover, by reading and executing the machine executable instructions corresponding to the codec control logic in the machine-readable storage medium 1402, the processor 1401 can execute the codec method described above.
- the machine-readable storage medium 1402 mentioned herein may be any electronic, magnetic, optical, or other physical storage device, and may contain or store information, such as executable instructions, data, and so on.
- the machine-readable storage medium can be: RAM, volatile memory, non-volatile memory, flash memory, storage drives (such as hard drives), solid state drives, any type of storage disks (such as optical discs, DVDs, etc.), or the like Storage media, or a combination of them.
Landscapes
- Engineering & Computer Science (AREA)
- Multimedia (AREA)
- Signal Processing (AREA)
- Compression Or Coding Systems Of Tv Signals (AREA)
- Compression, Expansion, Code Conversion, And Decoders (AREA)
Abstract
Description
Claims (34)
- 一种编解码方法,其特征在于,所述方法包括:确定当前图像块是否满足双向光流模式的启用条件;若是,则允许启用双向光流模式;若否,则拒绝启用双向光流模式;其中,当所述当前图像块启用双向光流模式时,所述当前图像块满足的条件包括:当前图像块的参考帧的尺寸与当前图像块所属帧的尺寸相同,且所述参考帧不是长期参考帧。
- 根据权利要求1所述的方法,其特征在于,当所述当前图像块不启用双向光流模式时,所述当前图像块满足的条件包括:当前图像块的参考帧的尺寸与当前图像块所属帧的尺寸不同,或/和,所述参考帧为长期参考帧。
- 根据权利要求1所述的方法,其特征在于,所述方法还包括:若允许启用双向光流模式,则通过如下步骤确定当前图像块的最终预测值:确定当前图像块各像素点的预测值;基于所述当前图像块各像素点的预测值,确定所述当前图像块各像素点的梯度值;确定所述当前图像块各像素点的偏移矢量;基于所述当前图像块各像素点的梯度值以及偏移矢量,确定所述当前图像块各像素点的预测补偿值;基于所述当前图像块各像素点的预测值以及预测补偿值,确定所述当前图像块各像素点的最终预测值。
- 根据权利要求3所述的方法,其特征在于,所述基于所述当前图像块各像素点的预测值,确定所述当前图像块各像素点的梯度值,包括:对于所述当前图像块的任一子块,对该子块的上下左右边缘分别填充N行/列整像素点;N为正整数;基于该子块各像素点的预测值,以及所填充的整像素点的像素值,确定该子块中各像素点的梯度值。
- 根据权利要求4所述的方法,其特征在于,所述对该子块的上下左右边缘分别填充N行/列整像素点,包括:确定参考帧中与该子块最近的等尺寸整像素块;将该整像素块周围相邻的N行/列整像素点的像素值分别作为该子块的上下左右边缘的填充值。
- 根据权利要求5所述的方法,其特征在于,所述对该子块的上下左右边缘分别填充N行/列整像素点,包括:分别对所述整像素块正上方、正下方、正左侧以及正右侧的N行/列的整像素点进行填充。
- 一种编解码方法,其特征在于,所述方法包括:确定当前图像块是否满足运动矢量调整模式的启用条件;若是,则允许启用运动矢量调整模式;若否,则拒绝启用运动矢量调整模式;其中,当所述当前图像块启用运动矢量调整模式时,所述当前图像块满足的条件包括:当前图像块的参考帧的尺寸与当前图像块所属帧的尺寸相同,且所述参考帧不是长期参考帧。
- 根据权利要求7所述的方法,其特征在于,当所述当前图像块不启用运动矢量调整模式时,所述当前图像块满足的条件包括:当前图像块的参考帧的尺寸与当前图像块所属帧的尺寸不同,或/和,所述参考帧为长期参考帧。
- 一种编解码方法,其特征在于,所述方法包括:确定当前图像块是否满足运动矢量调整模式的启用条件;若是,则允许启用运动矢量调整模式;若否,则拒绝启用运动矢量调整模式;其中,当所述当前图像块启用运动矢量调整模式时,所述当前图像块同时满足以下条件:当前图像块的当前模式为传统融合模式或者传统跳过模式;控制信息为允许当前图像块启用运动矢量调整模式;当前图像块采用双向预测模式,两个参考帧一个显示顺序位于当前图像块所属帧之前,另一个显示顺序位于当前图像块所属帧之后,且两个参考帧与当前图像块所属帧的距离相等;当前图像块的两个参考帧的加权权重相同;当前图像块的尺寸满足限定条件;当前图像块的两个参考帧的长宽分别与当前图像块所属帧的长宽相同。
- 根据权利要求9所述的方法,其特征在于,当所述当前图像块不启用运动矢量调整模式时,所述当前图像块不满足以下条件中的任意一个条件:当前图像块的当前模式为传统融合模式或者传统跳过模式;控制信息为允许当前图像块启用运动矢量调整模式;当前图像块采用双向预测模式,两个参考帧一个显示顺序位于当前图像块所属帧之前,另一个显示顺序位于当前图像块所属帧之后,且两个参考帧与当前图像块所属帧的距离相等;当前图像块的两个参考帧的加权权重相同;当前图像块的尺寸满足限定条件;当前图像块的两个参考帧的长宽分别与当前图像块所属帧的长宽相同。
- 根据权利要求9或10所述的方法,其特征在于,所述控制信息为允许当前图像块启用运动矢量调整模式,包括:序列级控制运动矢量调整模式的开关为第一取值;帧级控制运动矢量调整模式的开关为第一取值;其中,序列级控制运动矢量调整模式的开关为第一取值,表示序列级控制允许当前图像块启用运动矢量调整模式;帧级控制运动矢量调整模式的开关为第一取值,表示帧级控制允许当前图像块启用运动矢量调整模式。
- 根据权利要求9或10所述的方法,其特征在于,所述当前图像块的尺寸满足限定条件,包括:当前图像块的宽大于等于8,当前图像块的高大于等于8,且当前 图像块的面积大于等于128。
- 一种编解码方法,其特征在于,所述方法包括:确定当前图像块是否满足运动矢量调整模式的启用条件;若是,则允许启用运动矢量调整模式;若否,则拒绝启用运动矢量调整模式;其中,当所述当前图像块启用运动矢量调整模式时,所述当前图像块同时满足以下条件:当前图像块的当前模式为传统融合模式或者传统跳过模式;控制信息为允许当前图像块启用运动矢量调整模式;当前图像块采用双向预测模式,两个参考帧一个显示顺序位于当前图像块所属帧之前,另一个显示顺序位于当前图像块所属帧之后,且两个参考帧与当前图像块所属帧的距离相等;当前图像块的两个参考帧的加权权重相同;当前图像块的尺寸满足限定条件;当前图像块的两个参考帧的长宽分别与当前图像块所属帧的长宽相同;所述当前图像块的两个参考帧不是长期参考帧。
- 根据权利要求13所述的方法,其特征在于,当所述当前图像块不启用运动矢量调整模式时,所述当前图像块不满足以下条件中的任意一个条件:当前图像块的当前模式为传统融合模式或者传统跳过模式;控制信息为允许当前图像块启用运动矢量调整模式;当前图像块采用双向预测模式,两个参考帧一个显示顺序位于当前图像块所属帧之前,另一个显示顺序位于当前图像块所属帧之后,且两个参考帧与当前图像块所属帧的距离相等;当前图像块的两个参考帧的加权权重相同;当前图像块的尺寸满足限定条件;当前图像块的两个参考帧的长宽分别与当前图像块所属帧的长宽相同;所述当前图像块的两个参考帧不是长期参考帧。
- 根据权利要求13或14所述的方法,其特征在于,所述控制信息为允许当前图像块启用运动矢量调整模式,包括:序列级控制运动矢量调整模式的开关为第一取值;帧级控制运动矢量调整模式的开关为第一取值;其中,序列级控制运动矢量调整模式的开关为第一取值,表示序列级控制允许当前图像块启用运动矢量调整模式;帧级控制运动矢量调整模式的开关为第一取值,表示帧级控制允许当前图像块启用运动矢量调整模式。
- 根据权利要求13或14所述的方法,其特征在于,所述当前图像块的尺寸满足限定条件,包括:当前图像块的宽大于等于8,当前图像块的高大于等于8,且当前图像块的面积大于等于128。
- 一种编解码装置,其特征在于,所述装置包括:确定单元,用于确定当前图像块是否满足双向光流模式的启用条件;处理单元,用于若是,则允许启用双向光流模式;若否,则拒绝启用双向光流模式;其中,当所述当前图像块启用双向光流模式时,所述当前图像块满足的条件包括:当前图像块的参考帧的尺寸与当前图像块所属帧的尺寸相同,且所述参考帧不是长期参考帧。
- 根据权利要求17所述的装置,其特征在于,当所述当前图像块不启用双向光流模式时,所述当前图像块满足的条件包括:当前图像块的参考帧的尺寸与当前图像块所属帧的尺寸不同,或/和,所述参考帧为长期参考帧。
- 根据权利要求17所述的装置,其特征在于,所述装置还包括:预测单元,用于若允许启用双向光流模式,则通过如下步骤确定当前图像块的最终预测值:确定当前图像块各像素点的预测值;基于所述当前图像块各像素点的预测值,确定所述当前图像块各像素点的梯度值;确定所述当前图像块各像素点的偏移矢量;基于所述当前图像块各像素点的梯度值以及偏移矢量,确定所述当前图像块各像素点的预测补偿值;基于所述当前图像块各像素点的预测值以及预测补偿值,确定所述当前图像块各像素点的最终预测值。
- 根据权利要求19所述的装置,其特征在于,所述预测单元基于所述当前图像块各像素点的预测值,确定所述当前图像块各像素点的梯度值时具体用于:对于所述当前图像块的任一子块,对该子块的上下左右边缘分别填充N行/列整像素点;N为正整数;基于该子块各像素点的预测值,以及所填充的整像素点的像素值,确定该子块中各像素点的梯度值。
- 根据权利要求20所述的装置,其特征在于,所述预测单元对该子块的上下左右边缘分别填充N行/列整像素点时具体用于:确定参考帧中与该子块最近的等尺寸整像素块;将该整像素块周围相邻的N行/列整像素点的像素值分别作为该子块的上下左右边缘的填充值。
- 根据权利要求20所述的装置,其特征在于,所述预测单元对该子块的上下左右边缘分别填充N行/列整像素点时具体用于:分别对所述整像素块正上方、正下方、正左侧以及正右侧的N行/列的整像素点进行填充。
- 一种编解码装置,其特征在于,所述装置包括:确定单元,用于确定当前图像块是否满足运动矢量调整模式的启用条件;处理单元,用于若是,则允许启用运动矢量调整模式;若否,则拒绝启用运动矢量调整模式;其中,当所述当前图像块启用运动矢量调整模式时,所述当前图像块满足的条件包括:当前图像块的参考帧的尺寸与当前图像块所属帧的尺寸相同,且所述参考帧不是长期参考帧。
- 根据权利要求23所述的装置,其特征在于,当所述当前图像块不启用运动矢量调整模式时,所述当前图像 块满足的条件包括:当前图像块的参考帧的尺寸与当前图像块所属帧的尺寸不同,或/和,所述参考帧为长期参考帧。
- 一种编解码装置,其特征在于,所述装置包括:确定单元,用于确定当前图像块是否满足运动矢量调整模式的启用条件;处理单元,用于若是,则允许启用运动矢量调整模式;若否,则拒绝启用运动矢量调整模式;其中,当所述当前图像块启用运动矢量调整模式时,所述当前图像块同时满足以下条件:当前图像块的当前模式为传统融合模式或者传统跳过模式;控制信息为允许当前图像块启用运动矢量调整模式;当前图像块采用双向预测模式,两个参考帧一个显示顺序位于当前图像块所属帧之前,另一个显示顺序位于当前图像块所属帧之后,且两个参考帧与当前图像块所属帧的距离相等;当前图像块的两个参考帧的加权权重相同;当前图像块的尺寸满足限定条件;当前图像块的两个参考帧的长宽分别与当前图像块所属帧的长宽相同。
- 根据权利要求25所述的装置,其特征在于,当所述当前图像块不启用运动矢量调整模式时,所述当前图像块不满足以下条件中的任意一个条件:当前图像块的当前模式为传统融合模式或者传统跳过模式;控制信息为允许当前图像块启用运动矢量调整模式;当前图像块采用双向预测模式,两个参考帧一个显示顺序位于当前图像块所属帧之前,另一个显示顺序位于当前图像块所属帧之后,且两个参考帧与当前图像块所属帧的距离相等;当前图像块的两个参考帧的加权权重相同;当前图像块的尺寸满足限定条件;当前图像块的两个参考帧的长宽分别与当前图像块所属帧的长宽相同。
- 根据权利要求25或26所述的装置,其特征在于,所述控制信息为允许当前图像块启用运动矢量调整模式,包括:序列级控制运动矢量调整模式的开关为第一取值;帧级控制运动矢量调整模式的开关为第一取值;其中,序列级控制运动矢量调整模式的开关为第一取值,表示序列级控制允许当前图像块启用运动矢量调整模式;帧级控制运动矢量调整模式的开关为第一取值,表示帧级控制允许当前图像块启用运动矢量调整模式。
- 根据权利要求25或26所述的装置,其特征在于,所述当前图像块的尺寸满足限定条件,包括:当前图像块的宽大于等于8,当前图像块的高大于等于8,且当前图像块的面积大于等于128。
- 一种编解码装置,其特征在于,所述装置包括:确定单元,用于确定当前图像块是否满足运动矢量调整模式的启用条件;处理单元,用于若是,则允许启用运动矢量调整模式;若否,则拒绝启用运动矢量调整模式;其中,当所述当前图像块启用运动矢量调整模式时,所述当前图像块同时满足以下条件:当前图像块的当前模式为传统融合模式或者传统跳过模式;控制信息为允许当前图像块启用运动矢量调整模式;当前图像块采用双向预测模式,两个参考帧一个显示顺序位于当前图像块所属帧之前,另一个显示顺序位于当前图像块所属帧之后,且两个参考帧与当前图像块所属帧的距离相等;当前图像块的两个参考帧的加权权重相同;当前图像块的尺寸满足限定条件;当前图像块的两个参考帧的长宽分别与当前图像块所属帧的长宽相同;所述当前图像块的两个参考帧不是长期参考帧。
- 根据权利要求29所述的装置,其特征在于,当所述当前图像块不启用运动矢量调整模式时,所述当前图像块不满足以下条件中的任意一个条件:当前图像块的当前模式为传统融合模式或者传统跳过模式;控制信息为允许当前图像块启用运动矢量调整模式;当前图像块采用双向预测模式,两个参考帧一个显示顺序位于当前图像块所属帧之前,另一个显示顺序位于当前图像块所属帧之后,且两个参考帧与当前图像块所属帧的距离相等;当前图像块的两个参考帧的加权权重相同;当前图像块的尺寸满足限定条件;当前图像块的两个参考帧的长宽分别与当前图像块所属帧的长宽相同;所述当前图像块的两个参考帧不是长期参考帧。
- 根据权利要求29或30所述的装置,其特征在于,所述控制信息为允许当前图像块启用运动矢量调整模式,包括:序列级控制运动矢量调整模式的开关为第一取值;帧级控制运动矢量调整模式的开关为第一取值;其中,序列级控制运动矢量调整模式的开关为第一取值,表示序列级控制允许当前图像块启用运动矢量调整模式;帧级控制运动矢量调整模式的开关为第一取值,表示帧级控制允许当前图像块启用运动矢量调整模式。
- 根据权利要求29或30所述的装置,其特征在于,所述当前图像块的尺寸满足限定条件,包括:当前图像块的宽大于等于8,当前图像块的高大于等于8,且当前图像块的面积大于等于128。
- 一种编码端设备,其特征在于,包括处理器和机器可读存储介质,所述机器可读存储介质存储有能够被所 述处理器执行的机器可执行指令,所述处理器用于执行机器可执行指令,以实现权利要求1-16任一所述的编解码方法。
- 一种解码端设备,其特征在于,包括处理器和机器可读存储介质,所述机器可读存储介质存储有能够被所述处理器执行的机器可执行指令,所述处理器用于执行机器可执行指令,以实现权利要求1-16任一所述的编解码方法。
Applications Claiming Priority (2)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201910901352.7A CN112543331B (zh) | 2019-09-23 | 2019-09-23 | 编解码方法、装置及设备 |
CN201910901352.7 | 2019-09-23 |
Publications (1)
Publication Number | Publication Date |
---|---|
WO2021057579A1 true WO2021057579A1 (zh) | 2021-04-01 |
Family
ID=70220966
Family Applications (2)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
PCT/CN2020/115646 WO2021057578A1 (zh) | 2019-09-23 | 2020-09-16 | 编解码方法、装置及设备 |
PCT/CN2020/115657 WO2021057579A1 (zh) | 2019-09-23 | 2020-09-16 | 编解码方法、装置及设备 |
Family Applications Before (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
PCT/CN2020/115646 WO2021057578A1 (zh) | 2019-09-23 | 2020-09-16 | 编解码方法、装置及设备 |
Country Status (10)
Country | Link |
---|---|
US (1) | US20220295094A1 (zh) |
EP (1) | EP4037318A4 (zh) |
JP (2) | JP2022548515A (zh) |
KR (1) | KR20220038791A (zh) |
CN (22) | CN113596457A (zh) |
AU (1) | AU2020354107B2 (zh) |
BR (1) | BR112022004621A2 (zh) |
CA (1) | CA3149785A1 (zh) |
MX (1) | MX2022003168A (zh) |
WO (2) | WO2021057578A1 (zh) |
Families Citing this family (6)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN113596457A (zh) * | 2019-09-23 | 2021-11-02 | 杭州海康威视数字技术股份有限公司 | 编解码方法方法、装置及设备 |
CN112804534B (zh) * | 2019-11-14 | 2022-03-01 | 杭州海康威视数字技术股份有限公司 | 一种编解码方法、装置及其设备 |
CN111901590B (zh) * | 2020-06-29 | 2023-04-18 | 北京大学 | 一种用于帧间预测的细化运动矢量存储方法及装置 |
CN114339223B (zh) * | 2021-02-23 | 2023-03-31 | 杭州海康威视数字技术股份有限公司 | 解码方法、装置、设备及机器可读存储介质 |
CN113808157B (zh) * | 2021-11-18 | 2022-02-22 | 腾讯科技(深圳)有限公司 | 图像处理方法、装置、及计算机设备 |
CN114900691B (zh) * | 2022-07-14 | 2022-10-28 | 浙江大华技术股份有限公司 | 编码方法、编码器及计算机可读存储介质 |
Citations (4)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
WO2019010156A1 (en) * | 2017-07-03 | 2019-01-10 | Vid Scale, Inc. | MOTION COMPENSATION PREDICTION BASED ON A BIDIRECTIONAL OPTICAL FLOW |
US20190132606A1 (en) * | 2017-11-02 | 2019-05-02 | Mediatek Inc. | Method and apparatus for video coding |
US20190158873A1 (en) * | 2017-11-20 | 2019-05-23 | Google Llc | Motion field-based reference frame rendering for motion compensated prediction in video coding |
CN111031317A (zh) * | 2019-09-23 | 2020-04-17 | 杭州海康威视数字技术股份有限公司 | 编解码方法方法、装置及设备 |
Family Cites Families (39)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
DE3850709T2 (de) * | 1987-06-09 | 1994-11-24 | Sony Corp | Reduzierung der Bewegungsvektoren in Fernsehbildern. |
EP0973336A2 (en) * | 1998-07-15 | 2000-01-19 | Sony Corporation | Motion vector detecting, picture encoding and recording method and apparatus |
CN100581264C (zh) * | 2006-01-21 | 2010-01-13 | 宇龙计算机通信科技(深圳)有限公司 | 确定视频块帧内或帧间编码的方法 |
CN100471277C (zh) * | 2007-02-13 | 2009-03-18 | 武汉大学 | 一种快速实现双帧参考视频流的时域可伸缩编码的方法 |
JP2009089332A (ja) * | 2007-10-03 | 2009-04-23 | Sony Corp | 動き予測方法及び動き予測装置 |
CA2707706C (en) * | 2007-12-05 | 2017-07-11 | Onlive, Inc. | Tile-based system and method for compressing video |
RU2010127312A (ru) * | 2007-12-05 | 2012-01-10 | Онлайв, Инк. (Us) | Система и способ сжатия интерактивного потокового видео |
CN101527854A (zh) * | 2009-01-20 | 2009-09-09 | 源见科技(苏州)有限公司 | 帧间模式选择方法 |
CN103907346B (zh) * | 2011-10-11 | 2017-05-24 | 联发科技股份有限公司 | 运动矢量预测子及视差矢量导出方法及其装置 |
EP2805506A4 (en) * | 2012-01-20 | 2015-09-16 | Nokia Technologies Oy | VIDEO ENCODING METHOD AND APPARATUS, COMPUTER PROGRAM PRODUCT, CORRESPONDING SYSTEM AND MODULE |
US10200709B2 (en) * | 2012-03-16 | 2019-02-05 | Qualcomm Incorporated | High-level syntax extensions for high efficiency video coding |
US10721487B2 (en) * | 2012-04-17 | 2020-07-21 | Texas Instruments Incorporated | Converting a bi-predicted merging candidate in a merging candidate list for a prediction unit of a picture |
US20130343459A1 (en) * | 2012-06-22 | 2013-12-26 | Nokia Corporation | Method and apparatus for video coding |
JP2015525547A (ja) * | 2012-07-02 | 2015-09-03 | サムスン エレクトロニクス カンパニー リミテッド | ビデオ符号化またはビデオ復号化のための動きベクトル予測方法及びその装置 |
KR101967398B1 (ko) * | 2013-07-09 | 2019-04-09 | 노키아 테크놀로지스 오와이 | 모션 정보를 시그널링하기 위한 구문을 수반하는 비디오 코딩을 위한 방법 및 장치 |
WO2015169230A1 (en) * | 2014-05-06 | 2015-11-12 | Mediatek Inc. | Video processing method for determining position of reference block of resized reference frame and related video processing apparatus |
CN104333756B (zh) * | 2014-11-19 | 2017-10-24 | 西安电子科技大学 | 基于时域相关性的hevc预测模式快速选择方法 |
CN104853191B (zh) * | 2015-05-06 | 2017-09-05 | 宁波大学 | 一种hevc的快速编码方法 |
CN108965869B (zh) * | 2015-08-29 | 2023-09-12 | 华为技术有限公司 | 图像预测的方法及设备 |
CN107925775A (zh) * | 2015-09-02 | 2018-04-17 | 联发科技股份有限公司 | 基于双向预测光流技术的视频编解码的运动补偿方法及装置 |
CN105898296A (zh) * | 2015-12-24 | 2016-08-24 | 乐视云计算有限公司 | 视频编码帧选择方法及装置 |
KR20180129860A (ko) * | 2016-04-25 | 2018-12-05 | 엘지전자 주식회사 | 영상 코딩 시스템에서 인터 예측 방법 및 장치 |
CN106484614B (zh) * | 2016-09-29 | 2019-01-22 | 厦门美图之家科技有限公司 | 一种核对图片处理效果的方法、装置及移动终端 |
US10631002B2 (en) * | 2016-09-30 | 2020-04-21 | Qualcomm Incorporated | Frame rate up-conversion coding mode |
US10341659B2 (en) * | 2016-10-05 | 2019-07-02 | Qualcomm Incorporated | Systems and methods of switching interpolation filters |
EP3560202A4 (en) * | 2016-12-27 | 2020-07-29 | MediaTek Inc. | METHOD AND DEVICE FOR BILATERAL TEMPLATE MV REFINING FOR VIDEO CODING |
US10523964B2 (en) * | 2017-03-13 | 2019-12-31 | Qualcomm Incorporated | Inter prediction refinement based on bi-directional optical flow (BIO) |
US10701390B2 (en) * | 2017-03-14 | 2020-06-30 | Qualcomm Incorporated | Affine motion information derivation |
US10491917B2 (en) * | 2017-03-22 | 2019-11-26 | Qualcomm Incorporated | Decoder-side motion vector derivation |
TW201902223A (zh) * | 2017-03-24 | 2019-01-01 | 聯發科技股份有限公司 | 視頻編碼中重疊分塊運動補償的雙向光流的方法和裝置 |
WO2018193968A1 (ja) * | 2017-04-19 | 2018-10-25 | パナソニック インテレクチュアル プロパティ コーポレーション オブ アメリカ | 符号化装置、復号装置、符号化方法及び復号方法 |
CN117097891A (zh) * | 2017-06-05 | 2023-11-21 | 松下电器(美国)知识产权公司 | 解码装置、编码装置、存储介质、解码方法和编码方法 |
US10291925B2 (en) * | 2017-07-28 | 2019-05-14 | Intel Corporation | Techniques for hardware video encoding |
US11284107B2 (en) * | 2017-08-22 | 2022-03-22 | Google Llc | Co-located reference frame interpolation using optical flow estimation |
WO2019065329A1 (ja) * | 2017-09-27 | 2019-04-04 | パナソニック インテレクチュアル プロパティ コーポレーション オブ アメリカ | 符号化装置、復号装置、符号化方法、及び復号方法 |
WO2019110120A1 (en) * | 2017-12-08 | 2019-06-13 | Huawei Technologies Co., Ltd. | Template matching function for bi-predictive mv refinement |
US11202079B2 (en) * | 2018-02-05 | 2021-12-14 | Tencent America LLC | Method and apparatus for video decoding of an affine model in an intra block copy mode |
US11523109B2 (en) * | 2019-05-02 | 2022-12-06 | Tencent America LLC | Method and apparatus for improvements of affine prof |
CN111050168B (zh) * | 2019-12-27 | 2021-07-13 | 浙江大华技术股份有限公司 | 仿射预测方法及其相关装置 |
-
2019
- 2019-09-23 CN CN202111010773.4A patent/CN113596457A/zh not_active Withdrawn
- 2019-09-23 CN CN202111010795.0A patent/CN113630603A/zh not_active Withdrawn
- 2019-09-23 CN CN202111014068.1A patent/CN113596463A/zh not_active Withdrawn
- 2019-09-23 CN CN202111012147.9A patent/CN113596460A/zh not_active Withdrawn
- 2019-09-23 CN CN202111012295.0A patent/CN113784137A/zh not_active Withdrawn
- 2019-09-23 CN CN202111014082.1A patent/CN113596465A/zh not_active Withdrawn
- 2019-09-23 CN CN201910901352.7A patent/CN112543331B/zh active Active
- 2019-09-23 CN CN202111010798.4A patent/CN113596458A/zh not_active Withdrawn
- 2019-09-23 CN CN202111012188.8A patent/CN113784135A/zh not_active Withdrawn
- 2019-09-23 CN CN202111012039.1A patent/CN113784134B/zh active Active
- 2019-09-23 CN CN202111010730.6A patent/CN113596455A/zh not_active Withdrawn
- 2019-09-23 CN CN202111014013.0A patent/CN113784138A/zh not_active Withdrawn
- 2019-09-23 CN CN202111014014.5A patent/CN113596461A/zh not_active Withdrawn
- 2019-09-23 CN CN202111010752.2A patent/CN113596456B/zh active Active
- 2019-09-23 CN CN202111014067.7A patent/CN113596462B/zh active Active
- 2019-09-23 CN CN202111011707.9A patent/CN113596459A/zh not_active Withdrawn
- 2019-09-23 CN CN202111014069.6A patent/CN113596464A/zh not_active Withdrawn
- 2019-09-23 CN CN202111012014.1A patent/CN113784133B/zh active Active
- 2019-09-23 CN CN201911228558.4A patent/CN111031317B/zh active Active
- 2019-09-23 CN CN202111012193.9A patent/CN113784136B/zh active Active
- 2019-09-23 CN CN202111010742.9A patent/CN113747165A/zh not_active Withdrawn
- 2019-09-23 CN CN202111012146.4A patent/CN113676735A/zh not_active Withdrawn
-
2020
- 2020-09-16 WO PCT/CN2020/115646 patent/WO2021057578A1/zh active Application Filing
- 2020-09-16 AU AU2020354107A patent/AU2020354107B2/en active Active
- 2020-09-16 EP EP20869844.9A patent/EP4037318A4/en active Pending
- 2020-09-16 WO PCT/CN2020/115657 patent/WO2021057579A1/zh active Application Filing
- 2020-09-16 KR KR1020227007235A patent/KR20220038791A/ko not_active Application Discontinuation
- 2020-09-16 MX MX2022003168A patent/MX2022003168A/es unknown
- 2020-09-16 CA CA3149785A patent/CA3149785A1/en active Pending
- 2020-09-16 US US17/753,341 patent/US20220295094A1/en active Pending
- 2020-09-16 BR BR112022004621A patent/BR112022004621A2/pt unknown
- 2020-09-16 JP JP2022513845A patent/JP2022548515A/ja active Pending
-
2024
- 2024-02-28 JP JP2024028586A patent/JP2024063099A/ja active Pending
Patent Citations (4)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
WO2019010156A1 (en) * | 2017-07-03 | 2019-01-10 | Vid Scale, Inc. | MOTION COMPENSATION PREDICTION BASED ON A BIDIRECTIONAL OPTICAL FLOW |
US20190132606A1 (en) * | 2017-11-02 | 2019-05-02 | Mediatek Inc. | Method and apparatus for video coding |
US20190158873A1 (en) * | 2017-11-20 | 2019-05-23 | Google Llc | Motion field-based reference frame rendering for motion compensated prediction in video coding |
CN111031317A (zh) * | 2019-09-23 | 2020-04-17 | 杭州海康威视数字技术股份有限公司 | 编解码方法方法、装置及设备 |
Non-Patent Citations (4)
Title |
---|
C.-Y. CHEN, T.-D. CHUANG, C.-C. CHEN, C.-Y. LAI, Z.-Y. LIN, C.-W. HSU, Y.-W. HUANG, S.-M. LEI (MEDIATEK): "CE9-related: Simplification of cascading DMVR and BDOF processes", 14. JVET MEETING; 20190319 - 20190327; GENEVA; (THE JOINT VIDEO EXPLORATION TEAM OF ISO/IEC JTC1/SC29/WG11 AND ITU-T SG.16 ), no. JVET-N0097-v1, 13 March 2019 (2019-03-13), Geneva CH, pages 1 - 3, XP030202822 * |
H. LIU (BYTEDANC), L. ZHANG (BYTEDANCE), K. ZHANG (BYTEDANCE), H.-C. CHUANG, J. XU (BYTEDANCE): "Non-CE9: Restriction on motion vector refinement in BDOF", 15. JVET MEETING; 20190703 - 20190712; GOTHENBURG; (THE JOINT VIDEO EXPLORATION TEAM OF ISO/IEC JTC1/SC29/WG11 AND ITU-T SG.16 ), no. JVET-O0573, 5 July 2019 (2019-07-05), Gothenburg SE, pages 1 - 18, XP030219943 * |
J. LUO, Y. HE (INTERDIGITAL): "CE4-2.1: Prediction refinement with optical flow for affine mode", 15. JVET MEETING; 20190703 - 20190712; GOTHENBURG; (THE JOINT VIDEO EXPLORATION TEAM OF ISO/IEC JTC1/SC29/WG11 AND ITU-T SG.16 ), no. JVET-O0070, 26 June 2019 (2019-06-26), Geneva CH, pages 1 - 4, XP030218579 * |
X. LI (TENCENT), G. LI (TENCENT), X. XU (TENCENT), S. LIU (TENCENT): "Non-CE4: On prediction refinement with optical flow", 127. MPEG MEETING; 20190708 - 20190712; GOTHENBURG; (MOTION PICTURE EXPERT GROUP OR ISO/IEC JTC1/SC29/WG11), no. JVET-O0553, 26 June 2019 (2019-06-26), Gothenburg SE, pages 1 - 23, XP030222176 * |
Also Published As
Similar Documents
Publication | Publication Date | Title |
---|---|---|
WO2021057579A1 (zh) | 编解码方法、装置及设备 | |
JP2023126940A (ja) | コーデック方法及び装置、符号化デバイス及び復号化デバイス | |
BR122020002133B1 (pt) | Aparelho e método de gerar candidatos de preditores de vetor de movimento | |
WO2021093589A1 (zh) | 一种编解码方法、装置及其设备 | |
RU2797358C1 (ru) | Способ, устройство и аппарат для кодирования и декодирования |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
121 | Ep: the epo has been informed by wipo that ep was designated in this application |
Ref document number: 20868755 Country of ref document: EP Kind code of ref document: A1 |
|
NENP | Non-entry into the national phase |
Ref country code: DE |
|
122 | Ep: pct application non-entry in european phase |
Ref document number: 20868755 Country of ref document: EP Kind code of ref document: A1 |
|
32PN | Ep: public notification in the ep bulletin as address of the adressee cannot be established |
Free format text: NOTING OF LOSS OF RIGHTS PURSUANT TO RULE 112(1) EPC (EPO FORM 1205A DATED 23/05/2023) |
|
122 | Ep: pct application non-entry in european phase |
Ref document number: 20868755 Country of ref document: EP Kind code of ref document: A1 |