WO2019219066A1

WO2019219066A1 - Coding and decoding methods and devices

Info

Publication number: WO2019219066A1
Application number: PCT/CN2019/087316
Authority: WO
Inventors: 余全合; 安基程; 郑建铧; 林永兵; 王力强; 牛犇犇; 魏紫威; 何芸
Original assignee: 华为技术有限公司; 清华大学
Priority date: 2018-05-16
Filing date: 2019-05-16
Publication date: 2019-11-21

Abstract

The present application relates to the field of video coding, decoding and compression. Disclosed in the embodiments of the present application are coding and decoding methods and devices, for use in solving the problem in an existing division mode of being unable to satisfy relatively complex texture requirements. The decoding method specifically comprises: parsing a code stream, and determining a basic division mode of a current image block to be decoded and a target derivative mode of a sub-image block in the current image block to be decoded; dividing the current image block to be decoded into N sub-image blocks according to the target derivative mode, N being an integer greater than or equal to 2; deriving at least two adjacent sub-image blocks in the N sub-image blocks into a derivative image block according to the target derivative mode; and decoding the derivative image block.

Description

Codec method and device

This application is required to be submitted to the State Intellectual Property Office on May 16, 2018, the application number is 201810470411.5, the Chinese patent application whose invention name is "video codec method and device", and submitted to the State Intellectual Property Office on June 7, 2018. The priority of the Chinese Patent Application No. 201100581662.0, entitled "A Codec Method and Apparatus", the entire disclosure of which is incorporated herein by reference.

Technical field

The embodiments of the present invention relate to the field of video encoding and decoding and compression, and in particular, to a codec method and apparatus.

Background technique

The basic processing unit of video codec is an image block, which is obtained by dividing a frame/image by the encoding end. The fineness and diversity of image block division affect the texture and contour features of the image to some extent.

At present, the image block division method includes a QuadTree plus Binary Tree (QTBT) division method, a Triple Tree (TT) division method, and an Asymmetric Binary Tree (ABT) division manner. The QTBT division method is: dividing a coding tree unit (CTU) into four sub-image blocks according to a Quad-Tree (QT) division manner, and for a sub-image block that needs further division, The sub-image block is divided according to a QT/Binary Tree (BT) division manner. In this division mode, an image block can only be divided into a plurality of sub-image blocks corresponding to the center. The TT division method is: dividing a certain image block horizontally into three sub-image blocks of central symmetry or non-central symmetry, or vertically dividing an image block into three sub-image blocks of central symmetry or non-central symmetry. The ABT is divided into two sub-image blocks that are horizontally divided into central symmetric or non-central symmetric, or vertically divided into two sub-image blocks that are centrally symmetric or non-centralally symmetric.

Both the TT partitioning method and the ABT partitioning method can be combined with the QTBT partitioning method, but the image blocks obtained by the combined partitioning method cannot satisfy the more complicated texture requirements.

Summary of the invention

The embodiment of the present invention provides a codec method and apparatus, which can solve the problem that the existing division mode cannot meet more complicated texture requirements.

To achieve the above objective, the embodiment of the present application adopts the following technical solutions:

In a first aspect, a decoding method is provided. After obtaining a code stream, the decoding device parses the code stream to determine a basic division mode of the current image block to be decoded and a target derivative mode of the sub image block in the current image block to be decoded; And the decoding device divides the current image block to be decoded into N (N is an integer greater than or equal to 2) sub-image blocks according to the basic division manner, and at least according to the derivative manner indicated by the target derivative mode, at least N of the N sub-image blocks Two adjacent sub-image blocks are derived as a first derived image block; finally, the decoding device decodes the first derived image block. Here, the target derivation mode is one of the candidate derivation mode sets, and the candidate derivation mode set includes M preset derivation modes, and the preset derivation mode is used to indicate a derivation mode, and the derivation mode is used to determine any of the current image blocks to be decoded. Two adjacent sub-image blocks are derived as one derived image block, and M is an integer greater than or equal to zero.

It can be seen that the decoding apparatus determines the specific division manner of the image block to be decoded according to the basic division manner and the target derivative mode. In the decoding method provided by the present application, the combination of the division mode and the derivative mode identifies a plurality of different division results, and the image block division manner is more enriched than the prior art, and can be adapted to various texture characteristics. The video image improves the image quality of the compressed video image, more effectively reduces the code stream load bit overhead of the image block division, and further improves the compression efficiency; can regularly divide the division manner of different image blocks, and reduce the code of the division manner of different image blocks. Stream grammar complexity.

In a second aspect, a decoding method is provided. After obtaining a code stream, a decoding device parses the code stream to determine a basic division manner of an image block to be decoded and a feature parameter (at least a height and a width of a current image block to be decoded). a), in a case that the feature parameter of the current image block to be decoded is less than or equal to the first preset threshold, the decoding device re-analyzes the code stream to determine a target derivative mode of the sub-image block in the current image block to be decoded; The device divides the current image block to be decoded into N (N is an integer greater than or equal to 2) sub-image blocks according to the basic division manner, and at least two of the N sub-image blocks according to the derivative manner indicated by the target derivative mode. The adjacent sub-image block is derived as a first derived image block; finally, the decoding device decodes the first derived image block. Here, the target derivation mode is one of the candidate derivation mode sets, and the candidate derivation mode set includes M preset derivation modes, and the preset derivation mode is used to indicate a derivation mode, and the derivation mode is used to determine any of the current image blocks to be decoded. Two adjacent sub-image blocks are derived as one derived image block, and M is an integer greater than or equal to zero.

The decoding device determines the target derivative mode of the sub-image block in the current image block to be decoded only if the feature parameter of the current image block to be decoded is less than or equal to the first preset threshold. Compared to the first aspect described above, the decoding apparatus of the second aspect has higher decoding efficiency.

Optionally, in a possible implementation manner of the foregoing first aspect or the second aspect, the method that “the decoding device parses the code stream and determines the basic division manner of the current image block to be decoded” is: the decoding device from the code stream Obtaining a division identifier of the current image block to be decoded, and determining a basic division manner according to the division identifier of the current image block to be decoded. Correspondingly, the method for determining, by the decoding device, the target derivative mode of the current image block to be decoded is: the decoding device acquires the derived identifier of the current image block to be decoded from the code stream, and determines according to the derivative identifier of the current image block to be decoded. Target derivative model.

In a third aspect, a decoding apparatus is provided, the decoding apparatus comprising a parsing unit, a dividing unit, a deriving unit, and a decoding unit.

Specifically, the parsing unit is configured to parse the code stream, determine a basic division manner of the current image block to be decoded, and a target derivation mode of the sub image block in the current image block to be decoded, and the target derivation mode is one of the candidate derivation mode sets. The candidate derivative mode set includes M preset derivative modes, and the preset derivative mode is used to indicate a derivative mode, and the derivative mode is used to derivate any two adjacent sub-image blocks in the current image block to be decoded into a derivative image block, M Is an integer greater than or equal to 0. The dividing unit is configured to divide the current image block to be decoded into N sub-image blocks according to the basic division manner determined by the parsing unit, where N is an integer greater than or equal to 2. The above-mentioned deriving unit is configured to derivate at least two adjacent sub-image blocks of the N sub-image blocks obtained by the dividing unit into the first derivative image block according to the derivative manner of the target derivative mode indication determined by the parsing unit. The decoding unit is configured to decode the first derived image block obtained by the derivative unit.

In a fourth aspect, a decoding apparatus is provided, the decoding apparatus comprising a parsing unit, a judging unit, a dividing unit, a deriving unit, and a decoding unit.

Specifically, the parsing unit is configured to parse the code stream to determine a basic division manner of the current image block to be decoded and a feature parameter (at least one of width and height) of the current image block to be decoded. The determining unit is configured to determine whether a feature parameter of the current graphics block to be decoded is less than or equal to a first preset threshold. The parsing unit is further configured to parse the code stream to determine a target derivation mode of the sub-image block in the current image block to be decoded, the target derivation mode is one of the candidate derivation mode sets, and the candidate derivation mode set includes M preset derivatives. The mode, the preset derivative mode is used to indicate a derivative mode, and the derivative mode is used to derivate any two adjacent sub-image blocks in the current image block to be decoded into one derivative image block, where M is an integer greater than or equal to 0. The dividing unit is configured to divide the current image block to be decoded into N sub-image blocks according to the basic division manner determined by the parsing unit, where N is an integer greater than or equal to 2. The above-mentioned deriving unit is configured to derivate at least two adjacent sub-image blocks of the N sub-image blocks obtained by the dividing unit into the first derivative image block according to the derivative manner of the target derivative mode indication determined by the parsing unit. The decoding unit is configured to decode the first derived image block obtained by the derivative unit.

Optionally, in a possible implementation manner of the foregoing third aspect or the fourth aspect, the parsing unit is configured to: obtain, from the code stream, a partition identifier of the current image block to be decoded; and according to the current image block to be decoded. The division identifier determines the basic division manner; the derived identifier of the current image block to be decoded is obtained from the code stream; and the target derivative mode is determined according to the derived identifier of the current image block to be decoded.

In a fifth aspect, a decoding device is provided, the decoding device comprising: one or more processors, a memory, and a communication interface. The memory, communication interface is coupled to one or more processors; the memory is for storing computer program code, the computer program code comprising instructions, when the one or more processors execute the instructions, the decoding device performs the first aspect, the second The decoding method described in terms of aspect or any of its possible implementations.

In a sixth aspect, a video decoder is provided, including a non-volatile storage medium storing a executable program, and a non-volatile storage medium, the non-volatile storage The medium is coupled and the executable program is executed to implement the decoding method as described in the first aspect, the second aspect, or any one of the possible implementations described above.

In a seventh aspect, a decoder is provided, the decoder comprising the decoding apparatus and the reconstruction module in the third aspect or the fourth aspect, wherein the reconstruction module is configured to determine a current according to the predicted pixel obtained by the decoding apparatus The reconstructed pixel value of the image block.

In an eighth aspect, a computer readable storage medium is provided, wherein the computer readable storage medium stores instructions, when the instructions are executed on the decoding device of the third aspect or the fourth aspect, The decoding device performs the decoding method as described in the first aspect, the second aspect, or any of the possible implementations thereof.

A ninth aspect, a computer program product comprising instructions, when the computer program product is run on the decoding device of the above third aspect or the fourth aspect, causing the decoding device to perform the first aspect, A decoding method as described in two aspects or any of its possible implementations.

In the present application, the name of the above decoding device is not limited to the device or the function module itself. In actual implementation, these devices or functional modules may appear under other names. As long as the functions of the respective devices or functional modules are similar to the present application, they are within the scope of the claims and their equivalents.

For a detailed description of the third aspect to the ninth aspect and various implementations thereof, reference may be made to the detailed description in the first aspect, the second aspect, and various implementations thereof; and the third aspect to the ninth aspect and For the beneficial effects of the various implementations, reference may be made to the benefit analysis in the first aspect, the second aspect, or various implementations thereof, or the details are not described herein.

A tenth aspect provides an encoding method. First, an encoding apparatus determines a basic division manner of a current image block to be encoded and a target derivative mode of a sub image block in an image block to be currently encoded, and the target derivative mode is one of the candidate derivative mode sets. The candidate derivative mode set includes M preset derivative modes, and the preset derivative mode is used to indicate a derivative mode, and the derivative mode is used to derive any two adjacent sub-image blocks in the current image block to be coded into a derivative image block. M is an integer greater than or equal to 0; then, the encoding apparatus divides the current image block to be encoded into N (N is an integer greater than or equal to 2) sub-image blocks according to the basic division manner, and derives according to the target derivative mode indication. In a manner, at least two adjacent sub-image blocks of the N sub-image blocks are derived as a first derived image block; in a case where the first derivation image block is not divided, the encoding device encodes the derived image block to obtain a code stream. The code stream includes a division identifier of the current image block to be encoded and a derivative identifier of the current image block to be encoded, and the current image block to be encoded The division identifier is used to indicate a basic division manner, and the derived identifier of the current image block to be encoded is used to indicate the target derivative mode.

The coding device combines the division mode and the derivative mode to identify a plurality of different division results, enriches the division manner of the image block, can adapt to the video images of various texture characteristics, and can improve the image quality of the compressed video image; The code stream load bit overhead of the image block division is effectively reduced, and the compression efficiency is further improved; the division manner of different image blocks can be regularized, and the code stream syntax complexity of the division manner of different image blocks is reduced.

In an eleventh aspect, an encoding method is provided. First, an encoding apparatus determines a basic division manner of an image block to be currently encoded and a feature parameter (at least one of width and height) of an image block to be currently encoded, and is currently in an image to be encoded. If the feature parameter of the block is less than or equal to the first preset threshold, determining a target derivative mode of the sub image block in the current image block to be encoded, the target derivative mode is one of the candidate derivative mode sets, and the candidate derivative mode set includes M preset derivation mode, the preset derivation mode is used to indicate a derivative mode, and the derivation mode is used to derivate any two adjacent sub-image blocks in the current image block to be derived into a derivative image block, where M is greater than or equal to 0. An integer; then, the encoding apparatus divides the current image block to be encoded into N (N is an integer greater than or equal to 2) sub-image blocks according to a basic division manner, and sets N sub-image blocks according to a derivative manner indicated by the target derivative mode. At least two adjacent sub-image blocks in the derivation are derived as a first derivation image block; in the case of not dividing the first derivation image block, The device encodes the derived image block to obtain a code stream, where the code stream includes a division identifier of the current image block to be encoded and a derivative identifier of the current image block to be coded, and the division identifier of the current image block to be coded is used to indicate the basic division mode, and the current The derived identifier of the image block to be encoded is used to indicate the target derived mode.

The encoding device determines the target derivative mode of the sub-image block in the current image block to be encoded only if the feature parameter of the current image block to be encoded is less than or equal to the first preset threshold. Compared with the above tenth aspect, the coding apparatus of the eleventh aspect has higher coding efficiency.

Optionally, in a possible implementation manner of the foregoing tenth or eleventh aspect, the method for determining, by the encoding device, the basic division manner of the current image block to be encoded, is: the encoding device determines the current image to be encoded Corresponding at least two candidate division manners, and calculating a rate distortion cost of each of the at least two candidate division manners, so that the coding apparatus determines the candidate division manner corresponding to the minimum rate distortion cost as the basic division manner . The method for determining, by the encoding device, the target derivative mode of the current image block to be encoded, is: the encoding device calculates a rate distortion cost of each of the preset derivative modes in the candidate derivative mode set, and presets a minimum rate distortion cost corresponding to the preset The derivative mode is determined as the target derivative mode, and the candidate derivative mode set includes M preset derivative modes. When the basic division mode is horizontal N or vertical N, 0≤M≤2 ^N-1 ; when the basic division mode is horizontal N When dividing or dividing N points,

M is an integer,

Is an integer.

According to a twelfth aspect, there is provided an encoding apparatus comprising a determining unit, a dividing unit, a deriving unit, a judging unit, and an encoding unit.

Specifically, the determining unit is configured to determine a basic division manner of the current image block to be encoded and a target derivative mode of the sub image block in the current image block to be encoded, and the target derivative mode is one of the candidate derivative mode sets, and the candidate derivative mode The set includes M preset derivation modes, and the preset derivation mode is used to indicate a derivation mode, and the derivation mode is used to derivate any two adjacent sub-image blocks in the current image block to be derived into one derivation image block, where M is greater than or equal to An integer of 0. The dividing unit is configured to divide the current image block to be encoded into N sub-image blocks according to the basic division manner determined by the determining unit, and N is an integer greater than or equal to 2. The above-mentioned deriving unit is configured to derivate at least two adjacent sub-image blocks of the N sub-image blocks obtained by the dividing unit into the first derivative image block according to the derivative manner of the target derivative mode indication determined by the determining unit. The determining unit is configured to determine whether to divide the first derivative image block obtained by the derivative unit. The coding unit is configured to: when the determining unit determines that the first derived image block obtained by the deriving unit is not divided, encode the first derived image block to obtain a code stream, where the code stream includes the current image block to be encoded. Deriving the identifier and the derived identifier of the current image block to be encoded, the partition identifier of the current image block to be encoded is used to indicate the basic partition mode, and the derived identifier of the current image block to be encoded is used to indicate the target derivative mode.

In a thirteenth aspect, an encoding apparatus is provided, the encoding apparatus comprising a determining unit, a dividing unit, a deriving unit, a judging unit, and an encoding unit.

Specifically, the determining unit is configured to determine a basic division manner of the current image block to be encoded and a feature parameter (at least one of a width and a height) of the current image block to be encoded. The dividing unit is configured to divide the current image block to be encoded into N sub-image blocks according to the basic dividing manner determined by the determining unit, and N is an integer greater than or equal to 2. The determining unit is configured to determine whether a feature parameter of the current image block to be encoded is less than or equal to a first preset threshold. The determining unit is further configured to: when the determining unit determines that the feature parameter of the current image block to be encoded is less than or equal to the first preset threshold, determine a target derivative mode of the sub image block in the current image block to be encoded; target derivative mode As one of the candidate derivative mode sets, the candidate derivative mode set includes M preset derivative modes, and the preset derivative mode is used to indicate a derivative mode, and the derivative mode is used to use any two adjacent sub-images in the current image block to be encoded. The block is derived as a derived image block, and M is an integer greater than or equal to zero. The above-mentioned deriving unit is configured to derivate at least two adjacent sub-image blocks of the N sub-image blocks into the first derivative image block according to the derivative manner of the target derivative mode indication determined by the determining unit. The determining unit is further configured to determine whether to divide the first derivative image block obtained by the derivative unit. The coding unit is configured to: when the determining unit determines that the first derivative image block is not divided, encode the derived image block to obtain a code stream, where the code stream includes the division identifier of the current image block to be encoded and the current image to be encoded. The derived identifier of the block, the partition identifier of the current image block to be encoded is used to indicate the basic partition mode, and the derived identifier of the current image block to be encoded is used to indicate the target derivative mode.

Optionally, in a possible implementation manner of the foregoing twelfth or thirteenth aspect, the determining unit is specifically configured to: determine at least two candidate division manners corresponding to the current image to be encoded; calculate at least two The rate-distortion cost of each of the candidate partitioning modes; the candidate partitioning mode corresponding to the smallest rate-distortion cost is determined as the basic partitioning mode.

Optionally, in another possible implementation manner of the foregoing twelfth or thirteenth aspect, the determining unit is specifically configured to: calculate a rate distortion cost of each preset derivative mode in the candidate derived mode set. Wherein, when the basic division mode is horizontal N or vertical N, 0 ≤ ^M ≤ 2 ^N-1 ; when the basic division is horizontal and vertical N,

It is an integer; the preset derivative mode corresponding to the minimum rate distortion cost is determined as the target derivative mode.

In a fourteenth aspect, an encoding device is provided, the encoding device comprising: one or more processors, a memory, and a communication interface. The memory, communication interface is coupled to one or more processors; the memory is for storing computer program code, the computer program code comprising instructions, when the one or more processors execute the instructions, the encoding device performs the tenth aspect, tenth An encoding method as described on the one hand or any of its possible implementations.

According to a fifteenth aspect, a video encoder is provided, including a nonvolatile storage medium and a central processing unit, the nonvolatile storage medium storing an executable program, the central processing unit and the nonvolatile The storage medium is coupled and the executable program is executed to implement the encoding method as described in the tenth aspect, the eleventh aspect, or any one of the possible implementations.

According to a sixteenth aspect, there is provided an encoder comprising the encoding apparatus of the twelfth aspect or the thirteenth aspect.

A seventeenth aspect, a computer readable storage medium having instructions stored in an operation on the encoding device of the twelfth aspect or the thirteenth aspect, The encoding device is caused to perform the encoding method as described in the tenth aspect, the eleventh aspect, or any one of the possible implementations.

An eighteenth aspect, a computer program product comprising instructions for causing said encoding device to perform a tenth operation as described above when said computer program product is run on said encoding device of said twelfth or thirteenth aspect An encoding method as described in the aspect, the eleventh aspect or any one of its possible implementations.

In the present application, the names of the above-mentioned encoding devices are not limited to the devices or the functional modules themselves. In actual implementation, these devices or functional modules may appear under other names. As long as the functions of the respective devices or functional modules are similar to the present application, they are within the scope of the claims and their equivalents.

For a detailed description of the twelfth aspect to the eighteenth aspect and various implementations thereof, reference may be made to the tenth aspect, the eleventh aspect or the detailed description in various implementations thereof; and, the twelfth aspect For the beneficial effects of the eighteenth aspect and various implementations thereof, reference may be made to the benefit analysis in the tenth aspect, the eleventh aspect, or various implementations thereof, or the details are not described herein.

Optionally, in any one of the foregoing possible implementation manners, the basic division manner is horizontal N, vertical N, or horizontal N.

Here, the horizontal N points, the vertical N points, and the horizontal vertical N points may all be used to represent the average division of the current image block to be decoded/the current image block to be encoded, and may also represent the non-average division of the current image block to be decoded/the current image block to be encoded. .

Optionally, in any one of the foregoing possible implementation manners, in the case that the basic division mode is horizontal N minutes, the width of each of the plurality of sub image blocks is the current The widths of the image blocks to be decoded are equal, and the height of each sub-image block is equal to 1/N of the height of the current image block to be decoded. In the case that the basic division mode is vertical N minutes, the height of each of the plurality of sub-image blocks is equal to the height of the current image block to be decoded, and the width of each sub-image block is equal to the width of the current image block to be decoded. 1/N.

Optionally, in any one of the foregoing possible implementation manners, in the case that the basic division mode is horizontal and vertical N minutes, the width of each of the plurality of sub image blocks is equal to The width of the current image block to be decoded

The height of each sub-image block is equal to the height of the current image block to be decoded.

Is an integer.

Optionally, in any one of the foregoing possible implementation manners, the target derivative mode is one of a candidate derivative mode set, and the candidate derivative mode set includes M preset derivative modes, where when the base is divided into a horizontal mode or vertical N-N division, 0≤M≤2 ^N-1; when the base is divided into a horizontal mode or vertical N-N division,

M is an integer.

The maximum value of M is related to the number of dividing lines used to divide the image block. When the basic division mode is horizontal N or vertical N, the number of division lines dividing the image block is N-1, and correspondingly, the maximum value of M is 2 ^N-1 . When the basic division mode is horizontal and vertical N minutes, the number of division lines dividing the image block is

Correspondingly, the maximum value of M is

Of course, the set of candidate derivative patterns may also refer to including some of the preset derivative patterns.

Optionally, in any one of the foregoing possible implementation manners, in the case that N=4, M=4, and the basic division manner is horizontal quarter, the candidate derivative mode set includes the first a preset derivative mode, a second preset derivative mode, a third preset derivative mode, and a fourth preset derivative mode; wherein the first preset derivative mode is a first sub-image block, a second sub-image block, and a third The sub-image block is derived as a derivative image block; the second preset derivation mode is that the third sub-image block and the fourth sub-image block are derived as one derived image block; and the third preset derivation mode is the second sub-image block and the third The sub-image block is derived as a derivative image block; the fourth preset derivation mode is that the first sub-image block and the second sub-image block are derived as one derived image block, and the third sub-image block and the fourth sub-image block are derived as another The derived image block; the first sub-image block, the second sub-image block, the third sub-image block, and the fourth sub-image block are sequentially arranged in the order from top to bottom in the currently to-be-decoded image block. For the first preset derivative mode, reference may be made to D in FIG. 12, the second preset derivative mode may refer to C in FIG. 12, and the third preset derivative mode may refer to B in FIG. 12, and the fourth preset derivative mode. Reference may be made to F in FIG.

In the case that N=4, M=4, and the basic division mode is vertical quarter, the candidate derivative mode set includes a fifth preset derivative mode, a sixth preset derivative mode, a seventh preset derivative mode, and an eighth preset. a derivative mode; wherein the fifth preset derivative mode is a fifth sub-image block, a sixth sub-image block, and a seventh sub-image block are derived as one derivative image block; the sixth preset derivation mode is a sixth sub-image block, The seventh sub-image block and the eighth image block are derived as one derived image block; the seventh preset derivation mode is that the sixth sub-image block and the seventh sub-image block are derived as one derivative image block; the eighth preset derivation mode is fifth The sub image block, the sixth sub image block, the seventh sub image block, and the eighth sub image block are all kept independent; the fifth sub image block, the sixth sub image block, the seventh sub image block, and the eighth sub image block are currently The image blocks to be decoded are sequentially arranged in order from left to right. For the fifth preset derivative mode, reference may be made to D' in FIG. 13, and the sixth preset derivative mode may refer to E' in FIG. 13, and the seventh preset derivative mode may refer to B' in FIG. 13, the eighth pre- For the derivative mode, refer to G' in FIG.

In the case that N=4, M=4, and the basic division mode is horizontal and vertical quarters, the candidate derivative mode set includes a ninth preset derivative mode, a tenth preset derivative mode, an eleventh preset derivative mode, and a tenth a second preset derivative mode; wherein the ninth preset derivative mode is a ninth sub-image block and a tenth sub-image block are derived as one derived image block; and the tenth preset derivation mode is a ninth sub-image block and a tenth sub-image The block is derived as a derivative image block, and the eleventh sub-image block and the twelfth image block are derived as another derivation image block; the eleventh preset derivation mode is derived from the tenth sub-image block and the eleventh sub-image block. a derived image block, the ninth sub-image block and the twelfth image block are derived as another derived image block; the twelfth preset derivation mode is that the ninth sub-image block and the twelfth sub-image block are derived as one derivative image block The ninth sub-image block, the tenth sub-image block, the eleventh sub-image block, and the twelfth sub-image block are sequentially arranged in a clockwise direction in the currently to-be-decoded image block. For the ninth preset derivative mode, reference may be made to A′′ in FIG. 14 , and the tenth preset derivative mode may refer to E′′ in FIG. 14 , and the eleventh preset derivative mode may refer to F′′ in FIG. 14 , For the two preset derivation modes, reference may be made to B" in FIG.

Optionally, in any one of the foregoing possible implementation manners, the candidate derivative mode set includes a preset number of preset derivative modes (such as four types) in the candidate derivative mode set. Each of the preset derivation modes is represented by a fixed length code. Thus, compared with the prior art, the codec method provided by the present application can identify the division of the image block by using a simple identifier, thereby effectively improving the coding and decoding efficiency. .

Optionally, in any one of the foregoing possible implementation manners, the division identifier of the current image block to be decoded or the division identifier of the current image block to be encoded is represented by a variable length code, and the current image to be decoded The derived identifier of the block or the derived identifier of the current image block to be encoded is represented by a fixed length code.

These and other aspects of the present application will be more apparent from the following description.

DRAWINGS

FIG. 1 is a schematic structural diagram of different division manners provided by an embodiment of the present application;

2 is a schematic structural diagram of dividing a prediction unit by using different division manners according to an embodiment of the present application;

FIG. 3 is a schematic diagram of a division and identification of a QT division manner according to an embodiment of the present disclosure;

4 is a schematic diagram of dividing an image block by a QTBT division mode according to an embodiment of the present application;

FIG. 5 is a first schematic diagram of a method for marking a mark in a QT-ABT/TT division manner according to an embodiment of the present application; FIG.

6 is a second schematic diagram of a method for marking a mark in a QT-ABT/TT division mode according to an embodiment of the present application;

FIG. 7 is a schematic structural diagram of a video codec system according to an embodiment of the present application;

FIG. 8 is a schematic structural diagram of a video encoder according to an embodiment of the present application;

FIG. 9 is a schematic structural diagram of a video decoder according to an embodiment of the present application;

FIG. 10 is a schematic structural diagram of a codec device according to an embodiment of the present application;

FIG. 11 is a schematic flowchart 1 of a decoding method according to an embodiment of the present application;

FIG. 11B is a second schematic flowchart of a decoding method according to an embodiment of the present disclosure;

FIG. 12 is a schematic structural diagram 1 of a derivative mode according to an embodiment of the present disclosure;

FIG. 13 is a schematic structural diagram 2 of a derivative mode according to an embodiment of the present disclosure;

FIG. 14 is a schematic structural diagram 3 of a derivative mode according to an embodiment of the present disclosure;

15A is a schematic flowchart 1 of an encoding method according to an embodiment of the present application;

FIG. 15B is a second schematic flowchart of an encoding method according to an embodiment of the present disclosure;

16 is a schematic flowchart of a method for determining a basic division manner and a target derivative mode according to an embodiment of the present application;

17A is a schematic diagram 1 of a division structure of an image block in the embodiment of the present application;

17B is a second schematic structural diagram of an image block according to an embodiment of the present application;

FIG. 18 is a third schematic structural diagram of an image block according to an embodiment of the present application; FIG.

19 is a schematic diagram 4 of a division structure of an image block in an embodiment of the present application;

20 is a schematic diagram 5 of a division structure of an image block in the embodiment of the present application;

21 is a schematic diagram 6 of a division structure of an image block in the embodiment of the present application;

FIG. 22 is a schematic structural diagram 1 of a decoding apparatus according to an embodiment of the present application;

FIG. 23 is a second schematic structural diagram of a decoding apparatus according to an embodiment of the present application;

24 is a schematic structural diagram 1 of an encoding apparatus according to an embodiment of the present application;

FIG. 25 is a second schematic structural diagram of an encoding apparatus according to an embodiment of the present application.

Detailed ways

The terms "first", "second", "third", and "fourth" and the like in the specification and claims of the present application and the above drawings are used to distinguish different objects, and are not intended to limit the specific order.

In the embodiments of the present application, the words "exemplary" or "such as" are used to mean an example, illustration, or illustration. Any embodiment or design described as "exemplary" or "for example" in the embodiments of the present application should not be construed as preferred or advantageous over other embodiments or designs. Rather, the use of the words "exemplary" or "such as" is intended to present the concepts in a particular manner.

A complete image in a video is often referred to as a "frame," and a video composed of many frames in chronological order is also referred to as a video sequence. The video sequence has a series of redundant information such as spatial redundancy, temporal redundancy, visual redundancy, information entropy redundancy, structural redundancy, knowledge redundancy, and importance redundancy. In order to remove the redundant information in the video sequence and reduce the amount of data representing the video, a video coding (Video Coding) technique is proposed to reduce the storage space and save the transmission bandwidth. Video coding technology is also known as video compression technology.

As far as the current state of technology development is concerned, video coding technologies mainly include intra prediction, inter prediction, transform quantization, entropy coding, and deblocking filtering.

In the internationally accepted range, video compression coding standards, such as: MPEG-2 and MPEG-4 Part 10 Advanced Video Coding (AVC), developed by Motion Picture Experts Group (MPEG), H.263, H.264, and H.265 High Efficiency Video Coding Standard (HEVC) developed by the International Telecommunication Uion-Telecommunication Standardization Sector (ITU-T) There are four main compression coding methods: chroma sampling, predictive coding, transform coding, and quantization coding.

Chromatic Sampling: Make full use of the visual and psychological characteristics of the human eye, and in the underlying data representation, try to minimize the amount of data described by a single element.

Predictive coding: The data information of the previously encoded frame is used to predict the frame currently to be encoded. The encoding end obtains a predicted value by prediction, and there is a certain residual value between the predicted value and the actual value. If the prediction is more suitable, the closer the predicted value is to the actual value, the smaller the residual value, so that the encoding end encodes the residual value to greatly reduce the amount of data. When decoding, the decoder reconstructs the original image by using the residual value plus the predicted value. In the mainstream coding standard, predictive coding is divided into two basic types: intra prediction and inter prediction.

Transform coding: instead of directly encoding the original spatial domain information, the information sample value is converted from the current domain to another artificial domain (usually called the transform domain) according to some form of transformation function, and then The information is compressed and encoded in the distribution characteristics of the transform domain.

Quantization coding: Forcibly adjust the input value with a large dynamic range to a smaller output value.

In the coding algorithm based on the hybrid coding architecture, the above compression coding methods are used in combination, and the basic processing unit of these compression coding is an image block, which is obtained by dividing the frame/image by the encoding end.

For image partitioning, in order to more flexibly represent video content, HEVC defines a CTU, a Coding Unit (CU), a Prediction Unit (PU), and a Transform Unit (TU). The CTU, CU, PU, and TU are all image blocks.

CTU: An image consists of multiple CTUs. One CTU usually corresponds to a square image area. As shown in (a) of Figure 1, the image 10 consists of multiple CTUs (including CTU A, CTU B, CTU C, etc. ).

The coding information corresponding to a certain CTU includes luminance values and/or chrominance values of pixels in a square image region corresponding to the CTU. Furthermore, the coding information corresponding to a certain CTU may further include syntax elements indicating how to divide the CTU into at least one CU, and a method of decoding each CU to obtain a reconstructed image.

An image area corresponding to one CTU may include 64×64, 128×128, or 256×256 pixels. In one example, a CTU of 64x64 pixels contains a rectangular pixel lattice of 64 columns of 64 pixels per column, each pixel containing a luminance component and/or a chrominance component.

It can be understood that the CTU can also correspond to a rectangular image area or an image area of other shapes. The image area corresponding to one CTU may also be an image area in which the number of pixels in the horizontal direction is different from the number of pixels in the vertical direction, for example, including 64 × 128 pixels.

CU: generally corresponds to a rectangular area of A×B, A is the width of the rectangle, and B is the height of the rectangle. The width referred to in the embodiment of the present application refers to the two-dimensional Cartesian coordinate system XoY shown in FIG. 1 . The length in the X-axis direction (horizontal direction), the height refers to the length in the Y-axis direction (vertical direction) in the two-dimensional Cartesian coordinate system XoY shown in FIG. Here, the values of A and B may be the same or different. The values of A and B are usually an integer power of 2, for example: 256, 128, 64, 32, 16, 8, or 4.

PU: is the basic unit of intra prediction and inter prediction.

The division of the intra prediction unit includes a 2N×2N division scheme (shown as A in FIG. 2) and an N×N division scheme (shown as B in FIG. 2). The division of the inter prediction unit includes a 2N×2N division manner (as shown by C in FIG. 2), an N×N division manner (shown as D in FIG. 2), and an N×2N division manner (as shown in FIG. 2). E)), 2N×N division mode (shown as F in Fig. 2), 2N×nD division mode (shown as G in Fig. 2), 2N×nU division mode (such as H in Fig. 2) Show), nL × 2N division mode (shown as I in Figure 2), nR × 2N division mode (shown as J in Figure 2). Where N is any positive integer, n=x×N, and 0≤x≤1.

The 2N×2N division method is not to divide the image block; the N×N division method is to divide the image block into four large sub-image blocks; the N×2N division method is to divide the image block into two equal parts. Sub-image block; 2N×N division mode is to divide the image block into two sub-image blocks of upper and lower; 2N×nD division is to divide the image block into upper and lower sub-image blocks, and the image division line is opposite to the image block. The vertical bisector moves down n, where D indicates that the image dividing line is shifted downward relative to the vertical bisector of the image block; 2N×nU is divided into two sub-image blocks, and the image dividing line is opposite to the image. The vertical bisector of the block is shifted by n, where U represents the image dividing line moving relative to the vertical bisector of the image block; nL×2N is divided into two sub-image blocks, and the image dividing line is opposite to the image dividing line. The vertical bisector of the image block is shifted to the left by n, where L indicates that the image dividing line is shifted to the left by the vertical bisector of the image block; nR×2N is divided into two sub-image blocks, and the image dividing line is relatively The Perpendicular bisector of the right image block n, wherein, R represents an image division line of the image relative to the vertical bisector of the right block.

TU: Processes the residuals of the original image block and the predicted image block.

Generally, the division manners for dividing an image block include: Quad-Tree (QT) division, Binary Tree (BT) division, and TT division.

QT division method: divide one image block into four sub-image blocks. Optionally, the sub-image block may be further divided, or the sub-image block may not be divided.

In an example, the encoding end divides the CTU into four sub-image blocks by using a QT division manner; for each sub-image block, the encoding end may not be divided (in this case, each sub-image block is a CU), and the sub-image may also be The block is divided into four sub-image blocks of the next level. As shown in (b) of FIG. 1, the encoding end divides CTU A into four sub-image blocks a, b, c, and d.

BT division method: divide one image block into two sub-image blocks. Optionally, the sub-image block may be further divided, or the sub-image block may not be divided. The BT division method can include:

(1), horizontal two points

The image block is divided into upper and lower sub-image blocks of the same size. As shown in (c) of FIG. 1, the image block b is divided using a horizontal binary division method, and sub-image blocks e and sub-image blocks f of equal size are generated.

(2), vertical two points (or vertical two points)

The image block is divided into two sub-image blocks of the same size, left and right. As shown in (d) of FIG. 1, the image block d is divided by the vertical binary division method, and sub-image blocks g and sub-image blocks h of equal size are generated.

TT division method: divide one image block into three sub-image blocks. Optionally, the sub-image block may be further divided, or the sub-image block may not be divided. The TT division method can include:

(1), horizontal three points

The image block is divided into upper, middle and lower sub-image blocks.

In one example, the heights of the upper, middle, and lower sub-image blocks are respectively 1/4, 1/2, and 1/4 of the height of the image block. As shown in (e) of FIG. 1, the image block c is divided by a horizontal three-division division method, and the sub-image block j, the sub-image block k, and the sub-image block m are generated, and the height of the sub-image block j is high for the image block c. 1/4, the height of the sub-image block k is 1/2 of the height of the image block c, and the height of the sub-image block m is 1/4 of the height of the image block c.

In another example, the heights of the upper, middle, and lower sub-image blocks are respectively 1/3, 1/3, and 1/3 of the height of the image block, that is, the image block is in accordance with the two-dimensional Cartesian coordinate system shown in FIG. The direction of the X-axis in XoY is equally divided into three sub-image blocks. As shown in (f) of FIG. 1, the image block c is divided by a horizontal three-division division method, and the sub-image block j, the sub-image block k, and the sub-image block m are generated, and the height of the sub-image block j and the height of the sub-image block k are high. The height of the sub-image block m is 1/3 of the height of the image block c.

(2), vertical three points (or vertical three points)

The image block is divided into three sub-image blocks: left, center, and right.

In one example, the widths of the left, middle, and right sub-image blocks are respectively 1/4, 1/2, and 1/4 of the width of the image block. As shown in (g) of FIG. 1, the image block c is divided by a vertical three-division division method, and the sub-image block p, the sub-image block q, and the sub-image block x are generated, and the width of the sub-image block p is the width of the image block c. 1/4, the width of the sub-image block q is 1/2 of the width of the image block c, and the width of the sub-image block x is 1/4 of the width of the image block c.

In another example, the heights of the left, middle, and right sub-image blocks are respectively 1/3, 1/3, and 1/3 of the image block, that is, the image block is in accordance with the two-dimensional Cartesian coordinate system shown in FIG. The direction of the X-axis in XoY is equally divided into three sub-image blocks. As shown in (h) of FIG. 1, the image block c is divided by a vertical three-division division method, and the sub-image block p, the sub-image block q, and the sub-image block x are generated, and the width of the sub-image block p and the width of the sub-image block q are wide. The width of the sub-image block x is 1/3 of the width of the image block c.

As can be seen from the above description, after dividing an image block, the encoding end also needs to determine whether to continue dividing the sub-image block. Whether to divide a certain image block is indicated in the code stream by a division flag bit (such as split_cu_flag) corresponding to the image block. The level of the image block in the QT partition structure (referred to as the QT level) indirectly represents the number of QT partitions.

Exemplarily, the division flag bit of a certain image block is represented by split_cu_flag, split_cu_flag=0 indicates that the node is no longer divided, and split_cu_flag=1 indicates that the node is continuously divided. As shown in FIG. 3, in the case where the value of the split_cu_flag of the image block indicated by the 64×64 CTU is 1, the CTU is divided into four 32×32 sub-image blocks, and the four 32×32 sub-images are divided. The blocks are a sub-picture block A1, a sub-picture block A2, a sub-picture block A3, and a sub-picture block A4, respectively. Each of the four 32×32 sub-image blocks may be further divided or not divided according to its corresponding split_cu_flag. If the value of the split_cu_flag of the sub-picture block A1 is 1, the sub-picture block A1 is further divided, and the sub-picture block A1 is divided into four 16×16 nodes, and the four 16×16 sub-image blocks are respectively sub-image blocks. B1, sub-picture block B2, sub-picture block B3, and sub-picture block B4. And so on, until all sub-image blocks are no longer divided. In FIG. 3, the QT level of the CTU is 0, and the QT levels of the sub-image block A1, the sub-image block A2, the sub-image block A3, and the sub-image block A4 are all 1, the sub-image block B1, the sub-image block B2, and the sub-image block. Both B3 and sub-picture block B4 have a QT level of 2.

At present, the Joint Video Exploration Team (JVET) proposes a QTBT division method in which the QT division method and the BT division method are cascaded. Specifically, the CTU is divided into four sub-image blocks according to the QT division manner, and for the sub-image blocks that need to be further divided, the sub-image block is further divided according to the QT/BT division manner.

As shown in (a) of FIG. 4, each node represents an image block, and a node is a child node of the previous node, and correspondingly, the image block represented by the node is an image represented by the previous node. A sub-image block of the block. In (a) of FIG. 4, the solid line indicates division using the QT division method, and the broken line indicates division using the BT division method, and each of the nodes A to M corresponds to one CU. In the BT structure, 10 represents a vertical dichotomy and 11 represents a horizontal dichotomy. (b) in FIG. 4 shows a CTU divided according to the division manner shown in (a) of FIG.

As can be seen from Fig. 4, after dividing an image block by QTBT division, only the centrally symmetric sub-image block can be obtained. For non-central symmetric partitioning, the QTBT partitioning method cannot be satisfied. In addition, the QTBT partitioning method can only cover a small number of modes. For an image region with a more complex texture or a finer texture, the encoding end needs to perform deeper BT partitioning or QT partitioning.

In order to solve this problem, a QT-ABT/TT partitioning method is formed on the basis of the above QTBT, that is, the CTU is divided into four sub-image blocks according to the QT partitioning manner, for the sub-image block that needs further division. Then, the sub-image block is divided according to the QT/ABT/TT division method.

For the QT-ABT/TT division mode, the coding end can use the identifier bit indication method shown in FIG. 5 or FIG. 6 to write a specific division manner into the code stream.

Each branch in Figure 5 represents the manner in which the corresponding levels are divided. In the first level, 1 indicates the QT division mode, and 0 indicates the BT division method/TT division method/non-division. In the second level, 1 indicates the BT division method/TT division method, and 0 indicates no division. In the third hierarchy, 1 indicates the horizontal division mode, and 0 indicates the vertical division mode. Under the branch with the flag of level 3 being 1, the 1 indicates the BT division mode (ie, the horizontal dichotomy), and 0 indicates the TT division mode (ie, the horizontal three-point). Under the branch of the third level with the

identifier

0, 0 indicates the TT division mode (ie, vertical three points), and 1 indicates the BT division mode (ie, vertical dichotomy). Further, under the branch of the third level, the BT division mode indicates whether the symmetrical division manner is adopted by the three identifiers of 0, 10, and 11.

The qt in Fig. 6 indicates that the QT division mode is adopted, and the no qt indicates that the QT division mode is not employed. Correspondingly, no qt means that the BT division mode may not be divided or further adopted. If the BT division mode is further adopted, it is necessary to identify in the code stream whether a parallel division method or a vertical division manner is adopted. If the horizontal division method is adopted, the further identification is 1/4 division, or 1/8 division, or 1/3 division, or 1/2 division, and so on.

Through the above-mentioned marking method, the decoding end can correctly decode the specific shape of the image block. However, the above division method still uses QTBT as the basic framework. On the basis of which the horizontal or vertical division of TT or ABT is still limited, it cannot meet the more complicated texture requirements.

In addition, the combination of the above multiple division modes leads to a complicated indication method of the division manner of the image block, which affects the efficiency of the codec to some extent.

In this regard, the present application provides a decoding method. The decoding method provided by the present application can be implemented in two ways.

In the first implementation manner, the decoding apparatus determines the basic division manner of the current image block to be decoded and the target derivative mode of the sub image block in the current image block to be decoded by parsing the code stream, and according to the basic division manner, the current Decoding the image block to be divided into N (N is an integer greater than or equal to 2) sub-image blocks, and deriving at least two adjacent sub-image blocks of the N sub-image blocks as the first according to the derivative manner indicated by the target derivative mode The image block is derived such that the decoding device can decode the first derived image block. Here, the target derivative mode is one of a set of candidate derivative modes, and the candidate derivative mode set includes M (M is an integer greater than or equal to 0) preset derivation mode, and each of the preset derivation modes is used for the unique indication one. The derivative method is used to derive at least two adjacent sub-image blocks of the above N sub-image blocks into one derived image block. It is easy to understand that the combination of the division mode and the derivative mode can identify a plurality of different division results. Compared with the prior art, the image block is enriched and can be adapted to video images of various texture characteristics.

In a second implementation manner, the decoding device determines, by parsing the code stream, a basic division manner of the current image block to be decoded and a feature parameter (at least one of height and width) of the current image block to be decoded; If the feature parameter of the to-be-decoded image block is less than or equal to the first preset threshold, the decoding device parses the code stream to determine a target derivative mode of the sub-image block in the current image block to be decoded, so that the decoding device may be configured according to the basic division manner. Dividing the current image block to be decoded into N sub-image blocks, and deriving at least two adjacent sub-image blocks of the N sub-image blocks into the first derivative image block according to the target derivative mode, so that the decoding device can be the first Derived image blocks are decoded. The decoding device determines the target derivative mode of the sub-image block in the current image block to be decoded only if the feature parameter of the current image block to be decoded is less than or equal to the first preset threshold. The decoding efficiency of the decoding device in this mode is higher than that of the first implementation.

The decoding device may be a video decoder (as shown in FIG. 9 below), or may be a device having the functions of implementing the first implementable manner or the second implementable manner.

Correspondingly, the present application also provides an encoding method. This encoding method can be implemented in two ways.

In implementation mode A, after determining the basis division manner of the current image block to be encoded and the target derivation mode of the sub image block in the current image block to be encoded, the encoding device divides the current image block to be encoded into N according to the basic division manner. And sub-image blocks, and deriving at least two adjacent sub-image blocks of the N sub-image blocks as the first derivative image block according to the derivative manner indicated by the target derived mode. Further, in a case where the first derivative image block is not divided, the encoding device encodes the first derivative image block to obtain a code stream including a division identifier of the current image block to be encoded and a derivative identifier of the current image block to be encoded, And transmitting the code stream to the decoding device. Here, the target derivative mode is one of a set of candidate derivative modes, and the candidate derivative mode set includes M (M is an integer greater than or equal to 0) preset derivation mode, and each of the preset derivation modes is used for the unique indication one. The derivative method is used to derive at least two adjacent sub-image blocks of the above N sub-image blocks into one derived image block. The division identifier of the current image block to be encoded is used to indicate a basic division manner, and the derived identifier of the current image block to be encoded is used to indicate the target derivative mode. The coding end device in the present application derives at least two adjacent sub-image blocks of the N sub-image blocks into the first derivative image block by using the target derivation mode on the basis of the basic division mode, thereby realizing the current image block to be encoded. Division. That is to say, the coding end device in the present application identifies a plurality of different division results by combining the division mode and the derivative mode, enriches the division manner of the image block, and can adapt to video images of various texture characteristics.

In implementation B, after determining the basic division manner of the current image block to be encoded and the feature parameter of the current image block to be encoded, the encoding device divides the current image block to be encoded into N sub-image blocks according to the basic division manner, and If the feature parameter of the current image block to be encoded is less than or equal to the first preset threshold, parsing the code stream, determining a target derivative mode of the sub image block in the current image block to be encoded, such that the encoding device indicates according to the target derivative mode The derivative method is to derive at least two adjacent sub-image blocks of the N sub-image blocks into the first derivative image block. Further, in a case where the first derivative image block is not divided, the encoding device encodes the first derivative image block to obtain a code stream including a division identifier of the current image block to be encoded and a derivative identifier of the current image block to be encoded, And transmitting the code stream to the decoding device. The encoding device determines the target derivative mode only when the feature parameter of the current image block to be encoded is less than or equal to the first preset threshold. Compared with the foregoing implementation manner A, the implementation B effectively improves the coding efficiency, and the code The transmission resources occupied by the stream are small. In practical applications, implementation B is easier to implement than implementation A.

The above encoding device may be a video encoder (as shown in FIG. 8 below), or may be a device having the functions of implementing the above-described implementation A or the above-described implementation B.

The codec method provided by the present application is applicable to a video codec system. Fig. 7 shows the structure of a video codec system.

As shown in FIG. 7, the video codec system includes a source device 70 and a destination device 71. The source device 70 generates encoded video data, and the source device 70 may also be referred to as a video encoding device or a video encoding device. The destination device 71 may decode the encoded video data generated by the source device 70, and the destination device 71 also It may be referred to as a video decoding device or a video decoding device. Source device 70 and/or destination device 71 can include at least one processor and a memory coupled to the at least one processor. The memory may include, but is not limited to, a read-only memory (ROM), a random access memory (RAM), and an electrically erasable programmable read-only memory (EEPROM). The flash memory or any other medium that can be used to store the desired program code in the form of an instruction or data structure accessible by the computer is not specifically limited herein.

Source device 70 and destination device 71 may comprise various devices, including desktop computers, mobile computing devices, notebook (eg, laptop) computers, tablet computers, set top boxes, telephone handsets such as so-called "smart" phones, A television, a camera, a display device, a digital media player, a video game console, an on-board computer or the like.

Destination device 71 may receive encoded video data from source device 70 via link 72. Link 72 can include one or more media and/or devices capable of moving encoded video data from source device 70 to destination device 71. In one example, link 72 can include one or more communication media that enable source device 70 to transmit encoded video data directly to destination device 71 in real time. In this example, source device 70 may modulate the encoded video data in accordance with a communication standard (eg, a wireless communication protocol) and may transmit the modulated video data to destination device 71. The one or more communication media may include wireless and/or wired communication media, such as a radio frequency (RF) spectrum, one or more physical transmission lines. The one or more communication media described above may form part of a packet-based network, a portion of a packet-based network (eg, a local area network, a wide area network, or a global network (eg, the Internet)). The one or more communication media described above may include routers, switches, base stations, or other devices that enable communication from source device 70 to destination device 71.

In another example, the encoded video data can be output from the output interface 703 to the storage device 73. Similarly, the encoded video data can be accessed from storage device 73 via input interface 713. The storage device 73 can include a variety of local access data storage media, such as Blu-ray Disc, High Density Digital Video Disc (DVD), Compact Disc Read-Only Memory (CD-ROM), flash. Memory, or other suitable digital storage medium for storing encoded video data.

In another example, storage device 73 may correspond to a file server or another intermediate storage device that stores encoded video data generated by source device 70. In this example, destination device 71 may retrieve its stored video data from storage device 73 via streaming or download. The file server can be any type of server capable of storing encoded video data and transmitting the encoded video data to the destination device 71. For example, the file server may include a World Wide Web (Web) server (for example, for a website), a File Transfer Protocol (FTP) server, a Network Attached Storage (NAS) device, and a local disk. driver.

The destination device 71 can access the encoded video data over any standard data connection (e.g., an internet connection). The instance type of the data connection includes a wireless channel, a wired connection (e.g., a cable modem, etc.), or a combination of both, suitable for accessing the encoded video data stored on the file server. The transmission of the encoded video data from the file server may be streaming, downloading, or a combination of both.

The codec method of the present application is not limited to a wireless application scenario. Illustratively, the codec method of the present application can be applied to video codec supporting multiple multimedia applications such as the following applications: aerial television broadcasting, cable television transmission, satellite television transmission, Streaming video transmission (e.g., via the Internet), encoding of video data stored on a data storage medium, decoding of video data stored on a data storage medium, or other application. In some examples, video codec system 1 may be configured to support one-way or two-way video transmission to support applications such as video streaming, video playback, video broadcasting, and/or video telephony.

It should be noted that the video codec system shown in FIG. 7 is only an example of a video codec system, and is not a limitation of the video codec system in this application. The codec method provided by the present application is also applicable to a scenario where there is no data communication between the encoding device and the decoding device. In other examples, the video data to be encoded or the encoded video data may be retrieved from a local memory, streamed over a network, or the like. The video encoding device may encode the encoded video data and store the encoded video data to a memory, and the video decoding device may also acquire the encoded video data from the memory and decode the encoded video data.

In FIG. 7, source device 70 includes a video source 701, a video encoder 702, and an output interface 703. In some examples, output interface 703 can include a regulator/demodulator (modem) and/or a transmitter. Video source 701 can include a video capture device (eg, a video camera), a video archive containing previously captured video data, a video input interface to receive video data from a video content provider, and/or a computer graphic for generating video data. A combination of systems, or such sources of video data.

Video encoder 702 can encode video data from video source 701. In some examples, source device 70 transmits the encoded video data directly to destination device 71 via output interface 703. In other examples, the encoded video data may also be stored on storage device 73 for later access by destination device 71 for decoding and/or playback.

In the example of FIG. 7, destination device 71 includes display device 711, video decoder 712, and input interface 713. In some examples, input interface 713 includes a receiver and/or a modem. Input interface 713 can receive encoded video data via link 72 and/or from storage device 73. The display device 711 can be integrated with the destination device 71 or can be external to the destination device 71. Generally, the display device 711 displays the decoded video data. Display device 711 can include a variety of display devices, such as liquid crystal displays, plasma displays, organic light emitting diode displays, or other types of display devices.

Alternatively, video encoder 702 and video decoder 712 may each be integrated with an audio encoder and decoder, and may include appropriate multiplexer-demultiplexer units or other hardware and software to handle common Encoding of both audio and video in a data stream or in a separate data stream.

The video encoder 702 and the video decoder 712 may include at least one microprocessor, a digital signal processor (DSP), an application-specific integrated circuit (ASIC), and a field programmable gate array (Field Programmable). Gate Array, FPGA), discrete logic, hardware, or any combination thereof. If the codec method provided by the present application is implemented in software, the instructions for the software may be stored in a suitable non-transitory computer readable storage medium, and the instructions may be executed in hardware using at least one processor Implement this application. Any of the foregoing (including hardware, software, a combination of hardware and software, etc.) can be considered at least one processor. Video encoder 702 may be included in an encoder, and video decoder 712 may be included in a decoder, which may be part of a combined encoder/decoder (codec) in a respective device.

The video encoder 702 and the video decoder 712 in this application may be operated according to a video compression standard (for example, HEVC), and may also be operated according to other industry standards, which is not specifically limited herein.

The video encoder 702 is configured to determine a basic division manner of the current image block to be encoded and a target derivative mode of the sub image block in the current image block to be encoded, or determine a basic division manner of the current image block to be encoded and a feature of the current image block to be encoded. a parameter, in a case where the feature parameter of the current image block to be encoded is less than or equal to the first preset threshold, determining a target derivative mode of the sub image block in the current image block to be encoded; and dividing the current image block to be encoded according to the basic division mode At least N (N is an integer greater than or equal to 2) sub-image blocks, and at least two adjacent sub-image blocks of the N sub-image blocks are derived as the first derivative image block according to a derivative manner indicated by the target derived mode. Thus, where video encoder 702 does not need to partition the first derived image block, video encoder 702 encodes the first derived image block, resulting in a code stream and transmitting the code stream to video decoder 712. In the case that the video encoder 702 further needs to divide the first derived image block, the video encoder 702 determines the first derived image block as the current image block to be encoded, and repeatedly performs the above process until the first obtained. Derived image blocks are no longer partitioned.

Optionally, the method that the video encoder 702 determines the basic division manner of the current image block to be encoded may be: the video encoder 702 determines at least two candidate division manners corresponding to the current image to be encoded, and calculates at least two types. The rate-distortion cost of each of the candidate partitioning modes is such that the video encoder 702 determines the candidate partitioning mode corresponding to the smallest rate-distortion cost as the basic partitioning mode.

The method of “the video encoder 702 determining the target derived mode of the sub-image block in the current image block to be encoded” may be: the video encoder 702 calculates a rate distortion cost of each of the preset derivative modes in the candidate derived mode set, and The preset derivative mode corresponding to the minimum rate distortion cost is determined as the target derivative mode.

Referring to FIG. 11A or FIG. 11B described below, the video decoder 712 is configured to: parse the code stream, determine a basic division manner of the current image block to be decoded, and a target derivative mode of the sub image block in the current image block to be decoded (S111); or Parsing the code stream, determining a basic division manner of the current image block to be decoded, and a feature parameter of the current image block to be decoded (S111a), and determining whether the feature parameter of the current image block to be decoded is less than or equal to a first preset threshold (S111b), In a case that the feature parameter of the current image block to be decoded is less than or equal to the first preset threshold, the code stream is parsed to determine a target derivative mode of the sub-image block in the current image block to be decoded (S111c). Subsequently, the video decoder 712 divides the current image block to be decoded into N sub-image blocks according to the basic division manner (S112), and at least two adjacent sub-N of the N sub-image blocks according to the derivative manner indicated by the target derivative mode. The image block is derived as a first derived image block (S113). Further, the video decoder 712 decodes the first derived image block (S114).

FIG. 8 is a schematic block diagram of a video encoder 702 in the embodiment of the present application. Video encoder 702 may perform intra coding and inter coding of video blocks within a video strip. Intra coding relies on spatial prediction to reduce or remove spatial redundancy of video within a given video frame or image. Inter-coding relies on temporal prediction to reduce or remove temporal redundancy of adjacent frames of a video sequence or video within an image. The intra coding mode (I mode) may refer to any of a number of space based compression modes. For example, one-way prediction (P mode) or bidirectional prediction (B mode). The inter coding mode may refer to any of a number of time based compression modes.

In the possible embodiment of FIG. 8, video encoder 702 includes a partitioning unit 81, a transform unit 82, a quantization unit 83, a prediction unit 84, an entropy encoding unit 85, a reference image memory 88, and a summer 802. The prediction unit 84 includes a motion estimation unit 841, a motion compensation unit 842, and an intra prediction unit 843. For video block reconstruction, video encoder 702 also includes inverse quantization unit 86, inverse transform unit 87, and summer 801. In addition, video encoder 702 can also include a deblocking filter (not shown in FIG. 8) for filtering block boundaries to remove blockiness artifacts from the reconstructed video. The deblocking filter will typically filter the output of summer 801 as needed. In addition to the deblocking filter, video encoder 702 may also include additional loop filters (in-loop or post-loop).

As shown in FIG. 8, video encoder 702 receives video data, and segmentation unit 81 segments the data into video blocks. This partitioning may also include partitioning into slices, image blocks, or other larger units, and, for example, video block partitioning based on the quadtree structure of the LCU and CU. Illustratively, video encoder 702 encodes components of a video block within a video strip to be encoded. In general, a stripe may be partitioned into multiple video blocks (and possibly into a collection of video blocks called image blocks).

Prediction unit 84 may select one of a plurality of possible coding modes of the current video block based on the coding quality and the cost calculation result (eg, Rate Distortion cost (RD cost)), such as one of a plurality of intra coding modes. One of a plurality of code decoding modes. Prediction unit 84 may provide the resulting intra-coded or inter-coded block to summer 802 to generate residual block data, and provide the resulting intra-coded or inter-coded decoding to summer 801 to reconstruct the The coded block is thus used as a reference image.

Motion estimation unit 841 and motion compensation unit 842 within prediction unit 84 perform inter-predictive decoding of the current video block relative to one or more of the one or more reference pictures to provide temporal compression. Motion estimation unit 841 can be configured to determine an inter prediction mode for the video stripe based on a predetermined pattern of the video sequence. The predetermined mode specifies the video strips in the sequence as P strips, B strips, or GPB strips. Motion estimation unit 841 and motion compensation unit 842 are highly integrated, but are illustrated separately for conceptual purposes. The motion performed by the motion estimation unit 841 is estimated as a process of generating a motion vector of the estimated video block. For example, the motion vector may indicate the displacement of the PU of the video block within the current video frame or image relative to the predicted block within the reference image.

The prediction block is a block of PUs that are found to closely match the video block to be decoded according to the pixel difference, and the pixel difference may be Sum of Absolute Differences (SAD), Sum of Squared Differences (SSD) or Other difference metrics are determined. In some possible implementations, video encoder 702 can calculate a value of a sub-integer pixel location of a reference image stored in reference image memory 88. For example, video encoder 702 can interpolate values of a quarter pixel position, an eighth pixel position, or other fractional pixel position of a reference image. Accordingly, the motion estimation unit 841 can perform motion search with respect to the full pixel position and the fractional pixel position and output a motion vector having fractional pixel precision.

The motion estimation unit 841 calculates a motion vector of the PU of the video block in the inter-decoded slice by comparing the position of the PU with the position of the prediction block of the reference image. The reference images may be selected from a first reference image list (List 0) or a second reference image list (List 1), each of the lists identifying one or more reference images stored in the reference image memory 88. The motion estimation unit 841 transmits the calculated motion vector to the entropy encoding unit 85 and the motion compensation unit 842.

Motion compensation performed by motion compensation unit 842 may involve extracting or generating a prediction block based on motion vectors determined by motion estimation, possibly performing interpolation to sub-pixel precision. After receiving the motion vector of the PU of the current video block, motion compensation unit 842 may locate the prediction block pointed to by the motion vector in one of the reference image lists. Video encoder 702 forms a residual video block by subtracting the pixel values of the prediction block from the pixel values of the current video block being decoded, thereby forming pixel difference values. The pixel difference values form residual data for the block and may include both luminance and chrominance difference components. Summer 801 represents one or more components that perform this subtraction. Motion compensation unit 842 may also generate syntax elements associated with video blocks and video slices for use by video decoder 712 to decode video blocks of the video strip.

If the PU is located in a B-strip, the PU-containing image may be associated with two reference image lists called "List 0" and "List 1". In some possible implementations, an image containing B strips may be associated with a list combination that is a combination of List 0 and List 1.

Furthermore, if the PU is located in a B-strip, motion estimation unit 841 can perform uni-directional prediction or bi-directional prediction for the PU, wherein, in some possible implementations, bi-directional prediction is based on list of reference pictures of list 0 and list 1, respectively. The prediction performed by the image, in other possible embodiments, the bidirectional prediction is prediction based on the reconstructed future frame and the reconstructed past frame in the display order of the current frame, respectively. When the motion estimation unit 841 performs unidirectional prediction for the PU, the motion estimation unit 841 may search for a reference block for the PU in the reference image of list 0 or list 1. Motion estimation unit 841 may then generate a reference index indicating a reference picture containing the reference block in list 0 or list 1 and a motion vector indicating a spatial displacement between the PU and the reference block. The motion estimation unit 841 may output a reference index, a prediction direction identifier, and a motion vector as motion information of the PU. The predicted direction indicator may indicate that the reference index indicates a reference picture in list 0 or list 1. Motion compensation unit 842 may generate a predicted image block of the PU based on the reference block indicated by the motion information of the PU.

When the motion estimation unit 841 performs bidirectional prediction for the PU, the motion estimation unit 841 may search for a reference block for the PU in the reference image in the list 0 and may also search for another one for the PU in the reference image in the list 1 Reference block. Motion estimation unit 841 may then generate a reference index indicating the reference picture containing the reference block in list 0 and list 1 and a motion vector indicating the spatial displacement between the reference block and the PU. The motion estimation unit 841 may output a reference index of the PU and a motion vector as motion information of the PU. Motion compensation unit 842 may generate a predicted image block of the PU based on the reference block indicated by the motion information of the PU.

In some possible implementations, motion estimation unit 841 does not output a complete set of motion information for the PU to entropy encoding unit 85. Rather, the motion estimation unit 841 can signal the motion information of the PU with reference to motion information of another PU. For example, motion estimation unit 841 can determine that the motion information of the PU is sufficiently similar to the motion information of the neighboring PU. In this embodiment, motion estimation unit 841 can indicate an indication value in a syntax structure associated with the PU that indicates to video decoder 712 that the PU has the same motion information as the neighboring PU or has a slave phase The motion information derived by the neighboring PU. In another embodiment, motion estimation unit 841 can identify candidate predicted motion vectors and Motion Vector Differences (MVDs) associated with neighboring PUs in a syntax structure associated with the PU. The MVD indicates the difference between the motion vector of the PU and the indicated candidate predicted motion vector associated with the neighboring PU. Video decoder 712 may determine the motion vector of the PU using the indicated candidate predicted motion vector and MVD.

As described above, prediction unit 84 may generate a list of candidate predicted motion vectors for each PU of the CU. One or more of the candidate predicted motion vector lists may include one or more original candidate predicted motion vectors and one or more additional candidate predicted motion vectors derived from the original candidate predicted motion vectors.

Intra prediction unit 843 within prediction unit 84 may perform intra-predictive decoding of the current video block relative to one or more neighboring blocks in the same image or slice as the current block to be decoded to provide spatial compression. Accordingly, instead of inter prediction (as described above) performed by motion estimation unit 841 and motion compensation unit 842, intra prediction unit 843 may intra-predict the current block. In particular, intra prediction unit 843 can determine an intra prediction mode to encode the current block. In some possible implementations, intra-prediction unit 843 can encode the current block using various intra-prediction modes, for example, during a separate encoding traversal, and intra-prediction unit 843 can select the appropriate frame to use from the tested mode. Internal prediction mode.

After prediction unit 84 generates a prediction block for the current video block via inter prediction or intra prediction, video encoder 702 forms a residual video block by subtracting the prediction block from the current video block. The residual video data in the residual block may be included in one or more TUs and applied to transform unit 82. The transform unit 82 transforms the residual video data into residual transform coefficients using, for example, a Discrete Cosine Transform (DCT) or a transform of a conceptually similar transform (for example, DST for Discrete Sine Transform (DST)). Transform unit 82 may convert the residual video data from the pixel domain to a transform domain (eg, a frequency domain).

Transform unit 82 may send the resulting transform coefficients to quantization unit 83. The quantization unit 83 quantizes the transform coefficients to further reduce the code rate. The quantization process can reduce the bit depth associated with some or all of the coefficients. The degree of quantization can be modified by adjusting the quantization parameters. In some possible implementations, quantization unit 83 may then perform a scan of the matrix containing the quantized transform coefficients. Alternatively, the entropy encoding unit 85 may perform scanning.

After quantization, entropy encoding unit 85 may entropy encode the quantized transform coefficients. For example, the entropy encoding unit 85 may perform Context-Adaptive Varial-Length Coding (CAVLC), Context-based Adaptive Binary Arithmetic Coding (CABAC), and grammar-based Context Adaptive Binary Arithmetic Decoding (SBAC), Probability Interval Partition Entropy (PIPE) decoding or another entropy encoding method or technique. Entropy encoding unit 85 may also entropy encode the motion vectors and other syntax elements of the current video strip being decoded. After entropy encoding by entropy encoding unit 85, the encoded code stream may be transmitted to video decoder 712 or archive for later transmission or retrieved by video decoder 712.

The inverse quantization unit 86 and the inverse transform unit 87 apply inverse quantization and inverse transform, respectively, to reconstruct the residual block in the pixel domain for later use as a reference block of the reference image. Motion compensation unit 842 may calculate the reference block by adding the residual block to a prediction block of one of the reference pictures within one of the reference picture lists. Motion compensation unit 842 may also apply one or more interpolation filters to the reconstructed residual block to calculate sub-integer pixel values for motion estimation. Summer 801 adds the reconstructed residual block to the motion compensated prediction block generated by motion compensation unit 842 to produce a reference block for storage in reference image memory 88. The reference block may be used by the motion estimation unit 841 and the motion compensation unit 842 as reference blocks to inter-predict blocks in subsequent video frames or images.

It should be understood that other structural changes to video encoder 702 can be used to encode the video stream. For example, for certain image blocks or image frames, video encoder 702 may directly quantize the residual signal without the need for processing by transform unit 82, and accordingly need not be processed by inverse transform unit 87; or, for some image blocks Or the image frame, video encoder 702 does not generate residual data, and accordingly does not need to be processed by transform unit 82, quantization unit 83, inverse quantization unit 86, and inverse transform unit 87; alternatively, video encoder 702 can reconstruct the reconstructed The video block is stored directly as a reference block without filtering processing; alternatively, the quantization unit 83 and the inverse quantization unit 86 in the video encoder 702 can be combined.

FIG. 9 is a schematic structural diagram of a video decoder 712 in the embodiment of the present application. As shown in FIG. 9, video decoder 712 includes an entropy decoding unit 90, a prediction unit 91, an inverse quantization unit 94, an inverse transform unit 93, a summer 95, and a reference image memory 92. The prediction unit 91 includes a motion compensation unit 911 and an intra prediction unit 912. In some possible implementations, video decoder 712 can perform an exemplary reciprocal decoding process with respect to the encoding flow described by video encoder 702 from FIG.

During the decoding process, video decoder 712 receives from video encoder 702 an encoded video code stream representing the video blocks of the encoded video slice and associated syntax elements. Entropy decoding unit 90 of video decoder 712 entropy decodes the code stream to produce quantized coefficients, motion vectors, and other syntax elements. The entropy decoding unit 90 forwards the motion vector and other syntax elements to the prediction unit 91. Video decoder 712 can receive syntax elements at the video stripe level and/or video block level.

When the video stripe is decoded into an intra-coded (I) stripe, the intra-prediction unit 912 of the prediction unit 91 may be based on the signaled intra-prediction mode and data from the previously decoded block of the current frame or image. The predicted data of the video block of the current video stripe is generated.

When the video image is decoded into an inter-frame decoded (eg, B, P, or GPB) stripe, the motion compensation unit 911 of the prediction unit 91 generates the current video based on the motion vector and other syntax elements received from the entropy decoding unit 90. A predictive block of video blocks of an image. The predictive block may be generated from one of the reference images within one of the reference image lists. Video decoder 712 may construct a reference image list (List 0 and List 1) using default construction techniques based on reference images stored in reference image memory 92.

Motion compensation unit 911 determines the prediction information for the video block of the current video slice by parsing the motion vector and other syntax elements, and uses the prediction information to generate a predictive block of the current video block being decoded. For example, motion compensation unit 911 uses some of the received syntax elements to determine a prediction mode (eg, intra prediction or inter prediction) of a video block used to decode a video slice, an inter prediction slice type (eg, B strip, P strip, or GPB strip), construction information for one or more of the reference picture lists of the strip, motion vectors for each inter-coded video block of the strip, and each frame of the strip The inter prediction state of the decoded video block and other information used to decode the video block in the current video slice.

Motion compensation unit 911 can also perform interpolation based on the interpolation filter. Motion compensation unit 911 may use the interpolation filters as used by video encoder 702 during encoding of the video block to calculate interpolated values for sub-integer pixels of the reference block. In this application, motion compensation unit 911 can determine the interpolation filters used by video encoder 702 from the received syntax elements and use an interpolation filter to generate the predictive blocks.

If the PU is encoded using inter prediction, motion compensation unit 911 may generate a candidate predicted motion vector list for the PU. Data identifying the location of the selected candidate predicted motion vector in the candidate motion vector list of the PU may be included in the code stream. After generating the candidate prediction motion vector list for the PU, motion compensation unit 911 may generate a predictive image block for the PU based on one or more reference blocks indicated by the motion information of the PU. The reference block of the PU may be in a different time image than the PU. Motion compensation unit 911 may determine motion information for the PU based on the selected motion information from the candidate motion vector list of the PU.

Inverse quantization unit 94 inverse quantizes (eg, dequantizes) the quantized transform coefficients provided in the code stream and decoded by entropy decoding unit 90. The inverse quantization process can include determining the degree of quantization using the quantization parameters calculated by video encoder 702 for each of the video slices, and likewise determining the degree of inverse quantization of the application. Inverse transform unit 93 applies an inverse transform (eg, an inverse DCT, an inverse integer transform, or a conceptually similar inverse transform process) to the transform coefficients to produce a residual block in the pixel domain.

After the motion compensation unit 911 generates a predictive block of the current video block based on the motion vector and other syntax elements, the video decoder 712 sums the residual block from the inverse transform unit 93 with the corresponding predictive block generated by the motion compensation unit 911. To form a decoded video block. Summer 95 represents one or more components that perform this summation operation. A deblocking filter can also be applied to filter the decoded blocks to remove blockiness artifacts as needed. Other loop filters (either in the decoding loop or after the decoding loop) can also be used to smooth pixel transitions or otherwise improve video quality. The decoded video block in a given frame or image is then stored in a reference image memory 92, which stores a reference image for subsequent motion compensation.

It should be understood that other structural changes to video decoder 712 may be used for the decoding of the code stream. For example, for certain image blocks or image frames, entropy decoding unit 90 of video decoder 712 does not decode the quantized coefficients, and accordingly does not need to be processed by inverse quantization unit 94 and inverse transform unit 93. For example, inverse quantization unit 94 and inverse transform unit 93 in video decoder 712 can be combined.

The video encoder 702 and the video decoder 712 may also be implemented by another implementation, for example, by using a general-purpose digital processor system, such as the codec device 10 shown in FIG. Some of the devices in the video encoder 702 may also be part of the video decoder 712.

The codec device 10 may be applied to the encoding side or may be applied to the decoding side. The codec device 10 includes a processor 1001 and a memory 1002. The processor 1001 is coupled to the memory 1002 (e.g., interconnected by a bus 1004). Optionally, the codec device 10 may further include a transceiver 1003, and the transceiver 1003 is connected to the processor 1001 and the memory 1002 for receiving/transmitting data.

The memory 1002 can be a random access memory (RAM), a read-only memory (ROM), an Erasable Programmable Read Only Memory (EPROM), or a portable read only memory. (Compact Disc Read-Only Memory, CD-ROM). The memory 1002 is used to store related program codes and video data.

The processor 1001 may be one or more central processing units (CPUs). In the case that the processor 1001 is a CPU, the CPU may be a single core CPU or a multi-core CPU.

The processor 1001 is configured to read program code stored in the memory 1002, and perform operations of any one of the embodiments corresponding to FIG. 11A, FIG. 11B, FIG. 15A, or FIG. 15B and various possible embodiments thereof.

Hereinafter, the codec method provided by the present application will be described in detail in conjunction with the video codec system shown in FIG. 7 above, the video encoder 702 shown in FIG. 8, and the video decoder 712 shown in FIG.

FIG. 11A is a schematic flowchart of a decoding method according to an embodiment of the present application, and the decoding method can be applied to the video codec system shown in FIG. 7. The flow shown in FIG. 11A is explained by taking the video decoder 712 shown in FIG. 9 as an execution subject as an example.

As shown in FIG. 11A, the decoding method provided by the embodiment of the present application includes:

S110. The video decoder obtains a code stream that includes image data.

Optionally, the code stream obtained by the video decoder including the image data includes a Sequence Parameter Set (SPS), a Picture Parameter Set (PPS), and a slice header or a slice header ( Slice segment header) and other syntax elements.

S111. The video decoder decodes the obtained code stream, determines a basic division manner of the current image block to be decoded, and a target derivative mode of the sub-image block in the current image block to be decoded.

The syntax element of the code stream includes a division identifier of the current image block to be decoded and a derivative identifier of the current graphics block to be decoded, and the division identifier of the current image block to be decoded is used to indicate a basic division manner, and the derivative identifier of the current image block to be decoded is used. Indicates the target derivative mode.

Optionally, the division identifier of the current image block to be decoded is represented by a variable length code, for example, a binary truncation code or an exponential Golomb code. The derived identifier of the current image block to be decoded is represented by a fixed length code.

The division identifier of the current image block to be decoded and the derivative identifier of the current image block to be decoded may be an independent flag. For example, the division identifier of the current image block to be decoded is represented by SplitFlag, and the derivative identifier of the current image block to be decoded adopts DeriveFlag. Said. Certainly, the division identifier of the current image block to be decoded and the derivative identifier of the current image block to be decoded may also be a part of the same identifier, for example, if the derivative identifier of the current image block to be decoded is represented by a fixed length code of length x. The code stream includes flag1 corresponding to the current image block to be decoded, and the last x bit of the flag1 is a derivative identifier of the current image block to be decoded, and the remaining bits of the flag1 except the last x bit are the division identifier of the current image block to be decoded. .

Optionally, the video decoder parses the code stream, and the method for determining the basic division mode and the target derivation mode may be: the video decoder parses the code stream, and the syntax element included in the code stream is included to indicate how to a division identifier (such as SplitFlag) of the currently to-be-decoded image block into which the decoded image block is divided into N sub-image blocks and a current indication for how to derivate at least two adjacent sub-image blocks of the N sub-image blocks into one derived image block Derived identification of the image block to be decoded (such as DeriveFlag), and determining a basic division manner according to the division identifier of the current image block to be decoded, and determining a target derivative mode according to the derived identifier of the current image block to be decoded.

Exemplarily, if the value of a SplitFlag is 011, it indicates that the image block corresponding to the SplitFlag is divided into horizontal quarters. If the value of a certain DeriveFlag is 01, it indicates that at least two adjacent sub-image blocks in the image block corresponding to the DeriveFlag are derived into a derivative image block by using the first preset derivation mode in the candidate derivation mode.

Optionally, the video decoder parses the code stream, and the method for determining the basic division mode and the target derivation mode may further be: the decoding end device parses the code stream, and obtains, by the decoding end, the syntax element included in the code stream, Decoding a first identifier of the image block (such as NSFlag); if the value of the first identifier is a first value (for example, 1), acquiring a second identifier (such as QTSplitFlag) included in the syntax element corresponding to the current image block to be decoded; If the value of the second identifier is a second value (for example, 01), it indicates that the current image block to be decoded is divided into vertical quarters; if the value of the second identifier is a third value (for example, 00), the current to be decoded is The image block is divided into horizontal quarters. In addition, the decoding end device further acquires, from the code stream, a derivative identifier of the current to-be-decoded image block included in the syntax element for indicating how to derivate at least two adjacent sub-image blocks of the N sub-image blocks into one derived image block. (For example, DeriveFlag), and determining a basic division manner according to the division identifier of the current image block to be decoded, and determining a target derivative mode according to the derived identifier of the current image block to be decoded.

Specifically, the basic division manner in the embodiment of the present application is a horizontal N score, a vertical N score, or a horizontal vertical N score.

Here, the horizontal N point refers to horizontally dividing the current image block to be decoded into N sub-image blocks. The height of each of the N sub-image blocks may be 1/N of the height of the current image block to be decoded, that is, the current image block to be decoded is horizontally divided, and is equally divided into N sub-image blocks. As shown in (c) of FIG. 1, the height of each sub-image block is 1/2 of the height of the image block b. In addition, the heights of any two sub-image blocks of the N sub-image blocks may also be unequal, that is, the current image block to be decoded is horizontally divided, and is unevenly divided into N sub-image blocks. As shown in (e) of FIG. 1, the height of the sub-image block j is 1/4 of the height of the image block c, and the height of the sub-image block k is 1/2 of the height of the image block c, and the sub-image block m The height is 1/4 of the height of the image block c.

Similarly, the vertical N point refers to vertically dividing the current image block to be decoded into N sub-image blocks. The width of each of the N sub-image blocks may be 1/N of the width of the current image block to be decoded, that is, the current image block to be decoded is vertically divided, and is equally divided into N sub-image blocks. As shown in (d) of FIG. 1, the width of each sub-image block is 1/2 of the width of the image block d. In addition, the widths of any two sub-image blocks of the N sub-image blocks may not be equal, that is, the current image block to be decoded is vertically divided, and is unevenly divided into N sub-image blocks. As shown in (g) of FIG. 1, the width of the sub-image block p is 1/4 of the width of the image block c, and the width of the sub-image block q is 1/2 of the width of the image block c, and the sub-image block x is The width is 1/4 of the width of the image block c.

Similarly, horizontal and vertical N points mean that the current image block to be decoded is horizontally and vertically divided into N sub-image blocks. The width of each of the N sub-image blocks may be the width of the current image block to be decoded.

And the height of each sub-image block can be the height of the current image block to be decoded.

That is, the current image block to be decoded is horizontally and vertically divided, and is equally divided into N sub-image blocks. As shown in (b) of Figure 1, the width of each sub-image block is the width of CTU A.

(ie 1/2), and the height of each sub-image block is the height of CTU A

(ie 1/2). In addition, the widths of any two sub-image blocks in the N sub-image blocks may not be equal, or the heights of any two sub-image blocks are not equal, that is, the current image block to be decoded is vertically divided, and is unevenly divided into N sub-image blocks.

A typical representation of the horizontal and vertical N points in the embodiment of the present application is a quadtree partition.

For convenience of description, the embodiment of the present application refers to the above-mentioned “leveling the currently to-be-decoded image block horizontally and dividing into N sub-image blocks”, and the vertical N-score refers to “discriminating the current image block to be decoded vertically, and The average is divided into N sub-image blocks, and the horizontal and vertical N points are described as an example of “dividing the current image block to be decoded horizontally and dividing into N sub-image blocks equally”.

In practical applications, for a video encoder, the image block may be divided by a horizontal N-segment or a vertical N-segmentation when a certain image block satisfies a preset condition. For example, the width of an image block is M1 and the height is H1. If M1×H1 is greater than X1×Y1, the video encoder divides the image block by horizontal N or vertical N division; or, if M1×H1 is smaller than X2×Y2, the video encoder divides the image block by horizontal N or vertical N division; or, if M1×H1 is greater than X1×Y1 and smaller than X2×Y2, the video encoder adopts horizontal N or vertical The image block is divided by the N division method. Among them, X1, Y1, X2 and Y2 are all positive integers.

Exemplarily, if M1=4×H1, the video encoder may divide the image block by vertical quadrant division; if H1=4×M, the video encoder may divide the image block by horizontal quadrant division.

Correspondingly, if the current image block to be decoded has a width M1 and a height H1, if M1×H1 is greater than X1×Y1, or M1×H1 is smaller than X2×Y2, or M1×H1 is greater than X1×Y1 and smaller than X2. ×Y2, the basic division method is horizontal N or vertical N.

The target derivative mode in the embodiment of the present application is one of the candidate derivative mode sets, where the candidate derivative mode set includes M (M is an integer) preset derivative mode, and the basic division mode is horizontal N or vertical N when, 0≤M≤2 ^N-1; when the base is divided into horizontal and vertical N time division manner,

Each of the preset derivative modes uniquely indicates a derivative mode, and the derivative method refers to deriving at least two adjacent sub-image blocks of the N sub-image blocks into one derivative image block. It should be noted that there is a preset derivative mode for indicating that no derivation is performed in the preset derivation mode in M.

Derivation in the embodiments of the present application can be understood as fusion/merge. Specifically, after the image block to be decoded is divided into N sub-image blocks, at least two adjacent sub-image blocks are combined into one derived image block.

In one example, as shown in FIG. 12, the image block to be decoded is divided into a sub-image block 1 (which may correspond to the first sub-image block described herein) and a sub-image block 2 in a horizontal quad division manner (corresponding to the description herein) Second sub-image block), sub-image block 3 (which may correspond to the third sub-image block described herein), and sub-image block 4 (which may correspond to the fourth sub-image block described herein), N=4, dividing the to-be-decoded The dividing line of the image block has three (N-1=3) strips: a first dividing line, a second dividing line, and a third dividing line. Correspondingly, the candidate derivative mode corresponding to the image block to be decoded includes at most 24-1=8. Pre-defined derivative mode. A, B, C, D, E, F, G, and H in Fig. 12 show these eight preset derivation modes. Taking the preset derivation mode shown by A in FIG. 12 as an example, the first dividing line is removed, and the sub-image block 1 and the sub-image block 2 are derived (or referred to as fusion/merging) into one derived image block.

In another example, as shown in FIG. 13, the image block to be decoded is divided into a sub-image block 1 (which may correspond to the fifth sub-image block described herein) and a sub-image block 2 in a vertical division manner (corresponding to this document) a sixth sub-image block), a sub-image block 3 (corresponding to the seventh sub-image block described herein), and a sub-image block 4 (corresponding to the eighth sub-image block described herein), N=4, dividing the The dividing line of the decoded image block has three (N-1=3) strips: a first dividing line, a second dividing line, and a third dividing line. Correspondingly, the candidate derivative mode corresponding to the image block to be decoded includes at most 24-1= 8 preset derivative modes. A', B', C', D', E', F', G', H' in Fig. 13 show these eight preset derivation patterns. Taking the preset derivation mode shown by F′ in FIG. 13 as an example, the first dividing line is removed, and the sub-image block 1 and the sub-image block 2 are derived as one derived image block, and the third dividing line is removed, and the sub-image block 3 is removed. And sub-image block 4 is derived as another derived image block.

In another example, as shown in FIG. 14, the image block to be decoded is divided into a sub-image block 1 (which may correspond to the ninth sub-image block described herein) and a sub-image block 2 (corresponding to each other). The tenth sub-image block described herein), sub-image block 3 (which may correspond to the eleventh sub-image block described herein), and sub-image block 4 (which may correspond to the twelfth sub-image block described herein), N=4, Dividing the dividing line of the image block to be decoded into four

a first dividing line, a second dividing line, a third dividing line, and a fourth dividing line. Correspondingly, the candidate derivative mode corresponding to the image block to be decoded includes at most 24=16 preset derivative modes. A", B", C", D", E", F", G", H" in Fig. 14 show 8 of the preset derivation patterns. Taking the preset derivation mode shown by G" in FIG. 14 as an example, the third dividing line and the fourth dividing line are removed, and the sub-image block 1, the sub-image block 3, and the sub-image block 4 are derived as one derived image block.

As can be seen from the above-mentioned FIG. 12, FIG. 13 and FIG. 14, the candidate derivative mode set corresponding to different partitioning modes includes a preset diffraction mode having the same derivative result. For example, if the same current image block to be decoded is derived, the derived image block shown by F in FIG. 12 is the same as the derivative image block shown by E′′ in FIG. 14 above, and F′ in FIG. 13 is shown. The derived image block is the same as the derived image block shown by F" in Fig. 14 described above, and the derived image block shown by H in Fig. 12 is the same as the derivative image block shown by H" in Fig. 14 described above. Therefore, In the candidate derivative mode set corresponding to different partitioning modes, the preset derivative mode with the same derivative result may be deleted by one, and only reserved in the candidate derivative mode set corresponding to a certain dividing mode.

In one example, in combination with FIG. 12 above, in the case where N=4 and the basic division mode is horizontal quarter, the candidate derivative mode set may include only B in FIG. 12 (corresponding to the third preset derivative described herein). Mode), C (corresponding to the second preset derivative mode described herein), D (corresponding to the first preset derivative mode described herein), and F (corresponding to the fourth preset derivative mode described herein) Preset derivative mode.

In another example, in combination with FIG. 13 above, in the case where N=4 and the basic division manner is vertical quarter, the candidate derivative mode set may include only B′ in FIG. 13 (corresponding to the seventh pre-described herein) Derivative mode), D' (corresponding to the fifth preset derivative mode described herein), E' (corresponding to the sixth preset derivative mode described herein), and G' (corresponding to the eighth preset derivative described herein) Mode) shows the preset derivative mode.

In another example, in combination with FIG. 14 above, in the case where N=4 and the basic division manner is horizontal and vertical quarters, the candidate derivative pattern set may include only A′′ in FIG. 14 (corresponding to the ninth described herein) Preset derivative mode), B" (corresponding to the twelfth preset derivative mode described herein), E" (corresponding to the tenth preset derivative mode described herein), and F" (corresponding to the eleventh described herein) The preset derivative mode shown by the preset derivative mode).

Each of the preset derivative modes in the candidate derivative mode set uses a fixed length code identifier, for example, 2 bits, so that the decoding method provided by the present application can identify the image block by using a simple identifier compared to the prior art. The division effectively improves the decoding efficiency.

As can be seen in conjunction with FIG. 12, FIG. 13, and FIG. 14 above, the feature parameter of the derived image block is greater than the feature parameter of each of the N sub-image blocks, the feature parameter being at least one of height and width.

S112. The video decoder divides the current image block to be decoded into N sub-image blocks according to a basic division manner.

Specifically, if the basic division mode is horizontal four points, the video decoder divides the current image block to be decoded into four sub-image blocks by using the division method shown in FIG. 12 described above. If the basic division mode is a vertical quarter, the video decoder divides the current image block to be decoded into 4 sub-image blocks by using the division method as shown in FIG. 13 described above. If the basic division mode is horizontal and vertical quarters, the video decoder divides the current image block to be decoded into four sub-image blocks by using the division method as shown in FIG. 14 described above.

S113. The video decoder derives at least two adjacent sub-image blocks of the N sub-image blocks into the first derivative image block according to a derivative manner of the target derived mode indication.

S114. The video decoder decodes the first derivative image block.

Specifically, in a case where the first derivative image block is no longer divided, the video decoder performs decoding and reconstruction on the first derivative image block to obtain a reconstructed image block. In the case that the first derivative image block is further divided, the video decoder regards the first derivative image block as the current image block to be decoded, and re-executes the above S111-S114 until the re-obtained first derivative image block is no longer used. Until the division.

The video decoder in the present application determines a specific division manner of the current image block to be decoded according to the basic division manner and the target derivative mode. The combination of the division mode and the derivative mode identifies a plurality of different division results. Compared with the prior art, the image block is enriched and can be adapted to video images of various texture characteristics.

As can be seen from the above description, the video decoder in the present application may also determine the target derivative mode only if the feature parameter of the current image block to be decoded is less than or equal to the first preset threshold.

Specifically, in combination with FIG. 11A described above, as shown in FIG. 11B, the above S111 may be replaced with S111a to S111c.

S111a: The video decoder decodes the obtained code stream, determines a basic division manner of the current image block to be decoded, and a feature parameter of the current image block to be decoded.

For the method for the video decoder to determine the basic division mode of the image block to be decoded, refer to the description of S111 above, and details are not described herein again.

The feature parameter of the current image block to be decoded refers to the height of the current image block to be decoded and the width of the current image block to be decoded. The video decoder parses the code stream, and directly determines the feature parameters of the current image block to be decoded.

S111b. The video decoder determines whether a feature parameter of the current image block to be decoded is less than or equal to a first preset threshold.

If the feature parameter of the current image block to be decoded is less than or equal to the first preset threshold, the video decoder performs S111c.

If the feature parameter of the current image block to be decoded is greater than the first preset threshold, the video decoder regards the sub-image block of the current image block to be decoded as the current image block to be decoded, and re-executes S111a.

S111c, the video decoder parses the code stream, and determines a target derivative mode.

For the method for determining the target derivation mode by the video decoder, reference may be made to the description of S111 above, and details are not described herein again.

In the process shown in FIG. 11B, after performing S111a, the video decoder may perform S112 first, then S111b, or S111b may be performed first, then S112 may be performed, and S112 and S111b may be simultaneously performed. Specifically limited.

It is easy to understand that the video decoder determines the target derivative mode only if the feature parameter of the current image block to be decoded is less than or equal to the first preset threshold, and therefore, the feature parameter is greater than the first preset. For the image block of the threshold, the code stream may only include the division identifier of the image block, but does not include the derivative identifier of the image block, which effectively reduces the transmission resource occupied by the code stream. Of course, for an image block whose feature parameter is greater than the first preset threshold, the code stream may also include a derivative identifier of the image block, and the derived identifier of the image block is used to indicate that the sub-image block in the image block is not derivatized.

The video decoder determines the target derivative mode only when the feature parameter of the current image block to be decoded is less than or equal to the first preset threshold, and effectively improves the decoding rate.

In addition, the embodiment of the present application further provides an encoding method, which can be applied to the video codec system shown in FIG. 7.

FIG. 15A is a schematic flowchart of a coding method in an embodiment of the present application. The flow shown in FIG. 15A is exemplified by the video encoder 702 shown in FIG. 8 as an execution subject.

As shown in FIG. 15A, the coding method provided by the embodiment of the present application includes:

S150. The video encoder determines an image block to be currently encoded.

The image block to be encoded may be a CTU or a CU, which is not specifically limited in this embodiment of the present application.

As shown in FIG. 17A or FIG. 17B, the current image block to be encoded may be an image block 170, a sub-image block in the image block 173, an image block 174, or a sub-block in the image block 171. The image block is not specifically limited in this embodiment of the present application.

S151. The video encoder determines a basic division manner of the current image block to be encoded and a target derivative mode of the sub image block in the current image block to be encoded.

In a first implementation manner, the video encoder determines at least two candidate partitioning modes (horizontal N, vertical N, horizontal and vertical N, and no partition) corresponding to the current image to be encoded, and calculates at least two types. The rate-distortion cost of each of the candidate partitioning modes, so that the video encoder determines the candidate partitioning mode corresponding to the smallest rate-distortion cost as the basic partitioning mode. Subsequently, the video encoder calculates a rate distortion cost of each of the preset derivative modes in the candidate derivative mode set corresponding to the basic partition mode, and determines a preset derivative mode corresponding to the minimum rate distortion cost as the target derivative mode.

For the candidate derivation mode, reference may be made to the description of the candidate derivation mode in the embodiment shown in FIG. 11A or FIG. 11B above, and details are not described herein again.

The basic division mode of the current image block to be encoded and the target derivative mode of the sub image block in the current image block to be encoded may be referred to the description in the embodiment described above with reference to FIG. 11A. The difference is that the above-mentioned FIG. 11A is described for the current image block to be decoded, and the embodiment is described for the current image block to be encoded.

The first achievable manner refers specifically to the description of FIG. 16 below, and details are not described herein again.

In a second implementation manner, the video encoder determines at least two candidate partitioning modes (horizontal N, vertical N, horizontal, and non-divided) corresponding to the current image block to be encoded, and subsequently, for each A candidate division mode, where the video encoder derivates at least two adjacent sub-image blocks by using each of the preset derivation modes, so that the video encoder can be calculated in each candidate division mode. The rate distortion cost of each of the preset derivation modes is determined, and the division mode corresponding to the minimum rate distortion cost is determined as the basic division mode, and the derivative mode corresponding to the minimum rate distortion cost is determined as the target derivation mode.

S152. The video encoder divides the current image block to be encoded into N sub-image blocks according to a basic division manner.

S153. The video encoder derives at least two adjacent sub-image blocks of the N sub-image blocks into the first derivative image block according to a derivative manner of the target derived mode indication.

S154. If the video encoder does not divide the first derived image block, the video encoder encodes the first derived image block to obtain a code stream.

Optionally, the syntax element of the code stream includes a division identifier of the current image block to be encoded and a derivative identifier of the current image block to be encoded, and the division identifier of the current image block to be encoded is used to indicate a basic division manner of the image block to be currently encoded. The derived identifier of the current image block to be encoded is used to indicate a target derivative mode of the sub-image block in the current image block to be encoded.

Optionally, the division identifier of the current image block to be encoded is represented by a variable length code, for example, a binary truncation code or an exponential Golomb code. The derived identifier of the current image block to be encoded is represented by a fixed length code.

The split identifier of the current image block to be encoded and the derived identifier of the current image block to be encoded may be independent flags. For example, the split identifier of the current image block to be encoded is represented by SplitFlag, and the derivative identifier of the current image block to be coded is DeriveFlag. Said. Certainly, the division identifier of the current image block to be encoded and the derivative identifier of the current image block to be encoded may also be a part of the same identifier, for example, if the derivative identifier of the current image block to be encoded is represented by a fixed length code of length x The code stream includes flag1 corresponding to the current image block to be encoded, and the last x bit of the flag1 is a derivative identifier of the current image block to be encoded, and the remaining bits of the flag1 except the last x bit are the division identifier of the current image block to be encoded. .

It is easy to understand that if the video encoder further needs to divide the first derivative image block, the video encoder determines the derived image block as the current image block to be encoded, and re-executes S151-S154 until it is re-determined. The first derivative image block is divided.

The first achievable manner in the above S151 will now be described.

With reference to the above-mentioned FIG. 15A, as shown in FIG. 16, the method for determining the basic division mode and the target derivative mode by the video encoder in the embodiment of the present application can be implemented by S161 to S166, that is, the above S151 can be replaced with S161 to S166. S161 ~ S166 are specifically:

S161. The video encoder divides the current image block to be encoded by using at least two candidate division manners.

Candidate division methods include horizontal N points, vertical N points, horizontal vertical N points, and no division.

In one example, as shown in FIG. 17A, if the candidate division mode is horizontal quarter, vertical quarter, horizontal vertical quarter, and no division, and the current image block to be encoded is image block 170, the video encoder may adopt level four. The image block 170 is divided into four, and four rectangular sub-image blocks having a width equal to the width of the image block 170 and a height of 1/4 of the height of the image block 170 are generated, and FIG. 17A uses 171 to represent the image block of the structure. The video encoder may also divide the image block 170 by vertical quadrants to generate four rectangular sub-image blocks having a height equal to the height of the image block 170 and a width of 1/4 of the width of the image block 170, and FIG. 17A indicates the structure by 172. Image block. The video encoder may also divide the image block 170 by horizontal and vertical divisions to generate four square sub-image blocks having a height of 1/2 of the height of the image block 170 and a width of 1/2 of the width of the image block 170, which is adopted in FIG. 17A. 173 denotes an image block of the structure. Of course, the video encoder may also not divide the current coded image block 170, and the undivided image block is represented by 174 in FIG. 17A.

In Fig. 17A, the horizontal four-point is identified by 100, the vertical four-point is identified by 101, and the horizontal and vertical four-points are identified by 110, and the zero mark is not divided.

Of course, in the partitioning mode shown in FIG. 17A, the horizontal four-points may be identified by using 110, the vertical four-points are identified by 101, and the horizontal and vertical four-points are identified by using 101, and the zero-marking is not used. The embodiment of the present application does not specifically limit this. .

In another example, as shown in FIG. 19, if the candidate division mode is horizontal dichotomy, vertical dichotomy, and non-division, and the current image block to be encoded is a sub-image block in the image block 171, the video encoder may adopt horizontal dichotomy. The sub-image block in the image block 171 generates two rectangular sub-image blocks having a width equal to the width of the sub-image block in the image block 171 and a height equal to 1/2 of the height of the sub-graphic block in the image block 171, and FIG. 19 is adopted. 175 denotes an image block of the structure. The video encoder may also divide the sub-image blocks in the image block 171 by vertical bipartite division, and generate two heights equal to the height of the sub-image block in the image block 171, and the width is 1/2 of the width of the sub-image block in the image block 171. A rectangular sub-image block, Figure 19 uses 176 to represent an image block of the structure. Of course, the video encoder may also not divide the sub-image blocks in image block 171, and the un-divided image blocks are represented by 177 in FIG.

In another example, as shown in FIG. 20, if the candidate division mode is horizontal quadrant, vertical quadrant, and no division, and the current image block to be encoded is a sub-image block in the image block 171, the video encoder may adopt a horizontal level. Dividing the sub-image blocks in the image block 171 to generate four rectangular sub-image blocks having a width equal to the width of the sub-image block in the image block 171 and a height equal to 1/4 of the height of the sub-graphic block in the image block 171, Figure 20 shows an image block of the structure using 175'. The video encoder may also divide the sub-image blocks in the image block 171 by vertical quadrants to generate four heights equal to the height of the sub-image blocks in the image block 171, and the width is 1/4 of the width of the sub-image block in the image block 171. The rectangular sub-image block, Figure 20 uses 176' to represent the image block of the structure. Of course, the video encoder may also not divide the sub-image blocks in the image block 171, and the undivided image block 20 is represented by 177'.

Optionally, the candidate division mode indirectly reflects the division level, so that the video encoder may divide the current image block to be coded by using a corresponding division manner on the preset division level according to the candidate division manner.

Illustratively, as shown in FIG. 17B, the video encoder divides the image block 170 in a horizontal and vertical division manner at the first level, generates an image block 173, or does not divide the image block 170. Under the undivided branch, the video encoder may or may not divide the image block 170 at the second level. Under the branch where the video encoder divides the image block 170 at the second level, the video encoder may divide the image block 170 by horizontal quadrant division to generate the image block 171, or divide the image block 170 by vertical quadrant division. , a graphics block 172 is generated.

In Fig. 17B, the division manner shown by the 011 identification image block 171 is adopted, and the division manner shown by the 010 identification image block 172 is adopted, and the division manner shown by the 1 identification image block 173 is employed.

Of course, the numerical value of the division mark in Fig. 17B is just an example. In addition to the example in FIG. 17B, the division manner shown by the 1 identification image block 171 may be employed, and the division manner shown by the 010 identification image block 172 is employed, and the division manner shown by the image block 173 is indicated by the 011 flag.

In a practical application, the video encoder may divide the current image block to be encoded by using a bifurcation/four-fork division method when the current image block to be encoded satisfies a preset condition. For example, the image block 171 has a width of M1 and a height of H1. If M1×H1 is greater than X1×Y1, the video encoder divides the image block 171 by a bifurcation/quadruple division; or, if M1×H1 is smaller than X2 ×Y2, the video encoder divides the image block 171 by a bifurcation/quadruple division method; or, if M1×H1 is larger than X1×Y1 and smaller than X2×Y2, the video encoder adopts bifurcation/quad The image block 171 is divided by the cross division method. Among them, X1, Y1, X2 and Y2 are all positive integers.

Exemplarily, if M1=4×H1, the video encoder divides the current image block to be coded by vertical quadrant; if H1=4×M1, the video encoder divides the current image block to be coded by horizontal quadrant division. .

S162. The video encoder calculates a rate-distortion cost for each candidate partitioning manner, and determines a candidate partitioning manner corresponding to the minimum rate-distortion cost as the basic partitioning manner.

S163. The video encoder derivates the sub-image block obtained by dividing the basic division manner according to each preset derivation mode in the candidate derivative mode set.

Referring to FIG. 12 and FIG. 17B above, as shown in FIG. 18, if the basic division mode is horizontal quarter, the video encoder can adopt the eight preset derivative modes (A, B, C, D, shown in FIG. 12 above). E, F, G, and H) respectively deriving at least two adjacent sub-image blocks in 171, and correspondingly, the image blocks processed by using the above eight preset derivation patterns are respectively represented as 401, 402, 403, and 404. , 405, 406, 407, 408.

Referring to FIG. 13 and FIG. 17B above, as shown in FIG. 18, if the basic division mode is vertical quarter, the video encoder can adopt the eight preset derivative modes (A', B', C' shown in FIG. 13 above. , D′, E′, F′, G′, H′) respectively deriving at least two adjacent sub-image blocks in 172, and correspondingly, the image blocks processed by using the above eight preset derivation modes respectively represent It is 501, 502, 503, 504, 505, 506, 507, 508.

14 and FIG. 17B, as shown in FIG. 18, if the basic division mode is horizontal and vertical quarters, the video encoder can adopt the eight preset derivative modes (A′′, B′′, C shown in FIG. 14 above. ", D", E", F", G", H") respectively deriving at least two adjacent sub-image blocks in 173, and correspondingly, the image blocks processed by using the above eight preset derivation patterns respectively Expressed as 601, 602, 603, 604, 605, 606, 607, 608.

The method of identifying the image block processed by the preset derivative mode may be determined according to the state of the dividing line. Optionally, if a certain dividing line is removed, the dividing line is represented by “0”; if a dividing line is reserved, the dividing line is represented by “1”; according to the removal and retention of the dividing line in the image block, Identifies the image block.

Exemplarily, the image block 171 includes a first dividing line, a second dividing line, and a third dividing line (the representation of the dividing line may refer to FIG. 12 above), if the first dividing line is removed, the second dividing line and the third dividing line Reserved, that is, the first sub-image block and the second sub-image block are combined to generate an image block 401, and the image block 401 can be represented by 011. Correspondingly, the image block 172 includes a first dividing line, a second dividing line, and a third dividing line (the representation of the dividing line can refer to FIG. 13 above), and if the second dividing line is removed, the first dividing line and the third dividing line are retained. That is, the second sub-image block and the third sub-image block are combined to generate an image block 502, and the image block 502 can be represented by 101. The image block 173 includes a first dividing line, a second dividing line, a third dividing line, and a fourth dividing line (the representation of the dividing line can be referred to FIG. 14 above), if the first dividing line is removed, the second dividing line, and the third dividing line The line and the fourth dividing line are reserved, that is, the first sub-image block and the second sub-image block are combined to generate an image block 601, and the image block 601 may be represented by 0111. The identification methods of other image blocks are similar, and will not be repeated here.

Of course, the identification of the image block may also adopt a different method from that of FIG. 18, which is not specifically limited in this embodiment of the present application.

In the actual coding, for each division mode, the video encoder may select a part of the eight preset derivative modes corresponding to the division mode shown in FIG. 18 as the focus mode (the image block generated by the derivative mode) Further division is required) or the deletion mode (regardless of the derivative mode), and the remainder is used as the normal mode (the mode in which the video encoder performs derivation processing). Thus, the set of candidate derived patterns may include only normal patterns.

Exemplarily, if the basic division mode is horizontal quarter, the normal mode may include the preset derivative modes shown by B, C, D, and F in FIG. 12 above, that is, the video encoder generates image blocks 402, 403, and 404. 406.

If the basic division mode is vertical quarter, the normal mode may include the preset derivative modes shown by B', D', E', and G' in FIG. 13 above, that is, the video encoder generates image blocks 502, 504, and 505. 507.

If the basic division mode is horizontal and vertical quarters, the normal mode may include the preset derivative modes shown by A′′, B′′, E′′, and F′′ in FIG. 14 above, that is, the video encoder generates image blocks 601 and 602, 605, 606.

For the above-mentioned common mode, the embodiment of the present application may adopt a bit identifier of a preset length, such as a 2-bit identifier.

As can be seen in conjunction with FIG. 18, the image block 608 and the image block 173 have the same shape. Therefore, if the preset derivative mode corresponding to the image block 608 is the focus mode, the mode can be identified by the flag bit "1" of the image block 173. Compared to the identification bit "1111" of the image block 608, the number of bits of the identification bit "1" is less. If the video encoder determines that the derived mode shown by image block 608 is a target derived mode, the video encoder may directly use 1 to indicate the derived mode of the target derived mode for image block 608 after the last bit of the divided identification.

The derived image blocks in the embodiments of the present application may employ a smaller number of bit identifiers. Compared with the prior art, the coding method provided by the present application can identify the division of image blocks by using simple identification, and effectively improve coding efficiency.

S164. The video encoder calculates a rate distortion cost of each of the preset derivative modes in the candidate derivative mode set corresponding to the basic partition mode.

S165. The video encoder determines the preset derivative mode corresponding to the minimum rate distortion cost as the target derivative mode.

It is easy to understand that if the first derivative image block obtained in the above S153 further needs to be divided, the video encoder regards the first derivative image block as the current image block to be encoded, and re-executes S161-S165 to determine the basic division manner and Target derivative model.

As can be seen from the above description, the video codec in the present application may also determine the target derivative mode only if the feature parameter of the current image block to be encoded is less than or equal to the first preset threshold.

Specifically, in combination with FIG. 15A and FIG. 16 described above, as shown in FIG. 15B, the above S151 may be replaced with S151a to S151c.

S151a. The video encoder determines a basic division manner of the current image block to be encoded and a feature parameter of the current image block to be encoded.

For a method for the video encoder to determine the basic division mode of the image block to be encoded, reference may be made to the descriptions of the foregoing S161 to S162, and details are not described herein again.

The feature parameter of the current image block to be encoded refers to the height of the current image block to be encoded and the width of the current image block to be encoded.

S151b. The video encoder determines whether a feature parameter of the current image block to be encoded is less than or equal to a first preset threshold.

If the feature parameter of the current image block to be encoded is less than or equal to the first preset threshold (for example, 32×32), the video encoder performs S151c.

If the feature parameter of the current image block to be encoded is greater than the first preset threshold, the video encoder regards the sub-image block of the current image block to be encoded as the current image block to be encoded, that is, re-executes S150. For example, in FIG. 21, if the current image block to be encoded is the image block 210, the size of the image block 210 is 64×64, and the first preset threshold is 32×32, the video encoder determines that the basic division mode of the image block 210 is After horizontally dividing the image block 210, the image block 212 is generated. Since the feature parameter of the image block 210 is greater than the first preset threshold, the video encoder uses the sub-image block in the image block 212 as the current image block to be encoded. And re-determining the basic division manner of the sub-image block of the image block 212.

S151c, the video encoder determines the target derivative mode.

For the method for the video encoder to determine the target derivative mode, reference may be made to the descriptions of S163 to S165 above, and details are not described herein again.

In the flow shown in FIG. 15B, after performing S151a, the video encoder may perform S152 first, then S151b, or S151b may be performed first, then S152 may be performed, and S152 and S151b may be simultaneously performed. Specifically limited.

It is easy to understand that the video encoder only determines the target derivative mode if the feature parameter of the current image block to be encoded is less than or equal to the first preset threshold, and therefore, for the image whose feature parameter is greater than the first preset threshold. In the case of a block, the code stream may include only the partition identifier of the image block, and the derivative identifier of the image block is not included, which effectively reduces the transmission resource occupied by the code stream. Certainly, for an image block whose feature parameter is greater than the first preset threshold, the code stream may also include a derivative identifier of the image block, and the derived identifier of the image block is used to indicate that the sub-image block in the image block is not derivatized.

The video encoder determines the target derivative mode only when the feature parameter of the current image block to be encoded is less than or equal to the first preset threshold, and effectively increases the encoding rate. In practical applications, the flow shown in Figure 15B is easier to implement.

In order to facilitate a full understanding of the present application, the image block partitioning and the derivation of sub-image blocks in the present application will now be described in conjunction with the coding tree shown in FIG. Figure 21 illustrates the partitioning of square image blocks and the derivation of sub-image blocks in a square image block using the methods provided herein, as well as the partitioning of non-square image blocks and the derivation of sub-image blocks in non-square image blocks.

The video encoder divides the image block 210, and the image block 210 may be divided by the horizontal vertical quadrant division method at the first level, and the image block 211 may be generated or may not be divided. Under the branch without division, the video encoder may divide the image block 210 by a bifurcation division method at the second level; or may divide the four-fork/non-division division manner. In the branch of the second layer that divides the image block 210 by the bifurcation division mode, the video encoder may specifically divide the image block 210 by the horizontal binary division method to generate the image block 212; or divide the image block by the vertical binary division method. 210. Generate an image block 213. Further, the video encoder may divide the sub-image block of the image block 212 by using a bifurcation division manner, and may also divide the sub-image block of the image block 212 by a four-fork division method. The video encoder may divide the sub-image block of the image block 212 into a horizontal quadrant division manner to generate an image block 219, or a video encoder may divide the sub-image block of the image block 212 by a four-division division method. The sub-image blocks of the image block 212 are divided by a vertical division method to generate an image block 220. In the branch where the video encoder is divided by the four-branch/non-divided partitioning mode, the video encoder may divide the image block 210 by horizontal quadrant division to generate the image block 214, or may adopt the vertical quadrant division mode. The image block 210 is divided to generate an image block 215. The image block 219 and the image block 220 are non-square image blocks, and the image blocks obtained by deriving the at least two adjacent sub-image blocks in the image block 219 are respectively represented as 221 by using four preset derivation modes. 222, 223, 233; image blocks obtained by deriving at least two adjacent sub-image blocks in the image block 220 by using four preset derivation modes are respectively represented as 224, 225, 226, 234. The image block 214 and the image block 215 are square image blocks, and the image blocks obtained by deriving the at least two adjacent sub-image blocks in the image block 214 by using four preset derivation modes are respectively represented as 227, 228, 228, 235. Image blocks obtained by deriving at least two adjacent sub-image blocks in the image block 215 by using four preset derivation modes are respectively represented as 230, 231, 232, and 236.

The individual image blocks in Fig. 21 are identified by the methods shown in the figures, respectively.

The image block 212 is identified by "011". If the basic division mode of the sub-image block of the image block 212 is horizontal four-division and the image block 219 is generated, the basic division mode may be identified by "011111", if the sub-image of the image block 212 The feature parameter of the block is less than or equal to the first preset threshold, and the video encoder may determine one of the four preset derivative modes corresponding to the image block 219 shown in FIG. 21 as the target derivative mode. Since the shape of the image block 233 is the same as that of the image block 219, the derivative mode represented by the image block 233 can be directly identified by 1 , and the remaining 3 preset derivative modes can be identified by a value different from 1 (such as 0), the image block. The logos of 221, 222, and 223 start with 0. Thus, if the video encoder adopts the derivative mode shown by the image block 233 as the target derivative mode, the video encoder can directly add 1 after the identifier "011111" of the basic division mode, that is, form the identifier "0111111". Correspondingly, the video decoder parses the code stream, and after obtaining the identifier "0111111", it can be determined that the derivation is not performed after the image block 219. If the video encoder adopts the derivative mode shown by the image block 223 as the target derivative mode, the video encoder adds 011 after the identifier "011111" of the basic division mode, that is, the identifier "011111011" is formed. Correspondingly, the video decoder parses the code stream, and after obtaining the identifier “011111011”, it may determine that the target derivative mode is the derivative mode shown by the image block 223.

Similarly, the video encoder also processes the image blocks 214, 215, 220 in a similar manner.

The division mode shown in FIG. 21 is implemented on the basis of the existing QTBT division mode. Therefore, in practical applications, the division mode is relatively easy to implement.

The video encoder in the present application identifies a plurality of different division results by combining the division mode and the derivative mode, enriches the division manner of the image block, and can adapt to video images of various texture characteristics.

The embodiment of the present application provides a decoding apparatus, which may be a video decoder. Specifically, the decoding device is configured to perform the steps performed by the video decoder in the above decoding method. The decoding apparatus provided in this embodiment of the present application may include a module corresponding to the corresponding step.

The embodiment of the present application may divide the function module into the decoding device according to the foregoing method example. For example, each function module may be divided according to each function, or two or more functions may be integrated into one processing module. The above integrated modules can be implemented in the form of hardware or in the form of software functional modules. The division of modules in the embodiments of the present application is schematic, and is only a logical function division, and may be further divided in actual implementation.

In the case where the respective functional modules are divided by the respective functions, FIG. 22 shows a possible structural diagram of the decoding device involved in the above embodiment. As shown in FIG. 22, the decoding device 22 includes a parsing unit 220, a dividing unit 221, a deriving unit 222, a decoding unit 223, and a judging unit 224.

The parsing unit 220 is configured to support the decoding apparatus to perform S111 and the like in the above embodiments, and/or other processes for the techniques described herein.

The dividing unit 221 is for supporting the decoding apparatus to execute S112 and the like in the above embodiments, and/or other processes for the techniques described herein.

The derivation unit 222 is for supporting the decoding apparatus to perform S113 and the like in the above embodiments, and/or other processes for the techniques described herein.

The decoding unit 223 is for supporting the decoding apparatus to perform S114 and the like in the above embodiments, and/or other processes for the techniques described herein.

The judging unit 224 is configured to instruct the decoding apparatus to execute S111b and the like in the above embodiment, and/or other processes for the techniques described herein.

All the related content of the steps involved in the foregoing method embodiments may be referred to the functional descriptions of the corresponding functional modules, and details are not described herein again.

The decoding device provided by the embodiment of the present application includes, but is not limited to, the foregoing module. For example, the decoding device may further include a storage unit 225.

The storage unit 225 can be used to store program codes and data of the decoding device.

In the case of an integrated unit, a schematic structural diagram of a decoding apparatus provided by an embodiment of the present application is shown in FIG. In FIG. 23, the decoding device 23 includes a processing module 230 and a communication module 231. The processing module 230 is configured to perform control management on the actions of the decoding device, for example, perform the steps performed by the parsing unit 220, the dividing unit 221, the deriving unit 222, the decoding unit 223, and the determining unit 224, and/or for performing the description herein. Other processes of technology. The communication module 231 is used to support interaction between the decoding device and other devices. As shown in FIG. 23, the decoding apparatus may further include a storage module 232 for storing program codes and data of the decoding apparatus, for example, storing the content held by the storage unit 225.

The processing module 230 can be a processor or a controller, such as a central processing unit (CPU), a general purpose processor, a digital signal processor (DSP), an ASIC, an FPGA, or other programmable Logic device, transistor logic device, hardware component, or any combination thereof. It is possible to implement or carry out the various illustrative logical blocks, modules and circuits described in connection with the present disclosure. The processor may also be a combination of computing functions, for example, including one or more microprocessor combinations, a combination of a DSP and a microprocessor, and the like. The communication module 231 can be a transceiver, an RF circuit or a communication interface, or the like. The storage module 232 can be a memory.

All the relevant content of each scenario involved in the foregoing method embodiments may be referred to the functional description of the corresponding functional module, and details are not described herein again.

Both the decoding device 22 and the decoding device 23 can perform the above-described decoding method shown in FIG. 11A or FIG. 11B. The decoding device 22 and the decoding device 23 can be specifically a video decoding device or other device having a video encoding and decoding function.

The application also provides a decoding device, the decoding device comprising: one or more processors, a memory, a communication interface. The memory, communication interface is coupled to one or more processors; the memory is for storing computer program code, and the computer program code includes instructions that, when executed by one or more processors, perform a decoding method of an embodiment of the present application.

The decoding devices herein may be video display devices, smart phones, portable computers, and other devices that can process video or play video.

The present application also provides a video decoder including a nonvolatile storage medium, and a central processing unit, the nonvolatile storage medium storing an executable program, the central processing unit and the nonvolatile storage The medium is connected, and the executable program is executed to implement the decoding method of the embodiment of the present application.

The present application also provides a decoder, which includes the decoding device (the decoding device 22 and the decoding device 23) and the reconstruction module in the embodiment of the present application, wherein the reconstruction module is used according to the decoding device. The prediction map pixel determines the reconstructed pixel value of the current image block.

Another embodiment of the present application also provides a computer readable storage medium comprising one or more program codes, the one or more programs comprising instructions when a processor in a decoding device is executing the program At the time of the code, the decoding device performs the decoding method as shown in FIG. 11A or FIG. 11B.

In another embodiment of the present application, there is also provided a computer program product comprising computer executed instructions stored in a computer readable storage medium; at least one processor of the decoding device is readable from a computer The readable storage medium reads the computer execution instructions, and the at least one processor executes the computer to execute the instructions such that the terminal implements the step of executing the video decoder in the decoding method illustrated in FIG. 11A or FIG. 11B.

An embodiment of the present application provides an encoding apparatus, which may be a video encoder. Specifically, the encoding device is configured to perform the steps performed by the video encoder in the above encoding method. The encoding device provided by the embodiment of the present application may include a module corresponding to the corresponding step.

The embodiment of the present application may divide the functional modules of the encoding device according to the foregoing method example. For example, each functional module may be divided according to each function, or two or more functions may be integrated into one processing module. The above integrated modules can be implemented in the form of hardware or in the form of software functional modules. The division of modules in the embodiments of the present application is schematic, and is only a logical function division, and may be further divided in actual implementation.

In the case where the respective functional modules are divided by corresponding functions, FIG. 24 shows a possible structural diagram of the encoding apparatus involved in the above embodiment. As shown in FIG. 24, the encoding device 24 includes a determining unit 240, a dividing unit 241, a deriving unit 242, an encoding unit 243, and a judging unit 244.

The determining unit 240 is configured to support the encoding apparatus to perform S150, S151, S161, S162, S163, S164, S165, etc. in the above embodiments, and/or other processes for the techniques described herein.

The dividing unit 241 is for supporting the encoding apparatus to execute S152 and the like in the above embodiments, and/or other processes for the techniques described herein.

The derivation unit 242 is for supporting the encoding apparatus to perform S153 and the like in the above embodiments, and/or other processes for the techniques described herein.

The encoding unit 243 is for supporting the encoding apparatus to execute S154 and the like in the above embodiments, and/or other processes for the techniques described herein.

The judging unit 244 is for supporting the encoding apparatus to execute S151b and the like in the above embodiment, and/or other processes for the techniques described herein.

The encoding device provided by the embodiment of the present application includes, but is not limited to, the foregoing module. For example, the encoding device may further include a storage unit 245.

The storage unit 245 can be used to store program codes and data of the encoding device.

A schematic structural diagram of an encoding apparatus provided by an embodiment of the present application is shown in FIG. 25 in the case of an integrated unit. In FIG. 25, the encoding device 25 includes a processing module 250 and a communication module 251. The processing module 250 is configured to perform control management on the actions of the encoding device, for example, perform the steps performed by the determining unit 240, the dividing unit 241, the deriving unit 242, the encoding unit 243, and the determining unit 244, and/or for performing the description herein. Other processes of technology. The communication module 251 is used to support interaction between the encoding device and other devices. As shown in Fig. 25, the encoding device may further include a storage module 252 for storing program code and data of the encoding device, for example, storing the content held by the storage unit 245.

The processing module 250 can be a processor or a controller, such as a CPU, a general purpose processor, a DSP, an ASIC, an FPGA or other programmable logic device, a transistor logic device, a hardware component, or any combination thereof. It is possible to implement or carry out the various illustrative logical blocks, modules and circuits described in connection with the present disclosure. The processor may also be a combination of computing functions, for example, including one or more microprocessor combinations, a combination of a DSP and a microprocessor, and the like. The communication module 251 can be a transceiver, an RF circuit, a communication interface, or the like. The storage module 252 can be a memory.

Both the encoding device 24 and the encoding device 25 described above can perform the above-described encoding method shown in FIG. 15A or FIG. 15B. The encoding device 24 and the encoding device 25 can be specifically a video encoding device or other device having a video encoding function.

The application also provides an encoding device comprising: one or more processors, a memory, a communication interface. The memory, communication interface is coupled to one or more processors; the memory is for storing computer program code, and the computer program code includes instructions that, when executed by one or more processors, perform an encoding method of an embodiment of the present application.

The encoding device here can be a video display device, a smart phone, a portable computer, and other devices that can process video or play video.

The present application also provides a video encoder, including a non-volatile storage medium, and a central processing unit, the non-volatile storage medium storing an executable program, the central processing unit and the non-volatile storage The medium is connected, and the executable program is executed to implement the encoding method of the embodiment of the present application.

The present application also provides an encoder including the encoding device (encoding device 24 and encoding device 25) in the embodiment of the present application.

Another embodiment of the present application also provides a computer readable storage medium including one or more program codes, the one or more programs including instructions, when a processor in an encoding device is executing the program At the time of the code, the encoding device executes the encoding method as shown in Fig. 15A or Fig. 15B.

In another embodiment of the present application, there is also provided a computer program product comprising computer executed instructions stored in a computer readable storage medium; at least one processor of the encoding device can be from a computer The readable storage medium reads the computer execution instructions, and the at least one processor executes the computer to execute the instructions such that the terminal implements the step of executing the video encoder in the encoding method illustrated in FIG. 15A or FIG. 15B.

In the above embodiments, it may be implemented in whole or in part by software, hardware, firmware or any combination thereof. When implemented using a software program, it may occur in whole or in part in the form of a computer program product. The computer program product includes one or more computer instructions. When the computer program instructions are loaded and executed on a computer, the processes or functions described in accordance with embodiments of the present application are generated in whole or in part.

The computer can be a general purpose computer, a special purpose computer, a computer network, or other programmable device. The computer instructions can be stored in a computer readable storage medium or transferred from one computer readable storage medium to another computer readable storage medium, for example, the computer instructions can be from a website site, computer, server or data center Transfer to another website site, computer, server, or data center by wire (eg, coaxial cable, fiber optic, digital subscriber line (DSL), or wireless (eg, infrared, wireless, microwave, etc.). The computer readable storage medium can be any available media that can be accessed by a computer or a data storage device such as a server, data center, or the like that includes one or more available media. The usable medium may be a magnetic medium (eg, a floppy disk, a hard disk, a magnetic tape), an optical medium (eg, a DVD), or a semiconductor medium (such as a solid state disk (SSD)).

Through the description of the above embodiments, those skilled in the art can clearly understand that for the convenience and brevity of the description, only the division of the above functional modules is illustrated. In practical applications, the above functions can be allocated according to needs. It is completed by different functional modules, that is, the internal structure of the device is divided into different functional modules to complete all or part of the functions described above.

In the several embodiments provided by the present application, it should be understood that the disclosed apparatus and method may be implemented in other manners. For example, the device embodiments described above are merely illustrative. For example, the division of the modules or units is only a logical function division. In actual implementation, there may be another division manner, for example, multiple units or components may be used. The combination may be integrated into another device, or some features may be ignored or not performed. In addition, the mutual coupling or direct coupling or communication connection shown or discussed may be an indirect coupling or communication connection through some interface, device or unit, and may be in an electrical, mechanical or other form.

The units described as separate components may or may not be physically separated, and the components displayed as units may be one physical unit or multiple physical units, that is, may be located in one place, or may be distributed to multiple different places. . Some or all of the units may be selected according to actual needs to achieve the purpose of the solution of the embodiment.

In addition, each functional unit in each embodiment of the present application may be integrated into one processing unit, or each unit may exist physically separately, or two or more units may be integrated into one unit. The above integrated unit can be implemented in the form of hardware or in the form of a software functional unit.

The integrated unit, if implemented in the form of a software functional unit and sold or used as a standalone product, may be stored in a readable storage medium. Based on such understanding, the technical solution of the embodiments of the present application may be embodied in the form of a software product in the form of a software product in essence or in the form of a contribution to the prior art, and the software product is stored in a storage medium. A number of instructions are included to cause a device (which may be a microcontroller, chip, etc.) or a processor to perform all or part of the steps of the methods described in various embodiments of the present application. The foregoing storage medium includes: a U disk, a mobile hard disk, a read-only memory (ROM), a random access memory (RAM), a magnetic disk, or an optical disk, and the like, which can store program codes. .

The foregoing is only a specific embodiment of the present application, but the scope of protection of the present application is not limited thereto, and any changes or substitutions within the technical scope of the present application should be covered by the scope of the present application. . Therefore, the scope of protection of the present application should be determined by the scope of the claims.

Claims

A decoding method, comprising:

Parsing the code stream to determine a basic division manner of the current image block to be decoded and a target derivative mode of the sub image block in the current image block to be decoded; the target derivative mode is one of a candidate derivative mode set, the candidate The derivative mode set includes M preset derivative modes, where the preset derivative mode is used to indicate a derivative mode, and the derivative mode is used to derive any two adjacent sub-image blocks in the current image block to be decoded into one derivative. Image block, M is an integer greater than or equal to 0;

Decoding, according to the basic division manner, the current image block to be decoded into N sub-image blocks, where N is an integer greater than or equal to 2;

And deriving at least two adjacent sub-image blocks of the N sub-image blocks as the first derivative image block according to the derivative manner of the target-derived mode indication;

Decoding the first derived image block.
A decoding method, comprising:

Parsing the code stream to determine a basic division manner of the current image block to be decoded and a feature parameter of the current image block to be decoded, the feature parameter being at least one of a height and a width;

And if the feature parameter of the current to-be-decoded image block is less than or equal to the first preset threshold, parsing the code stream to determine a target derivative mode of the sub-image block in the current image block to be decoded, the target derivative mode As one of the candidate derivative mode sets, the candidate derivative mode set includes M preset derivative modes, where the preset derivative mode is used to indicate a derivative mode, where the derivative mode is used to use the current image block to be decoded Any two adjacent sub-image blocks are derived into one derivation image block, and the derivation manners of the M preset derivation mode indications are different from each other, and the feature parameters of the derived image block are larger than each of the current image blocks to be decoded. a feature parameter of the sub-image block, the feature parameter being at least one of height and width, and M being an integer greater than or equal to 0;

Decoding, according to the basic division manner, the current image block to be decoded into N sub-image blocks, where N is an integer greater than or equal to 2;

And deriving at least two adjacent sub-image blocks of the N sub-image blocks as the first derivative image block according to the derivative manner of the target-derived mode indication;

Decoding the first derived image block.
A decoding method according to claim 1 or 2, characterized in that

The basic division mode is horizontal N, vertical N or horizontal and vertical N.
The decoding method according to claim 3, characterized in that

In a case where the basic division mode is the horizontal N-score, a width of each of the N sub-image blocks is equal to a width of the current image block to be decoded, and each of the sub-image blocks The height is equal to 1/N of the height of the current image block to be decoded;

In a case where the basic division manner is the vertical N-score, a height of each of the N sub-image blocks is equal to a height of the current image block to be decoded, and each of the sub-image blocks The width is equal to 1/N of the width of the current image block to be decoded.
The decoding method according to claim 3, characterized in that

In a case where the basic division manner is the horizontal vertical N-minute, the width of each of the N sub-image blocks is equal to the width of the current image block to be decoded.
The height of each of the sub-image blocks is equal to the height of the current image block to be decoded.
Is an integer.
A decoding method according to any one of claims 1 to 5, characterized in that

When the basic division mode is horizontal N or vertical N, 0 ≤ M ≤ 2 N-1 ; when the basic division mode is horizontal and vertical N,
Is an integer.
The decoding method according to claim 6, wherein

In the case that N=4, M=4, and the basic division mode is the horizontal four-point, the candidate derivative mode set includes a first preset derivative mode, a second preset derivative mode, and a third preset. a derivative mode and a fourth preset derivative mode; wherein the first preset derivative mode is a first sub-image block, a second sub-image block, and a third sub-image block are derived as one derived image block; Determining that the third sub-image block and the fourth sub-image block are derived as one derived image block; the third preset derivation mode is deriving the second sub-image block and the third sub-image block as a derived image block; the fourth preset derivative mode is that the first sub-image block and the second sub-image block are derived as one derived image block, the third sub-image block and the fourth sub-image Deriving a block into another derived image block; the first sub-image block, the second sub-image block, the third sub-image block, and the fourth sub-image block are in the current image block to be decoded Arranged in order from top to bottom;

In the case that N=4, M=4, and the basic division manner is the vertical quarter, the candidate derivative mode set includes a fifth preset derivative mode, a sixth preset derivative mode, and a seventh preset derivative. And a fifth preset derivative mode, wherein the fifth preset derivative mode is a fifth sub-image block, a sixth sub-image block, and a seventh sub-image block are derived as one derivative image block; the sixth preset The derived mode is that the sixth sub-image block, the seventh sub-image block, and the eighth sub-image block are derived as one derived image block; the seventh preset derivation mode is the sixth sub-image block and the seventh The sub-image block is derived as a derived image block; the eighth preset derivation mode is that the fifth sub-image block, the sixth sub-image block, the seventh sub-image block, and the eighth sub-image block are all maintained Independently; the fifth sub-image block, the sixth sub-image block, the seventh sub-image block, and the eighth sub-image block are sequentially arranged in the current to-be-decoded image block in order from left to right;

In the case that N=4, M=4, and the basic division manner is the horizontal vertical quarter, the candidate derivative mode set includes a ninth preset derivative mode, a tenth preset derivative mode, and an eleventh pre-predetermined a derivation mode and a twelfth preset derivation mode; wherein the ninth pre-derivative mode is a ninth sub-image block and a tenth sub-image block are derived as one derivative image block; The ninth sub-image block and the tenth sub-image block are derived as one derived image block, and the eleventh sub-image block and the twelfth sub-image block are derived as another derived image block; the eleventh preset Derived mode is derived from the tenth sub-image block and the eleventh sub-image block into one derived image block, and the ninth sub-image block and the twelfth sub-image block are derived as another derived image block; The twelfth preset derivation mode is that the ninth sub-image block and the twelfth sub-image block are derived as one derived image block; the ninth sub-image block, the tenth sub-image block, and the An eleven sub-image block and the twelfth sub-image block at the current The image blocks to be decoded are sequentially arranged in a clockwise direction.
The decoding method according to any one of claims 1 to 7, wherein the parsing code stream determines a basic division manner of an image block to be decoded, and specifically includes:

Obtaining, from the code stream, a division identifier of the current image block to be decoded;

Determining, according to the division identifier of the current image block to be decoded, the basic division manner;

Determining a target derivative mode of the current image block to be decoded, specifically:

Obtaining, from the code stream, a derivative identifier of the current image block to be decoded;

Determining the target derivative mode according to the derived identifier of the current image block to be decoded.
The decoding method according to claim 8, wherein

The division identifier of the current image block to be decoded is represented by a variable length code, and the derivative identifier of the current image block to be decoded is represented by a fixed length code.
The decoding method according to any one of claims 1 to 9, wherein each of the candidate derived mode sets adopts a fixed length code identifier.
An encoding method, comprising:

Determining a basic division manner of the current image block to be encoded and a target derivative mode of the sub image block in the current image block to be encoded; the target derivative mode is one of a candidate derivative mode set, and the candidate derivative mode set includes M a preset derivative mode, the preset derivative mode is used to indicate a derivative mode, and the derivative mode is used to derivate any two adjacent sub-image blocks in the current image block to be encoded into a derivative image block, where M is An integer greater than or equal to 0;

Decoding the current image block to be encoded into N sub-image blocks according to the basic division manner, where N is an integer greater than or equal to 2;

And deriving at least two adjacent sub-image blocks of the N sub-image blocks as the first derivative image block according to the derivative manner of the target-derived mode indication;

And encoding the first derived image block to obtain a code stream, where the code stream includes a division identifier of the current to-be-coded image block and the current to-be-supplied a derivative identifier of the coded image block, the partition identifier of the current image block to be encoded is used to indicate the basic partition mode, and the derived identifier of the current image block to be encoded is used to indicate the target derivative mode.
An encoding method, comprising:

Determining a basic division manner of the current image block to be encoded and a feature parameter of the current image block to be encoded;

Decoding the current image block to be encoded into N sub-image blocks according to the basic division manner, where N is an integer greater than or equal to 2;

Determining, in a case that a feature parameter of the current to-be-coded image block is less than or equal to a first preset threshold, a target derivative mode of the sub-image block in the current image block to be encoded; the target derivative mode is a candidate derivative mode set The candidate derivative mode set includes M preset derivative modes, where the preset derivative mode is used to indicate a derivative mode, where the derivative mode is used to use any two phases in the current image block to be encoded. The neighboring image block is derived as a derived image block, and M is an integer greater than or equal to 0;

And deriving at least two adjacent sub-image blocks of the N sub-image blocks as the first derivative image block according to the derivative manner of the target-derived mode indication;

And encoding the derived image block to obtain a code stream, where the code stream includes a division identifier of the current to-be-encoded image block and the current to-be-coded image, where the first derivative image block is not divided. a derivation identifier of the block to be used to indicate the basic division mode, where the derivation identifier of the current to-be-encoded image block is used to indicate the target derivation mode.
The encoding method according to claim 11 or 12, wherein the determining the basic division manner of the image block to be encoded currently includes:

Determining at least two candidate division manners corresponding to the current image to be encoded;

Calculating a rate distortion cost of each of the at least two candidate partitioning modes;

The candidate division manner corresponding to the smallest rate distortion cost is determined as the basic division manner.
The encoding method according to any one of claims 11 to 13, wherein the determining the target derivative mode of the current image block to be encoded comprises:

Calculating a rate distortion cost of each of the preset derivative modes in the candidate derived mode set, wherein when the basic partitioning mode is horizontal N or vertical N, 0≤M≤2 N-1 ; When the basic division method is horizontal and vertical N minutes,
Is an integer;

The preset derivative mode corresponding to the minimum rate distortion cost is determined as the target derivative mode.
The encoding method according to any one of claims 11 to 14, wherein

The division identifier of the current image block to be decoded is represented by a variable length code, and the derivative identifier of the current image block to be decoded is represented by a fixed length code.
The encoding method according to any one of claims 11-15, wherein each of the candidate derived mode sets adopts a fixed length code identifier.
A decoding device, comprising:

a parsing unit, configured to parse the code stream, to determine a basic division manner of the current image block to be decoded, and a target derivation mode of the sub image block in the current image block to be decoded; the target derivation mode is one of the candidate derivation mode sets The candidate derivative mode set includes M preset derivative modes, where the preset derivative mode is used to indicate a derivative mode, where the derivative mode is used to use any two adjacent sub-images in the current image block to be decoded. The block is derived as a derived image block, and M is an integer greater than or equal to 0;

a dividing unit, configured to divide the current image block to be decoded into N sub-image blocks according to the basic division manner determined by the parsing unit, where N is an integer greater than or equal to 2;

a deriving unit, configured to derive at least two adjacent sub-image blocks of the N sub-image blocks obtained by the dividing unit into a first according to a derivative manner of the target derived mode indication determined by the parsing unit Derived image block;

And a decoding unit, configured to decode the first derived image block obtained by the derived unit.
A decoding device, comprising:

Parsing a code stream for parsing the code stream to determine a basic division manner of the current image block to be decoded and a feature parameter of the current image block to be decoded, the feature parameter being at least one of a height and a width;

a determining unit, configured to determine whether a feature parameter of the current graphic block to be decoded is less than or equal to a first preset threshold;

The parsing unit is further configured to parse the code stream to determine the current image to be decoded if the determining unit determines that the feature parameter of the current image block to be decoded is less than or equal to the first preset threshold. a target derivation mode of the block neutron image block, the target derivation mode is one of a candidate derivation mode set, the candidate derivation mode set includes M preset derivation modes, and the preset derivation mode is used to indicate a derivative mode And the deriving manner is used to derivate any two adjacent sub-image blocks in the current to-be-decoded image block into one derived image block, where the derivative manners indicated by the M preset derivation patterns are different from each other, and the derivation The feature parameter of the image block is greater than a feature parameter of each sub-image block in the current image block to be decoded, the feature parameter is at least one of height and width, and M is an integer greater than or equal to 0;

a dividing unit, configured to divide the current image block to be decoded into N sub-image blocks according to the basic division manner determined by the parsing unit, where N is an integer greater than or equal to 2;

a deriving unit, configured to derivate at least two adjacent sub-image blocks of the N sub-image blocks into a first derivative image block according to a derivative manner of the target derived mode indication determined by the parsing unit;

And a decoding unit, configured to decode the first derived image block obtained by the derived unit.
A decoding apparatus according to claim 17 or 18, characterized in that

The basic division mode is horizontal N, vertical N or horizontal and vertical N.
A decoding apparatus according to claim 19, wherein

In a case where the basic division mode is the horizontal N-score, a width of each of the N sub-image blocks is equal to a width of the current image block to be decoded, and each of the sub-image blocks The height is equal to 1/N of the height of the current image block to be decoded;

In a case where the basic division manner is the vertical N-score, a height of each of the N sub-image blocks is equal to a height of the current image block to be decoded, and each of the sub-image blocks The width is equal to 1/N of the width of the current image block to be decoded.
A decoding apparatus according to claim 19, wherein

In a case where the basic division manner is the horizontal vertical N-minute, the width of each of the N sub-image blocks is equal to the width of the current image block to be decoded.
The height of each of the sub-image blocks is equal to the height of the current image block to be decoded.
Is an integer.
A decoding apparatus according to any one of claims 17 to 21, characterized in that

When the basic division mode is horizontal N or vertical N, 0 ≤ M ≤ 2 N-1 ; when the basic division mode is horizontal and vertical N,
Is an integer.
A decoding apparatus according to claim 22, wherein

In the case that N=4, M=4, and the basic division mode is the horizontal four-point, the candidate derivative mode set includes a first preset derivative mode, a second preset derivative mode, and a third preset. a derivative mode and a fourth preset derivative mode; wherein the first preset derivative mode is a first sub-image block, a second sub-image block, and a third sub-image block are derived as one derived image block; Determining that the third sub-image block and the fourth sub-image block are derived as one derived image block; the third preset derivation mode is deriving the second sub-image block and the third sub-image block as a derived image block; the fourth preset derivative mode is that the first sub-image block and the second sub-image block are derived as one derived image block, the third sub-image block and the fourth sub-image Deriving a block into another derived image block; the first sub-image block, the second sub-image block, the third sub-image block, and the fourth sub-image block are in the current image block to be decoded Arranged in order from top to bottom;

In the case that N=4, M=4, and the basic division manner is the vertical quarter, the candidate derivative mode set includes a fifth preset derivative mode, a sixth preset derivative mode, and a seventh preset derivative. And a fifth preset derivative mode, wherein the fifth preset derivative mode is a fifth sub-image block, a sixth sub-image block, and a seventh sub-image block are derived as one derivative image block; the sixth preset The derived mode is that the sixth sub-image block, the seventh sub-image block, and the eighth sub-image block are derived as one derived image block; the seventh preset derivation mode is the sixth sub-image block and the seventh The sub-image block is derived as a derived image block; the eighth preset derivation mode is that the fifth sub-image block, the sixth sub-image block, the seventh sub-image block, and the eighth sub-image block are all maintained Independently; the fifth sub-image block, the sixth sub-image block, the seventh sub-image block, and the eighth sub-image block are sequentially arranged in the current to-be-decoded image block in order from left to right;

In the case that N=4, M=4, and the basic division manner is the horizontal vertical quarter, the candidate derivative mode set includes a ninth preset derivative mode, a tenth preset derivative mode, and an eleventh pre-predetermined a derivation mode and a twelfth preset derivation mode; wherein the ninth pre-derivative mode is a ninth sub-image block and a tenth sub-image block are derived as one derivative image block; The ninth sub-image block and the tenth sub-image block are derived as one derived image block, and the eleventh sub-image block and the twelfth sub-image block are derived as another derived image block; the eleventh preset Derived mode is derived from the tenth sub-image block and the eleventh sub-image block into one derived image block, and the ninth sub-image block and the twelfth sub-image block are derived as another derived image block; The twelfth preset derivation mode is that the ninth sub-image block and the twelfth sub-image block are derived as one derived image block; the ninth sub-image block, the tenth sub-image block, and the An eleven sub-image block and the twelfth sub-image block are in the same The image block to be decoded are sequentially arranged in a clockwise direction.
The decoding device according to any one of claims 17 to 23, wherein the parsing unit is specifically configured to:

Obtaining, from the code stream, a division identifier of the current image block to be decoded;

Determining, according to the division identifier of the current image block to be decoded, the basic division manner;

Obtaining, from the code stream, a derivative identifier of the current image block to be decoded;

Determining the target derivative mode according to the derived identifier of the current image block to be decoded.
A decoding apparatus according to claim 24, wherein

The division identifier of the current image block to be decoded is represented by a variable length code, and the derivative identifier of the current image block to be decoded is represented by a fixed length code.
The decoding apparatus according to any one of claims 17-25, wherein each of the preset derivative modes in the candidate derivative mode set is identified by a fixed length code.
An encoding device, comprising:

a determining unit, configured to determine a basic division manner of the current image block to be encoded and a target derivative mode of the sub image block in the current image block to be encoded; the target derivative mode is one of a candidate derivative mode set, and the candidate The derivative mode set includes M preset derivation modes, where the preset derivation mode is used to indicate a derivation mode, where the derivation mode is used to derive any two adjacent sub-image blocks in the current to-be-coded image block into one derivative. Image block, M is an integer greater than or equal to 0;

a dividing unit, configured to divide the current image block to be encoded into N sub-image blocks according to the basic division manner determined by the determining unit, where N is an integer greater than or equal to 2;

a deriving unit, configured to derivate at least two adjacent sub-image blocks of the N sub-image blocks into a first derivative image block according to a derivative manner of the target derived mode indication determined by the determining unit;

a determining unit, configured to determine whether to divide the first derivative image block obtained by the derivative unit;

a coding unit, configured to: when the determining unit determines that the first derived image block is not divided, encoding the first derived image block obtained by the derived unit to obtain a code stream, where the code stream And including a split identifier of the current to-be-coded image block and a derivative identifier of the current to-be-coded image block, where the partition identifier of the current to-be-coded image block is used to indicate the basic partition mode, where the current image block to be encoded is A derived identifier is used to indicate the target derived mode.
An encoding device, comprising:

a determining unit, configured to determine a basic division manner of the current image block to be encoded and a feature parameter of the current image block to be encoded;

a dividing unit, configured to divide the current image block to be encoded into N sub-image blocks according to the basic division manner determined by the determining unit, where N is an integer greater than or equal to 2;

a determining unit, configured to determine whether a feature parameter of the current image block to be encoded is less than or equal to a first preset threshold;

The determining unit is further configured to: when the determining unit determines that the feature parameter of the current image block to be encoded is less than or equal to the first preset threshold, determine a sub-image block in the current image block to be encoded. Target derivation mode; the target derivation mode is one of a set of candidate derivation modes, the candidate derivation mode set includes M preset derivation modes, the preset derivation mode is used to indicate a derivative mode, and the derivative mode For deriving any two adjacent sub-image blocks in the current to-be-coded image block into one derived image block, where M is an integer greater than or equal to 0;

a deriving unit, configured to derivate at least two adjacent sub-image blocks of the N sub-image blocks into a first derivative image block according to a derivative manner of the target derived mode indication determined by the determining unit;

The determining unit is further configured to determine whether to divide the first derivative image block obtained by the derivative unit;

a coding unit, configured to: when the determining unit determines that the first derived image block is not divided, to encode the derived image block to obtain a code stream, where the code stream includes the current image block to be encoded a partition identifier and a derived identifier of the current image block to be encoded, the partition identifier of the current image block to be encoded is used to indicate the basic partition mode, and the derivative identifier of the current image block to be encoded is used to indicate the target Derivative mode.
The encoding device according to claim 27 or 28, wherein the determining unit is specifically configured to:

Determining at least two candidate division manners corresponding to the current image to be encoded;

Calculating a rate distortion cost of each of the at least two candidate partitioning modes;

The candidate division manner corresponding to the smallest rate distortion cost is determined as the basic division manner.
The encoding device according to any one of claims 27 to 29, wherein the determining unit is specifically configured to:

Calculating a rate distortion cost of each of the preset derivative modes in the candidate derived mode set, wherein when the basic partitioning mode is horizontal N or vertical N, 0≤M≤2 N-1 ; When the basic division method is horizontal and vertical N minutes,
Is an integer;

The preset derivative mode corresponding to the minimum rate distortion cost is determined as the target derivative mode.
An encoding apparatus according to any one of claims 27 to 30, characterized in that

The division identifier of the current image block to be decoded is represented by a variable length code, and the derivative identifier of the current image block to be decoded is represented by a fixed length code.
The encoding apparatus according to any one of claims 27 to 31, wherein each of the preset derivative modes of the candidate derived mode set is identified by a fixed length code.