WO2022227082A1

WO2022227082A1 - Block division methods, encoders, decoders, and computer storage medium

Info

Publication number: WO2022227082A1
Application number: PCT/CN2021/091736
Authority: WO
Inventors: 唐桐
Original assignee: Oppo广东移动通信有限公司
Priority date: 2021-04-30
Filing date: 2021-04-30
Publication date: 2022-11-03
Also published as: CN117063467A

Abstract

Embodiments of the present application disclose block division methods, encoders, decoders, and a computer storage medium. A block division method comprises: determining, on the basis of texture information of a video image, maximum unit size information of a current block; pre-processing the current block according to the maximum unit size information, and determining a division mode of the current block; determining a block division parameter of the current block according to the division mode; and encoding the current block according to the block division parameter. As such, since the maximum unit size information of the current block is determined according to the texture information of the video image, that is, an adaptive image texture mechanism is designed for a maximum unit size, the prediction of large-size blocks and calculation of a transformation process can be directly skipped, leading to the total number of recursions for block division being exponentially reduced, thereby significantly reducing encoding complexity while maintaining performance gain to be substantially unchanged, and reducing encoding time, and thus improving the efficiency of encoding and decoding.

Description

Block division method, encoder, decoder, and computer storage medium

technical field

The embodiments of the present application relate to the technical field of video coding and decoding, and in particular, to a block division method, an encoder, a decoder, and a computer storage medium.

Background technique

With the improvement of people's requirements for video display quality, new video application forms such as high-definition and ultra-high-definition video emerge as the times require. H.265/High Efficiency Video Coding (HEVC) has been unable to meet the needs of the rapid development of video applications. The Joint Video Exploration Team (JVET) proposed a new generation of video coding standard H.266/Multiple Versatile Video Coding (VVC), and its corresponding test model is VVC's reference software test platform (VVC Test Model, VTM).

In the current VVC block division technology, the Quad Tree (Quad Tree with nested Multi-type Tree, QTMT) mode leads to the coding complexity of VVC far exceeding HEVC; Depth, HBD) video, the prediction and rate-distortion cost calculation of large-size coding blocks may generate a lot of unnecessary overhead, waste computing resources, and increase coding time.

SUMMARY OF THE INVENTION

Embodiments of the present application provide a block division method, an encoder, a decoder, and a computer storage medium, which can reduce coding complexity and further improve coding and decoding efficiency.

The technical solutions of the embodiments of the present application can be implemented as follows:

In a first aspect, an embodiment of the present application provides a block division method, which is applied to an encoder, and the method includes:

Determine the maximum unit size information of the current block based on the texture information of the video image;

Preprocess the current block according to the maximum unit size information to determine the division mode of the current block;

Determine the block division parameters of the current block according to the division mode;

The current block is encoded according to the block partition parameter.

In a second aspect, an embodiment of the present application provides a block division method, which is applied to a decoder, and the method includes:

Parse the code stream and determine the block division parameters of the current block;

Based on the block division parameters, the code stream is parsed to determine the predicted value of the current block;

Based on the block division parameters, the code stream is parsed to determine the residual value of the current block;

Based on the predicted value and the residual value, the reconstructed value of the current block is determined.

In a third aspect, an embodiment of the present application provides an encoder, the encoder includes a first determination unit, a block division unit, and an encoding unit; wherein,

a first determining unit, configured to determine the maximum unit size information of the current block based on the texture information of the video image;

a block division unit, configured to preprocess the current block according to the maximum unit size information, to determine a division mode of the current block; and to determine a block division parameter of the current block according to the division mode;

an encoding unit, configured to encode the current block according to the block division parameter.

In a fourth aspect, an embodiment of the present application provides an encoder, where the encoder includes a first memory and a first processor; wherein,

a first memory for storing a computer program executable on the first processor;

The first processor is configured to execute the method according to the first aspect when running the computer program.

In a fifth aspect, an embodiment of the present application provides a decoder, where the decoder includes a parsing unit and a second determining unit; wherein,

a parsing unit, configured to parse the code stream, and determine the block division parameters of the current block;

The parsing unit is further configured to parse the code stream based on the block division parameter to determine the predicted value of the current block; and based on the block division parameter, parse the code stream to determine the residual value of the current block;

The second determination unit is configured to determine the reconstruction value of the current block based on the predicted value and the residual value.

In a sixth aspect, an embodiment of the present application provides a decoder, the decoder includes a second memory and a second processor; wherein,

a second memory for storing a computer program executable on the second processor;

The second processor is configured to execute the method according to the second aspect when running the computer program.

In a seventh aspect, an embodiment of the present application provides a computer storage medium, where the computer storage medium stores a computer program, and when the computer program is executed, the method described in the first aspect or the method described in the second aspect is implemented.

The embodiments of the present application provide a block division method, an encoder, a decoder, and a computer storage medium. On the encoder side, based on the texture information of the video image, the maximum unit size information of the current block is determined; The block is preprocessed to determine the division mode of the current block; the block division parameter of the current block is determined according to the division mode; and the current block is encoded according to the block division parameter. On the decoder side, the code stream is parsed to determine the block division parameters of the current block; based on the block division parameters, the code stream is parsed to determine the predicted value of the current block; based on the block division parameters, the code stream is parsed to determine the residual value of the current block; And, based on the predicted value and the residual value, the reconstructed value of the current block is determined. In this way, since the maximum unit size information of the current block is determined according to the texture information of the video image, that is, the technical solution of the present application designs an adaptive image texture mechanism for the maximum unit size, which can directly skip the prediction and transformation process of large-size blocks. The calculation of , leads to an exponential decrease in the total number of recursion of block division, thus significantly reducing the coding complexity and reducing the coding time while keeping the performance gain basically unchanged, thereby improving the coding and decoding efficiency.

Description of drawings

1 is a schematic structural diagram of a multi-type tree provided by the related art;

2 is a schematic flowchart of a block division provided by the related art;

3 is a schematic structural diagram of another block division provided by the related art;

4A is a schematic block diagram of the composition of an encoder according to an embodiment of the present application;

4B is a schematic block diagram of the composition of a decoder according to an embodiment of the present application;

FIG. 5 is a schematic flowchart of a block division method provided by an embodiment of the present application;

FIG. 6 is a schematic flowchart of determining maximum unit size information according to an embodiment of the present application;

FIG. 7 is a detailed schematic flow chart of determining maximum unit size information according to an embodiment of the present application;

FIG. 8 is a schematic flowchart of another block division method provided by an embodiment of the present application;

9 is a schematic diagram of the composition and structure of an encoder provided by an embodiment of the present application;

10 is a schematic diagram of a specific hardware structure of an encoder provided by an embodiment of the application;

11 is a schematic diagram of the composition and structure of a decoder provided by an embodiment of the application;

FIG. 12 is a schematic diagram of a specific hardware structure of a decoder provided by an embodiment of the present application.

Detailed ways

In order to have a more detailed understanding of the features and technical contents of the embodiments of the present application, the implementation of the embodiments of the present application will be described in detail below with reference to the accompanying drawings.

Unless otherwise defined, all technical and scientific terms used herein have the same meaning as commonly understood by one of ordinary skill in the technical field to which this application belongs. The terms used herein are only for the purpose of describing the embodiments of the present application, and are not intended to limit the present application.

In the following description, reference is made to "some embodiments" which describe a subset of all possible embodiments, but it is understood that "some embodiments" can be the same or a different subset of all possible embodiments, and Can be combined with each other without conflict. It should also be pointed out that the term "first\second\third" involved in the embodiments of the present application is only used to distinguish similar objects, and does not represent a specific ordering of objects. It is understood that "first\second\" Where permitted, the specific order or sequence may be interchanged so that the embodiments of the present application described herein can be implemented in sequences other than those illustrated or described herein.

In a video image, a first image component, a second image component, and a third image component are generally used to represent a coding block (Coding Block, CB); wherein, the three image components are a luminance component and a blue chrominance component respectively. and a red chrominance component, specifically, the luminance component is usually represented by the symbol Y, the blue chrominance component is usually represented by the symbol Cb or U, and the red chrominance component is usually represented by the symbol Cr or V; in this way, the video image can use the YCbCr format Representation can also be represented in YUV format.

Before the embodiments of the present application are described in further detail, the nouns and terms involved in the embodiments of the present application will be described first. The nouns and terms involved in the embodiments of the present application are applicable to the following explanations:

Moving Picture Experts Group (MPEG)

International Standardization Organization (ISO)

International Electrotechnical Commission (IEC)

Joint Video Experts Team (JVET)

Alliance for Open Media (AOM)

H.265/High Efficiency Video Coding (HEVC)

H.266/Versatile Video Coding (VVC)

VVC's reference software test platform (VVC Test Model, VTM)

Audio Video Standard (AVS)

High-Performance Model (HPM) of AVS

Binary Tree (BT)

Ternary Tree (TT)

Quad Tree (Quad Tree, QT)

Multi-type Tree (MT)

Quad Tree with nested Multi-type Tree (QTMT)

High Bit Depth (HBD)

Quantization Parameter (QP)

Coding unit (Coding Unit, CU)

Coding Tree Unit (CTU)

Currently, for the block division technology of VVC, it adopts a more complex coding unit division structure than HEVC, such as the QTMT mode. Specifically, on the basis of the HEVC quadtree (QT) division, two kinds of binary trees (ie vertical binary tree and horizontal binary tree) and two kinds of ternary trees (ie vertical ternary tree and horizontal ternary tree) are added, wherein, A binary tree (BT) and a ternary tree (TT) may be collectively referred to as a multi-type tree (MT). FIG. 1 shows a schematic structural diagram of a multi-type tree provided by the related art.

It can be understood that, for the CTU, the CTU is firstly divided by the quadtree, and then the leaf nodes of the quadtree can be further divided by the MT. Specifically, the flow of CU division is shown in FIG. 2 . In Figure 2, for the CTU/quadtree node, first determine whether to perform quadtree division; if the value of the identification information (such as flag) at this time is 1, indicating that quadtree division is performed, then quadtree division can be obtained. tree node; if the value of the identification information (such as flag) at this time is 0, indicating that no quad-tree division is performed, then the quad-leaf node/multi-type tree node can be obtained, and then it is judged again whether to perform multi-type tree division, Until the division obtains a multi-type tree node divided by vertical binary division/multi-type tree node divided by vertical trigeminal division/multi-type tree node divided by horizontal binary division/multi-type tree node divided by horizontal trigeminal division.

It should be noted that several points of Figure 2 are explained as follows:

(a) The default size of the CTU of VVC is 128×128, and the minimum CU size is 4×4;

(b) The CTU is first divided into 4 sub-CUs in QT mode by default;

(c) If a CU adopts MT division, then subsequent QT division cannot be performed;

(d) Theoretically, QT nodes can be divided according to the 5 ways shown in Figure 2, and MT nodes can be divided according to the 4 ways shown in Figure 2. Among them, the results of the five division methods of QT nodes include: quad-tree node, multi-type tree node divided by vertical binary tree, multi-type tree node divided by vertical ternary tree, multi-type tree node divided by horizontal binary tree, multi-type tree node divided by horizontal ternary tree Multi-type tree nodes; the results of the four division methods of MT nodes include: multi-type tree nodes divided by vertical binary tree, multi-type tree nodes divided by vertical trigeminal tree, multi-type tree nodes divided by horizontal binary tree, multi-type tree nodes divided by horizontal trigeminal tree tree node;

(e) FIG. 3 shows a schematic structural diagram of a QTMT block division provided by the related art, which can be regarded as a specific example of a final CTU division manner of VVC.

In the encoder or decoder, the QTMT block division is located in the intra/inter prediction module. According to different block divisions, the corresponding reference blocks are found for prediction, and then the division mode with the least rate-distortion cost is found to obtain the final prediction residual. If it is poor, the next steps such as transformation and quantization can be performed.

In a specific example, an implementation method of an encoder using QTMT is: first, the input image is divided into multiple non-overlapping CTU blocks. Then, each CTU is processed in turn according to the raster scanning order, and the CTU is divided into several CUs, which mainly includes the following four steps: ① Calculate the first rate-distortion cost result of predictive coding when it is not divided (represented by RdCost0); ② Set the The CTU is divided according to the QT mode and predicted and encoded, and the second rate-distortion cost result (represented by RdCost1) is calculated; ③ Compare RdCost0 and RdCost1, if RdCost1 is smaller, continue to process 4 sub-CUs in sequence; ④ Each CU is in QT or MT mode Partition prediction, calculate the rate-distortion cost result of each division method, select the current optimal RdCost by comparison, and repeat recursively until a block division mode with the smallest rate-distortion cost is selected. Finally, the residual block is obtained by calculating the optimal block division mode, and then the residual block is transformed, quantized, and entropy encoded, and the prediction information such as the block division mode is encoded, and the output code stream is waiting for transmission.

In another specific example, an implementation method of a decoder using QTMT is: first, perform entropy decoding, inverse quantization, and inverse transformation on the input code stream to obtain a residual block; then, reconstruct an image according to the residual block , the reconstruction process mainly includes the following three steps: 1. Determine the partition tree of the current CTU according to the prediction information such as the block partition mode; 2. Process each CU of the partition tree in turn according to the raster scan order, and use the motion vector and other information to find the prediction block; 3. The residual value and the predicted value of the current CU are superimposed to obtain the reconstructed CU. Finally, the reconstructed image is sent to the Deblocking Filter (DBF)/Sample Adaptive Offset (SAO) filter/Adaptive Loop Filter (ALF), and the filtered image Send it to the buffer area and wait for the video to play.

However, in the existing VVC block division technology, even if only quadtree division is considered, there are 4 ⁵ +4 ⁴ +4 ³ +4 ² +4 ¹ +4 ⁰ =1365 division modes, far exceeding HEVC of 341 modes. In addition, coupled with the division methods such as binary tree and ternary tree, the total number of divisions is theoretically as high as several thousand. As a result, the current QTMT mode results in much more coding complexity for VVC than HEVC; for example, it takes days to encode a high-definition video sequence (1080p). In addition, the target quality of high-bit-depth (HBD) video encoding is ultra-high-definition. It can be seen from the recommended test configuration of VTM that the encoding QP of HBD sequence in 12bit configuration is -13, -8, -3, 2, 7, 12 , so the final size of most coded blocks is smaller. That is to say, when the existing VVC block division technology encodes HBD video, the prediction and rate-distortion cost (Rate-Distortion cost, RDcost) calculation of large-size coding blocks will generate a lot of unnecessary overhead, waste computing resources, increase encoding time.

The embodiment of the present application provides a block division method. On the encoder side, based on the texture information of a video image, the maximum unit size information of the current block is determined; the current block is preprocessed according to the maximum unit size information, and the division of the current block is determined. mode; determining a block division parameter of the current block according to the division mode; and encoding the current block according to the block division parameter. On the decoder side, the code stream is parsed to determine the block division parameters of the current block; based on the block division parameters, the code stream is parsed to determine the predicted value of the current block; based on the block division parameters, the code stream is parsed to determine the residual value of the current block; And, based on the predicted value and the residual value, the reconstructed value of the current block is determined.

In this way, since the maximum unit size information of the current block is determined according to the texture information of the video image, that is, the technical solution of the present application designs an adaptive image texture mechanism for the maximum unit size, which can directly skip the prediction and transformation process of large-size blocks. The calculation of , leads to an exponential decrease in the total number of recursion of block division, thus significantly reducing the coding complexity and reducing the coding time while keeping the performance gain basically unchanged, thereby improving the coding and decoding efficiency.

The embodiments of the present application will be described in detail below with reference to the accompanying drawings.

Referring to FIG. 4A , it shows a schematic block diagram of the composition of an encoder provided by an embodiment of the present application. As shown in FIG. 4A, the encoder 10 includes a transform and quantization unit 101, an intra-frame estimation unit 102, an intra-frame prediction unit 103, a motion compensation unit 104, a motion estimation unit 105, an inverse transform and inverse quantization unit 106, and a filter control unit. The analysis unit 107, the filtering unit 108, the encoding unit 109, the decoded image buffering unit 110, etc., wherein the filtering unit 108 can implement DBF filtering/SAO filtering/ALF filtering, and the encoding unit 109 can implement header information encoding and context-based adaptive binary Arithmetic coding (Context-based Adaptive Binary Arithmatic Coding, CABAC). For the input original video signal, a video coding block can be obtained by dividing the coding tree unit (Coding Tree Unit, CTU), and then the residual pixel information obtained after intra-frame or inter-frame prediction is transformed and quantized by the quantization unit 101. The video coding block is transformed, including transforming residual information from the pixel domain to the transform domain, and quantizing the resulting transform coefficients to further reduce the bit rate; the intra-frame estimation unit 102 and the intra-frame prediction unit 103 are used for Intra prediction is performed on the video coding block; specifically, the intra prediction unit 102 and the intra prediction unit 103 are used to determine the intra prediction mode to be used to encode the video coding block; the motion compensation unit 104 and the motion estimation unit 105 is used to perform inter-predictive encoding of the received video encoding block relative to one or more blocks in one or more reference frames to provide temporal prediction information; the motion estimation performed by the motion estimation unit 105 is to generate a motion vector. process, the motion vector can estimate the motion of the video coding block, and then the motion compensation unit 104 performs motion compensation based on the motion vector determined by the motion estimation unit 105; after determining the intra prediction mode, the intra prediction unit 103 also For providing the selected intra prediction data to the encoding unit 109, and the motion estimation unit 105 also sends the calculated motion vector data to the encoding unit 109; in addition, the inverse transform and inverse quantization unit 106 is used for the video Reconstruction of the coding block, reconstructing the residual block in the pixel domain, the reconstructed residual block removing the blocking artifacts by the filter control analysis unit 107 and the filtering unit 108, and then adding the reconstructed residual block to the decoding A predictive block in the frame of the image buffer unit 110 is used to generate a reconstructed video coding block; the coding unit 109 is used for coding various coding parameters and quantized transform coefficients. In the CABAC-based coding algorithm, The context content can be based on adjacent coding blocks, and can be used to encode information indicating the determined intra-frame prediction mode, and output a code stream of the video signal; and the decoded image buffer unit 110 is used to store the reconstructed video coding blocks, for Forecast reference. As the video image coding proceeds, new reconstructed video coding blocks are continuously generated, and these reconstructed video coding blocks are all stored in the decoded image buffer unit 110 .

Referring to FIG. 4B , it shows a schematic block diagram of the composition of a decoder provided by an embodiment of the present application. As shown in FIG. 4B, the decoder 20 includes a decoding unit 201, an inverse transform and inverse quantization unit 202, an intra prediction unit 203, a motion compensation unit 204, a filtering unit 205, a decoded image buffer unit 206, etc., wherein the decoding unit 201 Header decoding and CABAC decoding can be implemented, and the filtering unit 205 can implement DBF filtering/SAO filtering/ALF filtering. After the input video signal is subjected to the encoding process of FIG. 4A, the code stream of the video signal is output; the code stream is input into the video decoding system 20, and firstly passes through the decoding unit 201 to obtain the decoded transform coefficient; Inverse transform and inverse quantization unit 202 processes to generate residual blocks in the pixel domain; intra prediction unit 203 may be used to generate based on the determined intra prediction mode and data from previously decoded blocks of the current frame or picture Prediction data for the current video decoding block; motion compensation unit 204 determines prediction information for the video decoding block by parsing the motion vector and other associated syntax elements, and uses the prediction information to generate predictive information for the video decoding block being decoded block; a decoded video block is formed by summing the residual block from inverse transform and inverse quantization unit 202 and the corresponding predictive block produced by intra prediction unit 203 or motion compensation unit 204; the decoded video signal Video quality may be improved by filtering unit 205 in order to remove blocking artifacts; decoded video blocks are then stored in decoded image buffer unit 206, which stores reference images for subsequent intra prediction or motion compensation , and is also used for the output of the video signal, that is, the restored original video signal is obtained.

It should be noted that the block division method in the embodiment of the present application can be applied to a video codec chip, and the use of the QTMT mode can significantly improve the coding performance. Here, it can be applied to the intra/inter prediction part as shown in FIG. 4A (represented by a black bold box, specifically including the intra-frame estimation unit 102, the intra-frame prediction unit 103, the motion compensation unit 104, the motion estimation unit 105), it can also be applied to the intra/inter prediction part as shown in FIG. 4B (represented by a bold black box, specifically including the intra prediction unit 203 and the motion compensation unit 204). That is to say, the block division method in the embodiments of the present application can be applied to a video encoding system (referred to as "encoder" for short), also can be applied to a video decoding system (referred to as "decoder" for short), or even simultaneously It is applied to the video coding system and the video decoding system, but no limitation is made here.

It should also be noted that when the embodiments of the present application are applied to the encoder 10, the “current block” specifically refers to the block currently to be encoded in the video image (may also be referred to as “encoding blocks” for short); When applied to the decoder 20, the "current block" specifically refers to the block currently to be decoded in the video image (it may also be referred to as a "decoding block" for short).

In an embodiment of the present application, referring to FIG. 5 , it shows a schematic flowchart of a block division method provided by an embodiment of the present application. As shown in Figure 5, the method may include:

S501: Determine maximum unit size information of the current block based on the texture information of the video image.

It should be noted that the block division method in the embodiment of the present application is applied to an encoder. Here, for a video image, the video image can be divided into a plurality of image blocks, and each image block to be encoded can be called an encoding block, and the current block here specifically refers to the encoding block currently to be encoded. It may be a CTU, or even a CU, etc., which is not limited in any embodiment of the present application.

It should also be noted that the embodiments of the present application mainly provide a fast block division technology for high bit depth video based on texture analysis, that is, applied to high bit depth video. Therefore, in some embodiments, the method may further include:

Determine the identification information of the video image;

When the identification information of the video image indicates that the video image is a high bit depth video, the step of determining the maximum unit size information of the current block based on the texture information of the video image is performed.

In this embodiment of the present application, the determination of whether a video image is a high bit depth video may be represented by the identification information of the video image. Specifically, in some embodiments, the determining the identification information of the video image may include:

If the identification information of the video image indicates that the video image is a high bit depth video, then determine that the value of the identification information of the video image is the first value; or,

If the identification information of the video image indicates that the video image is a non-high bit-depth video, the value of the identification information of the video image is determined to be the second value.

It should be noted that the first value and the second value are different, and the first value and the second value may be in the form of parameters or in the form of numbers. In general, the identification information of the video image is a parameter written in the profile (profile), but the identification information of the video image may also be a flag (flag), which is not limited here.

It should also be noted that if the identification information of the video image is a flag, then in a specific example, the first value may be set to 1, and the second value may be set to 0; in another specific example, the first value may be set to 0. One value can also be set to true, and the second value can also be set to false; even in another specific example, the first value can also be set to 0, and the second value can also be set to 1; or, the first value can also be set to Can be set to false, and the second value can also be set to true. The first value and the second value in this embodiment of the present application are not limited in any way.

That is to say, the embodiments of the present application provide an encoding method, and specifically provide a block division method. More specifically, the embodiments of the present application design an adaptive maximum unit size mechanism based on image texture for high-bit-depth video, so that Subsequent computations of prediction and transform processes for large-sized blocks can be skipped directly.

Further, in some embodiments, the method may further include: encoding the identification information of the video image, and writing the encoded bits into the code stream. In this way, after the encoder writes the identification information of the video image into the code stream, the decoder can directly determine whether the video image is a high-bit-depth video by parsing the code stream, so as to facilitate the decoder to perform subsequent operations.

It should also be noted that, for the maximum unit size information, it can be the BT maximum unit size (represented by maxBtSize) used to limit the binary tree division, or the TT maximum unit size information used to limit the ternary tree division (represented by maxTtSize indicates). In other words, the maximum cell size information may be represented by maxBtSize or maxTtSize.

Further, in some embodiments, for the maximum unit size information, its level may be at least one of the following: sequence level and image level.

Exemplarily, when determining the maximum unit size information, a video sequence may be input, and then the initial frame is used as a video image, and the maximum unit size information corresponding to the initial frame is determined accordingly. In a specific example, if the maximum unit size information is at the sequence level, the entire video sequence can use this maximum unit size information. In another specific example, if the maximum unit size information is at the picture level (or called "frame level"), then for the entire video sequence, the maximum unit size information corresponding to each frame may be different, which is Each frame is used as a video image to determine the corresponding maximum unit size information of each frame. It should also be noted that for different blocks in the same frame, the maximum unit size information is the same.

In this way, in the embodiment of the present application, it is first necessary to perform texture analysis on the high-bit-depth video, and after determining the texture information of the video image, the maximum unit size information can be further determined, that is, the adaptive maximum unit size based on the image texture is realized. Cell size mechanism.

S502: Preprocess the current block according to the maximum unit size information to determine the division mode of the current block.

It should be noted that after the maximum unit size information is determined, the current block can be preprocessed according to the maximum unit size information, such as calculating the rate-distortion cost value under different division modes, and then selecting the optimal rate-distortion cost value (or called "minimum rate-distortion cost") to determine the partition mode of the current block.

That is, in this embodiment of the present application, the division mode may be determined by calculating the rate-distortion cost. In some embodiments, the current block is preprocessed according to the maximum unit size information, and the division mode of the current block is determined, which may include:

Use the maximum unit size information to divide the current block to obtain at least one first node sub-block, and calculate the first rate-distortion cost value;

Divide the first node sub-block by using the preset division mode, obtain at least one second node sub-block, and calculate the second rate-distortion cost value;

The division mode of the current block is determined according to the comparison result of the first rate-distortion cost value and the second rate-distortion cost value.

Further, in some embodiments, determining the division mode of the current block according to the comparison result of the first rate-distortion cost value and the second rate-distortion cost value may include:

comparing the first rate-distortion cost value and the second rate-distortion cost value;

In the case where the second rate-distortion cost value is less than the first rate-distortion cost value, use a preset division mode to divide the second node sub-block to obtain at least one next-level second node sub-block, and calculate the third rate-distortion cost value;

The division mode of the current block is determined according to the second rate-distortion cost value and the third rate-distortion cost value.

It should be noted that when the second rate-distortion cost value is less than the first rate-distortion cost value, the node sub-blocks need to be further divided by using the preset division mode, which is a recursive process until the minimum rate-distortion is determined. cost value. In some embodiments, determining the division mode of the current block according to the second rate-distortion cost value and the third rate-distortion cost value may include:

In the case that the third rate-distortion cost value is less than the second rate-distortion cost value, update the second rate-distortion cost value by using the third rate-distortion cost value, and return to perform dividing the second node sub-block by using the preset dividing mode, The steps of obtaining at least one next-level second node sub-block, and calculating the third rate-distortion cost value, until the minimum rate-distortion cost value is determined;

The division mode of the current block is determined according to the division mode corresponding to the minimum rate-distortion cost value.

It should be noted that the first node sub-block may be a node sub-block obtained by dividing the current block for the first time, and the second node sub-block may be a node sub-block obtained by continuing division based on a preset division mode, or It is regarded as starting from the second division, and the node sub-blocks obtained by the subsequent step-by-step division can be collectively referred to as the second node sub-blocks.

Exemplarily, for the ith level, the second rate-distortion cost value represents the rate-distortion cost value that does not continue to divide the ith level; and by using the preset division mode to continue dividing the node sub-blocks of the current level, the ith level can be obtained. For the node sub-block of the +1 level, the third rate-distortion cost value can be calculated at this time, and the third rate-distortion cost value here represents the rate-distortion cost value of continuing to divide the i-th level. Then perform a comparison between the second rate distortion cost value and the third rate distortion cost value. If the third rate distortion cost value is less than the second rate distortion cost value, for the i+1th level, the third rate distortion cost value can be calculated at this time. It is regarded as the second rate-distortion cost value that does not continue to divide the i+1th level; and by using the preset division mode to continue to divide the node sub-block of the current level, the node sub-block of the i+2th level can be obtained, and Calculate the new third rate-distortion cost value. At this time, the third rate-distortion cost value represents the rate-distortion cost value for the continued division of the i+1 level, and execute the second rate-distortion cost value and the third rate-distortion cost value again. value comparison. In this way, when the third rate-distortion cost value is less than the second rate-distortion cost value, the node sub-blocks of the current level can be divided into the next-level node sub-blocks again, and the rate-distortion cost value comparison can be continued, and the recursive cycle has been carried out. Go on until the minimum rate-distortion cost value is determined, and then the division mode corresponding to the minimum rate-distortion cost value is determined as the division mode.

It should also be noted that, in this embodiment of the present application, the preset division mode may include a quad-tree division mode and/or a multi-type tree division mode; wherein, the multi-type tree division mode may include at least one of the following: vertical Binary tree partition mode, horizontal binary tree partition mode, vertical ternary tree partition mode and horizontal ternary tree partition mode.

Here, the vertical binary tree division mode and the horizontal binary tree division mode may be collectively referred to as a binary tree division mode, and the vertical ternary tree division mode and the horizontal ternary tree division mode may be collectively referred to as a ternary tree division mode. In this way, for "using the preset division mode to divide the first node sub-block to obtain at least one second node sub-block", specifically, if the first node sub-block is divided by using the quad-tree division mode, it can be obtained Four second node sub-blocks; if the first node sub-block is divided by the binary tree division mode, two second node sub-blocks can be obtained; if the first node sub-block is divided by the ternary tree division mode, three second node sub-blocks can be obtained. A second node sub-block.

Here, the sub-block of the first node is divided by a preset division mode, which may be a quad-tree division mode, a vertical binary tree division mode, a horizontal binary tree division mode, a vertical ternary tree division mode, a horizontal ternary tree division mode, or the like. Each division mode divides the first node sub-block, and can calculate a rate-distortion cost value respectively, and then select the minimum rate-distortion cost value from the calculated rate-distortion cost value as the third rate-distortion cost value; When the third rate-distortion cost value is less than the second rate-distortion cost value, the obtained second node sub-blocks will continue to be divided, and the recursive cycle will continue until the minimum rate-distortion cost value is determined. The division mode corresponding to the distortion cost value is used to determine the division mode of the current block.

Besides, in some embodiments, the method may further include: when the second rate-distortion cost value is greater than or equal to the first rate-distortion value, directly dividing the current block according to the maximum unit size information Determines the division mode of the current block.

That is to say, if the second rate-distortion cost value is greater than or equal to the first rate-distortion value, it means that the rate-distortion cost is the smallest when the node sub-blocks are no longer divided into the next level, then it can be directly adjusted according to the maximum unit size information. The division mode of the current block is determined as the division mode of the current block, and at this time, it is no longer necessary to continue dividing the obtained node sub-blocks.

In this way, after the maximum unit size information is determined by using the texture information of the video image, the current block can be preprocessed according to the maximum unit size information, and then the division mode of the current block can be determined, so as to realize the block division operation of the current block.

S503: Determine block division parameters of the current block according to the division mode.

S504: Encode the current block according to the block division parameter.

It should be noted that the division mode is a specific block division manner, and the block division parameter here may be identification information indicating block division, such as split_cu_flag[x0][y0]. After the block division parameters are determined, the current block may be encoded according to the block division parameters.

In a possible implementation manner, the encoding of the current block according to the block division parameter may include:

The block division parameters are encoded, and the encoded bits are written into the code stream.

It should be noted that, after the block division parameters are determined, in order to enable the decoder to obtain the block division parameters, the encoder needs to encode the block division parameters, and then writes the code stream to wait for transmission from the encoder to the decoder.

In another possible implementation manner, the encoding of the current block according to the block division parameter may include:

Divide the current block into one or more node sub-blocks according to the block division parameters;

According to the preset processing order of node sub-blocks, the prediction parameters of each node sub-block are sequentially determined;

According to the prediction parameters, determine the predicted value of the node sub-block;

Determine the residual value of the node sub-block according to the original value and the predicted value of the node sub-block;

The prediction parameters and residual values of the node sub-blocks are encoded, and the encoded bits are written into the code stream.

It should be noted that the preset processing order of node sub-blocks may be the preset scanning order. Here, the preset scanning sequence may be diagonal, Zigzag, horizontal, vertical, 4×4 sub-block scanning, or any other raster scanning sequence, which is not limited in this embodiment of the present application.

It should also be noted that after the block division operation is performed on the current block by using the division mode, the residual value can be determined. At this time, the residual value can be transformed, quantized and entropy encoded, and the prediction parameters of the node sub-blocks can be encoded, and then written into the code stream to be transmitted from the encoder to the decoder.

In addition, the embodiment of the present application may also provide a code stream, where the code stream is generated by bit encoding according to relevant parameters. The relevant parameters may include at least one of the following: a block division parameter, a prediction parameter of a node sub-block, a residual value, and identification information of a video image.

This embodiment provides a block division method, which is applied to an encoder. Based on the texture information of the video image, the maximum unit size information of the current block is determined; the current block is preprocessed according to the maximum unit size information, and the division mode of the current block is determined; the block division parameters of the current block are determined according to the division mode; parameter to encode the current block. In this way, since the maximum unit size information of the current block is determined according to the texture information of the video image, that is, the technical solution of the present application designs an adaptive image texture mechanism for the maximum unit size, which can directly skip the prediction and transformation process of large-size blocks. The calculation of , leads to an exponential decrease in the total number of recursion of block division, thus significantly reducing the coding complexity and reducing the coding time while keeping the performance gain basically unchanged, thereby improving the coding and decoding efficiency.

In another embodiment of the present application, for the determination of the maximum unit size information of the current block, see FIG. 6 , which shows a schematic flowchart of determining the maximum unit size information provided by an embodiment of the present application. As shown in Figure 6, the process may include:

S601: Divide the video image into blocks to obtain N blocks of a preset size; wherein, N is an integer greater than zero, and the N blocks do not overlap each other.

It should be noted that the preset size refers to a preset block size value. Here, the preset size can be any one of 8, 16, 32, 64, etc., or any one of 8×8, 16×16, 32×32, 64×64, etc., which is implemented in this application. Examples are not specifically limited. In a specific example, the preset size may be 64×64; at this time, for the video image, it may be divided into N non-overlapping blocks of 64×64.

S602: Perform texture analysis on the N blocks to determine a first quantity; where the first quantity represents the quantity of blocks whose texture values are smaller than a first threshold in the N blocks.

Here, for the number of blocks with low texture complexity, that is, the determination of the first number, in some embodiments, performing texture analysis on N blocks to determine the first number may include:

Calculate the texture values of N blocks;

comparing the texture values of the N blocks with the first threshold in sequence;

According to the comparison result, the number of blocks whose texture value is less than the first threshold is counted to obtain the first number.

In a specific implementation, the method may further include:

Set the initial value of the statistical value to zero;

comparing the texture value of the ith block with the first threshold;

If the texture value of the i-th block is less than the first threshold, the statistic value is incremented by 1, and i=i+1;

When i is less than N, continue to perform the step of comparing the texture value of the ith block with the first threshold;

When i is equal to N, the obtained statistical value is determined as the first number; wherein, i is an integer greater than or equal to zero.

It should be noted that the first number represents the number of blocks with low texture complexity among the N blocks. Here, the level of texture complexity can be measured by the first threshold. If the texture value is greater than or equal to the first threshold, it indicates that the texture of the block is relatively complex; if the texture value is less than the first threshold, it indicates that the texture of the block is relatively complex. Simple (ie low texture complexity).

It should also be noted that, for the first threshold, it can be represented by T. In general, the value of T is equal to 4×2^(bitdepth-8), but it is not specifically limited.

S603: Determine maximum unit size information of the current block according to the comparison result between the first number and the second threshold.

It should be noted that, after the first number is determined, a second threshold may also be set for determining the maximum unit size information of the current block. In some embodiments, determining the maximum unit size information of the current block according to the comparison result between the first number and the second threshold, which may include:

determining a ratio of the first number to N, and comparing the ratio with a second threshold;

In the case that the ratio is smaller than the second threshold value, the maximum unit size information of the current block is determined to be the first size value.

Further, when the ratio is greater than or equal to the second threshold, the method may further include:

comparing the ratio with a third threshold;

If the ratio is smaller than the third threshold, the maximum unit size information of the current block is determined to be the second size value.

Further, the method may further include: if the ratio is greater than or equal to a third threshold, determining the maximum unit size information of the current block as a default value.

It should be noted that the first threshold is different from the second threshold and the third threshold, and the second threshold is smaller than the third threshold. Here, the first threshold may be represented by c1, and the second threshold may be represented by c2. In a specific example, the value of c1 is equal to 0.15, and the value of c2 is equal to 0.3, but the embodiment of the present application does not specifically limit it.

In addition, the first size value is different from the second size value. In a specific example, the first size value may be 8, and the second size value may be 16, but the embodiment of the present application does not specifically limit it.

It should also be noted that, assuming that the first quantity is represented by j, the ratio can be represented by j/N, and the value of the first size is 8 and the value of the second size is 16. In this way, if j/N<c1, then the maximum cell size information is 8; if c1≤j/N<c2, then the maximum cell size information is 16; if j/N≥c2, then the maximum cell size information is the default value.

In this embodiment of the present application, the calculation of the texture value may be determined according to variance calculation, or may be determined according to other methods, such as a method of summing the absolute values of the horizontal gradient and the vertical gradient.

In a possible implementation manner, the calculating texture values of N blocks may include:

Calculate the variance value of the kth block to obtain the texture value of the kth block.

In another possible implementation manner, the calculating the texture values of the N blocks may include:

Determine the absolute value of the horizontal gradient and the absolute value of the vertical gradient of the kth block;

The absolute value of the horizontal gradient and the absolute value of the vertical gradient are summed to obtain the texture value of the kth block.

Here, since the embodiment of the present application involves a total of N blocks, the value of k is an integer greater than or equal to zero and less than N. In this way, for the texture analysis calculation, the calculated variance value can be determined as the texture value, or the sum of the absolute value of the horizontal gradient and the absolute value of the vertical gradient can be calculated, and the calculated sum value can be determined as the texture value, But it does not make any limitation.

In addition, for the maximum unit size information, its level may be at least one of the following: sequence level and image level.

Exemplarily, when determining the maximum unit size information, a video sequence may be input, and then the initial frame is used as a video image, and the maximum unit size information corresponding to the initial frame is determined accordingly. In a specific example, if the maximum unit size information is at the sequence level, the entire video sequence can use this maximum unit size information. In another specific example, if the maximum unit size information is at the image level, then for the entire video sequence, the maximum unit size information corresponding to each frame is the video image, and then the corresponding maximum unit size information for each frame is determined separately. maximum element size information. It should also be noted that for different blocks in the same frame, the maximum unit size information is the same.

In a specific example, taking the input of a video sequence as an example, see FIG. 7 , which shows a detailed schematic flow chart of determining maximum unit size information provided by an embodiment of the present application. As shown in Figure 7, the detailed process may include:

S701: Input a video sequence, and set j=0, i=0.

S702: Divide the initial frame into N non-overlapping blocks of 64×64.

S703: Calculate the variance value var _i of the ith block.

S704: Determine whether var _i < T?

S705: j=j+1.

S706: i=i+1.

S707: Determine whether i=N?

S708: Judge whether j/N<c1?

S709: Judge whether j/N<c2?

S710: Determine maxBtSize=8 and maxTtSize=8.

S711: Determine maxBtSize=16 and maxTtSize=16.

S712: Determine the default maxBtSize and the default maxTtSize.

Here, T represents the first threshold described in the embodiment of the present application, c1 represents the second threshold described in the embodiment of the present application, and c2 represents the third threshold described in the embodiment of the present application. Exemplarily, T=4×2^(bitdepth-8), c1=0.15, c2=0.3.

In this embodiment of the present application, the maximum unit size information may be the BT maximum unit size (represented by maxBtSize) used to limit the binary tree division, or the TT maximum unit size information used to limit the ternary tree division ( Expressed by maxTtSize). In other words, the maximum unit size information can be represented by maxBtSize or maxTtSize; but under the same conditions, maxBtSize and maxTtSize have the same values.

It should be noted that the maxBtSize and maxTtSize shown in FIG. 7 may be at the sequence level. That is, for the video sequence, maxBtSize and maxTtSize can be determined only according to the initial frame, and the determined maxBtSize and maxTtSize can be used for the entire video sequence. In addition, maxBtSize and maxTtSize can also be modified from sequence level to image level, that is, maxBtSize and maxTtSize can be calculated according to the method shown in FIG. 7 for each frame of video image as block size constraints of the current frame.

It should also be noted that, in FIG. 7 , i represents the execution order of variance calculation for each of the N blocks and whether the variance is less than T, and j represents the cumulative value of the number of blocks with variance less than T among the N blocks. For step S704, if the judgment result is yes, it means that the variance of the i-th block is less than T, then step S705 and S706 are executed, that is, not only the processing of adding 1 to j is performed, but also processing of adding 1 to i is performed; If the result is no, it means that the variance of the i-th block is greater than or equal to T, then step S706 is executed, that is, at this time, it is no longer necessary to perform the processing of adding 1 to j, and only processing of adding 1 to i is performed. For step S707, if the judgment result is no, it means that all the N blocks have not been executed, then return to step S703, that is, continue the operation of the next block (for example, calculate the variance of the ith block, and then further judge Whether the variance of the i-th block is less than T, etc.); if the judgment result is yes, it means that the N blocks are all executed, then execute step S708, that is, after obtaining j, determine the ratio of j to N j/N, and compare j/N with c1. Specifically, for step S708, if the judgment result is yes, it means that j/N is less than c1, then step S710 is executed, that is, it is determined that the maximum unit size information of the current block is 8; if the judgment result is no, it means that j/N N is greater than or equal to c1, then step S709 is executed, and j/N needs to be further compared with c2. Specifically, for step S709, if the judgment result is yes, it means that j/N is less than c2, then step S711 is executed, that is, it is determined that the maximum unit size information of the current block is 16; if the judgment result is no, it means that j/N N is greater than or equal to c2, then step S712 is executed, that is, the maximum unit size information of the current block is determined as the default value (including the default maxBtSize and the default maxTtSize).

In short, the embodiments of the present application provide a high-bit-depth video fast division technology based on texture analysis. The technology first performs texture analysis on the high-bit-depth video, and then designs an adaptive maximum size mechanism for multi-type tree units based on this. When the texture of most areas of the video image is complex, the maximum size of the multi-type tree unit is smaller; and when there is a large flat area in the video image, the maximum size of the multi-type tree unit is larger. See Figure 7 for details. If j/N<c1, then maxBtSize is 8 and maxTtSize is 8; if c1≤j/N<c2, then maxBtSize is 16 and maxTtSize is 16; if j/N≥c2, then maxBtSize is the default maxBtSize, and maxTtSize is the default maxTtSize.

Thus, in another specific example, an implementation method of an HBD sequence encoder using QTMT fast block division is as follows: First, input the first frame of the video sequence, and determine the maximum BT according to the flow shown in FIG. 7 . The cell size is maxBtSize, and the TT maximum cell size is maxTtSize. Next, the input image is divided into non-overlapping CTU blocks. Then, each CTU is processed in turn according to the raster scan order, and the CTU is divided into several CUs, which mainly includes the following four steps: ① Divide the CTU into several non-overlapping CUs according to the maxBtSize×maxBtSize size (or maxTtSize×maxTtSize size) , calculate the rate-distortion cost value RdCost0 of predictive coding at this time. ② The CU is divided and predicted according to the QT mode or the MT mode, and the rate-distortion cost value RdCost1 is calculated. ③ Compare RdCost0 and RdCost1, if RdCost1 is smaller, continue to process each sub-CU in turn. ④ Among them, each CU is divided and predicted according to the QT mode or the MT mode, the rate-distortion cost value of each division mode is calculated, the current relatively better RdCost is compared, and the recursive loop is continued until the rate-distortion cost value with the smallest rate is determined. A block division method, that is, the division mode of the current block. Finally, the residual block is calculated according to the division mode, and the residual block is transformed, quantized, and entropy encoded, and the relevant information such as block division parameters is encoded, and the output code stream is waiting for transmission.

Understandably, in the implementation of VVC, first of all, for the input variable declaration part, a supplementary description of maxBtSize can be added:

For the BT maximum unit size maxBtSize, when encoding the HBD sequence, if j/N<c1, then maxBtSize=8; otherwise, if j/N<c2, then maxBtSize=16.

Secondly, for the input variable declaration part, you can also add a supplementary description of maxTtSize:

For the TT maximum unit size maxTtSize, when encoding the HBD sequence, if j/N<c1, then maxTtSize=8; otherwise, if j/N<c2, then maxTtSize=16.

That is to say, in QTMT technology, for high bit-depth video coding, the block division process of binary tree and ternary tree in MT is adaptively cropped, and for video sequences with extremely complex textures, the MT block division larger than 8×8 is cut off. mode, for video sequences with more complex textures, cut out the block division mode larger than 16×16. After the block division technology provided by the embodiment of the present application is implemented on the VVC reference software VTM11.0, the test is carried out in the HBD test sequence required by JVET under the All Intra condition, and the average change of BD-rate on the Y, Cb, and Cr components are 0.20%, 0.29%, and 0.28%, respectively, and the encoding time is reduced by an average of 56%. This data shows that this technology can save more than half of the encoding time with almost negligible loss of performance gain.

This embodiment provides a block division method, which is applied to an encoder. Through the detailed description of the foregoing embodiments in this embodiment, it can be seen that, on the premise of not affecting the performance, the coding complexity can be significantly reduced with the performance gain almost unchanged by using the technical solution of the present application. Compared with the existing QTMT technology in the related art, the technical solution of the present application directly skips the calculation of the prediction and transformation process of large-size blocks. The performance is only 0.20%, which is almost negligible, but since block sizes such as 128, 64, 32, etc. are directly skipped, the total number of recursion of block division can be reduced exponentially, thus ultimately reducing the encoding time by more than 50%; also That is to say, the technical solution of the present application can significantly reduce the coding complexity while maintaining the coding performance substantially equivalent to that of the prior art.

In another embodiment of the present application, referring to FIG. 8 , it shows a schematic flowchart of another block division method provided by the embodiment of the present application. As shown in Figure 8, the method may include:

S801: Parse the code stream, and determine the block division parameters of the current block.

It should be noted that, the block division method in this embodiment of the present application is applied to a decoder. Here, for a video image, the video image can be divided into multiple image blocks, wherein each image block to be decoded can be called a decoding block, and the current block here specifically refers to the decoding block currently to be decoded ; After decoding is complete, you can wait for the video to play.

It should also be noted that the embodiments of the present application mainly provide a fast block division technology for high bit depth video based on texture analysis, that is, applied to high bit depth video. Here, whether the video image is a high bit depth video can be determined by using the identification information of the video image. Specifically, in some embodiments, the method may further include:

Parse the code stream to obtain the identification information of the video image;

If the value of the identification information of the video image is the first value, it is determined that the identification information of the video image indicates that the video image is a high bit depth video; or,

If the value of the identification information of the video image is the second value, it is determined that the identification information of the video image indicates that the video image is a non-high bit depth video.

It should also be noted that the embodiments of the present application provide a decoding method, and specifically provide a block division method. More specifically, the embodiments of the present application design an image texture-based adaptive maximum unit size mechanism for high-bit-depth video. . In this way, when the encoder determines that the video image is a high-bit-depth video, the identification information of the video image can be written into the code stream, so that the decoder can directly determine whether the video image is a high-bit-depth video by parsing the code stream.

In addition, taking the identification information of the video image as a flag as an example, at this time, for the first value and the second value, in a specific example, the first value can be set to 1, and the second value can be set to 0 ; In another specific example, the first value can also be set to true, and the second value can also be set to false; even in another specific example, the first value can also be set to 0, and the second value can also be set to Set to 1; alternatively, the first value can also be set to false, and the second value can also be set to true. The first value and the second value in this embodiment of the present application are not limited in any way.

In this way, assuming that the first value is 1 and the second value is 0, after the decoder parses the code stream, if the value of the identification information of the video image is 1, it can be determined that the video image is a high-bit-depth video, that is, encoding The block division method described in the embodiments of the present application can be used to save coding speed and significantly reduce coding complexity. Otherwise, if the value of the identification information of the video image is 0, it can be determined that the video image is a non-high bit-depth video at this time, that is, the encoder does not use the block division method described in the embodiment of the present application, for example, according to the related art The block division method is performed.

Further, in some embodiments, after the decoder obtains the block division parameters by decoding, the method may further include:

Determine the division mode of the current block based on the block division parameters;

According to the division mode, a division tree of the current block is determined, wherein the division tree includes one or more node sub-blocks obtained by dividing the current block.

That is to say, after the block division parameters are determined, the division mode of the current block can be determined, and then the division tree of the current block can be determined, so as to sequentially process each node sub-tree of the division tree according to the preset processing order of node sub-blocks piece.

It should also be noted that if the identification information of the video image indicates that the video image is a high bit depth video, then the division mode of the current block is determined according to the block division parameter, and the division mode is associated with the texture information of the video image. That is, in the encoder, the division mode is determined by determining the maximum unit size information of the current block according to the texture information of the video image, and then preprocessing the current block according to the maximum unit size information; The mechanism of adaptive image texture is designed, which can directly skip the calculation of the prediction and transformation process of large-size blocks, thereby reducing the encoding time and significantly reducing the encoding complexity while keeping the performance gain basically unchanged.

S802: Based on the block division parameter, parse the code stream to determine the predicted value of the current block.

It should be noted that, after decoding to obtain the block division parameter, for the determination of the predicted value, in some embodiments, the step of parsing the code stream based on the block division parameter to determine the predicted value of the current block may include:

According to the preset node sub-block processing order, the code stream of each node sub-block of the partition tree is sequentially parsed, and the prediction mode of each node sub-block is determined;

The prediction value of each node sub-block is determined according to the prediction mode.

Here, the preset processing order of node sub-blocks may be the preset scanning order. The preset scanning sequence may be diagonal, Zigzag, horizontal, vertical, 4×4 sub-block scanning, or any other raster scanning sequence, which is not limited in this embodiment of the present application.

It should also be noted that, in this embodiment of the present application, the code stream of each node sub-block of the partition tree can be sequentially parsed according to the preset scanning order, to obtain the prediction mode of each node sub-block, and then determine the prediction of each node sub-block. value.

S803: Based on the block division parameters, parse the code stream to determine the residual value of the current block.

It should be noted that, after decoding to obtain the block division parameter, for the determination of the residual value, in some embodiments, the step of parsing the code stream based on the block division parameter to determine the residual value of the current block may include:

The code stream of each node sub-block of the partition tree is sequentially parsed according to the preset node sub-block processing order, and the residual value of each node sub-block is determined.

Here, the preset processing order of node sub-blocks may be the preset scanning order. That is to say, the embodiment of the present application may sequentially parse the code stream of each node sub-block of the partition tree according to the preset scanning order, and then determine the residual value of each node sub-block.

S804: Determine the reconstruction value of the current block based on the predicted value and the residual value.

In a specific example, the determining the reconstructed value of the current block based on the predicted value and the residual value may include: adding the predicted value and the residual value to determine the reconstructed value of the current block.

It should be noted that, after the block division parameters are obtained by decoding, the predicted value of the current block can also be obtained by parsing the code stream; and the residual value of the current block can also be obtained by parsing the code stream; in this way, by comparing the predicted value and The residual value is added and calculated to determine the reconstruction value of the current block.

It should also be noted that, in a specific example, an implementation method of an HBD sequence decoder using QTMT fast block division is as follows: First, entropy decoding, inverse quantization, and inverse transformation are performed on the input code stream, and the residual error can be obtained. Next, the image is reconstructed according to the residual block, and the reconstruction process here mainly includes the following three steps: 1. Determine the current CTU partition tree according to relevant information such as block partition parameters. ② Process each CU of the partition tree in turn according to the raster scan order, and use information such as motion vectors to find the prediction block. ③ Superimpose the residual value and the predicted value of the current CU to obtain the reconstructed CU. Finally, the reconstructed image is sent to the DBF/SAO/ALF filter, and the filtered image is sent to the buffer area, waiting for the video to play.

This embodiment provides a block division method, which is applied to a decoder. By parsing the code stream, the block division parameters of the current block are determined; based on the block division parameters, the code stream is parsed to determine the predicted value of the current block; based on the block division parameters, the code stream is parsed to determine the residual value of the current block; and based on the predicted value and the residual value to determine the reconstructed value of the current block. In this way, since the maximum unit size information of the current block is determined according to the texture information of the video image, that is, the technical solution of the present application designs an adaptive image texture mechanism for the maximum unit size, which can directly skip the prediction and transformation process of large-size blocks. The calculation of , leads to an exponential decrease in the total number of recursion of block division, thus significantly reducing the coding complexity and reducing the coding time while keeping the performance gain basically unchanged, thereby improving the coding and decoding efficiency.

In yet another embodiment of the present application, based on the same inventive concept as the foregoing embodiments, see FIG. 9 , which shows a schematic structural diagram of an encoder 90 provided by an embodiment of the present application. As shown in FIG. 9 , the encoder 90 may include: a first determining unit 901, a block dividing unit 902 and an encoding unit 903; wherein,

The first determining unit 901 is configured to determine the maximum unit size information of the current block based on the texture information of the video image;

The block division unit 902 is configured to preprocess the current block according to the maximum unit size information to determine the division mode of the current block; and according to the division mode, determine the block division parameter of the current block;

The encoding unit 903 is configured to encode the current block according to the block division parameter.

In some embodiments, the encoding unit 903 is further configured to encode the block division parameter, and write the encoded bits into the code stream.

In some embodiments, the block division unit 902 is further configured to divide the current block into one or more node sub-blocks according to the block division parameter;

The first determining unit 901 is further configured to sequentially determine the prediction parameter of each node sub-block according to the preset node sub-block processing order; and determine the predicted value of the node sub-block according to the prediction parameter; and according to the original node sub-block value and predicted value, determine the residual value of the node sub-block;

The encoding unit 903 is further configured to encode the prediction parameter and the residual value of the node sub-block, and write the encoded bits into the code stream.

In some embodiments, the block dividing unit 902 is further configured to use the maximum unit size information to divide the current block, obtain at least one first node sub-block, and calculate the first rate-distortion cost value; and use a preset division mode to The first node sub-block is divided to obtain at least one second node sub-block, and the second rate-distortion cost value is calculated;

The first determining unit 901 is further configured to determine the division mode of the current block according to the comparison result of the first rate-distortion cost value and the second rate-distortion cost value.

In some embodiments, the block dividing unit 902 is further configured to compare the first rate-distortion cost value with the second rate-distortion cost value; and in the case that the second rate-distortion cost value is less than the first rate-distortion cost value, Use the preset division mode to divide the second node sub-block to obtain at least one next-level second node sub-block, and calculate the third rate-distortion cost value;

The first determining unit 901 is further configured to determine the division mode of the current block according to the second rate-distortion cost value and the third rate-distortion cost value.

In some embodiments, the block dividing unit 902 is further configured to update the second rate-distortion cost value with the third rate-distortion cost value when the third rate-distortion cost value is smaller than the second rate-distortion cost value, and return to executing The second node sub-block is divided by using the preset division mode to obtain at least one next-level second node sub-block, and the steps of calculating the third rate-distortion cost value until the minimum rate-distortion cost value is determined;

The first determining unit 901 is further configured to determine the division mode of the current block according to the division mode corresponding to the minimum rate-distortion cost value.

In some embodiments, the first determining unit 901 is further configured to directly determine the mode of dividing the current block according to the maximum unit size information as the second rate-distortion cost value is greater than or equal to the first rate-distortion value. The partition mode of the current block.

In some embodiments, the preset division mode includes a quadtree division mode and/or a multi-type tree division mode, and the multi-type tree division mode includes at least one of the following: a vertical binary tree division mode, a horizontal binary tree division mode, and a vertical ternary tree division mode Partition mode and horizontal ternary tree partition mode.

In some embodiments, the block division unit 902 is further configured to perform block division on the video image to obtain N blocks of preset size; wherein, N is an integer greater than zero, and the N blocks do not overlap each other;

The first determining unit 901 is further configured to perform texture analysis on the N blocks to determine a first quantity; wherein the first quantity represents the quantity of blocks whose texture values are less than the first threshold in the N blocks; and according to the first quantity and the second The result of the comparison of the thresholds determines the maximum unit size information of the current block.

In some embodiments, referring to FIG. 9, the encoder 90 may further include a calculation unit 904 configured to calculate the texture values of the N blocks;

The first determining unit 901 is further configured to compare the texture values of the N blocks with the first threshold in sequence; and to count the number of blocks whose texture values are less than the first threshold according to the comparison result to obtain the first number.

In some embodiments, the calculation unit 904 is specifically configured to perform variance value calculation on the kth block to obtain the texture value of the kth block; wherein k is an integer greater than or equal to zero and less than N.

In some embodiments, the calculation unit 904 is specifically configured to determine the absolute value of the horizontal gradient and the absolute value of the vertical gradient of the kth block; and perform a sum calculation on the absolute value of the horizontal gradient and the absolute value of the vertical gradient to obtain the kth block The texture value of ; where k is an integer greater than or equal to zero and less than N.

In some embodiments, the first determining unit 901 is further configured to determine a ratio of the first number to N, compare the ratio with a second threshold; and when the ratio is less than the second threshold, determine the largest unit of the current block The size information is a first size value.

In some embodiments, the first determining unit 901 is further configured to compare the ratio with a third threshold when the ratio is greater than or equal to the second threshold; and if the ratio is less than the third threshold, determine the maximum value of the current block The unit size information is a second size value; wherein the second size value is different from the first size value; the first threshold value is different from the second threshold value and the third threshold value, and the second threshold value is smaller than the third threshold value.

In some embodiments, the first determining unit 901 is further configured to determine identification information of the video image; and when the identification information of the video image indicates that the video image is a high bit-depth video, perform texture information based on the video image to determine the Steps for maximum element size information.

In some embodiments, the first determining unit 901 is further configured to determine that the identification information of the video image is a first value if the identification information of the video image indicates that the video image is a high bit depth video; If the identification information indicates that the video image is a non-high bit depth video, the value of the identification information of the video image is determined to be the second value.

In some embodiments, the encoding unit 903 is further configured to encode the identification information of the video image, and write the encoded bits into the code stream.

In some embodiments, the level of the maximum cell size information is at least one of the following: sequence level, picture level.

It can be understood that, in the embodiments of the present application, a "unit" may be a part of a circuit, a part of a processor, a part of a program or software, etc., of course, it may also be a module, and it may also be non-modular. Moreover, each component in this embodiment may be integrated into one processing unit, or each unit may exist physically alone, or two or more units may be integrated into one unit. The above-mentioned integrated units can be implemented in the form of hardware, or can be implemented in the form of software function modules.

If the integrated unit is implemented in the form of a software functional module and is not sold or used as an independent product, it can be stored in a computer-readable storage medium. Based on this understanding, the technical solution of this embodiment is essentially or The part that contributes to the prior art or the whole or part of the technical solution can be embodied in the form of a software product, the computer software product is stored in a storage medium, and includes several instructions for making a computer device (which can be It is a personal computer, a server, or a network device, etc.) or a processor (processor) that executes all or part of the steps of the method described in this embodiment. The aforementioned storage medium includes: U disk, mobile hard disk, read only memory (Read Only Memory, ROM), random access memory (Random Access Memory, RAM), magnetic disk or optical disk and other media that can store program codes.

Therefore, an embodiment of the present application provides a computer storage medium, which is applied to the encoder 90, where the computer storage medium stores a computer program, and when the computer program is executed by the first processor, any one of the foregoing embodiments is implemented. Methods.

Based on the composition of the encoder 90 and the computer storage medium described above, see FIG. 10 , which shows a schematic diagram of a specific hardware structure of the encoder 90 provided by the embodiment of the present application. As shown in FIG. 10 , it may include: a first communication interface 1001 , a first memory 1002 and a first processor 1003 ; each component is coupled together through a first bus system 1004 . It can be understood that the first bus system 1004 is used to realize the connection and communication between these components. In addition to the data bus, the first bus system 1004 also includes a power bus, a control bus and a status signal bus. However, for the sake of clarity, the various buses are designated as the first bus system 1004 in FIG. 10 . in,

The first communication interface 1001 is used for receiving and sending signals in the process of sending and receiving information with other external network elements;

a first memory 1002 for storing a computer program that can run on the first processor 1003;

The first processor 1003 is configured to, when running the computer program, execute:

The current block is encoded according to the block partition parameter.

It can be understood that the first memory 1002 in this embodiment of the present application may be a volatile memory or a non-volatile memory, or may include both volatile and non-volatile memories. Wherein, the non-volatile memory may be a read-only memory (Read-Only Memory, ROM), a programmable read-only memory (Programmable ROM, PROM), an erasable programmable read-only memory (Erasable PROM, EPROM), an electrically programmable read-only memory (Erasable PROM, EPROM). Erase programmable read-only memory (Electrically EPROM, EEPROM) or flash memory. Volatile memory may be Random Access Memory (RAM), which acts as an external cache. By way of illustration and not limitation, many forms of RAM are available, such as Static RAM (SRAM), Dynamic RAM (DRAM), Synchronous DRAM, SDRAM), double data rate synchronous dynamic random access memory (Double Data Rate SDRAM, DDRSDRAM), enhanced synchronous dynamic random access memory (Enhanced SDRAM, ESDRAM), synchronous link dynamic random access memory (Synchlink DRAM, SLDRAM) And direct memory bus random access memory (Direct Rambus RAM, DRRAM). The first memory 1002 of the systems and methods described herein is intended to include, but not be limited to, these and any other suitable types of memory.

The first processor 1003 may be an integrated circuit chip with signal processing capability. In the implementation process, each step of the above-mentioned method may be completed by an integrated logic circuit of hardware in the first processor 1003 or an instruction in the form of software. The above-mentioned first processor 1003 can be a general-purpose processor, a digital signal processor (Digital Signal Processor, DSP), an application specific integrated circuit (Application Specific Integrated Circuit, ASIC), a ready-made programmable gate array (Field Programmable Gate Array, FPGA) Or other programmable logic devices, discrete gate or transistor logic devices, discrete hardware components. The methods, steps, and logic block diagrams disclosed in the embodiments of this application can be implemented or executed. A general purpose processor may be a microprocessor or the processor may be any conventional processor or the like. The steps of the method disclosed in conjunction with the embodiments of the present application may be directly embodied as executed by a hardware decoding processor, or executed by a combination of hardware and software modules in the decoding processor. The software modules may be located in random access memory, flash memory, read-only memory, programmable read-only memory or electrically erasable programmable memory, registers and other storage media mature in the art. The storage medium is located in the first memory 1002, and the first processor 1003 reads the information in the first memory 1002, and completes the steps of the above method in combination with its hardware.

It will be appreciated that the embodiments described herein may be implemented in hardware, software, firmware, middleware, microcode, or a combination thereof. For hardware implementation, the processing unit can be implemented in one or more Application Specific Integrated Circuits (ASIC), Digital Signal Processing (DSP), Digital Signal Processing Device (DSP Device, DSPD), programmable Logic Devices (Programmable Logic Device, PLD), Field-Programmable Gate Array (Field-Programmable Gate Array, FPGA), General Purpose Processors, Controllers, Microcontrollers, Microprocessors, Others for performing the functions described herein electronic unit or a combination thereof. For a software implementation, the techniques described herein may be implemented through modules (eg, procedures, functions, etc.) that perform the functions described herein. Software codes may be stored in memory and executed by a processor. The memory can be implemented in the processor or external to the processor.

Optionally, as another embodiment, the first processor 1003 is further configured to execute the method described in any one of the foregoing embodiments when running the computer program.

This embodiment provides an encoder, and the encoder may include a first determination unit, a block division unit, and a coding unit. In this way, since the maximum unit size information of the current block is determined according to the texture information of the video image, that is, the technical solution of the present application designs an adaptive image texture mechanism for the maximum unit size, which can directly skip the prediction and transformation process of large-size blocks. The calculation of , leads to an exponential decrease in the total number of recursion of block division, thus significantly reducing the coding complexity and reducing the coding time while keeping the performance gain basically unchanged, thereby improving the coding and decoding efficiency.

In yet another embodiment of the present application, based on the same inventive concept as the foregoing embodiments, see FIG. 11 , which shows a schematic structural diagram of a decoder 110 provided by an embodiment of the present application. As shown in FIG. 11 , the decoder 110 may include: a parsing unit 1101 and a second determining unit 1102; wherein,

The parsing unit 1101 is configured to parse the code stream and determine the block division parameter of the current block;

The parsing unit 1101 is further configured to parse the code stream based on the block division parameter to determine the predicted value of the current block; and based on the block division parameter, parse the code stream to determine the residual value of the current block;

The second determining unit 1102 is configured to determine the reconstruction value of the current block based on the predicted value and the residual value.

In some embodiments, the parsing unit 1101 is further configured to parse the code stream to obtain identification information of the video image;

The second determining unit 1102 is further configured to, if the value of the identification information of the video image is the first value, determine that the identification information of the video image indicates that the video image is a high bit depth video; or, if the value of the identification information of the video image is the value of For the second value, it is determined that the identification information of the video image indicates that the video image is a non-high bit-depth video.

In some embodiments, the second determining unit 1102 is further configured to determine a division mode of the current block based on the block division parameter; and determine a division tree of the current block according to the division mode, wherein the division tree comprises dividing the current block to obtain One or more node sub-blocks of .

In some embodiments, the parsing unit 1101 is further configured to sequentially parse the code stream of each node sub-block of the partition tree according to a preset node sub-block processing order, and determine the prediction mode of each node sub-block;

The second determining unit 1102 is further configured to determine the prediction value of each node sub-block according to the prediction mode.

In some embodiments, the parsing unit 1101 is further configured to sequentially parse the code stream of each node sub-block of the partition tree according to the preset node sub-block processing order, and determine the residual value of each node sub-block.

In some embodiments, the division mode has an associated relationship with texture information of the video image.

It can be understood that, in this embodiment, a "unit" may be a part of a circuit, a part of a processor, a part of a program or software, etc., of course, it may also be a module, and it may also be non-modular. Moreover, each component in this embodiment may be integrated into one processing unit, or each unit may exist physically alone, or two or more units may be integrated into one unit. The above-mentioned integrated units can be implemented in the form of hardware, or can be implemented in the form of software function modules.

If the integrated unit is implemented in the form of a software functional module and is not sold or used as an independent product, it may be stored in a computer-readable storage medium. Based on such understanding, this embodiment provides a computer storage medium, which is applied to the decoder 110, where the computer storage medium stores a computer program, and when the computer program is executed by the second processor, any one of the foregoing embodiments is implemented the method described.

Based on the above-mentioned composition of the decoder 110 and the computer storage medium, see FIG. 12 , which shows a schematic diagram of a specific hardware structure of the decoder 110 provided by the embodiment of the present application. As shown in FIG. 12 , it may include: a second communication interface 1201 , a second memory 1202 and a second processor 1203 ; each component is coupled together through a second bus system 1204 . It can be understood that the second bus system 1204 is used to implement connection communication between these components. In addition to the data bus, the second bus system 1204 also includes a power bus, a control bus, and a status signal bus. However, for the sake of clarity, the various buses are labeled as the second bus system 1204 in FIG. 12 . in,

The second communication interface 1201 is used for receiving and sending signals in the process of sending and receiving information with other external network elements;

a second memory 1202 for storing computer programs that can run on the second processor 1203;

The second processor 1203 is configured to, when running the computer program, execute:

Optionally, as another embodiment, the second processor 1203 is further configured to execute the method described in any one of the foregoing embodiments when running the computer program.

It can be understood that the hardware function of the second memory 1202 is similar to that of the first memory 1002, and the hardware function of the second processor 1203 is similar to that of the first processor 1003; details are not described here.

This embodiment provides a decoder, and the decoder may include a parsing unit and a second determining unit. In this way, since the maximum unit size information of the current block is determined according to the texture information of the video image, that is, the technical solution of the present application designs an adaptive image texture mechanism for the maximum unit size, which can directly skip the prediction and transformation process of large-size blocks. The calculation of , leads to an exponential decrease in the total number of recursion of block division, thus significantly reducing the coding complexity and reducing the coding time while keeping the performance gain basically unchanged, thereby improving the coding and decoding efficiency.

It should be noted that, in this application, the terms "comprising", "comprising" or any other variation thereof are intended to encompass non-exclusive inclusion, such that a process, method, article or device comprising a series of elements includes not only those elements , but also other elements not expressly listed or inherent to such a process, method, article or apparatus. Without further limitation, an element qualified by the phrase "comprising a..." does not preclude the presence of additional identical elements in a process, method, article or apparatus that includes the element.

The above-mentioned serial numbers of the embodiments of the present application are only for description, and do not represent the advantages or disadvantages of the embodiments.

The methods disclosed in the several method embodiments provided in this application can be arbitrarily combined under the condition of no conflict to obtain new method embodiments.

The features disclosed in the several product embodiments provided in this application can be combined arbitrarily without conflict to obtain a new product embodiment.

The features disclosed in several method or device embodiments provided in this application can be combined arbitrarily without conflict to obtain new method embodiments or device embodiments.

The above are only specific embodiments of the present application, but the protection scope of the present application is not limited to this. should be covered within the scope of protection of this application. Therefore, the protection scope of the present application should be subject to the protection scope of the claims.

Industrial Applicability

In the embodiment of the present application, on the encoder side, the maximum unit size information of the current block is determined based on the texture information of the video image; the current block is preprocessed according to the maximum unit size information, and the division mode of the current block is determined; according to the division mode, determining a block division parameter of the current block; and encoding the current block according to the block division parameter. On the decoder side, the code stream is parsed to determine the block division parameters of the current block; based on the block division parameters, the code stream is parsed to determine the predicted value of the current block; based on the block division parameters, the code stream is parsed to determine the residual value of the current block; And, based on the predicted value and the residual value, the reconstructed value of the current block is determined. In this way, since the maximum unit size information of the current block is determined according to the texture information of the video image, that is, the technical solution of the present application designs an adaptive image texture mechanism for the maximum unit size, which can directly skip the prediction and transformation process of large-size blocks. The calculation of , leads to an exponential decrease in the total number of recursion of block division, thus significantly reducing the coding complexity and reducing the coding time while keeping the performance gain basically unchanged, thereby improving the coding and decoding efficiency.

Claims

A block division method, applied to an encoder, the method comprising:

Determine the maximum unit size information of the current block based on the texture information of the video image;

Perform preprocessing on the current block according to the maximum unit size information to determine a division mode of the current block;

determining the block division parameter of the current block according to the division mode;

The current block is encoded according to the block division parameter.
The method of claim 1, wherein the encoding the current block according to the block division parameter comprises:

The block division parameter is encoded, and the encoded bits are written into the code stream.
The method according to claim 1 or 2, wherein the encoding the current block according to the block division parameter comprises:

dividing the current block into one or more node sub-blocks according to the block division parameter;

According to the preset node sub-block processing order, sequentially determine the prediction parameters of each of the node sub-blocks;

determining the predicted value of the node sub-block according to the prediction parameter;

According to the original value of the node sub-block and the predicted value, determine the residual value of the node sub-block;

The prediction parameters and residual values of the node sub-blocks are encoded, and the encoded bits are written into the code stream.
The method according to claim 1, wherein the preprocessing of the current block according to the maximum unit size information to determine the division mode of the current block comprises:

The current block is divided by using the maximum unit size information to obtain at least one first node sub-block, and a first rate-distortion cost value is calculated;

The first node sub-block is divided by using a preset division mode to obtain at least one second node sub-block, and the second rate-distortion cost value is calculated;

A division mode of the current block is determined according to a comparison result of the first rate-distortion cost value and the second rate-distortion cost value.
The method according to claim 4, wherein determining the division mode of the current block according to the comparison result of the first rate-distortion cost value and the second rate-distortion cost value comprises:

comparing the first rate-distortion cost value to the second rate-distortion cost value;

In the case that the second rate-distortion cost value is less than the first rate-distortion cost value, the second node sub-block is divided by using a preset division mode to obtain at least one next-level second node sub-block, And calculate the third rate distortion cost value;

A division mode of the current block is determined according to the second rate-distortion cost value and the third rate-distortion cost value.
The method according to claim 5, wherein the determining the division mode of the current block according to the second rate-distortion cost value and the third rate-distortion cost value comprises:

In the case that the third rate-distortion cost value is smaller than the second rate-distortion cost value, update the second rate-distortion cost value by using the third rate-distortion cost value, and return to executing the using preset division The mode divides the second node sub-block to obtain at least one next-level second node sub-block, and calculates the third rate-distortion cost value until the minimum rate-distortion cost value is determined;

The division mode of the current block is determined according to the division mode corresponding to the minimum rate-distortion cost value.
The method of claim 5, wherein the method further comprises:

In the case that the second rate-distortion cost value is greater than or equal to the first rate-distortion value, directly determining the mode of dividing the current block according to the maximum unit size information as the division mode of the current block .
The method according to claim 4, wherein the preset division mode comprises a quad-tree division mode and/or a multi-type tree division mode;

The multi-type tree division modes include at least one of the following: a vertical binary tree division mode, a horizontal binary tree division mode, a vertical tri-tree division mode, and a horizontal tri-tree division mode.
The method according to claim 1, wherein the determining the maximum unit size information of the current block based on the texture information of the video image comprises:

Perform block division on the video image to obtain N blocks of preset size; wherein, N is an integer greater than zero, and the N blocks do not overlap each other;

Perform texture analysis on the N blocks to determine a first number; wherein, the first number represents the number of blocks whose texture values are less than a first threshold in the N blocks;

The maximum unit size information of the current block is determined according to the comparison result of the first number and the second threshold.
The method of claim 9, wherein the performing texture analysis on the N blocks to determine the first number comprises:

calculating the texture values of the N blocks;

comparing the texture values of the N blocks with the first threshold in sequence;

According to the comparison result, the number of blocks whose texture value is less than the first threshold is counted to obtain the first number.
The method of claim 10, wherein the calculating the texture values of the N blocks comprises:

Perform variance value calculation on the kth block to obtain the texture value of the kth block; wherein, k is an integer greater than or equal to zero and less than N.
The method of claim 10, wherein the calculating the texture values of the N blocks comprises:

Determine the absolute value of the horizontal gradient and the absolute value of the vertical gradient of the kth block;

The absolute value of the horizontal gradient and the absolute value of the vertical gradient are summed to obtain the texture value of the kth block; wherein, k is an integer greater than or equal to zero and less than N.
The method according to claim 9, wherein the determining the maximum unit size information of the current block according to the comparison result of the first number and the second threshold value comprises:

determining a ratio of the first number to N, and comparing the ratio to a second threshold;

In the case that the ratio is smaller than the second threshold, determining the maximum unit size information of the current block as the first size value.
The method of claim 13, wherein the method further comprises:

if the ratio is greater than or equal to the second threshold, comparing the ratio to a third threshold;

If the ratio is less than the third threshold, determining that the maximum unit size information of the current block is the second size value;

The second size value is different from the first size value; the first threshold value is different from the second threshold value and the third threshold value, and the second threshold value is smaller than the third threshold value.
The method according to any one of claims 1 to 14, wherein the method further comprises:

determining the identification information of the video image;

When the identification information of the video image indicates that the video image is a high bit depth video, the step of determining the maximum unit size information of the current block based on the texture information of the video image is performed.
The method according to claim 15, wherein the determining the identification information of the video image comprises:

If the identification information of the video image indicates that the video image is a high-bit-depth video, determine that the value of the identification information of the video image is the first value; or,

If the identification information of the video image indicates that the video image is a non-high bit-depth video, the value of the identification information of the video image is determined to be a second value.
The method of claim 16, wherein the method further comprises:

The identification information of the video image is encoded, and the encoded bits are written into the code stream.
The method according to claim 1, wherein the level of the maximum cell size information is at least one of the following: sequence level and image level.
A block division method, applied to a decoder, the method comprising:

Parse the code stream and determine the block division parameters of the current block;

Based on the block division parameter, the code stream is parsed, and the predicted value of the current block is determined;

Based on the block division parameter, the code stream is parsed, and the residual value of the current block is determined;

Based on the predicted value and the residual value, a reconstructed value of the current block is determined.
The method of claim 19, wherein the method further comprises:

Parse the code stream to obtain the identification information of the video image;

If the value of the identification information of the video image is the first value, it is determined that the identification information of the video image indicates that the video image is a high bit depth video; or,

If the value of the identification information of the video image is the second value, it is determined that the identification information of the video image indicates that the video image is a non-high bit depth video.
The method of claim 19 or 20, wherein the method further comprises:

determining a division mode of the current block based on the block division parameter;

According to the division mode, a division tree of the current block is determined, wherein the division tree includes one or more node sub-blocks obtained by dividing the current block.
The method according to claim 21, wherein, based on the block division parameter, parsing the code stream to determine the predicted value of the current block comprises:

According to the preset node sub-block processing order, the code stream of each node sub-block of the partition tree is sequentially analyzed, and the prediction mode of each node sub-block is determined;

The prediction value of each node sub-block is determined according to the prediction mode.
The method according to claim 21, wherein, based on the block division parameter, parsing the code stream to determine the residual value of the current block, comprising:

The code stream of each node sub-block of the partition tree is sequentially parsed according to the preset node sub-block processing order, and the residual value of each node sub-block is determined.
The method of claim 21, wherein the division mode has an associated relationship with texture information of the video image.
An encoder comprising a first determination unit, a block division unit and a coding unit; wherein,

The first determining unit is configured to determine the maximum unit size information of the current block based on the texture information of the video image;

the block division unit, configured to preprocess the current block according to the maximum unit size information to determine a division mode of the current block; and determine a block division parameter of the current block according to the division mode;

The encoding unit is configured to encode the current block according to the block division parameter.
An encoder comprising a first memory and a first processor; wherein,

the first memory for storing a computer program executable on the first processor;

The first processor is configured to execute the method according to any one of claims 1 to 18 when running the computer program.
A decoder, the decoder includes a parsing unit and a second determining unit; wherein,

The parsing unit is configured to parse the code stream and determine the block division parameter of the current block;

The parsing unit is further configured to parse the code stream based on the block division parameter to determine the predicted value of the current block; and based on the block division parameter, parse the code stream to determine the residual value of the current block;

The second determination unit is configured to determine the reconstruction value of the current block based on the prediction value and the residual value.
A decoder comprising a second memory and a second processor; wherein,

the second memory for storing a computer program executable on the second processor;

The second processor is configured to execute the method according to any one of claims 19 to 24 when running the computer program.
A computer storage medium, wherein the computer storage medium stores a computer program that, when executed, implements the method according to any one of claims 1 to 18, or any one of claims 19 to 24 the method described.