CN111432212B

CN111432212B - Intra-frame division method, system and storage medium based on texture feature

Info

Publication number: CN111432212B
Application number: CN202010351187.5A
Authority: CN
Inventors: 梁凡; 谭敏仪
Original assignee: Sun Yat Sen University
Current assignee: Hefei Minglong Electronic Technology Co ltd
Priority date: 2020-04-28
Filing date: 2020-04-28
Publication date: 2022-05-10
Anticipated expiration: 2040-04-28
Also published as: CN111432212A

Abstract

The invention discloses an intra-frame division method, a system and a storage medium based on texture features, wherein the method comprises the following steps: extracting first texture information of a coding unit through a first index, a second index and a third index, and dividing the coding unit into a first coding unit, a second coding unit or a third coding unit; wherein, the first coding unit is a coding unit capable of terminating division; then determining a division mode of the second coding unit according to a fourth index and a fifth index; and finally, determining the dividing mode of the third coding unit through a sixth index and a seventh index. The invention deletes redundant partition modes by determining the partition modes of each code and determining the optimal partition mode, thereby reducing the complexity of the encoder and being widely applied to the technical field of data processing.

Description

Intra-frame division method, system and storage medium based on texture feature

技术领域technical field

本发明涉及数据处理技术领域，尤其是基于纹理特征的帧内划分方法、系统及存储介质。The present invention relates to the technical field of data processing, in particular to an intra-frame division method, system and storage medium based on texture features.

背景技术Background technique

VVC/H.266是基于HEVC/H.265编码标准上进一步发展的下一代视频编码标准。VVC标准目前还在制定中，其继承了传统的大多数编码工具以及混合编码框架，包括帧内(间)预测，变换量化，熵编码和环路滤波，但在延续了大多数编码工具的同时也采纳了很多新技术，使得编码效率大幅提升。H.266/VVC的目标是为了与相同质量条件下的H.265/HEVC相比，H.266/VVC的编码码率降低为H.265/HEVC的一半。VVC/H.266 is a next-generation video coding standard based on the HEVC/H.265 coding standard. The VVC standard is still being developed, and it inherits most of the traditional coding tools and the hybrid coding framework, including intra (inter) prediction, transform quantization, entropy coding and loop filtering, but while continuing most coding tools Many new technologies have also been adopted, which greatly improves the coding efficiency. The goal of H.266/VVC is to reduce the coding rate of H.266/VVC to half that of H.265/HEVC compared with H.265/HEVC under the same quality condition.

为了更好地描述视频内容的纹理，H.266/VVC在H.265/HEVC的基础上引入了MTT划分，即基于QT划分的基础上，添加了BT和TT划分。其中，QT划分与H.265/HEVC标准中规定的相同，都是将CU划分为四个面积相等的子块，每个子节点的CU的长宽都是原来父节点CU的一半。而新引入的BT和TT划分都具有水平和垂直两个方向，其中BT划分是将CU划分为两个面积相等的子块，若是水平划分，那么子块的CU的长与父节点CU一致，而宽为父节点CU的一半，反之，若是垂直划分，那么子块的CU的宽与父节点CU一致，而长为父节点CU的一半。TT划分而是将CU划分为三个子块，其面积比例为1:2:1。In order to better describe the texture of video content, H.266/VVC introduces MTT division based on H.265/HEVC, that is, based on QT division, BT and TT division are added. Among them, the QT division is the same as that specified in the H.265/HEVC standard, in which the CU is divided into four sub-blocks of equal area, and the length and width of the CU of each child node are half of the original parent node CU. The newly introduced BT and TT divisions both have horizontal and vertical directions. The BT division is to divide the CU into two sub-blocks with equal areas. If it is divided horizontally, the length of the CU of the sub-block is the same as that of the parent node CU. The width is half of that of the parent node CU. On the contrary, if it is vertically divided, the width of the CU of the child block is the same as that of the parent node CU, and the length is half of the parent node CU. Instead, the TT partition divides the CU into three sub-blocks with an area ratio of 1:2:1.

除了引入新的划分模式之外，H.266/VVC还对帧内编码部分进行改进。首先H.266/VVC细化了角度模式，从HEVC的33种角度模式扩展到65种。除此之外，VVC还引入了宽角预测模式、PDPC预测模式、MRL预测模式和ISP预测模式等技术。新引入的预测模式可以更好地描述视频图像中的纹理，提高压缩效率。其中，图像的特征包括图像的颜色特征，纹理特征，形状特征和空间关系特征等，其可以综合描述一幅图像的内容和不同方面的性质，而图像的内容和性质与编码单元的划分的结果息息相关。In addition to introducing new division modes, H.266/VVC also improves the intra-frame coding part. First of all, H.266/VVC has refined the angle mode, expanding from 33 angle modes of HEVC to 65 kinds. In addition, VVC also introduces technologies such as wide-angle prediction mode, PDPC prediction mode, MRL prediction mode and ISP prediction mode. The newly introduced prediction mode can better describe the texture in the video image and improve the compression efficiency. Among them, the characteristics of the image include the color characteristics, texture characteristics, shape characteristics and spatial relationship characteristics of the image, which can comprehensively describe the content and properties of different aspects of an image, and the content and properties of the image are the result of the division of coding units. closely related.

但VVC在提高压缩效率的同时，也带来了巨大的复杂度，这对VVC投入实际应用带来了困难。However, while improving the compression efficiency, VVC also brings huge complexity, which brings difficulties to the practical application of VVC.

发明内容SUMMARY OF THE INVENTION

有鉴于此，本发明实施例提供一种能够提高降低编码器复杂度的，基于纹理特征的帧内划分方法、系统及存储介质。In view of this, embodiments of the present invention provide a texture feature-based intra-frame division method, system, and storage medium that can improve and reduce the complexity of an encoder.

本发明的第一方面提供了一种基于纹理特征的帧内划分方法，包括：A first aspect of the present invention provides an intra-frame division method based on texture features, including:

通过第一指标、第二指标和第三指标提取编码单元的第一纹理信息，并将所述编码单元划分为第一编码单元、第二编码单元或第三编码单元；所述第一编码单元为可终止划分的编码单元，所述第二编码单元为执行三叉划分的编码单元，所述第三编码单元为执行二叉或四叉划分的编码单元；The first texture information of the coding unit is extracted by the first index, the second index and the third index, and the coding unit is divided into the first coding unit, the second coding unit or the third coding unit; the first coding unit is a coding unit that can terminate the division, the second coding unit is a coding unit that performs three-fork division, and the third coding unit is a coding unit that performs two-fork or four-fork division;

根据第四指标和第五指标，确定所述第二编码单元的划分模式；determining the division mode of the second coding unit according to the fourth index and the fifth index;

通过第六指标和第七指标，确定所述第三编码单元的划分模式；Determine the division mode of the third coding unit by the sixth index and the seventh index;

其中，所述第一指标用于确定编码单元的像素一致性；Wherein, the first index is used to determine the pixel consistency of the coding unit;

所述第二指标用于确定编码单元在垂直方向上的第一纹理信息；The second index is used to determine the first texture information of the coding unit in the vertical direction;

所述第三指标用于确定编码单元在水平方向上的第一纹理信息；The third index is used to determine the first texture information of the coding unit in the horizontal direction;

所述第四指标和所述第五指标用于确定编码单元的纹理方向；The fourth index and the fifth index are used to determine the texture direction of the coding unit;

所述第六指标和所述第七指标用于确定子块之间的纹理方向的一致性。The sixth index and the seventh index are used to determine the consistency of texture directions between sub-blocks.

在一些实施例中，所述通过第一指标、第二指标和第三指标提取编码单元的第一纹理信息，并将所述编码单元划分为第一编码单元、第二编码单元或第三编码单元这一步骤，具体为：In some embodiments, the first texture information of the coding unit is extracted by using the first index, the second index and the third index, and the coding unit is divided into the first coding unit, the second coding unit or the third coding unit This step of the unit is as follows:

当所述第一纹理信息小于第一阈值，则将所述编码单元确定为第一编码单元；When the first texture information is less than a first threshold, determining the coding unit as the first coding unit;

当所述第一纹理信息大于第二阈值，则将所述编码单元确定为第二编码单元；When the first texture information is greater than a second threshold, determining the coding unit as a second coding unit;

当所述第一纹理信息大于第一阈值且小于第二阈值，则将所述编码单元确定为第三编码单元。When the first texture information is greater than a first threshold and less than a second threshold, the coding unit is determined as a third coding unit.

在一些实施例中，所述根据第四指标和第五指标，确定所述第二编码单元的划分模式这一步骤，具体为：In some embodiments, the step of determining the division mode of the second coding unit according to the fourth index and the fifth index is specifically:

当第四指标大于第五指标，确定所述第二编码单元执行三叉水平划分；When the fourth index is greater than the fifth index, it is determined that the second coding unit performs three-pronged horizontal division;

当第四指标小于第五指标，确定所述第二编码单元执行三叉垂直划分。When the fourth index is smaller than the fifth index, it is determined that the second coding unit performs three-pronged vertical division.

在一些实施例中，所述通过第六指标和第七指标，确定所述第三编码单元的划分模式这一步骤，具体为：In some embodiments, the step of determining the division mode of the third coding unit according to the sixth index and the seventh index is specifically:

根据第六指标和第七指标计算编码单元的第二纹理信息；calculating the second texture information of the coding unit according to the sixth index and the seventh index;

当所述第二纹理信息大于第三阈值，确定所述第三编码单元执行四叉划分；When the second texture information is greater than a third threshold, it is determined that the third coding unit performs quad division;

当所述第二纹理信息小于第四阈值，确定所述第三编码单元执行二叉划分；When the second texture information is less than a fourth threshold, determining that the third coding unit performs binary division;

当所述第二纹理信息大于第四阈值且小于第三阈值，确定所述第三编码单元执行二叉划分或四叉划分。When the second texture information is greater than a fourth threshold and less than a third threshold, it is determined that the third coding unit performs binary division or quadruple division.

在一些实施例中，所述第一指标根据编码单元上各个位置的像素值、编码单元的高度值、编码单元的宽度值以及编码单元中像素值的均值确定；In some embodiments, the first indicator is determined according to the pixel value of each position on the coding unit, the height value of the coding unit, the width value of the coding unit, and the mean value of the pixel values in the coding unit;

所述第一指标的计算公式为：The calculation formula of the first indicator is:

其中，p(x,y)代表编码单元中位于第x列和第y行的位置上的像素值；u该编码单元的高度；v代表该编码单元的宽度；m代表该编码单元中像素值的均值。Wherein, p(x,y) represents the pixel value at the position of the xth column and the yth row in the coding unit; u is the height of the coding unit; v represents the width of the coding unit; m represents the pixel value in the coding unit mean value of .

在一些实施例中，所述第二指标根据编码单元上各个位置的像素值、编码单元的高度值、编码单元的宽度值以及编码单元中每行像素值的均值确定；In some embodiments, the second index is determined according to the pixel value of each position on the coding unit, the height value of the coding unit, the width value of the coding unit, and the mean value of the pixel values of each row in the coding unit;

所述第二指标的计算公式为：The calculation formula of the second indicator is:

其中，m_y是该编码单元中第y行的像素的均值；p(x,y)代表编码单元中位于第x列和第y行的位置上的像素值；u该编码单元的高度；v代表该编码单元的宽度；Wherein, m _y is the mean value of the pixels in the y-th row in the coding unit; p(x, y) represents the pixel value at the position of the x-th column and the y-th row in the coding unit; u is the height of the coding unit; v represents the width of the coding unit;

所述第三指标根据根据编码单元上各个位置的像素值、编码单元的高度值、编码单元的宽度值以及编码单元中每列像素值的均值确定；The third index is determined according to the pixel value of each position on the coding unit, the height value of the coding unit, the width value of the coding unit and the mean value of each column of pixel values in the coding unit;

所述第三指标的计算公式为：The calculation formula of the third indicator is:

其中，m_x是该编码单元中第x列的像素的均值；p(x,y)代表编码单元中位于第x列和第y行的位置上的像素值；u该编码单元的高度；v代表该编码单元的宽度。Wherein, m _x is the mean value of the pixel in the xth column in the coding unit; p(x,y) represents the pixel value at the position of the xth column and the yth row in the coding unit; u is the height of the coding unit; v Represents the width of this coding unit.

在一些实施例中，所述第四指标根据第一指标和第二指标计算得到；In some embodiments, the fourth index is calculated according to the first index and the second index;

所述第五指标根据第一指标和第三指标计算得到；The fifth index is calculated according to the first index and the third index;

其中，所述第四指标的计算公式为：Wherein, the calculation formula of the fourth indicator is:

LEN_H＝|MAD-MAD_H|LEN _H =|MAD-MAD _H |

MAD代表第一指标；MAD_H代表第二指标；MAD represents the first indicator; MAD _H represents the second indicator;

其中，所述第五指标的计算公式为：Wherein, the calculation formula of the fifth index is:

LEN_V＝|MAD-MAD_V|LEN _V = |MAD-MAD _V |

MAD代表第一指标；MAD_V代表第三指标。MAD stands for the first indicator; MAD _V stands for the third indicator.

在一些实施例中，所述第六指标根据各个子块的第二指标计算得到；In some embodiments, the sixth index is calculated according to the second index of each sub-block;

所述第七指标根据各个子块的第三指标计算得到；The seventh index is calculated according to the third index of each sub-block;

其中，所述第六指标的计算公式为：Wherein, the calculation formula of the sixth indicator is:

MAD_H1代表第一子块的第二指标；MAD_H2代表第二子块的第二指标；MAD_H3代表第三子块的第二指标；MAD_H4代表第四子块的第二指标；MAD _H1 represents the second indicator of the first sub-block; MAD _H2 represents the second indicator of the second sub-block; MAD _H3 represents the second indicator of the third sub-block; MAD _H4 represents the second indicator of the fourth sub-block;

所述第七指标的计算公式为：The calculation formula of the seventh index is:

MAD_V1代表第一子块的第三指标；MAD_V2代表第二子块的第三指标；MAD_V3代表第三子块的第三指标；MAD_V4代表第四子块的第三指标。MAD _V1 represents the third indicator of the first sub-block; MAD _V2 represents the third indicator of the second sub-block; MAD _V3 represents the third indicator of the third sub-block; MAD _V4 represents the third indicator of the fourth sub-block.

根据本发明的第二方面，还提供了一种基于纹理特征的帧内划分系统，包括：According to a second aspect of the present invention, there is also provided an intra-frame partitioning system based on texture features, including:

提取模块，用于通过第一指标、第二指标和第三指标提取编码单元的第一纹理信息，并将所述编码单元划分为第一编码单元、第二编码单元或第三编码单元；所述第一编码单元为可终止划分的编码单元，所述第二编码单元为执行三叉划分的编码单元，所述第三编码单元为执行二叉或四叉划分的编码单元；an extraction module, configured to extract the first texture information of the coding unit through the first index, the second index and the third index, and divide the coding unit into the first coding unit, the second coding unit or the third coding unit; the The first coding unit is a coding unit that can terminate the division, the second coding unit is a coding unit that performs three-pronged division, and the third coding unit is a coding unit that performs binary or quadruple division;

第一划分模块，用于根据第四指标和第五指标，确定所述第二编码单元的划分模式；a first division module, configured to determine the division mode of the second coding unit according to the fourth index and the fifth index;

第二划分模块，用于通过第六指标和第七指标，确定所述第三编码单元的划分模式；a second division module, configured to determine the division mode of the third coding unit through the sixth index and the seventh index;

所述第四指标和所述第五指标用于确定编码单元的纹理方向。The fourth index and the fifth index are used to determine the texture direction of the coding unit.

根据本发明的第三方面，还提供了一种存储介质，所述存储介质存储有程序，所述程序被处理器执行完成如第一方面所述的方法。According to a third aspect of the present invention, a storage medium is also provided, and the storage medium stores a program, and the program is executed by a processor to complete the method according to the first aspect.

本发明的实施例通过第一指标、第二指标和第三指标提取编码单元的第一纹理信息，并将所述编码单元划分为第一编码单元、第二编码单元或第三编码单元；其中，第一编码单元为可终止划分的编码单元；然后根据第四指标和第五指标，确定所述第二编码单元的划分模式；最后通过第六指标和第七指标，确定所述第三编码单元的划分模式。本发明通过明确各个编码的划分模式，通过确定最优的划分模式，删减了冗余的划分模式，进而能够降低编码器的复杂度。The embodiment of the present invention extracts the first texture information of the coding unit by using the first index, the second index and the third index, and divides the coding unit into the first coding unit, the second coding unit or the third coding unit; wherein , the first coding unit is a coding unit that can terminate the division; then according to the fourth index and the fifth index, determine the division mode of the second coding unit; finally through the sixth index and the seventh index, determine the third coding The division mode of the unit. By clarifying the division mode of each coding and determining the optimal division mode, the present invention deletes redundant division modes, thereby reducing the complexity of the encoder.

附图说明Description of drawings

为了更清楚地说明本申请实施例中的技术方案，下面将对实施例描述中所需要使用的附图作简单地介绍，显而易见地，下面描述中的附图仅仅是本申请的一些实施例，对于本领域普通技术人员来讲，在不付出创造性劳动的前提下，还可以根据这些附图获得其他的附图。In order to illustrate the technical solutions in the embodiments of the present application more clearly, the following briefly introduces the drawings that are used in the description of the embodiments. Obviously, the drawings in the following description are only some embodiments of the present application. For those of ordinary skill in the art, other drawings can also be obtained from these drawings without creative effort.

图1为本发明实施例的整体步骤流程图；Fig. 1 is the overall step flow chart of the embodiment of the present invention;

图2为本发明实施例的四个子块示意图。FIG. 2 is a schematic diagram of four sub-blocks according to an embodiment of the present invention.

具体实施方式Detailed ways

下面结合说明书附图和具体实施例对本发明作进一步解释和说明。对于本发明实施例中的步骤编号，其仅为了便于阐述说明而设置，对步骤之间的顺序不做任何限定，实施例中的各步骤的执行顺序均可根据本领域技术人员的理解来进行适应性调整。The present invention will be further explained and illustrated below in conjunction with the accompanying drawings and specific embodiments of the description. The step numbers in the embodiments of the present invention are set only for the convenience of elaboration, and the sequence between the steps is not limited, and the execution sequence of the steps in the embodiments can be performed according to the understanding of those skilled in the art Adaptive adjustment.

针对现有技术存在的问题，本发明采用7个指标对编码单元的纹理进行描述，分别是MAD、MAD_H、MAD_V、LEN_H、LEN_v、diff_H和diff_V，下面首先对这七个指标的定义与所代表的含义进行解释：In view of the problems existing in the prior art, the present invention uses seven indicators to describe the texture of the coding unit, which are MAD, MAD _H , MAD _V , LEN _H , LEN _v , diff _H and diff _V . The definitions of indicators and what they represent are explained:

1、第一指标：MAD；1. The first indicator: MAD;

其中，p(x,y)代表编码单元中位于(x,y)，也就是第x列和第y行的位置上的像素值，u和v代表该编码单元的高度(Height)和宽度(Width)，m代表该编码单元中像素值的均值，m的定义如下Among them, p(x,y) represents the pixel value located at (x,y) in the coding unit, that is, the pixel value at the position of the xth column and the yth row, and u and v represent the height (Height) and width ( Width), m represents the mean value of the pixel values in the coding unit, and m is defined as follows

其中，MAD用于描述编码单元的像素的一致性。MAD越大，表示该预测单元的纹理越复杂，需要进一步划分的概率也就越大，反之，MAD越小，说明该编码单元中像素值越相似，也就越倾向于终止划分。Among them, MAD is used to describe the consistency of the pixels of the coding unit. The larger the MAD, the more complex the texture of the prediction unit, and the greater the probability that further division is required. On the contrary, the smaller the MAD, the more similar the pixel values in the coding unit, and the more inclined to terminate the division.

2、第二指标：MAD_H；2. The second indicator: MAD _H ;

MAD_H的定义如下MAD _H is defined as follows

其中，m_y是该编码单元中第y行的像素的均值，其定义如下where m _y is the mean of the pixels in the y-th row of the coding unit, which is defined as follows

MAD_H用于衡量该编码单元上垂直方向上的纹理，若MAD_H越大，那么说明该编码单元上的垂直方向的纹理比较复杂。MAD _H is used to measure the texture in the vertical direction on the coding unit. If the MAD _H is larger, it means that the texture in the vertical direction on the coding unit is more complicated.

3、第三指标：MAD_V；3. The third indicator: MAD _V ;

MAD_V的定义如下MAD _V is defined as follows

其中，m_x是该编码单元中，第x列的像素的均值，其定义如下Among them, m _x is the mean value of the pixels in the xth column in the coding unit, which is defined as follows

MAD_v用于衡量该编码单元上水平方向上的纹理，若MAD_v越大，那么说明该编码单元上的水平方向的纹理复杂。MAD _v is used to measure the texture in the horizontal direction on the coding unit. If the MAD _v is larger, it means that the texture in the horizontal direction on the coding unit is complex.

4、第四指标LEN_H和第五指标LEN_v；4. The fourth index LEN _H and the fifth index LEN _v ;

LEN_H和LEN_v的定义如下LEN _H and LEN _v are defined as follows

LEN_H＝|MAD-MAD_H|LEN _H =|MAD-MAD _H |

LEN_V＝|MAD-MAD_V|LEN _V = |MAD-MAD _V |

LEN_H和LEN_v用于衡量一个编码单元中，其纹理的主要方向。这是因为LEN_H定义为MAD_H与MAD差值的绝对值，而LEN_v定义为MAD_v与MAD差值的绝对值。若LEN_H>LEN_v，说明该编码单元上的MAD值主要是由于水平方向上的纹理引起的，也就说明该编码单元中，水平方上的纹理较多。H.266/VVC的编码器VTM在编码该单元时更倾向于进行水平二叉或者三叉划分。相反，若LEN_H<LEN_v，说明该编码单元上的MAD值主要是由于垂直方向上的纹理引起的，也就说明该编码单元中，垂直方上的纹理较多。VTM在编码该单元时更倾向于进行垂直二叉或者三叉划分。LEN _H and LEN _v are used to measure the main direction of the texture in a coding unit. This is because LEN _H is defined as the absolute value of the difference between MAD _H and MAD, and LEN _v is defined as the absolute value of the difference between MAD _v and MAD. If LEN _H >LEN _v , it means that the MAD value on the coding unit is mainly caused by the texture in the horizontal direction, which means that there are many textures in the horizontal direction in the coding unit. The encoder VTM of H.266/VVC is more inclined to perform horizontal binary or triple division when encoding the unit. On the contrary, if LEN _H < LEN _v , it means that the MAD value on the coding unit is mainly caused by the texture in the vertical direction, which means that there are more textures in the vertical direction in the coding unit. The VTM is more inclined to perform vertical binary or triple division when encoding this unit.

5、第六指标diff_H和第七指标diff_V；5. The sixth index diff _H and the seventh index diff _V ;

diff_H和diff_V定义如下diff _H and diff _V are defined as follows

在32×32大小的编码单元中，本发明的实施例将该CU划分为四个大小相同的子块，具体划分方法如图2所示，并且依次将左上、右上、左下和右下的这几个子块标记为1，2，3和4。将其划分为四个相同的子块后，还需要计算出每个子块的MAD、MAD_H和MAD_V值，且记为MAD_i、MAD_Hi和MAD_Vi，其中i为每个子块中的标号。In a coding unit with a size of 32×32, the embodiment of the present invention divides the CU into four sub-blocks of the same size. The specific division method is shown in FIG. 2 , and the upper left, upper right, lower left and lower right Several sub-blocks are labeled 1, 2, 3 and 4. After it is divided into four identical sub-blocks, the MAD, MAD _H and MAD _V values of each sub-block need to be calculated and marked as MAD _i , MAD _Hi and MAD _Vi , where i is the label in each sub-block .

这两个指标用于衡量子块1，2，3和4的纹理方向是否一致。若min(diff_H,diff_V)比较大时，说明该CU在水平和垂直方向上的纹理都比较复杂，这种情况下，最佳的划分方式为四叉划分。而纹理方向比较连贯且一致的情况下，若编码单元的纹理主要为水平方向时，通常diff_H<diff_V。那么VTM在编码的时候更倾向于把该编码单元进行水平划分。反之，若编码单元的纹理主要为垂直方向是，通常diff_H>diff_V，这种情况下，VTM更倾向于把该编码单元进行垂直划分。These two indicators are used to measure whether the texture directions of sub-blocks 1, 2, 3 and 4 are consistent. If min(diff _H , diff _V ) is relatively large, it means that the texture of the CU in both the horizontal and vertical directions is relatively complex. In this case, the best division method is quadratic division. When the texture direction is relatively coherent and consistent, if the texture of the coding unit is mainly in the horizontal direction, usually diff _H <diff _V . Then the VTM is more inclined to divide the coding unit horizontally when coding. On the contrary, if the texture of the coding unit is mainly in the vertical direction, usually diff _H >diff _V , in this case, the VTM is more inclined to divide the coding unit vertically.

下面结合说明书附图1，详细描述本发明实施例的整体步骤流程：Below in conjunction with accompanying drawing 1 of the specification, the overall step flow of the embodiment of the present invention is described in detail:

由于尺寸32×32的编码单元是VTM启用二叉和三叉划分模式。与H.265/HEVC相同，H.266/VVC也是采用递归的方式，通过遍历所有可能的划分模式且通过率失真优化来获得最优的划分模式。本实施例对32×32的编码单元进行处理，通过删减该节点上一些冗余的划分模式来降低编码其的复杂度。具体地，本发明可以分为三个过程，具体的算法如下所述。Since the coding unit of size 32×32 is VTM, binary and trigeminal partition modes are enabled. Like H.265/HEVC, H.266/VVC also adopts a recursive manner, and obtains the optimal division mode by traversing all possible division modes and optimizing the rate-distortion. In this embodiment, the coding unit of 32×32 is processed, and the complexity of coding it is reduced by pruning some redundant division modes on the node. Specifically, the present invention can be divided into three processes, and the specific algorithm is as follows.

S1、通过MAD、MAD_H、MAD_V这三个指标提取编码单元的第一纹理信息texture₁，并将texture₁定义为texture₁＝min(MAD,MAD_H,MAD_V)S1. Extract the first texture information texture ₁ of the coding unit through the three indicators of MAD, MAD _H and MAD _V , and define texture ₁ as texture ₁ =min(MAD,MAD _H ,MAD _V )

然后，通过第一阈值th₁和第二阈值th₂将编码单元分为A、B和C三类。其中，A类的编码单元(即第一编码单元)像素表平坦，可以提前终止划分；B类编码单元(即第二编码单元)纹理复杂，后续执行三叉划分；C类编码单元(即第三编码单元)纹理复杂度介于A类和B类之间，此类编码单元后续只执行二叉或四叉划分模式。Then, the coding units are classified into three categories A, B, and C by the first threshold th ₁ and the second threshold th ₂ . Among them, the pixel table of the coding unit of type A (ie the first coding unit) is flat, and the division can be terminated in advance; the coding unit of type B (ie the second coding unit) has complex texture, and the three-point division is performed subsequently; the coding unit of type C (ie the third coding unit) Coding unit) The texture complexity is between A and B, and this type of coding unit only performs binary or quadratic partition mode subsequently.

具体地，如果texture₁<th1,那么，将该编码单元归类为A类；如果texture₁>th2，那么该编码单元归类为B类；如果th1<texture₁<th2，那么将该编码单元归类为C类。Specifically, if texture ₁ < th1, then the coding unit is classified as class A; if texture ₁ > th2, then the coding unit is classified as class B; if th1 < texture ₁ < th2, then the coding unit is classified as class B Classified as Class C.

S2、将B类的编码单元的划分模式进一步简化。S2. Further simplify the division mode of the coding unit of type B.

本实施例通过利用LEN_H和LEN_V这两个指标对纹理进行描述。当LEN_H>LEN_V时，确定第二编码单元执行三叉水平划分；当LEN_H<LEN_V时，确定第二编码单元执行三叉垂直划分。The present embodiment describes the texture by using two indicators, LEN _H and LEN _V. When LEN _H > LEN _V , it is determined that the second coding unit performs three-way horizontal division; when LEN _H < LEN _V , it is determined that the second coding unit performs three-way vertical division.

S3、将C类的编码单元的划分模式进一步简化。S3. Further simplify the division mode of the coding unit of the C type.

本实施例通过diff_H和diff_V对C类编码单元的第二纹理信息texture₂进行分析，同时将texture₂定义为texture₂＝min(diff_H,diff_V)In this embodiment, the second texture information texture ₂ of the C-type coding unit is analyzed through diff _H and diff _V , and texture ₂ is defined as texture ₂ =min(diff _H ,diff _V )

然后通过第三阈值th₃和第四阈值th₄将C类编码单元分成三类。若texture₂>th3，那么执行四叉划分；若texture₂<th4，那么对纹理进行二叉划分；最后，若th4<texture₂<th3，那么，四叉和二叉划分模式进行竞争。The C-type coding units are then divided into three categories by a third threshold th ₃ and a fourth threshold th ₄ . If texture ₂ > th3, then perform quadratic division; if texture ₂ < th4, then perform binary division on the texture; finally, if th4 < texture ₂ < th3, then the quad and binary division modes compete.

本发明实施例还提供了一种基于纹理特征的帧内划分系统，包括：An embodiment of the present invention also provides an intra-frame division system based on texture features, including:

根据本发明实施例还提供了一种存储介质，所述存储介质存储有程序，所述程序被处理器执行完成如图1所示的方法。According to an embodiment of the present invention, a storage medium is also provided, and the storage medium stores a program, and the program is executed by a processor to complete the method shown in FIG. 1 .

综上所述，本发明通过明确各个编码的划分模式，通过确定最优的划分模式，删减了冗余的划分模式，进而能够降低编码器的复杂度。To sum up, the present invention can reduce the complexity of the encoder by clarifying the division mode of each coding and by determining the optimal division mode.

在一些可选择的实施例中，在方框图中提到的功能/操作可以不按照操作示图提到的顺序发生。例如，取决于所涉及的功能/操作，连续示出的两个方框实际上可以被大体上同时地执行或所述方框有时能以相反顺序被执行。此外，在本发明的流程图中所呈现和描述的实施例以示例的方式被提供，目的在于提供对技术更全面的理解。所公开的方法不限于本文所呈现的操作和逻辑流程。可选择的实施例是可预期的，其中各种操作的顺序被改变以及其中被描述为较大操作的一部分的子操作被独立地执行。In some alternative implementations, the functions/operations noted in the block diagrams may occur out of the order noted in the operational diagrams. For example, two blocks shown in succession may, in fact, be executed substantially concurrently or the blocks may sometimes be executed in the reverse order, depending upon the functionality/operations involved. Furthermore, the embodiments presented and described in the flowcharts of the present invention are provided by way of example in order to provide a more comprehensive understanding of the technology. The disclosed methods are not limited to the operations and logic flows presented herein. Alternative embodiments are contemplated in which the order of the various operations are altered and in which sub-operations described as part of larger operations are performed independently.

此外，虽然在功能性模块的背景下描述了本发明，但应当理解的是，除非另有相反说明，所述的功能和/或特征中的一个或多个可以被集成在单个物理装置和/或软件模块中，或者一个或多个功能和/或特征可以在单独的物理装置或软件模块中被实现。还可以理解的是，有关每个模块的实际实现的详细讨论对于理解本发明是不必要的。更确切地说，考虑到在本文中公开的装置中各种功能模块的属性、功能和内部关系的情况下，在工程师的常规技术内将会了解该模块的实际实现。因此，本领域技术人员运用普通技术就能够在无需过度试验的情况下实现在权利要求书中所阐明的本发明。还可以理解的是，所公开的特定概念仅仅是说明性的，并不意在限制本发明的范围，本发明的范围由所附权利要求书及其等同方案的全部范围来决定。Furthermore, while the invention is described in the context of functional modules, it is to be understood that, unless stated to the contrary, one or more of the described functions and/or features may be integrated in a single physical device and/or or software modules, or one or more functions and/or features may be implemented in separate physical devices or software modules. It will also be appreciated that a detailed discussion of the actual implementation of each module is not necessary to understand the present invention. Rather, given the attributes, functions, and internal relationships of the various functional modules in the apparatus disclosed herein, the actual implementation of the modules will be within the routine skill of the engineer. Accordingly, those skilled in the art, using ordinary skill, can implement the invention as set forth in the claims without undue experimentation. It is also to be understood that the specific concepts disclosed are illustrative only and are not intended to limit the scope of the invention, which is to be determined by the appended claims along with their full scope of equivalents.

所述功能如果以软件功能单元的形式实现并作为独立的产品销售或使用时，可以存储在一个计算机可读取存储介质中。基于这样的理解，本发明的技术方案本质上或者说对现有技术做出贡献的部分或者该技术方案的部分可以以软件产品的形式体现出来，该计算机软件产品存储在一个存储介质中，包括若干指令用以使得一台计算机设备(可以是个人计算机，服务器，或者网络设备等)执行本发明各个实施例所述方法的全部或部分步骤。而前述的存储介质包括：U盘、移动硬盘、只读存储器(ROM，Read-Only Memory)、随机存取存储器(RAM，Random Access Memory)、磁碟或者光盘等各种可以存储程序代码的介质。The functions, if implemented in the form of software functional units and sold or used as independent products, may be stored in a computer-readable storage medium. Based on such understanding, the technical solution of the present invention can be embodied in the form of a software product in essence, or the part that contributes to the prior art or the part of the technical solution. The computer software product is stored in a storage medium, including Several instructions are used to cause a computer device (which may be a personal computer, a server, or a network device, etc.) to execute all or part of the steps of the methods described in the various embodiments of the present invention. The aforementioned storage medium includes: U disk, mobile hard disk, Read-Only Memory (ROM, Read-Only Memory), Random Access Memory (RAM, Random Access Memory), magnetic disk or optical disk and other media that can store program codes .

在流程图中表示或在此以其他方式描述的逻辑和/或步骤，例如，可以被认为是用于实现逻辑功能的可执行指令的定序列表，可以具体实现在任何计算机可读介质中，以供指令执行系统、装置或设备(如基于计算机的系统、包括处理器的系统或其他可以从指令执行系统、装置或设备取指令并执行指令的系统)使用，或结合这些指令执行系统、装置或设备而使用。就本说明书而言，“计算机可读介质”可以是任何可以包含、存储、通信、传播或传输程序以供指令执行系统、装置或设备或结合这些指令执行系统、装置或设备而使用的装置。The logic and/or steps represented in flowcharts or otherwise described herein, for example, may be considered an ordered listing of executable instructions for implementing the logical functions, may be embodied in any computer-readable medium, For use with, or in conjunction with, an instruction execution system, apparatus, or device (such as a computer-based system, a system including a processor, or other system that can fetch instructions from and execute instructions from an instruction execution system, apparatus, or apparatus) or equipment. For the purposes of this specification, a "computer-readable medium" can be any device that can contain, store, communicate, propagate, or transport the program for use by or in connection with an instruction execution system, apparatus, or apparatus.

计算机可读介质的更具体的示例(非穷尽性列表)包括以下：具有一个或多个布线的电连接部(电子装置)，便携式计算机盘盒(磁装置)，随机存取存储器(RAM)，只读存储器(ROM)，可擦除可编辑只读存储器(EPROM或闪速存储器)，光纤装置，以及便携式光盘只读存储器(CDROM)。另外，计算机可读介质甚至可以是可在其上打印所述程序的纸或其他合适的介质，因为可以例如通过对纸或其他介质进行光学扫描，接着进行编辑、解译或必要时以其他合适方式进行处理来以电子方式获得所述程序，然后将其存储在计算机存储器中。More specific examples (non-exhaustive list) of computer readable media include the following: electrical connections with one or more wiring (electronic devices), portable computer disk cartridges (magnetic devices), random access memory (RAM), Read Only Memory (ROM), Erasable Editable Read Only Memory (EPROM or Flash Memory), Fiber Optic Devices, and Portable Compact Disc Read Only Memory (CDROM). In addition, the computer readable medium may even be paper or other suitable medium on which the program may be printed, as the paper or other medium may be optically scanned, for example, followed by editing, interpretation, or other suitable medium as necessary process to obtain the program electronically and then store it in computer memory.

应当理解，本发明的各部分可以用硬件、软件、固件或它们的组合来实现。在上述实施方式中，多个步骤或方法可以用存储在存储器中且由合适的指令执行系统执行的软件或固件来实现。例如，如果用硬件来实现，和在另一实施方式中一样，可用本领域公知的下列技术中的任一项或他们的组合来实现：具有用于对数据信号实现逻辑功能的逻辑门电路的离散逻辑电路，具有合适的组合逻辑门电路的专用集成电路，可编程门阵列(PGA)，现场可编程门阵列(FPGA)等。It should be understood that various parts of the present invention may be implemented in hardware, software, firmware or a combination thereof. In the above-described embodiments, various steps or methods may be implemented in software or firmware stored in memory and executed by a suitable instruction execution system. For example, if implemented in hardware, as in another embodiment, it can be implemented by any one or a combination of the following techniques known in the art: Discrete logic circuits, application specific integrated circuits with suitable combinational logic gates, Programmable Gate Arrays (PGA), Field Programmable Gate Arrays (FPGA), etc.

在本说明书的描述中，参考术语“一个实施例”、“一些实施例”、“示例”、“具体示例”、或“一些示例”等的描述意指结合该实施例或示例描述的具体特征、结构、材料或者特点包含于本发明的至少一个实施例或示例中。在本说明书中，对上述术语的示意性表述不一定指的是相同的实施例或示例。而且，描述的具体特征、结构、材料或者特点可以在任何的一个或多个实施例或示例中以合适的方式结合。In the description of this specification, description with reference to the terms "one embodiment," "some embodiments," "example," "specific example," or "some examples", etc., mean specific features described in connection with the embodiment or example , structure, material or feature is included in at least one embodiment or example of the present invention. In this specification, schematic representations of the above terms do not necessarily refer to the same embodiment or example. Furthermore, the particular features, structures, materials or characteristics described may be combined in any suitable manner in any one or more embodiments or examples.

尽管已经示出和描述了本发明的实施例，本领域的普通技术人员可以理解：在不脱离本发明的原理和宗旨的情况下可以对这些实施例进行多种变化、修改、替换和变型，本发明的范围由权利要求及其等同物限定。Although embodiments of the present invention have been shown and described, it will be understood by those of ordinary skill in the art that various changes, modifications, substitutions and alterations can be made in these embodiments without departing from the principles and spirit of the invention, The scope of the invention is defined by the claims and their equivalents.

以上是对本发明的较佳实施进行了具体说明，但本发明并不限于所述实施例，熟悉本领域的技术人员在不违背本发明精神的前提下还可做作出种种的等同变形或替换，这些等同的变形或替换均包含在本申请权利要求所限定的范围内。The above is a specific description of the preferred implementation of the present invention, but the present invention is not limited to the described embodiments, and those skilled in the art can also make various equivalent deformations or replacements without departing from the spirit of the present invention, These equivalent modifications or substitutions are all included within the scope defined by the claims of the present application.

Claims

1. The method for dividing the frame based on the texture features is characterized by comprising the following steps:

extracting the minimum value of the first index, the second index and the third index to obtain first texture information, comparing the first texture information with a threshold value, and dividing a coding unit into a first coding unit, a second coding unit or a third coding unit according to the comparison result; the first coding unit is a coding unit capable of terminating partitioning, the second coding unit is a coding unit for executing trifurcate partitioning, and the third coding unit is a coding unit for executing binary or quaternary partitioning;

determining that the second coding unit performs trifurcated horizontal division or trifurcated vertical division according to the size comparison result of the fourth index and the fifth index;

extracting the minimum value of the sixth index and the seventh index to obtain second texture information, comparing the second texture information with a threshold value, and determining that the third coding unit executes quad partitioning or binary partitioning according to the comparison result;

wherein, the first index is used for measuring the pixel consistency of the coding unit;

the second index is used for measuring the texture complexity of the coding unit in the vertical direction;

the third index is used for measuring the texture complexity of the coding unit in the horizontal direction;

the fourth index is the absolute value of the difference between the first index and the second index;

the fifth index is the absolute value of the difference between the first index and the third index;

comparing the fourth index with the fifth index, and dividing the coding unit into four subblocks with the same size, wherein the fourth index is used for measuring the main direction of the texture in the coding unit;

the sixth index is obtained by calculation according to the second index of each sub-block;

the seventh index is obtained by calculation according to the third index of each sub-block;

and comparing the sixth index with the seventh index for measuring the consistency of the texture directions among the sub-blocks.

2. The texture feature-based intra partition method according to claim 1, wherein the step of extracting the first texture information of the coding unit by the first index, the second index and the third index and dividing the coding unit into the first coding unit, the second coding unit or the third coding unit comprises:

when the first texture information is smaller than a first threshold value, determining the coding unit as a first coding unit;

when the first texture information is larger than a second threshold value, determining the coding unit as a second coding unit;

and when the first texture information is larger than a first threshold and smaller than a second threshold, determining the coding unit as a third coding unit.

3. The texture feature-based intra partition method according to claim 1, wherein the step of determining the partition mode of the second coding unit according to a fourth index and a fifth index specifically comprises:

when the fourth index is larger than the fifth index, determining that the second coding unit executes trifurcate horizontal division;

and when the fourth index is smaller than the fifth index, determining that the second coding unit executes the three-fork vertical division.

4. The texture feature-based intra partition method according to claim 1, wherein the step of determining the partition mode of the third coding unit according to a sixth index and a seventh index specifically comprises:

calculating second texture information of the coding unit according to the sixth index and the seventh index;

when the second texture information is larger than a third threshold value, determining that the third coding unit executes quad-fork division;

when the second texture information is smaller than a fourth threshold value, determining that the third coding unit executes binary partitioning;

and when the second texture information is larger than a fourth threshold and smaller than a third threshold, determining that the third coding unit executes binary division or quad division.

5. The texture feature-based intra partition method according to claim 1, wherein the first index is determined according to a pixel value of each position on the coding unit, a height value of the coding unit, a width value of the coding unit, and a mean value of pixel values in the coding unit;

the calculation formula of the first index is as follows:

wherein p (x, y) represents a pixel value located at the position of the x-th column and the y-th row in the coding unit; u height of the coding unit; v represents the width of the coding unit; m represents the mean value of the pixel values in the coding unit.

6. The texture feature-based intra partition method according to claim 1, wherein the second index is determined according to pixel values of respective positions on the coding unit, a height value of the coding unit, a width value of the coding unit, and a mean value of pixel values of each row in the coding unit;

the calculation formula of the second index is as follows:

wherein m is_yIs the average of the pixels of the y-th row in the coding unit; p (x, y) represents a pixel value located at the position of the x-th column and the y-th row in the coding unit; u height of the coding unit; v represents the width of the coding unit;

the third index is determined according to the pixel value of each position on the coding unit, the height value of the coding unit, the width value of the coding unit and the average value of the pixel values of each column in the coding unit;

the calculation formula of the third index is as follows:

wherein m is_xIs the average of the pixels of the x-th column in the coding unit; p (x, y) represents a pixel value located at the position of the x-th column and the y-th row in the coding unit; u height of the coding unit; v represents the width of the coding unit.

7. The texture feature-based intra frame division method of claim I,

the fourth index is obtained by calculation according to the first index and the second index;

the fifth index is obtained by calculation according to the first index and the third index;

wherein the calculation formula of the fourth index is as follows:

LEN_H＝|MAD-MAD_H|

MAD represents a first index; MAD_HRepresents a second index;

wherein the calculation formula of the fifth index is:

LEN_V＝|MAD-MAD_V|

MAD represents a first index; MAD_VRepresenting a third index.

8. The texture feature-based intra frame division method of claim I,

wherein the calculation formula of the sixth index is:

diff_H＝｜MAD_H1-MAD_H2|+|MAD_H3-MAD_H4|

MAD_H1a second index representing the first sub-block; MAD_H2A second index representing a second sub-block; MAD_H3A second index representing a third sub-block; MAD_H4A second index representing a fourth sub-block;

the calculation formula of the seventh index is as follows:

diff_V＝|MAD_V1-MAD_V3|+|MAD_V2-MAD_V4|

MAD_V1a third index representing the first sub-block; MAD_V2A third index representing the second sub-block; MAD_V3A third index representing a third sub-block; MAD_V4Representing the third index of the fourth sub-block.

9. An intra-frame partitioning system based on texture features, comprising:

the extraction module is used for extracting the minimum value of the first index, the second index and the third index to obtain first texture information, comparing the first texture information with a threshold value, and dividing a coding unit into a first coding unit, a second coding unit or a third coding unit according to a comparison result; the first coding unit is a coding unit capable of terminating partitioning, the second coding unit is a coding unit for executing trifurcate partitioning, and the third coding unit is a coding unit for executing binary or quaternary partitioning;

the first division module is used for determining that the second coding unit executes trifurcate horizontal division or trifurcate vertical division according to the size comparison result of the fourth index and the fifth index;

the second dividing module is used for extracting the minimum value of the sixth index and the seventh index to obtain second texture information, comparing the second texture information with a threshold value, and determining that the third coding unit executes four-fork division or two-fork division according to the comparison result;

and comparing the sixth index with the seventh index, and measuring the consistency of the texture directions among the sub-blocks.

10. A storage medium, characterized in that the storage medium stores a program, which is executed by a processor to perform the method according to any one of claims 1-8.