CN111432212B - Intra-frame division method, system and storage medium based on texture feature - Google Patents
Intra-frame division method, system and storage medium based on texture feature Download PDFInfo
- Publication number
- CN111432212B CN111432212B CN202010351187.5A CN202010351187A CN111432212B CN 111432212 B CN111432212 B CN 111432212B CN 202010351187 A CN202010351187 A CN 202010351187A CN 111432212 B CN111432212 B CN 111432212B
- Authority
- CN
- China
- Prior art keywords
- index
- coding unit
- mad
- texture
- division
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Active
Links
Images
Classifications
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N19/00—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
- H04N19/10—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding
- H04N19/134—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the element, parameter or criterion affecting or controlling the adaptive coding
- H04N19/136—Incoming video signal characteristics or properties
- H04N19/14—Coding unit complexity, e.g. amount of activity or edge presence estimation
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N19/00—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
- H04N19/10—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding
- H04N19/134—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the element, parameter or criterion affecting or controlling the adaptive coding
- H04N19/157—Assigned coding mode, i.e. the coding mode being predefined or preselected to be further used for selection of another element or parameter
- H04N19/159—Prediction type, e.g. intra-frame, inter-frame or bidirectional frame prediction
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N19/00—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
- H04N19/10—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding
- H04N19/169—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the coding unit, i.e. the structural portion or semantic portion of the video signal being the object or the subject of the adaptive coding
- H04N19/182—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the coding unit, i.e. the structural portion or semantic portion of the video signal being the object or the subject of the adaptive coding the unit being a pixel
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N19/00—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
- H04N19/90—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using coding techniques not provided for in groups H04N19/10-H04N19/85, e.g. fractals
- H04N19/96—Tree coding, e.g. quad-tree coding
Landscapes
- Engineering & Computer Science (AREA)
- Multimedia (AREA)
- Signal Processing (AREA)
- Compression Or Coding Systems Of Tv Signals (AREA)
Abstract
Description
技术领域technical field
本发明涉及数据处理技术领域,尤其是基于纹理特征的帧内划分方法、系统及存储介质。The present invention relates to the technical field of data processing, in particular to an intra-frame division method, system and storage medium based on texture features.
背景技术Background technique
VVC/H.266是基于HEVC/H.265编码标准上进一步发展的下一代视频编码标准。VVC标准目前还在制定中,其继承了传统的大多数编码工具以及混合编码框架,包括帧内(间)预测,变换量化,熵编码和环路滤波,但在延续了大多数编码工具的同时也采纳了很多新技术,使得编码效率大幅提升。H.266/VVC的目标是为了与相同质量条件下的H.265/HEVC相比,H.266/VVC的编码码率降低为H.265/HEVC的一半。VVC/H.266 is a next-generation video coding standard based on the HEVC/H.265 coding standard. The VVC standard is still being developed, and it inherits most of the traditional coding tools and the hybrid coding framework, including intra (inter) prediction, transform quantization, entropy coding and loop filtering, but while continuing most coding tools Many new technologies have also been adopted, which greatly improves the coding efficiency. The goal of H.266/VVC is to reduce the coding rate of H.266/VVC to half that of H.265/HEVC compared with H.265/HEVC under the same quality condition.
为了更好地描述视频内容的纹理,H.266/VVC在H.265/HEVC的基础上引入了MTT划分,即基于QT划分的基础上,添加了BT和TT划分。其中,QT划分与H.265/HEVC标准中规定的相同,都是将CU划分为四个面积相等的子块,每个子节点的CU的长宽都是原来父节点CU的一半。而新引入的BT和TT划分都具有水平和垂直两个方向,其中BT划分是将CU划分为两个面积相等的子块,若是水平划分,那么子块的CU的长与父节点CU一致,而宽为父节点CU的一半,反之,若是垂直划分,那么子块的CU的宽与父节点CU一致,而长为父节点CU的一半。TT划分而是将CU划分为三个子块,其面积比例为1:2:1。In order to better describe the texture of video content, H.266/VVC introduces MTT division based on H.265/HEVC, that is, based on QT division, BT and TT division are added. Among them, the QT division is the same as that specified in the H.265/HEVC standard, in which the CU is divided into four sub-blocks of equal area, and the length and width of the CU of each child node are half of the original parent node CU. The newly introduced BT and TT divisions both have horizontal and vertical directions. The BT division is to divide the CU into two sub-blocks with equal areas. If it is divided horizontally, the length of the CU of the sub-block is the same as that of the parent node CU. The width is half of that of the parent node CU. On the contrary, if it is vertically divided, the width of the CU of the child block is the same as that of the parent node CU, and the length is half of the parent node CU. Instead, the TT partition divides the CU into three sub-blocks with an area ratio of 1:2:1.
除了引入新的划分模式之外,H.266/VVC还对帧内编码部分进行改进。首先H.266/VVC细化了角度模式,从HEVC的33种角度模式扩展到65种。除此之外,VVC还引入了宽角预测模式、PDPC预测模式、MRL预测模式和ISP预测模式等技术。新引入的预测模式可以更好地描述视频图像中的纹理,提高压缩效率。其中,图像的特征包括图像的颜色特征,纹理特征,形状特征和空间关系特征等,其可以综合描述一幅图像的内容和不同方面的性质,而图像的内容和性质与编码单元的划分的结果息息相关。In addition to introducing new division modes, H.266/VVC also improves the intra-frame coding part. First of all, H.266/VVC has refined the angle mode, expanding from 33 angle modes of HEVC to 65 kinds. In addition, VVC also introduces technologies such as wide-angle prediction mode, PDPC prediction mode, MRL prediction mode and ISP prediction mode. The newly introduced prediction mode can better describe the texture in the video image and improve the compression efficiency. Among them, the characteristics of the image include the color characteristics, texture characteristics, shape characteristics and spatial relationship characteristics of the image, which can comprehensively describe the content and properties of different aspects of an image, and the content and properties of the image are the result of the division of coding units. closely related.
但VVC在提高压缩效率的同时,也带来了巨大的复杂度,这对VVC投入实际应用带来了困难。However, while improving the compression efficiency, VVC also brings huge complexity, which brings difficulties to the practical application of VVC.
发明内容SUMMARY OF THE INVENTION
有鉴于此,本发明实施例提供一种能够提高降低编码器复杂度的,基于纹理特征的帧内划分方法、系统及存储介质。In view of this, embodiments of the present invention provide a texture feature-based intra-frame division method, system, and storage medium that can improve and reduce the complexity of an encoder.
本发明的第一方面提供了一种基于纹理特征的帧内划分方法,包括:A first aspect of the present invention provides an intra-frame division method based on texture features, including:
通过第一指标、第二指标和第三指标提取编码单元的第一纹理信息,并将所述编码单元划分为第一编码单元、第二编码单元或第三编码单元;所述第一编码单元为可终止划分的编码单元,所述第二编码单元为执行三叉划分的编码单元,所述第三编码单元为执行二叉或四叉划分的编码单元;The first texture information of the coding unit is extracted by the first index, the second index and the third index, and the coding unit is divided into the first coding unit, the second coding unit or the third coding unit; the first coding unit is a coding unit that can terminate the division, the second coding unit is a coding unit that performs three-fork division, and the third coding unit is a coding unit that performs two-fork or four-fork division;
根据第四指标和第五指标,确定所述第二编码单元的划分模式;determining the division mode of the second coding unit according to the fourth index and the fifth index;
通过第六指标和第七指标,确定所述第三编码单元的划分模式;Determine the division mode of the third coding unit by the sixth index and the seventh index;
其中,所述第一指标用于确定编码单元的像素一致性;Wherein, the first index is used to determine the pixel consistency of the coding unit;
所述第二指标用于确定编码单元在垂直方向上的第一纹理信息;The second index is used to determine the first texture information of the coding unit in the vertical direction;
所述第三指标用于确定编码单元在水平方向上的第一纹理信息;The third index is used to determine the first texture information of the coding unit in the horizontal direction;
所述第四指标和所述第五指标用于确定编码单元的纹理方向;The fourth index and the fifth index are used to determine the texture direction of the coding unit;
所述第六指标和所述第七指标用于确定子块之间的纹理方向的一致性。The sixth index and the seventh index are used to determine the consistency of texture directions between sub-blocks.
在一些实施例中,所述通过第一指标、第二指标和第三指标提取编码单元的第一纹理信息,并将所述编码单元划分为第一编码单元、第二编码单元或第三编码单元这一步骤,具体为:In some embodiments, the first texture information of the coding unit is extracted by using the first index, the second index and the third index, and the coding unit is divided into the first coding unit, the second coding unit or the third coding unit This step of the unit is as follows:
当所述第一纹理信息小于第一阈值,则将所述编码单元确定为第一编码单元;When the first texture information is less than a first threshold, determining the coding unit as the first coding unit;
当所述第一纹理信息大于第二阈值,则将所述编码单元确定为第二编码单元;When the first texture information is greater than a second threshold, determining the coding unit as a second coding unit;
当所述第一纹理信息大于第一阈值且小于第二阈值,则将所述编码单元确定为第三编码单元。When the first texture information is greater than a first threshold and less than a second threshold, the coding unit is determined as a third coding unit.
在一些实施例中,所述根据第四指标和第五指标,确定所述第二编码单元的划分模式这一步骤,具体为:In some embodiments, the step of determining the division mode of the second coding unit according to the fourth index and the fifth index is specifically:
当第四指标大于第五指标,确定所述第二编码单元执行三叉水平划分;When the fourth index is greater than the fifth index, it is determined that the second coding unit performs three-pronged horizontal division;
当第四指标小于第五指标,确定所述第二编码单元执行三叉垂直划分。When the fourth index is smaller than the fifth index, it is determined that the second coding unit performs three-pronged vertical division.
在一些实施例中,所述通过第六指标和第七指标,确定所述第三编码单元的划分模式这一步骤,具体为:In some embodiments, the step of determining the division mode of the third coding unit according to the sixth index and the seventh index is specifically:
根据第六指标和第七指标计算编码单元的第二纹理信息;calculating the second texture information of the coding unit according to the sixth index and the seventh index;
当所述第二纹理信息大于第三阈值,确定所述第三编码单元执行四叉划分;When the second texture information is greater than a third threshold, it is determined that the third coding unit performs quad division;
当所述第二纹理信息小于第四阈值,确定所述第三编码单元执行二叉划分;When the second texture information is less than a fourth threshold, determining that the third coding unit performs binary division;
当所述第二纹理信息大于第四阈值且小于第三阈值,确定所述第三编码单元执行二叉划分或四叉划分。When the second texture information is greater than a fourth threshold and less than a third threshold, it is determined that the third coding unit performs binary division or quadruple division.
在一些实施例中,所述第一指标根据编码单元上各个位置的像素值、编码单元的高度值、编码单元的宽度值以及编码单元中像素值的均值确定;In some embodiments, the first indicator is determined according to the pixel value of each position on the coding unit, the height value of the coding unit, the width value of the coding unit, and the mean value of the pixel values in the coding unit;
所述第一指标的计算公式为:The calculation formula of the first indicator is:
其中,p(x,y)代表编码单元中位于第x列和第y行的位置上的像素值;u该编码单元的高度;v代表该编码单元的宽度;m代表该编码单元中像素值的均值。Wherein, p(x,y) represents the pixel value at the position of the xth column and the yth row in the coding unit; u is the height of the coding unit; v represents the width of the coding unit; m represents the pixel value in the coding unit mean value of .
在一些实施例中,所述第二指标根据编码单元上各个位置的像素值、编码单元的高度值、编码单元的宽度值以及编码单元中每行像素值的均值确定;In some embodiments, the second index is determined according to the pixel value of each position on the coding unit, the height value of the coding unit, the width value of the coding unit, and the mean value of the pixel values of each row in the coding unit;
所述第二指标的计算公式为:The calculation formula of the second indicator is:
其中,my是该编码单元中第y行的像素的均值;p(x,y)代表编码单元中位于第x列和第y行的位置上的像素值;u该编码单元的高度;v代表该编码单元的宽度;Wherein, m y is the mean value of the pixels in the y-th row in the coding unit; p(x, y) represents the pixel value at the position of the x-th column and the y-th row in the coding unit; u is the height of the coding unit; v represents the width of the coding unit;
所述第三指标根据根据编码单元上各个位置的像素值、编码单元的高度值、编码单元的宽度值以及编码单元中每列像素值的均值确定;The third index is determined according to the pixel value of each position on the coding unit, the height value of the coding unit, the width value of the coding unit and the mean value of each column of pixel values in the coding unit;
所述第三指标的计算公式为:The calculation formula of the third indicator is:
其中,mx是该编码单元中第x列的像素的均值;p(x,y)代表编码单元中位于第x列和第y行的位置上的像素值;u该编码单元的高度;v代表该编码单元的宽度。Wherein, m x is the mean value of the pixel in the xth column in the coding unit; p(x,y) represents the pixel value at the position of the xth column and the yth row in the coding unit; u is the height of the coding unit; v Represents the width of this coding unit.
在一些实施例中,所述第四指标根据第一指标和第二指标计算得到;In some embodiments, the fourth index is calculated according to the first index and the second index;
所述第五指标根据第一指标和第三指标计算得到;The fifth index is calculated according to the first index and the third index;
其中,所述第四指标的计算公式为:Wherein, the calculation formula of the fourth indicator is:
LENH=|MAD-MADH|LEN H =|MAD-MAD H |
MAD代表第一指标;MADH代表第二指标;MAD represents the first indicator; MAD H represents the second indicator;
其中,所述第五指标的计算公式为:Wherein, the calculation formula of the fifth index is:
LENV=|MAD-MADV|LEN V = |MAD-MAD V |
MAD代表第一指标;MADV代表第三指标。MAD stands for the first indicator; MAD V stands for the third indicator.
在一些实施例中,所述第六指标根据各个子块的第二指标计算得到;In some embodiments, the sixth index is calculated according to the second index of each sub-block;
所述第七指标根据各个子块的第三指标计算得到;The seventh index is calculated according to the third index of each sub-block;
其中,所述第六指标的计算公式为:Wherein, the calculation formula of the sixth indicator is:
diffH=|MADH1-MADH2|+|MADH3-MADH4|diff H = |MAD H1 -MAD H2 |+|MAD H3 -MAD H4 |
MADH1代表第一子块的第二指标;MADH2代表第二子块的第二指标;MADH3代表第三子块的第二指标;MADH4代表第四子块的第二指标;MAD H1 represents the second indicator of the first sub-block; MAD H2 represents the second indicator of the second sub-block; MAD H3 represents the second indicator of the third sub-block; MAD H4 represents the second indicator of the fourth sub-block;
所述第七指标的计算公式为:The calculation formula of the seventh index is:
diffV=|MADV1-MADV3|+|MADV2-MADV4|diff V =|MAD V1 -MAD V3 |+|MAD V2 -MAD V4 |
MADV1代表第一子块的第三指标;MADV2代表第二子块的第三指标;MADV3代表第三子块的第三指标;MADV4代表第四子块的第三指标。MAD V1 represents the third indicator of the first sub-block; MAD V2 represents the third indicator of the second sub-block; MAD V3 represents the third indicator of the third sub-block; MAD V4 represents the third indicator of the fourth sub-block.
根据本发明的第二方面,还提供了一种基于纹理特征的帧内划分系统,包括:According to a second aspect of the present invention, there is also provided an intra-frame partitioning system based on texture features, including:
提取模块,用于通过第一指标、第二指标和第三指标提取编码单元的第一纹理信息,并将所述编码单元划分为第一编码单元、第二编码单元或第三编码单元;所述第一编码单元为可终止划分的编码单元,所述第二编码单元为执行三叉划分的编码单元,所述第三编码单元为执行二叉或四叉划分的编码单元;an extraction module, configured to extract the first texture information of the coding unit through the first index, the second index and the third index, and divide the coding unit into the first coding unit, the second coding unit or the third coding unit; the The first coding unit is a coding unit that can terminate the division, the second coding unit is a coding unit that performs three-pronged division, and the third coding unit is a coding unit that performs binary or quadruple division;
第一划分模块,用于根据第四指标和第五指标,确定所述第二编码单元的划分模式;a first division module, configured to determine the division mode of the second coding unit according to the fourth index and the fifth index;
第二划分模块,用于通过第六指标和第七指标,确定所述第三编码单元的划分模式;a second division module, configured to determine the division mode of the third coding unit through the sixth index and the seventh index;
其中,所述第一指标用于确定编码单元的像素一致性;Wherein, the first index is used to determine the pixel consistency of the coding unit;
所述第二指标用于确定编码单元在垂直方向上的第一纹理信息;The second index is used to determine the first texture information of the coding unit in the vertical direction;
所述第三指标用于确定编码单元在水平方向上的第一纹理信息;The third index is used to determine the first texture information of the coding unit in the horizontal direction;
所述第四指标和所述第五指标用于确定编码单元的纹理方向。The fourth index and the fifth index are used to determine the texture direction of the coding unit.
根据本发明的第三方面,还提供了一种存储介质,所述存储介质存储有程序,所述程序被处理器执行完成如第一方面所述的方法。According to a third aspect of the present invention, a storage medium is also provided, and the storage medium stores a program, and the program is executed by a processor to complete the method according to the first aspect.
本发明的实施例通过第一指标、第二指标和第三指标提取编码单元的第一纹理信息,并将所述编码单元划分为第一编码单元、第二编码单元或第三编码单元;其中,第一编码单元为可终止划分的编码单元;然后根据第四指标和第五指标,确定所述第二编码单元的划分模式;最后通过第六指标和第七指标,确定所述第三编码单元的划分模式。本发明通过明确各个编码的划分模式,通过确定最优的划分模式,删减了冗余的划分模式,进而能够降低编码器的复杂度。The embodiment of the present invention extracts the first texture information of the coding unit by using the first index, the second index and the third index, and divides the coding unit into the first coding unit, the second coding unit or the third coding unit; wherein , the first coding unit is a coding unit that can terminate the division; then according to the fourth index and the fifth index, determine the division mode of the second coding unit; finally through the sixth index and the seventh index, determine the third coding The division mode of the unit. By clarifying the division mode of each coding and determining the optimal division mode, the present invention deletes redundant division modes, thereby reducing the complexity of the encoder.
附图说明Description of drawings
为了更清楚地说明本申请实施例中的技术方案,下面将对实施例描述中所需要使用的附图作简单地介绍,显而易见地,下面描述中的附图仅仅是本申请的一些实施例,对于本领域普通技术人员来讲,在不付出创造性劳动的前提下,还可以根据这些附图获得其他的附图。In order to illustrate the technical solutions in the embodiments of the present application more clearly, the following briefly introduces the drawings that are used in the description of the embodiments. Obviously, the drawings in the following description are only some embodiments of the present application. For those of ordinary skill in the art, other drawings can also be obtained from these drawings without creative effort.
图1为本发明实施例的整体步骤流程图;Fig. 1 is the overall step flow chart of the embodiment of the present invention;
图2为本发明实施例的四个子块示意图。FIG. 2 is a schematic diagram of four sub-blocks according to an embodiment of the present invention.
具体实施方式Detailed ways
下面结合说明书附图和具体实施例对本发明作进一步解释和说明。对于本发明实施例中的步骤编号,其仅为了便于阐述说明而设置,对步骤之间的顺序不做任何限定,实施例中的各步骤的执行顺序均可根据本领域技术人员的理解来进行适应性调整。The present invention will be further explained and illustrated below in conjunction with the accompanying drawings and specific embodiments of the description. The step numbers in the embodiments of the present invention are set only for the convenience of elaboration, and the sequence between the steps is not limited, and the execution sequence of the steps in the embodiments can be performed according to the understanding of those skilled in the art Adaptive adjustment.
针对现有技术存在的问题,本发明采用7个指标对编码单元的纹理进行描述,分别是MAD、MADH、MADV、LENH、LENv、diffH和diffV,下面首先对这七个指标的定义与所代表的含义进行解释:In view of the problems existing in the prior art, the present invention uses seven indicators to describe the texture of the coding unit, which are MAD, MAD H , MAD V , LEN H , LEN v , diff H and diff V . The definitions of indicators and what they represent are explained:
1、第一指标:MAD;1. The first indicator: MAD;
其中,p(x,y)代表编码单元中位于(x,y),也就是第x列和第y行的位置上的像素值,u和v代表该编码单元的高度(Height)和宽度(Width),m代表该编码单元中像素值的均值,m的定义如下Among them, p(x,y) represents the pixel value located at (x,y) in the coding unit, that is, the pixel value at the position of the xth column and the yth row, and u and v represent the height (Height) and width ( Width), m represents the mean value of the pixel values in the coding unit, and m is defined as follows
其中,MAD用于描述编码单元的像素的一致性。MAD越大,表示该预测单元的纹理越复杂,需要进一步划分的概率也就越大,反之,MAD越小,说明该编码单元中像素值越相似,也就越倾向于终止划分。Among them, MAD is used to describe the consistency of the pixels of the coding unit. The larger the MAD, the more complex the texture of the prediction unit, and the greater the probability that further division is required. On the contrary, the smaller the MAD, the more similar the pixel values in the coding unit, and the more inclined to terminate the division.
2、第二指标:MADH;2. The second indicator: MAD H ;
MADH的定义如下MAD H is defined as follows
其中,my是该编码单元中第y行的像素的均值,其定义如下where m y is the mean of the pixels in the y-th row of the coding unit, which is defined as follows
MADH用于衡量该编码单元上垂直方向上的纹理,若MADH越大,那么说明该编码单元上的垂直方向的纹理比较复杂。MAD H is used to measure the texture in the vertical direction on the coding unit. If the MAD H is larger, it means that the texture in the vertical direction on the coding unit is more complicated.
3、第三指标:MADV;3. The third indicator: MAD V ;
MADV的定义如下MAD V is defined as follows
其中,mx是该编码单元中,第x列的像素的均值,其定义如下Among them, m x is the mean value of the pixels in the xth column in the coding unit, which is defined as follows
MADv用于衡量该编码单元上水平方向上的纹理,若MADv越大,那么说明该编码单元上的水平方向的纹理复杂。MAD v is used to measure the texture in the horizontal direction on the coding unit. If the MAD v is larger, it means that the texture in the horizontal direction on the coding unit is complex.
4、第四指标LENH和第五指标LENv;4. The fourth index LEN H and the fifth index LEN v ;
LENH和LENv的定义如下LEN H and LEN v are defined as follows
LENH=|MAD-MADH|LEN H =|MAD-MAD H |
LENV=|MAD-MADV|LEN V = |MAD-MAD V |
LENH和LENv用于衡量一个编码单元中,其纹理的主要方向。这是因为LENH定义为MADH与MAD差值的绝对值,而LENv定义为MADv与MAD差值的绝对值。若LENH>LENv,说明该编码单元上的MAD值主要是由于水平方向上的纹理引起的,也就说明该编码单元中,水平方上的纹理较多。H.266/VVC的编码器VTM在编码该单元时更倾向于进行水平二叉或者三叉划分。相反,若LENH<LENv,说明该编码单元上的MAD值主要是由于垂直方向上的纹理引起的,也就说明该编码单元中,垂直方上的纹理较多。VTM在编码该单元时更倾向于进行垂直二叉或者三叉划分。LEN H and LEN v are used to measure the main direction of the texture in a coding unit. This is because LEN H is defined as the absolute value of the difference between MAD H and MAD, and LEN v is defined as the absolute value of the difference between MAD v and MAD. If LEN H >LEN v , it means that the MAD value on the coding unit is mainly caused by the texture in the horizontal direction, which means that there are many textures in the horizontal direction in the coding unit. The encoder VTM of H.266/VVC is more inclined to perform horizontal binary or triple division when encoding the unit. On the contrary, if LEN H < LEN v , it means that the MAD value on the coding unit is mainly caused by the texture in the vertical direction, which means that there are more textures in the vertical direction in the coding unit. The VTM is more inclined to perform vertical binary or triple division when encoding this unit.
5、第六指标diffH和第七指标diffV;5. The sixth index diff H and the seventh index diff V ;
diffH和diffV定义如下diff H and diff V are defined as follows
diffH=|MADH1-MADH2|+|MADH3-MADH4|diff H = |MAD H1 -MAD H2 |+|MAD H3 -MAD H4 |
diffV=|MADV1-MADV3|+|MADV2-MADV4|diff V =|MAD V1 -MAD V3 |+|MAD V2 -MAD V4 |
在32×32大小的编码单元中,本发明的实施例将该CU划分为四个大小相同的子块,具体划分方法如图2所示,并且依次将左上、右上、左下和右下的这几个子块标记为1,2,3和4。将其划分为四个相同的子块后,还需要计算出每个子块的MAD、MADH和MADV值,且记为MADi、MADHi和MADVi,其中i为每个子块中的标号。In a coding unit with a size of 32×32, the embodiment of the present invention divides the CU into four sub-blocks of the same size. The specific division method is shown in FIG. 2 , and the upper left, upper right, lower left and lower right Several sub-blocks are labeled 1, 2, 3 and 4. After it is divided into four identical sub-blocks, the MAD, MAD H and MAD V values of each sub-block need to be calculated and marked as MAD i , MAD Hi and MAD Vi , where i is the label in each sub-block .
这两个指标用于衡量子块1,2,3和4的纹理方向是否一致。若min(diffH,diffV)比较大时,说明该CU在水平和垂直方向上的纹理都比较复杂,这种情况下,最佳的划分方式为四叉划分。而纹理方向比较连贯且一致的情况下,若编码单元的纹理主要为水平方向时,通常diffH<diffV。那么VTM在编码的时候更倾向于把该编码单元进行水平划分。反之,若编码单元的纹理主要为垂直方向是,通常diffH>diffV,这种情况下,VTM更倾向于把该编码单元进行垂直划分。These two indicators are used to measure whether the texture directions of
下面结合说明书附图1,详细描述本发明实施例的整体步骤流程:Below in conjunction with accompanying drawing 1 of the specification, the overall step flow of the embodiment of the present invention is described in detail:
由于尺寸32×32的编码单元是VTM启用二叉和三叉划分模式。与H.265/HEVC相同,H.266/VVC也是采用递归的方式,通过遍历所有可能的划分模式且通过率失真优化来获得最优的划分模式。本实施例对32×32的编码单元进行处理,通过删减该节点上一些冗余的划分模式来降低编码其的复杂度。具体地,本发明可以分为三个过程,具体的算法如下所述。Since the coding unit of size 32×32 is VTM, binary and trigeminal partition modes are enabled. Like H.265/HEVC, H.266/VVC also adopts a recursive manner, and obtains the optimal division mode by traversing all possible division modes and optimizing the rate-distortion. In this embodiment, the coding unit of 32×32 is processed, and the complexity of coding it is reduced by pruning some redundant division modes on the node. Specifically, the present invention can be divided into three processes, and the specific algorithm is as follows.
S1、通过MAD、MADH、MADV这三个指标提取编码单元的第一纹理信息texture1,并将texture1定义为texture1=min(MAD,MADH,MADV)S1. Extract the first texture information texture 1 of the coding unit through the three indicators of MAD, MAD H and MAD V , and define texture 1 as texture 1 =min(MAD,MAD H ,MAD V )
然后,通过第一阈值th1和第二阈值th2将编码单元分为A、B和C三类。其中,A类的编码单元(即第一编码单元)像素表平坦,可以提前终止划分;B类编码单元(即第二编码单元)纹理复杂,后续执行三叉划分;C类编码单元(即第三编码单元)纹理复杂度介于A类和B类之间,此类编码单元后续只执行二叉或四叉划分模式。Then, the coding units are classified into three categories A, B, and C by the first threshold th 1 and the second threshold th 2 . Among them, the pixel table of the coding unit of type A (ie the first coding unit) is flat, and the division can be terminated in advance; the coding unit of type B (ie the second coding unit) has complex texture, and the three-point division is performed subsequently; the coding unit of type C (ie the third coding unit) Coding unit) The texture complexity is between A and B, and this type of coding unit only performs binary or quadratic partition mode subsequently.
具体地,如果texture1<th1,那么,将该编码单元归类为A类;如果texture1>th2,那么该编码单元归类为B类;如果th1<texture1<th2,那么将该编码单元归类为C类。Specifically, if texture 1 < th1, then the coding unit is classified as class A; if texture 1 > th2, then the coding unit is classified as class B; if th1 < texture 1 < th2, then the coding unit is classified as class B Classified as Class C.
S2、将B类的编码单元的划分模式进一步简化。S2. Further simplify the division mode of the coding unit of type B.
本实施例通过利用LENH和LENV这两个指标对纹理进行描述。当LENH>LENV时,确定第二编码单元执行三叉水平划分;当LENH<LENV时,确定第二编码单元执行三叉垂直划分。The present embodiment describes the texture by using two indicators, LEN H and LEN V. When LEN H > LEN V , it is determined that the second coding unit performs three-way horizontal division; when LEN H < LEN V , it is determined that the second coding unit performs three-way vertical division.
S3、将C类的编码单元的划分模式进一步简化。S3. Further simplify the division mode of the coding unit of the C type.
本实施例通过diffH和diffV对C类编码单元的第二纹理信息texture2进行分析,同时将texture2定义为texture2=min(diffH,diffV)In this embodiment, the second texture information texture 2 of the C-type coding unit is analyzed through diff H and diff V , and texture 2 is defined as texture 2 =min(diff H ,diff V )
然后通过第三阈值th3和第四阈值th4将C类编码单元分成三类。若texture2>th3,那么执行四叉划分;若texture2<th4,那么对纹理进行二叉划分;最后,若th4<texture2<th3,那么,四叉和二叉划分模式进行竞争。The C-type coding units are then divided into three categories by a third threshold th 3 and a fourth threshold th 4 . If texture 2 > th3, then perform quadratic division; if texture 2 < th4, then perform binary division on the texture; finally, if th4 < texture 2 < th3, then the quad and binary division modes compete.
本发明实施例还提供了一种基于纹理特征的帧内划分系统,包括:An embodiment of the present invention also provides an intra-frame division system based on texture features, including:
提取模块,用于通过第一指标、第二指标和第三指标提取编码单元的第一纹理信息,并将所述编码单元划分为第一编码单元、第二编码单元或第三编码单元;所述第一编码单元为可终止划分的编码单元,所述第二编码单元为执行三叉划分的编码单元,所述第三编码单元为执行二叉或四叉划分的编码单元;an extraction module, configured to extract the first texture information of the coding unit through the first index, the second index and the third index, and divide the coding unit into the first coding unit, the second coding unit or the third coding unit; the The first coding unit is a coding unit that can terminate the division, the second coding unit is a coding unit that performs three-pronged division, and the third coding unit is a coding unit that performs binary or quadruple division;
第一划分模块,用于根据第四指标和第五指标,确定所述第二编码单元的划分模式;a first division module, configured to determine the division mode of the second coding unit according to the fourth index and the fifth index;
第二划分模块,用于通过第六指标和第七指标,确定所述第三编码单元的划分模式;a second division module, configured to determine the division mode of the third coding unit through the sixth index and the seventh index;
其中,所述第一指标用于确定编码单元的像素一致性;Wherein, the first index is used to determine the pixel consistency of the coding unit;
所述第二指标用于确定编码单元在垂直方向上的第一纹理信息;The second index is used to determine the first texture information of the coding unit in the vertical direction;
所述第三指标用于确定编码单元在水平方向上的第一纹理信息;The third index is used to determine the first texture information of the coding unit in the horizontal direction;
所述第四指标和所述第五指标用于确定编码单元的纹理方向。The fourth index and the fifth index are used to determine the texture direction of the coding unit.
根据本发明实施例还提供了一种存储介质,所述存储介质存储有程序,所述程序被处理器执行完成如图1所示的方法。According to an embodiment of the present invention, a storage medium is also provided, and the storage medium stores a program, and the program is executed by a processor to complete the method shown in FIG. 1 .
综上所述,本发明通过明确各个编码的划分模式,通过确定最优的划分模式,删减了冗余的划分模式,进而能够降低编码器的复杂度。To sum up, the present invention can reduce the complexity of the encoder by clarifying the division mode of each coding and by determining the optimal division mode.
在一些可选择的实施例中,在方框图中提到的功能/操作可以不按照操作示图提到的顺序发生。例如,取决于所涉及的功能/操作,连续示出的两个方框实际上可以被大体上同时地执行或所述方框有时能以相反顺序被执行。此外,在本发明的流程图中所呈现和描述的实施例以示例的方式被提供,目的在于提供对技术更全面的理解。所公开的方法不限于本文所呈现的操作和逻辑流程。可选择的实施例是可预期的,其中各种操作的顺序被改变以及其中被描述为较大操作的一部分的子操作被独立地执行。In some alternative implementations, the functions/operations noted in the block diagrams may occur out of the order noted in the operational diagrams. For example, two blocks shown in succession may, in fact, be executed substantially concurrently or the blocks may sometimes be executed in the reverse order, depending upon the functionality/operations involved. Furthermore, the embodiments presented and described in the flowcharts of the present invention are provided by way of example in order to provide a more comprehensive understanding of the technology. The disclosed methods are not limited to the operations and logic flows presented herein. Alternative embodiments are contemplated in which the order of the various operations are altered and in which sub-operations described as part of larger operations are performed independently.
此外,虽然在功能性模块的背景下描述了本发明,但应当理解的是,除非另有相反说明,所述的功能和/或特征中的一个或多个可以被集成在单个物理装置和/或软件模块中,或者一个或多个功能和/或特征可以在单独的物理装置或软件模块中被实现。还可以理解的是,有关每个模块的实际实现的详细讨论对于理解本发明是不必要的。更确切地说,考虑到在本文中公开的装置中各种功能模块的属性、功能和内部关系的情况下,在工程师的常规技术内将会了解该模块的实际实现。因此,本领域技术人员运用普通技术就能够在无需过度试验的情况下实现在权利要求书中所阐明的本发明。还可以理解的是,所公开的特定概念仅仅是说明性的,并不意在限制本发明的范围,本发明的范围由所附权利要求书及其等同方案的全部范围来决定。Furthermore, while the invention is described in the context of functional modules, it is to be understood that, unless stated to the contrary, one or more of the described functions and/or features may be integrated in a single physical device and/or or software modules, or one or more functions and/or features may be implemented in separate physical devices or software modules. It will also be appreciated that a detailed discussion of the actual implementation of each module is not necessary to understand the present invention. Rather, given the attributes, functions, and internal relationships of the various functional modules in the apparatus disclosed herein, the actual implementation of the modules will be within the routine skill of the engineer. Accordingly, those skilled in the art, using ordinary skill, can implement the invention as set forth in the claims without undue experimentation. It is also to be understood that the specific concepts disclosed are illustrative only and are not intended to limit the scope of the invention, which is to be determined by the appended claims along with their full scope of equivalents.
所述功能如果以软件功能单元的形式实现并作为独立的产品销售或使用时,可以存储在一个计算机可读取存储介质中。基于这样的理解,本发明的技术方案本质上或者说对现有技术做出贡献的部分或者该技术方案的部分可以以软件产品的形式体现出来,该计算机软件产品存储在一个存储介质中,包括若干指令用以使得一台计算机设备(可以是个人计算机,服务器,或者网络设备等)执行本发明各个实施例所述方法的全部或部分步骤。而前述的存储介质包括:U盘、移动硬盘、只读存储器(ROM,Read-Only Memory)、随机存取存储器(RAM,Random Access Memory)、磁碟或者光盘等各种可以存储程序代码的介质。The functions, if implemented in the form of software functional units and sold or used as independent products, may be stored in a computer-readable storage medium. Based on such understanding, the technical solution of the present invention can be embodied in the form of a software product in essence, or the part that contributes to the prior art or the part of the technical solution. The computer software product is stored in a storage medium, including Several instructions are used to cause a computer device (which may be a personal computer, a server, or a network device, etc.) to execute all or part of the steps of the methods described in the various embodiments of the present invention. The aforementioned storage medium includes: U disk, mobile hard disk, Read-Only Memory (ROM, Read-Only Memory), Random Access Memory (RAM, Random Access Memory), magnetic disk or optical disk and other media that can store program codes .
在流程图中表示或在此以其他方式描述的逻辑和/或步骤,例如,可以被认为是用于实现逻辑功能的可执行指令的定序列表,可以具体实现在任何计算机可读介质中,以供指令执行系统、装置或设备(如基于计算机的系统、包括处理器的系统或其他可以从指令执行系统、装置或设备取指令并执行指令的系统)使用,或结合这些指令执行系统、装置或设备而使用。就本说明书而言,“计算机可读介质”可以是任何可以包含、存储、通信、传播或传输程序以供指令执行系统、装置或设备或结合这些指令执行系统、装置或设备而使用的装置。The logic and/or steps represented in flowcharts or otherwise described herein, for example, may be considered an ordered listing of executable instructions for implementing the logical functions, may be embodied in any computer-readable medium, For use with, or in conjunction with, an instruction execution system, apparatus, or device (such as a computer-based system, a system including a processor, or other system that can fetch instructions from and execute instructions from an instruction execution system, apparatus, or apparatus) or equipment. For the purposes of this specification, a "computer-readable medium" can be any device that can contain, store, communicate, propagate, or transport the program for use by or in connection with an instruction execution system, apparatus, or apparatus.
计算机可读介质的更具体的示例(非穷尽性列表)包括以下:具有一个或多个布线的电连接部(电子装置),便携式计算机盘盒(磁装置),随机存取存储器(RAM),只读存储器(ROM),可擦除可编辑只读存储器(EPROM或闪速存储器),光纤装置,以及便携式光盘只读存储器(CDROM)。另外,计算机可读介质甚至可以是可在其上打印所述程序的纸或其他合适的介质,因为可以例如通过对纸或其他介质进行光学扫描,接着进行编辑、解译或必要时以其他合适方式进行处理来以电子方式获得所述程序,然后将其存储在计算机存储器中。More specific examples (non-exhaustive list) of computer readable media include the following: electrical connections with one or more wiring (electronic devices), portable computer disk cartridges (magnetic devices), random access memory (RAM), Read Only Memory (ROM), Erasable Editable Read Only Memory (EPROM or Flash Memory), Fiber Optic Devices, and Portable Compact Disc Read Only Memory (CDROM). In addition, the computer readable medium may even be paper or other suitable medium on which the program may be printed, as the paper or other medium may be optically scanned, for example, followed by editing, interpretation, or other suitable medium as necessary process to obtain the program electronically and then store it in computer memory.
应当理解,本发明的各部分可以用硬件、软件、固件或它们的组合来实现。在上述实施方式中,多个步骤或方法可以用存储在存储器中且由合适的指令执行系统执行的软件或固件来实现。例如,如果用硬件来实现,和在另一实施方式中一样,可用本领域公知的下列技术中的任一项或他们的组合来实现:具有用于对数据信号实现逻辑功能的逻辑门电路的离散逻辑电路,具有合适的组合逻辑门电路的专用集成电路,可编程门阵列(PGA),现场可编程门阵列(FPGA)等。It should be understood that various parts of the present invention may be implemented in hardware, software, firmware or a combination thereof. In the above-described embodiments, various steps or methods may be implemented in software or firmware stored in memory and executed by a suitable instruction execution system. For example, if implemented in hardware, as in another embodiment, it can be implemented by any one or a combination of the following techniques known in the art: Discrete logic circuits, application specific integrated circuits with suitable combinational logic gates, Programmable Gate Arrays (PGA), Field Programmable Gate Arrays (FPGA), etc.
在本说明书的描述中,参考术语“一个实施例”、“一些实施例”、“示例”、“具体示例”、或“一些示例”等的描述意指结合该实施例或示例描述的具体特征、结构、材料或者特点包含于本发明的至少一个实施例或示例中。在本说明书中,对上述术语的示意性表述不一定指的是相同的实施例或示例。而且,描述的具体特征、结构、材料或者特点可以在任何的一个或多个实施例或示例中以合适的方式结合。In the description of this specification, description with reference to the terms "one embodiment," "some embodiments," "example," "specific example," or "some examples", etc., mean specific features described in connection with the embodiment or example , structure, material or feature is included in at least one embodiment or example of the present invention. In this specification, schematic representations of the above terms do not necessarily refer to the same embodiment or example. Furthermore, the particular features, structures, materials or characteristics described may be combined in any suitable manner in any one or more embodiments or examples.
尽管已经示出和描述了本发明的实施例,本领域的普通技术人员可以理解:在不脱离本发明的原理和宗旨的情况下可以对这些实施例进行多种变化、修改、替换和变型,本发明的范围由权利要求及其等同物限定。Although embodiments of the present invention have been shown and described, it will be understood by those of ordinary skill in the art that various changes, modifications, substitutions and alterations can be made in these embodiments without departing from the principles and spirit of the invention, The scope of the invention is defined by the claims and their equivalents.
以上是对本发明的较佳实施进行了具体说明,但本发明并不限于所述实施例,熟悉本领域的技术人员在不违背本发明精神的前提下还可做作出种种的等同变形或替换,这些等同的变形或替换均包含在本申请权利要求所限定的范围内。The above is a specific description of the preferred implementation of the present invention, but the present invention is not limited to the described embodiments, and those skilled in the art can also make various equivalent deformations or replacements without departing from the spirit of the present invention, These equivalent modifications or substitutions are all included within the scope defined by the claims of the present application.
Claims (10)
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN202010351187.5A CN111432212B (en) | 2020-04-28 | 2020-04-28 | Intra-frame division method, system and storage medium based on texture feature |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN202010351187.5A CN111432212B (en) | 2020-04-28 | 2020-04-28 | Intra-frame division method, system and storage medium based on texture feature |
Publications (2)
Publication Number | Publication Date |
---|---|
CN111432212A CN111432212A (en) | 2020-07-17 |
CN111432212B true CN111432212B (en) | 2022-05-10 |
Family
ID=71554697
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN202010351187.5A Active CN111432212B (en) | 2020-04-28 | 2020-04-28 | Intra-frame division method, system and storage medium based on texture feature |
Country Status (1)
Country | Link |
---|---|
CN (1) | CN111432212B (en) |
Families Citing this family (3)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN112954321B (en) * | 2021-02-01 | 2022-09-16 | 中山大学 | An image feature intra-frame division method, device, device and medium |
CN113747177B (en) * | 2021-08-05 | 2023-06-20 | 中山大学 | Intra-frame coding speed optimization method, device and medium based on historical information |
CN115802044B (en) * | 2023-02-06 | 2023-08-18 | 深流微智能科技(深圳)有限公司 | Determination device, apparatus and storage medium for coding block division mode |
Family Cites Families (3)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN110446036B (en) * | 2019-07-29 | 2021-11-16 | 中南大学 | Coding unit rapid partitioning method based on intra-frame coding in multipurpose coding |
CN110650338B (en) * | 2019-09-20 | 2021-11-16 | 中山大学 | Method, system and storage medium for dividing multifunctional video coding frame |
CN110691254B (en) * | 2019-09-20 | 2022-01-18 | 中山大学 | Quick judgment method, system and storage medium for multifunctional video coding |
-
2020
- 2020-04-28 CN CN202010351187.5A patent/CN111432212B/en active Active
Also Published As
Publication number | Publication date |
---|---|
CN111432212A (en) | 2020-07-17 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
CN111147867B (en) | Multifunctional video coding CU partition rapid decision-making method and storage medium | |
CN110290388B (en) | Intra-frame prediction method, video encoding method, computer device and storage device | |
CN110087087B (en) | VVC inter-frame coding unit prediction mode early decision and block division early termination method | |
CN111432212B (en) | Intra-frame division method, system and storage medium based on texture feature | |
CN105491376B (en) | Image encoding and decoding method and device | |
CN108712648B (en) | Rapid intra-frame coding method for depth video | |
JP5399416B2 (en) | Video coding system with reference frame compression | |
CN104754362B (en) | Image compression method using finely divided block matching | |
CN105430415B (en) | Fast encoding method in a kind of 3D HEVC deep video frames | |
CN104244007A (en) | Image compression method and device based on arbitrary shape matching | |
CN109510987B (en) | Method, device and coding equipment for determining the division mode of coding tree nodes | |
CN106688238A (en) | Improved Reference Pixel Selection and Filtering for Intra-frame Coding of Depth Maps | |
CN114827604B (en) | A method and system for CTU division within a high-efficiency video coding frame | |
CN109963151B (en) | Coding unit division determining method and device, terminal device and readable storage medium | |
EP3343446A1 (en) | Method and apparatus for encoding and decoding lists of pixels | |
CN110383837A (en) | Method and device for video processing | |
CN104811730A (en) | Video image intra-frame encoding unit texture analysis and encoding unit selection method | |
CN114040211B (en) | AVS 3-based intra-frame prediction fast decision method | |
Hu et al. | Deep inter prediction with error-corrected auto-regressive network for video coding | |
CN103856776A (en) | Image coding and decoding method for boundary and curvature drive repairing | |
CN105791863B (en) | 3D-HEVC depth map intra-frame predictive encoding method based on layer | |
CN115883835B (en) | Video coding method, device, equipment and storage medium | |
CN113115042B (en) | An intra-frame decision-making method, device, equipment and medium based on ISP optimization | |
CN110312129B (en) | Method and device for constructing most probable mode list, intra-frame prediction and coding | |
CN116016937A (en) | Sample point adaptive compensation method and device in video coding |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
PB01 | Publication | ||
PB01 | Publication | ||
SE01 | Entry into force of request for substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
GR01 | Patent grant | ||
GR01 | Patent grant | ||
TR01 | Transfer of patent right |
Effective date of registration: 20250519 Address after: Ganquan road Shushan District of Hefei City, Anhui Province, Jiangmen 230000 Business Garden No. 81 floor B-1015 Patentee after: HEFEI MINGLONG ELECTRONIC TECHNOLOGY Co.,Ltd. Country or region after: China Address before: 510275 Xingang West Road, Guangdong, China, No. 135, No. Patentee before: SUN YAT-SEN University Country or region before: China |
|
TR01 | Transfer of patent right |