CN103517069B

CN103517069B - A kind of HEVC intra-frame prediction quick mode selection method based on texture analysis

Info

Publication number: CN103517069B
Application number: CN201310445775.5A
Authority: CN
Inventors: 张永飞; 李哲; 李波
Original assignee: Beihang University
Current assignee: Beijing Ckleader Software Technology Co ltd
Priority date: 2013-09-25
Filing date: 2013-09-25
Publication date: 2016-10-26
Anticipated expiration: 2033-09-25
Also published as: CN103517069A

Abstract

The present invention discloses a fast mode selection method for HEVC intra-frame prediction based on texture analysis. Before performing intra-frame prediction on the coding tree unit, the method uses the absolute value of the gradient in the four directions of horizontal, vertical, left-bottom, and right-bottom. and determine the main texture direction and texture complexity of each 4×4 unit in the coding tree unit, and determine the division of the current coding tree unit according to the principle that a larger coding unit is used in the texture smooth area and a smaller coding unit is used in the texture complex area. When predicting, according to the main texture direction of the prediction unit, several least likely prediction modes are excluded, and then rough mode selection and rate-distortion optimization mode selection are performed according to the HEVC coding standard. The fast mode selection method for HEVC intra-frame prediction based on texture analysis proposed by the present invention can significantly improve the encoding speed under the premise of ensuring the encoding quality.

Description

A Fast Mode Selection Method for HEVC Intra Prediction Based on Texture Analysis

技术领域technical field

本发明属于多媒体编码领域，具体涉及一种针对HEVC视频编码标准的帧内预测快速模式选择方法，是一种根据图像纹理分析进行编码单元划分和预测方向快速模式选择的方法。The invention belongs to the field of multimedia coding, and in particular relates to a fast mode selection method for intra-frame prediction aimed at the HEVC video coding standard, which is a method for dividing coding units and fast mode selection for prediction directions according to image texture analysis.

背景技术Background technique

随着新一代视频编码标准HEVC（High Efficiency Video Coding）的发布，视频的编码效率进一步提升。相比于以前的视频编码标准H.264/AVC，在相同编码质量的情况下，可以节省近一半的码率，然而这是以牺牲编码复杂度为代价的。如何在不改变码流结构和保持原有码率的前提下，最大程度的降低算法复杂度，显著提高编码速度，是HEVC大规模推广与应用的关键所在。With the release of the next-generation video coding standard HEVC (High Efficiency Video Coding), video coding efficiency has been further improved. Compared with the previous video coding standard H.264/AVC, under the same coding quality, it can save nearly half of the code rate, but this is at the expense of coding complexity. How to minimize the complexity of the algorithm and significantly improve the encoding speed without changing the code stream structure and maintaining the original code rate is the key to the large-scale promotion and application of HEVC.

帧内编码可以为帧间编码提供参考信息，并且可以阻止由误码引起的错误漂移，因此在视频编码中帧内编码是必不可少的。影响HEVC帧内编码耗时的主要有两个部分：编码单元的划分和预测方向的选择。Intra-frame coding can provide reference information for inter-frame coding, and can prevent error drift caused by bit errors, so intra-frame coding is essential in video coding. There are mainly two parts that affect the time-consuming of HEVC intra-frame encoding: the division of coding units and the selection of prediction directions.

针对编码单元的快速划分主要有两种方法，一是根据已编码单元的率失真代价利用贝叶斯等统计学习的方法确定编码单元的划分，然而离线的学习结果不能很好的通用于所有的视频序列，在线的学习方式又会引入额外的计算复杂度；二是根据当前编码单元的图像内容复杂度确定编码单元的划分，这种方法的关键在于计算图像复杂度以及选择合适的阈值。There are mainly two methods for fast division of coding units. One is to use Bayesian and other statistical learning methods to determine the division of coding units according to the rate-distortion cost of the coded units. However, the off-line learning results cannot be used well for all For video sequences, the online learning method will introduce additional computational complexity; the second is to determine the division of the coding unit according to the image content complexity of the current coding unit. The key to this method is to calculate the image complexity and select an appropriate threshold.

针对预测方向的快速模式选择主要有三种方法，一是根据相邻区域或者相邻帧间具有较强的相关性，利用周围编码单元的信息减少当前编码单元的预测方向候选集，然而实际应用中，经常出现场景切换或纹理丰富的视频，这种情况下时空相关性较弱，导致编码性能明显下降；二是利用最佳的预测方向与编码单元的纹理方向间较高的相关性，减少候选预测方向的数量；三是通过一种低复杂度的方式估计每一种预测模式的码率和失真降低率失真代价的计算复杂度，这种方法保留了所有的预测方向，并且没有增加额外的计算复杂度，但是由于自然图像视频序列的随机性较强，很难构造一个适用于所有视频序列的率失真估计模型，相比于第二种方法，这种方法的编码性能下降比较明显。There are three main methods for fast mode selection of the prediction direction. One is to use the information of the surrounding coding units to reduce the prediction direction candidate set of the current coding unit according to the strong correlation between adjacent regions or adjacent frames. However, in practical applications , there are often scene switching or texture-rich videos. In this case, the spatio-temporal correlation is weak, resulting in a significant drop in coding performance; the second is to use the high correlation between the best prediction direction and the texture direction of the coding unit to reduce candidate The number of prediction directions; the third is to estimate the code rate and distortion of each prediction mode in a low-complexity way to reduce the computational complexity of the rate-distortion cost. This method retains all prediction directions and does not add additional Computational complexity, but due to the strong randomness of natural image video sequences, it is difficult to construct a rate-distortion estimation model suitable for all video sequences. Compared with the second method, the coding performance of this method has dropped significantly.

在新一代视频编码标准中，HEVC引入了新的编码树单元（Coding Tree Unit,CTU）概念，用于替代现有视频编码标准中的宏块概念，另外还有编码单元、预测单元、变换单元等概念。编码树单元是一个递归的四叉树结构，深度可以从0到3，即编码单元的大小可以从64×64到8×8。对于最小的编码单元来说，HEVC的帧内编码又可以将其划分为4×4的预测单元，预测单元的大小可以从64×64到4×4。而对于帧内预测来说，HEVC对每一个预测单元都提供了35种预测模式。相比于现有视频编码标准H.264/AVC，编码单元划分和预测方向选择的优化问题更为复杂。本发明根据纹理分析构造当前编码树单元的图像复杂度描述，并据此快速确定编码单元的划分；利用纹理分析获得的纹理方向与预测方向间较强的相关性，减少预测模式的数量。In the new generation of video coding standards, HEVC introduces a new coding tree unit (Coding Tree Unit, CTU) concept, which is used to replace the concept of macroblocks in existing video coding standards. In addition, there are coding units, prediction units, and transformation units. and other concepts. The coding tree unit is a recursive quadtree structure, and the depth can range from 0 to 3, that is, the size of the coding unit can be from 64×64 to 8×8. For the smallest coding unit, HEVC intra-frame coding can divide it into 4×4 prediction units, and the size of the prediction unit can range from 64×64 to 4×4. For intra prediction, HEVC provides 35 prediction modes for each prediction unit. Compared with the existing video coding standard H.264/AVC, the optimization problems of coding unit division and prediction direction selection are more complicated. The invention constructs the image complexity description of the current coding tree unit according to the texture analysis, and quickly determines the division of the coding unit according to the texture analysis; utilizes the strong correlation between the texture direction obtained by the texture analysis and the prediction direction, and reduces the number of prediction modes.

发明内容Contents of the invention

为了克服现有技术的上述缺陷，本发明提出了一种基于纹理分析的HEVC帧内预测快速模式选择方法，该方法根据视频内容的复杂度确定编码树单元的划分，并根据图像的纹理方向减小预测方向的候选集。实验证明本方法能够很好的适应高清视频图像的实时压缩应用，大大提高了编码速度。In order to overcome the above-mentioned defects of the prior art, the present invention proposes a fast mode selection method for HEVC intra prediction based on texture analysis, which determines the division of coding tree units according to the complexity of the video content, and reduces Candidate set for small prediction directions. Experiments prove that this method can well adapt to the real-time compression application of high-definition video images, and greatly improves the encoding speed.

为实现上述目的，本发明采用的技术方案为：To achieve the above object, the technical solution adopted in the present invention is:

一种基于纹理分析的HEVC帧内预测快速模式选择方法，该方法步骤如下：A fast mode selection method for HEVC intra-frame prediction based on texture analysis, the method steps are as follows:

步骤（1）、对当前编码树单元，计算每个可能的编码单元的图像复杂度和主纹理方向；Step (1), for the current coding tree unit, calculate the image complexity and main texture direction of each possible coding unit;

步骤（2）、对当前编码树单元，按照Z扫描顺序从上往下重复执行步骤（3）至步骤（6），直至扫描完当前编码树单元；Step (2), for the current coding tree unit, repeat step (3) to step (6) from top to bottom according to the Z scanning order until the current coding tree unit is scanned;

步骤（3）、若当前编码单元的图像复杂度小于或等于阈值，则判定该编码单元为平滑单元，否则判定为复杂单元；Step (3), if the image complexity of the current coding unit is less than or equal to the threshold, it is determined that the coding unit is a smooth unit, otherwise it is determined as a complex unit;

步骤（4）、对于平滑的编码单元，利用步骤（5）计算当前编码单元的率失真代价；Step (4), for a smooth coding unit, use step (5) to calculate the rate-distortion cost of the current coding unit;

步骤（5）、对于当前编码单元中的每一个预测单元，保留该单元主纹理方向周围的8个预测方向以及planar预测和DC预测模式，按照HEVC视频编码标准进行粗略模式选择和率失真优化选择；Step (5), for each prediction unit in the current coding unit, retain the 8 prediction directions around the main texture direction of the unit, planar prediction and DC prediction mode, and perform rough mode selection and rate-distortion optimization selection according to the HEVC video coding standard ;

步骤（6）、对于复杂的编码单元，按照步骤（3）至步骤（6）依次递归扫描四个子单元，若四个子单元都判定为平滑单元，则利用步骤（5）计算当前编码单元的率失真代价，并根据率失真代价最小原则判定当前编码单元是否划分，同时更新阈值。Step (6), for complex coding units, follow steps (3) to (6) to recursively scan four sub-units in sequence, and if all four sub-units are determined to be smooth units, use step (5) to calculate the rate of the current coding unit Distortion cost, and determine whether the current coding unit is divided according to the principle of minimum rate-distortion cost, and update the threshold at the same time.

所述步骤（1）具体包括如下步骤：The step (1) specifically includes the following steps:

步骤（11）、对当前编码树单元中的每一个4×4的单元，分别沿水平、垂直、左下、右下四个方向计算梯度并计算梯度的绝对值和SAG；Step (11), for each 4×4 unit in the current coding tree unit, calculate the gradient along the horizontal, vertical, lower left, and lower right directions and calculate the absolute value of the gradient and SAG;

步骤（12）、将最小梯度绝对值和的梯度方向作为该单元的主纹理方向，所述主纹理方向是指该单元大部分像素的纹理方向；Step (12), taking the gradient direction of the minimum gradient absolute value sum as the main texture direction of the unit, and the main texture direction refers to the texture direction of most pixels of the unit;

步骤（13）、将主纹理方向与其正交方向上SAG差的绝对值定义为该单元的纹理复杂度，并利用量化步长与该单元大小的乘积作为初始阈值。Step (13), define the absolute value of the SAG difference between the main texture direction and its orthogonal direction as the texture complexity of the unit, and use the product of the quantization step size and the unit size as the initial threshold.

所述步骤（5）具体包括如下步骤：The step (5) specifically includes the following steps:

步骤（51）、定义一个候选预测模式数组；Step (51), defining an array of candidate prediction modes;

步骤（52）、根据步骤（1）获取当前预测单元的主纹理方向，并将主纹理方向及其周围的8个预测方向作为候选预测方向；Step (52), obtain the main texture direction of the current prediction unit according to step (1), and use the main texture direction and eight surrounding prediction directions as candidate prediction directions;

步骤（53）、将候选预测方向映射到HEVC编码标准中的角度预测模式，并将选择的角度预测模式和planar以及DC模式添加到候选预测模式数组；Step (53), mapping the candidate prediction direction to the angle prediction mode in the HEVC coding standard, and adding the selected angle prediction mode, planar and DC mode to the candidate prediction mode array;

步骤（54）、对候选预测模式数组中的每一个预测模式，按照HEVC编码标准进行粗略模式选择和率失真优化模式选择。Step (54), for each prediction mode in the candidate prediction mode array, perform rough mode selection and rate-distortion optimization mode selection according to the HEVC coding standard.

所述步骤（6）具体包括如下步骤：The step (6) specifically includes the following steps:

步骤（61）、按照步骤（3）至步骤（6）依次递归扫描四个子单元，若四个子单元都判定为平滑单元，则利用步骤（5）计算当前编码单元的率失真代价Cost_coarse，否则继续递归扫描下一层次的子单元；Step (61), recursively scan the four sub-units according to step (3) to step (6), if all four sub-units are determined to be smooth units, use step (5) to calculate the rate-distortion cost Cost _coarse of the current coding unit, otherwise Continue to recursively scan the subunits of the next level;

步骤（62）、若当前编码单元的率失真代价Cost_coarse小于或等于四个子单元的率失真代价和Cost_fine，则将阈值调大，否则将阈值调小。Step (62): If the rate-distortion cost Cost _coarse of the current coding unit is less than or equal to the rate-distortion cost and Cost _fine of the four sub-units, increase the threshold; otherwise, decrease the threshold.

所述步骤（62）中阈值调大的更新策略为：The update strategy for increasing the threshold value in the step (62) is:

TH′＝TH+α×(Cost_fine-Cost_coarse)/2TH'＝TH+α×(Cost _fine -Cost _coarse )/2

式中TH为更新前的阈值，TH′为更新后的阈值，α为纹理复杂度与率失真代价间的线性模型的乘法因子。where TH is the threshold before updating, TH' is the threshold after updating, and α is the multiplication factor of the linear model between texture complexity and rate-distortion cost.

所述步骤（62）中阈值调小的更新策略为：The update strategy for reducing the threshold value in the step (62) is:

TH′＝TH-aTH'=TH-a

式中TH为更新前的阈值，TH′为更新后的阈值,a为常数。In the formula, TH is the threshold before updating, TH' is the threshold after updating, and a is a constant.

本发明与现有技术相比的优点在于：The advantage of the present invention compared with prior art is:

1、本发明从纹理分析的角度出发，将编码树单元的划分与预测方向的选择结合考虑，利用梯度既能够提取纹理复杂度又可以提取纹理的方向，在保证编码质量的前提下，可以节省更多的编码时间。1. From the perspective of texture analysis, the present invention considers the division of coding tree units and the selection of prediction direction, and uses the gradient to extract both texture complexity and texture direction. On the premise of ensuring the coding quality, it can save More coding time.

2、本发明根据最终的率失真代价及时更新决定编码单元是否划分的阈值，能够充分利用编码树单元内相邻区域的强相关性，以减弱阈值选取对编码质量的影响。2. The present invention timely updates the threshold for determining whether to divide a coding unit according to the final rate-distortion cost, and can make full use of the strong correlation of adjacent regions in the coding tree unit to weaken the influence of threshold selection on coding quality.

附图说明Description of drawings

图1为基于纹理分析的HEVC帧内预测快速模式选择方法流程图；Fig. 1 is the flow chart of HEVC intra prediction fast mode selection method based on texture analysis;

图2为HEVC视频编码标准中的编码树单元划分结构图；FIG. 2 is a structural diagram of coding tree unit division in the HEVC video coding standard;

图3为方向梯度计算方法示意图；Fig. 3 is a schematic diagram of the method for calculating the direction gradient;

图4为基于梯度的编码单元快速划分详细流程图。FIG. 4 is a detailed flow chart of gradient-based fast division of coding units.

具体实施方式detailed description

下面结合附图和具体实施方式对本发明作进一步的说明。The present invention will be further described below in conjunction with the accompanying drawings and specific embodiments.

为了满足高清视频图像的实时压缩应用需求，提高HEVC视频编码器的编码速度，本发明提出了一种基于纹理分析的HEVC帧内预测快速模式选择方法。该方法主要包括两部分：基于纹理复杂度的编码单元快速划分和基于纹理方向的预测模式快速选择，编码单元的快速划分涉及发明内容中的步骤（1）至步骤（4）以及步骤（6），预测模式的快速选择涉及发明内容中的步骤（5）。首先利用梯度算子计算当前编码树单元中各子单元的复杂度，确定一个较细的编码单元划分，再根据该划分确定一个较粗的编码单元划分，最终根据后续帧内预测的率失真代价确定最终的编码单元划分；在选择最佳预测方向的过程中，排除掉与主纹理方向相差较远的预测模式，以节省帧内预测的时间。下面展开具体说明。In order to meet the real-time compression application requirements of high-definition video images and improve the encoding speed of HEVC video encoders, the present invention proposes a fast mode selection method for HEVC intra-frame prediction based on texture analysis. The method mainly includes two parts: fast partitioning of coding units based on texture complexity and fast selection of prediction mode based on texture direction. The fast partitioning of coding units involves steps (1) to (4) and (6) in the summary of the invention. , the quick selection of the prediction mode involves step (5) in the summary of the invention. First, use the gradient operator to calculate the complexity of each sub-unit in the current coding tree unit, determine a finer coding unit division, and then determine a coarser coding unit division according to the division, and finally according to the rate-distortion cost of the subsequent intra-frame prediction Determine the final coding unit division; in the process of selecting the best prediction direction, exclude the prediction modes that are far from the main texture direction, so as to save the time of intra prediction. The following is a detailed description.

1.基于纹理复杂度的编码单元快速划分1. Fast division of coding units based on texture complexity

新一代视频编码标准HEVC引入了编码树单元(Coding Tree Unit,CTU)概念，一帧视频由多个互不重叠的编码树单元组成。默认情况下，编码树单元的大小为64×64。编码树单元是一个四叉树形式的递归结构，它可以灵活的划分为多个层次的编码单元，如图1所示。编码单元的大小可以从64×64到8×8。在HEVC标准的参考编码器中，利用率失真优化选择的方法对每一种可能的划分都进行一次预测、变换、量化、熵编码、反量化、反变换、重建等过程，相比于H.264/AVC，编码复杂度大幅提升，严重制约了HEVC的实际应用。The new-generation video coding standard HEVC introduces the concept of Coding Tree Unit (CTU). A frame of video consists of multiple non-overlapping coding tree units. By default, the size of the coding tree unit is 64×64. The coding tree unit is a recursive structure in the form of a quadtree, which can be flexibly divided into multiple levels of coding units, as shown in FIG. 1 . The size of a coding unit can range from 64×64 to 8×8. In the reference encoder of the HEVC standard, the method of using rate-distortion optimization selection performs a process of prediction, transformation, quantization, entropy coding, inverse quantization, inverse transformation, and reconstruction for each possible division. Compared with H. 264/AVC, the coding complexity is greatly increased, which seriously restricts the practical application of HEVC.

通过实验统计发现，编码单元的大小与图像内容的纹理复杂度密切相关。纹理简单的区域采用较大的编码单元，而纹理复杂的区域采用较小的编码单元。纹理复杂度的度量方式有多种，如方差、梯度、熵等。梯度不仅包含图像的纹理复杂度信息，还包含了图像的纹理方向信息，可以协助快速确定预测单元的预测模式。基于此，本发明提出一种基于梯度的编码单元快速划分方法。Through experimental statistics, it is found that the size of the coding unit is closely related to the texture complexity of the image content. Regions with simple textures adopt larger coding units, while regions with complex textures adopt smaller coding units. There are many ways to measure texture complexity, such as variance, gradient, entropy, etc. The gradient contains not only the texture complexity information of the image, but also the texture direction information of the image, which can help to quickly determine the prediction mode of the prediction unit. Based on this, the present invention proposes a gradient-based fast coding unit division method.

在介绍编码单元快速划分的实施方式之前，首先给出本发明中所使用的梯度的计算方法以及主纹理方向、纹理复杂度的定义。本发明中的梯度是指像素亮度值沿水平、垂直、左下、右下等方向获得的方向导数，如图3所示。计算公式如（1）—（4）式所示。Before introducing the implementation of fast division of coding units, the calculation method of gradient used in the present invention and the definition of main texture direction and texture complexity are firstly given. The gradient in the present invention refers to the directional derivative of the pixel brightness value obtained along the horizontal, vertical, lower left, lower right directions, etc., as shown in FIG. 3 . The calculation formula is shown in formula (1)-(4).

$\{\begin{matrix} {g g}_{h h} ((x x,, y the y)) = = p p ((x x,, y the y)) - - p p ((x x + + 22,, y the y)) \\ {g g}_{h h} ((x x + + 11,, y the y)) = = p p ((x x + + 11,, y the y)) - - p p ((x x + + 33,, y the y)) \\ {g g}_{h h} ((x x,, y the y + + 11)) = = p p ((x x,, y the y + + 22)) - - p p ((x x + + 22,, y the y + + 22)) \\ {g g}_{h h} ((x x + + 11,, y the y + + 11)) = = p p ((x x + + 11,, y the y + + 22)) - - p p ((x x + + 33,, y the y + + 22)) \end{matrix} - - - - - - ((11))$

$\{\begin{matrix} {g g}_{v v} ((x x,, y the y)) = = p p ((x x,, y the y)) - - p p ((x x,, y the y + + 22)) \\ {g g}_{v v} ((x x + + 11,, y the y)) = = p p ((x x + + 22,, y the y)) - - p p ((x x + + 22,, y the y + + 22)) \\ {g g}_{v v} ((x x,, y the y + + 11)) = = p p ((x x + + 11,, y the y + + 11)) - - p p ((x x + + 11,, y the y + + 33)) \\ {g g}_{v v} ((x x + + 11,, y the y + + 11)) = = p p ((x x + + 33,, y the y + + 11)) - - p p ((x x + + 33,, y the y + + 33)) \end{matrix} - - - - - - ((22))$

$\{\begin{matrix} {g g}_{ld ld} ((x x,, y the y)) = = \sqrt{22} [[p p ((x x,, y the y)) - - p p ((x x + + 22,, y the y + + 22))]] \\ {g g}_{ld ld} ((x x + + 11,, y the y)) = = \sqrt{22} [[p p ((x x + + 11,, y the y)) - - p p ((x x + + 33,, y the y + + 22))]] \\ {g g}_{ld ld} ((x x,, y the y + + 11)) = = \sqrt{22} [[p p ((x x,, y the y + + 11)) - - p p ((x x + + 22,, y the y + + 33))]] \\ {g g}_{ld ld} ((x x + + 11,, y the y + + 11)) = = \sqrt{22} [[p p ((x x + + 11,, y the y + + 11)) - - p p ((x x + + 33,, y the y + + 33))]] \end{matrix} - - - - - - ((33))$

$\{\begin{matrix} {g g}_{rd rd} ((x x,, y the y)) = = \sqrt{22} [[p p ((x x + + 22,, y the y)) - - p p ((x x,, y the y + + 22))]] \\ {g g}_{rd rd} ((x x + + 11,, y the y)) = = \sqrt{22} [[p p ((x x + + 33,, y the y)) - - p p ((x x + + 11,, y the y + + 22))]] \\ {g g}_{rd rd} ((x x,, y the y + + 11)) = = \sqrt{22} [[p p ((x x + + 22,, y the y + + 11)) - - p p ((x x,, y the y + + 33))]] \\ {g g}_{rd rd} ((x x + + 11,, y the y + + 11)) = = \sqrt{22} [[p p ((x x + + 33,, y the y + + 11)) - - p p ((x x + + 11,, y the y + + 33))]] \end{matrix} - - - - - - ((44))$

式中g_h(·)、g_v(·)、g_ld(·)、g_rd(·)分别表示水平、垂直、左下、右下方向上的梯度滤波器，p(·)表示当前编码树单元中的亮度分量像素值，(x,y)表示每个4×4单元左上角像素的坐标。本发明中利用SAG表示沿某一方向的梯度绝对值和，计算公式如（5）式所示。where g _h ( ), g _v ( ), g _ld ( ), and g _rd ( ) represent the gradient filters in the horizontal, vertical, lower-left, and lower-right directions, respectively, and p( ) represents the current coding tree unit The pixel value of the luminance component in , (x, y) represents the coordinates of the upper left corner pixel of each 4×4 unit. In the present invention, SAG is used to represent the absolute sum of gradients along a certain direction, and the calculation formula is shown in formula (5).

SAG_dir＝∑|g_dir(x,y)|,dir∈{h,v,ld,rd} （5）SAG _dir =∑|g _dir (x,y)|,dir∈{h,v,ld,rd} (5)

将最小梯度绝对值和的方向作为主纹理方向，主纹理方向的梯度绝对值和记作SAG_min，其正交方向的梯度绝对值和记作SAG_org。纹理复杂度complexity定义为SAG_org与SAG_min间的差值，即：The direction of the minimum gradient absolute value sum is taken as the main texture direction, the gradient absolute value sum of the main texture direction is denoted as SAG _min , and the gradient absolute value sum of its orthogonal direction is denoted as SAG _org . The texture complexity complexity is defined as the difference between SAG _org and SAG _min , namely:

complexity＝SAG_org-SAG_min （6）complexity = SAG _org - SAG _min (6)

划分的详细流程如图4所示。对每一个编码树单元，首先将其划分为N×N个4×4的单元，对每一个单元分别计算水平、垂直、左下、右下方向上的梯度值。然后对编码树单元从64×64到8×8递归的计算各编码单元的纹理复杂度以及划分，最终确定整个编码树单元的划分。递归过程描述如下：The detailed process of partitioning is shown in Figure 4. For each coding tree unit, it is firstly divided into N×N 4×4 units, and the gradient values in the horizontal, vertical, lower left, and lower right directions are calculated for each unit. Then recursively calculate the texture complexity and division of each coding tree unit from 64×64 to 8×8, and finally determine the division of the entire coding tree unit. The recursive process is described as follows:

（1）对当前编码单元，利用（5）式分别计算四个方向上的梯度绝对值和SAG_h、SAG_v、SAG_ld、SAG_rd，并获取当前编码单元的主纹理方向和纹理复杂度complexity。(1) For the current coding unit, use formula (5) to calculate the absolute value of the gradient in the four directions and SAG _h , SAG _v , SAG _ld , SAG _rd , and obtain the main texture direction and texture complexity complexity of the current coding unit .

（2）若当前编码单元的纹理复杂度complexity小于阈值TH，直接将当前编码单元设置为平滑区域，并计算率失真代价，不再进行划分，返回上一层递归。(2) If the texture complexity complexity of the current coding unit is less than the threshold TH, directly set the current coding unit as a smooth area, and calculate the rate-distortion cost, no longer divide, and return to the previous layer of recursion.

（3）否则，检查当前编码单元是否可以划分，若不可以划分，则计算率失真代价并返回上一层递归。(3) Otherwise, check whether the current coding unit can be divided, if not, calculate the rate-distortion cost and return to the previous layer of recursion.

（4）否则，将当前编码单元划分为四个子单元，依次进入下一层递归。(4) Otherwise, divide the current coding unit into four subunits, and enter the next layer of recursion in turn.

（5）检查四个子单元是否都被确定为平滑区域，如是平滑区域则计算当前编码单元的率失真代价，否则转步骤（8）。(5) Check whether all four sub-units are determined to be smooth areas, if they are smooth areas, calculate the rate-distortion cost of the current coding unit, otherwise go to step (8).

（6）若当前编码单元的率失真代价小于或等于四个子单元的率失真代价和，则当前编码单元不划分，同时将阈值TH调大。(6) If the rate-distortion cost of the current coding unit is less than or equal to the sum of the rate-distortion costs of the four sub-units, the current coding unit is not divided, and the threshold TH is increased.

（7）若当前编码单元的率失真代价大于四个子单元的率失真代价和，则当前编码单元划分，同时将阈值TH微调小。(7) If the rate-distortion cost of the current coding unit is greater than the sum of the rate-distortion costs of the four sub-units, the current coding unit is divided, and the threshold TH is fine-tuned at the same time.

（8）检测是否有未编码单元，若有则返回上一层递归，否则退出递归。(8) Detect whether there are unencoded units, if so, return to the previous level of recursion, otherwise exit the recursion.

一般说来，现有视频编码标准所对应的编码器产生的失真主要是由量化引起的，若相邻像素间的差异小于量化步长，则说明这一差异经过量化后就不存在了。因此可以通过量化步长设定纹理复杂度的阈值，然而视频编码标准中的量化步长是针对变换域的。本发明根据空域和变换域间的能量守恒定律，将递归过程中的初始阈值定义为当前编码单元的大小与量化步长的乘积，即TH＝CUsize×QPstep。Generally speaking, the distortion produced by the encoder corresponding to the existing video coding standards is mainly caused by quantization. If the difference between adjacent pixels is smaller than the quantization step size, it means that the difference will not exist after quantization. Therefore, the threshold of texture complexity can be set through the quantization step size, but the quantization step size in the video coding standard is for the transform domain. According to the law of energy conservation between space domain and transform domain, the present invention defines the initial threshold in the recursive process as the product of the size of the current coding unit and the quantization step, ie TH=CUsize×QPstep.

阈值TH调大的更新策略为：TH′＝TH+α×(Cost_fine-Cost_coarse)/2，这里的α为纹理复杂度与率失真代价间的线性模型的乘法因子。The update strategy for increasing the threshold TH is: TH′=TH+α×(Cost _fine -Cost _coarse )/2, where α is the multiplication factor of the linear model between the texture complexity and the rate-distortion cost.

阈值TH微调小的更新策略为：TH′为TH′＝TH-a，这里的a为常数。The update strategy for fine-tuning the threshold TH is as follows: TH' is TH'=TH-a, where a is a constant.

这样就可以根据纹理复杂度以及率失真代价获得编码树单元的划分。In this way, the division of coding tree units can be obtained according to texture complexity and rate-distortion cost.

2.基于纹理方向的预测模式快速选择2. Fast selection of prediction mode based on texture direction

相比于H.264/AVC，新一代视频编码标准HEVC引入了预测单元(Prediction Unit,PU)的概念。对于帧内预测来说，预测单元有两种划分方式。对于最小的编码单元，预测单元可以与编码单元相同，也可以在编码单元基础上进一步四叉分；而对于其余的编码单元，预测单元只能与编码单元相同。同时HEVC还提供了更为精细的预测方向，每一个预测单元都有35种预测模式，大大增加了编码复杂度。Compared with H.264/AVC, HEVC, a new generation of video coding standard, introduces the concept of prediction unit (Prediction Unit, PU). For intra prediction, there are two ways to divide the prediction unit. For the smallest coding unit, the prediction unit can be the same as the coding unit, or can be further divided into four branches on the basis of the coding unit; while for the remaining coding units, the prediction unit can only be the same as the coding unit. At the same time, HEVC also provides a more refined prediction direction. Each prediction unit has 35 prediction modes, which greatly increases the coding complexity.

提高编码效率的根本在于在保证编码质量的前提下，尽量减少候选预测模式。通常情况下，最终选择的预测模式与纹理方向相近。表1给出了各种预测单元中最佳预测模式与纹理方向相近的概率。从表中可以看出，最佳预测模式与纹理方向相近的概率可以达到80%以上。另外，在本发明的第一步编码单元的快速划分中，已经获得了每一个编码单元的主纹理方向。因此，可以利用这一中间结果尽量减少候选预测模式的数量，从而提高编码效率。需要说明的是，对于所有的预测单元，HEVC所提供的planar和DC预测模式都保留，本发明只考虑减少角度预测的候选预测方向。The root of improving the coding efficiency is to minimize the number of candidate prediction modes under the premise of ensuring the coding quality. Usually, the final selected prediction mode is close to the texture direction. Table 1 gives the probability that the best prediction mode is close to the texture direction in various prediction units. It can be seen from the table that the probability that the best prediction mode is similar to the texture direction can reach more than 80%. In addition, in the first step of fast division of coding units in the present invention, the main texture direction of each coding unit has been obtained. Therefore, this intermediate result can be used to minimize the number of candidate prediction modes, thereby improving coding efficiency. It should be noted that, for all prediction units, the planar and DC prediction modes provided by HEVC are reserved, and the present invention only considers the candidate prediction directions of reduced angle prediction.

表1 各预测单元在不同量化步长下最佳预测模式与纹理方向相近的概率Table 1 The probability that the best prediction mode of each prediction unit is similar to the texture direction under different quantization steps

量化步长Quantization step size 64×6464×64 32×3232×32 16×1616×16 8×88×8 4×44×4 22twenty two 100%100% 79.63%79.63% 80.76%80.76% 84.32%84.32% 87.45%87.45% 2727 86.36%86.36% 78.80%78.80% 83.64%83.64% 87.67%87.67% 89.24%89.24% 3232 80.43%80.43% 82.21%82.21% 89.47%89.47% 88.88%88.88% 89.86%89.86% 3737 78.67%78.67% 86.53%86.53% 89.83%89.83% 89.20%89.20% 89.06%89.06% 平均average 86.37%86.37% 81.79%81.79% 85.93%85.93% 87.52%87.52% 88.90%88.90%

由于最佳预测模式与纹理方向相近的概率较高，本发明利用主纹理方向周围的几个预测方向进行角度预测。具体实施方式如下：Since the probability that the best prediction mode is close to the texture direction is high, the present invention uses several prediction directions around the main texture direction for angle prediction. The specific implementation is as follows:

（1）定义一个大小为11的候选预测模式数组，用于存储筛选的候选预测模式，其中包括planar模式、DC模式、主纹理方向以及主纹理方向周围的若干预测方向。在本发明的具体实施中，在主纹理方向两侧各取了4个预测方向。(1) Define an array of candidate prediction modes with a size of 11, which is used to store the selected candidate prediction modes, including planar mode, DC mode, main texture direction and several prediction directions around the main texture direction. In the specific implementation of the present invention, four prediction directions are taken on both sides of the main texture direction.

（2）获取当前预测单元的主纹理方向所对应的预测模式编号mode_texture，并利用公式（7）计算出角度预测范围[mode_left,mode_right]。(2) Obtain the prediction mode number mode _texture corresponding to the main texture direction of the current prediction unit, and use the formula (7) to calculate the angle prediction range [mode _left t, mode _right ].

$\{\begin{matrix} {mode mode}_{left left} = = {mode mode}_{texture texture} - - 44 \\ {mode mode}_{right right} = = {mode mode}_{texture texture} + + 44 \end{matrix} - - - - - - ((77))$

（3）利用公式（8）获取主纹理方向左侧的预测模式。(3) Use formula (8) to obtain the prediction mode on the left side of the main texture direction.

$\{\begin{matrix} mode mode Idx Idx = = mode mode + + 3333 & mode mode < < 22 \\ mode mode Idx Idx = = mode mode & mode mode &GreaterEqual; &Greater Equal; 22 \end{matrix},, mode mode &Element; &Element; [[{mode mode}_{left left},, {mode mode}_{texture texture} - - 11]] - - - - - - ((88))$

（4）利用公式（9）获取主纹理方向右侧的预测模式。(4) Use formula (9) to obtain the prediction mode on the right side of the main texture direction.

$\{\begin{matrix} mode mode Idx Idx = = mode mode - - 3333 & mode mode > > 3434 \\ mode mode Idx Idx = = mode mode & mode mode \leq \leq 3434 \end{matrix},, mode mode &Element; &Element; [[{mode mode}_{texture texture} + + 11,, {mode mode}_{right right}]] - - - - - - ((99))$

（5）对筛选的候选预测模式集合{modeIdx}，存储到候选预测模式数组中。按照HEVC标准中的粗略模式选择和率失真优化选择从中选取最佳的预测模式。(5) Store the filtered candidate prediction mode set {modeIdx} in the candidate prediction mode array. Select the best prediction mode according to the coarse mode selection and rate-distortion optimization selection in the HEVC standard.

以上公开的仅为本发明的具体实施例。根据本发明提供的技术思想，本领域的技术人员所能思及的变化，都应落入本发明的保护范围内。The above disclosures are only specific embodiments of the present invention. According to the technical ideas provided by the present invention, all changes conceivable by those skilled in the art shall fall within the protection scope of the present invention.

Claims

1. a HEVC intra-frame prediction quick mode selection method based on texture analysis, it is characterised in that: the method step As follows:

Step (1), to present encoding tree unit, calculate the image complexity of each possible coding unit and main grain direction；

Step (2), to present encoding tree unit, according to Z scanning sequency repeated execution of steps from top to bottom (3) to step (6), Until scanning through present encoding tree unit；

Step (3) if the image complexity of current coded unit is less than or equal to threshold value, then judges that this coding unit is as smooth single Unit, is otherwise judged to complex unit；

Step (4), for smooth coding unit, utilize step (5) to calculate the rate distortion costs of current coded unit；

Step (5), for each predicting unit in current coded unit, only retain 8 around the main grain direction of this unit Individual prediction direction and planar prediction and DC predictive mode, according to HEVC video encoding standard carry out coarse mode select and Rate-distortion optimization selects；

Step (6), for complicated coding unit, according to step (3) to four subelements of step (6) successively recursive scanning, If four subelements are all judged to smooth unit, then step (5) is utilized to calculate the rate distortion costs of current coded unit, and root Judge whether current coded unit divides, and updates threshold value simultaneously according to rate distortion costs minimum principle；

Described step (6) specifically includes following steps:

Step (61), according to step (3) to four subelements of step (6) successively recursive scanning, if four subelements are all sentenced It is set to smooth unit, then utilizes step (5) to calculate rate distortion costs Cost of current coded unit_coarse, otherwise continue recurrence Scan the subelement of next level；

Step (62) if rate distortion costs Cost of current coded unit_coarseRate distortion generation less than or equal to four subelements Valency and Cost_fine, then threshold value is tuned up, otherwise threshold value is turned down；

The more New Policy that in described step (62), threshold value tunes up is:

TH '=TH+ α × (Cost_fine-Cost_coarse)/2

In formula, TH is the threshold value before updating, and TH ' is the threshold value after updating, and α is the line between Texture complication and rate distortion costs The multiplication factor of property model.

2. HEVC intra-frame prediction quick mode selection method based on texture analysis as claimed in claim 1, its feature exists Following steps are specifically included in: described step (1):

Step (11), to each unit of 4 × 4 in present encoding tree unit, respectively along level, vertical, lower-left, the right side Lower four direction calculates gradient and calculates absolute value and the SAG of gradient；

Step (12), using the gradient direction of minimal gradient absolute value sum as the main grain direction of this code tree unit, described master Grain direction refers to the grain direction of this code tree unit major part pixel；

Step (13), the absolute value of SAG difference on orthogonal to that for main grain direction direction is defined as the Texture complication of this unit, And utilize the quantization step product with this cell size as initial threshold.

3. HEVC intra-frame prediction quick mode selection method based on texture analysis as claimed in claim 1, its feature exists Following steps are specifically included in: described step (5):

Step (51), one candidate modes array of definition；

Step (52), obtain the main grain direction of current prediction unit according to step (1), and by main grain direction and about 8 prediction direction as candidate prediction direction；

Step (53), the angle predictive mode that candidate prediction direction is mapped in HEVC coding standard, and the angle that will select Degree predictive mode and planar and DC pattern add candidate modes array to；

Step (54), to each predictive mode in candidate modes array, carry out roughly according to HEVC coding standard Model selection and R-D optimized mode selection.

4. HEVC intra-frame prediction quick mode selection method based on texture analysis as claimed in claim 1, its feature exists In: the more New Policy that in described step (62), threshold value is turned down is:

TH '=TH-a

In formula, TH is the threshold value before updating, and TH ' is the threshold value after updating, and a is constant.