CN108174204B

CN108174204B - Decision tree-based inter-frame rapid mode selection method

Info

Publication number: CN108174204B
Application number: CN201810184642.XA
Authority: CN
Inventors: 张昊; 雷诗哲; 王塞博; 牟凡; 符婷
Original assignee: Central South University
Current assignee: Central South University
Priority date: 2018-03-06
Filing date: 2018-03-06
Publication date: 2020-01-03
Anticipated expiration: 2038-03-06
Also published as: CN108174204A

Abstract

The invention discloses a decision tree-based inter-frame rapid mode selection method, which comprises the steps of firstly carrying out predictive coding on an optimal mode obtained by decision tree prediction by obtaining CU information of a specific position with better correlation, obtaining some information after current CU coding in real time, and finely adjusting the number and the sequence of inter-coding modes by utilizing the correlation of time domain information and space domain information and combining with the correlation information of peripheral CUs. The scheme predicts the inter-frame mode in advance, adjusts the mode sequence in real time in the prediction process of the inter-frame mode, skips unnecessary mode prediction, and greatly shortens the inter-frame prediction time, thereby reducing the coding time; the method is simple and easy to implement, and is beneficial to the industrialized popularization of the new generation of video coding standard.

Description

Decision tree-based inter-frame rapid mode selection method

Technical Field

The invention relates to the field of video coding and decoding, in particular to a decision tree-based inter-frame fast mode selection method.

Background

In video coding techniques, inter-frame prediction is one of the core techniques of coding. Inter-frame prediction is to predict an image to be coded by using the related information of an already coded image according to the time correlation of a video image, and then perform a series of operations such as transformation, quantization, entropy coding and the like on the predicted residual error instead of directly coding the original pixel value. After interframe coding, the time correlation among video images is greatly eliminated, the coding complexity is greatly reduced, and the coding efficiency is obviously improved.

In 2013, the joint introduction of VCEG (video coding experts group) of ITU-T and MPEG (moving Picture experts group) of ISO/IEC into the HEVC (high efficiency video coding) video compression scheme. Since 2016, VCEG and MPEG began to research a new generation of video encoders, and established an expert group, jfet (joint video research group), aimed at further increasing the compression rate of HEVC. The latest coding software JEM for a new generation of video encoder is developed on the framework of HM, following the basic framework of HEVC, but introducing many new techniques and tools inside each module. The introduction of these tools plays a role in improving the compression rate and video quality, but increases the complexity of encoding, so that the practical application performance is worse.

A new inter-frame prediction mode is added into a new generation of coding standard JEM, and the inter-frame prediction steps are as follows:

the method comprises the following steps: the affinity Merge mode is performed. And calculating the rate distortion cost of the Affini Merge mode, and setting the Affini Merge mode as the optimal mode.

Step two: it is determined whether the optimal mode is the 2Nx2N Merge mode. And calculating the rate distortion cost of the 2Nx2N Merge mode, and if the rate distortion cost is smaller than that of the current optimal mode, selecting the 2Nx2N Merge mode as the optimal mode.

Step three: and judging whether the optimal mode is the FRUC Merge mode. The FRUC target mode includes two sub-modes, namely, binary and Template. And calculating the rate distortion cost of the FRUC Merge mode, if the rate distortion cost is smaller than the rate distortion cost of the current optimal mode, selecting the FRUC Merge mode as the optimal mode, and indicating which sub-mode the optimal mode belongs to by using a flag bit.

Step four: it is determined whether the optimal mode is the 2Nx2N mode. And calculating the rate-distortion cost under the 2Nx2N mode, and if the rate-distortion cost is smaller than that of the current optimal mode, selecting the 2Nx2N mode as the optimal mode.

Step five: the intra prediction mode is determined.

The prediction of the inter mode occupies more than half of the total time of encoding, and thus improvement of the inter mode prediction is necessary. If the order of the inter-frame modes can be adjusted through the related information, the optimal possibly selected mode is predicted, the traversal of all the modes is avoided, and some modes with low selection possibility are skipped, so that the encoding time is greatly reduced, and the encoding complexity is reduced.

Disclosure of Invention

The invention provides a decision tree-based inter-frame rapid mode selection method aiming at the defects of more inter-frame modes, overlong coding time and low coding efficiency in a JEM (Japanese image projection) coder.

A decision tree-based inter-frame fast mode selection method comprises the following steps:

step 1: constructing a current Coding Unit (CU) optimal prediction mode classifier;

randomly selecting four video test sequences from international standard test sequences, and acquiring a left adjacent block CU of a current coding unit CU from 100 frames of coding information of the selected four test sequences_LeftUpper neighboring block CU_AboveAnd co-located block CU_ColThe related information of (2);

with CU_Left、CU_Above、CU_ColBest mode of (1), CU_ColMV, CU_ColThe average pixel of the current coding unit CU is used as input data, the optimal prediction mode of the current coding unit CU is used as output data, 10-fold cross validation is selected, the J48 decision tree in weka is trained, and the optimal prediction mode classifier of the current coding unit CU based on the decision tree is obtained;

step 2: acquiring an optimal prediction mode estimated value mode (P) of a current coding unit CU in a coding process;

extracting CU of current coding unit in coding process_Left、CU_Above、CU_ColBest mode of (1), CU_ColMV, CU_ColInputting the residual error, the coding QP value and the average pixel of the current coding unit CU into the current coding unit CU best mode classifier constructed in the step 1, and acquiring the best prediction mode pre-estimated value mode (P) of the current coding unit CU in the coding process;

and step 3: obtaining a left neighbor block CU_LeftUpper neighboring block CU_AboveAnd co-located block CU_ColBest prediction mode of_jAnd distortion D in the corresponding best prediction mode_jAnd rate distortion cost RD_jIf mode (P) and mode_jIf not, entering step 4, otherwise, sequentially predicting all inter-frame modes and entering step 9;

and 4, step 4: counting the number i of the same optimal prediction modes, wherein j represents Left, Above and Col;

and 5: acquiring an optimal prediction mode candidate value of the current coding unit CU according to the value of i;

step 6: predicting an affinity Merge mode of the current coding unit CU, and marking the affinity Merge mode as the current best mode of the current coding unit CU^*And simultaneously marking the rate distortion cost in the affinity Merge mode as the current optimal rate distortion cost RD^*；

And 7: predicting mode (P) of current coding unit CU, and updating current optimal prediction mode^*And the current best rate-distortion cost RD^*；

And 8: based on the optimal prediction mode prediction value mode (P) of the current coding unit CU, sequentially performing mode prediction on the optimal prediction mode candidate value of the current coding unit CU, and updating the current optimal prediction mode after each mode prediction^*And the current best rate-distortion cost RD^*；

Under the values of various i, sequentially selecting the optimal prediction mode candidate value of the current coding unit CU to perform mode prediction based on the optimal prediction mode prediction value mode (P) of the current coding unit CU, thereby finishing the prediction of the inter-frame mode of the current coding unit CU;

and step 9: the prediction of the inter mode is ended.

Further, the obtaining process of the optimal prediction mode candidate value of the current coding unit CU is as follows:

1) if i is 0, according to the distortion D_jFrom small to large left neighboring block CU_LeftUpper neighboring block CU_AboveAnd co-located block CU_ColArranging and arranging the arranged left adjacent block CU_LeftUpper neighboring block CU_AboveAnd co-located block CU_ColThe corresponding best prediction modes are marked as a first best prediction mode candidate value mode (X), a second best prediction mode (Y) and a third best prediction mode (Z) in sequence;

2) if i is 2, the left neighboring block CU is divided into two_LeftUpper neighboring block CU_AboveAnd co-located block CU_ColThe same mode in (a) is labeled as a first best prediction mode (x), and a different mode is labeled as a second best prediction mode (y);

3) if i is 3, the left neighboring block CU is divided into_LeftUpper neighboring block CU_AboveAnd co-located block CU_ColIs marked as the first best mode (x);

wherein X, Y, Z is selected from Merge, Bilateral, Template, Inter2nx2 n.

Further, the specific process of step 8 is as follows:

A) if i is equal to 0, determining whether mode (p) is the same as mode (x), if so, proceeding to step 8.2, otherwise, proceeding to step 8.1;

step 8.1: predicting the current coding unit CU in mode (X), and updating the current optimal prediction mode^*And the current best rate-distortion cost RD^*(ii) a Determining whether left neighbor block CU is satisfied_LeftUpper neighboring block CU_AboveAnd co-located block CU_ColDistortion D of_jAnd a rate-distortion cost RD_jAre all larger than the current coding unit CU in the current best prediction mode^*Distortion D at the bottom and the best rate distortion cost RD^*If yes, entering step 9, and if not, entering step 8.2;

step 8.2: judging whether the mode (P) and the mode (Y) are the same, if so, entering a step 8.4, and if not, entering a step 8.3;

step 8.3: predicting mode (Y) of current coding unit CU, and updating current optimal prediction mode^*And the current best rate-distortion cost RD^*(ii) a Determining whether left neighbor block CU is satisfied_LeftUpper neighboring block CU_AboveAnd co-located block CU_ColDistortion D of at least two blocks_jAndrate distortion cost RD_jAre all larger than the current coding unit CU in the current best prediction mode^*If the distortion D and the rate distortion cost RD are met, the step 9 is carried out, and if the distortion D and the rate distortion cost RD are not met, the step 8.4 is carried out;

step 8.4: judging whether mode (P) is the same as mode (Z), if so, entering step 9, if not, predicting mode (Z) of the current coding unit CU, and updating the current optimal prediction mode^*And the current best rate-distortion cost RD^*Entering step 9;

B) if i is 2, determining whether mode (p) is the same as mode (x), if so, proceeding to step 8.6, otherwise, proceeding to step 8.5;

step 8.5: predicting the current coding unit CU in mode (X), and updating the current optimal prediction mode^*And the current best rate-distortion cost RD^*(ii) a Judging the current best prediction mode of the current coding unit CU^*Whether mode (X) and whether left neighbor block CU is satisfied_LeftUpper neighboring block CU_AboveAnd co-located block CU_ColRate distortion cost RD of at least two blocks_jIs larger than the current coding unit CU in the current best mode of the current coding unit CU^*Lower rate-distortion cost RD^*If the judgment result is true, the step 9 is entered, and if the judgment result is false, the step 8.6 is entered;

step 8.6: judging whether the mode (P) and the mode (Y) are the same, if so, entering a step 8.8, and if not, entering a step 8.7;

step 8.7: predicting mode (Y) of current coding unit CU, and updating current optimal prediction mode^*And the current best rate-distortion cost RD^*(ii) a Entering a step 9;

step 8.8: judging the current best mode of the current coding unit CU^*If the current mode is the mode (P), entering the step 9, if not, predicting the rest unexecuted modes and updating the current best prediction mode according to the rate distortion cost^*And the current best rate-distortion cost RD^*Entering step 9;

C) if i is 3, determining whether mode (p) is the same as mode (x), if so, proceeding to step 9, otherwise, proceeding to step 8.9;

step 8.9: predicting the current coding unit CU in mode (X), and updating the current optimal prediction mode^*And the current best rate-distortion cost RD^*Predicting the remaining unexecuted modes, and entering step 9;

the update of the current best prediction mode^*And the current best rate-distortion cost RD^*After the current coding unit is subjected to mode prediction, if the obtained rate distortion cost is less than RD^*The mode of progress is taken as the current best prediction mode^*Meanwhile, the obtained rate distortion cost is taken as the current optimal rate distortion cost RD^*；

And step 9: the prediction of the inter mode is ended.

Advantageous effects

The invention provides a decision tree-based inter-frame rapid mode selection method, which comprises the steps of firstly carrying out predictive coding on an optimal mode obtained by decision tree prediction by obtaining CU information of a specific position with better correlation, obtaining some information after current CU coding in real time, and finely adjusting the number and the sequence of inter-coding modes by utilizing the correlation of time domain information and space domain information and combining with the correlation information of peripheral CUs. The scheme predicts the inter-frame mode in advance, adjusts the mode sequence in real time in the prediction process of the inter-frame mode, skips unnecessary mode prediction, and greatly shortens the inter-frame prediction time, thereby reducing the coding time; the method is simple and easy to implement, and is beneficial to the industrialized popularization of the new generation of video coding standard.

Drawings

FIG. 1 is a schematic diagram illustrating the position relationship between a CU and neighboring blocks, wherein (a) is a reference frame and (b) is a current frame;

FIG. 2 is an overall flow chart of the present invention.

Detailed Description

For the convenience of public understanding, the following describes the technical solution of the present invention in detail based on the reference software JEM of the new generation video coding by way of example with reference to fig. 1 and 2.

In order to reduce the coding time and improve the working efficiency, the invention specifically adopts the technical scheme that: the method comprises the steps of obtaining CU information of a specific position with better correlation, firstly carrying out prediction coding of an optimal mode obtained by decision tree prediction, obtaining some information after current CU coding in real time, and finely adjusting the number and the sequence of inter-coding modes by utilizing the correlation of time domain information and space domain information and combining with the correlation information of peripheral CUs.

such as: if mode_LeftAnd mode_Above、mode_ColIf all the patterns are represented by Bilaterals, i is 3.

wherein X, Y, Z is selected from Merge, Bilateral, Template, Inter2nx2 n.

And under various values of i, sequentially selecting the optimal prediction mode candidate value of the current coding unit CU to perform mode prediction based on the optimal prediction mode prediction value mode (P) of the current coding unit CU, thereby finishing the prediction of the inter-frame mode of the current coding unit CU.

step 8.3: predicting mode (Y) of current coding unit CU, and updating current optimal prediction mode^*And the current best rate-distortion cost RD^*(ii) a Determining whether left neighbor block CU is satisfied_LeftUpper neighboring block CU_AboveAnd co-located block CU_ColDistortion D of at least two blocks_jAnd a rate-distortion cost RD_jAre all larger than the current coding unit CU in the current best prediction mode^*If the distortion D and the rate distortion cost RD are met, the step 9 is carried out, and if the distortion D and the rate distortion cost RD are not met, the step 8.4 is carried out;

And step 9: the prediction of the inter mode is ended.

In order to verify the performance of the algorithm, the example uses two indexes of BDBR (Bjotegaard Delta Bit rate) and Delta T for evaluation. The BDBR is used for evaluating the influence of the algorithm on the video quality, and the larger the BDBR is, the larger the influence of the algorithm on the video quality is, namely the performance of the algorithm is poorer. The Δ T reflects the improvement of the encoder efficiency by the method of the present invention, and the calculation formula is as follows:

wherein, T_orgRepresents the time used for encoding using the original encoder without any fast algorithm, Tnew represents the time required for encoding using the method of the present invention, and Δ T represents the percentage of efficiency improvement of the encoder using the method of the present invention.

Through experimental simulation, the experimental results of the invention are shown in table 1.

TABLE 1 results of the experiment

According to the experimental simulation results, as shown in table 1: the encoding time is reduced by 15.88% while the BDBR rise is only 0.83. The experimental result shows that the method greatly improves the coding efficiency on the premise of ensuring the subjective quality of the video, and achieves the aim of the invention.

Claims

1. A decision tree-based inter-frame fast mode selection method is characterized by comprising the following steps:

and step 9: the prediction of the inter mode is ended.

2. The method according to claim 1, wherein the best prediction mode candidate for the current Coding Unit (CU) is obtained as follows:

1) if i is 0, according to the distortion D_jFrom small to large left neighboring block CU_LeftUpper neighboring block CU_AboveAndco-located block CU_ColArranging and arranging the arranged left adjacent block CU_LeftUpper neighboring block CU_AboveAnd co-located block CU_ColThe corresponding best prediction modes are marked as a first best prediction mode candidate value mode (X), a second best prediction mode (Y) and a third best prediction mode (Z) in sequence;

wherein X, Y, Z is selected from Merge, Bilateral, Template, Inter2nx2 n.

3. The method according to claim 2, wherein the specific process of step 8 is as follows:

step 8.8: judging the current best mode of the current coding unit CU^*If the current mode is the mode (P), entering the step 9, if not, predicting the rest unexecuted modes and updating the current best prediction mode according to the rate distortion cost^*And whenFront optimal rate distortion cost RD^*Entering step 9;

And step 9: the prediction of the inter mode is ended.