CN110365982B

CN110365982B - Multi-transformation selection accelerating method for intra-frame coding in multipurpose coding

Info

Publication number: CN110365982B
Application number: CN201910698624.8A
Authority: CN
Inventors: 张昊; 符婷
Original assignee: Central South University
Current assignee: Central South University
Priority date: 2019-07-31
Filing date: 2019-07-31
Publication date: 2022-01-04
Anticipated expiration: 2039-07-31
Also published as: CN110365982A

Abstract

The embodiment of the invention provides a multi-transform selection accelerating method for intra-frame coding in multipurpose coding, which is divided into two stages: the first stage is 'quick skip', the second stage is 'quick termination', wherein the first stage aims at prejudging whether MTS is needed or not, if the prejudged CU does not need to carry out MTS, the rate distortion optimization of MTS is quickly skipped, otherwise, the second stage is entered, and in the second stage, the rate distortion calculation of candidate transformation of some intra-frame prediction modes is considered to be terminated in advance when the MTS is executed, so that the execution time of the transformation process is reduced as much as possible, the problem of long encoding time of a VVC standard encoder VTM in the prior art is solved, the encoding time is shortened, and the applicability is expanded.

Description

Multi-transformation selection accelerating method for intra-frame coding in multipurpose coding

Technical Field

The invention belongs to the technical field of video coding, and particularly relates to a multi-transform selection acceleration method for intra-frame coding in multipurpose coding.

Background

Although High Efficiency Video Coding (HEVC) is currently the main global Video Coding technology, it still cannot meet the Video application and Video requirements that are being developed. Therefore, in 10 months of 2015, a Joint Video encoding Team (Joint Video encoding Team, abbreviated as jviet) was established by a Moving Picture Experts Group (MPEG) and a Video Coding Experts Group (VCEG), and a new generation Video encoding technology is researched to exceed the compression efficiency of HEVC and adapt to the rapid development of the current Video service.

At the 10 th jfet meeting in 4 months in 2018, the jfet defines the first draft of a new generation of Video Coding technology, and names the new Video standard as multifunctional Video Coding (VVC), and issues a corresponding encoder test model VTM 1.0. Different from a quadtree Coding structure of HEVC, VTM1.0 adopts quadtree and a nested multi-type tree structure (QTMT), so that the partitioning of Coding units (Coding units, abbreviated as CUs) is more flexible. From 7 months to 10 months in 2018, jvt successively released second and third drafts of VVC, and the corresponding test models were also updated to VTM2.0 and VTM 3.0.

With the development of video coding technology, many new coding tools, such as a partition structure for separating luminance and chrominance and a Multiple Transform Selection (MTS), are adopted by the VVC and integrated into a VTM (video Test model), which greatly improve the coding efficiency of the VTM, but also greatly increase the time complexity of coding, especially intra-frame coding. The intra coding time complexity of VTM is about several times that of HEVC. The high complexity not only hinders the further development and research process of the VVC, but also is not beneficial to popularization and application in the future.

Disclosure of Invention

The embodiment of the invention solves the problem of long encoding time of a VVC standard encoder VTM in the prior art by providing a multi-transform selection acceleration method for intra-frame encoding in multipurpose encoding, shortens the encoding time and expands the applicability.

A multi-transform selection acceleration method for intra-frame coding in multipurpose coding comprises the following steps:

s1: after the current CU is subjected to DCT-II transformation, judging whether the current CU is the last sub-CU, and if so, performing step S2;

s2: determining whether the CU and the sub-CUs thereof described in step S1 satisfy the formula:

if not, go to step S3; if so, go to step S5, where RD_pAnd RD_i(i is more than or equal to 1 and less than or equal to N) respectively represents the rate distortion cost of the parent CU and the ith child CU, N is the number of the child CUs, and alpha is an adjustable factor and is used for correcting errors;

s3: counting the frequency of each MTS candidate in the adjacent block of the CU, and sorting the MTS candidate list L from high to low according to the frequency of each MTS candidate in the adjacent block of the CU;

s4: the jth candidate transformation of the MTS candidate list L is set as cand_jJ has an initial value of 0, sets the prediction mode list to S, has a length of N, N is an integer greater than 0, mode_iFor the ith prediction mode in S, with an initial value of 0 for i, will be isSkip (mode)_i) As whether to skip mode_iRate-distortion computation flag, isSkip (mode)_i) Are all 0, RD_minAs rate-distortion cost minimum, RD, for each mode at each candidate transformation_minIs a sufficiently large number, e.g. 1.79769e +308, RD (mode)_i,cand_j) Is a mode_iIn cand_jRate-distortion cost obtained by coding calculation under transform, if j>3, performing step S5;

s5: the MTS procedure is ended.

In step S2, RD is_pAnd RD_i(1 ≦ i ≦ N) representing the rate-distortion cost of the parent CU and its ith child CU, respectively, N being the number of child CUs, and RD here_pRefers to the minimum rate-distortion cost of the parent CU at the current stage. For example, when calculating the rate-distortion cost of a child CU of a parent CU under vertical binary partitioning, RD_pMinimum rate-distortion cost for parent CU no partition and horizontal binary partition. According to different division modes, the value of N can be 2 (binary division), 3 (three-fork division) and 4 (four-fork division). RD_dctThe rate-distortion cost of the DCT-II transform for the current CU. Alpha is an adjustable factor for correcting errors and takes a real number between 0 and 1, and particularly, when the value of alpha is 0.76, better balance between coding quality and coding time saving can be achieved. Rate-distortion cost RD of DCT-II transform of the last sub-CU when performing MTS of that sub-CU_dctRate-distortion cost RD of other sub-CUs_i(1≤i<N) and rate distortion cost RD of parent CU_pAre all known.

In step S3, the CU neighbor block positions are reordered with respect to the MTS candidate list L as shown in fig. 1, and the process proceeds to step S4.

Wherein steps S2 to S4 are the key steps of the present invention, and the decision of skipping MTS in advance and terminating MTS in advance in the steps plays a key role in reducing the encoding time.

The initialization operation in step S4 is performed only once, and it is not necessary to perform initialization when going to S4, as long as j >3 is determined.

Preferably, in step S1, if no, step S3 is performed.

Preferably, in step S2, the value of the adjustable factor α is a real number between 0 and 1.

Preferably, in step S4, if no, pair cand_jThe following steps are carried out: and judging whether j is more than or equal to 1 and the adjacent blocks of the CU all select DCT-II transformation, if j is more than or equal to 1 and the adjacent blocks of the CU all select DCT-II transformation, performing step S5.

Further preferably, if j ≧ 1 is not satisfied or all adjacent blocks of the CU are not selected for DCT-II transform, it is determined whether i is greater than or equal to N, if i ≧ N, i is set to 0 and j is added by 1, proceeding to step S4, where proceeding to S4, and the initialization step in S4 is not necessary as long as it is determined whether j is greater than 3 and the subsequent operations.

Still more preferably, if i < N, the following steps are performed for pattern i in S:

judging whether j is greater than or equal to 1 and is (mode)_i) If so, then i is incremented by 1 and the process goes to step S4.

Still further preferably, if not, for mode_iBy cand_jCoding is carried out to obtain the corresponding rate distortion cost RD (mode)_i,cand_j) And judging RD (mode)_i,cand_j) Whether or not it is less than RD_minIf yes, then RD will be used_minIs updated to RD (mode)_i,cand_j) And i is incremented by 1 and then goes to step S4.

Even more preferably, if RD (mode)_i,cand_j)≥RD_minThen isSkip (mode)_i) Is updated to 1, i is incremented by 1, and the process proceeds to step S4.

In the intra-frame coding module of VTM, flexible QTMT partition structure and newly introduced coding tools such as MTS and the like are the main reasons for the increase of coding complexity.

Like HEVC and h.264/AVC, VVC also employs a block-based hybrid coding framework. Based on the framework, the VVC coding technology has corresponding module optimization for various redundancies. Fig. 2 shows a typical VVC video encoding flow. As shown in fig. 2, an input picture is first divided into square tiles of equal size, which are called Coding Tree Units (CTUs), which are root nodes of a quad-Tree and nested multi-type Tree division structure. The CTU further divides the partition structure according to the quadtree and the nested multi-type tree into Coding Units (CU), where a CU is a basic Unit for an encoder to perform subsequent processing on a video signal. A CU first performs intra prediction or inter prediction based on its intra-frame and inter-frame properties. If the prediction is intra-frame prediction, the pixel prediction value of the current CU is obtained by mainly utilizing the reference pixels adjacent in space through linear interpolation, and if the prediction is inter-frame prediction, the pixel prediction value of the current CU is obtained by utilizing the reference pixels adjacent in time (the previous frame or the previous frames) through displacement compensation. And then subtracting the original value from the predicted value of the CU to obtain a residual error, and transforming the residual error to further reduce the spatial correlation of the errors of adjacent pixel points and obtain a corresponding residual error coefficient. After the residual coefficient is quantized, entropy coding is carried out by combining information such as a coding mode and related coding parameters, so that a compressed code stream is obtained. On the other hand, the quantized residual coefficient is subjected to inverse quantization and inverse transformation, then the residual and the predicted value are added to obtain a reconstructed pixel, and the reconstructed image is filtered to generate a reference frame and stored in a decoded image buffer to be used as a reference pixel in the following CU intra-frame prediction or inter-frame prediction.

The VVC video coding standard adopts a more flexible structure in the image division technology, namely a structure based on a quadtree and a nested multi-type tree, and supports binary division and trigeminal division in different directions besides the quadtree division. The VVC has five partition modes, namely a quad partition, a vertical binary partition (SPLIT _ BT _ VER), a horizontal binary partition (SPLIT _ BT _ HOR), a vertical trigeminal partition (SPLIT _ TT _ VER), and a horizontal trigeminal partition (SPLIT _ TT _ HOR), wherein the quad partition is called a quad tree structure, and the vertical binary partition, the horizontal binary partition, the vertical trigeminal partition, and the horizontal trigeminal partition are called a multi-type tree structure. Each division mode in the multi-type tree structure is shown in fig. 3-6, fig. 3 is a schematic diagram of vertical binary division, fig. 4 is a schematic diagram of horizontal binary division, fig. 5 is a schematic diagram of vertical trigeminal division, and fig. 6 is a schematic diagram of horizontal trigeminal division. In fig. 3 and 4, the binary partition of CU is a symmetric partition, and in fig. 5 and 6, the trifurcate partition is a partition with a side length of 1:2: 1. Under the multi-type tree division structure, the VVC can better adapt to video sequences with larger and larger resolution, richer and richer contents and more complex textures.

When an image enters the VVC encoder, it is first divided into equal CTUs, which is the largest coding unit and is typically 128 × 128 pixels in size. The QTMT division technology carries out four-fork division by taking the CTU as a root node, and leaf nodes of the four-fork division can be further divided in a recursion mode by a multi-type tree structure until the leaf nodes are divided to a set minimum value. Each node in this structure is a CU. Except special conditions, each division of the nodes of the multi-type tree can traverse four division modes in the multi-type tree structure, and the nodes of the quadtree can traverse the four division modes in the multi-type tree structure and can also try the quadtree division modes. In the process, the optimal partitioning mode and the optimal partitioning depth need to be selected through rate distortion optimization. Taking an intra-frame CU with the size of 64 multiplied by 64 as an example, sequentially performing intra-frame prediction, four-fork division, horizontal two-fork division, vertical two-fork division, horizontal three-fork division and vertical three-fork division on the CU, respectively calculating intra-frame prediction and rate distortion costs under the five divisions, and selecting a mode with the minimum rate distortion cost as a final coding mode.

In HEVC, other coding blocks employ only DCT-II transform, except that luma blocks of 4 × 4 size within a frame may use DST-VII transform and DCT-II transform. In VVC, MTS is adopted to improve the efficiency of residual transformation besides DCT-II transformation. MTS introduces two new transformation kernels: DST-VII and DCT-VIII, both of which can be used for the vertical or horizontal direction of the residual block. That is, the MTS has four candidate transformations, and table 1 lists the candidate transformation numbers of the MTS and their corresponding transformation kernels.

TABLE 1 MTS candidate transformation sequence numbers and corresponding transformation kernels

If the MTS is enabled in the Sequence Parameter Set (SPS), the encoder firstly transforms the residual block by DCT-II to calculate the rate-distortion cost RD-DCTII, and then sequentially checks four candidate transformations of the MTS to respectively obtain the rate-distortion costs of 4 transformations. The transform with the least rate-distortion cost is the optimal transform. Fig. 7 shows the residual transform process for VVC intra coding. As shown in FIG. 7, the VVC intra-CU transform process goes through two stages and uses the CU level MTS flag (MTS flag) to indicate whether the current CU enables MTS. And in the first stage, the MTS flag is 0, the CU is transformed by DCT-II, and in the second stage, the MTS flag is 1, and the CU is sequentially transformed by MTS candidate transformation. The encoder transforms the transform with the lowest chosen rate-distortion cost to the optimal transform.

The introduction of MTS improves the coding efficiency of VVC, but it is time consuming to search for the best transform from these candidate transforms for each intra prediction mode of the CU. Therefore, VVC also employs two fast algorithms to reduce the complexity of the residual transformation process, which are represented by the dashed lines in fig. 7. As shown, the core idea of the first fast algorithm is: in the first stage (MTS flag is 0), when the rate distortion cost of a CU in a certain intra-frame prediction mode exceeds a certain threshold value, the rate distortion optimization of the second stage is skipped for the mode, and the next mode is directly coded. Since the rate-distortion cost of one prediction mode transformed with DCT-II may be very large when the rate-distortion cost of the mode transformed with MTS is much larger than that of the other mode, and thus the mode may not be the optimal mode, the MTS rate-distortion cost calculation of the mode may be skipped. The core idea of the second fast algorithm is: in the second phase (MTS flag is 1), rate distortion optimization of the remaining MTS candidate transforms is skipped when the Coded Block Flag (CBF) of the CU under a certain MTS candidate transform is 0. This is because when the transform coefficients of a CU are all 0, the encoder sets its CBF to 0, which means that the CU can most effectively reduce the spatial correlation of the adjacent pixel errors under the transform, and therefore, it is not necessary to perform the rate-distortion cost calculation of other transforms. Furthermore, VVC provides that MTS can only be enabled if a CU satisfies the following conditions: (1) the current coding block is a brightness block; (2) the length and width of the current coding block are all less than or equal to 32; (3) the CBF of the coding block transformed with DCT-II is equal to 1. In the prior art, although the VVC effectively reduces the coding complexity caused by the MTS by using the above simplified scheme, the MTS still has a large promotion space.

The invention aims to provide a multi-transform selection accelerating method based on a multipurpose coding intra-frame coding mode, aiming at the defect of long coding time of a VVC standard coder VTM and the defects of the prior art, so as to shorten the coding time and improve the practical applicability of the method.

And when the sum of the rate-distortion costs of the sub-CUs in a certain division mode is larger than or equal to the current minimum rate-distortion cost of the parent CU, the parent CU does not select the division mode. The MTS rate-distortion cost calculation of the last sub-CU in this partition mode is therefore unnecessary. If this can be predicted, the MTS calculation of the last sub-CU can be skipped in advance, saving coding time.

In addition, the rate distortion calculation order of the MTS candidate transformation in the VTM is fixed (from 0 to 3) according to the candidate index, and the rate distortion calculation order of the MTS candidate transformation is dynamically allocated by fully utilizing the spatial correlation of the CU on transformation selection. As shown in fig. 1, blocks a through E are neighboring blocks of the current CU, whose MTS candidate transform indices are all available at the time of encoding the current CU due to the "zigzag" coding order of the VTM (if blocks a through E select the MTS transform). The order of the MTS candidates of the current CU is decided according to the frequency with which each MTS candidate is selected in the neighboring blocks. Due to the spatial correlation of the CU on the transform selection, the higher the frequency with which an MTS candidate is selected by neighboring blocks, the higher the probability that the current CU selects the candidate transform. Therefore, when deciding the rate-distortion calculation order of the MTS candidates of the current CU, placing candidates with high selection probability in the front is beneficial to improve the accuracy of the MTS fast termination algorithm.

According to the spatial correlation of the CU on the transformation selection, in addition to dynamically allocating the rate-distortion calculation order of the MTS candidate transformation, whether the CU selects the MTS transformation can be predicted. For example, when the neighboring blocks a-E in fig. 1 all select the DCT-II transform, illustrating that the probability of the current CU selecting the MTS is low, the sequential MTS candidates may be considered skipped at this time.

On the basis of sorting of the MTS candidate transformation, the rate distortion cost of each prediction mode under each MTS candidate transformation can be considered to be combined to terminate the prediction mode with lower priority or the MTS candidate transformation in advance.

The embodiment of the invention has the beneficial effects

1. The invention provides a multi-transform selection accelerating method for intra-frame coding in multipurpose coding, which is divided into two stages: the method comprises the steps that a first stage is 'quick skip', a second stage is 'quick termination', wherein the first stage aims at prejudging whether MTS is needed or not, if the prejudging CU does not need to carry out the MTS, rate-distortion optimization of the MTS is quickly skipped, otherwise, the second stage is entered, and in the second stage, rate-distortion calculation of candidate transformation of some intra-frame prediction modes is considered to be terminated in advance when the MTS is executed, so that the execution time of the transformation process is reduced as much as possible;

2. the method provided by the invention has simple steps and small calculated amount, and can be conveniently put into practical application.

Drawings

Fig. 1 is a schematic diagram of CU neighbor block locations.

Fig. 2 is a typical VCC video encoding flow diagram.

Fig. 3 is a schematic diagram of vertical binary partitioning.

Fig. 4 is a schematic diagram of horizontal binary partitioning.

Fig. 5 is a schematic diagram of vertical trifurcation division.

Fig. 6 is a schematic diagram of horizontal trifurcation division.

Fig. 7 is a flowchart of MTS rate-distortion calculation.

Detailed Description

The embodiment of the invention solves the problem of long encoding time of a VVC standard encoder VTM in the prior art by providing a multi-transform selection acceleration method based on a multi-purpose encoding intra-frame encoding mode, shortens the encoding time and expands the applicability.

In order to better understand the above technical solutions, the above technical solutions will be described in detail with reference to specific embodiments.

Example 1

s4: converting the jth candidate of L in step S3 into cand_jJ has an initial value of 0, sets the prediction mode list to S, has a length of N, N is an integer greater than 0, mode_iFor the ith prediction mode in S, with an initial value of 0 for i, will be isSkip (mode)_i) As whether to skip mode_iRate-distortion computation flag, isSkip (mode)_i) Are all 0, RD_minAs rate-distortion cost minimum, RD, for each mode at each candidate transformation_minIs a sufficiently large number, e.g. 1.79769e +308, RD (mode)_i,cand_j) Is a mode_iIn cand_jRate-distortion cost obtained by coding calculation under transform, if j>3, performing step S5;

s5: the MTS procedure is ended.

In step S2, it should be noted that RD here_pRefers to the minimum rate-distortion cost of the parent CU at the current stage. RD_pAnd RD_i(1. ltoreq. i. ltoreq.N) represents the parent CU and the parent CUAnd the rate distortion cost of the ith sub-CU is N, wherein N is the number of the sub-CUs. Note that, here, RD_pRefers to the minimum rate-distortion cost of the parent CU at the current stage. For example, when calculating the rate-distortion cost of a child CU of a parent CU under vertical binary partitioning, RD_pMinimum rate-distortion cost for parent CU no partition and horizontal binary partition. According to different division modes, the value of N can be 2 (binary division), 3 (three-fork division) and 4 (four-fork division). RD_dctThe rate-distortion cost of the DCT-II transform for the current CU. Alpha is an adjustable factor for correcting errors and takes a real number between 0 and 1, and particularly, when the value of alpha is 0.76, better balance between coding quality and coding time saving can be achieved. Rate-distortion cost RD of DCT-II transform of the last sub-CU when performing MTS of that sub-CU_dctRate-distortion cost RD of other sub-CUs_i(1≤i<N) and rate distortion cost RD of parent CU_pAre all known.

In step S3, the CU neighbor block position is reordered on the MTS candidate list L as shown in fig. 1, and the process proceeds to step four.

In step S1, if no, step S3 is performed.

In step S2, the adjustable factor α takes a real number between 0 and 1.

In step S4, if not, the pair cand_jThe following steps are carried out: and judging whether j is more than or equal to 1 and the adjacent blocks of the CU all select DCT-II transformation, if j is more than or equal to 1 and the adjacent blocks of the CU all select DCT-II transformation, performing step S5.

If j is not equal to or larger than 1 or the adjacent blocks of the CU are not all DCT-II transformed, judging whether i is larger than or equal to N, if i is equal to or larger than N, setting i to 0 and going to step S4. If i < N, performing the following steps on the mode i in S:

judging whether j is greater than or equal to 1 and is isSkip (mo)de_i) If so, then i is incremented by 1 and the process goes to step S4. If not, then for mode_iBy cand_jCoding is carried out to obtain the corresponding rate distortion cost RD (mode)_i,cand_j) And judging RD (mode)_i,cand_j) Whether or not it is less than RD_minIf yes, then RD will be used_minIs updated to RD (mode)_i,cand_j) And i is incremented by 1 and then goes to step S4.

If RD (mode)_i,cand_j)≥RD_minThen isSkip (mode)_i) Is updated to 1, i is incremented by 1, and the process proceeds to step S4.

Example 2

This example was implemented based on the VVC official reference platform VTM3.0 and experiments were performed under the general test conditions of JEVT. On the setting of an encoder, the default setting in All-Intra configuration is used, the video sequence used for testing is 22 sequences of six types recommended by JFET, the B-E type test video sequence is the same as the standard test video sequence of HEVC, the input bit depth is 8 bits, the resolution is different from 1920 x 1080 to 416 x 240, the A1-A2 type test video sequence is VVC new ultra high definition test video sequence, and the input bit depth is 10 bits. The encoding performance is mainly evaluated by two indexes of BDBR (Bjotegaard DeltaBit rate) and TS, and the encoding performance of the algorithm is evaluated by taking the original VTM3.0 encoder as a reference (as shown in Table 2, wherein Y, U, V is Y, U, V BDBR respectively). The BDBR represents the code rate difference of two coding methods under the same objective quality, and is obtained by respectively coding and calculating the code rate and the PSNR of the same video under four QP values (22, 27, 32 and 37). The BDBR can comprehensively reflect the code rate and the quality of the video, and the larger the value of the BDBR is, the higher the code rate of the proposed fast algorithm is compared with the original encoder, and the worse the compression performance of the algorithm is. The TS is used to measure the reduction degree of the fast algorithm to the encoding time based on the original encoder, and is calculated as follows:

wherein, T_pTo be fastTotal encoding time, T, after embedding of fast algorithm into VTM3.0_oIs the total encoding time of the original encoder VTM 3.0.

TABLE 2 results of the experiment

Overall, the BDBR of the UV component is reduced by 0.04% and 0.05% on average, and is approximately none although its RD performance is improved, i.e., the MTS fast algorithm proposed herein has no effect on the chrominance component. This is because VVC adopts a structure of luminance-chrominance separation division, and MTS is used only for luminance. The RD performance of different sequences of chrominance components varies greatly from individual to individual and appears random. For example, some sequences (e.g., CatRobot1, markeplace) have a reduced UV component BDBR, some sequences (e.g., ParkRunning3, ritualdarce) have an increased RD performance, and even for the same sequence (e.g., FoodMarket4, dayightroad 2, etc.), the RD performance of the UV component is opposite. This is because, in the structure of luma and chroma division, when the chroma components are coded after luma coding, the entropy coded context state is changed by the MTS fast algorithm, and the influence of the change on the bit rate is random, so the RD performance of chroma is not consistent from an example. As can be seen from table 2, the algorithm exhibits a high degree of consistency in the impact of BDBR and coding time for different types of sequences, and embodies the robustness and universality of the algorithm.

The specific embodiments described herein are merely illustrative of the spirit of the invention. Various modifications, additions and substitutions for the described embodiments may be made by those skilled in the art without departing from the spirit or scope of the invention as defined in the accompanying claims.

Claims

1. A method for accelerating multiple transform selection for intra coding in multipurpose coding, comprising the steps of:

if not, go to step S3; if so, go to step S5, where RD_pAnd RD_i(i is more than or equal to 1 and less than or equal to N) respectively represents the rate distortion cost of the parent CU and the ith child CU, N is the number of the child CUs, alpha is an adjustable factor and is used for correcting errors, RD_dctRate-distortion cost of the DCT-II transform for the current CU;

s4: the jth candidate transformation of the MTS candidate list L is set as cand_jJ has an initial value of 0, sets the prediction mode list to S, has a length of N, N is an integer greater than 0, mode_iFor the ith prediction mode in S, with an initial value of 0 for i, will be isSkip (mode)_i) As whether to skip mode_iRate-distortion computation flag, isSkip (mode)_i) Are all 0, RD_minAs the minimum value of the rate-distortion cost, RD (mode), for each mode under each candidate transformation_i,cand_j) Is a mode_iIn cand_jRate-distortion cost obtained by coding calculation under transform, if j>3, performing step S5;

s5: the MTS procedure is ended.

2. The method of claim 1, wherein in step S1, if not, step S3 is performed.

3. The method of claim 1, wherein in step S2, the adjustable factor α is a real number between 0 and 1.

4. The method as claimed in claim 1, wherein in step S4, if not, the method is applied to cand_jThe following steps are carried out: and judging whether j is more than or equal to 1 and the adjacent blocks of the CU all select DCT-II transformation, if j is more than or equal to 1 and the adjacent blocks of the CU all select DCT-II transformation, performing step S5.

5. The method as claimed in claim 4, wherein if j ≧ 1 is unsatisfied or all adjacent CU blocks have not selected DCT-II transform, it is determined whether i is greater than or equal to N, if i ≧ N, i is set to 0 and j is added to 1, and the method proceeds to step S4, where i indicates the ith prediction mode in S described in step S4.

6. The method of claim 5, wherein if i < N, then performing the following steps for mode i in S:

7. The method of claim 6, wherein the multi-transform selection acceleration for intra coding in multipurpose coding,

if not, then for mode_iBy cand_jCoding is carried out to obtain the corresponding rate distortion cost RD (mode)_i,cand_j) And judging RD (mode)_i,cand_j) Whether or not it is less than RD_minIf yes, then RD will be used_minIs updated to RD (mode)_i,cand_j) And i is incremented by 1 and then goes to step S4.

8. The method of claim 7, wherein the mode is RD (mode)_i,cand_j)≥RD_minThen isSkip (mode)_i) Is updated to 1, and after 1 is added to each of i and j, the process proceeds to step S4.