CN102547263B

CN102547263B - The inverse discrete cosine transform of variable complexity is tabled look-up fast algorithm

Info

Publication number: CN102547263B
Application number: CN201010607560.5A
Authority: CN
Inventors: 孙皓; 陈小敬
Original assignee: Leadcore Technology Co Ltd
Current assignee: Leadcore Technology Co Ltd
Priority date: 2010-12-27
Filing date: 2010-12-27
Publication date: 2016-09-14
Anticipated expiration: 2030-12-27
Also published as: CN102547263A

Abstract

The present invention discloses the inverse discrete cosine transform of a kind of variable complexity and tables look-up fast algorithm, primarily to the redundant operation reduced in original acceleration IDCT algorithm, further speeds up the calculating speed of IDCT module and designs.First DCT coefficient matrix F after inverse quantization, is disassembled into Multi-Nominal Matrix and is added by the present invention；Then, respectively the every matrix obtained after disassembling is carried out inverse discrete cosine transform；Finally, the every matrix after inverse discrete cosine transform is added tries to achieve and go back original matrix f.Algorithm of the present invention decreases the redundant computation in original algorithm, effectively accelerates the calculating speed of inverse discrete cosine transform.

Description

The inverse discrete cosine transform of variable complexity is tabled look-up fast algorithm

Technical field

The present invention relates to digital picture and technical field of video compression, particularly relate to a kind of JPEG, MPEG, H26X compression In standard, the inverse discrete cosine transform of the variable complexity of IDCT (inverse discrete cosine transform) module is tabled look-up the realization of fast algorithm.

Background technology

The positive inverse transformation of discrete cosine be current most of multimedia compression standard important component part (including: JPEG, MPEG、H26X).The discrete cosine transform (2D-DCT) of two dimension, by space-domain signal redistributed power, makes containing major part energy The low frequency component of amount concentrates in together, and uses different lossy compression method strategies according to Energy distribution, makes energy be lost in frequency domain Controlled.And the frequency-region signal after compression is reverted to spatial domain by inverse discrete cosine transform (2D-IDCT), coordinate shape with other modules Becoming optical signal, idiographic flow is shown in Fig. 1.DCT's and IDCT is computationally intensive, in the occasion that requirement of real-time is the highest, and can be quick Complete the key during IDCT has just become decoding.

The one-dimensional discrete cosine transform formula of 8 is as follows:

f (x) = \frac{1}{2} Σ_{u = 0}^{7} C (u) \cdot F (u) \cos \frac{(2 m + 1) πu}{16}

Wherein, x, u=0,1 ... 6,7.

The two-dimension discrete cosine transform formula of 8 × 8 matrixes is as follows:

f (x, y) = \frac{1}{4} Σ_{u = 0}^{7} Σ_{v = 0}^{7} (C (u) \cdot C (v) \cdot F (u, v) \cos \frac{(2 m + 1) πu}{16} \cdot \cos \frac{(2 m + 1) πv}{16})

Wherein, x, y, u, v=0,1 ... 6,7.

(u is v) that in DCT coefficient matrix, (u, v) value of position, (x, y) for going back (x, y) value of position in original matrix for f to F.

Existing two-dimension discrete cosine transform technology is based primarily upon the extension of one-dimensional discrete cosine transform (1D-IDCT), first Afterwards DCT coefficient matrix is both horizontally and vertically performed 1D-IDCT, i.e. complete 2D-IDCT.Week according to trigonometric function Phase property and half-wave odd symmetry, mainly use butterfly computation (such as Fig. 2) to accelerate 1D-IDCT.The 1D-IDCT of 8 needs 24 times Multiplication and 24 sub-additions, and the two-dimentional IDCT of 8 × 8 matrixes needs 384 multiplication and 384 sub-additions.In order to further speed up IDCT, at IDCT module outer layer, to 8 × 8 stand growth model in addition labelling, does not perform IDCT, permissible in the case of stand growth model is a lot Save a lot of computing.

Shortcoming that prior art exists and problem following some:

1, prior art exists substantial amounts of multiplication and additive operation.

2, fixing computational burden.What prior art was the most rough has distinguished stand growth model and non-full zero block, in DCT coefficient matrix Even if whole matrix to be also complete IDCT by only one of which nonzero coefficient；

3 are difficult to couple with other modules, it is not easy to realize parallel.IDCT have to wait until all systems in DCT coefficient matrix Number is just carried out when all decoding complete, in addition it is also necessary to open up and initialize the caching depositing DCT coefficient matrix.

Summary of the invention

In order to overcome the problem of above-mentioned shortcoming and existence, the present invention provides the discrete cosine inversion of a kind of variable complexity Change fast algorithm of tabling look-up, decrease the number of times of IDCT addition and multiplication, and then accelerate the calculating speed of IDCT module.

For reaching above-mentioned purpose, the inverse discrete cosine transform of variable complexity of the present invention is tabled look-up fast algorithm, including Following steps:

(1) the DCT coefficient matrix F after inverse quantization is transformed into the form of Multi-Nominal Matrix sum；

(2) respectively every matrix is carried out inverse discrete cosine transform；

(3) the every matrix after inverse discrete cosine transform is added tries to achieve and go back original matrix f.

Further, also include: using the every matrix after inverse discrete cosine transform as constant, set up inquiry table；

Step (3) also includes: by retrieving described inquiry table, find out the every matrix after inverse discrete cosine transform.

Further, for improving the calculating speed of the present invention further, described step (2) also includes: according to required precision, It is amplified rounding process to the element in the every matrix after inverse discrete cosine transform；

Described step (3) also includes: each element going back in original matrix f tried to achieve reduces identical with amplification times Number, obtains going back original matrix f ' after precision controlling.

Especially, described step (1) particularly as follows:

DCT coefficient matrix F after inverse quantization is transformed to:

F=F (0,0) × I₀₀+ ...+F (u, v) × I_uv+ ...+F (n, n) × I_nn

Wherein, u, v=0,1 ... 6, n, n are line number and the columns of matrix F, and (u v) represents in DCT coefficient matrix F F (u, v) value of position, I_uvRepresent only (u, v) position is 1, and other positions are the matrix of 0；

Described step (2) particularly as follows:

To non-zero F (u, v) corresponding I in the DCT coefficient matrix F after disassembling_uvCarry out inverse discrete cosine transform and obtain matrix i_uv,

Step (3) particularly as follows:

Non-zero F (u, v) and i_uvIt is multiplied and again the summation of every product is shown that DCT coefficient matrix F is after inverse discrete cosine transform Go back original matrix f.

Further, also include: by matrix i_uvAs constant, set up inquiry table；

Step (3) also includes: by retrieving described inquiry table, find out matrix i_uv。

Wherein, described inquiry table to set up process as follows:

Determine each matrix i_uvSymmetrical structure type；

Finding out can be according to matrix i_uvSymmetrical structure type recovery go out the elementary cell of whole matrix；

To there is the matrix i of close symmetrical structure type_uvElementary cell list in same inquiry table.

Wherein, the matrix i of described close symmetrical structure type_uvIncluding: all matrixes that symmetrical structure type is identical i_uv, and two matrix i of transposition each other_uvAnd i_vu。

Further, based on described inquiry table, the concrete calculating process of described step (3) is as follows:

According to non-zero F (u, v) corresponding to matrix i_uv, search inquire about table accordingly, draw non-zero F (u, v) corresponding to Matrix i_uvElementary cell, and with the element multiplication in described elementary cell；

To have close symmetrical structure type matrix i_uvEvery result of product be added to cache accordingly；

Matrix in each caching above-mentioned steps drawn is extended superposition, draws and goes back original matrix f.

Further, described step (2) also includes: according to required precision, to matrix i_uvIn element be amplified rounding Process；

The present invention is had the beneficial effect that

1, the present invention effectively reduces addition and multiplication number of times.Compared with the algorithm of existing fixing complexity, this calculation The computation complexity of method is variable, depends on number and the nonzero coefficient position of nonzero coefficient number, eliminates DCT accurately Redundancy addition that in coefficient matrix, IDCT is brought by neutral element and multiplication, distinguished Elemental redistribution in DCT coefficient matrix flexibly Various situations, be distinguished by processing, nonzero coefficient seldom in the case of, it is possible to be greatly improved IDCT calculating speed ( Limiting case, the most only DC coefficient, then have only to 1 multiplication and 1 sub-addition)；

2, the present invention utilizes the quickening IDCT's that tables look-up to realize speed, general elementary cell matrix is made table, reduces weight Multiple calculating；

3 with coefficient decoding, the coupling of inverse quantization module.Decode a nonzero coefficient F (u, v) the most just can be complete every time Become inverse quantization and IDCT, so use same flow process, add degree of parallelism, eliminate the interface caching between each module and parameter Transmission, accelerates overall speed.

Accompanying drawing explanation

Fig. 1 is that existing algorithm calculates the flow chart going back original matrix；

Fig. 2 is 8 IDCT butterfly computation schematic diagrams；

Fig. 3 is C1 type symmetrical structure matrix；

Fig. 4 is C2 type symmetrical structure matrix；

Fig. 5 is C3 type symmetrical structure matrix；

Fig. 6 is C4 type symmetrical structure matrix；

Fig. 7 is C5 type symmetrical structure matrix；

Fig. 8 is C6 type symmetrical structure matrix；

Fig. 9 is C7 type symmetrical structure matrix；

Figure 10 is C8 type symmetrical structure matrix；

Figure 11 is C9 type symmetrical structure matrix；

Figure 12 is C10 type symmetrical structure matrix；

Figure 13 is C11 type symmetrical structure matrix；

Figure 14 is C12 type symmetrical structure matrix；

Figure 15 is C13 type symmetrical structure matrix；

Figure 16 is C14 type symmetrical structure matrix；

Figure 17 is C15 type symmetrical structure matrix；

Figure 18 is C16 type symmetrical structure matrix；

Figure 19 is the extension of all coefficient IDCT results and the processing procedure of superposition；

Figure 20 is the calculation flow chart of algorithm of the present invention.

Detailed description of the invention

Below in conjunction with Figure of description, the detailed description of the invention of the present invention is described in detail.

The present invention utilize inverse quantization after DCT coefficient matrix F (u, feature tissue technical scheme v), i.e. the present invention is according to warp Cross the DCT coefficient matrix F after inverse quantization (u, v) in there is a large amount of null value, and nonzero coefficient focuses mostly on the spy in the matrix upper left corner Point；By DCT coefficient matrix F, (u, v) disassembles, and has operated IDCT just for nonzero coefficient in matrix, obtains going back original matrix, can be big Reduce greatly amount of calculation.Mainly comprise the steps that

For improving arithmetic speed further, algorithm of the present invention also includes: by the every base after inverse discrete cosine transform This cell matrix, as constant, sets up inquiry table；During inverse discrete cosine transform, when running into double counting, can be by looking into Table obtains, and reduces the number of times of double counting.I.e. in step (3), by retrieving described inquiry table, find out through discrete cosine inversion Every matrix after changing；Then it is added to try to achieve by the every matrix found out and goes back original matrix f.

During actual decoding IDCT, the most directly use floating-point operation, in order to improve calculating speed, typically will be floating Point processing is converted into fixed-point calculation.Algorithm of the present invention is in order to realize fixed-point calculation, and described step (2) also should include: foundation Required precision, is amplified rounding process to the element in the every matrix after inverse discrete cosine transform；

Described step (3) also should include: each element going back in original matrix f tried to achieve reduces identical with amplification times Number, obtains going back original matrix f ' after precision controlling.

In a specific embodiment, the present embodiment is the inverse discrete cosine transform of the variable complexity as a example by 8 × 8 matrixes Tabling look-up fast algorithm, it is as follows that it implements step:

For convenience of describing, below the formula (2) in inverse discrete cosine transform formula, i.e. background technology is simplified shown as:

F=IDCT (F) (3)

In formula, f and F respectively goes back original matrix and DCT coefficient matrix；

First, DCT coefficient matrix F is transformed into the form that Multi-Nominal Matrix is added:

Wherein, (u v) represents (u, v) value of position, I in coefficient matrix to F_uvRepresent only (u, v) position is 1, other Position is the matrix of 0.

Then, (3) formula of (4) formula being brought into draws goes back original matrix f:

f = IDCT (F) = F (0,0) \times IDCT (I_{00}) + F (0,1) \times IDCT (I_{01}) + . . . . . . F (7,7) \times IDCT (I_{77})

u Σ_{u = 0}^{7} Σ_{v = 0}^{7} F (u, v) \times IDCT (I_{uv}) = Σ_{u = 0}^{7} Σ_{v = 0}^{7} F (u, v) \times i_{uv} - - - (5)

Wherein, i_uvFor I_uvAfter idct transform, go back original matrix, claim i_uvFor fundamental matrix unit.

The meaning of formula (5) is: by the idct transform of matrix F is converted into 64, with F, (u is v) that weight coefficient is multiplied by i_uvThe superposition of matrix.If time in F containing multiple 0 element, avoid the need for 64 matrixes when of calculating f superimposed；Also That is, if only 10 nonzero coefficients, then have only to 10 submatrix superpositions, thus eliminate 0 element in F and reduce to calculating The impact of matrix f, calculates nonzero coefficient in conversion process accurately to the contribution going back original matrix f.

DCT coefficient matrix F is converted into the form shown in formula (4) by the present invention, therefore in all of idct transform all Same fundamental matrix unit i can be used_uv, therefore, it can i_uvRegard constant as and make inquiry table, obtain i by look-up table_uv, Eliminate and calculate IDCT (I every time_uv) burden.The data space that this inquiry table of primary Calculation takies is about 64 × 64= 4096.Considering the most complicated computation complexity, when i.e. 64 positions are all nonzero coefficient, a 8 × 8IDCT needs 64 × 64= 4096 multiplication and 64 × 63=4032 sub-addition.The property of butterfly computation all it is not so good as from data space and computation complexity Can, so needing compress data space and reduce calculation times.The present invention is according to i_uv=IDCT (I_uv) i that calculates_uvHave Certain symmetric feature greatly reduces data space and reduces calculation times.

As a example by 8 × 8 matrixes, through IDCT (I_uv) 64 i after conversion_uv, can be divided into according to the symmetrical structure type of matrix 16 kinds, as shown in Fig. 3～Figure 18, be the elementary cell of matrix in black box in figure, is i.e. tied according to difference by elementary cell Structure type can recover whole matrix, is equivalent to list all 64 i so these elementary cells are made table_uv.Wherein, i_uvWith i_vuTransposition relation each other, their elementary cell element numerical value is the same, and simply position is different, can share a table, And then reach the purpose in compaction table space, the process of setting up of its inquiry table is:

First, each matrix i is determined_uvSymmetrical structure type；

Then, finding out can be according to matrix i_uvSymmetrical structure type recovery go out the elementary cell of whole matrix；

Finally, will there is the matrix i of close symmetrical structure type_uvElementary cell list in same inquiry table.

Below in conjunction with the accompanying drawings above-mentioned inquiry table is further described.

The structure of C1 type shown in Fig. 3: belong to the type has: i₀₀, in inquiry table 1, elementary cell element is [a], element number It is 1；

Inquiry table 1

Symmetrical structure type	Matrix	Elementary cell element
			C1 type	i₀₀	[a]

The structure of C2 type shown in Fig. 4: belong to the type has: i₀₄, in inquiry table 2, elementary cell element is [a], element number It is 1；

Inquiry table 2

Symmetrical structure type	Matrix	Elementary cell element
			C2 type, C3 type	i₀₄, i₄₀	[a]

The structure of C3 type shown in Fig. 5: belong to the type has: i₄₀；Wherein, i₄₀With i₀₄Transposition relation each other, i₄₀Can be with i₀₄ Share inquiry table 2；

The structure of C4 type shown in Fig. 6: belong to the type has: i₄₄, in inquiry table 3, elementary cell element is [a], element number It is 1；

Inquiry table 3

Symmetrical structure type	Matrix	Elementary cell element
			C4 type	i₄₄	[a]

The structure of C5 type shown in Fig. 7: belong to the type has: i₀₂And i₀₆, in inquiry table 4, elementary cell element is [a, b], This inquiry table is contained within two list items and each element number is 2, i.e. the total element number of this table is 2 × 2=4；

Inquiry table 4

Symmetrical structure type	Matrix	Elementary cell element
			C5 type, C6 type	i₀₂, i₂₀	[a, b]

i₀₆, i₆₀

[a, b]

The structure of C6 type shown in Fig. 8: belong to the type has: i₂₀And i₆₀；Wherein, i₂₀And i₆₀Respectively with i₀₂And i₀₆Turn each other Put relation, therefore, i₂₀、i₀₂、i₆₀And i₀₆Inquiry table 4, wherein, i can be shared₂₀The i in this inquiry table 4 can be shared₀₂List item, i₆₀Can Share the i in this inquiry table 4₀₆List item；

The structure of C7 type shown in Fig. 9: belong to the type has: i₄₂And i₄₆, in inquiry table 5, elementary cell element is [a, b], This inquiry table is contained within two list items and each element number is 2, i.e. the total element number of this table is 2 × 2=4；

Inquiry table 5

The structure of C8 type shown in Figure 10: belong to the type has: i₂₄And i₆₄；Wherein, i₂₄And i₆₄Respectively with i₄₂And i₄₆Each other Transposition relation, i.e. i₂₄、i₄₂、i₆₄And i₄₆Inquiry table 5, wherein, i can be shared₂₄The i in this inquiry table 5 can be shared₄₂List item, i₆₄Can Share the i in this inquiry table 4₄₆List item；

The structure of C9 type shown in Figure 11: belong to the type has: i₂₂、i₆₆、i₂₆And i₆₂, in inquiry table 6, elementary cell element isThis inquiry table is contained within four list items and each element number is 4, wherein i₂₆And i₆₂Transposition can use same list item each other, Then the total element number of this table is 4 × 3=12；

Inquiry table 6

The structure of C10 type shown in Figure 12: belong to the type has: i₀₁、i₀₃、i₀₅And i₀₇, elementary cell element in inquiry table 7 For [a, b, c, d], this inquiry table is contained within four list items and each element number is 4, i.e. the total element number of this table is 4 × 4= 16；

Inquiry table 7

The structure of C11 type shown in Figure 13: belong to the type has: i₁₀、i₃₀、i₅₀And i₇₀；Wherein, i₁₀、i₃₀、i₅₀And i₇₀Point Not and i₀₁、i₀₃、i₀₅And i₀₇Transposition relation each other, i.e. i₁₀、i₀₁、i₃₀、i₀₃、i₅₀、i₀₅、i₇₀And i₀₇Inquiry table 7 can be shared.

The structure of C12 type shown in Figure 14: belong to the type has: i₄₁、i₄₃、i₄₅And i₄₇, elementary cell element in inquiry table 8 For [a, b, c, d], this inquiry table is contained within four list items and each element number is 4, i.e. the total element number of this table is 4 × 4= 16；

Inquiry table 8

The structure of C13 type shown in Figure 15: belong to the type has: i₁₄、i₃₄、i₅₄And i₇₄；Wherein, i₁₄、i₃₄、i₅₄And i₇₄Point Not and i₄₁、i₄₃、i₄₅And i₄₇Transposition relation each other, i.e. i₁₄、i₄₁、i₃₄、i₄₃、i₅₄、i₄₅、i₇₄And i₄₇Inquiry table 8 can be shared；

The structure of C14 type shown in Figure 16: belong to the type has: i₂₁、i₂₃、i₂₅、i₂₇、i₆₁、i₆₃、i₆₅And i₆₇, inquire about table 9 Interior elementary cell element isThis inquiry table is contained within eight list items and each element number is 8, i.e. this table Total element number is 8 × 8=64；

Inquiry table 9

The structure of C15 type shown in Figure 17: belong to the type has: i₁₂、i₂₂、i₅₂、i₇₂、i₁₆、i₃₆、i₅₆And i₇₆；Wherein, i₁₂、 i₃₂、i₅₂、i₇₂、i₁₆、i₃₆、i₅₆And i₇₆Respectively with i₂₁、i₂₃、i₂₅、i₂₇、i₆₁、i₆₃、i₆₅And i₆₇Transposition relation each other, i.e. i₁₂、 i₂₁、i₃₂、i₂₃、i₅₂、i₂₅、i₇₂、i₂₇、i₁₆、i₆₁、i₃₆、i₆₃、i₅₆、i₆₅、i₇₆And i₆₇Inquiry table 9 can be shared；

The structure of C16 type shown in Figure 18: belong to the type has: i₁₁、i₃₃、i₅₅、i₇₇、i₁₅、i₃₁、i₅₇、i₇₃、i₁₇、i₃₅、 i₅₃、i₇₁、i₁₃、i₃₇、i₅₁And i₇₅, in inquiry table 10, elementary cell element isWherein, i_uvWith i_vuCan Sharing same table, therefore this inquiry table is contained within ten list items and each element number is 16, and total element number is 16 × 4+ (16 × 12)/2=160；

Inquiry table 10

In sum, completing the foundation of each inquiry table, the number of all elements of described each inquiry table is: 1+1+1+4+ 4+12+16+16+64+160=279.

After having set up elementary cell inquiry table, calculating quickly soon of inverse discrete cosine transform can be carried out by the way of tabling look-up Method.Calculate final IDCT also original matrix f and need by summation about non-zero DCT coefficients F of diverse location that (u v) is multiplied by i_uvAfter pass through again Add up and obtain, i.e. calculateWherein, i_uvCan obtain by the way of tabling look-up.During computation of table lookup, F (u, v) × i_uvCalculating need not F that (u v) is multiplied by i_uvIn all 64 elements, it is only necessary to be multiplied by elementary cell Element, then adds up result of product inside elementary cell, the IDCT result that finally extension reduction is final.

The highest (to above by looking into as a example by the calculating of the most all DCT coefficient F (u, v) equal non-zero) with computation complexity below The mode of table carries out the calculating process of the fast algorithm of inverse discrete cosine transform and is further described, and adds up its amount of calculation.

First, according to non-zero F (u, v) corresponding to matrix i_uv, search and inquire about table accordingly, (u, v) institute is right to draw non-zero F The matrix i answered_uvElementary cell, and with the element multiplication in described elementary cell, if: S_uv=F (u, v) × i_uv

Then, will have close symmetrical structure type matrix i_uvEvery result of product be overlapped, and accumulate it In corresponding caching, wherein, f₁-f₉In each corresponding caching.

f₁=(S₀₀+S₀₄)+(S₄₀+S₄₄): multiplication 1x4=4 time, addition 2+2+4=8 time；

f₂=S₂₀+S₆₀)+(S₂₄+S₆₄): multiplication 2x4=8 time, addition 2+2+4=8 time；

f₃=(S₀₂+S₀₆)+(S₄₂+S₄₆): multiplication 2x4=8 time, addition 2+2+4=8 time；

f₄=(S₁₀+S₃₀+S₅₀+S₇₀+S₁₄+S₃₄+S₅₄+S₇₄): multiplication 4x8=32 time, addition 4x8=32 time；

f₅=(S₀₁+S₀₃+S₀₅+S₀₇+S₄₁+S₄₃+S₄₅+S₄₇): multiplication 4x8=32 time, addition 4x8=32 time；

f₆=(S₂₂+S₆₆)+(S₂₆+S₆₂): multiplication 4x4=16 time, addition 4x3=12 time；

f₇=(S₂₁+S₂₃+S₂₅+S₂₇+S₆₁+S₆₃+S₆₅+S₆₇): multiplication 8x8=64 time, addition 7x8=56 time；

f₈=(S₁₂+S₃₂+S₅₂+S₇₂+S₁₆+S₃₆+S₅₆+S₇₆): multiplication 8x8=64 time, addition 7x8=56 time；

f₉=(S₁₁+S₃₃+S₅₅+S₇₇+S₃₁+S₅₇+S₁₅+S₇₃+S₃₅+S₁₇+S₅₃+S₇₁+S₁₃+S₃₇+S₅₁+S₇₅): multiplication 16x16 =196 times, addition 16x15=180 time.

Finally, by f₁-f₉The similar matrix of middle symmetry is first added together, it is possible to reduce addition number of times, so nine squares Battle array extends superposition again by the order addition of formula (6) and obtains final original matrix of going back:

F={ [(f₁+f₂)+(f₃+f₆)]+f₉}+[(f₄+f₈)+(f₅+f₇)] (6)

Wherein, addition 8+8+16+32+16+16+32+64=192 sub-addition is needed altogether.

The concrete of above-mentioned calculating calculates process as shown in figure 19, and filled arrows represents that overlap-add procedure, hollow arrow represent expansion Exhibition processes.Such as, if S₀₀Elementary cell be [a₁], S₀₄Elementary cell be [a₂], according to C1 and C2 type symmetrical structure, need To [a₁] and [a₂] do extend after again superposition obtain S₀₀+S₀₄Elementary cell be [a₁+a₂ a₁-a₂]；In like manner, if S₄₀The most single Unit is [a₃], S₄₄Elementary cell be [a₄], obtain S according to C3 and C4 type symmetrical structure₄₀+S₄₄Elementary cell be [a₃+a₄ a₃-a₄]；F is understood by C1, C2, C3 and C4 type symmetrical structure₁=(S₀₀+S₀₄)+(S₄₀+S₄₄) elementary cell beBy that analogy, the cumulative result finally being gone back original matrix after extension.

Under the limiting case that there are 64 nonzero coefficients, calculating complete 8 × 8IDCT complexity is: multiplication 424 times, Addition 584 times, although more than the complexity of butterfly computation, but the most this situation seldom occurs.Actual DCT coefficient matrix Only existing a small amount of nonzero coefficient, at this moment algorithm complex can be greatly reduced that (when such as only having DC coefficient, this algorithm has only to 1 Secondary multiplication and 1 sub-addition just can complete), complexity is variable, relevant to nonzero coefficient number and position.Process in encoding and decoding In, nonzero coefficient number mainly affected by the size of quantized value QP, after tested, H263 decoder decoding QCIF form Foreman code stream, in the range of QP value 6～30, under same precision, the efficiency of IDCT than conventional butterfly computation improve 40%～ 70%.

It addition, during actual decoding IDCT, the most directly use floating-point operation, in order to improve calculating speed, one As floating-point operation is converted into fixed-point calculation.In algorithm of the present invention, in order to realize fixed-point calculation, need table i_uvIn Element amplifies, rounds process, according to different required precisions, can realize the control to precision by adjusting amplification, this Sample uses fixed-point data to carry out multiplying accumulating calculating, but need formula (6) cumulative after reduce identical multiple reduction.

Shown in Figure 20, fast algorithm of the inverse discrete cosine transform of variable complexity of the present invention being tabled look-up combines How decoding and inverse quantization module, realize concurrency to during decoding, be further described.

As the coefficient F drawn after decoding and inverse quantization, (u directly skips when v) being zero, is not IDCT, continues to decode down One zero coefficient values level and position (u, v)；

As the coefficient F drawn after decoding and inverse quantization, (u, when v) being non-zero, draws i by tabling look-up_uv, then calculate F (u, v)×i_uvAnd accumulate it f₁-f₉In in corresponding caching, such as, if the nonzero coefficient F solved (u, position v) be (5, 2), then by S₅₂It is added to f₈In corresponding caching；Until all of non-zero F, (u, after v) all having calculated, i.e. decoding runs into During last==1 signal, by f₁-f₉Add up by the order of formula (6), draw the result going back original matrix f, i.e. IDCT.

Wherein, f is calculated₁-f₉Process need be calculated by certain scanning sequency in 8 × 8DCT coefficient matrix F , and the scanning sequency solving nonzero coefficient from reality decoding is different.If needs general solution nonzero coefficient F (u, v), inverse quantization And (u, v) three steps of contribution to IDCT merge (as shown in figure 20) then to need to open up 9 cachings right to calculate this coefficient F Answer f₁-f₉, total size is 4+4+4+8+8+4+8+8+16=64.

Above, only presently preferred embodiments of the present invention, but protection scope of the present invention is not limited thereto, any it is familiar with basis Those skilled in the art in the technical scope that the invention discloses, the change that can readily occur in or replacement, all should contain Within protection scope of the present invention.Therefore, protection scope of the present invention should be as the criterion with the protection domain that claim is defined.

Claims

1. the inverse discrete cosine transform of a variable complexity is tabled look-up fast algorithm, it is characterised in that be applied to digital picture and Technical field of video compression；

The inverse discrete cosine transform of described variable complexity fast algorithm of tabling look-up comprises the following steps:

(1) the DCT coefficient matrix F after inverse quantization is transformed into the form of Multi-Nominal Matrix sum；Wherein, by the IDCT to described F Shift conversion is the superposition that the weight coefficient of each position is multiplied by the fundamental matrix unit of relevant position；

(3) by retrieval and inquisition table, the every matrix after inverse discrete cosine transform is found out, after inverse discrete cosine transform Every matrix is added to try to achieve goes back original matrix f；

Wherein, described inquiry table is by the base obtained after the every matrix in described Multi-Nominal Matrix is carried out inverse discrete cosine transform This matrix unit is set up as constant and is obtained.

The most according to claim 1, the inverse discrete cosine transform of variable complexity is tabled look-up fast algorithm, it is characterised in that described Step (2) also includes: according to required precision, be amplified rounding place to the element in the every matrix after inverse discrete cosine transform Reason；

Described step (3) also includes: each element going back in original matrix f tried to achieve is reduced the multiple identical with amplification, To going back original matrix f ' after precision controlling.

The most according to claim 1, the inverse discrete cosine transform of variable complexity is tabled look-up fast algorithm, it is characterised in that described Step (1) particularly as follows:

DCT coefficient matrix F after inverse quantization is transformed to:

F=F (0,0) × l₀₀+ ...+F (u, v) × I_uv+ ...+F (n, n) × I_nn

Wherein, u, v=0,1 ..., 6, n；N is line number and the columns of matrix F；F (u, v) represent in DCT coefficient matrix F (u, V) value of position；I_uvRepresent only (u, v) position is 1, and other positions are the matrix of 0；

Described step (2) particularly as follows:

Step (3) particularly as follows:

Non-zero F (u, v) and i_uvIt is multiplied and the summation of every product is drawn DCT coefficient matrix F going back after inverse discrete cosine transform again Original matrix f.

The most according to claim 3, the inverse discrete cosine transform of variable complexity is tabled look-up fast algorithm, it is characterised in that also wrap Include: by matrix i_uvAs constant, set up inquiry table；

The most according to claim 4, the inverse discrete cosine transform of variable complexity is tabled look-up fast algorithm, it is characterised in that described Inquiry table to set up process as follows:

Determine each matrix i_uvSymmetrical structure type；

To there is the matrix i of close symmetrical structure type_uvElementary cell list in same inquiry table；

Wherein, the matrix i of described close symmetrical structure type_uvIncluding: all matrix i that symmetrical structure type is identical_uv, and Two matrix i of transposition each other_uvAnd i_vu。

6. table look-up fast algorithm according to the inverse discrete cosine transform of variable complexity described in claim 4 or 5, it is characterised in that Based on described inquiry table, the concrete calculating process of described step (3) is as follows:

7. table look-up fast algorithm according to the inverse discrete cosine transform of variable complexity described in claim 3 or 4, it is characterised in that Described step (2) also includes: according to required precision, to matrix i_uvIn element be amplified rounding process；