CN104113763A

CN104113763A - Optimized integer transform radix applied to image coding

Info

Publication number: CN104113763A
Application number: CN201310138425.4A
Authority: CN
Inventors: 吴君钦; 李艳丽
Original assignee: Jiangxi University of Science and Technology
Current assignee: Jiangxi University of Science and Technology
Priority date: 2013-04-22
Filing date: 2013-04-22
Publication date: 2014-10-22

Abstract

The invention discloses an optimized integer transform radix applied to image coding. Integer DCT matrix filling rules are improved, so that a type of new integer transform radixes is obtained and, the integer transform radixes are named as variety integer DCT radixes. A kind of optimized integer transform radix is obtained through searching a large number of variety integer DCT radixes. The optimized integer transform radix has the following characteristics that: matrix elements are integers; the optimized integer transform radix satisfies an orthogonal constraint condition; a normalized orthogonal matrix and the original DCT radixes have similar decorrelation performance; one-dimensional transform has a complete butterfly fast algorithm; only addition and shifting operation is required in a transform process, and multiplication does not exist, and therefore, the accuracy of transform operation is high; and forward and reverse transform are completely reversible, and no floating point drifts, and therefore, the transform operation is easy to realize through hardware. With the transform radix and transform radix transform processing method applied to video image coding adopted, the complexity of an encoder can be reduced.

Description

A kind of preferred integer translation base that is applied to Image Coding

one, technical field

The present invention relates to image coding technology field, relate in particular to a kind of image conversion coding method (coding method of class Integer DCT Transform, or excellent integer transform method).

two, background technology

Image Coding refers to meeting under the condition of certain mass (requirement of signal to noise ratio or subjective assessment score), with the technology of institute's inclusion information in less bit number presentation video or image.

Discrete cosine transform (Discrete Cosine Transform) after being proposed by people such as N.Ahmed for 1974, owing to can greatly removing the correlation of pictorial element at transform domain, be considered to the accurate optimal mapping that performance approaches optimal transformation Karhunen-Loeve transformation, be widely used at image and video compression coding field.Conventional dct transform matrix represents by floating number, more multiplying and add operation are adopted, and multiplying needs to consume more system resource in general-purpose computations chip, take more operation time, under limited word length condition, there is truncated error in floating-point operation, and precision is not high, there is drift in Code And Decode data, can cause and not mate phenomenon.Integer DCT Transform adopts integer to replace conventional floating number to represent transformation matrix.The core of conversion is integer transform, calculates without floating number, and accuracy is high; Kernel kernal mapping only needs simple addition and shift operation to complete, and has reduced computation complexity; There is not trueness error in integer arithmetic, ensure the invertibity of coding, solved the matching problem between the positive inverse transformation of encoder and decoder, therefore audio frequency and video compression standard of new generation H.264/AVC, China national audio/video encoding standard AVS etc., standard has all adopted Integer DCT Transform.

The selection of integer translation base plays conclusive effect to the performance of orthogonal transform.Because integer transform basic matrix is not unique, Integer DCT Transform base is one of numerous integer translation bases, and selecting transform-based criterion is that its normalization coefficient matrix and dct transform matrix are approaching as far as possible, to reach best decorrelation performance.And the present invention " class Integer DCT Transform " selection criterion of base is: decorrelation characteristic and the DCT characteristic of transform-based are close as far as possible, and do not pursue its normalization coefficient matrix and DCT matrix approaches, and have the object of complete dish-shaped fast algorithm to reach transform operation.

three, summary of the invention

The object of example of the present invention is to propose a kind of preferred integer translation base that is applied to Image Coding, be intended to solve two aspect problems below: (1) existing dct transform matrixing adopts floating-point multiplication and addition, take hardware resource many, computational accuracy is not high, the positive inverse transformation of DCT can not be mated completely, converts irreversible problem; (2) DCT, integer DCT are without complete fast algorithm, and hardware implementation complexity is high.Preferably the acquisition of integer translation base is that complexity, the raising performance that further reduces image encoder has huge practical value.

Method of the invention process is achieved in that integer translation base builds matrix hfor:

It is characterized in that: wherein a ₀, a ₁, a ₂, a ₃, a ₄, a ₅, a ₆be 1 ~ 15 integer, excellent integer transform base element a ₀, a ₁, a ₂, a ₃, a ₄, a ₅, a ₆value be { Isosorbide-5-Nitrae, 3,2,5,6,8}.Described matrix hmust meet orthogonality, for diagonal matrix, that is:

In formula , orthogonality requirement , .

Preferably integer translation base is for receiving 8 × 8 Image Residual data of Image Coding prediction module output x , carrying out transition coding processing, the matrix relationship of the positive inverse transformation of its two dimension is expressed as:

Wherein x for original input residual error data matrix, y for transformation results, g for h normalization matrix, for matrix dot multiplication, so integer transform is reversible.

For the preferred integer translation base transformation matrix of Image Coding h middle all elements is integer, and conversion is integer arithmetic entirely, normalization dot product computing can with quantize to merge, in conversion final stage processing, normalized can narrow down to suitable size by the conversion coefficient having amplified.

The preferred integer translation base of Image Coding, is characterized in that, two-dimentional class Integer DCT Transform can be decomposed into one-dimensional transform twice.One dimension class integer DCT direct transform matrix expression is , wherein x, Xfor a dimensional vector before and after conversion, be launched into direct transform evaluator and be expressed as:

Wherein with , with , with , with between these four groups of data, form butterfly computation relation, computing obtains four groups of new data with , with , with , with , between four groups of new data, can again utilize butterfly computation, its result is multiplied by respectively different coefficients, obtains last transformation results through applying butterfly computation for the third time.Its computing flow process as shown in Figure 1, has complete butterfly fast algorithm.

four, brief description of the drawings

Fig. 1 is the flow chart of preferred integer transform fast algorithm of the present invention;

Fig. 2 is the application example of the present invention in video encoder flow chart.

five, embodiment

In order to make object of the present invention, technical scheme and advantage clearer, below in conjunction with accompanying drawing and embodiment, the present invention is further elaborated, for convenience of explanation, only show the part relevant to the invention process example.Should be appreciated that the concrete embodiment that this place is described, only for explaining the present invention, not in order to limit the present invention.

The present invention proposes the integer transform that a class is new and (is called: " class integer DCT " conversion, or preferred integer transform) and fast algorithm, the decorrelation performance of preferred class integer translation base orthonormalization matrix and dct transform is close, alternative Integer DCT Transform role in Image Coding.

case study on implementation

The MPEG4 video encoder Xvidcore that increases income is the coding and decoding video of telotism, comprise infra-frame prediction, estimation and inter prediction, dct transform and the entropy several parts composition of encoding, encoder input is the video sequence of QCIF, and output is the video file of MPEG4 form.Input video two field picture in piecemeal, frame/inter prediction, form 8 × 8 residual error data pieces and send into conversion, quantization modules, finally by the harmless entropy output squeezing video data of encoding, detailed process is shown in accompanying drawing 2.In order to analyze preferred integer transform performance, positive the DCT in encoder inverse transformation operator source code is changed to the positive inverse operator of preferred integer transform of the present invention, and insert Performance Evaluation code, the decorrelation efficiency of calculation code device, pSNR, the isoparametric population mean of compression ratio, the modules such as dct transform in accompanying drawing 2, decorrelation efficiency estimation, quantification, inverse quantization, inverse dct transform are seen in concrete insertion position.

Statistic property parameter calculation formula and being described as follows:

y-PSNR: ,

Wherein , f( i, j) be input picture, goriginal image is gone back in (i, j) decoding,

compression ratio meter: ,

Wherein yUV_LEN isinput original YUV ASCII stream file ASCII length, mPEG4_LEN isoutput MPEG4 ASCII stream file ASCII length

decorrelation efficiency:

If be 8 8 input residual matrixes, for converting rear coefficient matrix, scan by " Z " word the dimensional vector obtaining , , n =64, and column vector is carried out to periodicity and expand , yeach rank auto-correlation coefficient computing formula is:

The auto-correlation extent index of output sequence computing formula is:

To given input residual error data x, the decorrelation efficiency of transform operation is defined as:

standard dct transform matrix pfill method :

Wherein coefficient , , , , , , value can be calculated by cosine function, wherein , , , , , , ,

integer DCT direct transform matrix ifill method :

Wherein Integer DCT Transform matrix can use integer quotient , , , , , , vectorial unique expression,

Table 1 is standard DCT, integer DCT, preferably integer translation base is filled element vector table.

Table 2, table 3 have provided each transform-based and have been applied to the Performance Evaluation result that MPEG4 encodes.

Can find out from table 2, table 3, preferably integer transform is applied to MPEG4 coding, and the deviation of the performance index such as its compression ratio, Y-PSNR, decorrelation efficiency and standard dct transform is less than 5%, suitable with integer DCT base.

Before accompanying drawing 1 fast algorithm, two-stage is without multiplication, the last third level needs multiplication of integers operation, can adopt the summation that is shifted by turn to replace multiplication, realize 3 shifting functions of maximum needs, the 3 sub-addition operations of 4Bit × 10Bit parallel multiplier, if the complexity of 1 sub-addition or shifting function is 1, can be with reference to the accompanying drawings 1 to obtain preferred integer transform fast algorithm overall complexity be 66.And the overall complexity of Integer DCT Transform fast algorithm is 148.So the preferred integer transform of the present invention is realized and can be saved 55% gate resource with hardware.

Claims

1. an integer translation base that is applied to Image Coding, is characterized in that: transform-based matrix is 8 × 8 INTEGER MATRICES h , matrix element fill method is:

Wherein a ₀, a ₁, a ₂, a ₃, a ₄, a ₅, a ₆be 1 ~ 15 integer, optimum integer transform base element a ₀, a ₁, a ₂, a ₃, a ₄, a ₅, a ₆value is { Isosorbide-5-Nitrae, 3,2,5,6,8}.

2. the Image Coding integer transform basic matrix as described in claim 1 h ,it is characterized in that h must meet orthogonality, i.e. requirement hH ^tfor diagonal matrix:

In formula k=4 ( a ₁ a ₅- a ₂ a ₆), for meeting orthogonality, must meet k=0, a ₁ a ₅= a ₂ a ₆ .

3. integer translation base as described in claim 1, is characterized in that, for receiving 8 × 8 Image Residual data of Image Coding prediction module output x , for transition coding processing, the matrix relationship of the positive inverse transformation of its two dimension is expressed as:

Wherein x for original input residual error data matrix, y for transformation results, g for h normalization matrix, for matrix dot computing, so integer transform is reversible.

4. the integer translation base of the Image Coding as described in claim 1, is characterized in that transformation matrix h middle all elements is integer, and conversion be full integer arithmetic, normalization with quantize to merge, converting final stage processing, normalized can narrow down to suitable size by the conversion coefficient having amplified.

5. the Image Coding integer translation base as described in claim 1, is characterized in that, two-dimentional class Integer DCT Transform can be decomposed into one-dimensional transform twice, and one dimension class integer DCT direct transform matrix expression is x= hx, wherein x, Xfor a dimensional vector before and after conversion, be launched into evaluator and be expressed as:

Wherein x ₀with x ₇ , x ₁with x ₆ , x ₂with x ₅ , x ₃with x ₄ ,between these four groups of data, form butterfly computation relation, computing obtains four groups of new data x ₀+ x ₇with x ₀ -x ₇, x ₁+ x ₆with x ₁ -x ₆, x ₂+ x ₅with x ₂ -x ₅, x ₃+ x ₄with x ₃ -x ₄, between four groups of new data, can again utilize butterfly computation, its result is multiplied by respectively different coefficients, obtains last transformation results through applying butterfly computation for the third time, and its operational flowchart provides in Figure of description 1.