CN1564602A - Integral conversing matrix selection method of video coding and related integral conversion method - Google Patents
Integral conversing matrix selection method of video coding and related integral conversion method Download PDFInfo
- Publication number
- CN1564602A CN1564602A CNA2004100128571A CN200410012857A CN1564602A CN 1564602 A CN1564602 A CN 1564602A CN A2004100128571 A CNA2004100128571 A CN A2004100128571A CN 200410012857 A CN200410012857 A CN 200410012857A CN 1564602 A CN1564602 A CN 1564602A
- Authority
- CN
- China
- Prior art keywords
- transform
- integer
- matrix
- eval
- eta
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Granted
Links
Images
Classifications
-
- A—HUMAN NECESSITIES
- A41—WEARING APPAREL
- A41B—SHIRTS; UNDERWEAR; BABY LINEN; HANDKERCHIEFS
- A41B5/00—Fold-line formings for collars or cuffs
-
- A—HUMAN NECESSITIES
- A41—WEARING APPAREL
- A41B—SHIRTS; UNDERWEAR; BABY LINEN; HANDKERCHIEFS
- A41B1/00—Shirts
- A41B1/08—Details
-
- A—HUMAN NECESSITIES
- A41—WEARING APPAREL
- A41B—SHIRTS; UNDERWEAR; BABY LINEN; HANDKERCHIEFS
- A41B1/00—Shirts
- A41B1/08—Details
- A41B1/18—Shirt-fronts
- A41B1/22—False shirt-fronts, e.g. dickeys, with or without attached collars; Means for attaching or stretching
-
- A—HUMAN NECESSITIES
- A41—WEARING APPAREL
- A41B—SHIRTS; UNDERWEAR; BABY LINEN; HANDKERCHIEFS
- A41B7/00—Cuffs
Abstract
The invention is especially relative to integer transformation for image data compression in video codec and introduces a method for selecting transform radix. Through estimating decorrelation efficiency, energy concentration string, transform dynamical range and calculation complexity of tranform radix, the two groups of 8X8 integer transform radix (5,6,4,1) and (4,5,3,1) are introduced, and a fast algorithm based on the two groups transform radix is obtained.
Description
Technical field
The invention belongs to image processing technique, particularly the integer transform of Image Data Compression in the Video Codec.Mainly comprise integer transform transform-based (transformation matrix) system of selection and based on the implementation method of the piece conversion of selected transform base.
Background technology
Existing international video standard as H.264, among the motion picture expert group MPEG-4, vision signal is divided into sequence, frame, band, macro block, piece by level; Minimum processing unit is a piece.At coding side, by infra-frame prediction or inter prediction, obtain the prediction residual of piece and do the piece conversion, with concentration of energy in a few coefficient; Again by quantification, scanning, Run-Length Coding and entropy coding, with Image Data Compression and write encoding code stream.Then process is opposite in decoding end, extracts the block conversion coefficient of entropy coding from code stream, by inverse quantization and inverse transformation, recovers the prediction residual of piece, in conjunction with information of forecasting, finally recovers the video data of piece.In the encoding and decoding flow process, conversion module is the basis of video compression, and property directly influences the combination property of codec.
Early stage international standard such as MPEG-1, H.261 adopt discrete cosine transform DCT.DCT after proposing in 1974, obtained using widely at image and field of video encoding, its property is highly significant in all suboptimum conversion, can greatly remove the correlation of pictorial element in transform domain, for high efficiency image compression is laid a good foundation.But because the transformation matrix of DCT is to represent with floating number, the floating-point operation amount is bigger, takies than multi-system resource.In order to improve conversion efficiency, develop again and approach floating-point operation DCT with fixed-point computation or bigger integer transform, owing to there is trueness error,, can not recover view data fully after the inverse transformation even without through quantizing, just Bian Ma invertibity is not strong.The generation of integer transform has solved the problem of computational accuracy and code efficiency simultaneously, is characterized in replacing with the integer transform matrix floating number transformation matrix of DCT, and conversion process is integer arithmetic fully like this, does not have trueness error, has guaranteed the invertibity of encoding; Multiplication of integers can replace with addition and subtraction and displacement simultaneously, so conversion process can realize that by addition and subtraction and displacement operand reduces significantly fully.Up-to-date international standard H.264/MPEG-4 Part 10 just adopts integer transform, and has obtained extraordinary transform effect.In recent years, at image and field of video processing many researchs at integer transform are arranged, existing foreign patent about integer transform mainly contains:
1.U.S.Patent No.5999957A " Lossless Transform System For Digital Signals "; This patent multiply by fixed numbers by each row to the dct transform matrix, rounds approximately again, and the transformation matrix coefficient is become integer to realize inverible transform.This transformation matrix derivation is not considered the orthogonality of conversion, can not guarantee that integer transform is orthogonal transform, thereby have influence on property.And quantizing process has repeatedly multiplication and division, calculation of complex.Repeatedly multiplication is arranged in the fast transform algorithm, influence conversion efficiency.
2.WO?01/08001A1“Integer?Cosine?Transform?Using?Integer?Operations”;
3.U.S.Patent No.20020111979A1 " Integer Transform Matrix For PictureCoding "; This patent provides a kind of evaluation method of integer transform matrixing performance, and main by the degree of approximation of comparison with the dct transform matrix, this method guarantees the orthogonality of conversion.Patent has provided 4 and has taken advantage of 4,8 to take advantage of 8,16 to take advantage of best in theory transformation matrix under 16 3 kinds of situations.This method is not considered the influence of computation complexity to property yet, and for the vector normal form that guarantees each row or column is identical, the performance of selected transformation matrix is not near DCT's.
4.U.S.Patent No.2003/0093452A1 " Video Block Transform "; This patent provide based on H.26L 4 take advantage of 4 quadrature and integer transform, the inverse transformation matrix of nonopiate form, macro block DC coefficient transformation matrix and orthogonal transform corresponding quantitative step-length.The transformation matrix yardstick of this patent is different with the present invention, and the transformation matrix of small scale is not suitable for application such as high definition TV.
8 take advantage of 8 discrete cosines can use following formulate:
Wherein
C(w)=1,(w=1,…,7)。Form with matrix can be expressed as: Y=P
0XP
0 T, wherein X 8 takes advantage of 8 pixel prediction residual matrixes, and Y is the matrix after the conversion.
Wherein
H.264 to the rewriting process of 4 * 4DCT conversion, it is as follows to rewrite 8 * 8 conversion: from P according to international standard
0Each row a common coefficient is proposed, obtain vectorial V
8=[a, m, f, m, a, m, f, m], wherein m is matrix P
0The common coefficient that proposes of even number line, the value of m is the positive number that is not more than k4.Then transformation matrix is rewritten as:
Y=P
1XP
1 TE
8 (2)
Wherein represents the multiplication cross (the element correspondence that is same position multiplies each other) of two matrixes.For (2) formula can with matrix E
8Multiplication cross computing and quantization operation put together, make conversion obtain simplifying.Therefore conversion focuses on P
1XP
1 TCalculating, wherein X is 8 * 8 pixel prediction residual matrix, is the shaping data.If P
1In variable k1, k2, k3, k4, k5 are integer, then whole conversion will all be converted into integer arithmetic.So following work just needs to determine k1, k2, k3, k4, the choosing value of five parameters such as k5.The present invention is by experimental results demonstrate, as selected k1, k2, k3, behind the k4, it is best that k5 gets 2 o'clock property, and Cham is at his article (Development of Integer Cosine Transform by the Principle of DyadicSymmetry.IEE Proceedings, 1989,136 (4): 276-288) also proposed similar conclusion.Thereby the present invention with k5 fixedly value be 2, only study the selection of all the other four parameters, and the definition (k3 k4) is the transform-based of integer transform for k1, k2.Corresponding transformation matrix P is
Summary of the invention
The present invention proposes a kind of system of selection of integer transform matrix and relevant integer transform method of video coding, first audio/video encoding standard (AVS) that will formulate at current China adopts 8 to take advantage of 8 integer class dct transforms, a kind of transform-based system of selection of integer transform has been proposed, two indexs of the decorrelation efficiency of overall merit transform-based and energy compaction efficiency and transform-based conversion dynamic range and computation complexity, and propose 8 of two groups of excellent performances by the method and take advantage of 8 integer translation bases (5,6,4,1) and (4,5,3,1), and obtain integer transform fast algorithm based on these two groups of bases.
The selection of transform-based is mainly based on following several principles:
Principle 1: conversion orthogonality.The characteristic of orthogonal transform guarantees that conversion only is the rotation to coordinate system, and the energy of image remains unchanged.In order to guarantee the orthogonality of conversion, the P in the formula (2) must satisfy the following formula condition:
P·P
T=Diag (3)
Wherein Diag is a diagonal matrix, and promptly its non-leading diagonal element is zero.Quantizing process by adjusting quantization matrix, makes conversion satisfy orthogonality again.
Principle 2: concentration of energy.The purpose of DCT exchange is to remove the correlation between the element, the energy after the conversion is concentrated on a few coefficients as far as possible, so that improve the compression efficiency that quantizes the back entropy coding.The selection of integer translation base equally also needs to follow this principle.
Principle 3: fast transform algorithm simplicity.Requirement transform-based numerical value can not be too big, and the calculation times of fast algorithm is the least possible.
The integer transform matrix system of selection of a kind of video coding of the present invention in turn includes the following steps:
(1) at first all integer translation bases of orthogonality condition are satisfied in search within the specific limits, take advantage of 8 integer transform matrix P for 8, the definition transform-based be (k1, k2, k3, k4),
The span of transform-based coefficient k 1, k2, k3, k4 is k1, k2, k3 ∈ [1,10], and k4 ∈ [1,4] obtains all and satisfies PP
TThe integer orthogonal transform base of=Diag;
(2) set up the covariance matrix COV (X that the input picture residual error data was got at cross-correlation coefficient ρ at 0.75,0.8,0.85,0.9,0.95 o'clock
v),
If length is 8 image residual error data one-dimensional vector is X
v=[x
1, x
2... x
8], set up X by the single order Markov model
vCovariance matrix COV (the X of element
v), COV (X
v)
(i, j)=ρ
| i-j|(0≤i, j≤7), wherein ρ is X
vThe cross-correlation coefficient of adjacent element (ρ≤1),
(3), obtain the covariance matrix COV (Y of transform domain by the transformation matrix P of transform-based correspondence
v),
Transform-based (k1, k2, k3, k4) Dui Ying transformation matrix P, normalization, promptly each row element of P obtains orthogonal matrix P divided by the length of this row vector
uTo X
vMake orthogonal transform Y
v=P
uX
v, Y
vCovariance matrix be:
COV(Y
v)=P
u·COV(X
v)·P
u T (4)
(4) respectively organize the energy compaction efficiency η that transform-based was got at cross-correlation coefficient ρ at 0.75,0.8,0.85,0.9,0.95 o'clock by (2), (3) calculating
EValue and decorrelation efficiency η
CValue,
Definition energy compaction efficiency η
EFor:
Decorrelation efficiency η
cFor:
(5) calculate the energy compaction efficiency η of each transform-based under the cross-correlation coefficient ρ of appointment
EValue and decorrelation efficiency η
CNormalized result, under the same ρ, i transform-based energy compaction efficiency η
EThe normalization result be:
I transform-based decorrelation efficiency η
CThe normalization result be:
(6), obtain every group of base energy compaction efficiency η under each cross-correlation coefficient ρ by weighted sum
EValue, the comprehensive evaluation result Eval of decorrelation efficiency η C
E,,, Eval
C, the weight of 5 ρ correspondences is respectively 1/15,2/15,3/15,4/15,5/15;
(7) pass through Eval
CAnd Eval
EThese two index weighted sums obtain the comprehensive evaluation value Eval of transform-based performance, Eval
CAnd Eval
ECorresponding weight is respectively 0.4,0.6.
The integer transform matrix system of selection of described video coding, after it is further characterized in that the comprehensive evaluation value Eval that obtains the transform-based performance, increase transform-based (k1, k2, k3, the evaluation procedure of computation complexity k4): at first select the higher transform-based of comprehensive evaluation value Eval; If it is addition and subtraction and the few transform-based of displacement number of times that the Eval gap, pays the utmost attention in the application that real-time is had relatively high expectations that computation complexity has a clear superiority in less than 0.02.
The integer transform method of a kind of video coding of the present invention at coding side, by infra-frame prediction or inter prediction, obtains the prediction residual of piece and does the piece conversion, with concentration of energy in a few coefficient; Again by quantification, scanning, Run-Length Coding and entropy coding, with Image Data Compression and write encoding code stream; Then from code stream, extract the block conversion coefficient of entropy coding in decoding end,, recover the prediction residual of piece,, recover the video data of piece in conjunction with information of forecasting by inverse quantization and inverse transformation; It is characterized in that:
(1) 8 take advantage of transformation matrix P that 8 integer transforms adopt to obtain in the video coding by the integer transform matrix system of selection of aforementioned video coding, as shown in the formula:
Corresponding integer translation base is (5,6,4,1);
(2) direct transform takes advantage of 8 image residual error data piece to do integer transform, shape such as Y=PXP to 8
T, the elementary cell of conversion is 8 one-dimensional transforms of shape such as y=Px, x=[x0 wherein, x1, x2, x3, x4, x5, x6, x7]
T, the y=[y0 of output, y1, y2, y3, y4, y5, y6, y7]
T, computational process is as follows:
A.a0=x0-x7,a1=x1-x6,a2=x2-x5,a3=x3-x4,a4=x0+x7,a5=x1+x6,a6=x2+x5,a7=x3+x4;
B.b0=a4+a7,b1=a5+a6,b2=a4-a7,b3=a5-a6;
C.y0=b0+b1,y4=b0-b1,y2=b2<<1+b3,y6=b2-b3<<1;
Finish the calculation procedure of suitable calculating following formula again:
D.c0=a0<<2+a0+a3;c1=a2-a1-a1<<2;c2=a1+a2+a2<<2;c3=a3<<2+a3-a0;
E.y1=c0-c1+c2;y3=c0-c2-c3;y5=c0+c1+c3;y7=c1+c2-c3;
(3) inverse transformation makes that the one-dimensional transform elementary cell is x=P
TY.Y=[y0 wherein, y1, y2, y3, y4, y5, y6, y7]
T, x=[x0, x1, x2, x3, x4, x5, x6, x7]
T. the one dimension contravariant is changed to:
A.m0=y0+y4;m1=y0-y4;m2=y2<<1+y6;m3=y2-y6<<1;
B.b0=m0+m2;b1=m1+m3;b2=m1-m3;b3=m0-m2;
C. calculate 4 of following formula and take advantage of 4 matrix multiplications:
4 of computational process and direct transform takes advantage of 4 matrix multiplications identical, just the exchange of inputoutput data vector;
D.x0=a0+b0;x1=a1+b1;x2=a2+b2;x3=a3+b3;
x7=-a0+b0;x6=-a1+b1;x5=-a2+b2;x4=-a3+b3;
Wherein "<<" expression is to the left dislocation computing, its priority is higher than addition and subtraction, " a<<b " represent that a is to left dislocation b position.
The integer transform method of another video coding of the present invention at coding side, by infra-frame prediction or inter prediction, obtains the prediction residual of piece and does the piece conversion, with concentration of energy in a few coefficient; Again by quantification, scanning, Run-Length Coding and entropy coding, with Image Data Compression and write encoding code stream; Then from code stream, extract the block conversion coefficient of entropy coding in decoding end,, recover the prediction residual of piece,, recover the video data of piece in conjunction with information of forecasting by inverse quantization and inverse transformation; It is characterized in that:
(1) 8 take advantage of transformation matrix P that 8 integer transforms adopt to obtain in the video coding by the integer transform matrix system of selection of aforementioned video coding, also can as shown in the formula:
Corresponding integer translation base is (4,5,3,1);
(2) direct transform takes advantage of 8 image residual error data piece to do integer transform, shape such as Y=PXP to 8
T, the elementary cell of conversion is 8 one-dimensional transforms of shape such as y=Px, x=[x0 wherein, x1, x2, x3, x4, x5, x6, x7]
T, the y=[y0 of output, y1, y2, y3, y4, y5, y6, y7]
TComputational process is as follows:
A.a0=x0-x7,a1=x1-x6,a2=x2-x5,a3=x3-x4,a4=x0+x7,a5=x1+x6,a6=x2+x5,
a7=x3+x4;
B.b0=a4+a7,b1=a5+a6,b2=a4-a7,b3=a5-a6;
C.y0=b0+b1,y4=b0-b1,y2=b2<<1+b3,y6=b2-b3<<1;
Finish the calculation procedure of suitable calculating following formula again:
D.c0=a0<<2+a3;c1=a2-a1<<2;c2=a1+a2<<2;c3=a3<<2-a0;
E.y1=c0-c1+c2;y3=c0-c2-c3;y5=c0+c1+c3;y7=c1+c2-c3;
(3) inverse transformation:
Make that the one-dimensional transform elementary cell is x=P
TY.Y=[y0 wherein, y1, y2, y3, y4, y5, y6, y7]
T,
X=[x0, x1, x2, x3, x4, x5, x6, x7]
T. the one dimension contravariant is changed to:
A.m0=y0+y4;m1=y0-y4;m2=y2<<1+y6;m3=y2-y6<<1;
B.b0=m0+m2;b1=m1+m3;b2=m1-m3;b3=m0-m2;
C. calculate 4 of following formula and take advantage of 4 matrix multiplications:
4 of the computational process of fast algorithm and direct transform takes advantage of 4 matrix multiplications identical, just the exchange of inputoutput data vector;
D.x0=a0+b0;x1=a1+b1;x2=a2+b2;x3=a3+b3;
x7=-a0+b0;x6=-a1+b1;x5=-a2+b2;x4=-a3+b3;
Wherein "<<" expression is to the left dislocation computing, its priority is higher than addition and subtraction, " a<<b " represent that a is to left dislocation b position.
The present invention proposes a kind of integrated evaluating method of integer translation base performance, and filter out the transform-based of several groups of better performances, and provide the fast transform approach of two groups of transform-based according to this method.The ABT 8 that the performance of the bright preferred several groups of transform-based of measured result proof this law of high definition video cycle tests is better than JVT takes advantage of 8 conversion, and wherein the property of (10,9,6,2) is best, the computation complexity of (4,5,3,1) is minimum, the performance of (5,6,4,1) falls between.Take advantage of 8 conversion with respect to ABT 8, these 3 groups of bases all have certain advantage aspect property and the computation complexity etc. two.The measured performance of selected transform-based has also been verified the accuracy and the feasibility of transform-based system of selection of the present invention, this method not only is applicable to the integer transform matrix, also can be used for Performance Evaluation, have very strong directive significance for the selection of transformation matrix to all kinds of transformation matrixs.
Figure of description
Fig. 1: transform-based is estimated FB(flow block).
Fig. 2: transform-based (5,6,4,1) direct transform fast algorithm.
Fig. 3: transform-based (5,6,4,1) inverse transformation fast algorithm.
Fig. 4: transform-based (4,5,3,1) direct transform fast algorithm.
Fig. 5: transform-based (4,5,3,1) inverse transformation fast algorithm.
Embodiment
(1) selection of transform-based
The evaluation flow process of transform-based as shown in Figure 1.
The cross-correlation coefficient ρ of all kinds of image residual error data mainly is distributed between 0.75~0.95, is the energy compaction efficiency η of each transform-based correspondence of 0.75,0.8,0.85,0.9,0.95 five by calculating ρ
E, again with each basic η under the same ρ
ENormalization.To same transform-based η under different cross-correlation coefficient ρ
ENormalization is weighted sum as a result, obtains the comprehensive evaluation value Eval that this organizes basic energy compaction efficiency
E, wherein weight is the probability size decision according to different ρ.The weight of getting 5 ρ correspondences among the present invention is followed successively by 1/15,2/15,3/15,4/15,5/15.In like manner obtain the decorrelation efficiency η of transform-based
cComprehensive evaluation result Eval
C
Pass through Eval at last
EAnd Eval
CWeighted sum obtains the comprehensive evaluation value Eval of transform-based energy compaction efficiency and decorrelation efficiency.Consider the compression performance after energy compaction efficiency directly influences conversion, weight is bigger, and the weight of definition energy compaction efficiency and decorrelation efficiency evaluation of estimate is respectively among the present invention: 0.6,0.4.
When comprehensive evaluation value Eval near the time, the performance of the transform-based that computation complexity is low is better.
The span of having listed transform-based in the following table is k1, k2, k3 ∈ [1,10], during k4 ∈ [1,4], and the η of 5 groups of bases
EAnd η
cComprehensive evaluation value, finish needed addition of one dimension 8 point transformation and displacement number of times (the calculation number of times far away of positive inverse transformation is identical).
K1, k2, k3, η
EAnd η
cComprehensively
The addition number of times+/-the displacement number of times<<
The k4 evaluation of estimate
10,9,6,2 0.9859 36 10
5,6,4,1 0.8579 32 6
6,6,3,2 0.8441 36 10
6,7,5,1 0.8409 32 10
4,5,3,1 0.8249 28 6
(10,9,6,2) and (6,6,3,2) are suggested in relevant document.The decorrelation efficiency of (5,6,4,1) and the comprehensive evaluation value of energy compaction efficiency are only second to (10,9,6,2), and computation complexity is less, and the comprehensive evaluation value of (4,5,3,1) is a little less than (6,6,3,2), but the computation complexity advantage is the most obvious.Practical video sequential test result shows that the distortion performance of (5,6,4,1), (4,5,3,1), (6,7,5,1) is better than (6,6,3,2), and is very approaching with (10,9,6,2).The realization of (two) 8 * 8 integer transform fast algorithms
X0, x1, x2, x3, x4, x5, x6, x7 represent input eight point value of the one dimension direct transform of integer transform among Fig. 2-Fig. 5, are 8 output numerical value of inverse transformation simultaneously; Y0, y1, y2, y3, y4, y5, y6, y7 are 8 output numerical value of direct transform, are 8 output numerical value of inverse transformation simultaneously.The data processing direction from left to right, two lines that intersect at a round dot are represented two data additions, three data additions of three-way expression.Square expression data are taken advantage of a coefficient, and wherein negative value is got in "-" expression, and " 2 " expression takes advantage of 2, promptly moves to left one, and " 4 " expression takes advantage of 4, promptly moves to left two.
1. direct transform
Image residual error data piece to 8x8 is done integer transform, and the elementary cell of conversion is 8 one-dimensional transforms of shape such as y=Px, establishes x=[x0, x1, x2, x3, x4, x5, x6, x7]
T, the y=[y0 of output, y1, y2, y3, y4, y5, y6, y7]
TComputational process is as follows:
When at first calculating is done conversion with each different transformation matrix P, identical calculation procedure:
(1)a0=x0-x7,a1=x1-x6,a2=x2-x5,a3=x3-x4,a4=x0+x7,a5=x1+x6,a6=x2+x5,a7=x3+x4;
(2)b0=a4+a7,b1=a5+a6,b2=a4-a7,b3=a5-a6;
(3)y0=b0+b1,y4=b0-b1,y2=b2<<1+b3,y6=b2-b3<<1;
Same section calculates needs addition and subtraction 16 times, displacement 2 times.
Calculate different calculation procedures again, this part is equivalent to calculate following formula:
For transform-based (5,6,4,1) calculation procedure be:
(1)c0=a0<<2+a0+a3;c1=a2-a1-a1<<2;c2=a1+a2+a2<<2;c3=a3<<2+a3-a0;
(2)y1=c0-c1+c2;y3=c0-c2-c3;y5=c0+c1+c3;y7=c1+c2-c3;
Need 16 addition and subtractions and 4 displacements altogether.
For transform-based (4,5,3,1) calculation procedure be:
(1)c0=a0<<2+a3;c1=a2-a1<<2;c2=a1+a2<<2;c3=a3<<2-a0;
(2)y1=c0-c1+c2;y3=c0-c2-c3;y5=c0+c1+c3;y7=c1+c2-c3;
Need 12 addition and subtractions and 4 displacements altogether.
Therefore finish a y=Px computing for transform-based (5,6,4,1), shared 32 addition and subtractions and 6 displacements; Shared 28 addition and subtractions of transform-based (4,5,3,1) and 6 displacements.Finish operand of integer transform of 8 * 8 and be the said units amount of calculation 16 times.The fast algorithm of (5,6,4,1) direct transform as shown in Figure 2, the fast algorithm of (4,5,3,1) direct transform is as shown in Figure 4.
2. inverse transformation
Make that the one-dimensional transform elementary cell is x=P
TY.
Y=[y0 wherein, y1, y2, y3, y4, y5, y6, y7]
T, x=[x0, x1, x2, x3, x4, x5, x6, x7]
T. following process is an x=P
TThe y computing.
(1)m0=y0+y4;m1=y0-y4;m2=y2<<1+y6;m3=y2-y6<<1;
(2)b0=m0+m2;b1=m1+m3;b2=m1-m3;b3=m0-m2;
(3) calculate 4 of following formula and take advantage of 4 matrix multiplications:
4 * 4 matrix multiplications are identical in calculating formula and the direct transform, and algorithm is the same, just the exchange of inputoutput data vector.Amount of calculation is identical.Need 16 addition and subtractions and 4 displacements for transform-based (5,6,4,1);
Need 12 addition and subtractions and 4 displacements for transform-based (4,5,3,1).
(4)x0=a0+b0;x1=a1+b1;x2=a2+b2;x3=a3+b3;
x7=-a0+b0;x6=-a1+b1;x5=-a2+b2;x4=-a3+b3;
Wherein "<<" expression is to the left dislocation computing, its priority is higher than addition and subtraction, " a<<b " expression a is to left dislocation b position, and the public part amount of calculation of different transform-based is: addition and subtraction 16 times, displacement 2 times.Therefore finish x=P one time for transform-based (5,6,4,1)
TThe y computing, shared 32 addition and subtractions and 6 displacements; Shared 28 addition and subtractions of transform-based (4,5,3,1) and 6 displacements.The fast algorithm of (5,6,4,1) inverse transformation as shown in Figure 3, the fast algorithm of (4,5,3,1) inverse transformation is as shown in Figure 5.The operand of finishing the integer transform inverse transformation of taking advantage of 8 for one time 8 is 16 times of said units amount of calculation.
Claims (4)
1. the integer transform matrix system of selection of a video coding in turn includes the following steps:
(1) at first all integer translation bases of orthogonality condition are satisfied in search within the specific limits, take advantage of 8 integer transform matrix P for 8, the definition transform-based be (k1, k2, k3, k4),
The span of transform-based coefficient k 1, k2, k3, k4 is k1, k2, k3 ∈ [1,10], and k4 ∈ [1,4] obtains all and satisfies PP
TThe integer orthogonal transform base of=Diag, wherein Diag is a diagonal matrix;
(2) set up the covariance matrix COV (X that the input picture residual error data was got at cross-correlation coefficient ρ at 0.75,0.8,0.85,0.9,0.95 o'clock
v),
If length is 8 image residual error data one-dimensional vector is X
V=[x
1, x
2... x
8], set up X by the single order Markov model
VCovariance matrix COV (the X of element
v), COV (X
v)
(i, j)=ρ
| i-j|(0≤i, j≤7), wherein ρ is X
VThe cross-correlation coefficient of adjacent element (ρ≤1);
(3), obtain the covariance matrix COV (Y of transform domain by the transformation matrix P of transform-based correspondence
v),
Transform-based (k1, k2, k3, k4) Dui Ying transformation matrix P, normalization, promptly each row element of P obtains orthogonal matrix P divided by the length of this row vector
uTo X
VMake orthogonal transform Y
V=P
uX
V, Y
VCovariance matrix be:
(4) respectively organize the energy compaction efficiency η that transform-based was got at cross-correlation coefficient ρ at 0.75,0.8,0.85,0.9,0.95 o'clock by (2), (3) calculating
EValue and decorrelation efficiency η
CValue,
Definition energy compaction efficiency η
EFor:
Decorrelation efficiency η
cFor:
(5) calculate the energy compaction efficiency η of each transform-based under the cross-correlation coefficient ρ of appointment
EValue and decorrelation efficiency η
CNormalized result, under the same ρ, i transform-based energy compaction efficiency η
EThe normalization result be:
I transform-based decorrelation efficiency η
CThe normalization result be:
(6), obtain every group of base energy compaction efficiency η under each cross-correlation coefficient ρ by weighted sum
EValue, decorrelation efficiency η
CComprehensive evaluation result Eval
E,,, Eval
C, the weight of 5 ρ correspondences is respectively 1/15,2/15,3/15,4/15,5/15;
(7) pass through Eval
CAnd Eval
EThese two index weighted sums obtain the comprehensive evaluation value Eval of transform-based performance, Eval
CAnd Eval
ECorresponding weight is respectively 0.4,0.6.
2. the integer transform matrix system of selection of video coding as claimed in claim 1, after it is characterized in that obtaining the comprehensive evaluation value Eval of transform-based performance, increase transform-based (k1, k2, k3, the evaluation procedure of computation complexity k4): at first select the higher transform-based of comprehensive evaluation value Eval; If it is addition and subtraction and the few transform-based of displacement number of times that the Eval gap, pays the utmost attention in the application that real-time is had relatively high expectations that computation complexity has a clear superiority in less than 0.02.
3. the integer transform method of a video coding at coding side, by infra-frame prediction or inter prediction, obtains the prediction residual of piece and does the piece conversion, with concentration of energy in a few coefficient; Again by quantification, scanning, Run-Length Coding and entropy coding, with Image Data Compression and write encoding code stream; Then from code stream, extract the block conversion coefficient of entropy coding in decoding end,, recover the prediction residual of piece,, recover the video data of piece in conjunction with information of forecasting by inverse quantization and inverse transformation; It is characterized in that:
(1) 8 take advantage of transformation matrix P that 8 integer transforms adopt to obtain in the video coding by the integer transform matrix system of selection of claim 1 or 2 described video codings, as shown in the formula:
Corresponding integer translation base is (5,6,4,1);
(2) direct transform takes advantage of 8 image residual error data piece to do integer transform, shape such as Y=PXP to 8
T, the elementary cell of conversion is 8 one-dimensional transforms of shape such as y=Px, x=[x0 wherein, x1, x2, x3, x4, x5, x6, x7]
T, the y=[y0 of output, y1, y2, y3, y4, y5, y6, y7]
T, computational process is as follows:
A.a0=x0-x7,a1=x1-x6,a2=x2-x5,a3=x3-x4,a4=x0+x7,a5=x1+x6,a6=x2+x5,a7=x3+x4;
B.b0=a4+a7,b1=a5+a6,b2=a4-a7,b3=a5-a6;
C.y0=b0+b1,y4=b0-b1,y2=b2<<1+b3,y6=b2-b3<<1;
Finish the calculation procedure of suitable calculating following formula again:
D.c0=a0<<2+a0+a3;c1=a2-a1-a1<<2;c2=a1+a2+a2<<2;c3=a3<<2+a3-a0;
E.y1=c0-c1+c2;y3=c0-c2-c3;y5=c0+c1+c3;y7=c1+c2-c3;
(3) inverse transformation:
Make that the one-dimensional transform elementary cell is x=Py, y=[y0 wherein, y1, y2, y3, y4, y5, y6, y7]
T, x=[x0, x1, x2, x3, x4, x5, x6, x7]
TThe one dimension contravariant is changed to:
A.m0=y0+y4;m1=y0-y4;m2=y2<<1+y6;m3=y2-y6<<1;
B.b0=m0+m2;b1=m1+m3;b2=m1-m3;b3=m0-m2;
C. calculate 4 of following formula and take advantage of 4 matrix multiplications:
4 of computational process and direct transform takes advantage of 4 matrix multiplications identical, just the exchange of inputoutput data vector;
D.x0=a0+b0;x1=a1+b1;x2=a2+b2;x3=a3+b3;
x7=-a0+b0;x6=-a1+b1;x5=-a2+b2;x4=-a3+b3;
Wherein "<<" expression is to the left dislocation computing, its priority is higher than addition and subtraction, as " a<<b " represent that a is to left dislocation b position.
4. the integer transform method of a video coding at coding side, by infra-frame prediction or inter prediction, obtains the prediction residual of piece and does the piece conversion, with concentration of energy in a few coefficient; Again by quantification, scanning, Run-Length Coding and entropy coding, with Image Data Compression and write encoding code stream; Then from code stream, extract the block conversion coefficient of entropy coding in decoding end,, recover the prediction residual of piece,, recover the video data of piece in conjunction with information of forecasting by inverse quantization and inverse transformation; It is characterized in that:
(1) 8 take advantage of transformation matrix P that 8 integer transforms adopt to obtain in the video coding by the integer transform matrix system of selection of claim 1 or 2 described video codings, as shown in the formula:
Corresponding integer translation base is (4,5,3,1);
(2) direct transform takes advantage of 8 image residual error data piece to do integer transform, shape such as Y=PXP to 8
T, the elementary cell of conversion is 8 one-dimensional transforms of shape such as y=Px, x=[x0 wherein, x1, x2, x3, x4, x5, x6, x7]
T, the y=[y0 of output, y1, y2, y3, y4, y5, y6, y7]
T, computational process is as follows:
A.a0=x0-x7,a1=x1-x6,a2=x2-x5,a3=x3-x4,a4=x0+x7,a5=x1+x6,a6=x2+x5,a7=x3+x4;
B.b0=a4+a7,b1=a5+a6,b2=a4-a7,b3=a5-a6;
C.y0=b0+b1, y4=b0-b1, y2=b2<<1+b3, y6=b2-b3<<1; Finish the calculation procedure of suitable calculating following formula again:
D.c0=a0<<2+a3;c1=a2-a1<<2;c2=a1+a2<<2;c3=a3<<2-a0;
E.y1=c0-c1+c2;y3=c0-c2-c3;y5=c0+c1+c3;y7=c1+c2-c3;
(3) inverse transformation:
Make that the one-dimensional transform elementary cell is x=Py, y=[y0 wherein, y1, y2, y3, y4, y5, y6, y7]
T, x=[x0, x1, x2, x3, x4, x5, x6, x7]
T, the one dimension contravariant is changed to:
A.m0=y0+y4;m1=y0-y4;m2=y2<<1+y6;m3=y2-y6<<1;
B.b0=m0+m2;b1=m1+m3;b2=m1-m3;b3=m0-m2;
C. calculate 4 of following formula and take advantage of 4 matrix multiplications:
4 of the computational process of fast algorithm and direct transform takes advantage of 4 matrix multiplications identical, just the exchange of inputoutput data vector.
D.x0=a0+b0;x1=a1+b1;x2=a2+b2;x3=a3+b3;
x7=-a0+b0;x6=-a1+b1;x5=-a2+b2;x4=-a3+b3;
Wherein "<<" expression is to the left dislocation computing, its priority is higher than addition and subtraction, as " a<<b " represent that a is to left dislocation b position.
Priority Applications (5)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CNB2004100128571A CN100433837C (en) | 2004-03-18 | 2004-03-18 | Integral conversing matrix selection method of video coding and related integral conversion method |
KR1020050018437A KR100636225B1 (en) | 2004-03-18 | 2005-03-05 | Integer transform matrix selection method in video coding and related integer transform method |
US11/081,829 US20050213835A1 (en) | 2004-03-18 | 2005-03-17 | Integer transform matrix selection method in video coding and related integer transform method |
JP2005080456A JP2005269657A (en) | 2004-03-18 | 2005-03-18 | Integral transform matrix selection method in video coding and related integral transform method |
EP20050251662 EP1577790A2 (en) | 2004-03-18 | 2005-03-18 | Integer transform matrix selection method in video coding and related integer transform matrix |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CNB2004100128571A CN100433837C (en) | 2004-03-18 | 2004-03-18 | Integral conversing matrix selection method of video coding and related integral conversion method |
Publications (2)
Publication Number | Publication Date |
---|---|
CN1564602A true CN1564602A (en) | 2005-01-12 |
CN100433837C CN100433837C (en) | 2008-11-12 |
Family
ID=34478014
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CNB2004100128571A Expired - Fee Related CN100433837C (en) | 2004-03-18 | 2004-03-18 | Integral conversing matrix selection method of video coding and related integral conversion method |
Country Status (2)
Country | Link |
---|---|
KR (1) | KR100636225B1 (en) |
CN (1) | CN100433837C (en) |
Cited By (17)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
WO2009127135A1 (en) * | 2008-04-15 | 2009-10-22 | The Chinese University Of Hong Kong | Method and apparatus for deriving an order-16 integer transform |
WO2009127134A1 (en) * | 2008-04-15 | 2009-10-22 | The Chinese University Of Hong Kong | Generation of an order-2n transform from an order-n transform |
CN100563337C (en) * | 2007-10-24 | 2009-11-25 | 上海广电(集团)有限公司中央研究院 | Integer transform based on AVS quantizes and the integral inverse transform quantification method |
CN101931817A (en) * | 2010-04-12 | 2010-12-29 | 中山大学 | Transform coding method based on transform base self-adaption |
WO2011047579A1 (en) * | 2009-10-23 | 2011-04-28 | 华为技术有限公司 | Method and device for encoding and decoding video |
CN102067606A (en) * | 2007-09-26 | 2011-05-18 | 高通股份有限公司 | Efficient transformation techniques for video coding |
CN101529917B (en) * | 2006-10-23 | 2011-08-31 | 高通股份有限公司 | Signalling of maximum dynamic range of inverse discrete cosine transform |
WO2011124163A1 (en) * | 2010-04-09 | 2011-10-13 | 华为技术有限公司 | Method and device for encoding or decoding video data, method and device for transform processing |
CN102281445A (en) * | 2011-04-20 | 2011-12-14 | 深圳市融创天下科技股份有限公司 | Video coding 8*8 integer transformation method |
CN102333217A (en) * | 2011-04-20 | 2012-01-25 | 深圳市融创天下科技股份有限公司 | Video coding 4*4 integer transform method |
CN102413331A (en) * | 2011-12-01 | 2012-04-11 | 上海大学 | Coding method for images or videos |
CN102547285A (en) * | 2012-01-09 | 2012-07-04 | 上海大学 | Hexagonal transform quick-processing method for image or video coding |
CN103250412A (en) * | 2010-02-02 | 2013-08-14 | 数码士有限公司 | Image encoding/decoding method for rate-istortion optimization and apparatus for performing same |
CN103561265A (en) * | 2013-11-15 | 2014-02-05 | 武汉大学 | Method for integer DST conversion base selection for video coding |
CN103748885A (en) * | 2011-06-18 | 2014-04-23 | 三星电子株式会社 | Method and device for the transformation and method and device for the reverse transformation of images |
CN103782600A (en) * | 2011-09-02 | 2014-05-07 | 数码士有限公司 | Image encoding/decoding method for rate-distortion optimization and device for performing same |
CN110856000A (en) * | 2019-10-22 | 2020-02-28 | 深圳市华星光电技术有限公司 | Image decompression method and device |
Families Citing this family (2)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN102014475B (en) | 2010-01-08 | 2012-01-04 | 华为技术有限公司 | Resource mapping and code division multiplexing method and device |
CN102571255B (en) * | 2010-01-08 | 2014-08-20 | 华为技术有限公司 | Resource mapping and code division multiplexing method and device |
Family Cites Families (7)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
JP2914226B2 (en) * | 1995-06-16 | 1999-06-28 | 日本電気株式会社 | Transformation encoding of digital signal enabling reversible transformation |
AU6118700A (en) * | 1999-07-23 | 2001-02-13 | Trustees Of Boston University | Integer discrete cosine transform using integer operations |
CN1242621C (en) * | 2000-08-12 | 2006-02-15 | 罗伯特-博希股份公司 | Method for carrying out integer approximation of transform coefficients, and coder and decoder |
US6990506B2 (en) * | 2000-12-13 | 2006-01-24 | Sharp Laboratories Of America, Inc. | Integer cosine transform matrix for picture coding |
US7185037B2 (en) * | 2001-08-23 | 2007-02-27 | Texas Instruments Incorporated | Video block transform |
US7007055B2 (en) | 2002-03-12 | 2006-02-28 | Intel Corporation | Method of performing NxM Discrete Cosine Transform |
CN1209926C (en) * | 2003-03-26 | 2005-07-06 | 浙江大学 | Method and device of integer transform applicable to image coding and video frequency coding |
-
2004
- 2004-03-18 CN CNB2004100128571A patent/CN100433837C/en not_active Expired - Fee Related
-
2005
- 2005-03-05 KR KR1020050018437A patent/KR100636225B1/en not_active IP Right Cessation
Cited By (29)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN101529917B (en) * | 2006-10-23 | 2011-08-31 | 高通股份有限公司 | Signalling of maximum dynamic range of inverse discrete cosine transform |
US8654833B2 (en) | 2007-09-26 | 2014-02-18 | Qualcomm Incorporated | Efficient transformation techniques for video coding |
CN102067606A (en) * | 2007-09-26 | 2011-05-18 | 高通股份有限公司 | Efficient transformation techniques for video coding |
CN102067606B (en) * | 2007-09-26 | 2014-09-17 | 高通股份有限公司 | Efficient transformation techniques for video coding |
CN100563337C (en) * | 2007-10-24 | 2009-11-25 | 上海广电(集团)有限公司中央研究院 | Integer transform based on AVS quantizes and the integral inverse transform quantification method |
CN101990763B (en) * | 2008-04-15 | 2013-01-02 | 香港中文大学 | Generation of an order-2n transform from an order-n transform |
CN101990763A (en) * | 2008-04-15 | 2011-03-23 | 香港中文大学 | Generation of an order-2n transform from an order-n transform |
WO2009127134A1 (en) * | 2008-04-15 | 2009-10-22 | The Chinese University Of Hong Kong | Generation of an order-2n transform from an order-n transform |
WO2009127135A1 (en) * | 2008-04-15 | 2009-10-22 | The Chinese University Of Hong Kong | Method and apparatus for deriving an order-16 integer transform |
WO2011047579A1 (en) * | 2009-10-23 | 2011-04-28 | 华为技术有限公司 | Method and device for encoding and decoding video |
US9723313B2 (en) | 2009-10-23 | 2017-08-01 | Huawei Technologies Co., Ltd. | Method and device for encoding and decoding videos using a best transformation matrix among multiple transformation matrices |
CN102045560B (en) * | 2009-10-23 | 2013-08-07 | 华为技术有限公司 | Video encoding and decoding method and video encoding and decoding equipment |
CN102045560A (en) * | 2009-10-23 | 2011-05-04 | 华为技术有限公司 | Video encoding and decoding method and video encoding and decoding equipment |
CN103250412A (en) * | 2010-02-02 | 2013-08-14 | 数码士有限公司 | Image encoding/decoding method for rate-istortion optimization and apparatus for performing same |
WO2011124163A1 (en) * | 2010-04-09 | 2011-10-13 | 华为技术有限公司 | Method and device for encoding or decoding video data, method and device for transform processing |
CN101931817A (en) * | 2010-04-12 | 2010-12-29 | 中山大学 | Transform coding method based on transform base self-adaption |
WO2012142777A1 (en) * | 2011-04-20 | 2012-10-26 | 深圳市融创天下科技股份有限公司 | 8×8 integer transform method in video coding |
CN102333217A (en) * | 2011-04-20 | 2012-01-25 | 深圳市融创天下科技股份有限公司 | Video coding 4*4 integer transform method |
CN102333217B (en) * | 2011-04-20 | 2017-04-12 | 深圳市云宙多媒体技术有限公司 | Video coding 4*4 integer transform method |
CN102281445A (en) * | 2011-04-20 | 2011-12-14 | 深圳市融创天下科技股份有限公司 | Video coding 8*8 integer transformation method |
CN103748885A (en) * | 2011-06-18 | 2014-04-23 | 三星电子株式会社 | Method and device for the transformation and method and device for the reverse transformation of images |
CN103782600A (en) * | 2011-09-02 | 2014-05-07 | 数码士有限公司 | Image encoding/decoding method for rate-distortion optimization and device for performing same |
CN102413331A (en) * | 2011-12-01 | 2012-04-11 | 上海大学 | Coding method for images or videos |
CN102547285A (en) * | 2012-01-09 | 2012-07-04 | 上海大学 | Hexagonal transform quick-processing method for image or video coding |
CN102547285B (en) * | 2012-01-09 | 2014-05-14 | 上海大学 | Hexagonal transform quick-processing method for image or video coding |
CN103561265A (en) * | 2013-11-15 | 2014-02-05 | 武汉大学 | Method for integer DST conversion base selection for video coding |
CN110856000A (en) * | 2019-10-22 | 2020-02-28 | 深圳市华星光电技术有限公司 | Image decompression method and device |
CN110856000B (en) * | 2019-10-22 | 2020-10-27 | 深圳市华星光电技术有限公司 | Image decompression method and device |
US11436759B2 (en) | 2019-10-22 | 2022-09-06 | Tcl China Star Optoelectronics Technology Co., Ltd. | Image decompression method, device and display terminal |
Also Published As
Publication number | Publication date |
---|---|
KR20060043435A (en) | 2006-05-15 |
CN100433837C (en) | 2008-11-12 |
KR100636225B1 (en) | 2006-10-19 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
CN1564602A (en) | Integral conversing matrix selection method of video coding and related integral conversion method | |
CN1764277A (en) | Improved block transform and quantization for image and video coding | |
CN1791222A (en) | Reversible transform for lossy and lossless 2-D data compression | |
CN1280709C (en) | Parameterization for fading compensation | |
CN1697328A (en) | Fast video codec transform implementations | |
JP2005269657A (en) | Integral transform matrix selection method in video coding and related integral transform method | |
CN1575546A (en) | Implementation of a transform and of a subsequent quantization | |
CN1461527A (en) | Signal encoding method and apparatus and decording method and apparatus | |
CN1805548A (en) | Reversible 2-dimensional pre-/post-filtering for lapped biorthogonal transform | |
CN101047849A (en) | Discrete cosine inverse transformation method and its device | |
CN1455599A (en) | 2-D transformation of image and video-frequency coding | |
CN1286322C (en) | Quantizing device for low complicated degree integer 4x4 discrete cosine transform and its realizing method | |
CN1190084C (en) | Apparatus and method for compressing video information | |
CN1245028C (en) | Non-uniform multilayer hexaploid lattice full pixel kinematic search method | |
CN1552042A (en) | Method and device for calculating similarity | |
CN1216495C (en) | Video image sub-picture-element interpolation method and device | |
CN1284378C (en) | Moving estimation quick semi pixel grade searching method based on mathematical model prediction | |
CN1179570C (en) | Coding device for directional interpolator node and its method | |
CN1165176C (en) | Method based on correcting predictive direction/statistical prejudgement for fast estimation of sub-pixel motion | |
CN1399766A (en) | Fast and efficient computation of cubic-spline interpolation for data compression | |
CN1816144A (en) | 2-D discrete cosine conversion device and method | |
CN1209926C (en) | Method and device of integer transform applicable to image coding and video frequency coding | |
CN1471674A (en) | Faster transforms using scaled terms, early aborts, and precision refinements | |
CN1324533C (en) | Decoding method for encoding image data | |
CN1222171C (en) | Image transformation apparatus and method |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
C06 | Publication | ||
PB01 | Publication | ||
C10 | Entry into substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
C14 | Grant of patent or utility model | ||
GR01 | Patent grant | ||
C17 | Cessation of patent right | ||
CF01 | Termination of patent right due to non-payment of annual fee |
Granted publication date: 20081112 Termination date: 20140318 |