CN102137260A - Video coding 4*4 integral number transformation method - Google Patents

Video coding 4*4 integral number transformation method Download PDF

Info

Publication number
CN102137260A
CN102137260A CN2011101014046A CN201110101404A CN102137260A CN 102137260 A CN102137260 A CN 102137260A CN 2011101014046 A CN2011101014046 A CN 2011101014046A CN 201110101404 A CN201110101404 A CN 201110101404A CN 102137260 A CN102137260 A CN 102137260A
Authority
CN
China
Prior art keywords
integer
transformation
transform
matrix
video coding
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN2011101014046A
Other languages
Chinese (zh)
Inventor
徐苏珊
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
SHENZHEN RONGCHUANG TIANXIA TECHNOLOGY DEVELOPMENT Co Ltd
Shenzhen Temobi Science and Technology Co Ltd
Original Assignee
SHENZHEN RONGCHUANG TIANXIA TECHNOLOGY DEVELOPMENT Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by SHENZHEN RONGCHUANG TIANXIA TECHNOLOGY DEVELOPMENT Co Ltd filed Critical SHENZHEN RONGCHUANG TIANXIA TECHNOLOGY DEVELOPMENT Co Ltd
Priority to CN2011101014046A priority Critical patent/CN102137260A/en
Priority to PCT/CN2011/073582 priority patent/WO2012142776A1/en
Publication of CN102137260A publication Critical patent/CN102137260A/en
Priority to CN201110278320.XA priority patent/CN102333217B/en
Pending legal-status Critical Current

Links

Images

Classifications

    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N19/00Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
    • H04N19/10Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding
    • H04N19/102Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the element, parameter or selection affected or controlled by the adaptive coding
    • H04N19/12Selection from among a plurality of transforms or standards, e.g. selection between discrete cosine transform [DCT] and sub-band transform or selection between H.263 and H.264
    • H04N19/122Selection of transform size, e.g. 8x8 or 2x4x8 DCT; Selection of sub-band transforms of varying structure or type
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N19/00Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
    • H04N19/10Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding
    • H04N19/169Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the coding unit, i.e. the structural portion or semantic portion of the video signal being the object or the subject of the adaptive coding
    • H04N19/17Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the coding unit, i.e. the structural portion or semantic portion of the video signal being the object or the subject of the adaptive coding the unit being an image region, e.g. an object
    • H04N19/176Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the coding unit, i.e. the structural portion or semantic portion of the video signal being the object or the subject of the adaptive coding the unit being an image region, e.g. an object the region being a block, e.g. a macroblock

Abstract

The invention discloses a video coding 4*4 integral number transformation method, comprising the following steps of: receiving residual data X of a 4*4 image output by a predication module in a coder, and selecting different integral number transform bases (selecting one from (1, 9, 4), (1, 7, 3) and (1, 8, 3)) to construct a transformation matrix P; according to the transformation matrix P, constructing a scaling matrix PF (PF=ZT*Z); carrying out forward transformation (Y=PXPT) on the image residual data X, and adopting a corresponding integral number transformation computing method according to different integral number transformation bases; and scaling the data Y obtained by 4*4 forward transformation. The transformation method disclosed by the invention has excellent energy concentration performance, the energy concentration ratio is close to 4*4DCT (discrete cosine transformation), and the correlation of video data in a transformation space can be eliminated; the computational complexity is low, the transformation method can be realized by addition and shifting and is easy to realize by hardware, the operational precision is high, and the defect that the forward transformation and a backward transformation are not matched can be avoided.

Description

A kind of video coding 4x4 integer transform method
Technical field
The present invention relates to technical field of video coding, relate in particular to a kind of video coding 4x4 integer transform method.
Background technology
The complete video coding system of one cover is made up of series of algorithms modules such as prediction, conversion, quantification, entropy codings.Wherein inter prediction is to utilize the correlation packed data of video data on room and time in the frame.Conversion module be image residual error data with prediction module output by spatial transform to another space, make data concentration of energy on transformation space of flat distribution on the time domain, the energy major part of data is concentrated in the low frequency region in space.In quantification and entropy coding module subsequently, can effectively utilize the data distribution characteristics after the conversion, further packed data.Obviously, conversion module is a part important in the video coding system, and the performance of conversion directly influences the performance of video coding system.
In existing video coding system, dct transform has obtained using widely, and the concentration of energy of its conversion is fine, can greatly eliminate the correlation of video data on transformation space.But this algorithm exists not enough, and the dct transform matrix is a floating point representation, uses a large amount of floating-point multiplication and addition in the computational process, and it is many to take hardware resource, calculation of complex big.Because the processor calculating figure place influences, and there is truncated error in floating-point operation, and the Floating-point Computation precision is not high, and the positive inverse transformation of DCT can not be mated fully, conversion is irreversible simultaneously.
Summary of the invention
The purpose of the embodiment of the invention is to propose a kind of video coding 4x4 integer transform method, be intended to solve prior art dct transform matrix and adopt floating-point multiplication and addition, it is many to take hardware resource, calculation of complex big, computational accuracy is not high, the positive inverse transformation of DCT can not be mated fully, the irreversible problem of conversion.
The method of the embodiment of the invention is achieved in that a kind of video coding 4x4 integer transform method, comprises
Step 1, the image residual error data X of the 4x4 of prediction module output in the received code device;
Step 2 is chosen integer translation base, makes up transformation matrix,
The general purpose integer transformation matrix of 4x4 is:
C = k 0 k 0 k 0 k 0 k 1 k 2 - k 2 - k 1 k 0 - k 0 - k 0 k 0 k 2 - k 1 k 1 - k 2
In the Matrix C, integer translation base is [k0k1k1];
Step 3 according to transformation matrix P, makes up scaled matrix PF,
PF=Z T*Z
Wherein, Z T=[1/||m1||1/||m2||1/||m3||1/||m4||]; Z is
Figure BSA00000479386100022
m iBe the row vector of transformation matrix P, the i value is 1-4; || m i|| be the vectorial m of row iVector norm;
Step 4, direct transform is carried out integer transform to the image residual error data X of 4x4, and transformation for mula is Y=PXP TWherein P is described transformation matrix; X is described image residual error data;
Step 5 is handled carrying out convergent-divergent through the data Y after the 4x4 direct transform,
Y ′ = Y ⊗ PF
Figure BSA00000479386100024
Be the point multiplication operation of matrix, Y is that PF is a scaled matrix through the data after the 4x4 direct transform, and Y ' is the output valve after this convergent-divergent is handled, and represents video data to transform to information on the frequency domain;
Described integer translation base is in the following numerical value:
[1,9,4],[1,7,3],[1,8,3]。
Wherein, the elementary cell of conversion is 4 one-dimensional transforms, the wherein x=[x1x2x3x4 of shape such as y=Px in the step 4] T, the y=[y1y2y3y4 of output] T, x1x2x3x4 represents input four point value of the one dimension direct transform of integer transform; Y1y2y3y4 is 4 output numerical value of direct transform;
When described integer translation base value is [1,9,4]:
Described transformation matrix P = 1 1 1 1 9 4 - 4 - 9 1 - 1 - 1 1 4 - 9 9 - 4
Z T=[0.5,0.071796,0.5,0.071796]
PF = 0.25 0.035898 0.25 0.035898 0.035898 0.0051546 0.035898 0.0051546 0.25 0.035898 0.25 0.035898 0.035898 0.0051546 0.035898 0.0051546
" the image residual error data X to 4x4 carries out integer transform, and transformation for mula is Y=PXP T" computational process is as follows:
b1=x1+x4;b2=x2+x3;b3=x2-x3;b4=x1-x4;
y1=b1+b2;y3=b1-b2;y2=b4<<3+b4+b3<<2;y4=-b3<<3-b3+b4<<2;
Wherein "<<" expression is to left shift operation, priority is higher than addition and subtraction, and elementary cell y=Px needs 10 additions and 4 displacements altogether.
When described integer translation base value is [1,7,3]:
Described transformation matrix P = 1 1 1 1 4 3 - 3 - 7 1 - 1 - 1 1 3 - 7 7 - 3
Z T=[0.5,0.0928480.5,0.092848]
PF = 0.25 0.046424 0.25 0.046424 0.046424 0.0086207 0.046424 0.0086207 0.25 0.046424 0.25 0.046424 0.046424 0.0086207 0.046424 0.0086207
" the image residual error data X to 4x4 carries out integer transform, and transformation for mula is Y=PXP T" computational process is as follows:
b1=x1+x4;b2=x2+x3;b3=x2-x3;b4=x1-x4;
y1=b1+b2;y3=b1-b2;Y2=b4<<3-b4+b3<<2-b3;Y4=-b3<<3+b3+b4<<2-b4;
Wherein "<<" expression is to left shift operation, priority is higher than addition and subtraction, and elementary cell y=Px needs 12 additions and 4 displacements altogether.
When described integer translation base value is [1,8,3]:
Described transformation matrix P = 1 1 1 1 8 3 - 3 - 8 1 - 1 - 1 1 3 - 8 8 - 3
Z T=[0.5,0.082761,0.5,0.082761]
Scaled matrix PF = 0.25 0.04138 0.25 0.04138 0.04138 0.0068493 0.04138 0.0068493 0.25 0.04138 0.25 0.04138 0.04138 0.0068493 0.04138 0.0068493
" the image residual error data X to 4x4 carries out integer transform, and transformation for mula is Y=PXP T" computational process is as follows:
b1=x1+x4;b2=x2+x3;b3=x2-x3;b4=x1-x4;
y1=b1+b2;y3=b1-b2;Y2=b4*8+4*b3-b3;Y4=-b3*8+4*b4-b4;
"<<" expression is to left shift operation, and priority is higher than addition and subtraction, and elementary cell y=Px needs 10 additions and 4 displacements altogether.
Beneficial effect of the present invention:
The integer transform matrix of three groups the 4x4 that the embodiment of the invention proposes, and provided the fast algorithm of conversion, the transform method concentration of energy performance of the embodiment of the invention is outstanding, and encircled energy can greatly be eliminated the correlation of video data on transformation space near 4x4DCT.Computation complexity is low, can realize by addition and displacement, and be easy to hardware and realize, and the operational precision height, do not deposit the unmatched problem of positive inverse transformation.
Description of drawings
Fig. 1 is 1 one kinds of video coding 4x4 of embodiment of the invention integer transform method flow chart;
Fig. 2 is when integer translation base corresponding integer transform computational methods schematic diagram when choosing [1,9,4] in the embodiment of the invention 1;
Fig. 3 is the computational methods figure of the computing module M1 in the integer transform computational methods schematic diagram among Fig. 2;
Fig. 4 is when integer translation base corresponding integer transform computational methods schematic diagram when choosing [1,7,3] in the embodiment of the invention 1;
Fig. 5 is the computational methods figure of the computing module M2 in the integer transform computational methods schematic diagram among Fig. 4;
Fig. 6 is when integer translation base corresponding integer transform computational methods schematic diagram when choosing [1,8,3] in the embodiment of the invention 1;
Fig. 7 is the computational methods figure of the computing module M3 in the integer transform computational methods schematic diagram among Fig. 6.
Embodiment
In order to make purpose of the present invention, technical scheme and advantage clearer, below in conjunction with drawings and Examples, the present invention is further elaborated, for convenience of explanation, only show the part relevant with the embodiment of the invention.Should be appreciated that the specific embodiment that this place is described, only be used to explain the present invention, not in order to restriction the present invention.
The present invention chooses different integer translation bases by the image residual error data X of the 4x4 of prediction module output in the received code device, makes up transformation matrix P; According to transformation matrix P, make up scaled matrix PF (PF=Z T* Z); X carries out direct transform (Y=PXP to the image residual error data T), and, adopt corresponding integer transform computational methods according to different integer variation bases; Handle carrying out convergent-divergent through the data Y after the 4x4 direct transform.
Embodiment one
Be illustrated in figure 1 as 1 one kinds of video coding 4x4 of embodiment of the invention integer transform method, said method comprising the steps of:
S101, the image residual error data X of the 4x4 of prediction module output in the received code device;
S102 chooses integer translation base, makes up transformation matrix P;
The general purpose integer transformation matrix of 4x4 is:
C = k 0 k 0 k 0 k 0 k 1 k 2 - k 2 - k 1 k 0 - k 0 - k 0 k 0 k 2 - k 1 k 1 - k 2
In the Matrix C, integer translation base is [k0k1k1];
In the embodiment of the invention, integer translation base can be chosen in the following numerical value: [1,9,4], [1,7,3], [1,8,3].
S103 according to transformation matrix P, makes up scaled matrix PF;
PF=Z T*Z
Wherein, Z T=[1/||m1||1/||m2||1/||m3||1/||m4||]; Z is
Figure BSA00000479386100052
m iBe the row vector of transformation matrix P, the i value is 1-4; || m i|| be the vectorial m of row iVector norm;
S104, direct transform is carried out integer transform to the image residual error data X of 4x4, and transformation for mula is Y=PXP TWherein P is described transformation matrix; X is described image residual error data;
The elementary cell of conversion is 4 one-dimensional transforms, the wherein x=[x1x2x3x4 of shape such as y=Px] T,
The y=[y1y2y3y4 of output] T, x1x2x3x4 represents the one dimension direct transform of integer transform
Import four point value; Y1y2y3y4 is 4 output numerical value of direct transform;
Change base according to different integers, adopt corresponding integer transform computational methods.
S105 handles carrying out convergent-divergent through the data Y after the 4x4 direct transform;
Y ′ = Y ⊗ PF
Figure BSA00000479386100054
Be the point multiplication operation of matrix, Y is that PF is a scaled matrix through the data after the 4x4 direct transform, and Y ' is the output valve after this convergent-divergent is handled, and represents video data to transform to information on the frequency domain.
The transform method concentration of energy performance of the embodiment of the invention is outstanding, and encircled energy can greatly be eliminated the correlation of video data on transformation space near 4x4DCT; Computation complexity is low, can realize by addition and displacement, and be easy to hardware and realize, and the operational precision height, do not deposit the unmatched problem of positive inverse transformation.
Embodiment two
Below be a kind of video coding 4x4 integer transform method of the embodiment of the invention 2, it is the integer translation base of 4x4 general purpose integer transformation matrix that the embodiment of the invention 2 is chosen [1,9,4], said method comprising the steps of:
S201, the image residual error data X of the 4x4 of prediction module output in the received code device;
S202 chooses integer translation base, makes up transformation matrix P;
The general purpose integer transformation matrix of 4x4 is:
C = k 0 k 0 k 0 k 0 k 1 k 2 - k 2 - k 1 k 0 - k 0 - k 0 k 0 k 2 - k 1 k 1 - k 2
In the Matrix C, integer translation base is [k0k1k1];
It is integer translation base that the embodiment of the invention is chosen [1,9,4], is built into transformation matrix P.
When transform-based was [1,9,4], transformation matrix P was:
P = 1 1 1 1 9 4 - 4 - 9 1 - 1 - 1 1 4 - 9 9 - 4
S203 according to transformation matrix P, makes up scaled matrix PF;
PF=Z T*Z
Wherein, Z T=[1/||m1||1/||m2|| 1/||m3|| 1/||m4||]; Z is
Figure BSA00000479386100063
m iBe the row vector of transformation matrix P, the i value is 1-4; || m i|| be the vectorial m of row iVector norm;
When the transform-based of transformation matrix P is [1,9,4],
Z T=[0.5,0.071796,0.5,0.071796]
PF = 0.25 0.035898 0.25 0.035898 0.035898 0.0051546 0.035898 0.0051546 0.25 0.035898 0.25 0.035898 0.035898 0.0051546 0.035898 0.0051546
S204, direct transform is carried out integer transform to the image residual error data X of 4x4, and transformation for mula is Y=PXP TWherein P is described transformation matrix; X is described image residual error data; The elementary cell of conversion is 4 one-dimensional transforms, the wherein x=[x1x2x3x4 of shape such as y=Px] T, the y=[y1y2y3y4 of output] T, x1x2x3x4 represents input four point value of the one dimension direct transform of integer transform; Y1y2y3y4 is 4 output numerical value of direct transform;
The computational process of described integer transform is as follows:
b1=x1+x4;b2=x2+x3;b3=x2-x3;b4=x1-x4;
y1=b1+b2;y3=b1-b2;y2=b4<<3+b4+b3<<2;y4=-b3<<3-b3+b4<<2;
Wherein, b1, b2, b3, b4 are the intermediate variable of integer transform;
Wherein "<<" expression is to left shift operation, priority is higher than addition and subtraction, and elementary cell y=Px needs 10 additions and 4 displacements altogether.
The direct transform quick calculation method of this transform-based as shown in Figure 2, wherein computing module M1 is as shown in Figure 3.Wherein x1 x2 x3 x4 represents input four point value of the one dimension direct transform of integer transform; Y1y2y3y4 is 4 output numerical value of direct transform; The data processing direction is from left to right, and the intermediate variable value that intersects at a round dot equals the data addition that round dot left side line is represented; Square expression data are taken advantage of a coefficient; Negative value is got in "-" expression; " 2 " expression takes advantage of 2, promptly moves to left one; " 4 " expression takes advantage of 4, promptly moves to left two; " 8 " expression takes advantage of 8, promptly moves to left 3.The input data 1 of " in1 " expression M1 module link to each other with b3 in the drawings; The input data 2 of " in2 " expression M1 module link to each other with b4 in the drawings.The dateout 1 of " out1 " expression M1 module links to each other with y2 in the drawings; The dateout 2 of " out2 " expression M1 module links to each other with y4 in the drawings.
S205 handles carrying out convergent-divergent through the data Y after the 4x4 direct transform;
Y ′ = Y ⊗ PF
Figure BSA00000479386100072
Be the point multiplication operation of matrix, Y is that PF is a scaled matrix through the data after the 4x4 direct transform, and Y ' is the output valve after this convergent-divergent is handled, and represents video data to transform to information on the frequency domain.
The transform method concentration of energy performance of the embodiment of the invention is outstanding, and encircled energy can greatly be eliminated the correlation of video data on transformation space near 4x4DCT; Computation complexity is low, can realize by addition and displacement, and be easy to hardware and realize, and the operational precision height, do not deposit the unmatched problem of positive inverse transformation.
Embodiment three
Below be a kind of video coding 4x4 integer transform method of the embodiment of the invention 3, it is the integer translation base of 4x4 general purpose integer transformation matrix that the embodiment of the invention 3 is chosen [1,7,3], said method comprising the steps of:
S301, the image residual error data X of the 4x4 of prediction module output in the received code device.
S302 chooses integer translation base, makes up transformation matrix P;
The general purpose integer transformation matrix of 4x4 is:
C = k 0 k 0 k 0 k 0 k 1 k 2 - k 2 - k 1 k 0 - k 0 - k 0 k 0 k 2 - k 1 k 1 - k 2
In the Matrix C, transform-based is [k0k1k1],
It is integer translation base that the embodiment of the invention is chosen [1,7,3], is built into transformation matrix P.
When transform-based was [1,7,3], transformation matrix P was:
P = 1 1 1 1 4 3 - 3 - 7 1 - 1 - 1 1 3 - 7 7 - 3
S303 according to transformation matrix P, makes up scaled matrix PF;
PF=Z T*Z
Wherein, Z T=[1/||m1||1/||m2||1/||m3||1/||m4||]; Z is
Figure BSA00000479386100083
m iBe the row vector of transformation matrix P, the i value is 1-4; || m i|| be the vectorial m of row iVector norm;
When the transform-based of transformation matrix P is [1,7,3],
Z T=[0.5,0.0928480.5,0.092848]
Scaled matrix PF = 0.25 0.046424 0.25 0.046424 0.046424 0.0086207 0.046424 0.0086207 0.25 0.046424 0.25 0.046424 0.046424 0.0086207 0.046424 0.0086207
S304, direct transform is carried out integer transform to the image residual error data X of 4x4, and transformation for mula is Y=PXP TWherein P is described transformation matrix; X is described image residual error data;
The elementary cell of conversion is 4 one-dimensional transforms, the wherein x=[x1x2x3x4 of shape such as y=Px] T, the y=[y1y2y3y4 of output] T, x1x2x3x4 represents input four point value of the one dimension direct transform of integer transform; Y1y2y3y4 is 4 output numerical value of direct transform;
The computational process of described integer transform is as follows:
b1=x1+x4;b2=x2+x3;b3=x2-x3;b4=x1-x4;
y1=b1+b2;y3=b1-b2;Y2=b4<<3-b4+b3<<2-b3;Y4=-b3<<3+b3+b4<<2-b4;
Wherein, b1, b2, b3, b4 are the intermediate variable of integer transform;
Wherein "<<" expression is to left shift operation, priority is higher than addition and subtraction, and elementary cell y=Px needs 12 additions and 4 displacements altogether.
The direct transform quick calculation method of this transform-based as shown in Figure 4, wherein computing module M2 is as shown in Figure 5.Wherein x1x2x3x4 represents input four point value of the one dimension direct transform of integer transform; Y1y2y3y4 is 4 output numerical value of direct transform; The data processing direction is from left to right, and the intermediate variable value that intersects at a round dot equals the data addition that round dot left side line is represented; Square expression data are taken advantage of a coefficient; Negative value is got in "-" expression; " 2 " expression takes advantage of 2, promptly moves to left one; " 4 " expression takes advantage of 4, promptly moves to left two; " 8 " expression takes advantage of 8, promptly moves to left 3.The input data 1 of " in1 " expression M2 module link to each other with b3 in the drawings; The input data 2 of " in2 " expression M2 module link to each other with b4 in the drawings.The dateout 1 of " out1 " expression M2 module links to each other with y2 in the drawings; The dateout 2 of " out2 " expression M2 module links to each other with y4 in the drawings.
S305 handles carrying out convergent-divergent through data Y after the 4x4 direct transform;
Y ′ = Y ⊗ PF
Figure BSA00000479386100092
Be the point multiplication operation of matrix, Y is that PF is a scaled matrix through the data after the 4x4 direct transform, and Y ' is the output valve after this convergent-divergent is handled, and represents video data to transform to information on the frequency domain.
The transform method concentration of energy performance of the embodiment of the invention is outstanding, and encircled energy can greatly be eliminated the correlation that video data is asked at the conversion sky near 4x4DCT; Computation complexity is low, can realize by addition and displacement, and be easy to hardware and realize, and the operational precision height, do not deposit the unmatched problem of positive inverse transformation.
Embodiment four
Below be a kind of video coding 4x4 integer transform method of the embodiment of the invention 4, it is the integer translation base of 4x4 general purpose integer transformation matrix that the embodiment of the invention 4 is chosen [1,8,3], said method comprising the steps of:
S401, the image residual error data X of the 4x4 of prediction module output in the received code device.
S402 chooses integer translation base, makes up transformation matrix P;
The general purpose integer transformation matrix of 4x4 is:
C = k 0 k 0 k 0 k 0 k 1 k 2 - k 2 - k 1 k 0 - k 0 - k 0 k 0 k 2 - k 1 k 1 - k 2
In the Matrix C, transform-based is [k0k1k1],
It is integer translation base that the embodiment of the invention is chosen [1,8,3], is built into transformation matrix P.
When transform-based was [1,8,3], transformation matrix P was
P = 1 1 1 1 8 3 - 3 - 8 1 - 1 - 1 1 3 - 8 8 - 3
S403 according to transformation matrix P, makes up scaled matrix PF;
PF=Z T*Z
Wherein, Z T=[1/ //m1||1/||m2||1/||m3||1/||m4||]; Z is
Figure BSA00000479386100101
m iBe the row vector of transformation matrix P, the i value is 1-4; || m i|| be the vectorial m of row iVector norm;
When the transform-based of transformation matrix P is [1,8,3],
Z T=[0.5,0.082761,0.5,0.082761]
Scaled matrix PF = 0.25 0.04138 0.25 0.04138 0.04138 0.0068493 0.04138 0.0068493 0.25 0.04138 0.25 0.04138 0.04138 0.0068493 0.04138 0.0068493
S404, direct transform is carried out integer transform to the image residual error data X of 4x4, and transformation for mula is Y=PXP TWherein P is described transformation matrix; X is described image residual error data;
The elementary cell of conversion is 4 one-dimensional transforms, the wherein x=[x1x2x3x4 of shape such as y=Px] T, the y=[y1y2y3y4 of output] T, x1x2x3x4 represents input four point value of the one dimension direct transform of integer transform; Y1y2y3y4 is 4 output numerical value of direct transform;
The computational process of described integer transform is as follows:
b1=x1+x4;b2=x2+x3;b3=x2-x3;b4=x1-x4;
y1=b1+b2;y3=b1-b2;Y2=b4*8+4*b3-b3;Y4=-b3*8+4*b4-b4;
Wherein, b1, b2, b3, b4 are the intermediate variable of integer transform;
"<<" expression is to left shift operation, and priority is higher than addition and subtraction, and elementary cell y=Px needs 10 additions and 4 displacements altogether.
The direct transform quick calculation method of this transform-based as shown in Figure 6, wherein computing module M3 is as shown in Figure 7.Wherein x1x2x3x4 represents input four point value of the one dimension direct transform of integer transform; Y1y2y3y4 is 4 output numerical value of direct transform; The data processing direction is from left to right, and the intermediate variable value that intersects at a round dot equals the data addition that round dot left side line is represented; Square expression data are taken advantage of a coefficient; Negative value is got in "-" expression; " 2 " expression takes advantage of 2, promptly moves to left one; " 4 " expression takes advantage of 4, promptly moves to left two; " 8 " expression takes advantage of 8, promptly moves to left 3.In1 " the input data 1 of expression M3 module, link to each other with b3 in the drawings; The input data 2 of " in2 " expression M3 module link to each other with b4 in the drawings.The dateout 1 of " out1 " expression M3 module links to each other with y2 in the drawings; The dateout 2 of " out2 " expression M3 module links to each other with y4 in the drawings.
S405 handles carrying out convergent-divergent through data Y after the 4x4 direct transform;
Y ′ = Y ⊗ PF
Be the point multiplication operation of matrix, Y is that PF is a scaled matrix through the data after the 4x4 direct transform, and Y ' is the output valve after this convergent-divergent is handled, and represents video data to transform to information on the frequency domain.
The transform method concentration of energy performance of the embodiment of the invention is outstanding, and encircled energy can greatly be eliminated the correlation of video data on transformation space near 4x4DCT; Computation complexity is low, can realize by addition and displacement, and be easy to hardware and realize, and the operational precision height, do not deposit the unmatched problem of positive inverse transformation.
Those having ordinary skill in the art will appreciate that, all or part of step in realization the foregoing description method can be finished by the program command related hardware, described program can be stored in the computer read/write memory medium, and described storage medium can be ROM, RAM, disk, CD etc.
The above only is preferred embodiment of the present invention, not in order to restriction the present invention, all any modifications of being done within the spirit and principles in the present invention, is equal to and replaces and improvement etc., all should be included within protection scope of the present invention.

Claims (8)

1. a video coding 4x4 integer transform method comprises
Step 1, the image residual error data X of the 4x4 of prediction module output in the received code device;
Step 2 is chosen integer translation base, makes up transformation matrix,
The general purpose integer transformation matrix of 4x4 is:
Figure FSA00000479386000011
In the Matrix C, integer translation base is [k0k1k1];
Step 3 according to transformation matrix P, makes up scaled matrix PF,
PF=Z T*Z
Wherein, Z T=[1//|| m1||1/||m2||1/||m3||1/||m4||]; Z is
Figure FSA00000479386000012
m iBe the row vector of transformation matrix P, the i value is 1-4; || m i|| be the vectorial m of row iVector norm;
Step 4, direct transform is carried out integer transform to the image residual error data X of 4x4, and transformation for mula is Y=PXP TWherein P is described transformation matrix; X is described image residual error data;
Step 5 is handled carrying out convergent-divergent through the data Y after the 4x4 direct transform,
Figure FSA00000479386000013
Be the point multiplication operation of matrix, Y is that PF is a scaled matrix through the data after the 4x4 direct transform, and Y ' is the output valve after this convergent-divergent is handled, and represents video data to transform to information on the frequency domain;
It is characterized in that:
Described integer translation base is in the following numerical value:
[1,9,4],[1,7,3],[1,8,3]。
2. video coding 4x4 integer transform method as claimed in claim 1 is characterized in that, the elementary cell of conversion is 4 one-dimensional transforms, the wherein x=[x1x2x3x4 of shape such as y=Px in the step 4] T, the y=[y1y2y3y4 of output] T, x1x2x3x4 represents input four point value of the one dimension direct transform of integer transform; Y1y2y3y4 is 4 output numerical value of direct transform.
3. video coding 4x4 integer transform method as claimed in claim 1 is characterized in that, when described integer translation base value is [1,9,4]:
Described transformation matrix
Figure FSA00000479386000021
Z T=[0.5,0.071796,0.5,0.071796]
Figure FSA00000479386000022
4. video coding 4x4 integer transform method as claimed in claim 1 is characterized in that, described when the integer translation base value is [1,9,4], " the image residual error data X to 4x4 carries out integer transform, and transformation for mula is Y=PXP T" computational process is as follows:
b1=x1+x4;b2=x2+x3;b3=x2-x3;b4=x1-x4;
y1=b1+b2;y3=b1-b2;y2=b4<<3+b4+b3<<2;y4=-b3<<3-b3+b4<<2;
Wherein "<<" expression is to left shift operation, priority is higher than addition and subtraction, and elementary cell y=Px needs 10 additions and 4 displacements altogether.
5. video coding 4x4 integer transform method as claimed in claim 1 is characterized in that, when described integer translation base value is [1,7,3]:
Described transformation matrix
Figure FSA00000479386000023
Z T=[0.5,0.0928480.5,0.092848]
Figure FSA00000479386000024
6. video coding 4x4 integer transform method as claimed in claim 1 is characterized in that, described when the integer translation base value is [1,7,3], " the image residual error data X to 4x4 carries out integer transform, and transformation for mula is Y=PXP T" computational process is as follows:
b1=x1+x4;b2=x2+x3;b3=x2-x3;b4=x1-x4;
y1=b1+b2;y3=b1-b2;Y2=b4<<3-b4+b3<<2-b3;Y4=-b3<<3+b3+b4<<2-b4;
Wherein "<<" expression is to left shift operation, priority is higher than addition and subtraction, and elementary cell y=Px needs 12 additions and 4 displacements altogether.
7. video coding 4x4 integer transform method as claimed in claim 1 is characterized in that, when described integer translation base value is [1,8,3]:
Described transformation matrix
Figure FSA00000479386000031
Z T=[0.5,0.082761,0.5,0.082761]
Scaled matrix
Figure FSA00000479386000032
8. video coding 4x4 integer transform method as claimed in claim 1 is characterized in that, described when the integer translation base value is [1,8,3], " the image residual error data X to 4x4 carries out integer transform, and transformation for mula is Y=PXP T" computational process is as follows:
b1=x1+x4;b2=x2+x3;b3=x2-x3;b4=x1-x4;
y1=b1+b2;y3=b1-b2;Y2=b4*8+4*b3-b3;Y4=-b3*8+4*b4-b4;
"<<" expression is to left shift operation, and priority is higher than addition and subtraction, and elementary cell y=Px needs 10 additions and 4 displacements altogether.
CN2011101014046A 2011-04-20 2011-04-20 Video coding 4*4 integral number transformation method Pending CN102137260A (en)

Priority Applications (3)

Application Number Priority Date Filing Date Title
CN2011101014046A CN102137260A (en) 2011-04-20 2011-04-20 Video coding 4*4 integral number transformation method
PCT/CN2011/073582 WO2012142776A1 (en) 2011-04-20 2011-04-29 Video coding 4×4 integer transform method
CN201110278320.XA CN102333217B (en) 2011-04-20 2011-09-19 Video coding 4*4 integer transform method

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN2011101014046A CN102137260A (en) 2011-04-20 2011-04-20 Video coding 4*4 integral number transformation method

Publications (1)

Publication Number Publication Date
CN102137260A true CN102137260A (en) 2011-07-27

Family

ID=44296902

Family Applications (2)

Application Number Title Priority Date Filing Date
CN2011101014046A Pending CN102137260A (en) 2011-04-20 2011-04-20 Video coding 4*4 integral number transformation method
CN201110278320.XA Expired - Fee Related CN102333217B (en) 2011-04-20 2011-09-19 Video coding 4*4 integer transform method

Family Applications After (1)

Application Number Title Priority Date Filing Date
CN201110278320.XA Expired - Fee Related CN102333217B (en) 2011-04-20 2011-09-19 Video coding 4*4 integer transform method

Country Status (2)

Country Link
CN (2) CN102137260A (en)
WO (1) WO2012142776A1 (en)

Families Citing this family (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO2018142823A1 (en) 2017-01-31 2018-08-09 Sharp Kabushiki Kaisha Systems and methods for scaling transform coefficient level values

Family Cites Families (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US7185037B2 (en) * 2001-08-23 2007-02-27 Texas Instruments Incorporated Video block transform
CN100372382C (en) * 2003-03-24 2008-02-27 华为技术有限公司 Video image coding method
CN100433837C (en) * 2004-03-18 2008-11-12 华中科技大学 Integral conversing matrix selection method of video coding and related integral conversion method
CN1662069A (en) * 2005-01-12 2005-08-31 华中科技大学 Method for encoding/decoding image and video signals
US8331454B2 (en) * 2007-11-12 2012-12-11 Cisco Technology, Inc. Integer transform function for video compression systems
CN101931817A (en) * 2010-04-12 2010-12-29 中山大学 Transform coding method based on transform base self-adaption

Also Published As

Publication number Publication date
CN102333217B (en) 2017-04-12
CN102333217A (en) 2012-01-25
WO2012142776A1 (en) 2012-10-26

Similar Documents

Publication Publication Date Title
TWI680409B (en) Method for matrix by vector multiplication for use in artificial neural network
US9613669B2 (en) Matrix transposing circuit
CN102231788B (en) Method and apparatus for high-speed and low-complexity piecewise geometric transformation of signals
CN107590106A (en) A kind of computational methods for being applied to symmetrical matrix and vector multiplication
US20080126467A1 (en) Technique for transposing nonsymmetric sparse matrices
JP2023014091A (en) efficient convolutional engine
CN104244010A (en) Method for improving digital signal conversion performance and digital signal conversion method and device
CN115641285A (en) Binocular vision stereo matching method based on dense multi-scale information fusion
CN104320668B (en) HEVC/H.265 dct transform and the SIMD optimization methods of inverse transformation
CN114007079A (en) Conversion circuit, method, device and encoder
CN102547263B (en) The inverse discrete cosine transform of variable complexity is tabled look-up fast algorithm
CN104144346B (en) For executing the data processing equipment and method of the transformation between airspace and frequency domain
CN102137260A (en) Video coding 4*4 integral number transformation method
CN110766136B (en) Compression method of sparse matrix and vector
CN1187698C (en) Design method of built-in parallel two-dimensional discrete wavelet conversion VLSI structure
CN103533378A (en) Three-dimensional integer DCT (Discrete Cosine Transform) transformation system on basis of FPGA (Field Programmable Gate Array) and transformation method thereof
CN102227135A (en) Video coding 8X8 integer transformation method
CN104270643B (en) Address mapping algorithm for transposed matrix based on single-port SRAM
CN102137261A (en) 16*16 integer transformation method for video coding
CN101562744B (en) Two-dimensional inverse transformation device
CN102025988B (en) Mode-related fast transformation method
KR101395143B1 (en) Integer transform method for image processing and device threof
CN102333216B (en) A kind of 16*16 integer transformation method for video coding
CN104581174A (en) High-throughput DCT and IDCT hardware multiplexing structure suitable for HEVC standard
Mishra et al. Architectures for FPGA-based implementation of motion estimation of dynamic obstacles for autonomous robot navigation

Legal Events

Date Code Title Description
C06 Publication
PB01 Publication
C02 Deemed withdrawal of patent application after publication (patent law 2001)
WD01 Invention patent application deemed withdrawn after publication

Open date: 20110727

REG Reference to a national code

Ref country code: HK

Ref legal event code: DE

Ref document number: 1159373

Country of ref document: HK

REG Reference to a national code

Ref country code: HK

Ref legal event code: WD

Ref document number: 1159373

Country of ref document: HK