CN1107413C - Motion vector field coding - Google Patents

Motion vector field coding Download PDF

Info

Publication number
CN1107413C
CN1107413C CN95198000A CN95198000A CN1107413C CN 1107413 C CN1107413 C CN 1107413C CN 95198000 A CN95198000 A CN 95198000A CN 95198000 A CN95198000 A CN 95198000A CN 1107413 C CN1107413 C CN 1107413C
Authority
CN
China
Prior art keywords
section
matrix
vector
motion
coefficient
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Expired - Fee Related
Application number
CN95198000A
Other languages
Chinese (zh)
Other versions
CN1205153A (en
Inventor
J·尼维罗斯基
M·卡泽维茨
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Nokia Oyj
Original Assignee
Nokia Mobile Phones Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Nokia Mobile Phones Ltd filed Critical Nokia Mobile Phones Ltd
Priority to CN95198000A priority Critical patent/CN1107413C/en
Publication of CN1205153A publication Critical patent/CN1205153A/en
Application granted granted Critical
Publication of CN1107413C publication Critical patent/CN1107413C/en
Anticipated expiration legal-status Critical
Expired - Fee Related legal-status Critical Current

Links

Images

Landscapes

  • Compression Or Coding Systems Of Tv Signals (AREA)

Abstract

The present invention relates to an image encoding decoder which comprises an encoder of a motion information group for minimizing the number of motion coefficients of an operating vector information group. A first block in the encoder of a motion information group comprises a device for generating a new matrix representation of an information group of motion vectors. A new information group of coded motion vectors is linear. A second main block is a device for merging adjacent section pairs, the adjacent section pairs are merged by the second main block under the condition that an information group of a combining section can be predicated by an information group of common motion, and merging information is transferred to a decoder. A third main block comprises a device for eliminating basic functions of a motion information group. Square prediction errors are calculated and an elimination process is continued to be executed after each elimination step until the square prediction errors can not be accepted. A final motion coefficient is obtained by computing a linear matrix equation. The present invention has a result that motion coefficients of reducing numbers of each section are obtained. The motion coefficients are transferred to the image encoding decoder.

Description

Motion vector field coding
Technical field
The relevant image compression of the present invention.More precisely, the invention relates to an a kind of estimating motion information sets of coding (motion field) and reach the method that in an image sequence, generates movable information.
Background of invention
Motion compensated prediction is the key link in most of picture coding schemes.Fig. 1 utilizes motion compensation technique at one, is used for the schematic diagram of an encoder of compressed image sequence.Essential elements in this encoder is 1, one motion estimator 2 of a motion-compensated prediction block and a movable information group coding device 3.The operation principle of moving compensation image encoder is the incoming frame I that is encoded that compression is called as present frame n(x is y) with a predictive frame
Figure C9519800000061
Between the predicated error E of difference n(x, y), wherein: E n ( x , y ) = I n ( x , y ) - I ^ n ( x , y ) - - - ( 1 ) Predictive frame
Figure C9519800000063
Make up by motion-compensated prediction block 1, and utilize the front or some other be called as reference frame, be designated as The pixel value of coded frame, and the motion vector of the pixel between present frame and reference frame is set up.Motion vector is calculated by movable information group estimation device 2, and then, the Vector Message group of gained was encoded before being transmitted to prediction piece 1 in some way.Predictive frame is: I ^ n ( x , y ) = I ~ n - 1 [ x + Δx ( x , y ) , y + Δy ( x , y ) ] - - - ( 2 ) Several to [x+ Δ x (and x, y), y+ Δ y (x, y)] be called as position in the present frame (x, the motion vector of the pixel of y) locating, and Δ x (x, y) (x y) is the displacement in the horizontal and vertical directions of this pixel with Δ y.Present frame I n(x, y) in the motion vector of one group of all pixel be called as motion vector information group (motion vector field).The encoding motion vector information sets also is transmitted to decoder as movable information.
Fig. 2 is in decoder, by at reference frame
Figure C9519800000066
In obtain the prediction of picture element Reconstruct present frame I n(x, picture element y).Motion-compensated prediction block 21 is utilized the movable information and the reference frame that receive (in the figure, reference frame is identical with present frame), the generation forecast frame.Then, in predicated error decoder 22, decoded predicated error E n(x, y) with the predictive frame addition, the result is original present frame I n
The general purpose of motion compensated prediction is to minimize the amount of information that need send decoder to.Should minimize by such as E n(the predicated error amount that x, energy y) measure also will minimize the required amount of information of reproduction motion vector information group.
By H.Nguen, " movable information of reproduction picture coding " (" the Representation of motion information for image coding ") that E.Dubois showed, see proceedings, picture coding collection of thesis ' 90, Massachusetts, Cambridge, March nineteen ninety 18-26, the 841-845 page or leaf has provided the commentary of a movable information group coding technology in this article.Rule of thumb, reduce predicated error, just need more complicated movable information group, promptly must have more position to be used for coding.Like this, the general purpose of picture coding is an encoding motion vector information sets as far as possible closely, simultaneously, keeps the value of predicated error low as far as possible.
Fig. 1, movable information group prediction piece 1, the motion vector of all picture elements of a given section of calculating, it minimizes the value of predicated error in this section, for example square prediction error.Movable information group valuation technology is at the model of movable information group and make aspect the minimized algorithm two of selected prediction error value all different.
Because in a frame, the number of picture element is very big, so if will transmit the single motion vector of each picture element, efficient is very low.And in most picture coding schemes, present frame all is divided into big visual section, and like this, motion vector all in the visual section can be described by parameter seldom.The image section can be square block, for example, and according to international standard ISO/IECMPEG-1 or ITU-TH.261, coding decoder uses the piece of 16 * 16 picture elements, and perhaps, they can comprise the district's information sets that is entirely arbitrary shape, for example, the district's information sets that draws by segmentation algorithm.In actual applications, section comprises tens picture elements at least.
In order to reappear the motion vector of picture element in the section densely, their value can be described preferably with the function that few parameters is arranged.This function is called as the motion vector information group model.A kind of known model family is the linear movement model, and wherein, motion vector is the linear combination of movable information group basic function.In this model, the motion vector of image section is represented by a common-used formula: Δx ( x , y ) = Σ i = 1 N c i f i ( x , y ) - - - ( 3 ) Δy ( x , y ) = Σ i = N + 1 N + M c i f i ( x , y )
Wherein, parameters C iBe called as kinematic coefficient, they are transmitted to decoder.Function f i(x y) is called as movable information group basic function, and they are fixed, and encoder is all known this function.
When utilization had the linear movement model of above-mentioned formula, the problem of existence was how to minimize the kinematic coefficient C that will be fed to decoder iNumber, keep predicated error E simultaneously n(x, value y) is low as far as possible.This process is in encoder, is carried out by movable information group coding piece 3, sees Fig. 1.It is carried out after piece 2 is finished movable information group valuation very complicated in the calculating.Thereby crucial is, the movable information group coding should be simple on calculating, and like this, it could not increase extra burden to encoder.
The number that need be transmitted to the kinematic coefficient of decoder depends on the number of kinematic coefficient in the number of section in the image and each section.Like this, have at least two kinds of approach can reduce the sum of kinematic coefficient.
First method is those can be predicted with a common motion vector information sets, and can not cause that those sections that predicated error increases significantly make up (merging) together, thereby reduce the number of section.The number of section can reduce in the frame, because, usually, be close to, that is, contiguous section can be predicted well by identical kinematic coefficient.The process that makes up this section is called as auxiliary merge (the motion assisted merging) of motion.
Second method is to choose a motion model for each section, and it can obtain gratifying low predicated error, and coefficient is few as much as possible.Because between frame and the frame and between section and the section, it is different that the quantity of motion and complexity all have, so each section is all always used the N+M movement function, effect is also bad.Be necessary for every section minimum number of all finding out the kinematic coefficient that can obtain gratifying low predicated error.This coefficient adaptively selected is called as kinematic coefficient cancellation (motion coefficient removal).
Fig. 3 shows a frame that is divided into section.In the prior art of kinematic coefficient coding, comprise that several motions assist folding.At the motion vector of all sections all after the valuation, carry out that motion is auxiliary to be merged.It is based on every couple of adjacent sections S iAnd S jAnd their kinematic coefficient C separately iAnd C jFinish.Combination section S iAnd S jDistrict's field mark be S IjIf S IjDistrict's information sets can be with one group of kinematic coefficient C IjPredict, and can not cause than by independent prediction S iAnd S jWhen section gained error has the predicated error of excessive increase, S then iAnd S jMerged.The auxiliary method that merges of motion is at the single kinetic system array C that seeks the good prediction that can provide the section of combining IjMode on, have essence difference.
A kind of method is arranged, be called as nothing omission motion estimation and merge.This method is every couple of adjacent sections S iAnd S jEstimate " starting all over again from the beginning " new kinematic parameter group C IjIf S IjPredicated error excessively do not increase section S then iAnd S jMerged.Although this method can be selected the merged section of energy well, it is also infeasible in realization, because generally speaking, it will make the complexity of encoder increase by several orders of magnitude.
Another kind method is called as to expand by the movable information group and merges.This method is judged district information sets S IjWhether can utilize kinematic parameter C iOr C jPredict, and predicated error does not have excessive increase.The characteristics of this method are that computational complexity is very low, because it is without any need for new estimating motion.But, this method often can not merge the section of this merging because be motion compensation that a section calculates the gained coefficient seldom can be simultaneously the contiguous section of prediction well.
In addition, also have a kind of method, be called as by the match of movable information group and merge.In this method, calculate kinematic coefficient C by approximation method IjThis is by estimating in each section that several motion vectors seldom finish.Fig. 3 has described section S iAnd S jIn some motion vectors.Utilize some known approximating methods, by these vectors, common motion vector information sets of match, thereby calculation of sector S IjThe movable information group.The shortcoming of this method is, and is not accurate enough by the movable information group that match obtains, and regular meeting causes the unacceptable increase of predicated error.
H.Nicolas and C.labit showed " utilize the prejudgementing character scheme of approaching one by one to do estimating motion (" Region-based motion estimationusing deterministic relaxation schemes for image sequencecoding "); to see proceedings; 1994 annual acoustics international symposiums; language and signal processing; No. 265~268, III based on the interval for coding sequence of pictures; and the symmetrical directed split plot design of the estimating motion and the coding sequence of pictures in interval " efficiently based on " that P.Cicconi and H.Nicolas showed, see the IEEE collection of thesis, the Circuits and Systems of image technology, No. the 3rd, the 4th volume, in June, 1994, the 357th~364 page.In these two pieces of documents, proposed to carry out estimating motion and select only a kind of method with different models.These methods attempt to rely on the complexity of motion to adopt motion model by carrying out estimation and select only one with different models.The major defect of these methods is complexity that it calculates and in fact, can be for the quantity of the different motion information sets model that detects seldom.
How to make the kinematic coefficient C that transmits toward decoder iNumber minimum, keep simultaneously predicated error En (x, is value y) as far as possible little? in the foregoing method, neither one can be independently solved this problem.
Summary of the invention
An object of the present invention is to generate a movable information group coding device, it can reduce the quantity of the movable information group Vector Message that is generated by some known motion estimator largely, and can not cause the bigger increase of predicated error.The complexity of movable information group coding device should be lower, so that in practice, can realize on available signal processor or general purpose microprocessor.
By the present invention, movable information group coding device comprises three main pieces.
First main piece is called as a QR motion analyzer.Its task is to find out the new expression of the input motion district information sets that is produced by movable information group estimator.This new expression is fed to second main piece.The operation of this first main piece comprises the many steps that contain matrix manipulation: in the first step, utilize some known approximation method linearisation predictive frames, so that predictive frame becomes the linear function about motion vector.In second step, structural matrix E iAnd Y i, be used for minimizing of square prediction error.In the 3rd step, use a known QR factoring algorithm, with matrix E iBe decomposed into two matrix Q iAnd R iProduct.In addition, by coefficient matrix Q iWith matrix Y i, calculate an auxiliary vector Z iMatrix R iWith auxiliary vector Z iA part be sent to second main piece.
Second main piece is called a section and merges piece.This piece is by judging that whether the right combination region information sets of adjacent sections can utilize a common motion district information sets prediction, carries out union operation.In matrix manipulation, at first form a Matrix Formula, surplus with known matrix computations method then, handle the factor matrix.The gained result is a matrix-style, and here, a matrix includes some, on its basis, can calculate the square prediction error in merging section district information sets at an easy rate.If according to a selected rule, the variation of square prediction error is can be received, and then these two sections are merged.
To after having been investigated, the output that section merges section is at all sections:
I. the new division of the image that reduced of sector number,
Ii. to each new section, this piece is output matrix R all Ij 1With vector Z Ij 1
Iii. be sent to the pooling information of decoder, it can help decoder to determine merged.
The 3rd main piece is called as coefficient cancellation piece, and this piece receives the present frame of importing of repartitioning with the form of section, and to each section, its receives by section and merges the matrix R that piece produces k 1, Z k 1And C kEvery section motion vector is represented by some kinematic coefficients.Kinematic coefficient cancellation piece is every section and determines whether to simplify the movable information group model, and predicated error is excessively increased.Some basic functions are by cancellation from motion model, thereby only need less coefficient to describe the movable information group model of such simplification.
Operation in this 3rd main piece is a matrix manipulation, and wherein, at first cancellation delegation and row from coefficient matrix are revised Matrix Formula, then, Matrix Formula are carried out trigonometric ratio.Cancellation delegation and row are equivalent to basic function of cancellation from motion model.The variation of the square prediction error of the section that causes by basic function of cancellation equal in the formula one square.
If according to a selected rule, the variation of predicated error is can be received, then coefficient of cancellation from coefficient sets.Then repeat these matrix manipulations, can also reduce the more coefficient of this section.Behind the coefficient of cancellation sufficient amount,, calculate the final kinematic coefficient of this section by finding the solution final linear formula.This formula can utilize a kind of the finding the solution in the algorithm known, for example back-substitution algorithm.
The 3rd main piece is each processed section output selection information, and it shows has by cancellation from the movable information group model for which basic function.In addition, its output is corresponding to the new kinematic coefficient of residue basic function.Selection information and kinematic coefficient all are transmitted to decoder.
Accompanying drawing is described
Fig. 1 is the schematic diagram of a known encoder;
Fig. 2 is the schematic diagram of a known decoder;
Fig. 3 has described the adjacent sections that is used to merge;
Fig. 4 shows the merging of being done by motor area information sets approximation method;
Fig. 5 be one according to movable information group coding device of the present invention;
Fig. 6 is the sketch of a QR motion analyzer.
Embodiment describes
Fig. 5 shows according to movable information group coding device of the present invention.It is corresponding to the piece among Fig. 13, but its input also has reference frame and present frame.The 3rd input of this piece is the motion vector information group [Δ x (), Δ y ()] that is generated by movable information group valuation piece among Fig. 12.
Our hypothesis, the output of picture coding device is the condensed frame that is divided into section, and each section has kinematic coefficient, comprising P coordinate for one so is (x i, y i); I=1,2 ... the section of the picture element of P, the task of movable information group coding device are to find out a compressing motion vector information sets Kinematic coefficient, wherein motion vector is by a linear movement model representation, the form of motion vector information group is: Δ ~ x ( x , y ) = Σ i = 1 N c ~ i f i ( x , y ) - - - ( 4 a ) Δ ~ y ( x , y ) = Σ i = N + 1 N + M c ~ i f i ( x , y ) - - - ( 4 b ) Like this, it minimizes square prediction error: Σ i = 1 P ( I n ( x i , y i ) - I ~ n - 1 ( x i + Δx ~ ( x i , y i ) , y i + Δy ~ ( x i , y i ) ) ) 2 . - - - ( 5 )
For finishing described task, movable information group coding device comprises three main pieces that make up, and they are QR motion analyzer pieces, and section merges piece and kinematic coefficient cancellation piece.Section merges the quantity of piece and kinematic coefficient cancellation piece minimizing movable information, consequently causes the increase of square prediction error.
The purpose of QR motion analyzer is to find out the new expression of motor area information sets.Be used in two pieces that are illustrated in the back that this is new so that fast and flexible find out kinematic coefficient that merges section and the cancellation that is used for coefficient.
The operation of QR motion analyzer comprises the steps:
The first step is the linearisation of error.In this step, utilize some known approximation methods, the predictive frame I in the approximation formula (7) N-1() so that it with respect to
Figure C9519800000122
Linearisation.Like this, every coefficient C that becomes of summation in the formula (5) iLinear combination Σ j = 1 P ( e j , 1 c ~ 1 + e j , 2 c ~ 2 + · · · + e j , N + M c ~ N + M - y j ) 2 - - - ( 6 )
The 2nd step was a structural matrix.It is based on such fact, and promptly formula (6) minimizes and matrix expression Minimize fully and be equal to, E here iAnd Y iFor: E i = e 1,1 e 1,2 . . . e 1 , N + M e 2,1 e 2,2 . . . e 2 , N + M . . . . . . . . . . . . e P , 1 e P , 2 . . . e P , N + M , y = y 1 y 2 . . . y P - - - ( 7 )
The 3rd step was the QR factorization." matrix computations " second edition that G.H.Golub and C.Van Loan are shown, famous QR factoring algorithm 1989, has been described by Johns Hopkins university press in this book.This algorithm is used for E iBe decomposed into the product of two matrixes
E i=Q iR i(8) in this step, also calculated a companion matrix Z i, wherein
z i=Q T iy i (9)
In the 4th step, the output of calculating QR motion analyzer piece.Output packet contains a matrix R i 1, it contains matrix R iPreceding N+M capable, also comprise a vector Z i 1, it contains Z iPreceding N+M element.
In section merges piece, for adjacent sections to S iAnd S jCarry out union operation, see Fig. 4, this operation is by judging their combination region information sets S IjWhether can utilize by kinematic coefficient C IjA common motion information sets describing is predicted and is finished.Union operation may further comprise the steps:
The 1st step was matrix computations, and the present invention has used a unknown in advance parameter, kinematic coefficient C IjCan obtain by finding the solution a system of linear equations: R 1 i R 1 j c ij = z 1 i z 1 j - - - ( 10 )
R wherein i 1, Z i 1And R j 1Z j 1By QR analyzer piece respectively to section S iAnd S jFind the solution and obtain.
In the 2nd step, to the 1st step gained matrix trigonometric ratio.Matrix R i 1, R i 1For upper triangular matrix and equation group (10) have the form described in the document as previously mentioned:
Figure C9519800000132
Wherein symbol X represents nonzero element.According to what said in the above-mentioned document, by scalar each row is carried out a series of multiplying each other by the heel additional row, come the trigonometric ratio equation group, that is, it is converted into this form:
Figure C9519800000141
In the 3rd step, estimate pooled error, by section S iAnd S jMerge caused district information sets S IjIn the change of square prediction error, calculate according to the method for being said in the above-mentioned document, wherein: ΔE ij = Σ k = 1 N + M q 2 k - - - ( 13 )
At last, in the 4th step, if according to selected rule, the change of square prediction error is can be received in the formula (13), then section S iAnd S jMerged, for the new section S of gained Ij, capable by the preceding N+M that gets equation group (12), structural matrix R Ij 1With vector Z Ij 1, that is, provide by following formula:
Sections all in frame are obtained the output that section merges piece to after all investigating.Output comprises three category informations.At first, it provides the new division of the image that the number of section reduced.Secondly, it provides the piece output matrix R of each new section Ij 1, vector Z Ij 1Once more, it provides pooling information, and this information is sent in the decoder, helps decoder to discern merged section.Now, can be by the solving equation group R ij 1 C ij = Z ij 1 , Calculation of sector S IjKinematic coefficient C Ij=(C 1, C 2... C N+M), but, if use next piece, i.e. coefficient cancellation piece then there is no need to calculate them.
Now, we have a look the operation of coefficient cancellation piece.This piece receives the new division of the present frame of input with the form of section, and for each section S k, it receives by front section and merges the matrix R that piece produced k 1, Z k 1And C kEvery section motion vector is represented by the N+M kinematic coefficient.
To a given section S k, kinematic coefficient cancellation piece determines whether to simplify the movable information group model, and does not have the excessive increase of predicated error.When some basic functions by from should with background technology in the model of the formula (3) told about during cancellation, the movable information group model that can obtain simplifying.The movable information group model of describing such simplification only needs less coefficient.
To each section, carry out following processes, its objective is and find out that i basic function (and i coefficient) whether can be by cancellation from the movable information group model:
The 1st step comprised matrix modifications, here, and by from R k 1In remove i row and from C kIn remove i element, revise system of linear equations
R 1 kc k=z 1 k (15)
The 2nd step comprised the matrix trigonometric ratio, here, with a kind of known way, by the scalar by the heel additional row a series of row was multiplied each other, and made equation group (15) trigonometric ratio, that is, it is converted into such form:
Figure C9519800000151
The 3rd step comprised error estimation, the variation of the square prediction error of the section that is caused by i coefficient of cancellation and the q in the formula (16) i 2Item equates.
The 4th step comprised the cancellation of coefficient.If according to a given rule, the change of predicated error is can be received, then coefficient C iBy cancellation from coefficient sets.New coefficient number is N+M-1.Matrix R k 1With vector Z k 1Be modified to form:
Figure C9519800000161
Utilize the matrix (17) in the formula (15) and repeated for 1~4 step, also can further reduce coefficient number.
The 5th step comprised coefficient calculations.Behind the coefficient of enough numbers of cancellation, begin this step.In this step,, calculate a section S by finding the solution system of linear equations kLast kinematic coefficient:
R 1 kc k=z 1 k (18)
Matrix R wherein k 1With vector Z k 1Be the front result in 1~4 step.Can utilize an algorithm known, for example back-substitution algorithm solving equation group.
Kinematic coefficient cancellation piece is each processed section output selection information, and it can tell decoder which basic function all to be arranged by cancellation from the movable information group model.In addition, its output is to being applied to the new kinematic coefficient of remaining basic function.Selection information and kinetic system number average are fed to decoder.
As in above all pieces result in steps, generate as encoder of the present invention: pooling information, which section quilt the notice decoder has merged; Selection information, which basic function the notice decoder has by cancellation; And kinematic coefficient information.
The present invention compares with prior art, and biggest advantage is that it has the ability that reduces movable information quantity largely, and can not cause the increase that predicated error is bigger.In addition, the complexity of whole system is very low, and in actual applications, it can be realized on existing signal processor or general microprocessor.
Section merges piece a distinctive ability, and it can be from given for finding out the motion vector of combination section the single section estimated motion vectors.Can prove that in fact, the motion vector that is produced by its is best in the square error that keeps combined segment aspect minimizing.This has just explained why this piece has and can significantly reduce the number of section, and only causes the ability that very little square prediction error increases.
For the actual quantity and the type of the motion in the instantaneous adaptation image of the motion model scene, kinematic coefficient cancellation piece is a strong tool.This piece can utilize very a large amount of models to detect at an easy rate and predict the outcome (the square prediction error value of a section), for example, utilize all possible combination of movable information group basic function.Known method without any other has big like this flexibility.A very big benefit of this scheme is, it does not need the repeating motion estimation process, thereby, on calculating, be very simple.
By after the QR motion analyzer, using estimating motion, movable information group coding device can be by finding the solution very simple system of linear equations, for the movable information group model in the combination of any desired image section or any desired section is found out new kinematic coefficient.
Most preferred embodiment is described
In most preferred embodiment, used quadratic polynomial motion vector information group model with 12 coefficients: Δ x (x, y)=c 1+ c 2X+c 3Y+c 4Xy+c 5x 2+ c 6y 2Δ y (x, y)=c 7+ c 8X+c 9Y+c 10Xy+c 11x 2+ c 12y 2
In actual applications, this model can be provided in the image sequence by very complicated motion and predicting the outcome of providing well.
In QR motion analyzer piece, by each the picture element (x around point i, y i) (i=1,2 ... P) locate to use I N-1The Taylor expansion of () is finished the linearisation in the 1st step: x ' i=x i+ Δ x (x i, y i) y ' i=y i+ Δ y (x i, y i) utilize ∑ a 2=∑ (a) 2Characteristic, predicated error is: Σ i = 1 , P ( I ~ n - 1 ( x i ′ , y i ′ ) + ( Δ ~ x ( x i , y i ) - Δx ( x i , y i ) ) G x ( x i ′ , y i ′ ) + ( Δ ~ y ( x i , y i ) - Δy ( x i , y i ) ) G y ( x i ′ , y i ′ ) - I n ( x i , y i ) ) 2 Utilize following formula to calculate instrumental value: g j(x i, y i): Here, function f j(x i, y i) be basic function as definition at formula (4a) and (4b).Matrix E in the formula (9) and vector Y are utilized the following formula structure: g 1 ( x 1 , y 1 ) g 2 ( x 1 , y 1 ) . . . g N + M ( x 1 , y 1 ) g 1 ( x 2 , x 2 ) g 2 ( x 2 , y 2 ) . . . g N + M ( x 2 , y 2 ) . . . . . . . . . . . . g 1 ( x P , y P ) g 2 ( x P , y P ) . . . g N + M ( x P , y P ) , y = I n ( x 1 , y 1 ) - I ~ n - 1 ( x 1 ′ , y 1 ′ ) + G x ( x 1 ′ , y 1 ′ ) Δx ( x 1 , y 1 ) + G y ( x 1 ′ , y 1 ′ ) Δy ( x 1 , y 1 ) I n ( x 2 , y 2 ) - I ~ n - 1 ( x 2 ′ , y 2 ′ ) + G x ( x 2 ′ , y 2 ′ ) Δx ( x 2 , y 2 ) + G y ( x 2 ′ , y 2 ′ ) Δy ( x 2 , y 2 ) . . . I n ( x P , y P ) - I ~ n - 1 ( x P ′ , y P ′ ) + G x ( x P ′ , y P ′ ) Δx ( x P , y P ) + G y ( x P ′ , y P ′ ) Δy ( x P , y P )
G x(x, y) and G y(x y) is reference frame I N-1(they are calculated by following formula for x, level y) and vertical slope value: G x ( x , y ) = I ~ n - 1 ( x + 1 , y ) - I ~ n - 1 ( x - 1 , y ) , G y ( x , y ) = I ~ n - 1 ( x , y + 1 ) - I ~ n - 1 ( x , y - 1 ) .
Fig. 6 shows the sketch of a QR motion analyzer.Row selects piece only to select the preceding N+M of input matrix capable.
Merge in the piece at section, used following section to merge skill:
A. select a thresholding T, it is corresponding to the square prediction error added value that allows in the entire frame,
B. the adjacent sections to all is right, utilizes formula (13) to calculate Δ E Ij
C. Δ E IjMinimum section is to being merged.
D. repeat b-c, up to corresponding to the Δ E of all merged sections to sum IjGreater than T.
Δ E to sum IjGreater than T.
For the trigonometric ratio of equation group (11), a series of Givens circulations have been used.
In kinematic coefficient cancellation piece, used the skill of following elimination factor: a. to select a thresholding T, it is corresponding to the permission added value of square prediction error in the entire frame.B. to all sections and all functions, utilize formula (16) to calculate q i 2C. the minimum q of a section of cancellation i 2The basic function of value.D. repeat b-c, up to in different sections all by all corresponding q of the basic function of cancellation i 2And greater than T.
Use a series of Givens circulations, equation group (16) is realized trigonometric ratio.
Utilize back-substitution algorithm, by solution formula (18), the final kinematic coefficient of calculation of sector.
Pixel value I N-1(x, y), G x(x, y) and G y(x y), only is integer coordinate points X and Y definition.At many X or Y is not under the integer-valued situation, comes calculating pixel values by the nearest picture element that the rounded coordinate value is arranged is carried out bilinear interpolation.
Under the situation that does not break away from the spirit and scope of the present invention, system can realize in a different manner.For example, in formula (3), can use different linear movement models.Can with diverse ways go in the linearization equations (5) the item.In addition, can remove to determine whether to merge two sections with different rules.Determine the skill whether a given basic function is somebody's turn to do cancellation from model that difference is also arranged.Can carry out the trigonometric ratio of matrix in formula (10) and (15) with different algorithms.And can separate the algorithm known of system of linear equations with some, solution formula (18) solves final coefficient.At last, can ask the I at non-integer coordinates point place with different interpolation algorithms N-1(x, y), G x(x, y) and G y(x, value y).

Claims (11)

1. image coding-decoding device, the present frame I of the image sequence that is used to encode n, so that the predictive frame of a coding to be provided This image coding-decoding device comprises a movable information group coding device, and this movable information group coding device comprises:
A motion analyzer is used for receiving
-present frame I n,
-reference frame I N-1, and
-motion vector information group is to the present frame I of multi-region section nEach section, this motion vector information group have one group of motion vector [x+ Δ x (and x, y), y+ Δ y (x, y)], each motion vector is with the reference frame of a section Be mapped to a section present frame I corresponding to this vector n,
This motion analyzer comprises:
A kind of device is used for approaching a prediction error functions between each present frame section and reference frame section, the reference frame section then passes through with one group of N+M kinematic coefficient C iWith one group of basic function f iApproach corresponding motion vector [x+ Δ x (and x, y), y+ Δ y (x, y)] and be mapped to the present frame section, this approximated function comprises a kinematic coefficient C iLinear combination, and according to E iC i-y iCan be expressed as a matrix expression and
The matrix processing unit is used for being each present frame section, by described matrix expression, forms one first matrix R i 1With the first vector Z i 1, wherein R i 1 C i = Z i 1 , And the first matrix R i 1Line number equal the number of kinematic coefficient N+M;
Section merges device
Be connected on the motion analyzer, to receive the described first matrix R i 1With the first vector Z i 1, and be used for
To present frame I nAdjacent sections to (S i, S j), by each each right matrix R i 1, R j 1With the first vector Z i 1, Z j 1, determine that merges a matrix R Ij 1With combined vector Z Ij 1, and
With merging matrix R Ij 1With combined vector Z Ij 1Replace the first paired matrix R i 1With the first vector Z i 1, wherein, merges matrix and vector and in predetermined threshold, provide predicated error, and it is right to merge corresponding section; The coefficient destructor is connected to section and merges the first remaining matrix R of reception on the device i 1With the first vector Z i 1And the merging matrix R that replaces Ij 1With combined vector Z Ij 1, and be used for
Each section or merging section to present frame utilize corresponding matrix R k 1With vector Z k 1, determine one group of kinematic coefficient C that simplifies kThereby, the variation that makes the prediction error functions of approaching in a preset range, and
A present frame I is provided nEncoding scheme, merge section for each section or each, it comprises the kinematic coefficient that described number reduces and shows the mark which coefficient has been omitted.
2. image coding-decoding device as claimed in claim 1, wherein, for each section in the present frame, to its prediction error functions approximation, this function is by motion analyzer: Σ j = i P ( e j , 1 c ~ 1 + e j , 2 c ~ 2 + · · · + e j , N + M c ~ N + M - y i ) 2 E i = e 1,1 e 1,2 . . . e 1 , N + M e 2,1 e 2,2 . . . e 2 , N + M · · · · · · · · · · · · e P , 1 e P , 2 . . . e P , N + M With y i = y 1 y 2 . . . y P
3. image coding-decoding device as claimed in claim 1, wherein, described matrix processing unit comprises QR factorization device, is used for matrix E iBe decomposed into a matrix Q iAnd R iProduct, and be used to form an auxiliary vector Z i, Z wherein i=Q T iY i, and, by matrix R iThe capable formation of preceding N+M R i 1, by vector Z iPreceding N+M element form vector Z i 1
4. image coding-decoding device as claimed in claim 1, wherein, section merges device and comprises the trigonometric ratio device, to described adjacent sections to (S i, S j), it is with formula R i 1 R j 1 c ij = z i 1 z j 1 Triangle turns to following form
Figure C9519800000035
C wherein IjBe to merge one group of right N+M kinematic coefficient of adjacent sections.
5. image coding-decoding device as claimed in claim 4, wherein section merges device according to formula ΔE ij = Σ k = 1 N + M q k 2 .
By being every couple of adjacent sections (S i, S j) calculating square prediction error Δ E IjVariation, determine that predicated error is whether in preset range.
6. as claim 4 or 5 described image coding-decoding devices, wherein, section merges device and utilizes the preceding N+M of trigonometric ratio Matrix Formula capable, determines described merging matrix R Ij 1With combined vector Z Ij 1, like this
7. image coding-decoding device as claimed in claim 1 wherein, includes the mark that merges section in the encoding scheme of described present frame In.
8. image coding-decoding device as claimed in claim 1, wherein, described coefficient destructor is by selectively from matrix R k 1In cancellation i row and from vector Z k 1In i element of cancellation, thereby the Matrix Formula of a simplification is provided R K 1 C K = Z K 1 , From then on, determine the kinematic coefficient C of number minimizing one by one for each section k
9. image coding-decoding device as claimed in claim 8, wherein, the coefficient destructor turns to following form with the Matrix Formula triangle of simplifying:
And by formula ΔE K = q i 2 Calculate the variation of square prediction error.
10. image coding-decoding device as claimed in claim 9, if the variation of square prediction error is in preset range, then the coefficient destructor is with kinematic coefficient C iFrom coefficient matrix C kMiddle cancellation, and, by cancellation last column from second described trigonometric ratio Matrix Formula, represent this formula again.
11. image coding-decoding device as claimed in claim 1, wherein, the coefficient destructor is by finding the solution Matrix Formula R K 1 C K = Z K 1 Or the reduced form of this formula, determine kinematic coefficient C k
CN95198000A 1995-10-20 1995-10-20 Motion vector field coding Expired - Fee Related CN1107413C (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN95198000A CN1107413C (en) 1995-10-20 1995-10-20 Motion vector field coding

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN95198000A CN1107413C (en) 1995-10-20 1995-10-20 Motion vector field coding

Publications (2)

Publication Number Publication Date
CN1205153A CN1205153A (en) 1999-01-13
CN1107413C true CN1107413C (en) 2003-04-30

Family

ID=5083489

Family Applications (1)

Application Number Title Priority Date Filing Date
CN95198000A Expired - Fee Related CN1107413C (en) 1995-10-20 1995-10-20 Motion vector field coding

Country Status (1)

Country Link
CN (1) CN1107413C (en)

Families Citing this family (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN102204256B (en) * 2008-10-31 2014-04-09 法国电信公司 Image prediction method and system
EP2490448A1 (en) * 2011-02-18 2012-08-22 Siemens Aktiengesellschaft Encoding method and image encoding device for compressing an image sequence

Citations (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
FR2628864A1 (en) * 1988-03-21 1989-09-22 France Etat Segmentation of point velocity vectors in image sequence - segmenting velocity data based on movement prediction to generate representative vector for processing
DE4138517A1 (en) * 1991-11-23 1993-05-27 Univ Hannover Object oriented coding of image signals for describing moving objects - identifies objects within vision field that have parameters encoded and used in synthesising process

Patent Citations (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
FR2628864A1 (en) * 1988-03-21 1989-09-22 France Etat Segmentation of point velocity vectors in image sequence - segmenting velocity data based on movement prediction to generate representative vector for processing
DE4138517A1 (en) * 1991-11-23 1993-05-27 Univ Hannover Object oriented coding of image signals for describing moving objects - identifies objects within vision field that have parameters encoded and used in synthesising process

Non-Patent Citations (3)

* Cited by examiner, † Cited by third party
Title
IEEE TRANSACTIONS ON IMAGE PROCESSING VOL.4,NO.9 1995-09-01 h.zheng:motion-based object segmentation and estimation using the mdl principle *
VISUAL COMMUNICATION AND IMAGE PROCESSING VOL.1605 PART2/02 1991-11-01 labit c:compact motion representation based on global features for semantic image sequence coding *
VISUAL COMMUNICATION AND IMAGE PROCESSING VOL.2501,NO.2 1995-05-01 t.ebrahimi:joint motion estimation and segmentation for very low bit rate video coding *

Also Published As

Publication number Publication date
CN1205153A (en) 1999-01-13

Similar Documents

Publication Publication Date Title
JP7466849B2 (en) TRISOUP node size per slice
US6212235B1 (en) Video encoder and decoder using motion-based segmentation and merging
JP4966669B2 (en) Data compression using matching tracking
JP4573438B2 (en) Video coding
EP1523196A2 (en) Encoding system of motion image containing arbitrary object
JP3781194B2 (en) Motion vector field coding
EP0709809A2 (en) Image encoding and decoding method and apparatus using edge synthesis and inverse wavelet transform
CN101584215B (en) Integrated spatial-temporal prediction
CN112866694B (en) Intelligent image compression optimization method combining asymmetric convolution block and condition context
KR20020026243A (en) Method and device for variable complexity decoding of motion-compensated block-based compressed digital video
CN1370376A (en) Adaptive motion vector field coding
JP2004520744A (en) Method and apparatus for image encoding and decoding and related applications
Efstratiadis et al. Hierarchical partition priority wavelet image compression
US6594375B1 (en) Image processing apparatus, image processing method, and storage medium
JPH08275157A (en) Video signal encoding device
CN1107413C (en) Motion vector field coding
CN105338354A (en) Motion vector estimation method and device
CN112534814B (en) Method and apparatus for coefficient context modeling in video coding
JPH08317410A (en) Method and apparatus for coding image signal
CN1263309C (en) Motion vector prediction method used for video coding
JP5043849B2 (en) Variable shape motion estimation in video sequences
CN100429946C (en) Method and apparatus for processing digital motion picture
CN107181963B (en) A kind of video-frequency compression method and device
CN1222283A (en) Video encoder and decoder using motion-based segmentation and merging
EP4244813B1 (en) Devices and methods for scalable coding for point cloud compression

Legal Events

Date Code Title Description
C06 Publication
PB01 Publication
C10 Entry into substantive examination
SE01 Entry into force of request for substantive examination
C14 Grant of patent or utility model
GR01 Patent grant
C17 Cessation of patent right
CF01 Termination of patent right due to non-payment of annual fee

Granted publication date: 20030430

Termination date: 20111020