CN101540902B - Method and device for scaling motion vectors, and method and system for coding/decoding - Google Patents

Method and device for scaling motion vectors, and method and system for coding/decoding Download PDF

Info

Publication number
CN101540902B
CN101540902B CN 200810102353 CN200810102353A CN101540902B CN 101540902 B CN101540902 B CN 101540902B CN 200810102353 CN200810102353 CN 200810102353 CN 200810102353 A CN200810102353 A CN 200810102353A CN 101540902 B CN101540902 B CN 101540902B
Authority
CN
China
Prior art keywords
motion vector
coordinate
scaling
transformation
motion
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Active
Application number
CN 200810102353
Other languages
Chinese (zh)
Other versions
CN101540902A (en
Inventor
林永兵
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Huawei Technologies Co Ltd
Original Assignee
Huawei Technologies Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Huawei Technologies Co Ltd filed Critical Huawei Technologies Co Ltd
Priority to CN 200810102353 priority Critical patent/CN101540902B/en
Priority to PCT/CN2008/073864 priority patent/WO2009114986A1/en
Publication of CN101540902A publication Critical patent/CN101540902A/en
Application granted granted Critical
Publication of CN101540902B publication Critical patent/CN101540902B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Images

Classifications

    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N19/00Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
    • H04N19/10Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding
    • H04N19/102Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the element, parameter or selection affected or controlled by the adaptive coding
    • H04N19/103Selection of coding mode or of prediction mode
    • H04N19/112Selection of coding mode or of prediction mode according to a given display mode, e.g. for interlaced or progressive display mode
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N19/00Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
    • H04N19/50Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using predictive coding
    • H04N19/503Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using predictive coding involving temporal prediction
    • H04N19/51Motion estimation or motion compensation
    • H04N19/513Processing of motion vectors
    • H04N19/517Processing of motion vectors by encoding
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N19/00Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
    • H04N19/50Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using predictive coding
    • H04N19/503Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using predictive coding involving temporal prediction
    • H04N19/51Motion estimation or motion compensation
    • H04N19/523Motion estimation or motion compensation with sub-pixel accuracy
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N19/00Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
    • H04N19/50Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using predictive coding
    • H04N19/503Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using predictive coding involving temporal prediction
    • H04N19/51Motion estimation or motion compensation
    • H04N19/53Multi-resolution motion estimation; Hierarchical motion estimation

Landscapes

  • Engineering & Computer Science (AREA)
  • Multimedia (AREA)
  • Signal Processing (AREA)
  • Compression Or Coding Systems Of Tv Signals (AREA)

Abstract

本发明实施例涉及一种运动矢量的缩放方法和装置、编解码方法和系统,该运动矢量的缩放方法包括:根据隔行扫描图像的底场与隔行扫描图像的顶场之间的坐标偏移量对第一运动矢量进行坐标变换,得到第二运动矢量;对所述第二运动矢量进行缩放,得到第三运动矢量;根据所述坐标偏移量对所述第三运动矢量进行坐标反变换,得到第四运动矢量。本发明实施例根据顶场图像和底场图像在空间位置上的偏移,对缩放前后的运动矢量进行半像素补偿,或者说进行坐标变换,变换到反映场图像之间真实位置关系的坐标系下,进行运动矢量的缩放处理,更加符合运动模型,保证了运动矢量缩放的准确性,提高了运动矢量预测精度,从而提高了场图像的编码压缩效率。

Figure 200810102353

The embodiment of the present invention relates to a motion vector scaling method and device, encoding and decoding method and system, the motion vector scaling method includes: according to the coordinate offset between the bottom field of the interlaced scanning image and the top field of the interlacing scanning image performing coordinate transformation on the first motion vector to obtain a second motion vector; scaling the second motion vector to obtain a third motion vector; performing coordinate inverse transformation on the third motion vector according to the coordinate offset, Get the fourth motion vector. In the embodiment of the present invention, half-pixel compensation is performed on the motion vector before and after scaling according to the offset in the spatial position of the top field image and the bottom field image, or coordinate transformation is performed to a coordinate system that reflects the real positional relationship between the field images Next, the scaling process of the motion vector is more in line with the motion model, which ensures the accuracy of the motion vector scaling, improves the prediction accuracy of the motion vector, and thus improves the encoding and compression efficiency of the field image.

Figure 200810102353

Description

The Zoom method of motion vector and device, decoding method and system
Technical field
The present invention relates to multimedia technology field, especially a kind of Zoom method of motion vector and device, decoding method and system.
Background technology
In the video coding compression standard, adopted the motion-vector prediction technology to reduce the required overhead bit of transmitting moving vector.The motion-vector prediction technology comprises spatial domain motion-vector prediction and time domain motion-vector prediction.In the motion-vector prediction of spatial domain, can be according to predicting the motion vector of current macro with current macro motion vector of adjacent macroblocks on the locus; And in the time domain motion-vector prediction, then can be according to predicting the motion vector of current macro with the motion vector of current macro adjacent or close macro block on the time shaft position.
Still be the time domain motion-vector prediction for the spatial domain motion-vector prediction no matter, the motion vector convergent-divergent is generally to adopt in the motion-vector prediction technology.Motion vector is one of most important data in the compressed bit stream, and the accuracy of motion vector convergent-divergent influences the precision of motion vector, and further influences the video coding compression performance.In general, the motion vector convergent-divergent is followed rigid body straight line uniform motion model, carries out corresponding convergent-divergent according to the time domain distance and handles.
In general, video sequence comprises sequence and interlaced line by line.The image of forming video sequence comprises that the two field picture of lining by line scan is that progressive scanning picture and interleaved field picture are horizontally interlaced image, 1 frame can be divided into 2, promptly there are a time difference in field, top and field, the end between field, top and the end field picture, and this time difference equals half of frame period.The field, top is made up of the odd-numbered line pixel, and field, the end is made up of the even number line pixel.Sequence only is made up of two field picture line by line, and interlacedly can comprise two field picture and field picture.
In the prior art, for two field picture and field picture, all carry out identical convergent-divergent and handle.But, because field picture is different from two field picture, there is the skew (being offset pixel half) on the locus between the field, top of field picture and the field, the end, adopt the motion vector convergent-divergent identical to handle the accuracy that can reduce the motion vector convergent-divergent, caused reduction the encoding compression efficient of field picture with the two field picture convergent-divergent.
Summary of the invention
The embodiment of the invention provides a kind of Zoom method and device, decoding method and system of motion vector, in order to improve the accuracy of motion vector convergent-divergent, improves the encoding compression efficient of field picture.
The embodiment of the invention provides a kind of Zoom method of motion vector, comprising:
According to the coordinate offset amount between end field picture and the top field picture first motion vector is carried out coordinate transform, obtain second motion vector;
Described second motion vector is carried out convergent-divergent, obtain the 3rd motion vector;
According to described coordinate offset amount described the 3rd motion vector is carried out the coordinate inverse transformation, obtain the 4th motion vector.
The embodiment of the invention provides a kind of device for zooming of motion vector, comprising:
First conversion module is used for according to the coordinate offset amount between end field picture and the top field picture first motion vector being carried out coordinate transform, obtains second motion vector;
Zoom module is used for described second motion vector is carried out convergent-divergent, obtains the 3rd motion vector;
Second conversion module is used for according to described coordinate offset amount described the 3rd motion vector being carried out the coordinate inverse transformation, obtains the 4th motion vector.
The embodiment of the invention provides a kind of coding method, comprising:
In cataloged procedure, when carrying out the motion-vector prediction processing, the step of motion vector being carried out convergent-divergent comprises:
According to the coordinate offset amount between end field picture and the top field picture first motion vector is carried out coordinate transform, obtain second motion vector;
Described second motion vector is carried out convergent-divergent, obtain the 3rd motion vector;
According to described coordinate offset amount described the 3rd motion vector is carried out the coordinate inverse transformation, obtain the 4th motion vector;
Described the 4th motion vector is used for motion-vector prediction to be handled.
The embodiment of the invention provides a kind of coding/decoding method, comprising:
In decode procedure, when carrying out the motion-vector prediction processing, the step of motion vector being carried out convergent-divergent comprises:
According to the coordinate offset amount between end field picture and the top field picture first motion vector is carried out coordinate transform, obtain second motion vector;
Described second motion vector is carried out convergent-divergent, obtain the 3rd motion vector;
According to described coordinate offset amount described the 3rd motion vector is carried out the coordinate inverse transformation, obtain the 4th motion vector;
Described the 4th motion vector is used for motion-vector prediction to be handled.
The embodiment of the invention provides a kind of coded system, comprising:
First device for zooming is used for first motion vector is carried out convergent-divergent, obtains the 4th motion vector;
First prediction unit is used for carrying out motion-vector prediction at the 4th motion vector that cataloged procedure obtains according to described first device for zooming and handles;
Wherein, described first device for zooming comprises:
First conversion module is used for according to the coordinate offset amount between end field picture and the top field picture first motion vector being carried out coordinate transform, obtains second motion vector;
Zoom module is used for described second motion vector is carried out convergent-divergent, obtains the 3rd motion vector;
Second conversion module is used for according to described coordinate offset amount described the 3rd motion vector being carried out the coordinate inverse transformation, obtains the 4th motion vector.
The embodiment of the invention provides a kind of decode system, comprising:
Second device for zooming is used for first motion vector is carried out convergent-divergent, obtains the 4th motion vector;
Second prediction unit is used for carrying out motion-vector prediction at the 4th motion vector that decode procedure obtains according to described second device for zooming and handles;
Wherein, described second device for zooming comprises:
First conversion module is used for according to the coordinate offset amount between end field picture and the top field picture first motion vector being carried out coordinate transform, obtains second motion vector;
Zoom module is used for described second motion vector is carried out convergent-divergent, obtains the 3rd motion vector;
Second conversion module is used for according to described coordinate offset amount described the 3rd motion vector being carried out the coordinate inverse transformation, obtains the 4th motion vector.
As shown from the above technical solution, embodiment of the invention skew on the locus according to top field picture and end field picture, motion vector before and after the convergent-divergent is carried out the half-pix compensation, carry out coordinate transform in other words, transform under the coordinate system of actual position relation between the reflection field picture, carry out the convergent-divergent of motion vector and handle, meet motion model more, guarantee the accuracy of motion vector convergent-divergent, improved the motion-vector prediction precision, thereby improved the encoding compression efficient of field picture.
Below in conjunction with accompanying drawing specific embodiments of the invention are described in further detail.
Description of drawings
Fig. 1 is the schematic flow sheet of first embodiment of the Zoom method of motion vector of the present invention;
Fig. 2 is the convergent-divergent schematic diagram of motion vector among first embodiment of the Zoom method of motion vector of the present invention;
Fig. 3 is the schematic flow sheet of second embodiment of the Zoom method of motion vector of the present invention;
Fig. 4 is the convergent-divergent schematic diagram of motion vector among second embodiment of the Zoom method of motion vector of the present invention;
Fig. 5 is the structural representation of the device for zooming embodiment of motion vector of the present invention;
Fig. 6 is the schematic flow sheet of coding method embodiment of the present invention;
Fig. 7 is the schematic flow sheet of coding/decoding method embodiment of the present invention;
Fig. 8 is the structural representation of coded system embodiment of the present invention;
Fig. 9 is the structural representation of decode system embodiment of the present invention.
Embodiment
The Zoom method of the motion vector of the embodiment of the invention carries out coordinate transform according to the coordinate offset amount between end field picture and the top field picture to first motion vector, can obtain second motion vector; Second motion vector is carried out convergent-divergent handle, thereby can obtain the 3rd motion vector; According to above-mentioned coordinate offset amount the 3rd motion vector is carried out the coordinate inverse transformation, just can obtain the 4th motion vector, for carrying out follow-up motion-vector prediction.The Zoom method of the motion vector of the embodiment of the invention can guarantee the accuracy of motion vector convergent-divergent, thereby improves the motion-vector prediction precision, and the encoding compression efficient of field picture.
As shown in Figure 1, be the schematic flow sheet of first embodiment of the Zoom method of motion vector of the present invention.Present embodiment is applicable to the motion-vector prediction that carries out based on the spatial domain, may further comprise the steps:
Suppose that scaled motion vector is MV1, the motion vector behind the convergent-divergent is MV2.
Between step 101, top field picture and the end field picture coordinate offset amount delt.
For horizontally interlaced image, there are the skew of half-pixel in vertical direction in top field picture and end field picture, corresponding coordinate offset amount delt=0.5, and unit is whole pixel (pixel);
Step 102, the vertical component of scaled motion vector MV1 is carried out coordinate transform, obtain motion vector MV1 ' according to coordinate offset amount delt.
Scaled MV1 is the motion vector under the former coordinate system; MV1 ' after the corresponding coordinate transform is the motion vector under the new coordinate system.The operation of coordinate transform specifically comprises following situation (only the vertical component of MV1 being carried out coordinate transform):
When motion vector MV1 by top->bottom (top point to a field, the end), MV1 '=MV1+delt;
When motion vector MV1 by bottom->top (field, the end point to top), MV1 '=MV1-delt;
When motion vector MV1 by top->top (top point to top), MV1 '=MV1;
When motion vector MV1 by bottom->bottom (field, the end point to field, the end), MV1 '=MV1.
From above-mentioned situation as can be seen, when the parity of the field picture at the starting point of motion vector and terminal point place not simultaneously, need to consider the influence of coordinate offset delt;
Step 103, to the MV1 ' after the coordinate transform, carry out linear scale according to the rigid body uniform rectilinear motion model along the direction of MV1 ', obtain motion vector MV2 '.
Wherein, MV2 '=scale * MV1 '; In the formula, scale is a zoom factor, and it is relevant with the time gap of the motion vector of convergent-divergent front and back, and this time gap is that motion vector is projected in the length on the time shaft.
Step 104, carry out the coordinate inverse transformation, obtain motion vector MV2 according to the vertical component of the motion vector MV2 ' of coordinate offset amount delt after to convergent-divergent.
Motion vector MV2 ' behind the convergent-divergent is the motion vector under the new coordinate system; MV2 after the corresponding coordinate inverse transformation is the motion vector under the former coordinate system.The operation of coordinate transform specifically comprises following situation (only the vertical component of MV2 ' being carried out coordinate transform):
When motion vector MV2 by top->bottom (top point to a field, the end), MV2=MV2 '-delt;
When motion vector MV2 by bottom->top (field, the end point to top), MV2=MV2 '+delt;
When motion vector MV2 by top->top (top point to top), MV2=MV2 ';
When motion vector MV2 by bottom->bottom (field, the end point to field, the end), MV2=MV2 '.
From above-mentioned situation as can be seen, when the parity of the field picture at the starting point of motion vector and terminal point place not simultaneously, need to consider the influence of coordinate offset delt.
Present embodiment is the offset d elt on the locus according to top field picture and end field picture, its unit is identical with the unit of motion vector, motion vector MV1 before and after the convergent-divergent and MV2 ' are carried out the half-pix compensation, carry out coordinate transform in other words, transform under the coordinate system O ' of actual position relation between the reflection field picture, carrying out the convergent-divergent of motion vector handles, meet motion model more, guaranteed the accuracy of motion vector convergent-divergent, improve the motion-vector prediction precision, thereby improved the encoding compression efficient of field picture.
As shown in Figure 2, be the convergent-divergent schematic diagram of motion vector among first embodiment of the Zoom method of motion vector of the present invention.Original coordinates is O, and new coordinate is O ', and filled circles is represented the top field picture, and open circles is represented end field picture, and both difference are the position difference of field, the end.Under new coordinate system, exist the half-pixel on the vertical direction poor between field, top and the field, the end.New coordinate system has reflected the actual position relation between the field picture, and the convergent-divergent of motion vector should carry out under new coordinate system.Among the figure, time express time axle; Y axle remarked pixel position in vertical direction; Top represents the top field picture; Bottom represents end field picture; Delt is the coordinate offset amount; MV1 and MV2 are respectively the motion vector under the coordinate system O; MV1 ' and MV2 ' are respectively MV1 and MV2 and transform to motion vector under the coordinate system O '; Scale is a zoom factor, under illustrated case, and scale=3.Under new coordinate system, motion vector MV2 ' behind the convergent-divergent and scaled motion vector MV1 ' are overlapping.
Further, in the present embodiment, in step 101, if field picture has been adopted interpolation processing, so corresponding coordinate offset amount is calculated as follows: if adopt 1/4 interpolation, then coordinate offset amount delt=0.5 * 4=2, unit is 1/4 pixel, and the expression half-pixel is equivalent to 2 1/4 samples.For the interpolation method of other precision, can analogize.
Further, in the present embodiment, when the motion vector behind the convergent-divergent and the motion vector direction before the convergent-divergent not simultaneously, zoom factor scale related in the step 103 can be negative.
In the present embodiment, what above-mentioned motion vector was described is the motion of pixel.Further, motion vector can also be used to description block (Block) or macro block (Macro Block, motion MB).Piece and macro block are made up of the pixel of some.Macro block is made up of 16 * 16 pixels, and macro block can be divided into piece.
As shown in Figure 3, be the schematic flow sheet of second embodiment of the Zoom method of motion vector of the present invention.Present embodiment is applicable to the motion-vector prediction that carries out based on time domain, compares with a last embodiment, and the step 103 among the last embodiment item becomes:
Step 103 ', to the MV1 ' after the coordinate transform, carry out linear scale according to the rigid body uniform rectilinear motion model along the direction parallel with MV1 ', obtain motion vector MV2 '.
The starting point of motion vector MV2 ' behind scaled motion vector MV1 ' and the convergent-divergent is positioned on the different time-domain positions, but on the position, spatial domain corresponding relation is arranged.
As shown in Figure 4, be the convergent-divergent schematic diagram of motion vector among second embodiment of the Zoom method of motion vector of the present invention.Original coordinates is O, and new coordinate is O ', and filled circles is represented the top field picture, and open circles is represented end field picture, and both difference are the position difference of field, the end.Under new coordinate system, exist the half-pixel on the vertical direction poor between field, top and the field, the end.New coordinate system has reflected the actual position relation between the field picture, and the convergent-divergent of motion vector should carry out under new coordinate system.Among the figure, time express time axle; Y axle remarked pixel position in vertical direction; Top represents the top field picture; Bottom represents end field picture; Delt is the coordinate offset amount; MV1 and MV2 are respectively the motion vector under the coordinate system O; MV1 ' and MV2 ' are respectively MV1 and MV2 and transform to motion vector under the coordinate system O '; Scale is a zoom factor, under illustrated case, and scale=1/3.Under new coordinate system, the motion vector MV2 ' behind the convergent-divergent is parallel with scaled motion vector MV1 '.
Among the Zoom method embodiment of the invention described above motion vector, the skew on the locus according to top field picture and end field picture, after motion vector before and after the convergent-divergent carried out the half-pix compensation, carrying out the convergent-divergent of motion vector again handles, can guarantee the accuracy of motion vector convergent-divergent, improve the motion-vector prediction precision, thereby improved the encoding compression efficient of field picture.
As shown in Figure 5, be the structural representation of the device for zooming embodiment of motion vector of the present invention.Present embodiment comprises first conversion module 10, Zoom module 20 and second conversion module 30 that connects in turn.Wherein, the coordinate offset amount that first conversion module 10 is used between end field picture and the top field picture is carried out coordinate transform to first motion vector, obtains second motion vector; Zoom module 20 is used for described second motion vector is carried out convergent-divergent, obtains the 3rd motion vector; Second conversion module 30 is used for according to described coordinate offset amount described the 3rd motion vector being carried out the coordinate inverse transformation, obtains the 4th motion vector.
Among the device for zooming embodiment of the invention described above motion vector, the skew on the locus of first conversion module and second conversion module according to top field picture and end field picture, after respectively the motion vector before and after the convergent-divergent being carried out the half-pix compensation, carry out the convergent-divergent processing of motion vector again by Zoom module, can guarantee the accuracy of motion vector convergent-divergent, improve the motion-vector prediction precision, thereby improved the encoding compression efficient of field picture.
Further, still as shown in Figure 5, first conversion module 10 can comprise interconnective first recognition unit 11 and first converter unit 12 in the present embodiment.Wherein, first recognition unit 11 is used to discern the direction of described first motion vector; First converter unit 12 is used for according to the recognition result of described first recognition unit 11 and described coordinate offset amount the vertical component of described first motion vector being carried out coordinate transform, obtains second motion vector.Second conversion module 30 can comprise interconnective second recognition unit 31 and second converter unit 32.Second recognition unit 31 is used to discern the direction of described the 3rd motion vector; Second converter unit 32 is used for according to the recognition result of described second recognition unit 31 and described coordinate offset amount the vertical component of described the 3rd motion vector being carried out coordinate transform, obtains the 4th motion vector.
As shown in Figure 6, be the schematic flow sheet of coding method embodiment of the present invention.Present embodiment is in cataloged procedure, and when carrying out the motion-vector prediction processing, the step of motion vector being carried out convergent-divergent comprises:
Step 601, first motion vector is carried out coordinate transform, obtain second motion vector according to the coordinate offset amount between end field picture and the top field picture;
Step 602, described second motion vector is carried out convergent-divergent, obtain the 3rd motion vector;
Step 603, described the 3rd motion vector is carried out the coordinate inverse transformation, obtain the 4th motion vector according to described coordinate offset amount;
Step 604, described the 4th motion vector is used for the cataloged procedure motion-vector prediction handles.
Present embodiment when carrying out the motion-vector prediction processing, carries out convergent-divergent to motion vector and handles in cataloged procedure, promptly according to the coordinate offset amount between end field picture and the top field picture first motion vector is carried out coordinate transform, can obtain second motion vector; Second motion vector is carried out convergent-divergent handle, thereby can obtain the 3rd motion vector; According to above-mentioned coordinate offset amount the 3rd motion vector is carried out the coordinate inverse transformation, just can obtain the 4th motion vector; The 4th motion vector is used for motion-vector prediction to be handled.Present embodiment has guaranteed the accuracy of motion vector convergent-divergent, thereby has improved the motion-vector prediction precision, and the encoding compression efficient of field picture, has further improved the efficient of coding.
As shown in Figure 7, be the schematic flow sheet of coding/decoding method embodiment of the present invention.Present embodiment is in decode procedure, and when carrying out the motion-vector prediction processing, the step of motion vector being carried out convergent-divergent comprises:
Step 701, first motion vector is carried out coordinate transform, obtain second motion vector according to the coordinate offset amount between end field picture and the top field picture;
Step 702, described second motion vector is carried out convergent-divergent, obtain the 3rd motion vector;
Step 703, described the 3rd motion vector is carried out the coordinate inverse transformation, obtain the 4th motion vector according to described coordinate offset amount;
Step 704, the motion-vector prediction that described the 4th motion vector is used for decode procedure are handled.
Present embodiment when carrying out the motion-vector prediction processing, carries out convergent-divergent to motion vector and handles in decode procedure, promptly according to the coordinate offset amount between end field picture and the top field picture first motion vector is carried out coordinate transform, can obtain second motion vector; Second motion vector is carried out convergent-divergent handle, thereby can obtain the 3rd motion vector; According to above-mentioned coordinate offset amount the 3rd motion vector is carried out the coordinate inverse transformation, just can obtain the 4th motion vector; The 4th motion vector is used for motion-vector prediction to be handled.Present embodiment has guaranteed the accuracy of motion vector convergent-divergent, thereby has improved the motion-vector prediction precision, and the encoding compression efficient of field picture, has further improved the efficient of decoding.
As shown in Figure 8, be the structural representation of coded system embodiment of the present invention.Present embodiment comprises interconnective first device for zooming 1 and first prediction unit 2.Wherein, first device for zooming 1 is used for first motion vector is carried out convergent-divergent, obtains the 4th motion vector; First prediction unit 2 is used for carrying out motion-vector prediction at the 4th motion vector that cataloged procedure obtains according to first device for zooming 1 to be handled.Wherein, first device for zooming 1 can comprise first conversion module 10, Zoom module 20 and second conversion module 30 that connects in turn.Wherein, first conversion module 10 is used for according to the coordinate offset amount between end field picture and the top field picture first motion vector being carried out coordinate transform, obtains second motion vector; Zoom module 20 is used for described second motion vector is carried out convergent-divergent, obtains the 3rd motion vector; Second conversion module 30 is used for according to described coordinate offset amount described the 3rd motion vector being carried out the coordinate inverse transformation, obtains the 4th motion vector.
Among the invention described above coded system embodiment, the skew on the locus of first conversion module in first device for zooming and second conversion module according to top field picture and end field picture, after respectively the motion vector before and after the convergent-divergent being carried out the half-pix compensation, carry out the convergent-divergent processing of motion vector again by Zoom module, handle according to the motion-vector prediction that the 4th motion vector carries out in the cataloged procedure by first prediction unit at last, can guarantee the accuracy of motion vector convergent-divergent, improved the motion-vector prediction precision, thereby improved the encoding compression efficient of field picture, further improved the efficient of coding.
As shown in Figure 9, be the structural representation of decode system embodiment of the present invention.Present embodiment comprises interconnective second device for zooming 3 and second prediction unit 4.Wherein, first device for zooming 3 is used for first motion vector is carried out convergent-divergent, obtains the 4th motion vector; First prediction unit 4 is used for carrying out motion-vector prediction at the 4th motion vector that decode procedure obtains according to second device for zooming 3 to be handled.Wherein, first device for zooming 3 can comprise first conversion module 10, Zoom module 20 and second conversion module 30 that connects in turn.Wherein, first conversion module 10 is used for according to the coordinate offset amount between end field picture and the top field picture first motion vector being carried out coordinate transform, obtains second motion vector; Zoom module 20 is used for described second motion vector is carried out convergent-divergent, obtains the 3rd motion vector; Second conversion module 30 is used for according to described coordinate offset amount described the 3rd motion vector being carried out the coordinate inverse transformation, obtains the 4th motion vector.
Among the invention described above decode system embodiment, the skew on the locus of first conversion module in second device for zooming and second conversion module according to top field picture and end field picture, after respectively the motion vector before and after the convergent-divergent being carried out the half-pix compensation, carry out the convergent-divergent processing of motion vector again by Zoom module, handle according to the motion-vector prediction that the 4th motion vector carries out in the decode procedure by second prediction unit at last, can guarantee the accuracy of motion vector convergent-divergent, improved the motion-vector prediction precision, thereby improved the encoding compression efficient of field picture, further improved the efficient of decoding.
In the above embodiment of the present invention, there are the fact of the position deviation on the vertical direction in top field picture and end field picture, have influenced code efficiency.In the video coding framework, to consider that in motion vector prediction process other also will consider this point as interpolation, loop filtering etc. the above-mentioned this point.By improving interpolation, loop filtering, improve precision, thereby improve encoding compression efficient.
Through the above description of the embodiments, those of ordinary skill in the art can be well understood to the present invention and can realize by hardware, also can realize by the mode that software adds essential general hardware platform.Based on such understanding, technical scheme of the present invention can be written as computer executable program, and computer equipment can be realized method provided by the invention by carrying out this program.This computer executable program can be stored in the non-volatile memory medium and (comprise CD-ROM, USB flash disk, portable hard drive etc.).
It should be noted that at last: above embodiment only in order to technical scheme of the present invention to be described, is not intended to limit; Although with reference to previous embodiment the present invention is had been described in detail, those of ordinary skill in the art is to be understood that: it still can be made amendment to the technical scheme that aforementioned each embodiment put down in writing, and perhaps part technical characterictic wherein is equal to replacement; And these modifications or replacement do not make the essence of appropriate technical solution break away from the spirit and scope of various embodiments of the present invention technical scheme.

Claims (18)

1.一种运动矢量的缩放方法,其特征在于包括:1. A scaling method of a motion vector, characterized in that it comprises: 根据底场图像与顶场图像之间的坐标偏移量对第一运动矢量进行坐标变换,得到第二运动矢量;performing coordinate transformation on the first motion vector according to the coordinate offset between the bottom field image and the top field image to obtain a second motion vector; 对所述第二运动矢量进行缩放,得到第三运动矢量;Scaling the second motion vector to obtain a third motion vector; 根据所述坐标偏移量对所述第三运动矢量进行坐标反变换,得到第四运动矢量。Perform coordinate inverse transformation on the third motion vector according to the coordinate offset to obtain a fourth motion vector. 2.根据权利要求1所述的运动矢量的缩放方法,其特征在于所述坐标偏移量的单位与运动矢量的单位相同。2. The scaling method of the motion vector according to claim 1, characterized in that the unit of the coordinate offset is the same as the unit of the motion vector. 3.根据权利要求2所述的运动矢量的缩放方法,其特征在于若未对场图像作插值处理,所述坐标偏移量为0.5,单位为像素。3. The scaling method of the motion vector according to claim 2, wherein if no interpolation process is performed on the field image, the coordinate offset is 0.5, and the unit is pixel. 4.根据权利要求2所述的运动矢量的缩放方法,其特征在于若对场图像作1/4插值处理,所述坐标偏移量等于0.5×4也即等于2,单位为1/4像素。4. The scaling method of the motion vector according to claim 2, wherein if the field image is subjected to 1/4 interpolation processing, the coordinate offset is equal to 0.5×4, which is equal to 2, and the unit is 1/4 pixel . 5.根据权利要求1、2、3或4所述的运动矢量的缩放方法,其特征在于所述对第一运动矢量进行坐标变换具体包括:对第一运动矢量的垂直分量进行坐标变换。5 . The scaling method for motion vectors according to claim 1 , 2 , 3 or 4 , characterized in that said performing coordinate transformation on the first motion vector specifically comprises: performing coordinate transformation on a vertical component of the first motion vector. 6 . 6.根据权利要求5所述的运动矢量的缩放方法,其特征在于所述对第一运动矢量的垂直分量进行坐标变换具体包括:6. The scaling method of the motion vector according to claim 5, wherein said carrying out coordinate transformation to the vertical component of the first motion vector specifically comprises: 若所述第一运动矢量由顶场指向底场,则将所述第一运动矢量的垂直分量加上所述坐标偏移量;If the first motion vector points from the top field to the bottom field, add the coordinate offset to the vertical component of the first motion vector; 若所述第一运动矢量由底场指向顶场,则将所述第一运动矢量的垂直分量减去所述坐标偏移量; If the first motion vector points to the top field from the bottom field, then subtract the coordinate offset from the vertical component of the first motion vector; 若所述第一运动矢量由顶场指向顶场或者由底场指向底场,则不对所述第一运动矢量的垂直分量作变换。 If the first motion vector points from the top field to the top field or from the bottom field to the bottom field, the vertical component of the first motion vector is not transformed. the 7.根据权利要求1、2、3或4所述的运动矢量的缩放方法,其特征在于所述对所述第二运动矢量进行缩放具体包括:按照刚体匀速直线运动模型对所述第二运动矢量进行线性缩放。 7. The scaling method of the motion vector according to claim 1, 2, 3 or 4, wherein said scaling the second motion vector specifically comprises: scaling the second motion according to a rigid body uniform linear motion model Vectors are scaled linearly. the 8.根据权利要求7所述的运动矢量的缩放方法,其特征在于所述按照刚体匀速直线运动模型对所述第二运动矢量进行线性缩放具体包括:按照刚体匀速直线运动模型沿所述第二运动矢量的方向或与所述第二运动矢量平行的方向对所述第二运动矢量进行线性缩放。 8. The scaling method of the motion vector according to claim 7, wherein said performing linear scaling on the second motion vector according to the rigid body uniform linear motion model specifically comprises: according to the rigid body uniform linear motion model along the second The direction of the motion vector or a direction parallel to the second motion vector linearly scales the second motion vector. the 9.根据权利要求8所述的运动矢量的缩放方法,其特征在于线性缩放的缩放系数取决于所述第一运动矢量与所述第四运动矢量投影在时间轴上的长度,若所述第一运动矢量与所述第四运动矢量的方向不相同时,所述缩放系数为负数。 9. The scaling method of motion vectors according to claim 8, characterized in that the scaling coefficient of linear scaling depends on the lengths projected on the time axis of the first motion vector and the fourth motion vector, if the first motion vector When a motion vector is in a different direction from the fourth motion vector, the scaling factor is a negative number. the 10.根据权利要求1、2、3或4所述的运动矢量的缩放方法,其特征在于所述对所述第三运动矢量进行坐标反变换具体包括:对所述第三运动矢量的垂直分量进行坐标反变换。 10. The scaling method of motion vector according to claim 1, 2, 3 or 4, characterized in that said inverse coordinate transformation of said third motion vector specifically comprises: vertical component of said third motion vector Perform coordinate inverse transformation. the 11.根据权利要求10所述的运动矢量的缩放方法,其特征在于所述对所述第三运动矢量的垂直分量进行坐标反变换具体包括: 11. The scaling method of motion vector according to claim 10, it is characterized in that said carrying out coordinate inverse transformation to the vertical component of described 3rd motion vector specifically comprises: 若所述第三运动矢量由顶场指向底场,则将所述第三运动矢量的垂直分量减去所述坐标偏移量; If the third motion vector is directed from the top field to the bottom field, then subtract the coordinate offset from the vertical component of the third motion vector; 若所述第三运动矢量由底场指向顶场,则将所述第三运动矢量的垂直分量加上所述坐标偏移量; If the third motion vector points from the bottom field to the top field, then add the coordinate offset to the vertical component of the third motion vector; 若所述第三运动矢量由顶场指向顶场或者由底场指向底场,则不对所述第三运动矢量的垂直分量作变换。 If the third motion vector points from the top field to the top field or from the bottom field to the bottom field, the vertical component of the third motion vector is not transformed. the 12.一种运动矢量的缩放装置,其特征在于包括: 12. A scaling device for motion vectors, characterized in that it comprises: 第一变换模块,用于根据底场图像与顶场图像之间的坐标偏移量对第一运动矢量进行坐标变换,得到第二运动矢量; The first transformation module is used to carry out coordinate transformation to the first motion vector according to the coordinate offset between the bottom field image and the top field image, to obtain the second motion vector; 缩放模块,用于对所述第二运动矢量进行缩放,得到第三运动矢量; A scaling module, configured to scale the second motion vector to obtain a third motion vector; 第二变换模块,用于根据所述坐标偏移量对所述第三运动矢量进行坐标反变换,得到第四运动矢量。 The second transformation module is configured to perform coordinate inverse transformation on the third motion vector according to the coordinate offset to obtain a fourth motion vector. the 13.根据权利要求12所述的运动矢量的缩放装置,其特征在于所述第一变换模块包括: 13. The scaling device of motion vector according to claim 12, is characterized in that described first transformation module comprises: 第一识别单元,用于识别所述第一运动矢量的方向; a first identification unit, configured to identify the direction of the first motion vector; 第一变换单元,用于根据所述第一识别单元的识别结果和所述坐标偏移量对所述第一运动矢量的垂直分量进行坐标变换,得到第二运动矢量。 The first transformation unit is configured to perform coordinate transformation on the vertical component of the first motion vector according to the recognition result of the first recognition unit and the coordinate offset to obtain a second motion vector. the 14.根据权利要求12所述的运动矢量的缩放装置,其特征在于所述第二变换模块包括: 14. The scaling device of motion vector according to claim 12, is characterized in that described second transformation module comprises: 第二识别单元,用于识别所述第三运动矢量的方向; a second identification unit, configured to identify the direction of the third motion vector; 第二变换单元,用于根据所述第二识别单元的识别结果和所述坐标偏移量对所述第三运动矢量的垂直分量进行坐标变换,得到第四运动矢量。 The second transformation unit is configured to perform coordinate transformation on the vertical component of the third motion vector according to the recognition result of the second recognition unit and the coordinate offset to obtain a fourth motion vector. the 15.一种编码方法,其特征在于,包括: 15. A coding method, characterized in that, comprising: 在编码过程中,在进行运动矢量预测处理时,对运动矢量进行缩放的步骤包括: In the encoding process, when performing motion vector prediction processing, the steps of scaling the motion vector include: 根据底场图像与顶场图像之间的坐标偏移量对第一运动矢量进行坐标变换,得到第二运动矢量; Carry out coordinate transformation to the first motion vector according to the coordinate offset between the bottom field image and the top field image to obtain the second motion vector; 对所述第二运动矢量进行缩放,得到第三运动矢量; Scaling the second motion vector to obtain a third motion vector; 根据所述坐标偏移量对所述第三运动矢量进行坐标反变换,得到第四运动矢量; Perform coordinate inverse transformation on the third motion vector according to the coordinate offset to obtain a fourth motion vector; 将所述第四运动矢量用于运动矢量预测处理。 The fourth motion vector is used for motion vector prediction processing. the 16.一种解码方法,其特征在于,包括: 16. A decoding method, characterized in that, comprising: 在解码过程中,进行运动矢量预测处理时,对运动矢量进行缩放的步骤包括: In the decoding process, when performing motion vector prediction processing, the steps of scaling the motion vector include: 根据底场图像与顶场图像之间的坐标偏移量对第一运动矢量进行坐标变换,得到第二运动矢量; Carry out coordinate transformation to the first motion vector according to the coordinate offset between the bottom field image and the top field image to obtain the second motion vector; 对所述第二运动矢量进行缩放,得到第三运动矢量; Scaling the second motion vector to obtain a third motion vector; 根据所述坐标偏移量对所述第三运动矢量进行坐标反变换,得到第四运动矢量; Perform coordinate inverse transformation on the third motion vector according to the coordinate offset to obtain a fourth motion vector; 将所述第四运动矢量用于运动矢量预测处理。 The fourth motion vector is used for motion vector prediction processing. the 17.一种编码系统,其特征在于,包括: 17. A coding system, characterized in that, comprising: 第一缩放装置,用于对第一运动矢量进行缩放,得到第四运动矢量; The first scaling device is used to scale the first motion vector to obtain the fourth motion vector; 第一预测装置,用于在编码过程中根据所述第一缩放装置得到的第四运动矢量进行运动矢量预测处理; The first prediction means is used to perform motion vector prediction processing according to the fourth motion vector obtained by the first scaling means during the encoding process; 其中,所述第一缩放装置包括: Wherein, the first scaling device includes: 第一变换模块,用于根据底场图像与顶场图像之间的坐标偏移量对第一运动矢量进行坐标变换,得到第二运动矢量; The first transformation module is used to carry out coordinate transformation to the first motion vector according to the coordinate offset between the bottom field image and the top field image, to obtain the second motion vector; 缩放模块,用于对所述第二运动矢量进行缩放,得到第三运动矢量; A scaling module, configured to scale the second motion vector to obtain a third motion vector; 第二变换模块,用于根据所述坐标偏移量对所述第三运动矢量进行坐标反变换,得到第四运动矢量。 The second transformation module is configured to perform coordinate inverse transformation on the third motion vector according to the coordinate offset to obtain a fourth motion vector. the 18.一种解码系统,其特征在于,包括: 18. A decoding system, characterized in that, comprising: 第二缩放装置,用于对第一运动矢量进行缩放,得到第四运动矢量; The second scaling device is used to scale the first motion vector to obtain the fourth motion vector; 第二预测装置,用于在解码过程中根据所述第二缩放装置得到的第四运动矢量进行运动矢量预测处理; The second prediction means is used to perform motion vector prediction processing according to the fourth motion vector obtained by the second scaling means during the decoding process; 其中,所述第二缩放装置包括: Wherein, the second scaling device includes: 第一变换模块,用于根据底场图像与顶场图像之间的坐标偏移量对第一运动矢量进行坐标变换,得到第二运动矢量; The first transformation module is used to carry out coordinate transformation to the first motion vector according to the coordinate offset between the bottom field image and the top field image, to obtain the second motion vector; 缩放模块,用于对所述第二运动矢量进行缩放,得到第三运动矢量; A scaling module, configured to scale the second motion vector to obtain a third motion vector; 第二变换模块,用于根据所述坐标偏移量对所述第三运动矢量进行坐标反变换,得到第四运动矢量。 The second transformation module is configured to perform coordinate inverse transformation on the third motion vector according to the coordinate offset to obtain a fourth motion vector. the
CN 200810102353 2008-03-20 2008-03-20 Method and device for scaling motion vectors, and method and system for coding/decoding Active CN101540902B (en)

Priority Applications (2)

Application Number Priority Date Filing Date Title
CN 200810102353 CN101540902B (en) 2008-03-20 2008-03-20 Method and device for scaling motion vectors, and method and system for coding/decoding
PCT/CN2008/073864 WO2009114986A1 (en) 2008-03-20 2008-12-30 Method, device for scaling motion vector and encoding, decoding method and system

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN 200810102353 CN101540902B (en) 2008-03-20 2008-03-20 Method and device for scaling motion vectors, and method and system for coding/decoding

Publications (2)

Publication Number Publication Date
CN101540902A CN101540902A (en) 2009-09-23
CN101540902B true CN101540902B (en) 2011-02-02

Family

ID=41090480

Family Applications (1)

Application Number Title Priority Date Filing Date
CN 200810102353 Active CN101540902B (en) 2008-03-20 2008-03-20 Method and device for scaling motion vectors, and method and system for coding/decoding

Country Status (2)

Country Link
CN (1) CN101540902B (en)
WO (1) WO2009114986A1 (en)

Families Citing this family (8)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JP5786478B2 (en) * 2011-06-15 2015-09-30 富士通株式会社 Moving picture decoding apparatus, moving picture decoding method, and moving picture decoding program
PL231159B1 (en) * 2011-09-09 2019-01-31 Kt Corp Method for achieving temporary predictive vector of motion and the device for application of this method
CN106331722B (en) 2015-07-03 2019-04-26 华为技术有限公司 Image prediction method and related equipment
CN107135397B (en) * 2017-04-28 2018-07-06 中国科学技术大学 A kind of panorama video code method and apparatus
CN109729352B (en) * 2017-10-27 2020-07-21 华为技术有限公司 Method and device for determining motion vector of affine coding block
CN110868611B (en) * 2018-08-27 2023-09-12 北京三星通信技术研究有限公司 Video encoding and decoding methods and devices
CN109068140B (en) * 2018-10-18 2021-06-22 北京奇艺世纪科技有限公司 Method and device for determining motion vector in video coding and decoding equipment
CN112087626B (en) * 2020-08-21 2024-07-26 西安万像电子科技有限公司 Image processing method, device and storage medium

Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN1142733A (en) * 1995-08-08 1997-02-12 大宇电子株式会社 Image processing system using feature point-based motion estimation
CN1189057A (en) * 1996-10-31 1998-07-29 松下电器产业株式会社 Motion picture encoding method and device and its encoding program recording medium
US6430223B1 (en) * 1997-11-01 2002-08-06 Lg Electronics Inc. Motion prediction apparatus and method
CN1520179A (en) * 2003-02-03 2004-08-11 三星电子株式会社 Method and apparatus for encoding/decoding interlaced video signals

Family Cites Families (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JP4158442B2 (en) * 2002-07-18 2008-10-01 富士通株式会社 Video encoding method and video decoding method

Patent Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN1142733A (en) * 1995-08-08 1997-02-12 大宇电子株式会社 Image processing system using feature point-based motion estimation
CN1189057A (en) * 1996-10-31 1998-07-29 松下电器产业株式会社 Motion picture encoding method and device and its encoding program recording medium
US6430223B1 (en) * 1997-11-01 2002-08-06 Lg Electronics Inc. Motion prediction apparatus and method
CN1520179A (en) * 2003-02-03 2004-08-11 三星电子株式会社 Method and apparatus for encoding/decoding interlaced video signals

Also Published As

Publication number Publication date
CN101540902A (en) 2009-09-23
WO2009114986A1 (en) 2009-09-24

Similar Documents

Publication Publication Date Title
CN101540902B (en) Method and device for scaling motion vectors, and method and system for coding/decoding
US8009923B2 (en) Method and system for motion estimation with multiple vector candidates
CN102316317A (en) Method and device for generating predicted value of picture
US11805259B2 (en) Non-affine blocks predicted from affine motion
CN104427345A (en) Motion vector acquisition method, acquisition device, video codec and method thereof
US11115678B2 (en) Diversified motion using multiple global motion models
KR20110117082A (en) Image processing apparatus, decoding method, intra-frame decoding apparatus, intra-frame decoding method, and intra-frame encoding apparatus
CN107360433B (en) Inter-frame prediction coding method and device
Guo et al. Learning cross-scale weighted prediction for efficient neural video compression
CN102611887B (en) The coordinate figure of non-Integer Pel position motion vector rounds method and apparatus
JP2023514215A (en) Intra prediction for image and video compression
US20240146963A1 (en) Method and apparatus for talking face video compression
CN100502511C (en) Interpolated Image Memory Organization Method for Fractional Pixel Precision Motion Prediction
CN110278434A (en) Method, device and storage medium for fast multi-composite frame video coding
CN113709458B (en) Displacement vector prediction method, device and equipment in video coding and decoding
CN112203091B (en) Motion vector prediction method, system and computer medium based on quadratic polynomial
CN101194513B (en) Up-down alternative motion vector
EP3952314B1 (en) Inter-frame prediction method and apparatus, device and storage medium
CN114727114B (en) Method and device for determining motion vector
KR20090014371A (en) Motion detection device
CN1283107C (en) Quick movement prediction method and structure thereof
CN113906741A (en) Image decoding device, image decoding method, and program
CN102651814A (en) Video decoding method, video encoding method and terminal
KR101126294B1 (en) Apparatus and method for coding multi-view video
JP2866325B2 (en) Motion vector search device

Legal Events

Date Code Title Description
C06 Publication
PB01 Publication
C10 Entry into substantive examination
SE01 Entry into force of request for substantive examination
C14 Grant of patent or utility model
GR01 Patent grant