CN101090491B - Be used for the block-based motion estimation algorithm of the enhancing of video compress - Google Patents

Be used for the block-based motion estimation algorithm of the enhancing of video compress Download PDF

Info

Publication number
CN101090491B
CN101090491B CN200610064777.XA CN200610064777A CN101090491B CN 101090491 B CN101090491 B CN 101090491B CN 200610064777 A CN200610064777 A CN 200610064777A CN 101090491 B CN101090491 B CN 101090491B
Authority
CN
China
Prior art keywords
motion vector
value
position candidate
image
piece
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Expired - Fee Related
Application number
CN200610064777.XA
Other languages
Chinese (zh)
Other versions
CN101090491A (en
Inventor
区子廉
黄海明
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Hong Kong University of Science and Technology HKUST
Original Assignee
Hong Kong University of Science and Technology HKUST
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Priority claimed from US11/455,009 external-priority patent/US8761258B2/en
Application filed by Hong Kong University of Science and Technology HKUST filed Critical Hong Kong University of Science and Technology HKUST
Publication of CN101090491A publication Critical patent/CN101090491A/en
Application granted granted Critical
Publication of CN101090491B publication Critical patent/CN101090491B/en
Expired - Fee Related legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Landscapes

  • Compression Or Coding Systems Of Tv Signals (AREA)

Abstract

A kind of method, system and software are proposed, for obtaining the piece of first image similar with the piece of the second image (" reference picture "). Piece to the first image is sequentially processed, and is the each multiple position candidate that obtain in the second image, and is each position candidate assessment cost function. Each position candidate in the second image is to be replaced by the motion vector separately of the piece from the first image. In a first aspect of the present invention, described cost function is the function of the motion vectors of the following piece (also there is no the piece of the first processed image) of the first image. In second aspect present invention, described motion vector is provided by positional value, and described positional value is not all both full-pixel space, half-pix space or 1/4 pixel space.

Description

Be used for the block-based motion estimation algorithm of the enhancing of video compress
Related application
The application's request in the priority of the U.S. Provisional Patent Application 60/691,181 of on June 17th, 2005 application,At this, its entirety is incorporated herein by reference.
Technical field
The present invention relates generally to the method and system for data signal compression, coding and expression, and more definitelySay, the present invention relates to use the method and system of multiframe estimation (ME). The invention further relates to a kind of computer programProduct, such as recording medium, the programmed instruction that carrying can be read by computing equipment, to make described computing equipment carry out basisA kind of method of the present invention.
Background technology
Due to the huge size of the raw digital video data (or image sequence) that used by modern multimedia application, mustMust compress to can transmit and store these data this data. There are many important video compression standards, compriseISO/IECMPEG-1, MPEG-2, MPEG-4 standard and ITU-TH.261, H.263, standard H.264. ISO/IECMPEG-1/2/4 standard apply to widely show business with distribution film, comprise the digital video of video compact disc or VCD (MPEG-1)Broadcast, digital video disk or digital multi-purpose CD or DVD (MPEG-2), recordable DVD (MPEG-2), digital videoBroadcast, DVB or DVB (MPEG-2), video request program or VOD (MPEG-2), high-definition television in USOr HDTV (MPEG-2) etc. MPEG-4 standard is more advanced than MPEG-2, can under lower bit rate, realize high-qualityAmount video, this make its be very suitable for internet, digital wireless network (for example 3G network), MMS (Multimedia Message Service) (fromThe MMS standard of 3GPP) etc. on video stream. MPEG-4 is by fine definition DVD of future generation (HD-DVD) standard and MMS markAccurate institute accepts. ITU-TH.261/3/4 standard design is for low deferred telegram phonovision and video conferencing system. In early days H.261Standard design is at p*Under 64kbit/s, work, p=1,2 .., 31. H.263 the standard in later stage is extremely successful, is used widelyIn advance TV conference system, and for the video stream of broadband network and wireless network, wherein wireless network comprisesMMS (Multimedia Message Service) (MMS) in 2.5G and 3G network and other networks. Newest standards, H.264 (also referred to as MPEG-4Version 10, or MPEG-4AVC) be the video compression standard of current state-of-the-art. It is so powerful so that MPEG decisionDevelop jointly with the ITU-T in the framework of JointVideoTeam (JVT). H.264 new standard is called in ITU-T,And be known as MPEG-4 advanced video coding (MPEG-4AVC), or MPEG-4 version 10. H.264 for HD-DVD standard,Directly video broadcasting (DVB) standard and may be used for MMS standard. Based on H.264, currently developing and be called audiovisual in ChinaThe relevant criterion of standard (AVS). AVS1.0 is designed for high-definition television (HDTV). AVS-M is designed for mobile application.H.264 have and exceed MPEG-1/2/4 and the H.261/3 target of standard and subjective video quality. Except using, integer 4x4 is discreteCosine transform (DCT) replaces beyond traditional 8x8DCT, and H.263 the basic coding class of algorithms H.264[1] is similar to or MPEG-4, and also have extra feature, comprise the inter-frame forecast mode of I frame, for the multiple block size of motion estimation/compensation andMultiple reference frame, for deblocking effect (in-in 1/4th pixel precisions of estimation, loopLoopdeblocking) wave filter, context adaptive binary arithmetic coding (contextadaptivebinaryArithmeticcoding), etc.
Estimation is the core of most of video compression standards (such as MPEG-1/2/4 and H.261/3/4), itsThe redundancy that employs one's time to the best advantage, therefore its performance directly affect video coding system compression efficiency, well as subjective video quality andCoding rate.
Estimate in (BMME) at block matching motion, in ME, the most general to the distortion between current block and reference blockMeasurement be absolute difference and (SAD), for N × N piece, be defined as
SAD ( mvx , mvy ) = Σ m = 0 , n = 0 N - 1 | F t ( x + m , y + n ) - F t - 1 ( x + m + mvx , y + n + mvy ) |
Wherein FtPresent frame, Ft-1Be standard frame, (mvx, mvy) represents current motion vector (MV). For width=X,Highly=Y, and the frame of block size=N × N, in hunting zone ±WIn need to assess to search optimum fortune to SADThe search point sum of dynamic vector equals:
( X N ) ( Y N ) ( 2 W + 1 ) 2 ,
For X=352, Y=288, in the situation of N=16 and W=32, it equals 1673100. This is at Video codingIn device, consume the huge numerical value of huge computing capability. Having proposed many fast algorithms [2]-[9] reduces in ME and searches forThe number of point, for example three step search (TSS) [11], 2D logarithm search [12], new three step search (NTSS) [3], MVFAST[7],And PMVFAST[2]. MVFAST and PMVFAST are better than first three algorithm significantly, because they use intermediate value motion vector pre-Survey device and depart from ME as search center implementation center, reduced thus the number of the position of MV coding by smooth motion vector field.
PMVFAST algorithm (it is the important improvement to MVFAST and other fast algorithm, and thereby by mpeg standard[10] accept) consider at first one group of MV fallout predictor, comprise intermediate value, zero point, the left side, top, top-right and previous common position(co-located) MV fallout predictor. Fig. 1 for example understand current block, left, top block, upper right square, upper right square andThe position of right (it is the following piece " of ", that is, processed piece after current block). It calculates the SAD cost of each prediction. ?In development afterwards, PMVFAST is modified to calculate RD (rate distortion) cost [13] and substitute the following cost letter of useThe SAD cost of number:
J(m,λmotion)=SAD(s,c(m))+λmotion(R(m-p))(1)
Wherein s is raw video signal, and c is reference video signal, and m is current MV, and p is the intermediate value MV prediction of current blockDevice, λmotionBe Lagrange multiplier, R (m-p) represents the position for coded motion information. Next step in PMVFASTBe the MV fallout predictor of selecting to have minimum cost, and it is large to carry out number of executions according to the value of the minimum cost obtaining from MV fallout predictorDiamond search (diamondsearch) or little diamond search.
In definition current video coding standard, independent still major issue is to use sub-pix pr-set vector, comprises half pictureElement, 1/4 pixel or 1/8 pixel motion vector even, it provides the describing more accurately of motion, and can carryThe PSNR gain of the about 1dB estimating for whole pixel motion. Adopt half-pixel accuracy, motion vector can adopt equidistant positionValue, such as 0.0,0.5,1.0,1.5,2.0 etc. Adopt 1/4 pixel precision, motion vector can adopt such as 0.00,0.25,0.50,0.75,1.00,1.25,1.50,1.75,2.00 etc. and so on positional value. Adopt 1/8 pixel precision, motion vector canTo adopt such as 0.000,0.125,0.250,0.375,0.500,0.625,0.750,0.875,1.000,1.125,1.250,1.375,1.500,1.625,1.750,1.875,2.000 etc. and so on positional value.
As everyone knows, motion vector distribution trends towards off-centring, this means that motion vector trends towards approaching in the extremeIn (0,0). In Fig. 6 (a), this situation has been shown, its be depicted as (0,0) MV use completely search (FS) algorithmMotion vector distribution in Foreman sequence. In addition,, as shown in Fig. 6 (b), motion vector distribution is also to median prediction device (intermediate valueMV) skew, it is the intermediate value of left of motion vector, top block and upper right square shown in Figure 1. In addition, as Fig. 6 (c) instituteShow, motion vector is also to institute in the adjacent motion vectors in present frame (leftMV, topMV, topRightMV) and previous frameMotion vector (preMV) skew arranging, as shown in Fig. 6 (d). These can be all for the motion vector of current vector canThe fallout predictor of considering, and they can be used to PMVFAST.
Summary of the invention
The present invention seeks to be provided for the new and useful technology of estimation, it is applicable to data signal and pressesContracting, coding and the method and system of expressing.
Particularly, the present invention manages to provide new and useful effective exercise estimation technique, and it can for example be applied inMPEG-1, MPEG-2, MPEG-4, H.261, H.263, H.264 or in AVS or other associated video coding standard.
The realization of first aspect present invention based on such: although the estimation of PMVFAST algorithm is with compared with front technologyReally there is advantage, but it is not best. In principle, for the each frame in video, existence makes whole on the wholeMotion vector field { the m of the RD Least-cost of individual framei,j,i=0..M-1,j=0..N-1}:
total _ RD _ Cost = Σ i = 0 M - 1 Σ j = 0 N - 1 [ SAD ( s i , j , c ( m i , j ) ) + λ i , j ( R ( m i , j - p i , j ) ) ] - - - ( 2 )
Wherein (i, j) is illustrated in (i, j) the individual piece in the frame that comprises M × N piece. For fixing Qp(it is to quantize ginsengNumber), λi,j=λ=constant, and
pi,j=median(mi,j-1,mi-1,j,mi-1,j+1)(3)
But, consider that whole RD costs of whole frame need exponential computational complexity simultaneously, this is unpractiaca.Thereby PMVFAST and other algorithm known are each only considers the only RD cost of a piece, instead of whole pieces in a frame.
Particularly, MVFAST or PMVFAST do not consider that this causes in the time deriving with respect to the motion vector of current blockThe variation in intermediate value MV fallout predictor of next piece. This can affect the level and smooth of whole motion vector field.
In general, a first aspect of the present invention is by improving the cost definition of PMVFAST and motion predictor candidateSelect and proposed a kind of new ME algorithm. Particularly, for each current block of the first image (current block can be 16 ×16,16 × 8,8 × 16,8 × 8,4 × 8,8 × 4,4 × 4 or other rectangle length, even non-rectangle), according to a cost functionSelect the second image (reference picture) similar, described cost function comprises two: (i) current block and the not phase of similarSeemingly measure the item of (for example SAD, SAE), and (ii) conduct is at least the prediction of the motion vector of the following piece to the first imageThe item of function.
Particularly, the algorithm proposing (will not be located by comprising current intermediate value MV fallout predictor and comprising future so farReason) the estimation intermediate value MV fallout predictor of encoding block, making to improve sports ground flatness becomes possibility.
Many variations of the present invention are possible. Particularly, described can have any size and any shape.
Can there be multiple the second images (being multiple reference values) and described search can be included in all the second imagesPosition candidate.
In addition, can by form together larger region in described the first image and adopt and determined by coded digitalMultiple sub-blocks that the coded sequence of justice is encoded are carried out described new cost function. These sub-blocks need not have formed objects orShape.
The present invention has on the other hand, and it can or independently use with first aspect present invention combination. In general, thisBright second aspect propose when to the first image (current block can be 16 × 16,16 × 8,8 × 16,8 × 8,4 × 8,8 × 4,4 ×4, or other rectangle size, even non-rectangle) when current block is encoded, use selected, have from known technologyThe motion vector of the positional value (i.e. assembly separately in two axis directions) of selecting in the different class value of middle used valueEncode.
Consider a possible motion vector predictor: (0,0). And the routine techniques of whole pixel allows motion vector to adoptSuch as-2.0 ,-1.0,0,1.0, the positional value of 2.0 grades, second aspect present invention proposes amendment close to one of described fallout predictorGroup possible position value. For the positional value 1.0 that approaches most 0, we can use another location value such as 0.85, can thereby makeThe positional value allowing will comprise-2.0 ,-0.85,0,0.85,2.0 etc. Its advantage is, in statistics, motion vector becomesTo close to 0. And, thereby by selecting the position close to 0 more, we will approach true motion vector more, andThereby can provide the better motion compensation that can cause high compression efficiency.
Thereby, in particular expression of the present invention, can be chosen as at least one direction of principal axis and select one group of possible positionValue, thus make them cannot all write Lm, wherein m=-..., 2 ,-1,0,1,2..., and L is constant (for example 1 pictureElement interval, 1/2 pixel separation, or 1/4 pixel separation); , positional value is inhomogeneous. Particularly, can be chosen as at least oneIndividual direction of principal axis is selected one group of possible position value, thereby makes them cannot all write m/n, wherein m=-..., and 2 ,-1,0,1,2 ... and n is 1 or 2 power.
Note, a second aspect of the present invention is not limited to chosen position value from one group of locus heterogeneous value;Compared with traditional positional value group, be also not limited to and only select two positional values close to zero. As example, at thisIn another example of bright second aspect, positional value 2.0 becomes 1.9, thereby admissible positional value will comprise-1.9- 0.85,0,0.851.9Deng.
Thereby in the interchangeable particular expression of second aspect present invention, (be at least one described in direction) selects oneGroup possible position value, can write as LA to comprisemOne or more positional values of m/n, wherein m=-..., 2 ,-1,0,1,2.. n is 1 or 2 power, L is constant (for example 1 pixel separation, 1/2 pixel separation, or 1/4 pixel separation), and AmTo be less than1 is at least still 0.75 value (optionally different for different m values), is at least more preferably 0.80, and optimumSelection of land is at least 0.85.
We have found that AmOptimum value depend on video.
An advantage of the specific embodiment of second aspect present invention is, the motion vector that they produce can adoptThe form type code identical with conventional algorithm encoded, and makes except the regular code of this positional value should be interpreted as respectively this embodimentWith possible position value outside. For example, if the positional value being used by specific embodiment be-1.9-0.85,0,0.851.9Deng, the regular code of positional value 1.0 should be interpreted as 0.85, and the regular code of positional value 2.0 should be interpreted as 1.9, etc.
Method according to a second aspect of the invention can comprise the following steps: definition of search region, and in the described field of searchThe multiple position candidate of definition in territory, described multiple position candidate comprise by the new positional value of second aspect present invention definedOne group of multiple position. These positional values be from key position (for example, (0,0) motion vector position, motion vectors, etc.)The corresponding shift value of the position candidate at place. For each candidate motion vector, our calculation cost function, described cost letterNumber is the similarity measurement (example between the piece at the described candidate motion vector place in current block and the second image in the first imageAs SAD, SAE) function. Alternatively, it can also be the function of following motion vector: described candidate motion vector, current pre-Survey motion vector, and alternatively, as first aspect present invention, one or more future anticipation motion vectors. For example, optionalGround, as first aspect present invention, can given this cost function.
Brief description of the drawings
Now will only the embodiment of the present invention be described with reference to the following drawings as example, wherein:
Fig. 1 shows current block, left, top block, upper right square, upper right square and right;
Fig. 2 shows the exemplary distribution of Diff, its be | mi,j+1—pi,j+1| with | mi,j—pi,j+1| between difference;
Fig. 3 a shows the way of search of the huge diamond search using as first embodiment of the invention;
Fig. 3 b shows the way of search of the huge diamond search of the amendment as used in first embodiment of the invention;
Fig. 4 shows the way of search of the little diamond search as used in first embodiment of the invention;
Fig. 5 a and 5b compare the flatness in the MV territory of PMVFAST and first embodiment of the invention;
Fig. 6 shows configuring in (a) (0,0) MV, (b) intermediate value MV, (c) adjacent MV and previous frame in present frameBottom-right MV (PreBottomRightMV) in previous frame of MV, (d) use that search (FS) algorithm completely obtainsMotion vector distribution in Foreman sequence;
Fig. 7 is the flow chart of first embodiment of the invention.
Detailed description of the invention
First embodiment of the invention has adopted many features of PMVFAST algorithm, but is by considering several contiguous blocksInstead of only a piece comes PMVFAST (and other existing algorithm) in addition improved. Equation (2) and (3) illustrate, current blockThe selection of MV directly affects the RD cost of contiguous block, and described contiguous block comprises right (or (i, j+1) piece), lower-left square (or(i+1, j-1) piece), and lower block (or (i+1, j) piece). This is because current MV will affect these contiguous blocksTherefore and then affect the optimum movement vector of those pieces the MV that predicts, and. These are " future " pieces, because when processing current blockTime, also they are not carried out to estimation. We cannot side by side calculate with current block the optimal movement arrow of these following piecesAmount, because we need to calculate in whole frame according to equation (2) optimum movement vector of all simultaneously, this will answer very muchAssorted.
As an alternative, in order to evaluate the implication of selection of current MV of the current block on right or (i, j+1) piece,We can be that the RD cost function of the current block of equation (1) increases by one:
R(|mi,j+1-pi,j+1|)(4)
Wherein, mi,j+1The optimum movement vector of right of the current MV based on current block, and pi,j+1Based on currentThe intermediate value MV fallout predictor of right of the current MV of piece,
pi,j+1=median(mi,j,mi-1,j+1,mi-1,j+2)(5)
But, the m in equation (4)i,j+1Unknown, because the estimation of right (following piece) is not also carried out. SoAnd we notice, | mi,j+1-pi,j+1| can pass through | mi,j-pi,j+1| approximate well. Allow Diff be | mi,j+1-pi,j+1|-|mi,j-pi,j+1|. We test and study the distribution of Diff to many video measurement orders. Fig. 2 has shown Foreman orderThe probability density function (probability-distribution function) of row, as shown in Figure 2, typical result is seriously partial to zero. This means this twoIndividual amount in most of situation, be essentially identical (identical in about 70% situation, and about 23% in the situation that onlyDiffer 1). This means, | mi,j-pi,j+1| be | mi,j+1-pi,j+1| good approximate. Therefore, R (| mi,j+1-pi,j+1|) canBy W*R (| mi,j-pi,j+1|) be similar to wherein W > 0. Equally, we can add addition Item to generation to lower-left square and lower blockIn valency.
For (i, j) piece, allow medianMV represent to provide intermediate value MV fallout predictor by equation 3. Allow FmedianMV represent byThe following intermediate value MV fallout predictor (for the intermediate value MV fallout predictor of right) that equation (5) provides. Thereby FmedianMV is MV candidateFunction. At this, the first embodiment is called as " the strengthening motion vectors field adaptive random search " for (i, j) piece(E-PMVFAST). The step of this embodiment is as follows.
Each step of embodiment is as follows, is shown in Fig. 7.
To any candidate MV, as the cost of giving a definition.
cost(MV)=SAD+λ*[w*R(MV-medianMV)+(1-w)*R(MV-FmedianMV)]
(6)
1. calculate the cost of three motion vector predictor: (i) intermediate value MV fallout predictor (" medianMV "), (ii) rightEstimated motion vector (" futureMV "), it is defined as follows:
FutureMV ≡ median (TopMV, TopRightMV, TopRightRightMV) and (iii) from past pieceThe MV fallout predictor of (" pastMV "), it is previous common position MV (" PreMV ") and is away from the previously bottom-right of medianMVIn MV (" PreBottomRightMV ") one,
pastMV≡argmaxMV∈{PreMv,PreBottomRightMV}{abs(MV-medianMV)}
Note, (ii) can by for another contiguous following piece (such as lower left, bottom, and/or bottom-rightPiece) estimated motion vector supplement or replace.
Shall also be noted that in item (iii), previous bottom-right MV can be predicted by the previous MV for another contiguous blockDevice supplements or replaces.
Note (ii) and (iii) independent aspects of shape cost invention.
For example, if above any one MV fallout predictor unavailable border of frame (), skips this fallout predictor.
2., if the minimum cost of motion vector predictor is less than threshold value T1, stops search and forward step 7 to. No, the motion vector of selecting to have a minimum cost is as currentMV (current MV) and forward next step to. Note 3 fortuneThe cost of dynamic vector fallout predictor can adopt predefined procedure to calculate that (for example medianMV, is then futureMV, is thenPastMV), and at any time,, if the cost of any motion vector predictor is less than specific threshold, search may stopOnly and forward step 7 to.
3. carry out an iteration of directed little diamond search around currentMV. The directed little diamond search of explained laterConcept.
4., if minimum cost is less than threshold value T2, stops search and forward step 7 to. Otherwise, select to have minimum generationThe motion vector of valency is as currentMV and forward next step to.
If (currentMV=medianMV) and current minimum cost be less than threshold value T3, carry out little rhombus and searchRope and forward step 7 to.
6., if video is not interlacing scan, carry out large diamond search, as shown in Fig. 3 (a); Otherwise, carry out as Fig. 3(b) the large diamond search of the amendment shown in. In each step in these steps, to each index point assessment cost of rhombusFunction.
7. select to have the MV of minimum cost.
In our experiment, find that the value about 0.8 of w is effective.
Explain now the step of directed little diamond search. Suppose, centerMV is current search center, and MV1, MV2,MV3 and MV4 be four around search point, as shown in Figure 4. For each MVi calculates R (MVi-medianMV). If R is (MVi-MedianMV) < R (centerMV-medianMV), calculates SAD and the cost of MVi. Otherwise, ignore this MVi. Selection hasThe MV of low-cost is as currentMV. Note, the concept of directed square search is considered to new, and structure cost invention oneIndividual independent aspects, its need not with combine to carry out by the concept of futureMV.
The step of the large diamond search of large diamond search and amendment is identical, but described search be for Fig. 3 (a) andAll set of the point illustrating respectively in 3 (b) complete.
We consider the multiple possible variant of embodiment within the scope of the present invention now.
First, note, for different pieces, the weight coefficient w in cost function can be different. In addition, optionalSelecting ground, can be different for w described in different MV candidates. In particular, the definition of w can be depended on such as MV candidate and isWhether the no X-axis component close to medianMV and/or futureMV or MV candidate or Y-axis component be with described FmedianMV'sThe situation that X-axis component or Y-axis component are identical and so on.
In addition, cost function can be not restricted to the form of equation (6). It can be (for example to comprise distortion measurement itemSAD, distortion quadratic sum (SSD), average deviation distortion (MAD), MSD etc.) and to current block and some contiguous block (for example consideredRight, lower block, lower-left piece etc.) encode any function of item of necessary position of motion vector.
In addition,, in step 1, the definition of futureMV is not limited to the form that above step 1 provides. For futureMVTwo possible replacements definition be:
futureMV≡median(leftMV,TopRightMV,TopRightRightMV)
futureMV≡median(medianMV,TopRightMV,TopRightRightMV)
In addition, in the above step 1 of expressing, pastMV be chosen as possible MV in previous frame (preMV withPreBottomRlghtMV) farthest one of distance medianMV in list collection. But the MV list that will consider can be wrappedContaining two above possible MV (for example preMV, preLeftMV, preRightMV, preTopMV, preTopLeftMV,PreTopRightMV, preBottomMB, preBottomLeftMV, preBottomRightMV, etc.). In addition, from oneThe MV of above previous coding frame can be included in that in described list, (if for example present frame is frame N, described list can compriseFrame N-1, N-2, N-3 ...). If present frame is B frame, the list of previous coding frame can comprise following P frame.
In addition,, in step 1, selecting pastMV is distance reference MV (medianMV in step 1) possibility farthestMV. Other is also fine with reference to MV, comprises leftMV or TopMV or TopRightMV or certain combination. From possible MVList in other method of selecting be also possible.
In step 2, the cost of described 3 motion vector predictor obtains according to certain predefined order.Possible predefined order comprises:
A) medianMV, follows by futureMV, follows by pastMV
B) medianMV, follows by pastMV, follows by futureMV
C) futureMV, follows by medianMV, follows by pastMV
D) futureMV, follows by pastMV, follows by medianMV
E) pastMV, follows by medianMV, follows by futureMV
F) pastMV, follows by futureMV, follows by medianMV
In addition, although express as above, in step 3, carried out an iteration of directed little diamond search, can applyMore than one iteration.
Analog result
We provide the analog result of embodiment E-PMVFAST now. Described embodiment is embedded into H.264 reference softwareJM9.3[13] in, and use various QP, video sequence, resolution ratio and hunting zone to simulate it. Table 1 (a-c) and 2(a-c) some typical analog results are shown. PSNR (ratio of peak signal and noise) changes and BR (bit rate) variation isWith respect to the search PSNR of (FS) and the variation of bit rate completely. Analog result illustrates, the bit rate of the E-PMVFAST proposingTrend towards with search and PMVFAST are similar completely with PSNR, but on large-scale video sequence and bit rate, E-PMVFAST trends towards than PMVFAST fast about 40%. A key character of E-PMVFAST is, its motion vector field trendIn level and smooth in the extreme, thereby make the motion vector can be than other fast moving algorithm for estimating moving of indicated object more exactlyMoving.
In Fig. 5 (a) and 5 (b), the image in left side illustrates the motion vector that (as short-term) obtained by PMVFAST algorithm, and the imaging on right side illustrates the motion vector of the identical imaging obtaining by described embodiment. The motion vector of E-PMVFASTField is more level and smooth than the motion vector field of PMVFAST significantly, particularly in irised out region. For perception conversion is compiledCode (perceptualtrans-coding), speed control, multiple block-sized motion estimation, multiple standards frame estimation etc.In video motion content classify, level and smooth sports ground is very useful.
The analog result of form 1a-foremanCIF sequence
The analog result of form 1b-CoastguardCIF sequence
The analog result of form 1c-HallCIF sequence
The analog result of form 2a-foremanQCIF sequence
The analog result of form 2b-AkiyoQCIF sequence
The analog result of form 2c-CoastguardQCIF sequence
We turn to the second embodiment of the present invention now, and it shows a second aspect of the present invention.
As mentioned above, conventional integer pixel completely allows motion vector in each direction, to adopt-2.0 ,-1.0,0,1.0,2.0 etc. positional value. In second embodiment of the invention, select the possible position value close to fallout predictor. For approaching most 0Positional value, we can use (replace 1.0) another location value, such as 0.85, thereby the positional value of permission can be wrappedDraw together-2.0,-0.85,0,0.85, 2.0 etc. This advantage is, in statistics, motion vector tends to close to 0. Therefore, pass throughSelect more to approach 0 position, we can be more close to real motion vector, and thereby can provide and can causeThe better motion compensation of high compression efficiency. Similarly, can change other positional value. For example, positional value 2.0 can be changedBe 1.9, thereby the positional value allowing can be comprised-1.9-0.85, 0,0.85,1.9 etc. Except 1.0 coding motion is vowedAmount position should be interpreted as 0.85, and 2.0 encoding motion vector position should be interpreted as beyond 1.9, the change proposingAdvantage be to use same movement vector code.
Half-pixel accuracy allows motion vector to adopt such as 0.0,0.5,1.0,1.5,2.0 etc. and so on positional value. IThese positional values of proposed amendments, particularly those are close to the positional value of fallout predictor. For the positional value that is in close proximity to 00.5, we advise using a different value. For example, a kind of possibility is to use 0.4 to replace 0.5. In other words, positional valueTo comprise 0.0,0.4, 1.0,1.5,2.0. Similarly, other positional value can be modified. For example, positional value 1.0 can changeBe 0.9, thereby make this group reposition value will comprise 0.0,0.4,0.95,1.5,2.0 etc. Equally, this can contribute to carryHigh compression efficiency. Similarly, can revise other positional value to improve compression efficiency. But changing this position can be remarkableGround causes the higher computational efficiency at both places of encoder. Conventionally, most of compression efficiency gain comes from positionThe value of putting is changed into close to fallout predictor.
1/4 pixel precision allows motion vector to adopt such as 0.00,0.25,0.50,0.75,1.00 etc. and so on positionValue. We can location revision value, and those are close to the positional value of fallout predictor especially. For example, we can be revised as them0.00,0.20,0.47,0.73,0.99 etc.
Note, the method proposing allows us between each integer position value, to select the positional value of any amount. ExampleAs, between positional value 0 and 1, half-pixel accuracy is used 1 positional value, and { 0.5}, 1/4 pixel precision uses 3 positional values0.25,0.50,0.75}, and 7 positional values of 1/8 pixel precision use 0.125,0.250,0.375,0.500,0.625,0.750,0.875}. The method proposing allows us between 0 and 1, to select any N positional value. For example, we can select N=2 values are such as 0.3 and 0.6.
The inhomogeneous sub-pix estimation proposing and compensation are not each regions that must be applied to each frame.On the contrary, some position can be incorporated in head, with indicate each region (for example sheet) for described frame of video its whether byOpen or close. In addition, it can directly apply to existing standard in the situation that changing without any grammer, because canTo apply identical motion vector code.
Adopt H.264JM82 software to simulate proposed inhomogeneous sub-pix estimation and compensation, andIts result is shown in above form, and wherein QP represents quantization parameter. This simulation in x and y direction use location value (...-1 ,-0.75 ,-0.5 ,-0.15,0,0.15,0.5,0.75,1..). , with the standard method that uses 1/4 pixel space positional valueCompare, only revised-0.25 and the positional value at+0.25 place. Except using new positional value, described algorithm is at itHis aspect is identical with known H.264 canonical algorithm. Shown at table, the second embodiment is realizing similar PSNR'sMeanwhile, reduced significantly bit rate. Do not need amendment grammer H.264.
Although be only several embodiments of the present invention described above, many within the scope of the present invention variations are all possible.
For example, the description of the invention more than providing is the piece of the fixed size of the P frame for having a reference frame.But the present invention can be applied to the piece with multiple sub-block size, and described must be not non-overlapped. Can haveMore than one reference frame, and reference frame can be the past with respect to present frame or in the future any of video sequence.
For video, a pictorial element (pixel) can have one or more components, such as luminance component, redness,Green, blue (RGB) component, YUV component, YCrCb component, infrared component, X-ray or other component. Each point of pictorial elementAmount is to be expressed as digital symbol, and described numeral can be natural number, integer, real number or even plural number. NaturalIn situation, they can be 12,8, or any other bit resolution. Although being the rectangle having, the pixel in video adoptsThe 2 dimension samples in sample grid and uniform sampling cycle, but described sampling grid is not to be necessary for rectangle and described sampling periodNot that to be necessary for be uniform.
Industrial applicibility
Each embodiment of the present invention is suitable for by MPEG-1, MPEG-2, MPEG-4, H.261, H.263, H.264, AVSOther can be modified to include above it associated video coding standard or quick, low delay and the low-cost software of methodRealize with hardware. Possible application comprises DVB (ground, satellite, wired), digital camera, numeral canCamcorder (camcorder), digital VTR, Set Top Box, personal digital assistant (PDA), can use multimedia honeycombFormula phone (2.5G, 3G and more than), video conferencing system, video on-demand system, WLAN devices, bluetooth applications, networkVideo streaming services device in server, low or bandwidth applications, video code interpreter are (from a format conversion to another latticeFormula) and other television communications system etc.
Bibliography
Disclosure below with reference to document is introduced in this comprehensively:
[1]JointVideoTeamofITU-TandISO/1ECJTC1,“DraftITU-TRecommendationandFinalDraftinternationalStandardofJointVideoSpecification(ITU-TRec.H.264|ISO/IEC14496-10AVC)”,docunmentJVT-G050r1,may2003
[2]A.M.Tourapis,O.C.AuandM.L.Liou,“PredictiveMotionVectorFieldAdaptiveSearchTechnique(PMVFAST)”,ISO/IECJTC1/SC29/WG11MPEG2000,Noordwijkerhout,ML,March’2000
[3]R.LI,B.ZengandM.L.Liou,“Anewthree-stepsearchalgorithmforblockmotionestimation”,OnCircuitsandSystemsforVideoTechnology,vol4,no4,pp438-42,Aug’94
[4] Z.L.He and M.L.Liou, " Ahighperformancefastsearchalgorithmforblockmatchingmotionestimation”,IEEETrans.onCircuitsandSystemsforVld&o2Technology,vol.7,no5,pp826-8,Oct’97
[5]A.M.Tourapis,O.C.Au,andM.L.Liou,“FastMotionEstimationusingCircularZonalSearch”,Proc.ofSPIESym.OfVisualComm.&ImaggProcessin,vol2,pp.1496-1504,Jan.25-27,‘99
[6]A,M.Tourapis,O.C.Au,M.LLiou,G.Shen,andI.Ahmad,“OptimizingtheMpeg-4Encoder-AdvancedDiamondZonalSearch”,inPros.of2000IEEEInter.Sym.onCircuitsandSystems,Geneva,Switzerland,May,2000
[7]K.K.MaandP.I.Hosur,“PerformanceReportofMotionVectorFieldAdaptiveSearchTechnique(MVFAST)”,inISO/IECJTC1/SC29/WG11MPEG99/m81,Noordwijkerhout,NLMar’00
[8]A.M.Tourapis,O.C.Au,andM.L.Liou,“FastBlock-MatchingMotionEstimationusingPredictiveMotionVectorFieldAdaptiveSearchTechnique(PMVFAST)”,inISO/IEC/JTC1/SC29/WG11MPEG2000/M5866,Noordwijkerhout,NL,Mar’00
[9]ImplementationStudyGroup,“Experimentalconditionsforevaluatingencodermotionestimationalgorithms”,inISO/IECJTC1/SC29/WG11MPEG99/n3141,Hawaii,USA,Dec’99
[10]“MPEG-4OptimizationModelVersion1.0”,inISO/IECJTC1/SC29/WG11MPEG2000/n3324,Noordwijkerhout,NL,Mar’00
[11]T,Koga,K.linuma,A.Hirano,Y.lijima,andT.Ishlguro,“Motioncompensatedinterframecodingforvideoconferencing”Proc.Nat.Telecommun.Conf.,NewOrleans,LA,pp.G.5.3.1-G.5.3.5,Dec’81。
[12]J.R.JainandA.K.Jain,“Displacementmeasurementanditsapplicationininterframeimagecoding”,JEEETrans.OnCommunications,vol.COM-29,pp.1799-808,Dec’81
[13]JVTreferencesoftwareJM9.2forJVT/H.264FRext

Claims (8)

1. one kind is that each in a series of block of pixels in the first image is selected the similar pixel separately in the second imageThe method of piece, described method comprises:
For the current block in described a series of block of pixels of described the first image, the multiple position candidate in described the second imageBetween select, described multiple position candidate comprise by two defined position candidate of positional value on two direction of principal axisGroup, at least one positional value in described positional value is inhomogeneous for described two axial at least one direction of principal axis.
2. the method for claim 1, wherein each in the group of described position candidate is worth with each by relevant positionFrom motion vector be associated, described motion vector is different from motion vectors.
3. the method for claim 1, wherein each in the group of described position candidate is worth with each by relevant positionFrom motion vector be associated, described motion vector is different from (0,0) motion vector.
4. method as claimed in claim 1, wherein, described positional value can be write DAmM/n, wherein m=-..., 2 ,-1,0,1,2 ...,N is 1 or 2 power, and D is constant, and at least one the value of m AmTo be less than 1 value, and for all values of m0.75≤Am<1。
5. method as claimed in claim 4, wherein, each in the group of described position candidate is worth with each by relevant positionFrom motion vector be associated, described motion vector is different from motion vectors.
6. method as claimed in claim 4, wherein, each in the group of described position candidate is worth with each by relevant positionFrom motion vector be associated, described motion vector is different from (0,0) motion vector.
7. method as claimed in claim 4, wherein, for all values of m, AmValue be at least 0.85.
8. one kind is that each in a series of block of pixels in the first image is selected the similar pixel separately in the second imageThe system of piece, described system comprises processor, described processor is configured to, and is described a series of block of pixels of described the first imageIn current block, between the multiple position candidate in described the second image, select, described multiple position candidate comprise byThe group of two defined position candidate of positional value on two direction of principal axis, at least one positional value in described positional value forDescribed two axial at least one direction of principal axis are inhomogeneous.
CN200610064777.XA 2006-06-16 2006-12-15 Be used for the block-based motion estimation algorithm of the enhancing of video compress Expired - Fee Related CN101090491B (en)

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
US11/455,009 2006-06-16
US11/455,009 US8761258B2 (en) 2005-06-17 2006-06-16 Enhanced block-based motion estimation algorithms for video compression

Publications (2)

Publication Number Publication Date
CN101090491A CN101090491A (en) 2007-12-19
CN101090491B true CN101090491B (en) 2016-05-18

Family

ID=38943617

Family Applications (1)

Application Number Title Priority Date Filing Date
CN200610064777.XA Expired - Fee Related CN101090491B (en) 2006-06-16 2006-12-15 Be used for the block-based motion estimation algorithm of the enhancing of video compress

Country Status (1)

Country Link
CN (1) CN101090491B (en)

Cited By (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US9445120B2 (en) 2011-04-12 2016-09-13 Sun Patent Trust Moving picture coding method, moving picture coding apparatus, moving picture decoding method, moving picture decoding apparatus and moving picture coding and decoding apparatus
US9456214B2 (en) 2011-08-03 2016-09-27 Sun Patent Trust Moving picture coding method, moving picture coding apparatus, moving picture decoding method, moving picture decoding apparatus, and moving picture coding and decoding apparatus
US9456217B2 (en) 2011-05-24 2016-09-27 Sun Patent Trust Coding method and apparatus with candidate motion vectors
US9560373B2 (en) 2011-05-31 2017-01-31 Sun Patent Trust Image coding method and apparatus with candidate motion vectors
US11570444B2 (en) 2011-05-27 2023-01-31 Sun Patent Trust Image coding method, image coding apparatus, image decoding method, image decoding apparatus, and image coding and decoding apparatus

Families Citing this family (10)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
TWI566586B (en) * 2009-10-20 2017-01-11 湯姆生特許公司 Method for coding a block of a sequence of images and method for reconstructing said block
CN106851306B (en) * 2011-01-12 2020-08-04 太阳专利托管公司 Moving picture decoding method and moving picture decoding device
WO2012108701A2 (en) * 2011-02-09 2012-08-16 엘지전자 주식회사 Method for storing motion information and method for inducing temporal motion vector predictor using same
WO2012117728A1 (en) 2011-03-03 2012-09-07 パナソニック株式会社 Video image encoding method, video image decoding method, video image encoding device, video image decoding device, and video image encoding/decoding device
US9485518B2 (en) 2011-05-27 2016-11-01 Sun Patent Trust Decoding method and apparatus with candidate motion vectors
TW201304552A (en) 2011-05-31 2013-01-16 Panasonic Corp Moving picture coding method, moving picture coding apparatus, moving picture decoding method, moving picture decoding apparatus, and moving picture coding and decoding apparatus
CN103535039B (en) 2011-06-30 2018-08-28 太阳专利托管公司 Picture decoding method, image encoding method, picture decoding apparatus, picture coding device and arrangement for encoding
CN107404651B (en) * 2011-09-09 2020-03-06 株式会社Kt Method for decoding video signal
EP3923572A1 (en) 2011-10-19 2021-12-15 Sun Patent Trust Image encoding method, image encoding device, image decoding method, and picture decoding device
KR20200012957A (en) 2017-06-30 2020-02-05 후아웨이 테크놀러지 컴퍼니 리미티드 Inter-frame Prediction Method and Device

Cited By (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US9445120B2 (en) 2011-04-12 2016-09-13 Sun Patent Trust Moving picture coding method, moving picture coding apparatus, moving picture decoding method, moving picture decoding apparatus and moving picture coding and decoding apparatus
US9456217B2 (en) 2011-05-24 2016-09-27 Sun Patent Trust Coding method and apparatus with candidate motion vectors
US11570444B2 (en) 2011-05-27 2023-01-31 Sun Patent Trust Image coding method, image coding apparatus, image decoding method, image decoding apparatus, and image coding and decoding apparatus
US9560373B2 (en) 2011-05-31 2017-01-31 Sun Patent Trust Image coding method and apparatus with candidate motion vectors
US9456214B2 (en) 2011-08-03 2016-09-27 Sun Patent Trust Moving picture coding method, moving picture coding apparatus, moving picture decoding method, moving picture decoding apparatus, and moving picture coding and decoding apparatus

Also Published As

Publication number Publication date
CN101090491A (en) 2007-12-19

Similar Documents

Publication Publication Date Title
CN101090491B (en) Be used for the block-based motion estimation algorithm of the enhancing of video compress
US8761258B2 (en) Enhanced block-based motion estimation algorithms for video compression
US10362329B2 (en) Video coding using reference motion vectors
KR100681370B1 (en) Predicting motion vectors for fields of forward-predicted interlaced video frames
CN102084655B (en) Video encoding by filter selection
TWI720750B (en) Dynamic image encoding device and dynamic image encoding method
JP6457483B2 (en) Frame interpolation with motion compensation using smoothness constraints
CN110463201A (en) Use the prediction technique and device of reference block
JP5913283B2 (en) Motion prediction method
CN102415098B (en) Image processing apparatus and method
JP2007053561A (en) Device and method for encoding image
CN102077598A (en) Apparatus and method for computationally efficient intra prediction in a video coder
CN102422643A (en) Image processing device, method, and program
JP2004207802A (en) Image data processing apparatus and method, recording medium, and program
CN102077599A (en) Apparatus and method for high quality intra mode prediction in a video coder
JPWO2005107267A1 (en) Image encoding / decoding device, encoding / decoding program, and encoding / decoding method
Hussain et al. A survey on video compression fast block matching algorithms
WO2014058796A1 (en) Method and apparatus for video coding using reference motion vectors
FI109634B (en) Method and apparatus for encoding a video image
Munagala et al. Enhanced holoentropy-based encoding via whale optimization for highly efficient video coding
Liu et al. Low-cost H. 264/AVC inter frame mode decision algorithm for mobile communication systems
JP2014027676A (en) Image decoding device
Singh et al. Direction-adaptive motion estimation (DAME) for efficient video compression
Yu et al. Efficient multiple-reference temporal error concealment algorithm based on H. 264/AVC
Karmakar et al. Efficient motion modelling with variable-sized blocks from hierarchical cuboidal partitioning

Legal Events

Date Code Title Description
C06 Publication
PB01 Publication
REG Reference to a national code

Ref country code: HK

Ref legal event code: DE

Ref document number: 1115255

Country of ref document: HK

C10 Entry into substantive examination
SE01 Entry into force of request for substantive examination
C14 Grant of patent or utility model
GR01 Patent grant
REG Reference to a national code

Ref country code: HK

Ref legal event code: WD

Ref document number: 1115255

Country of ref document: HK

CF01 Termination of patent right due to non-payment of annual fee
CF01 Termination of patent right due to non-payment of annual fee

Granted publication date: 20160518

Termination date: 20161215