CN1055364C

CN1055364C - Motion adaptive spatio-temporal filtering of video signals

Info

Publication number: CN1055364C
Application number: CN 94117024
Authority: CN
Inventors: 金钟勳
Original assignee: Daewoo Electronics Co Ltd
Current assignee: Fengye Vision Technology Co., Ltd.
Priority date: 1994-10-10
Filing date: 1994-10-10
Publication date: 2000-08-09
Anticipated expiration: 2014-10-10
Also published as: CN1120780A

Abstract

The present invention relates to a motion self-tuning space-time filtering device used as a pre-filter in an image encoding device. By using a filter with a band limitation characteristic of cutting off frequency and the speed of a moving part according to required time, the present invention processes the time band limitation of video frame signals along the trace of the moving part on space-time domains, but time overlapping can not be generated.

Description

Vision signal is carried out the method and apparatus of motion adaptive spatio-temporal filtering

The present invention relates to be used for the method and apparatus of the time filtering of vision signal; More specifically, thus relate to be used for image coding device, can reach instant bandwidth restriction and can not cause a kind of motion adaptive spatio-temporal filtering device (MASTF) of the image quality that the time-interleaving effect can be improved.

In such as digital television systems such as video telephone, videoconference and high-definition television systems, adopted the image coding device that utilizes various data compression techniques to reduce the mass data of each frame video signal of definition, in these data compression techniques such as the transition coding of using discrete cosine transform, and the motion compensation encoding that is used to reduce two time relationships between the frame in succession.

In order to carry out data compression process effectively, the most of practical image coding device that can support utilization in the present technique all adopts the part of various filters as the low front-end processing of filtering and frame prompt drop.These filters work to eliminate or alleviate the time noise and carry out bandwidth constraints, thereby improve picture matter snail and code efficiency.

During the paper that one of device of this prior art is disclosed in people such as Eric Dubois " adopts the noise in the image sequence of time filtering of motion compensation to reduce ", the ieee communication journal, COM-32, No. 7 (1984 place July), it utilize a kind of nonlinear recursion termporal filter reduce may initialize signal generate with conveying operations in the noise contribution that produce.This termporal filter adopts motion compensation technique to carry out in the time domain along the filtering of movement locus, does not change the thin portion of image to weaken the noise contribution in the moving region.

The device of another kind of prior art is described in people's such as Wen-HsiungChen paper " the recurrence time filtering and the frame prompt drop of picture coding are low ", the field periodical is selected in ieee communication, SAC-5 (in August, 1987), it also adopts a recurrence time filter to carry out recursive filtering and the frame prompt drop is low.When this filter applies in time domain the time, can be eliminated the interframe input noise and improved image quality.

The United States Patent (USP) of awarding to K.J.klees proposed to utilize for 4,694, No. 342 the device of a spatial filter, this filter can recursively work and can onrecurrent ground work and eliminate noise in the video image, keep its thin portion simultaneously basically.This filter comprises a look-up table, be used for storing predetermined justice and filtered output pixel value and predefined feedback pixel value, to keep its thin portion of image basically, some other parts of same image then are to pass through recursive filtering to eliminate its noise to one of them some part that enters image through non-recursive filtering.

Though above-mentioned device with other prior art can weaken the noise in the moving region and not change the thin portion of image by adopting the low-pass filtering technique of carrying out along movement locus, these methods are tended to occurring relatively introducing artificial phenomenon in the zone of motion fast.Thereby these devices do not possess is enough to deal with the visible artifactitious ability that time frequency band limits or time-interleaving cause.

If in repeating frequency spectrum, comprise lap, then visible artefact can in image, occur.Especially, the psycho-visual effect might be distorted in the moving region that is made of the spatial high-frequency composition: this is that the speed of feeling on moving area may be different with actual speed.Therefore, in order to reach effective time frequency band limits, just wish to have a kind of termporal filter that not influenced by eclipsing effects.

Therefore, a main purpose of the present invention is for providing a kind of motion adaptive spatio-temporal filtering method, and this method can be carried out the time frequency band limits of vision signal effectively and can not suffer time-interleaving, thereby improves image quality.

According to the present invention, provide a kind of usefulness one preset time cut-off frequency to come the filtering video signal so that the method and apparatus that its time frequency band obtains limiting, wherein this vision signal comprises a plurality of frames, each frame has a large amount of pixels, and the method that is used for obtaining the filtering result of the target pixel in the target frame of vision signal comprises the steps:

Estimate a plurality of motion vectors, each motion vector is represented the locational motion of target pixel in each frame of vision signal;

Determine the many groups pixel value as a filtering input function on the track of target pixel, wherein each group is that the motion vector of the frame by using a correspondence is determined on the target pixel track in this frame; With

Carry out the involutory of filtering input function with a kind of filter impulse response of determining according to space-time cut-off frequency fc, thereby obtain having preset time bandwidth and the not free overlapping vision signal once wave filter, this space-time cut-off frequency fc is:

f_{c} = \frac{f_{t}^{c}}{L}

F wherein _t ^cBe the time cut-off frequency; L then be with vision signal in the predetermined positive integer of velocity correlation of moving object.

From the description of the preferred embodiment done below in conjunction with accompanying drawing, above-mentioned and other purpose and characteristic of the present invention will be well-known, wherein:

Figure 1A, 1B and 1C are the baseband frequency spectrum distribution map of a moving object rate function;

Fig. 2 gives the figure that uses the traditional low-pass filtering result of set time cut-off frequency in time domain for describing;

Fig. 3 is the figure that is used for a filtering input function of illustration time-space domain;

Fig. 4 A to 4D illustration is according to the result of motion adaptive spatio-temporal filtering of the present invention;

Fig. 5 is the schematic block diagram of employing according to the image coding device of the motion adaptive spatio-temporal filtering method of preferred embodiment of the present invention.

A vision signal can be used its three-dimensional, and promptly level, vertical and time component are treated; And be expressed as a continuous function f ₃(x, y, t).Suppose that its moving object has only at the uniform velocity translation of rigid body campaign v=(v _x, v _y), progressive video F then ₃The fourier transform of () can be expressed as:

F ₃(f _x，f _y，f _t)＝F ₂(f _x，f _y)·δ(f _xv _x+f _yv _y+f _c)

Eq (1) is F wherein ₂(f _x, f _y) be two-dimensional video signal F ₂(f _x, f _y) fourier transform, and δ (f _xv _x+ f _yv _y+ f _t) then represent in the three-dimensional frequency space by Equation f _xv _x+ f _yv _y+ f _tA clinoplain of=0 expression, therefore, base band only exists on the two-dimentional frequency plane.Eq (1) is disclosed in the paper " double sampling of the motion compensation of HDTV " such as people such as R.A.F.Belfor, SPIE, 1605, face-to-face communication image processing ' 91,274-284 page or leaf (1991).Can predict a space-time bandwidth from the position of baseband frequency spectrum.That is, if given time bandwidth f _t ^w, just can from Eq (1), draw time bandwidth f ^w _t, spatial bandwidth f _x ^wWith f _y ^w, and velocity component v _xWith v _yBetween relation as follows:

f _t ^w＝f _x ^w·v _x+f _y ^w·v _y

Eq(2)

F wherein _x ^wWith f _y ^wBe the corresponding spatial bandwidth component on x and the y direction.Can find out that from Eq (2) time bandwidth is to be directly proportional with the speed of moving object; And when having fixed time bandwidth, spatial bandwidth becomes that speed with moving object is inversely proportional to.

Because the vision signal that is used for filtering is with space and time sample frequency sampling, the vision signal of being sampled can be represented three-dimensional data from the sample survey, i.e. pixel.Therefore, the sampling of continuous function f3 () can be expressed as continuous function f ₃(x, y t) multiply by the cubical array of a δ function.Then, the spectrum distribution of pixel just can be by (f ₃The fourier transform of () and the involutory of a δ function provide.Thereby the frequency spectrum of pixel is repeated on the interval of sampling frequency by the feature of δ function.

At first, wherein show base band frequency division cloth v as the function of speed of moving object referring to Figure 1A, 1B and 1C _x=1 pixel/frame period, v _x=2 pixels/frame period and v _x=3 pixels/frame period, wherein solid line is represented the duplicate of base band; And with the normalization to 1 of time sample frequency; And space (x direction of principal axis) and temporal frequency be appointed as f respectively _xWith f _t

The motion of a pixel A in the moving object causes frequency spectrum to depart from the spatial frequency axle, as shown in Figure 1A.As shown in Figure 1A, 1B and 1C, described angle of deviation θ increases with speed.From Eq (2), just can easily understand the reason of deflection by investigating temporal frequency on the pixel in the vision signal: because the distribution of frequency spectrum on the spatio-temporal frequency territory is relevant with the product of the speed of spatial frequency and moving object, higher moving object speed provides higher temporal frequency.What should emphasize is that frequency spectrum is deflection rather than rotation.

Referring to Fig. 2, wherein show with a regular time cut-off frequency f _t ^cThe result of low-pass filtering in time domain.In order to carry out time filtering, can make following two kinds of hypothesis: the first, baseband frequency spectrum does not have the space overlap part, and the second, in order to simplify, only existence simple horizontal movement at the uniform velocity is (with f _xExpression).In Fig. 2, filtering is the result comprise, such as the spatial high-frequency composition B of the overlapping adjacent spectra of express time.That is the time low-frequency component of the duplicate that the spatial high-frequency composition influence is adjacent.In other words, the spatial high-frequency composition of adjacent duplicate and the interference between the low-frequency component appear in the image of demonstration.

From Eq (1) and Eq (2) as can be seen, space (comprising the vertical and horizontal component) and temporal frequency f _sWith f _tBetween relation can represent down:

f_{s} = \frac{1}{| v |} {\cdot f}_{t} - - - Eq (3)

Spatial frequency f wherein _sBe to be defined in f _x-f _yOn the plane.Seen in from Eq (3), should be appreciated that when time of having fixed for the binding hours bandwidth during cut-off frequency spatial-cut-off frequency becomes that absolute value with the speed of moving object is inversely proportional to.

To suppose that h () is the pulse characteristic of a low pass termporal filter, and in order simplifying, only to have simple horizontal movement (x direction of principal axis), then the vision signal g of time frequency band limits (x, t ') can be expressed as follows:

g (x, t) = {&Integral;}_{- \infty}^{\infty} h (τ) f (x, t - τ) dτ - - - Eq (4)

Wherein used a linear phase filter to reduce the group delay effect of filter response.From translation of rigid body campaign v=(v at the uniform velocity _x, v _y) with the hypothesis of simple horizontal movement in, the filtering input function can be expressed as follows:

f(x，t-τ)＝f(x+v _xτ，t) Eq(5)

From Eq (5), motion pixel can be represented with the track in the spatial domain on its any on time shaft along the displacement of temporal frequency axle.Thereby, Eq (4) can be rewritten as:

g (x, n) = {&Integral;}_{- \infty}^{\infty} h (τ) f (x + v_{x} τ, t) dτ - - - Eq (6)

On the other hand, in the situation of actual video signal, at the uniform velocity the hypothesis of translation of rigid body campaign is not to set up forever.Moreover if there is not moving object, then each pixel value of video data signal is owing to light source and change such as the feature of vision signal generation equipment such as gamma camera change.In these situations, Eq (5) only keeps setting up at short time interval, and can rewrite as follows:

f(x，t-(k+1)Δt)＝f(x+v _x(t-kΔt)·Δt，t-kΔt)

Eq (7) wherein Δ t represents time interval of a weak point, frame period for example, and K then is an integer.According to Eq (7), formula (6) can be rewritten into:

g (x, t) = Σ_{k = - \infty}^{\infty} {&Integral;}_{kΔt}^{(k + 1) Δt} h (τ) f (x + v_{x} (t - kΔt) \cdot (τ - kΔt), t - kΔt) dτ

Eq(8)

From Eq (8), be appreciated that the time filtering of Eq (4) can be by finishing with the spatio-temporal filtering of its filtering input function f ().

Eq (8) is the continuous description of motion adaptive spatio-temporal filtering.Similarly the result also sets up in discrete case: integration is substituted with summation, and d τ is represented with Δ τ and j.Eq this moment (8) is provided by following formula:

g (x, n) = Σ_{j = - N}^{N} {Σ_{l = 0}^{L - 1} h (Lj + 1) \cdot

f(x+v(x，n-j)Δτ-l，n-j))

Eq (9) wherein n is a frame subscript; Speed and filtering position are replaced by vector v and x; Comprise 2N+1) * the filter pulse characteristic h () of a L filter coefficient is that L (N, L are positive integer) is predetermined together together with time cut-off frequency and predetermined digital N; And if the pixel interbody spacer is expressed as Δ x, and then select to satisfy | v () Δ τ |≤| the Δ τ of Δ x| (if Δ τ can not satisfy this condition, then it can cause space overlap).

Therefore, as can be seen in the Eq (9), the time frequency band limits can be passed through spatio-temporal filtering, i.e. low-pass filtering is taken from the filtering input function in space and the time-domain and drawn.

On the other hand, if Δ T is an interFrameGap, then L Δ τ equals Δ T, and v () Δ T equals D (), and D () is the displacement of a pixel of two adjacent interframe of expression.At this moment, Eq (9) can revise as follows:

g (x, n) = Σ_{j = - N}^{N} (Σ_{l = 0}^{L - 1} h (Lj + 1) \cdot

f (x + D (x, n - j) \cdot \frac{1}{L}, n - j)} - - - Eq (10)

Wherein, L is chosen as satisfied | D () |≤| Δ x|L (this condition equivalence is in the condition of previous description | v () Δ τ |≤| Δ x|, therefore, if L can not satisfy this condition, it can cause space overlap).Eq (10) is a kind of realization of Eq (9).The time frequency band limits is to pass through spatio-temporal filtering, it is low-pass filtering, filtering input function f () draws, that this function comprises is a plurality of, and (such as (2N+1) filtering input data set, wherein each group comprises the filtering input data of the predetermined number (for example L) that obtains from the pixel value of the corresponding frame of vision signal.In Eq (10), (x+D (x, n-j) 1/L) may not overlap with accurate pixel location in the position of the filtering input data in (n-j) frame of expression vision signal.In this case, filtering input data can be determined from being arranged in around the adjacent image point of this position by using such as the bilinear interpolation method of the weighted sum of determining the adjacent image point value as filtering input data.That is, the filtering input function is that track in the moving object of upper edge, time-space domain draws.Specifically, the one group of input data that is included among the filtering input function f () can use motion vector to determine from the pixel value of the frame of correspondence, this motion vector is represented moving object in this frame of vision signal and the displacement between its former frame, and this point will be described in conjunction with Fig. 3 below.

On the other hand, the filter pulse characteristic comprises and plays a plurality of with the effect of bandwidth constraints on a predetermined bandwidth of vision signal, promptly (2N+1) * L, filter coefficient, these filter coefficients can pre-determine according to required time cut-off frequency and predetermined digital N and L.For example, when the time cut-off frequency be f _t ^cThe time, then use cut-off frequency f _t ^c/ L designs the filtering pulse characteristic.

In fact, can from Eq (10), find out, the data g () of filtering, promptly the frequency band limits data are respectively to organize filtering input data with corresponding filter coefficient and by asking the input data sum of respectively organizing filtering to obtain by convolution.

Referring to Fig. 3, wherein show the filtering input function key diagram of motion adaptive spatio-temporal filtering method of the present invention.In order to simplify, each frame represents with a line, for example F _C-1, F _cWith F _C+1, and the N of Eq (10) and L are assumed to 1 and 4 respectively.In other words, in order to obtain a target frame F _cIn a target pixel through the data of filtering, in Filtering Processing, used three filtering incoming frames, promptly comprise the target frame F that will carry out the target pixel of filtering operation thereon _c, and two adjacent frame F _C-1, F _C+1, c-1 wherein, c and c+1 represent the frame subscript; And on each filtering incoming frame, determine four filtering input data according to the motion vector of the locational pixel of target pixel in the frame of its back.The position of target pixel is respectively at frame F _C-1, F _cWith F _C+1In as x ₁₀, x ₂₀With x ₃₀Expression, and vertical axis is a time shaft.

In order to draw target frame F _cMiddle x ₂₀On target pixel through the data of filtering, determined many groups of (promptly three groups) filtering input data, data are imported in the filtering on the corresponding movement locus of every group of target pixel that is arranged in corresponding filtering incoming frame that comprises predetermined number (for example 4).Particularly, according to frame F _C-1, F _cWith F _C+1In motion vector D (x ₁₀, c-1), D (x ₂₀, c) with D (x ₃₀, c+1) on the locational pixel track of target pixel, determine three groups respectively and be positioned at (x ₁₀, x ₁₁, x ₁₂, x ₁₃), (x ₂₀, x ₂₁, x ₂₂, x ₂₃) and (x ₃₀, ₃₁, x ₃₂, x ₃₃On filtering input data.

As shown in Figure 3, be readily appreciated that filtering input data equal the target pixel value of valency in the frame of the temporal interpolation of vision signal or the sampling that makes progress.For example, frame F _C-1Middle x ₁₁The filtering input data at place are equivalent to time t=-3 Δ T/4 and go up x ₁₀The pixel value at place.This can be expressed as follows:

f (x_{10} + D (x_{10}, - ΔT) \cdot \frac{1}{4}, - ΔT) = f (x_{10}, - \frac{3}{4} ΔT)

Eq(11)

Equivalence between spatial domain and the time-domain as shown in phantom in Figure 3.

Referring to Fig. 4 A to 4D, wherein show by using the low pass time filtering result of the vision signal of this motion adaptive spatio-temporal filtering method on the time-space domain.In Fig. 4 A, show the baseband frequency spectrum of original video signal.As mentioned above, acquisition is respectively organized the process time that is equivalent to of filtering input data and is upwards sampled or interpolation, as shown in Fig. 4 B.If the cut-off frequency of required time low-pass filtering is f _t ^c, then the cut-off frequency of filter of the present invention is f _t ^c/ L is as shown in Fig. 4 C.Filtering result's final frequency spectrum is illustrated among Fig. 4 D, and they are patterns of upwards sampling (note, do not provide the filtering result for the frame that inserts) of the frequency spectrum among Fig. 4 C.Compare with the time frequency band limits among Fig. 2, should be able to easily understand, space-time frequency band limits of the present invention is not subjected to the time-interleaving some effects.

As can be, will be understood that filtering operation is that track in the moving object of upper edge, time-space domain carries out, and draws a time frequency band limits whereby from shown in Eq (10) and Fig. 3,4A, 4B, 4C and the 4D.Therefore, property filter of the present invention can be eliminated the time-interleaving in the frequency spectrum that may appear at repetition when the speed of moving object increases effectively, thereby greatly reduces the visible artefact in the motor area that appears in the image.

Referring to Fig. 5, wherein show the image coding device of employing according to the motion adaptive spatio-temporal filtering device of a preferred embodiment of the present invention.This image coding device comprises a filter circuit 100, is used for carrying out according to motion adaptive spatio-temporal filtering of the present invention; And a video coding circuit 60, be used for for video signal compression is eliminated the redundancy of the vision signal of process filtering for transmission to more tractable size.This vision signal generates from a video signal source, gamma camera (not shown) for example, and deliver to filter circuit 100.

Filter circuit 100 is carried out the motion adaptive spatio-temporal filtering operation according to Eq (10), as mentioned above.Filter circuit 100 comprises a frame buffer 10, motion estimator 20, motion vector buffer 30, a filtering pattern of the input device 40 and a filtering calculator 50.The present frame that one of storage is being input to filter circuit 100 in the frame buffer 10 and a plurality of (such as the frame of (2N+1) front, the filtering incoming frame that promptly in filtering, will use.Particularly, suppose N=1, then store present frame F in the frame buffer 10 _C+2, and three filtering incoming frame F _C-1, F _cWith F _C+1, wherein c+2, c+1, c and c-1 are the frame subscript.Two frames in succession of motion estimator 20 receiving video signals are promptly directly from the present frame F of the vision signal of video signal source input _C+2And be stored in the frame F of its front in the frame buffer 10 _C+1, and extract and be included in present frame F _C+2In the motion vector that is associated of each pixel.In order to extract motion vector, can adopt various estimating motion methods famous in the present technique (for example, to see MPEG video simulation model 3, International Standards Organization, the coded representation of picture and acoustic information, 1990, ISOIDEC/JTC1/SC2/WG8 MPEG 90/041).

The motion vector that extracts is coupled to motion vector buffer 30 and it is stored in wherein.According to the present invention, storage frame F in the motion vector buffer 30 _C+2, F _C+1, F _cWith F _C-1Motion vector.

The motion vector that is associated with the filtering incoming frame that is stored in the filtering incoming frame in the frame buffer 10 and be stored in the motion vector buffer 30 is coupled to filtering pattern of the input device 40.Filtering pattern of the input device 40 determines to constitute many groups of (for example 3 groups) filtering input data of the filtering input function f () among the Eq (10).As mentioned above, be positioned on the position of not dropping on accurate pixel location if judge filtering input data, then filtering pattern of the input device 40 provides filtering input data by the weighted sum of calculating its four neighboring pixels.Filtering input data are coupled on the filtering calculator 50.

In filtering calculator 50, use the filtered data g () that represents with Eq (10) from the filtering input data computation of filtering pattern of the input device 40 inputs.

The filter pulse characteristic that comprises a plurality of (for example (2N+1) * L) filter coefficient is according to required time cut-off frequency f _t ^c, N and L determine, and f _t ^c, N and L then be by having considered the feature of vision signal, and be predetermined in conjunction with the described condition of Eq (10) for satisfying the front.Filter coefficient can pre-determine before filtering and be stored in the filtering calculator 50.As mentioned above, filter circuit 100 is carried out the motion adaptive spatio-temporal filtering operation, draws the vision signal of a time frequency band limits whereby.

Be coupled to video coding circuit 60 from the filtered vision signal of filtering calculator 50 outputs, therein, with the whole bag of tricks compressed video signal known in the present technique (for example, see MPEG video simulation model 3, International Standards Organization, the coded representation of picture and acoustic information, 1990, ISO-IEC/JTC1/SC2/WG8 MPEG 90/041).Encoded video signal is coupled to a transmitter for sending.

Though illustrate and described the present invention with reference to specific embodiment, apparent for person skilled in the art person, can under the condition that does not break away from defined invention spirit and scope in the appending claims, make many changes and correction.

Claims

1, a kind of be used for by with vision signal of preset time cut-off frequency filtering so that one of each pixel that its time frequency band is limited thereby this vision signal is provided device through the data of filtering, wherein said vision signal comprises a plurality of filtering incoming frames, comprise the frame of front of predetermined number of a target frame of carrying out filtering operation thereon and described target frame and the frame of back in these frames, each filtering incoming frame has a plurality of pixels, and described device comprises;

Be used for estimating that each expression is included in the device of a plurality of motion vectors of motion of each pixel of this vision signal;

Be used to a target pixel that is included in this target frame to determine the device of a filtering input function, wherein this filtering input function comprises many group filtering input data; Each organizes filtering input data is according to the motion vector at the locational pixel of target pixel, determines on the track of the locational pixel of target pixel in each of a plurality of filtering incoming frames;

Be used for a kind of according to space-time cut-off frequency f _cThe involutory of filtering input function carried out in the filter impulse response of determining, thereby draws the device of the data of passing through filtering of the target pixel in the target frame, this space-time cut-off frequency f _cBe expressed as follows:

F wherein _t ^cBe the time cut-off frequency; L then be with vision signal in the relevant predetermined positive integer of speed of moving object.

2, a kind of be used for by with vision signal of preset time cut-off frequency filtering so that thereby its time frequency band is limited the method through the data of filtering of a target pixel that this vision signal is provided, wherein said vision signal comprises a plurality of filtering incoming frames, comprise the frame of front of described target frame of a target frame that wherein has this target pixel and predetermined number and the frame of back in these frames, have a plurality of pixels in each filtering incoming frame, described method comprises the steps:

Estimate a plurality of motion vectors, each motion vector is represented the motion of locational each pixel of target pixel in each frame of this vision signal;

Determine a filtering input function of target pixel, wherein this filtering input function comprises many group filtering input data; Each organizes filtering input data is motion vectors according to the locational pixel of target pixel, determines on the track of the locational pixel of target pixel in each of a plurality of filtering incoming frames; And

With a kind of according to space-time cut-off frequency f _cThe involutory of filtering input function carried out in the filter impulse response of determining, thereby draws the data through filtering of the target pixel in the target frame, this space-time cut-off frequency f _cBe expressed as follows:

3, the method for claim 2, wherein said data representation through filtering is as follows:

g (x, n) = Σ_{j = - N}^{N} (Σ_{l = 0}^{l = L - 1} h (Lj + 1) \cdot f (x + D (x, n - j) \cdot \frac{1}{L} \cdot n - j))

Wherein x is the position of target pixel; N is the subscript of the target frame in the vision signal; Comprise (2N+1) * L filter system among the filter impulse response h (); J is the subscript that absolute value is not more than N; N, L are positive integer; D () then is the motion vector of a motion of expression target pixel.

4, the device of claim 1, wherein said data representation through filtering is as follows:

g (x, n) = Σ_{j = - N}^{N} (Σ_{l = 0}^{l = L - 1} h (Lj + 1) \cdot f (x + D (x, n - j) \cdot \frac{1}{L} \cdot n - j))