JP2980810B2

JP2980810B2 - Motion vector search method and apparatus

Info

Publication number: JP2980810B2
Application number: JP10444094A
Authority: JP
Inventors: 真樹佐藤; 亮磨大網; 清晴相澤; 光俊羽鳥
Original assignee: GURAFUITSUKUSU KOMYUNIKEESHON RABORATORIIZU KK
Current assignee: GURAFUITSUKUSU KOMYUNIKEESHON RABORATORIIZU KK
Priority date: 1994-04-20
Filing date: 1994-04-20
Publication date: 1999-11-22
Anticipated expiration: 2014-11-22
Also published as: JPH07298265A

Description

DETAILED DESCRIPTION OF THE INVENTION

【０００１】[0001]

【産業上の利用分野】本発明は、動画像符号化における
動きベクトル探索方法と装置に関する。具体的には、ク
ワッド・トリー（Quad-Tree）を用いることにより、動
きベクトル場をより少数の代表動きベクトルにより表現
し、動きベクトル場の情報量を削減する新規な方法と装
置を提供しようとするものである。BACKGROUND OF THE INVENTION 1. Field of the Invention The present invention relates to a method and an apparatus for searching for a motion vector in moving picture coding. More specifically, an attempt is made to provide a novel method and apparatus for expressing a motion vector field with a smaller number of representative motion vectors by using a quad-tree, and reducing the information amount of the motion vector field. Is what you do.

【０００２】[0002]

【従来の技術】動画像符号化では、動き情報の推定によ
るフレーム間予測と、予測誤差の符号化の組み合わせに
よる方式が多く用いられている。より低いレート（少な
い画像データ）での符号化のためには、動き情報をいか
に推定し、利用するかが重要である。従来より動画像符
号化で多用されてきたブロック・マッチング法による動
きベクトル推定では、ブロック内に異なる動きを有する
ものが存在するとき、推定精度が著しく劣化するという
問題がある。このような問題を解決する方式を示したい
くつかの文献がある。2. Description of the Related Art In moving picture coding, a method using a combination of inter-frame prediction by estimating motion information and coding of prediction error is often used. For encoding at a lower rate (less image data), it is important how motion information is estimated and used. In the motion vector estimation based on the block matching method, which has been frequently used in moving image coding, there is a problem that when there is a block having a different motion, the estimation accuracy is significantly deteriorated. There are several documents showing a method for solving such a problem.

【０００３】文献１．J.L.Barron, D.J.Fleet, S.S.Bea
uchemin,T.A.Burkitt: “Performance of Optical Flow
Techniques", Proceeding CVPR June, 1992, pp. 236-
242Reference 1. JLBarron, DJFleet, SSBea
uchemin, TABurkitt: “Performance of Optical Flow
Techniques ", Proceeding CVPR June, 1992, pp. 236-
242

【０００４】文献２．大綱亮磨、佐藤真樹、相澤
清晴、羽鳥光俊：“１画素マッチングに基づく動き推
定の基礎検討”，テレビ学技法，ICS94-38(1994)Reference 2. Ryoma Ohtsuna, Maki Sato, Aizawa
Kiyoharu, Hatori Mitsutoshi: "Basic Study of Motion Estimation Based on One Pixel Matching", Television Techniques, ICS94-38 (1994)

【０００５】文献１では、コンピュータ・ビジョン等の
分野で、高次の処理のためにより密なベクトル場を推定
している。密な動きベクトル場が利用できれば、異なる
動きの境界領域においても推定精度が著しく劣化するこ
とがない。[0005] Reference 1 estimates a denser vector field for higher-order processing in the field of computer vision and the like. If a dense motion vector field can be used, the estimation accuracy will not be significantly degraded even in a boundary region between different motions.

【０００６】文献２では、密なベクトル場の生成に際し
ては、１画素単位のマッチングによる誤差情報を利用す
る方式を用いている。この方式によれば、１画素毎に有
している誤差平面が、計算に有効に利用できる。[0006] In Reference 2, when a dense vector field is generated, a method of using error information by matching on a pixel-by-pixel basis is used. According to this method, an error plane provided for each pixel can be effectively used for calculation.

【０００７】文献２の内容を以下に説明する。The contents of Reference 2 will be described below.

【０００８】１画素マッチングに基づく動き推定[0008] Motion estimation based on one-pixel matching

【０００９】一般に、ブロック・マッチング法により動
き推定を行う場合は、画素ｘを中心としたマッチング・
ウインドウＷ_mと、探索範囲Ｗ_sを設定し、式（１）に
示すように誤差関数ｅ_B（Δｘ）を最小にする動きベク
トルΔｘを求める。ｅ_B（Δｘ）＝Σ｜Ｉ_k+1（ｘ＋Δｘ）−Ｉ_k（ｘ）｜（１）ここでΣは、マッチング・ウィンドウＷ_m内に含まれた
すべての画素ｘについての累和を、Ｉ_k+1（ｘ＋Δｘ）
は参照画像を、Ｉ_k（ｘ）は原画像を示している。In general, when motion estimation is performed by the block matching method, matching is performed around the pixel x.
A window W _m and a search range W _s are set, and a motion vector Δx that minimizes the error function e _B (Δx) is obtained as shown in Expression (1). e _B (Δx) = Σ | I _{k + 1} (x + Δx) −I _k (x) | (1) where Σ is the sum of all the pixels x included in the matching window W _m . I _{k + 1} (x + Δx)
Denotes a reference image, and I _k (x) denotes an original image.

【００１０】このマッチング・ウインドウＷ_mに含まれ
る画素数を１画素まで減らした場合、すなわちブロック
でなく１画素でマッチングをとる場合には、式（１）は
式（２）となる。ｅ_x（Δｘ）＝｜Ｉ_k+1（ｘ＋Δｘ）−Ｉ_k（ｘ）｜（２）これを画素ｘにおける１画素マッチングの誤差関数ｅ_x
（Δｘ）として定義する。[0010] If the number of pixels included in the matching window W _m was reduced to one pixel, that is, when the matching is one pixel rather than block, Equation (1) becomes Equation (2). e _x (Δx) = | I _{k + 1} (x + Δx) −I _k (x) | (2) This is an error function of one pixel matching at the pixel x _ex
(Δx).

【００１１】図７には１画素マッチングの誤差関数ｅ_x
（Δｘ）に、画素ｘ₀を含む領域Ｒｘ₀（図中の黒塗
り）内に含まれた多くの画素ｘについて、オペレータＦ
を作用させて、画素ｘ₀における動きベクトルΔｘ₀を
求める様子を示しており、仮定する動きモデルとして
は、平行移動のモデルやアフィン・モデルなどがある。
ここで１画素マッチングの基本的な枠組では、まず、１
画素でマッチンクを行い、その結果に何等かのオペレー
タＦを作用させて動きを求める。このとき、画素毎に誤
差関数ｅ_x（Δｘ）を予め計算し、その値（画素単位で
の distortion plane：誤差平面）を保持しておくた
め、処理の自由度が増し、オペレータＦとして非線形関
数等を用いることにより、より柔軟な動き推定が可能に
なる。FIG. 7 shows an error function e _{x for} one-pixel matching.
In (Δx), for many pixels x included in a region Rx ₀ (solid black in the figure) including the pixel x ₀ , the operator F
Is applied to obtain the motion vector Δx ₀ at the pixel x ₀ , and examples of the assumed motion model include a parallel movement model and an affine model.
Here, in the basic framework of one-pixel matching, first, 1
Matching is performed at the pixels, and some operator F acts on the result to obtain a motion. At this time, an error function e _x (Δx) is calculated in advance for each pixel, and its value (distortion plane in a pixel unit: error plane) is held, so that the degree of freedom of processing increases, and the nonlinear function as the operator F is increased. And the like, more flexible motion estimation becomes possible.

【００１２】具体的には、画素ｘ₀の動きベクトルΔｘ
₀を求める場合、１画素マッチングに基づく動き推定の
基本的な枠組は、 1) まず、ある閾値Ｔに対し、ｅ_x0（Δｘ₀ ）≦Ｔとな
るΔｘ₀ を動きベクトルの候補として選ぶ。この動きベ
クトルの集合を、Ｖ_x0＝｛Δｘ₀ ｜ｅ_x0（Δｘ₀ ）≦Ｔ｝で表わす。 2) ｘ₀ を含む領域Ｒ_x0に対し、何等かの動きモデルの
仮定のもとに、ｅ_x（Δｘ）（ただし、ｘは領域Ｒ_x0に
含まれている）に評価関数となるオペレータＦ［・］を
作用させる。 3) Δｘ₀ ＝arg min Ｆ［ｅ_x（Δｘ₀），ただし、ｘは
領域Ｒ_x0に含まれている］となるベクトルＶ_x0に含まれ
た動きベクトルΔｘ₀を求める。Specifically, the motion vector Δx of the pixel x ₀
When seeking _0, the basic framework of a motion estimation based on 1 pixel matching, 1) First, with respect to some threshold T, motion e _x0 ([Delta] x ₀₎ becomes ≦ T [Delta] x ₀ selected as candidate vectors. This set of motion vectors, V _x0 = | represented by _{_{_{{Δx 0 e x0 (Δx 0}}} ) ≦ T}. For a region R _x0 including 2) x _0, under the assumption of some kind of motion model, e _x ([Delta] x) (where the operator x is an evaluation function in which) is included in the region R _x0 F Apply [•]. _{3) Δx 0 = arg min F} [e x (Δx 0), here, x is included in the region R _x0] obtaining a motion vector [Delta] x ₀ contained in the vector V _x0 where the.

【００１３】この枠組の中では、オペレータＦ、領域Ｒ
_x0の指定に対して自由度があるため、画像の各領域の性
質に適した動き推定が可能になる。In this framework, an operator F, an area R
_Since there is a degree of freedom in specifying _x0 , it is possible to perform motion estimation suitable for the properties of each region of the image.

【００１４】このように、１画素マッチングによるベク
トル場の導出には、オペレータＦの選択や領域Ｒ_x0の制
御により、様々な手法が考えられるが、最終的に代表動
きベクトルを抽出するためには、適度に領域分割される
ことが望ましい。ここでは、距離変換とクラスタリング
に類似した手法により、動きと領域の推定を同時に行
う。As described above, various methods are conceivable for deriving the vector field by one-pixel matching, depending on the selection of the operator F and the control of the region R _x0 , but in order to finally extract the representative motion vector, It is desirable that the region is appropriately divided. Here, motion and region estimation are performed simultaneously by a method similar to distance conversion and clustering.

【００１５】この手法は、“１”または“０”にラベル
付けした２値のラベル画像の作成と、その画像の距離変
換の繰り返しからなる。まず、閾値Ｔ_m で１画素マッチ
ングの誤差関数ｅ_x（Δｘ）を２値化し、これを式
（３）の２値化誤差関数ｆ（Δｘ）で表わす。ｅ_x（Δｘ）≦Ｔ_m のときは、ｆ（Δｘ）＝１ｅ_x（Δｘ）＞Ｔ_m のときは、ｆ（Δｘ）＝０（３）This method includes the steps of creating a binary label image labeled "1" or "0" and repeating distance conversion of the image. First, 1 pixel matching error function e _x a ([Delta] x) binarized by the threshold T _m, which represents in binary error function f ([Delta] x) of equation (3). When the _{_{e x (Δx) ≦ T m}} , when _{f (Δx) = 1 e x} (Δx)> T m is, f (Δx) = 0 ( 3)

【００１６】各動きベクトルΔｘに対し２値化誤差関数
ｆ（Δｘ）により得られるラベル画像の距離変換を行
い、その距離変換値ｄ_x（Δｘ）を指標として用いる。The distance conversion of the label image obtained by the binarization error function f (Δx) is performed on each motion vector Δx, and the distance conversion value d _x (Δx) is used as an index.

【００１７】図８には各動きベクトルΔｘに対応して分
類されたラベル画像の距離変換を示している。そこにお
ける黒塗り部分はマッチングのとれなかった画素ｘを表
わしている。点Ａからの距離変換値ｄはその値を大きく
していったときマッチングのとれなかったいくつかの画
素のうちのある画素７９に最初に接するまでの距離ｄに
相当し、画素ｘから距離ｄ_x（Δｘ）の円７１の範囲内
にはマッチングのとれた画素のみが存在する。よってこ
のｄ_x（Δｘ）の値が大きくなるΔｘが、真の動きベク
トルである可能性が高いと考えられることから、距離
ｄ_x（Δｘ）の最大値を与えるΔｘを動きベクトルとす
る。すなわち、ｄ_x _max＝maxｄ_x（Δｘ）（４）を与えるΔｘを動きベクトルとする。FIG. 8 shows distance conversion of label images classified according to each motion vector Δx. The black portions represent pixels x that could not be matched. The distance conversion value d from the point A is equivalent to the distance d from the pixel x to the first contact with a certain pixel 79 among several pixels that could not be matched when the value was increased, and the distance d from the pixel x. Only matched pixels exist within the range of the circle 71 of _x (Δx). Therefore, since it is considered that Δx in which the value of d _x (Δx) increases is likely to be a true motion vector, the distance x
Δx that gives the maximum value of d _x (Δx) is defined as a motion vector. That is, the motion vector [Delta] x to give _{_{_{d x max = maxd x (Δx}}} ) (4).

【００１８】具体的には、雑音等の影響も考慮し、以下
の手順に従う。１．各動きベクトルΔｘのラベル画像に対して距離変換
を行う。距離の最大値ｄ_x _max がある値ｄ₀より大きく
なる画素には、ｄ_x _max を与えるΔｘを動きベクトルと
して割り当てる。２．動きベクトルの確定した画素ｘについては、２値化
誤差関数ｆ（Δｘ）は、動きベクトルΔｘが距離ｄ_x
_max を与えるとき、ｆ（Δｘ）＝１動きベクトルΔｘが距離ｄ_x _max を与えないとき、ｆ（Δｘ）＝０（５）としてラベル付けをし、ラベル画像を更新する。３．以上のことを距離ｄ₀を少しづつ小さくしながら繰
り返す。Specifically, the following procedure is taken in consideration of the influence of noise and the like. 1. Distance conversion is performed on the label image of each motion vector Δx. Distance The larger pixel than the maximum value d _x _max is the value d ₀ of assigned as a motion vector Δx give d _x _max. 2. For the pixel x for which the motion vector is determined, the binarization error function f (Δx) is obtained by calculating the motion vector Δx by the distance d _x
When giving the _max, when f (Δx) = 1 motion vector [Delta] x does not give the distance d _x _max, and labeled as f (Δx) = 0 (5 ), and updates the label image. 3. The above is repeated while the distance d ₀ is gradually reduced.

【００１９】このようにして求められたベクトル場は、
領域Ｒ_x0単位で動きベクトルΔｘが一様となり、代表動
きベクトルによる補間近似に適したものとなる。以上が
文献２に開示された概要である。The vector field obtained in this way is
The motion vector Δx becomes uniform for each region R _x0, which is suitable for interpolation approximation using the representative motion vector. The above is the outline disclosed in Reference 2.

【００２０】動きベクトル抽出に用いるブロック・マッ
チング法には、しばしばクワッド・トリー（Quad-Tre
e）が用いられる。このクワッド・トリーはそのブロッ
クの予測誤差がある閾値よりも大きな場合には４分割を
行い、４分割された各ブロックのブロック・マッチング
を行い予測誤差を調べ、それがある閾値よりも大きな場
合には、その閾値を越えたブロックについて、さらに４
分割してブロック・マッチングを行い予測誤差を調べて
いる。しかし、その閾値は画像に依存し、閾値が小さい
場合、必要以上に分割が進み付加情報の増大と、小さい
ブロック・サイズに伴う雑音等の影響のため、動きの誤
推定を引き起こす可能性がある。また、閾値を大きくし
た場合には、小領域の抽出が難しくなり、動きベクトル
場の再現性と予測誤差の劣化が生ずる。このため従来の
クワッド・トリーの分割では、画像を予め小さなサブブ
ロックに分割し、同じベクトルを持つブロックを統合す
るという統合手順をとっている。A block matching method used for extracting a motion vector often includes a quad tree (Quad-Tre).
e) is used. If the prediction error of the block is larger than a certain threshold, this quad tree performs quadrant division, performs block matching of each of the four divided blocks, checks the prediction error, and if the prediction error is larger than a certain threshold, Is 4 more for blocks that exceed the threshold
The prediction error is examined by performing block matching by dividing. However, the threshold value depends on the image, and if the threshold value is small, there is a possibility that erroneous estimation of a motion may be caused due to an increase in additional information and an influence of noise due to a small block size if the threshold value is small. . If the threshold value is increased, it becomes difficult to extract a small area, and the reproducibility of the motion vector field and the prediction error deteriorate. For this reason, in the conventional quad tree division, an integration procedure of dividing an image into small sub-blocks in advance and integrating blocks having the same vector is adopted.

【００２１】[0021]

【発明が解決しようとする課題】文献１および２に開示
された手法で求められたベクトル場は、１画素単位の密
なもので、動きの境界領域における精度も確保されてい
る。しかし、超低レート符号化におけるフレーム間予測
情報として伝送する場合、まだ情報量が大きいという解
決されねばならない課題があった。また、ブロック・マ
ッチング法を用いる場合には、そのブロック・サイズを
大きくすることにより、ベクトル数の減少を実現した場
合、付加情報が不要で、簡単に情報量の減少が可能とな
るが、ブロック内でベクトルが一様となり、予測誤差が
増大し、ベクトル場の再現性の劣化が生ずる。The vector fields obtained by the methods disclosed in Documents 1 and 2 are dense on a pixel-by-pixel basis, and the accuracy in the boundary region of motion is ensured. However, when transmitting as inter-frame prediction information in ultra-low-rate coding, there is a problem that the amount of information is still large and needs to be solved. Also, when the block matching method is used, if the number of vectors is reduced by increasing the block size, additional information is unnecessary and the amount of information can be easily reduced. , The vector becomes uniform, the prediction error increases, and the reproducibility of the vector field deteriorates.

【００２２】そこで従来のクワッド・トリーの分割によ
り画像を予め小さなサブブロックに分割し、そのサブブ
ロックの中で同じベクトルを持つサブブロックを統合す
るという統合手順をとる場合には、この統合手順に付随
して処理量が増大してしまうという未解決の課題が残さ
れていた。Therefore, when an image is divided into small sub-blocks in advance by the conventional quad tree division and an integration procedure of integrating sub-blocks having the same vector in the sub-block is adopted, this integration procedure is used. There remains an unsolved problem that the amount of processing increases accordingly.

【００２３】[0023]

【課題を解決するための手段】本発明はこのような課題
を解決するためになされたものである。すなわち、ブロ
ックをクワッド・トリーにより分割する場合には、ま
ず、１画素単位の１画素マッチングの誤差関数ｅ_x（Δ
ｘ）を予め計算し、１画素単位の密なベクトル場を持
つことにより、そのベクトル場から計算される予測誤差
値をブロック分割の閾値とし、分割に伴う付加情報につ
いては、分割順を予め定めておくことにより、分割の階
層レベルのみを伝達することによってベクトル場を再現
できるように構成した。SUMMARY OF THE INVENTION The present invention has been made to solve such problems. That is, when a block is divided by a quad tree, first, an error function e _x (Δ
x) is calculated in advance, and by having a dense vector field on a pixel-by-pixel basis, a prediction error value calculated from the vector field is used as a threshold value for block division. For additional information accompanying the division, the division order is determined in advance. Thus, the vector field can be reproduced by transmitting only the hierarchical level of the division.

【００２４】[0024]

【作用】ブロックの分割に先立って、１画素マッチング
の誤差関数ｅ_x（Δｘ）を計算し、密なベクトル場を得
ているから、そのベクトル場から計算される予測誤差値
を、分割を行うか否かを決定する閾値とすることができ
る。そのために、分割を行うブロックに対応して、閾値
が適応的に変化し、最も符号化効率の良い最適なブロッ
ク分割が可能となった。さらに、画像全体からブロック
分割を開始できるので、代表動きベクトルを大局的に抽
出することが可能となり、従来問題となっていた統合手
順が不要となる。また、閾値の計算は、１画素単位の誤
差関数ｅ_x（Δｘ）を予め計算しておくために、予測誤
差に対する再計算の必要がない。このようにして、クワ
ッド・トリーの生成手順を決めておくことにより、領域
記述のための付加情報が階層レベルだけですみ、代表ベ
クトルの抽出順が、位置的な連続性を保持しているの
で、ベクトルの連続性を利用した差分符号化による情報
量の減少が可能となった。[Action] Prior to division of the block, one pixel matching error function e _x a ([Delta] x) is calculated, because they give a dense vector field, the prediction error value calculated from the vector field, performs division It can be a threshold value for determining whether or not. For this reason, the threshold value changes adaptively according to the block to be divided, and optimal block division with the highest coding efficiency has become possible. Further, since the block division can be started from the entire image, the representative motion vector can be globally extracted, and the integration procedure which has been a problem in the past becomes unnecessary. The calculation of the threshold, in order to advance calculates an error function e _x 1 pixel ([Delta] x), there is no need for re-calculation for prediction error. By determining the quad tree generation procedure in this way, additional information for region description is required only at the hierarchical level, and the representative vector extraction order maintains positional continuity. Thus, the amount of information can be reduced by differential encoding using continuity of vectors.

【００２５】[0025]

【実施例】図１には本発明の一実施例の回路構成を示し
ている。ここで１１は誤差平面計算器、１２はメモリ、
１３は原ベクトル場生成器、１４は閾（しきい）値生成
器、１５は分割判断器、１６はブロック選択器、１７は
ブロック・マッチング器、１８はクワッド・トリー生成
器、３１〜４５は各種の情報をやりとりするための信号
線である。各信号線３１〜４４でやりとりされる種々の
情報を列挙すると、信号線３１は原画像Ｉ_k（ｘ）を、信号線３２は参照画像Ｉ_k+1（ｘ）を、信号線３３は１画素マッチングの誤差関数ｅ_x（Δｘ）
を、信号線３４はメモリ１２から読出した１画素マッチング
の誤差関数ｅ_x（Δｘ）を、信号線３５はベクトル場Ｖ（ｘ）を、信号線３６は１画素単位の平均予測誤差ｅ_rと閾値Ｔ_q
の和を、信号線３７はメモリ１２から読出した１画素マッチング
の誤差関数ｅ_x（Δｘ）を、信号線３８は分割命令を、信号線３９はブロックｂ_N,Mを、信号線４０はベクトルＶ_bN,Mを、信号線４１は１画素単位の平均予測誤差ｅ_bN,Mを、信号線４２はメモリ１２から読出した１画素マッチング
の誤差関数ｅ_x（Δｘ）を、信号線４３はブロック位置Ｍ（Ｎ）を、信号線４４は代表ベクトル成分Ｖ_Rを、信号線４５は階層レベルＮをそれぞれ表わしている。FIG. 1 shows a circuit configuration of an embodiment of the present invention. Where 11 is an error plane calculator, 12 is a memory,
13 is an original vector field generator, 14 is a threshold value generator, 15 is a division determiner, 16 is a block selector, 17 is a block matching unit, 18 is a quad tree generator, and 31 to 45 are This is a signal line for exchanging various information. When enumerating various kinds of information exchanged through the signal lines 31 to 44, the signal line 31 represents the original image I _k (x), the signal line 32 represents the reference image I _{k + 1} (x), and the signal line 33 represents 1 Error function e _x (Δx) of pixel matching
And the signal line 34 of 1 pixel matching read out from the memory 12 the error function e _x ([Delta] x), the signal line 35 is a vector field V (x), the average prediction error e _r of the signal line 36 is 1 pixel Threshold _Tq
Sum, the signal line 37 of 1 pixel matching read out from the memory 12 the error function e _x ([Delta] x), the signal lines 38 divide instructions, the signal line 39 is a block b _N, the _M, signal line 40 is a vector of V _bN, the _M, the average prediction error e _bN of the signal line 41 one pixel _unit, the _M, signal line 42 of 1 pixel matching read out from the memory 12 the error function e _x a ([Delta] x), the signal line 43 is blocked the position M (N), the signal line 44 representative vector components V _R, the signal lines 45 represent respectively the hierarchy level N.

【００２６】図２には図１の回路構成のクワッド・トリ
ーによる代表ベクトルの抽出動作の原理を示している。
同図（ａ）は円８２や方形８３を含む原ベクトル場８０
を示し、破線で示した８４は分割したブロックを、８５
は分割したブロック８４と方形８３とのオーバラップ部
分を示しており、そこに示された矢印は原ベクトル場８
０内に存在する多数のベクトルのうちの一部を表わして
いる。FIG. 2 shows the principle of the operation of extracting a representative vector by the quad tree having the circuit configuration of FIG.
FIG. 8A shows an original vector field 80 including a circle 82 and a square 83.
84 is indicated by a broken line, and the divided block is represented by 85
Indicates an overlap portion between the divided block 84 and the square 83, and the arrow shown therein indicates the original vector field 8
It represents a part of a number of vectors existing in 0.

【００２７】同図（ｂ）は、（ａ）に示した分割したブ
ロック８４と斜線をつけたオーバラップ部分８５を示
し、そこに示された矢印は分割したブロック８４でブロ
ック・マッチングをして求めたベクトルを表わしている
が、同図（ｂ）のオーバラップ部分８５のベクトルを表
わすものとはなっておらず、この分割の階層レベルにお
いては誤差が大きいことを表わしている。FIG. 2B shows the divided block 84 shown in FIG. 2A and an overlapped portion 85 indicated by diagonal lines, and the arrow shown therein indicates that the divided block 84 is subjected to block matching. Although the obtained vector is shown, it does not represent the vector of the overlap portion 85 in FIG. 9B, which indicates that the error is large at the hierarchical level of this division.

【００２８】同図（ｃ）は、（ｂ）の分割したブロック
８４を４分割した場合を示しているが、この４分割にお
いてはオーバラップ部分８５を含む右上のブロックのベ
クトルは、なおオーバラップ部分８５のベクトルを正確
に表わすものとなっておらずこの分割したブロック８４
の（ｃ）に示した４分割の階層レベルにおいては誤差が
大きいことを表わしている。FIG. 9C shows a case where the divided block 84 shown in FIG. 9B is divided into four parts. In this four-part division, the vector of the upper right block including the overlap part 85 is still overlapped. Since the vector of the portion 85 is not accurately represented, the divided block 84
(C) indicates that the error is large at the hierarchical level of four divisions.

【００２９】同図（ｄ）には（ｃ）においてオーバラッ
プ部分８５を含んだブロックのベクトルが正確ではな
く、その誤差（１画素マッチングの誤差関数（信号線３
７）の、オーバラップ部分８５を含んだブロック内の平
均誤差）が閾値を越えているところから、そのブロック
をさらに４分割して各分割したブロックのベクトルを求
めている。ここではじめて、オーバラップ部分８５の正
確なベクトルが得られることを示している。In FIG. 4D, the vector of the block including the overlapping portion 85 in FIG. 4C is not accurate, and the error (error function of one-pixel matching (signal line 3)
7) In the block including the overlapping portion 85,
Since the average error exceeds the threshold, the block is further divided into four, and the vector of each divided block is obtained. Here, it is shown for the first time that an accurate vector of the overlap portion 85 can be obtained.

【００３０】同図（ｅ）には、同図（ａ）のベクトル場
８０による最終的な分割結果９０が示されている。そこ
では、各ブロック内のベクトルがほぼ一様になるまで分
割がなされて、同図（ａ）の円８２および方形８３にそ
れぞれ対応する円９２および方形９３が示されている。FIG. 5E shows a final division result 90 by the vector field 80 shown in FIG. Here, division is performed until the vector in each block becomes substantially uniform, and a circle 92 and a square 93 corresponding to the circle 82 and the square 83 in FIG.

【００３１】図３ないし図５には図１に示した回路構成
の動作の流れが示されているので、図１を参照しながら
説明する。FIGS. 3 to 5 show the flow of operation of the circuit configuration shown in FIG. 1, and will be described with reference to FIG.

【００３２】信号線３１からは原画像Ｉ_k（ｘ）が、信
号線３２からは参照画像Ｉ_k+1（ｘ）が誤差平面計算器
１１に入力され、１画素単位の誤差平面を表わす１画素
マッチングの誤差関数ｅ_x（Δｘ）を、ｅ_x（Δｘ）＝｜Ｉ_k+1（ｘ＋Δｘ）−Ｉ_k（ｘ）｜により求めて、信号線３３によりメモリ１２に送り格納
する。The original image I _k (x) is input from the signal line 31 and the reference image I _{k + 1} (x) is input from the signal line 32 to the error plane calculator 11. the error function e _x ([Delta] x) of the pixel _{matching, e x (Δx) = |} I k + 1 (x + Δx) -I k (x) | by seeking to store feed via a signal line 33 to the memory 12.

【００３３】原ベクトル場生成器１３はメモリ１２から
信号線３４により１画素マッチングの誤差関数ｅ_x（Δ
ｘ）を読出して、１画素マッチングにより原画像のベク
トル場Ｖ（ｘ）を求めて信号線３５に出力する（Ｓ１、
図３）。The original vector field generator 13 outputs an error function e _x (Δ) for one-pixel matching from the memory 12 through a signal line 34.
x) is read out, the vector field V (x) of the original image is obtained by one-pixel matching, and output to the signal line 35 (S1,
(Fig. 3).

【００３４】各回路の構成要素は条件設定がなされてい
る。すなわち、クワッド・トリー生成器１８において
は、ブロックの分割手順が定められている。The conditions of the components of each circuit are set. That is, in the quad tree generator 18, a block division procedure is defined.

【００３５】図６にはクワッド・トリーによるブロック
の分割手順の一例が示されている。クワッド・トリーの
階層レベルをＮとし分割ブロックの生成順による現在の
ブロック位置をＭ（Ｎ）で表わす。Ｎ＝０は無分割の画
面全体を表わし、そのブロック位置はＭ（０）＝０とす
る。階層レベルＮ＝１では、画面全体を４分割し、同図
のように左上がＭ（１）＝０、右上がＭ（１）＝１（図
示されてはいない）、左下がＭ（１）＝２、右下がＭ
（１）＝３に条件設定されている。ここで、Ｍ（１）＝
１の分割ブロックの誤差が閾値以上であるときには階層
レベルＮ＝２として更に４分割して、左上がＭ（２）＝
０、右上がＭ（２）＝１、左下がＭ（２）＝２、右下が
Ｍ（２）＝３とするように条件設定されている（Ｓ
２）。最低の階層レベルはＮ＝Ｎ_Lとする。また、得ら
れたｍ番目の代表ベクトルをＶ_mとし、その代表ベクト
ルをＶ_mを得た階層をＮ_Vmと表わす。FIG. 6 shows an example of a block dividing procedure by the quad tree. The hierarchical level of the quad tree is N, and the current block position in the generation order of the divided blocks is represented by M (N). N = 0 represents the entire undivided screen, and its block position is M (0) = 0. At the hierarchical level N = 1, the entire screen is divided into four parts, and M (1) = 0 at the upper left, M (1) = 1 at the upper right (not shown), and M (1) at the lower left as shown in FIG. = 2, lower right is M
(1) = 3 is set. Here, M (1) =
When the error of one divided block is equal to or larger than the threshold value, the hierarchical level N is set to N = 2, and the divided block is further divided into four, and M (2) =
0, M (2) = 1 at upper right, M (2) = 2 at lower left, and M (2) = 3 at lower right (S
2). The lowest hierarchical level is N = _NL . Further, the m-th representative vectors obtained as V _m, representing the representative vectors to obtain a V _m hierarchy N _Vm.

【００３６】まず、階層レベルＮ＝０、Ｍ（Ｎ）＝０、
ｍ＝０に初期設定し、クワッド・トリー生成器１８は信
号線４３に現在のブロック位置Ｍ（Ｎ）を、信号線４５
に階層Ｎ（Ｎ＝０）を出力する（Ｓ３）。このブロック
位置Ｍ（Ｎ）と階層Ｎとの指示を受けたブロック選択器
１６では、代表ベクトルの抽出対象となるブロックｂ
_N,Mを決定し、これを信号線３９に出力する（Ｓ４）。First, the hierarchical levels N = 0, M (N) = 0,
m = 0, the quad tree generator 18 places the current block position M (N) on the signal line 43 and the signal line 45
Is output to the layer N (N = 0) (S3). The block selector 16 that has received the instruction of the block position M (N) and the hierarchy N selects the block b from which the representative vector is to be extracted.
_{N and M} are determined and output to the signal line 39 (S4).

【００３７】図５にはブロックｂ_N,Mの決定手順の詳細
が示されている。ブロック位置Ｍ（Ｎ）＝３でブロック
決定手順の再帰処理を指示するフラグｆｌ（ｆｌ＝１で
再帰処理）が“１”になっているときには（Ｓ２１Ｙ、
図５）、階層レベルＮをＮ＝Ｎ−１に設定し（Ｓ２
２）、Ｎ＝０になっていなければ（Ｓ２３Ｎ）、ステッ
プＳ２１に戻り、Ｎ＝０になっていれば分割がすべて終
了したことを意味するので処理を終了する（Ｓ２３
Ｙ）。FIG. 5 shows the details of the procedure for determining the block b _{N, M.} When the flag fl (recursive processing with fl = 1) instructing the recursive processing of the block determination procedure is "1" at the block position M (N) = 3 (S21Y,
FIG. 5), the hierarchical level N is set to N = N−1 (S2
2) If N = 0 has not been reached (N in S23), the process returns to step S21, and if N = 0, it means that all divisions have been completed, and the process ends (S23).
Y).

【００３８】ブロック位置Ｍ（Ｎ）＝３でないか、もし
くは、フラグｆｌ＝１でないときには（Ｓ２１Ｎ）、フ
ラグｆｌ＝０とし（Ｓ２５）、処理すべきブロックｂ
_N,Mを設定して、ブロック選択器１６は信号線３９にブ
ロックｂ_N,Mを出力してブロックｂ_N,M設定のサブルー
チンを出る（Ｓ２６）。If the block position M (N) is not 3 or the flag fl is not 1 (S21N), the flag fl is set to 0 (S25), and the block b to be processed is set.
_After setting _{N and M} , the block selector 16 outputs the block bN _{and M} to the signal line 39 and exits the subroutine for setting the block bN _{and M} (S26).

【００３９】信号線３９により処理の対象とするブロッ
クｂ_N,Mの指示を受けたブロック・マッチング器１７で
は信号線４２によりメモリ１２から１画素マッチングの
誤差関数ｅ_x（Δｘ）を読出してブロック・マッチング
を行い、得た動きベクトルＶ_bN,Mを信号線４０によって
出力する（Ｓ５）。さらにブロック・マッチング器１７
および閾値生成器１４では、ベクトルＶ_bN,Mおよび原
画像Ｉ_k（ｘ）のブロックｂ_N,M （信号線３９）に対応
する画素ｘのベクトルＶ（ｘ）（信号線３５）とから、
ブロックｂ_N,M （信号線３９）に含まれる画素の総数を
Ｓ（ｂ_N,M）として、１画素単位の平均予測誤差ｅ
_bN,Mおよびｅ_rを次の２つの式から求める。ｅ_bN,M＝（１／Ｓ（ｂ_N,M））Σｅ_x（Ｖ_bN,M）ｅ_r＝（１／Ｓ（ｂ_N,M））Σｅ_x（Ｖ（ｘ））ここでｘはブロックｂ_N,M （信号線３９）に含まれた画
素のすべての点であり、ｅ_x（Ｖ_bN,M）は点ｘのベクト
ルＶ_bN,Mに対する予測誤差であり、ｅ_x（Ｖ（ｘ））は
点ｘのベクトルＶ（ｘ）に対する予測誤差であり、Σは
ブロックｂ_N,Mに含まれたすべての点ｘについての累和
を表わしている。１画素単位の平均予測誤差ｅ_bN,Mは
ブロック・マッチング器１７で算出され信号線４１によ
り出力され、原画像に関する平均予測誤差ｅ_rは、処理
の対象とするブロックｂ _N,M （信号線３９）の指示とベ
クトル場Ｖ（ｘ）（信号線３５）と１画素マッチングの
誤差関数ｅ _x （Δｘ）（信号線３７）とを受けた閾値生
成器１４で算出されそれに閾値Ｔ_qを加えて信号線３６
によって出力される（Ｓ６）。The signal line 39 by the block b _N to be _processed, the block matching unit 17 in line 42 that has received the instruction for _M reads the error function e _x from the memory 12 1 pixel matching ([Delta] x) block Perform matching, and output the obtained motion vector V _{bN, M} via the signal line 40 (S5). Further, the block matching unit 17
And the threshold generator 14, based on the vector V _{bN, M} and the vector V (x) (signal line 35) of the pixel x corresponding to the block b _{N, M} (signal line 39) of the original image I _k (x),
_Assuming that the total number of pixels included in the block b _{N, M} (signal line 39) is S (b _{N, M} ), the average prediction error e in pixel units
_bN, obtaining the _M and e _r from the following two equations. _{e bN, M = (1 /} S (b N, M)) Σe x (V bN, M) e r = (1 / S (b N, M)) Σe x (V (x)) where x block b _N, is in all respects the pixels included in _M (signal line _{_{39), e x (V bN}} , M) is a vector V _bN, prediction error for _M at the point _x, e x (V ( x)) is the prediction error of the point x with respect to the vector V (x), and Σ represents the sum of all the points x included in the block b _{N, M.} Mean prediction error e _bN of one pixel _{unit, M} is output by the signal line 41 is calculated by the block matching unit 17, an average prediction error e _r relates the original image, processing
Of the block b _{N, M} (signal line 39) to be
Vector field V (x) (signal line 35) and 1 pixel matching
Error function e _x ([Delta] x) (signal line 37) and add it to the threshold T _q are calculated by the threshold generator 14 which has received the signal line 36
Is output (S6).

【００４０】分割判断器１５では、信号線４１により受
けた１画素単位の平均予測誤差ｅ_b _N,Mと信号線３６に
より受けた原画像に関する１画素単位の平均予測誤差
ｅ_rに閾値Ｔ_qを加算したｅ_r＋Ｔ_qとを比較して、ｅ
_bN,M＞ｅ_r＋Ｔ_q、かつ、階層レベルＮが最低階層レベ
ルＮ_Lになっていないときには（Ｓ７Ｎ）、階層レベル
Ｎをインクリメント（Ｎ＝Ｎ＋１）し、分割命令を信号
線３８により出力し、４分割ブロックの左上のブロック
をＭ（Ｎ）＝０として位置指定してクワッド・トリー生
成器１８においてブロックｂ_N,Mを分割してステップＳ
４の動作に戻る（Ｓ８）。In the division judging unit 15, the average prediction error e _b _{N, M} per pixel received by the signal line 41 and the average prediction error per pixel pertaining to the original image received by the signal line 36.
by comparing the e _r + T _q obtained by adding the threshold value T _q to e _r, e
_{_{bN, M> e r + T}} q and, when the hierarchical level N is not in the lowest hierarchy level N _L and (S7: NO), increments the hierarchical level N (N = N + 1), the division instruction is output by the signal line 38 , The upper left block of the four-divided block is designated as M (N) = 0, and the block b _{N, M} is divided by the quad tree generator 18 to obtain a step S
The operation returns to operation 4 (S8).

【００４１】クワッド・トリー生成器１８においては、
ｅ_bN,M≦ｅ_r＋Ｔ_qまたはＮ＝Ｎ_Lとなっているときには
（Ｓ７Ｙ）、代表ベクトルＶ_mをＶ_bN,Mとし、その階層
レベルＮ_VmをＮとし、代表ベクトル成分Ｖ_Rを信号線４
４に、階層レベルＮを信号線４５に出力する。そして、
ｍ番目の代表ベクトルを示す数をｍ＋１とし（Ｓ９、図
４）、ブロック位置Ｍ（Ｎ）をＭ（Ｎ）＋１に進め（Ｓ
１０）、Ｍ（Ｎ）＝４であれば（Ｓ１１Ｙ）、Ｎ＝Ｎ−
１にし、フラグｆｌ＝１にして、ブロック決定処理に再
帰処理が必要であることを示し（Ｓ１２）、Ｍ（Ｎ）が
４にはなっていないとき（Ｓ１１Ｎ）とともに、階層レ
ベルＮが０であるか否かを調べ（Ｓ１３）、０でなけれ
ばステップＳ４の動作に戻り（Ｓ１３Ｎ）、０であれ
ば、分割が全て終了したことを意味するので、処理を終
了する。In the quad tree generator 18,
_{_{e bN, M ≦ e r +}} T when that is the _q or N = N _L (S7Y), a representative vector V _m V _bN, and _M, and the hierarchy level N _Vm and N, the signal representative vector component V _R Line 4
4 and outputs the hierarchical level N to the signal line 45. And
The number indicating the m-th representative vector is set to m + 1 (S9, FIG. 4), and the block position M (N) is advanced to M (N) +1 (S9).
10), if M (N) = 4 (S11Y), N = N−
The flag fl is set to 1 to indicate that recursive processing is necessary for block determination processing (S12). When M (N) is not 4 (S11N), the hierarchical level N is set to 0. It is checked whether there is any data (S13). If it is not 0, the process returns to the operation of step S4 (S13N). If it is 0, it means that all the divisions have been completed, and the process is terminated.

【００４２】[0042]

【発明の効果】以上の説明から明らかなように、本発明
によるならば、クワッド・トリーの生成順を決めておく
ことにより、領域記述のための付加情報が階層レベルだ
けですみ、代表ベクトルの抽出順が、位置的な連続性を
保持しているので、ベクトルの連続性を利用した差分符
号化による情報量の減少が可能となったから、本発明の
効果は極めて大きい。As is clear from the above description, according to the present invention, by determining the generation order of quad trees, additional information for region description is required only at the hierarchical level, and the representative vector Since the extraction order retains the positional continuity, the amount of information can be reduced by differential encoding using the continuity of the vector, and the effect of the present invention is extremely large.

[Brief description of the drawings]

【図１】本発明の一実施例を示す回路構成図である。FIG. 1 is a circuit diagram showing an embodiment of the present invention.

【図２】図１の回路構成のクワッド・トリーによる代表
ベクトルの抽出動作の原理を説明するための代表ベクト
ル抽出図である。FIG. 2 is a representative vector extraction diagram for explaining the principle of a representative vector extraction operation by a quad tree having the circuit configuration of FIG. 1;

【図３】図１の回路構成のクワッド・トリーによる代表
ベクトルの抽出動作の流れを示すフローチャートであ
る。FIG. 3 is a flowchart showing a flow of an operation of extracting a representative vector by a quad tree having the circuit configuration of FIG. 1;

【図４】図３とともに図１の回路構成のクワッド・トリ
ーによる代表ベクトルの抽出動作の流れを示すフローチ
ャートである。4 is a flowchart showing a flow of an operation of extracting a representative vector by a quad tree having the circuit configuration of FIG. 1 together with FIG. 3;

【図５】図３の動作の一部の詳細を示すフローチャート
である。FIG. 5 is a flowchart showing details of a part of the operation in FIG. 3;

【図６】図３の動作におけるクワッド・トリーによるブ
ロックの分割図である。FIG. 6 is a block diagram of a quad tree in the operation of FIG. 3;

【図７】従来の１画素マッチングに基づく動き推定図で
ある。FIG. 7 is a motion estimation diagram based on conventional one-pixel matching.

【図８】従来のラベル画像の距離変換図である。FIG. 8 is a distance conversion diagram of a conventional label image.

[Explanation of symbols]

１１誤差平面計算器１２メモリ１３原ベクトル場生成器１４しきい値生成器１５分割判断器１６ブロック選択器１７ブロック・マッチング器１８クワッド・トリー生成器３１〜４５信号線７０画面７１円７９マッチングのとれなかった画素８０原ベクトル場８２円８３方形８４分割したブロック８５オーバラップ部分９０原ベクトル場８０の最終的な分割結果９２円８２に対応する円９３方形８３に対応する方形ｂ_N,M ブロックｄ距離ｅ_bN,M，ｅ_r １画素単位の平均予測誤差ｅ_x（Δｘ）１画素マッチングの誤差関数ＦオペレータｆｌフラグＭ（１），Ｍ（２），Ｍ（Ｎ）ブロック位置Ｎ，Ｎ_Vm 階層レベルＮ_L 最低階層レベルＴ_q 閾値Ｖ（ｘ）ベクトル場（もしくは点ｘの位置のベクト
ル）Ｖ_m ｍ番目の代表ベクトルｘ位置Reference Signs List 11 error plane calculator 12 memory 13 original vector field generator 14 threshold value generator 15 division determiner 16 block selector 17 block matching unit 18 quad tree generator 31-45 signal line 70 screen 71 yen 79 Pixels not removed 80 Original vector field 82 Circle 83 Square 84 Divided block 85 Overlapping part 90 Final division result of original vector field 80 92 Circle corresponding to circle 82 93 Square corresponding to square 83 b _{N, M} block d distance e _{bN, M,} e _r 1 average prediction error of the pixel units e _x (Δx) 1 pixel error function F operator fl flag M (1) of the matching, M (2), M ( N) block position N, N _Vm hierarchy level N _L lowest hierarchy level T _q threshold V (x) vector field (or vector at position of point x) V _m m-th Representative vector x position

───────────────────────────────────────────────────── フロントページの続き (72)発明者羽鳥光俊東京都文京区千石１丁目６番24−808号 (56)参考文献特開平１−69182（ＪＰ，Ａ) (58)調査した分野(Int.Cl.⁶，ＤＢ名) H04N 7/24 - 7/68 ──────────────────────────────────────────────────続き Continuation of the front page (72) Inventor Mitsutoshi Hatori 1-6-24-808 Sengoku, Bunkyo-ku, Tokyo (56) References JP-A-1-69182 (JP, A) (58) Fields surveyed ( Int.Cl. ⁶ , DB name) H04N ^7/ 24-7/68

Claims

(57) [Claims]

1. A motion vector generation process for obtaining a motion vector field (35) in units of one pixel between frames of a moving image (11, 12, 13), and the one pixel in a block (39) to be processed. The processing to be performed using the prediction error obtained from the unit motion vector field (35) and the error function (37) of one-pixel matching.
One pixel of the original image calculated in each block
A threshold generation process for obtaining an average prediction error in units and generating a threshold (36) that changes according to the average prediction error is performed (1
4), the position (43) of the block to be processed and the hierarchical level (4
5) to select the block (39) to be processed.
(16), and the block (39) to be processed is designated, and
One-pixel matching error function corresponding to the indicated block
According to the number (42), the motion vector (40) and the
Block matching to find the average prediction error (41)
Processing (17), and obtained in the block matching processing (17).
When the average prediction error (41) exceeds the threshold (36), the representative motion vector (44) is selected and the hierarchical level (45) of the division is set while the motion vector field is divided by the quad tree. Representative vector (4
Perform quad tree processing as additional information in 4) ( 1)
5,18 ) A motion vector search method.

2. An inter-frame prediction of a moving image encoding,
Error function of pixel matching (33, 34, 37, 42)
And a motion vector generation process for obtaining a motion vector field (35) in units of one pixel (11, 12, 13), the position (43) of the block to be processed and the hierarchical level (4)
5) performing a block selection process for selecting a block (39) to be processed from (16), and instructing the block (39) to be processed, the one-pixel matching corresponding to the designated block; A block matching process for obtaining a motion vector (40) and an average prediction error (41) in units of one pixel by an error function (42) of (17) is performed. A threshold generation process is performed to calculate an average of prediction errors from the error function (37) of pixel matching and the motion vector field (35) in units of one pixel and to generate an adaptive threshold (36) according to the average prediction error (14). ), Comparing the average prediction error (41) for each pixel with the adaptive threshold (36) to determine whether or not to perform quadtree division, and issue a division command (38). (15) receiving the division instruction (38) and the motion vector (40), the position (43) of the block to be processed for quad tree generation and the hierarchy level ( 45) and a quadtree generation process for outputting the motion vector (40) received at the end of the quadtree division as a representative motion vector (44) (18).

3. The threshold generation process (14) sets the block (39) to be processed to b _{N, M} ,
Error function of pixel matching (37) and e _x ([Delta] x), the one pixel of the motion vector field a (35) V
And (x), the prediction error of the motion vector field V (x) and _{e x (V (x))} , any threshold constant and T _q, x
Is the position of a pixel included in the block b _{N, M} to be processed _, the total number of pixels included in the block b _{N, M} to be processed is S (b _{N, M} ), and the adaptive threshold (36) is e _r
+ When the _{_{T q, e r = (1}} / S (b N, M)) Σe x (V (x)) where Σ is for the position x of the pixel included in the block b _{N, M} be the processing 3. The motion vector search method according to claim 2, including a calculation process represented as a cumulative sum.

4. A motion vector means (1) for obtaining a motion vector field (35) in units of one pixel between frames of a moving image.
And 1,12,13), to processing using the prediction error obtained from the error function of the one pixel unit of the motion vector field (35) and 1 pixel matching in the block (39) to be treated (37) Bekiso
One pixel of the original image calculated in each block
Threshold generating means for generating a threshold value (36) that varies according to the average prediction error calculating an average prediction error of a unit (14)
, The position of the block to be processed (43) and the hierarchical level (4
5) to select the block (39) to be processed.
For selecting a block (39) for processing and a block (39) to be processed.
One-pixel matching error function corresponding to the indicated block
According to the number (42), the motion vector (40) and the
Block matching to find the average prediction error (41)
Means (17) and the block matching means (17)
The average prediction error (41) exceeds the threshold (36)
Sometimes , while dividing the motion vector field by the quad tree, the representative motion vector (44) is selected and the hierarchical level (45) of the division is set to the representative vector (4).
Quad tree means ( 15, 1 ) as additional information of 4)
8 ) A motion vector search device comprising:

5. A motion vector generating means (11, 12) for obtaining an error function (33, 34, 37, 42) of one-pixel matching between video encoded frames and a motion vector field (35) in pixel units. , 13), the position of the block to be processed (43) and the hierarchical level (4
5), a block selecting means (16) for selecting a block (39) to be processed, and a block (39) to be processed is specified, and the one-pixel matching corresponding to the specified block is performed. A block matching means (17) for calculating a motion vector (40) and an average prediction error (41) in units of one pixel by an error function (42); A threshold generation means (14) for calculating an average of prediction errors from the error function (37) of the above and the motion vector field (35) in units of one pixel and generating an adaptive threshold (36) according to the average prediction error; A comparison is made between the average prediction error (41) for each pixel and the adaptive threshold (36) to determine whether or not quadtree division is to be performed, and to issue a division instruction (38). The position (43) of the block to be processed for quad tree generation in response to the determination means (15), the division instruction (38), and the motion vector (40), and the hierarchy level (45) A quad tree generating unit (18) for outputting the motion vector (40) received at the end of the quad tree division as a representative motion vector (44).

6. The threshold generation means (14) sets the block (39) to be processed to b _{N, M} ,
Error function of pixel matching (37) and e _x ([Delta] x), the one pixel of the motion vector field a (35) V
And (x), the prediction error of the motion vector field V (x) and _{e x (V (x))} , any threshold constant and T _q, x
Is the position of a pixel included in the block b _{N, M} to be processed _, the total number of pixels included in the block b _{N, M} to be processed is S (b _{N, M} ), and the adaptive threshold (36) is e _r
+ When the _{_{T q, e r = (1}} / S (b N, M)) Σe x (V (x)) where Σ is for the position x of the pixel included in the block b _{N, M} be the processing 6. The motion vector search device according to claim 5, which performs a calculation operation represented as a cumulative sum.