JPH09182082A

JPH09182082A - Method and apparatus for motion compensation predictive coding of moving image

Info

Publication number: JPH09182082A
Application number: JP33720595A
Authority: JP
Inventors: Hirotaka Jiyosawa; 裕尚如沢; Kazuto Kamikura; 一人上倉
Original assignee: Nippon Telegraph and Telephone Corp
Current assignee: Nippon Telegraph and Telephone Corp
Priority date: 1995-12-25
Filing date: 1995-12-25
Publication date: 1997-07-11

Abstract

(57)【要約】【課題】平行移動のみならず回転、拡大、縮小、せん
断、等の変形を補償することによりオーバーラップ動き
補償の予測性能を向上させる動画像の動き補償予測符号
化方法と装置の提供。【解決手段】符号化対象画像を小領域に分割し、直前
の位置情報である参照画像と比較して小領域毎の動きパ
ラメータをアフィン変換パラメータを使用することによ
り、平行移動のみならず、拡大、縮小、せん断、回転等
各種の変形移動の動きを計算して予測画像を作成し、そ
れぞれの差分を量子化して符号化するので、動画像の補
償符号化が可能になる。 (57) Abstract: A motion compensation predictive coding method for a moving image, which improves prediction performance of overlap motion compensation by compensating not only translation but also deformation such as rotation, enlargement, reduction, shearing, and the like. Providing equipment. An image to be coded is divided into small regions, compared with a reference image which is position information immediately before, and the motion parameter for each small region is used as an affine transformation parameter, so that not only parallel movement but enlargement is performed. , Predictive images are created by calculating motions of various deformation movements such as reduction, shearing, and rotation, and the respective differences are quantized and coded, so that it is possible to perform compensation coding of moving images.

Description

Detailed Description of the Invention

【０００１】[0001]

【発明の属する技術分野】本発明は、画像通信・画像記
録等に利用される画像信号のディジタル圧縮符号化方法
とその装置とに関し、特に動画像の符号化における動き
補償部分の符号化と装置に関する。BACKGROUND OF THE INVENTION 1. Field of the Invention The present invention relates to a method and apparatus for digitally compressing and encoding an image signal used for image communication, image recording, etc. Regarding

【０００２】[0002]

【従来の技術】動画像のディジタル圧縮符号化におい
て、動画像信号の時間冗長性を抑制する手段として、動
き補償フレーム間予測が用いられる。このフレーム間予
測では、通常、符号化対象画像を１６画素Ｘ１６ライン
等のブロックに区切り、各ブロック毎に参照画像との間
の動き量（動きベクトル）を検出し、参照画像を動きベ
クトル分シフトして生成した予測画像と符号化対象画像
との差分（動き補償予測誤差）信号を符号化する。動き
補償フレーム間予測により動画像のフレーム間相関は飛
躍的に向上し、単純フレーム間予測に比べ大幅な情報圧
縮が得られる。さらに、動き補償予測誤差信号に対して
離散コサイン変換（ＤＣＴ：Discrete Cosine Transfor
m)や、サブバンド分割を施すことにより、空間方向の冗
長性も抑圧され、一層の情報圧縮が図られた。このた
め、テレビ電話、会議用ビデオ符号化（ＩＴＵ−Ｔ勧告
Ｈ．２６１）、蓄積用ビデオ符号化（ＩＳＯ／ＩＥＣ
１１１７２（ＭＰＥＧ−１））、蓄積、放送、通信向け
汎用ビデオ符号化（ＩＳＯ／ＩＥＣ１３８１８（ＭＰ
ＥＧ−２））等では、動き補償フレーム間予測による残
差信号をＤＣＴ符号化するハイブリッド符号化構成が採
用されている。以下、これらを代表して、Ｈ．２６１の
符号化アルゴリズムを図３を用いて簡単に説明する。2. Description of the Related Art In digital compression encoding of moving images, motion-compensated interframe prediction is used as a means for suppressing temporal redundancy of moving image signals. In this inter-frame prediction, an encoding target image is usually divided into blocks such as 16 pixels × 16 lines, a motion amount (motion vector) between each block and the reference image is detected, and the reference image is shifted by the motion vector. A difference (motion compensation prediction error) signal between the prediction image generated in this way and the encoding target image is encoded. The motion-compensated inter-frame prediction dramatically improves the inter-frame correlation of a moving image, and provides a great amount of information compression as compared with the simple inter-frame prediction. Furthermore, a DCT (Discrete Cosine Transfor) is applied to the motion compensation prediction error signal.
By applying m) and sub-band division, the spatial redundancy is also suppressed and further information compression is achieved. Therefore, videophone, video coding for conference (ITU-T Recommendation H.261), video coding for storage (ISO / IEC
11172 (MPEG-1), general purpose video coding for storage, broadcasting and communication (ISO / IEC 13818 (MP
In EG-2)) and the like, a hybrid coding configuration for DCT-coding a residual signal by motion-compensated interframe prediction is adopted. Hereinafter, as a representative of these, H.264. The encoding algorithm of H.261 will be briefly described with reference to FIG.

【０００３】まず、符号化対象画像１は、動きベクトル
検出部３１に入力され、１６画素×１６ラインのマクロ
ブロックと称する正方形のブロックに分割される。動き
ベクトル検出部３１では、符号化対象画像１の中の各マ
クロブロック毎に、参照画像との間の動き量を検出し、
得られた動きベクトル３２をブロック動き補償部４１に
送る。ここで、各マクロブロックの動きベクトルは、基
準方向ｘおよびそれに垂直な方向ｙの平行移動量として
求められる。すなわち、参照画像においては、着目のマ
クロブロックとのマッチング度が最も高いブロックの座
標と着目のマクロブロックの座標との変位として表され
る。動きベクトルの探索範囲は、着目のマクロブロック
の座標とその周囲の±１５画素×±１５ラインに制限さ
れる。First, the image to be coded 1 is inputted to the motion vector detecting section 31 and divided into square blocks called macroblocks of 16 pixels × 16 lines. The motion vector detection unit 31 detects the amount of motion with respect to the reference image for each macroblock in the encoding target image 1,
The obtained motion vector 32 is sent to the block motion compensator 41. Here, the motion vector of each macroblock is obtained as a translation amount in the reference direction x and the direction y perpendicular thereto. That is, in the reference image, it is represented as a displacement between the coordinates of the block having the highest degree of matching with the macroblock of interest and the coordinates of the macroblock of interest. The motion vector search range is limited to the coordinates of the macroblock of interest and ± 15 pixels × ± 15 lines around it.

【０００４】次に、ブロック動き補償部４１では、各マ
クロブロック毎に動きベクトル３２分シフトした座標に
位置する局部復号画像６を当該ブロックの予測値とし、
動き補償予測画像９を生成する。ここで得られる動き補
償予測画像９は符号化対象画蔵王１と共に減算器１０に
入力される。両者の差分すなわち動き補償予測誤差１１
は、ＤＣＴ／量子化部４２においてＤＣＴ変換され、さ
らに量子化されて圧縮差分データ１３となる。ここで、
ＤＣＴのブロックサイズ８×８である。圧縮差分データ
１３（量子化インデックス）は差分データ符号化部１４
においてデータ圧縮され、差分画像符号化データ１５と
なる。一方、動きベクトル３２は動きベクトル符号化部
３４において符号化され、得られた動きベクトル符号化
データ３５は差分画像符号化データ１５と共に多重化部
２２にて多重化され、多重化データ２３として伝送され
る。Next, in the block motion compensating section 41, the locally decoded image 6 located at the coordinate shifted by 32 motion vectors for each macroblock is used as the prediction value of the block,
A motion compensation predicted image 9 is generated. The motion-compensated predicted image 9 obtained here is input to the subtractor 10 together with the encoding target image store King 1. Difference between the two, that is, motion compensation prediction error 11
Is subjected to DCT conversion in the DCT / quantization unit 42 and further quantized to become compressed difference data 13. here,
The block size of DCT is 8 × 8. The compressed difference data 13 (quantization index) is stored in the difference data encoding unit 14
In (1), the data is compressed and becomes the differential image coded data 15. On the other hand, the motion vector 32 is coded by the motion vector coding unit 34, and the obtained motion vector coded data 35 is multiplexed by the multiplexing unit 22 together with the difference image coded data 15 and transmitted as multiplexed data 23. To be done.

【０００５】なお、復号器と同じ復号画像と符号器内で
も得るため、圧縮差分データ１３（量子化インデック
ス）は逆量子化／逆ＤＣＴ部４３で量子化代表値に戻さ
れ、さらに逆ＤＣＴ変換された後、復号差分画像１７と
なる。復号差分画像１７と動き補償画像９は加算器１８
で加算され、局部復号画像１９となる。この局部復号画
像１９はフレームメモリ５に蓄積され、次のフレームの
符号化時に参照画像として用いられる。Since the same decoded image and encoder are obtained in the decoder, the compressed difference data 13 (quantization index) is returned to the quantized representative value by the inverse quantization / inverse DCT unit 43, and the inverse DCT conversion is performed. Then, the decoded difference image 17 is obtained. The decoded difference image 17 and the motion compensation image 9 are added by an adder 18
Are added to form a locally decoded image 19. The locally decoded image 19 is stored in the frame memory 5 and is used as a reference image when the next frame is encoded.

【０００６】[0006]

【発明が解決しようとする課題】上述した従来の動き補
償予測方法は、１６画素×１６ライン等のブロック毎に
１組の動きベクトルを与え、各ブロック単位に平行移動
量を補償するものである。The above-described conventional motion compensation prediction method is to provide a set of motion vectors for each block such as 16 pixels × 16 lines and to compensate the translation amount for each block. .

【０００７】したがって、隣接するブロックの動きベク
トルが異なる場合、予測画像中にブロック状の不連続歪
みが発生する。動きの激しい部分では、この不連続歪み
が特に顕著となり、予測誤差画像の符号化に十分な符号
量を割り当てることができない低レート符号化時には、
視覚的に大きな妨害となる。さらに、ブロック単位に求
めた動きベクトルにより動き補償を行った場合、プロッ
ク中心部の予測誤差は小さいが、ブロック境界に近付く
につれて予測誤差が大きくなる。Therefore, when the motion vectors of adjacent blocks are different, block-like discontinuous distortion occurs in the predicted image. This discontinuous distortion becomes particularly noticeable in a portion with a lot of movement, and at the time of low-rate encoding in which a sufficient code amount cannot be allocated for encoding the prediction error image,
It is a great visual obstacle. Furthermore, when motion compensation is performed using the motion vector obtained in block units, the prediction error at the center of the block is small, but the prediction error increases as the block boundary approaches.

【０００８】上記問題点を解決するための一手段とし
て、予測ブロックを互いにオーバーラップさせ、各オー
バーラップブロックの重み加算により予測画像を生成す
るオーバーラップ動き補償方式が提案されている。本方
式は、M.Ohta等による"EntropyCoding for Waveleet Tr
ansform of Image and Its Application for Motion Pi
cture Coding"(SPIE Symposium on Visial Communicati
ons and Image Processing '91,pp. 456, 466, 1991
１１月)と、H. Watanabe等による"Windowed Motion Com
pensation"(SPIE Symposium on Visial Communications
and Image Processing '91, pp. 582-589, 1991 １１
月)で同時に提案されたものである。As a means for solving the above problems, an overlap motion compensation method has been proposed in which prediction blocks are overlapped with each other and a weighted sum of each overlap block is used to generate a prediction image. This method is based on "Entropy Coding for Waveleet Tr
ansform of Image and Its Application for Motion Pi
cture Coding "(SPIE Symposium on Visial Communicati
ons and Image Processing '91, pp. 456, 466, 1991
November) and "Windowed Motion Com by H. Watanabe and others.
pensation "(SPIE Symposium on Visial Communications
and Image Processing '91, pp. 582-589, 1991 11
It was proposed at the same time.

【０００９】さらに、１９９５年１１月に正式に勧告さ
れる低レート用符号化ＩＴＵ−ＴＨ．２６３にも採用さ
れている。以下、オーバーラップ動き補償を、図４、図
５により簡単に説明する。Further, the low-rate coding ITU-TH. It is also used in 263. The overlap motion compensation will be briefly described below with reference to FIGS.

【００１０】図４に示すように、符号化対象画像１を１
６画素×１６ラインのマクロブロックに分割し、動きベ
クトル検出部３１で各マクロブロック毎の動きベクトル
３２を求める。次に、オーバーラップ動き補償器３３に
おいて、当該マクロブロックの動きベクトルと、その周
囲の３つのマクロブロックの動きベクトル、合わせて４
本の動きベクトルを用い、４つの予測値の重み付け加算
平均をとることにより、最終的な予測値を計算する。以
下に、ｊ行ｉ列目のマクロブロックＢ_i内の座標（ｘ、
ｙ）に位置する画素の予測値を計算する手順を図６によ
り詳しく述べる。As shown in FIG. 4, the encoding target image 1 is set to 1
It is divided into macroblocks of 6 pixels × 16 lines, and the motion vector detection unit 31 obtains a motion vector 32 for each macroblock. Next, in the overlap motion compensator 33, the motion vector of the macroblock in question and the motion vectors of the three macroblocks around it are combined into a total of 4
The final predicted value is calculated by using the motion vector of the book and taking the weighted average of the four predicted values. Hereinafter, j th row and the i-th column of the macroblock B _i in the coordinates (x,
The procedure for calculating the predicted value of the pixel located at y) will be described in detail with reference to FIG.

【００１１】１．左上のマクロブロックＢ_i-1,j-1の動
きベクトル（Ｖｘ_i-1,j-1，Ｖｙ_i-1, _j-1）を用い、現フ
レームの画素Ｐ（ｘ、ｙ）に対応する１つ目の予測値
Ｐ’（ｘ＋Ｖｘ_i-1,j-1，ｙ＋Ｖｙ_i-1,j-1）を求める。1. The motion vector (Vx _{i-1, j-1} , Vy _i-1, _j-1 ) of the upper left macroblock B _{i-1, j-1} is used to correspond to the pixel P (x, y) of the current frame. The first predicted value P '(x + Vx _{i-1, j-1} , y + Vy _{i-1, j-1} ) is obtained.

【００１２】２．直上マクロブロックＢ_i,j-1の動きベ
クトル（Ｖｘ_i,j-1，Ｖｙ_i,j-1）を用い、現フレームの
画素Ｐ（ｘ、ｙ）に対する２つ目の予測値Ｐ'（ｘ＋Ｖ
ｘ_i,j _-1，ｙ＋Ｖｙ_i,j-1）を求める。2. Using the motion vector (Vx _{i, j-1} , Vy _{i, j-1} ) of the macroblock B _{i, j-1} immediately above, the second prediction value P ′ (for the pixel P (x, y) of the current frame is x + V
_{_{x i, j -1, y +}} Vy i, j-1) obtained.

【００１３】３．直左マクロブロックＢ_i-1,jの動きベ
クトル（Ｖｘ_i-1,j，Ｖｙ_i-1,j）を用い、現フレームの
画素Ｐ（ｘ、ｙ）に対する３つ目の予測値Ｐ'（ｘ＋Ｖ
ｘ_i-1 _,j，ｙ＋Ｖｙ_i-1,j）を求める。3. Using the motion vector (Vx _{i-1, j} , Vy _{i-1, j} ) of the immediate left macroblock B _{i-1, j} , the third prediction value P ′ for the pixel P (x, y) of the current frame (X + V
x _i−1 _{, j} , y + Vy _{i−1, j} ) is obtained.

【００１４】４．当該マクロブロックＢ_i,jの動きベク
トル（Ｖｘ_i,j，Ｖｙ_i,j）を用い、現フレームの画素Ｐ
（ｘ、ｙ）に対する４つ目の予測値Ｐ'（ｘ＋Ｖｘ_i,j，
ｙ＋Ｖｙ_i、j）を求める。4. Using the motion vector (Vx _{i, j} , Vy _{i, j} ) of the macroblock B _{i, j} , the pixel P of the current frame
The fourth predicted value P ′ (x + Vx _{i, j} , for (x, y)
y + Vy _{i, j} ) is obtained.

【００１５】５．以上の４つの予測値それぞれに対して
重み係数Ｗ_i-1,j-1，（ｘ，ｙ），Ｗ_i,j-1（ｘ，ｙ），
Ｗ_i-1,j（ｘ，ｙ），Ｗ_i,jを掛け、これらの総和を
画素Ｐ（ｘ、ｙ）に対する予測値Ｐ”（ｘ、ｙ）とす
る。すなわち5. Weighting factors W _{i-1, j-1} , (x, y), W _{i, j-1} (x, y), for each of the above four predicted values
W _{i-1, j} (x, y) and W _{i, j} are multiplied, and the sum of these is taken as the predicted value P ″ (x, y) for the pixel P (x, y).

【００１６】[0016]

【数１】となる。但し、Ｗ_i-1,j-1（ｘ，ｙ）＋Ｗ_i,j-1（ｘ，ｙ）＋Ｗ_i-1,j（ｘ，ｙ）＋Ｗ_i,j（ｘ，ｙ）＝１（２）である。４つの予測値に対する重み係数は、例えばＩＴ
Ｕ一ＴＨ．２６３では，図７のように定義されてい
る。Ｈ２６３では、当該ブロックの２行３列目の画素
の予測値を求める際には、図おり、当該ブロックＢ_i,j
の動きベクトル（Ｖｘ_i,j，Ｖｙ_i,j）が指すＰ’（ｘ＋
Ｖｘ_i,j，ｙ＋Ｖｙ_i,j）ｎｉ対する重み係数Ｗ
_i,j（ｘ，ｙ）は，５／８となる。また、図７（ｂ）よ
り、直上ブロックＢ_i、j _ー1の動きベクトル（Ｖｘ_i,j-1，
Ｖｙ_i,j-1）が指すＰ'（ｘ＋Ｖｘ_i,j-1，ｙ＋Ｖ
ｙ_i,j-1）に対する重み係数Ｗ_i、jー1（ｘ，ｙ）は２／８
となる。同様に、図７（ｃ）より直左ブロックＢ_i-1,y
の動きベクトル（Ｖｘ_i-1,j，Ｖｙ_i-1,j）が指すＰ’
（ｘ＋Ｖｘ_i-1,j，ｙ＋Ｖｙ_i-1,j）に対する重み係数Ｗ
_i-1,j（ｘ，ｙ）は，１／８となる。なお、Ｈ．２６３
では、当該ブロックの斜め方向に位置するマクロブロッ
クの動きベクトルを用いた予測は行わない。すなわち、Ｗ_i-1,j-1（ｘ，ｙ）＝０（３）である。[Equation 1] Becomes However, W _{i-1, j-1} (x, y) + W _{i, j-1} (x, y) + W _{i-1, j} (x, y) + W _{i, j} (x, y) = 1 (2 ) Is. The weighting factors for the four predicted values are, for example, IT
U-TH. In H.263, it is defined as shown in FIG. In H 263, when the predicted value of the pixel in the second row and third column of the block is calculated _,
Of the motion vector (Vx _{i, j} , Vy _{i, j} ) of P '(x +
Vx _{i, j} , y + Vy _{i, j} ) ni weighting coefficient W
_{i, j} (x, y) becomes 5/8. Further, from FIG. 7 (b), the immediately above block B _i, the motion vector of the _j _{over _{1 (Vx i, j-1}} ,
Py (x + Vx _{i, j-1} , y + V) indicated by Vy _{i, j-1} )
The weighting coefficient W _{i, j-1} (x, y) for y _{i, j-1} ) is 2/8
Becomes Similarly, as shown in FIG. 7C, the block on the immediate left B _{i-1, y}
Of the motion vector (Vx _{i-1, j} , Vy _{i-1, j} ) of P
Weighting coefficient W for (x + Vx _{i-1, j} , y + Vy _{i-1, j} )
_{i-1, j} (x, y) becomes 1/8. In addition, H. 263
Then, the prediction using the motion vector of the macroblock located in the diagonal direction of the block is not performed. That is, W _{i-1, j-1} (x, y) = 0 (3).

【００１７】図６に示すように、Ｐ'（ｘ＋Ｖｘ_i,j，ｙ
＋Ｖｙ_i,j）を除いたＰ'（ｘ＋Ｖｘ _i-1,j-1，ｙ＋Ｖｙ
_i-1,j-1）、Ｐ'（Ｖｘ_i,j-1，ｙ＋Ｖｙ_i,j-1）、Ｐ'
（ｘ＋Ｖｘ_i-1,j，ｙ＋Ｖｙ_i-1,j）の３つは、それぞれ
マクロブロックＢ_i-1,j-1、Ｂ_i-1 _,、，Ｂ_i,j-1が対応す
るブロックの外側に位置している。つまり、マクロブロ
ックの外側に４画素づつオーバーラップさせた２４画素
×２４ラインのブロックを用いて予測していることとな
る。オーバーラップ動き補償と称されるのはこのためで
ある。As shown in FIG. 6, P '(x + Vx_{i, j}, Y
+ Vy_{i, j}) Is excluded, P '(x + Vx _{i-1, j-1}, Y + Vy
_{i-1, j-1}), P '(Vx_{i, j-1}, Y + Vy_{i, j-1}), P '
(X + Vx_{i-1, j}, Y + Vy_{i-1, j}) Are three
Macro block B_{i-1, j-1}, B_i-1 _,,, B_{i, j-1}Corresponds to
It is located outside the block. In other words, Macroblo
24 pixels with 4 pixels overlapping each outside
It is assumed that prediction is performed using blocks of × 24 lines.
You. This is why it is called overlap motion compensation.
is there.

【００１８】以上は、（ｘ、ｙ）がＢ_i,jの左上にある
場合の例である。例えば、（ｘ、ｙ）がＢ_i,jの右上に
ある場合は、直上マクロブロックＢ_i,j-1、右上マクロ
ブロックＢ_i+1,j-1、当該マクロブロックＢ_i,jの動きベ
クトルを用いる。同様に、（ｘ、ｙ）がＢ_i,jの左下に
ある場合には、直左マクロブロックＢ_i-1,j、当該マク
ロブロックＢ_i,j、左下マクロブロックＢ_i-1,j+1、直下
マクロブロックＢ_i,j+1を用い、（ｘ、ｙ）がＢ_i,jの右
下にある場合は、当該マクロブロックＢ_i,j直右マクロ
ブロックＢ_i+1,j、直下マクロブロックＢ_i,j+1、右下マ
クロブロックＢ_i+1j ₊₁を用いる。The above is an example of the case where (x, y) is at the upper left of B _{i, j} . For example, when (x, y) is at the upper right of B _{i, j} , the macroblock B _{i, j-1} immediately above, the macroblock B _{i + 1, j-1 at the} upper right _{, and} the motion of the macroblock B _{i, j} . Use a vector. Similarly, when (x, y) is at the lower left of B _{i, j} , the immediate left macroblock B _{i-1, j} , the macroblock B _{i, j} , and the lower left macroblock B _{i-1, j +. 1} , using the immediate lower macroblock B _{i, j + 1} and (x, y) is at the lower right of B _{i, j} , the macroblock B _{i, j} immediately right macroblock B _{i + 1, j} , The immediately lower macroblock B _{i, j + 1} and the lower right macroblock B _{i + 1j} ₊₁ are used.

【００１９】しかし、前記の動き補償予測方法は、各ブ
ロックの平行移動のみを補償するものである。したがっ
て、被写体の回転、拡大・縮小等の変形を補償すること
は不可能である。すなわち、物体の変形を伴う場合に
は、予測効率が大幅に低下するという問題がある。However, the above motion compensation prediction method compensates only the parallel movement of each block. Therefore, it is impossible to compensate for the deformation of the subject such as rotation and enlargement / reduction. That is, when the object is deformed, there is a problem that the prediction efficiency is significantly reduced.

【００２０】本発明の目的は、上記問題点を解決し、回
転、拡大・縮小等の変形を補償することによりオーバー
ラップ動き補償の予測性能を一層向上させる動画像の動
き補償予測符号化方法とその装置を提供することであ
る。An object of the present invention is to solve the above problems and to provide a motion compensation predictive coding method for a moving image, which further improves the prediction performance of overlap motion compensation by compensating for deformations such as rotation and enlargement / reduction. It is to provide the device.

【００２１】[0021]

【課題を解決するための手段】本発明の動画像の動き補
償予測符号化方法は、符号化対象画像を複数の小領域に
分割し、符号化対象画像と予測参照画像との間の動き量
を前記小領域毎に求め、小領域内の各画素の予測値を求
めるにあたり、当該小領域の動きベクトルが指す位置の
濃淡値と、隣接小領域の動きベクトルが指す位置の濃淡
値の各々に重み係数を掛け、これらを加算した値を予測
値とすることによって予測画像を生成し、該予測画像と
符号化対象画像との差分を符号化する動画像の動き補償
予測符号化方法において、前記小領域の動きが、基準と
する方向（ｘ）およびそれに垂直な方向（ｙ）への平行
移動、回転、拡大、縮小、せん断、変形のうちの少なく
とも２つを表現する、それぞれの動きパラメータにより
記述される手順を有している。A motion-compensated predictive coding method for a moving picture according to the present invention divides a coding target image into a plurality of small areas, and calculates a motion amount between the coding target image and a prediction reference image. Is calculated for each of the small areas, and in obtaining the predicted value of each pixel in the small area, the gray value of the position indicated by the motion vector of the small area and the gray value of the position indicated by the motion vector of the adjacent small area are respectively set. A motion-compensated predictive coding method for a moving image, wherein a predictive image is generated by multiplying a weighting coefficient, and a value obtained by adding these is used as a predictive value, and a difference between the predictive image and an encoding target image is encoded. Depending on the respective motion parameters, the movement of the small area expresses at least two of translation, rotation, enlargement, reduction, shearing, and deformation in the reference direction (x) and the direction (y) perpendicular thereto. The steps described It is.

【００２２】本発明の動画像の動き補償予測符号化装置
は、符号化対象画像を複数の小領域に分割し、符号化対
象画像と予測参照画像との間の動き量を前記小領域毎に
求め、小領域内の各画素の予測値を求めるにあたり、当
該小領域の動きベクトルが指す位置の濃淡値と、隣接小
領域の動きベクトルが指す位置の濃淡値の各々に重み係
数を掛け、これらを加算した値を予測値とすることによ
って予測画像を生成し、該予測画像と符号化対象画像と
の差分を符号化する動画像の動き補償予測符号化装置に
おいて、前記小領域の動きを、基準とする方向（ｘ）お
よびそれに垂直な方向（ｙ）への平行移動、回転、拡
大、縮小、変形のうちの少なくとも２つを表現する、そ
れぞれの動きパラメータにより記述する手段を有してい
る。The motion-compensated predictive coding apparatus for a moving image according to the present invention divides a coding target image into a plurality of small regions, and calculates a motion amount between the coding target image and the prediction reference image for each of the small regions. When obtaining the predicted value of each pixel in the small area, multiply the gray value of the position indicated by the motion vector of the small area and the gray value of the position indicated by the motion vector of the adjacent small area by a weighting factor, In the motion-compensated predictive coding apparatus for a moving image, which generates a predicted image by setting a value obtained by adding the predicted value and codes a difference between the predicted image and the image to be coded, It has means for describing at least two of translation, rotation, enlargement, reduction, and deformation in a reference direction (x) and a direction (y) perpendicular to the reference direction by each motion parameter. .

【００２３】また、本発明の動画像の動き補償予測符号
化装置の一実施態様は、符号化対象画像を予め決められ
ている小領域にに分割する手段と、前記各小領域毎の直
前の状態の画像を参照画像として再生する手段と、各少
領域毎の動きベクトルを前記参照画像と比較して検出す
る手段と、該動きベクトルを動きの状態によって成分別
に分解し前記参照画像に最もマッチング度が高い小領域
を選出して各小領域毎の変位とする手段と当該小領域の
動きベクトルが指す位置の濃淡値と、隣接小領域の動き
ベクトルが指す位置の濃淡値の各々に重み係数を掛け、
これらを加算した値を予測値とすることによって各小領
域毎の補償予測値を生成する手段と、各小領域毎の補償
予測値と符号化対象画像との差から動き補償予測誤差を
算出する手段と、該動き補償予測誤差の空間冗長度を圧
縮する手段と、量子化された圧縮差分を符号化する手段
と、前記小領域毎の動きベクトルを符号化する手段と、
前記符号化された差分データおよび前記動きベクトル符
号化データとを多重化して出力する手段を有する、動画
像の動き補償予測符号化装置において、前記各小領域毎
の補償予測を生成する手段が、平行移動、拡大、縮小、
反転、せん断、回転、変形のうちの少なくとも２つを表
現する動きパラメータにより動き補償予測値を生成する
ものである。Further, an embodiment of the motion compensation predictive coding apparatus for a moving picture according to the present invention comprises means for dividing a coding target picture into predetermined small areas, and means for immediately preceding each small area. Means for reproducing an image of a state as a reference image, means for detecting a motion vector of each small area by comparing it with the reference image, and the motion vector being decomposed into components according to the motion state to best match the reference image. A means for selecting a small area having a high degree of displacement and a displacement for each small area, a gray value of a position indicated by a motion vector of the small area, and a weighting coefficient for each gray value of a position indicated by a motion vector of an adjacent small area. Multiply
A means for generating a compensation prediction value for each small area by using a value obtained by adding these as a prediction value, and a motion compensation prediction error is calculated from the difference between the compensation prediction value for each small area and the image to be encoded. Means, means for compressing the spatial redundancy of the motion compensation prediction error, means for encoding the quantized compression difference, and means for encoding the motion vector for each small region,
In the motion compensation predictive coding apparatus for a moving image, having means for multiplexing and outputting the coded difference data and the motion vector coded data, a means for generating the compensation prediction for each of the small regions, Translation, enlargement, reduction,
The motion compensation prediction value is generated by a motion parameter expressing at least two of inversion, shearing, rotation, and deformation.

【００２４】また、前記動き補償予測値を生成する手段
がアフィン変換パラメータを使用する計算によるもの
が、好ましい態様である。In a preferred mode, the means for generating the motion-compensated predicted value is calculated by using an affine transformation parameter.

【００２５】[0025]

【発明の実施の形態】次に、本発明の実施の形態につい
て図面を参照して説明する。Next, embodiments of the present invention will be described with reference to the drawings.

【００２６】図１は本発明の動画像の動き補償予測符号
化方法が適用された符号化装置のブロック図である。FIG. 1 is a block diagram of a coding device to which the motion compensation predictive coding method of the present invention is applied.

【００２７】この符号化装置では、始めに符号化対象画
像１が動きモデル２と共に動きパラメータ検出部３に入
力され、各小領域毎の動きパラメータ４が求められる。
ここで、小領域の大きさと形状としては、一般的には１
６画素×１６ライン等の矩形ブロックであるが、必ずし
もこれによる必要はない。In this encoding apparatus, the image to be encoded 1 is first input to the motion parameter detector 3 together with the motion model 2 and the motion parameter 4 for each small area is obtained.
Here, the size and shape of the small area is generally 1
It is a rectangular block of 6 pixels × 16 lines, but it is not always necessary.

【００２８】動きモデル２の例としては、射影変換があ
る。射影変換の式を以下に示す。An example of the motion model 2 is projective transformation. The formula of the projective transformation is shown below.

【００２９】[0029]

【数２】 [Equation 2]

【００３０】[0030]

【数３】射影変間は２次元変換の一般表現であり、ｐ＝ｑ＝０、
ｓ＝１としたものがアフィン変換と呼ばれる。アフィン
変換の式を以下に示す。(Equation 3) The projective transformation interval is a general expression of a two-dimensional transformation, and p = q = 0,
The one with s = 1 is called an affine transformation. The affine transformation formula is shown below.

【００３１】ｘ'＝ａｘ＋ｂｙ＋ｔ_x （６）ｙ'＝ｃｘ＋ｄｙ＋ｔ_y （７）これを行列表記に書き換えると以下のようになる。[0031] x '= ax + by + t x (6) y' = cx + dy + t y (7) is as follows rewrite this matrix notation.

【００３２】[0032]

【数４】ここで，ｔ_x，ｔ_y は，それぞれｘ方向とｙ方向の平行
移動量を表す。ａ＝ｄ＝１で、かつ、ｂ＝ｃ＝０の場
合、従来の平行移動のみによる動き補償となる。すなわ
ち(Equation 4) Here, t _x and t _y respectively represent parallel movement amounts in the x direction and the y direction. When a = d = 1 and b = c = 0, the conventional motion compensation is performed only by the parallel movement. Ie

【００３３】[0033]

【数5】アフィン変換パラメータａにより、以下の変換が実現で
きる。(Equation 5) The following conversion can be realized by the affine conversion parameter a.

【００３４】ａ＞１：ｘ軸方向の拡大ａ＝１：変化なし０＜ａ＜１：ｘ軸方向の縮小ａ＝０：ｙ軸に対する投影ー１＜ａ＜０：ｘ軸方向に縮小後、ｙ軸に関し反転ａ＝ー１：ｙ軸に関する反転ａ＜ー１：ｘに方向に拡大後、ｙ軸に関し反転同様に、アフィン変換パラメータｄによる変換はｙ軸方
向に関する拡大／縮小を表現し、、ｄが負ならｘ軸に関
する反転を伴う。A> 1: enlargement in the x-axis direction a = 1: no change 0 <a <1: reduction in the x-axis direction a = 0: projection on the y-axis-1 <a <0: after reduction in the x-axis direction , Inversion with respect to y-axis a = -1: inversion with respect to y-axis a <-1: after expansion in the direction to x, then inversion with respect to y-axis Similarly, the conversion by the affine transformation parameter d expresses expansion / contraction in the y-axis direction. ,, d is negative, with inversion about the x-axis.

【００３５】アフィン変換パラメータｂは、ｘ軸方向の
せん断を表現する。例えば、ａ＝ｄ＝１でかつｃ＝ｔ_x
＝ｔ_y＝０の場合、変換式は以下のようになる。The affine transformation parameter b represents shear in the x-axis direction. For example, a = d = 1 and c = t _x
= T _y = 0, the conversion formula is as follows.

【００３６】ｘ'＝ｘ＋ｂｙ（１０）ｙ'＝ｙ（１１）X ′ = x + by (10) y ′ = y (11)

【００３７】上式から分かるように、この変換によりｘ
軸上の点は動かず、ｘ軸上にない点は、ｂ×ｙだけ右側
にシフトする（図５（ａ））。この意味は正方形にこの
変換を適用した変換図形（図５（ｂ））が、せん断力を
正方形の弾性体に加えた場合と同じである。同様に、ア
フィン変換パラメータｃは、ｙ軸方向のせん断を表現す
る。ａ＝ｄ＝１でかつｂ＝０の場合、ｙ軸上の点は動か
ず、ｙ軸上にない点はｃ×ｘだけ上方にシフトする。さ
らに、ａ＝ｃｏｓθ，ｂ＝ｓｉｎθ，ｃ＝−ｓｉｎθ，
ｄ＝ｃｏｓθの場合は角度θの回転を表現できる。以上
のように、アフィン変換により拡大、縮小、反転、せん
断、回転等の様々な変換とこれらの組み合せを自由に表
現し得る。As can be seen from the above equation, this conversion results in x
Points on the axis do not move, and points on the x-axis shift to the right by b × y (FIG. 5 (a)). This meaning is the same as in the case where the transformation figure in which this transformation is applied to a square (FIG. 5B) is applied with a shearing force on a square elastic body. Similarly, the affine transformation parameter c represents shear in the y-axis direction. For a = d = 1 and b = 0, points on the y-axis do not move, points not on the y-axis shift up by c × x. Furthermore, a = cos θ, b = sin θ, c = −sin θ,
When d = cos θ, the rotation of the angle θ can be expressed. As described above, various transformations such as enlargement, reduction, inversion, shearing, rotation, and combinations thereof can be freely expressed by the affine transformation.

【００３８】（８）式によるアフィン変換を簡略化し、
せん断を省略した式を以下に示す。Simplifying the affine transformation by the equation (8),
The equation without shearing is shown below.

【００３９】[0039]

【数６】を用いることもできる（H. Jozawa "Segment-Based Vid
eo Coding Using An Affine Model", SPIE Symposium o
n visial Communications and Image Processing'94pp.
1605-1614, September 1994)。以上が本発明のオーバ
ーラップ動き補償に用いる動きモデルの例である。(Equation 6) Can also be used (H. Jozawa "Segment-Based Vid
eo Coding Using An Affine Model ", SPIE Symposium o
n visial Communications and Image Processing'94pp.
1605-1614, September 1994). The above is an example of the motion model used for the overlap motion compensation of the present invention.

【００４０】動き検出部３では、例えば特願平６ー１１
６２６０「動きパラメータ探索方法」で出願した方法に
より動きパラメータ４を求めることができる。特願平６
ー１１６２６０における動きパラメータ発生方法を以下
に示す。ここで、動きモデルは式（１２）の５パラメー
タのアフィン変換である。In the motion detector 3, for example, Japanese Patent Application No. 6-11
The motion parameter 4 can be obtained by the method applied in 6260 “Motion Parameter Search Method”. Japanese Patent Application No. 6
The method of generating motion parameters in -116260 will be described below. Here, the motion model is the five-parameter affine transformation of equation (12).

【００４１】図２に示すように、動きパラメータ検出部
３は、入力画像１と参照画像１０２は平行移動パラメー
タ探索部１０３に入力され、ブロックマッチング法によ
る平行移動パラメータ（ｔ_x、ｔ_y）探索の結果、誤差評
価値の小さいものから順に所定の個数の初期平行移動パ
ラメータ１０４が求められる。初期平行移動パラメータ
１０４は入力画像と参照画像１０２と共に平行移動、拡
大、縮小、回転パラメータ探索部１０５に入力される。
平行移動、拡大、縮小、回転パラメータ探索部１０５で
は、初期平行移動パラメータ１０４について、該初期値
を含む所定の微小範囲内で平行移動パラメータを変化さ
せると共に、拡大、縮小パラメータ（Ｃ _x、Ｃ_y）および
回転パラメータ（θ）を変化させ、その結果最も小さい
マッチング誤差評価値を与えるパラメータの組み合せを
最終的なアフィン変換パラメータ１０６として出力す
る。探索方法の一例を以下に示す。As shown in FIG. 2, the motion parameter detector
3 is a translation parameter for the input image 1 and the reference image 102.
Input to the data search unit 103, and the block matching method is used.
Translation parameter (t_x, T_y) Search results, error evaluation
Starting from the smallest value, a predetermined number of initial translation
The parameter 104 is required. Initial translation parameter
Reference numeral 104 denotes a translation and expansion together with the input image and the reference image 102.
It is input to the large, reduction, and rotation parameter search unit 105.
In parallel translation, enlargement, reduction, rotation parameter search unit 105
Is the initial value of the initial translation parameter 104.
The translation parameter is changed within a predetermined minute range including
And enlargement / reduction parameter (C _x, C_y)and
Rotation parameter (θ) is changed, resulting in the smallest
A combination of parameters that gives a matching error evaluation value
Output as the final affine transformation parameter 106
You. An example of the search method is shown below.

【００４２】まず、第一段（平行移動パラメータ探索部１０３）に
て、ｘおよびｙ方向に±１５画素を１画素刻みで探索
し、３１×３１＝９６１個の探索点の内、誤差評価の小
さい方から順に４点を初期平行移動パラメータ１０４と
する。次に、第二段（平行移動・拡大／縮小・回転パラ
メータ探索部１０５）により、前記４点の周囲±０．５
画素の範囲を０．５画素刻みで変化させると共に、拡大
／縮小パラメータについては０．８〜１．２の範囲を
０．１刻みで、回転パラメータについては０〜２πの範
囲をπ／４刻みで変化させ、全ての探索点（９×５×８
＝３６０点）の内誤差評価値の最も小さいものをアフィ
ン変換パラメータとして求める。６パラメータ（ａ，
ｂ，ｃ，ｄ，ｔ_x，ｔ_y）の場合も全く同様である。[0042] First, in the first stage (translation parameter search unit 103), ± 15 pixels are searched in 1-pixel increments in the x and y directions, and from 31 × 31 = 961 search points, the one with the smallest error evaluation is selected. The four points are set as the initial translation parameter 104 in order. Next, the second stage (parallel movement / enlargement / reduction / rotation parameter search unit 105) performs ± 0.5 around the four points.
The pixel range is changed in 0.5 pixel steps, and the enlargement / reduction parameter is in the range of 0.8 to 1.2 in 0.1 steps and the rotation parameter is in the range of 0 to 2π in π / 4 steps. Change all search points (9 × 5 × 8
= 360 points), the smallest error evaluation value is obtained as an affine transformation parameter. 6 parameters (a,
b, c, d, t _x , t _y ) is exactly the same.

【００４３】さて、動き検出部３で求められた小領域毎
の動きパラメータ４は、フレームメモリ５に蓄積された
局部復号画像６と共にオーバーラップ動き補償部７に入
力される。オーバーラップ動き補償部７では、各小領域
（以下マクロブロックと称す）毎にｘ、ｙ方向の平行移
動に加え、回転，拡大，縮小等の変形を伴った予測を行
い、動き補償予測画像９を生成する。オーバーラップ動
き補償部７における予測画像の生成例を図８に示す。以
下に、ｊ行ｉ列目のマクロブロックＢ_i,j内の座標
（ｘ、ｙ）に位置する画素を計算する手順を詳しく述べ
る。The motion parameter 4 for each small area obtained by the motion detector 3 is input to the overlap motion compensator 7 together with the locally decoded image 6 stored in the frame memory 5. The overlap motion compensation unit 7 performs prediction with deformation such as rotation, enlargement, and reduction in addition to parallel movement in the x and y directions for each small area (hereinafter referred to as macroblock), and a motion compensation predicted image 9 To generate. FIG. 8 shows an example of generation of a predicted image in the overlap motion compensation unit 7. The procedure for calculating the pixel located at the coordinate (x, y) in the macroblock B _{i, j} on the j-th row and the i-th column will be described in detail below.

【００４４】１．左上のマクロブロックＢ_i-1,j-1の動
きパラメータ（ａ_i-1,j-1，ｂ_i-1,j- ₁，ｃ_i-1,j-1，ｄ
_i-1,j-1，ｔｘ_i-1,j-1，ｔｙ_i-1,j-1）を用い現フレー
ムの画素Ｐ（ｘ、ｙ）に対応する１つ目の予測値Ｐ'
（ｘ'_i-1,j-1，ｙ'_i-1,j-1）を求める。1. The motion parameters (a _{i-1, j-1} , b _{i-1, j-} ₁ , c _{i-1, j-1} , d of the upper left macroblock B _{i-1, j-1} are
_{i-1, j-1} , tx _{i-1, j-1} , ty _{i-1, j-1} ), and the first predicted value P'corresponding to the pixel P (x, y) of the current frame
(X ' _{i-1, j-1} , y'i _{-1, j-1} ) is calculated.

【００４５】[0045]

【数７】図８はｘ、ｙ方向の拡大（ａ，ｄ＞１，ｂ＝ｃ＝０）の
例を示している。(Equation 7) FIG. 8 shows an example of enlargement in the x and y directions (a, d> 1, b = c = 0).

【００４６】２．直上のマクロブロックＢ_i,j-1の動き
パラメータ（ａ_i,j-1，ｂ_i,j-1，ｃ_i _,j-1，ｄ_i,j-1，ｔ
ｘ_i,j-1，ｔｙ_i,j-1を用い現フレームの画素Ｐ（ｘ、
ｙ）に対応する２つ目の予測値Ｐ’（ｘ'_i,j-1，ｙ'
_i,j-1）を求める。2. The motion parameters (a _{i, j-1} , b _{i, j-1} , c _i _{, j-1} , d _{i, j-1} , t of the macroblock B _{i, j-1} immediately above it.
x _{i, j−1} , ty _{i, j−1} are used for the pixel P (x,
second predicted value P '( _{x'i, j-1} , y'corresponding to y)
_{i, j-1} ).

【００４７】[0047]

【数８】図８は、ｘ、ｙ方向の縮小（０＜ａ，ｄ＜１，ｂ＝ｃ＝
０）の例を示す。(Equation 8) FIG. 8 shows reduction in the x and y directions (0 <a, d <1, b = c =
0) is shown.

【００４８】３．直左のマクロブロックＢ_i-1,jの動き
パラメータ（ａ_i-1,j，ｂ_i-1.j，ｃ_i _-1,j，ｄ_i-1,j，ｔ
ｘ_i-1.j，ｔｙ_i-1,jを用い、現フレームの画素Ｐ（ｘ、
ｙ）に対応する３つ目の予測値Ｐ’（ｘ'_i-1,j，ｙ'
_i-1,j）を求める。3. The motion parameters (a _{i-1, j} , b _i-1.j , c _i _{-1, j} , d _{i-1, j} , t of the macro blocks B _{i-1, j on} the immediate left are
x _i−1.j , ty _{i−1, j} , the pixel P (x,
The third predicted value P '(x' _{i-1, j} , y'corresponding to y)
_{i-1, j} ) is obtained.

【００４９】[0049]

【数９】図８は、ｘ、ｙ方向の縮小と回転を組み合せた例を示し
ている（ａ＝ｄ＝ｃｏｓθ，ｂ＝ｓｉｎθ，ｃ＝−ｓｉ
ｎθでかつ０＜ａ，ｄ＜１）。[Equation 9] FIG. 8 shows an example in which reduction and rotation in the x and y directions are combined (a = d = cos θ, b = sin θ, c = −si).
nθ and 0 <a, d <1).

【００５０】４．当該マクロブロックＢ_i,jの動きパラ
メータ（ａ_i,j，ｂ_i,j，ｃ_i,j,ｄ_i.j，ｔｘ_i,j，ｔｙ
_i,j）を用い、現フレームの画素Ｐ（ｘ、ｙ）に対応す
る４つ目の予測値Ｐ’（ｘ'_i,j，ｙ'_i,j）を求める。4. The motion parameters (a _{i, j} , b _{i, j} , c _{i, j} , d _ij , tx _{i, j} , ty) of the macro block B _{i, j.}
_{i, j} ) is used to obtain the fourth predicted value P ′ (x ′ _{i, j} , y ′ _{i, j} ) corresponding to the pixel P (x, y) of the current frame.

【００５１】[0051]

【数１０】図８は、回転の例を示している（ａ＝ｄ＝ｃｏｓθ，
ｂ＝ｓｉｎθ，ｃ＝−ｓｉｎθ）。(Equation 10) FIG. 8 shows an example of rotation (a = d = cos θ,
b = sin θ, c = −sin θ).

【００５２】また、アフィン変換により求められる
（ｘ，ｙ）の対応点（ｘ'，ｙ'）は必ずしも整数ではな
い。したがって、参照画像中で内挿計算を行う必要があ
る。この場合は、例えば図９に示すように、マッチング
先の座標の周囲４近傍の画素値Ｐ'（ｘ₁，ｙ₁）、Ｐ'、
（ｘ₁＋１，ｙ₁）、Ｐ'（ｘ₁，ｙ₁＋１）、Ｐ'（ｘ₁＋
１，ｙ₁＋１）から内挿計算を行う（ｘ₁＝ｘ’の整数部
分、ｙ₁＝ｙ’の整数部分）。ｗｘ₁、ｗｘ₂、ｗｙ₁、ｗ
ｙ₂は重み係数で、これらの重みを掛けた値の総和The corresponding point (x ', y') of (x, y) obtained by affine transformation is not necessarily an integer. Therefore, it is necessary to perform interpolation calculation in the reference image. In this case, for example, as shown in FIG. 9, pixel values P ′ (x ₁ , y ₁ ), P ′, and 4 ′ around the coordinates of the matching destination are shown.
(X ₁ +1, y ₁ ), P '(x ₁ , y ₁ +1), P' (x ₁ +
1, y ₁ +1) is interpolated (x ₁ = x ′ integer part, y ₁ = y ′ integer part). wx ₁ , wx ₂ , wy ₁ , w
y ₂ is a weighting coefficient, which is the sum of values multiplied by these weights.

【００５３】[0053]

【数１１】 [Equation 11]

【００５４】[0054]

【数１２】 (Equation 12)

【００５５】[0055]

【数１３】 (Equation 13)

【００５６】[0056]

【数１４】 [Equation 14]

【００５７】[0057]

【数１５】を予測値とする。(Equation 15) Is the predicted value.

【００５８】５．以上の４つの予測値それぞれに対して
重み係数Ｗ_i-1,j-1（ｘ，ｙ），Ｗ_i,j-1（ｘ，ｙ），Ｗ
_i-1,j（ｘ，ｙ），Ｗ_i,j（ｘ，ｙ）を掛け、これらの総
和を画素Ｐ（ｘ、ｙ）に対する予測値Ｐ”（ｘ、ｙ）と
する。すなわち、5. Weighting factors W _{i-1, j-1} (x, y), W _{i, j-1} (x, y), W for each of the above four predicted values
_{i-1, j} (x, y) and W _{i, j} (x, y) are multiplied, and the sum of these is taken as the predicted value P ″ (x, y) for the pixel P (x, y).

【００５９】[0059]

【数１６】となる。ただし、(Equation 16) Becomes However,

【００６０】[0060]

【数１７】である。４つの予測値に対する重み係数は、例えばＩＴ
Ｕ−ＴＨ．２６３では図７のように定義されている。
Ｈ．２６３の場合、当該ブロックの２行３列目の画素の
予測値を求める際には、図７（ａ）より、当該ブロック
Ｂ_i,jの動きベクトル（Ｖｘ_i,j，Ｖｙ_i,j）が指すＰ'
（ｘ＋Ｖｘ_i,j，ｙ＋Ｖｙ_i,j）に対する重み係数Ｗ_i,j
（ｘ，ｙ）は、５／８となる。また、図７（ｂ）より、
直上ブロックＢ_i,j-1の動きベクトル（Ｖｘ_i,j-1，Ｖｙ
_i,j-1）が指すＰ’（ｘ＋Ｖｘ_i,j _-1，ｙ＋Ｖｙ_i,j-1）
に対する重み係数Ｗ_i,j-1（ｘ，ｙ）は、２／８とな
る。同様に図７（ｃ）より、直左ブロックＢ
_i-1,j（ｘ，ｙ）は、１／８となる。また、Ｈ．２６３
では、当該ブロックの斜め方向に位置するマクロブロッ
クの動きベクトルを用いた予測は行わない。すなわち、Ｗ_i-1,j-1（ｘ，ｙ）＝０（２４）である。[Equation 17] It is. The weighting factors for the four predicted values are, for example, IT
U-TH. In H.263, it is defined as shown in FIG.
H. In the case of H.263, the motion vector (Vx _{i, j} , Vy _{i, j} ) of the block B _{i, j} is calculated from FIG. 7A when the prediction value of the pixel in the second row and third column of the block is calculated. Pointed to by P '
Weighting coefficient W _{i, j} for (x + Vx _{i, j} , y + Vy _{i, j} )
(X, y) becomes 5/8. Also, from FIG. 7 (b),
The motion vector (Vx _{i, j-1} , Vy of the immediately preceding block B _{i, j-1}
_{i, j-1)} pointed _{P '(x + Vx i,} j -1, y + Vy i, j-1)
The weighting coefficient W _{i, j-1} (x, y) with respect to is 2/8. Similarly, as shown in FIG.
_{i-1, j} (x, y) becomes 1/8. H. 263
Then, the prediction using the motion vector of the macroblock located in the diagonal direction of the block is not performed. That is, W _{i-1, j-1} (x, y) = 0 (24).

【００６１】以上は、（ｘ、ｙ）がＢ_i,j,の左上にある
場合の例である。例えば、（ｘ、ｙ）がＢ_i,jの右上に
ある場合には、直上のマクロブロックＢ_i,j-1，右上の
マクロブロックＢ_i+1,j-1，当該マクロブロックＢ_i,j、
直右マクロブロックＢ_i+1,jの動きベクトルを用いる。
同様に、（ｘ、ｙ）がＢ_i,jの左下マクロブロックＢ_i-1
_,j、当該マイクロブロックＢ_i,j、左下マイクロブロッ
クＢ_i-1,j+1、直下マクロブロックＢ_i,j+1を用い、
（ｘ、ｙ）がＢ_i,jの直下にある場合は当該マクロブロ
ックＢ_i,j、直右マクロブロックＢ_i+1,j、直下マクロブ
ロックＢ_i,j+1右下マクロブロックＢ_i+1,j+1を用いる。
以上が、アフィン変換を用いたオーバーラップ動き補償
における予測補償の計算方法である。さて、動き補償予
測画像９は符号化対象画像１と共に減算器１０に入力さ
れ、それらの差分データすなわち動き補償予測誤差１１
は、空間冗長度圧縮部１２において空間冗長どの抑圧が
行われる。一方、現在の符号化対象画像１の局部復号画
像１９を得るため、空間冗長度圧縮部１２より出力され
る圧縮差分データ１３は差分データ伸張部１６により伸
張差分画像１７に復号される。伸張差分画像１７は空間
冗長度を抑圧された動き補償予測誤差信号である。伸張
差分画像１７は加算器１８により動き補償予測画像９と
加算され、現在の符号化対象画像の局部復号画像１９と
なる。局部復号画像１９はフレームメモリ５に蓄積さ
れ、以降のフレームの符号化において参照される。The above is an example of the case where (x, y) is at the upper left of B _{i, j ,} . For example, when (x, y) is located at the upper right of B _{i, j} , the macro block B _{i, j−1} immediately above, the macro block B _{i + 1, j−1 at} the upper right, and the macro block B _{i, j} ,
The motion vector of the immediate right macroblock B _{i + 1, j} is used.
Similarly, (x, y) is the lower left macroblock B _{i−1 of} B _{i, j}
_{, j} , the relevant microblock B _{i, j} , the lower left microblock B _{i-1, j + 1} , and the immediate lower macroblock B _{i, j + 1} ,
(X, y) is B _i, the macroblock B _i if immediately below the _{_j, j,} straight right macroblock B _{i + 1, j,} directly under the macro block B _{i, j + 1} lower right macroblock B _{i Use + 1, j + 1} .
The above is the calculation method of the prediction compensation in the overlap motion compensation using the affine transformation. Now, the motion compensation prediction image 9 is input to the subtractor 10 together with the encoding target image 1, and their difference data, that is, the motion compensation prediction error 11 is input.
Is suppressed by the spatial redundancy compressing unit 12. On the other hand, in order to obtain the locally decoded image 19 of the current encoding target image 1, the compressed difference data 13 output from the spatial redundancy compression unit 12 is decoded by the difference data expansion unit 16 into the expanded difference image 17. The decompression difference image 17 is a motion compensation prediction error signal whose spatial redundancy is suppressed. The decompressed difference image 17 is added to the motion compensation predicted image 9 by the adder 18 and becomes the locally decoded image 19 of the current image to be encoded. The locally decoded image 19 is stored in the frame memory 5 and is referred to in the coding of subsequent frames.

【００６２】また、動き補償予測誤差１１に対する圧縮
差分データ１３は差分データ符号化部１４にてデータ圧
縮符号化され、差分画像符号化データとなる。小領域毎
の動きパラメータ４は動きパラメータ符号化部２０によ
りデータ圧縮符号化され、動きパラメータ符号化データ
２１となる。差分画像符号化でーた１５と動きパラメー
タ符号化データ２１は多重化部２２において多重化さ
れ、多重化データ２３として伝送または蓄積される。The compressed difference data 13 corresponding to the motion compensation prediction error 11 is data-compressed and encoded by the difference data encoding unit 14 to be difference image encoded data. The motion parameter 4 for each small region is data-compressed and coded by the motion parameter coding unit 20 and becomes motion parameter coded data 21. The difference image coding 15 and the motion parameter coded data 21 are multiplexed in the multiplexing unit 22 and transmitted or stored as multiplexed data 23.

【００６３】上述した本発明の動画像の動き補償符号化
方法のフローチャートを図１０にまとめた。すなわち、
符号化対象画像が入力されると（ステップ５１）、小領
域に分割し（ステップ５２）、小領域毎に参照画像と比
べてアフィン変換パラメータ１０６を動きパラメータ４
として作成し（ステップ５３）、動きパラメータ４か
ら、さらに参照図との比較において動き補償予測画像９
を生成し（ステップ５４）、入力時の符号化対象画像を
動き補償予測画像９から減算して動き補償予測誤差を算
出し（ステップ５５）、該誤差を圧縮して差分データを
作成する（ステップ５６）。一方においてこの差分デー
タをステップ５４による動き補償予測画像と加算して参
照画像を作成し（ステップ５７）、この参照画像が次の
処理の参照に使用される。FIG. 10 shows a flowchart of the above-described motion compensation coding method for moving images according to the present invention. That is,
When the image to be encoded is input (step 51), it is divided into small areas (step 52), and the affine transformation parameter 106 is set to the motion parameter 4 for each small area compared to the reference image.
(Step 53), the motion-compensated predicted image 9 is compared with the motion parameter 4 in comparison with the reference diagram.
Is generated (step 54), the image to be coded at the time of input is subtracted from the motion compensation prediction image 9 to calculate a motion compensation prediction error (step 55), and the error is compressed to create difference data (step). 56). On the other hand, this difference data is added to the motion-compensated predicted image in step 54 to create a reference image (step 57), and this reference image is used for reference in the next process.

【００６４】他方において、差分データを符号化し（ス
テップ５８）、動きパラメータ４と多重化して多重化デ
ータ２３として出力する（ステップ５９）。On the other hand, the difference data is encoded (step 58), multiplexed with the motion parameter 4 and output as multiplexed data 23 (step 59).

【００６５】[0065]

【発明の効果】以上説明したように本発明は、動画像の
動き補償に平行移動のみならず拡大、縮小、せん断、回
転などの各種の変形およびこれらの自由な組み合せの動
きを補償することができるので、動き補償予測の正確度
が向上しかつ予測の効果が大幅に改善されトータルの圧
縮性能が一層向上できる効果がある。As described above, the present invention is capable of compensating not only parallel movement but also various deformations such as enlargement, reduction, shearing, and rotation, and movements freely combining these for motion compensation of moving images. Therefore, the accuracy of the motion compensation prediction is improved, the prediction effect is significantly improved, and the total compression performance is further improved.

[Brief description of the drawings]

【図１】本発明の動画像の動き補償予測符号化方法が適
用された動き補償予測符号化装置の一実施例の構成ブロ
ック図である。FIG. 1 is a configuration block diagram of an embodiment of a motion compensation predictive coding apparatus to which a moving image motion compensation predictive coding method of the present invention is applied.

【図２】図１に示す動き検出部３の詳細構成ブロック図
である。FIG. 2 is a detailed configuration block diagram of a motion detection unit 3 shown in FIG.

【図３】従来の動画像の動き補償予測符号化装置の一例
のブロックずである。FIG. 3 is a block diagram of an example of a conventional motion compensation predictive coding apparatus for a moving image.

【図４】従来の平行移動補償によるオーバーラップ動き
補償予測符号化装置の構成を示すブロック図である。FIG. 4 is a block diagram showing a configuration of a conventional overlap motion compensation predictive coding apparatus by parallel movement compensation.

【図５】従来の平行移動補償によるオーバーラップ動き
補償予測方法の説明図で、（ａ）はフレームの画素Ｐ
（ｘ、ｙ）のｘ方向への平行移動を示し、（ｂ）は上辺
ＱＲのみのｘ方向への動きを示すせん断を示す図であ
る。FIG. 5 is an explanatory diagram of a conventional overlapping motion compensation prediction method by parallel movement compensation, in which (a) is a pixel P of a frame.
(X, y) is a diagram showing parallel movement in the x direction, and (b) is a diagram showing shearing showing the movement in the x direction of only the upper side QR.

【図６】アフィン変換で記述し得るせん断変換の一例を
示す図である。FIG. 6 is a diagram showing an example of shear transformation that can be described by affine transformation.

【図７】オーバーラップ動き補償における各ブロックの
重み係数の一例を示す図であり、（ａ）は当該ブロック
の、（ｂ）は直上ブロックの、（ｃ）は直左ブロックの
動きベクトルの指す画素の予測値に対する重み係数の例
であり、図中の重み係数は１／８の正規化係数が省略さ
れており、実際には各値に１／８を掛けた値が重み係数
である。FIG. 7 is a diagram showing an example of weighting factors of each block in overlap motion compensation, where (a) indicates the motion vector of the block, (b) indicates the immediately upper block, and (c) indicates the motion vector of the immediately left block. This is an example of the weighting coefficient for the predicted value of the pixel, and the normalization coefficient of 1/8 is omitted in the weighting coefficient in the figure, and the value obtained by multiplying each value by 1/8 is actually the weighting coefficient.

【図８】本発明の一実施例におけるアフィン変換を用い
たオーバーラップ動き補償方法を示す図である。FIG. 8 is a diagram showing an overlap motion compensation method using affine transformation according to an embodiment of the present invention.

【図９】アフィン変換によるマッチング化素の計算方法
の一例を示す図である。FIG. 9 is a diagram showing an example of a method of calculating a matching element by affine transformation.

【図１０】本発明の動画像の動き補償予測符号化方法の
フローチャートである。[Fig. 10] Fig. 10 is a flowchart of a motion compensation predictive coding method for a moving image according to the present invention.

[Explanation of symbols]

１符号化対象画像２動きモデル３動きパラメータ検出部４動きパラメータ５フレームメモリ６局部復号画像７オーバーラップ動き補償部８重み関数、９動き補償予測画像１０減算器１１動き補償予測誤差１２空間冗長度圧縮部１３圧縮差分データ１４差分データ符号化部１５差分画像符号化データ１６差分データ伸張部１７伸張差分画像１８加算器１９局部復号画像２０動きパラメータ符号化部２１動きパラメータ符号化データ２２多重化部２３多重化データ３１動きベクトル検出部３２動きベクトル３３オーバーラップ動き補償部３４動きベクトル符号化部３５動きベクトル符号化データ４１ブロック動き補償部４２ＤＣＴ／量子化部４３逆ＤＣＴ／量子化部１０１フレームメモリ１０２参照画像１０３平行移動パラメータ探索部１０４初期平行移動パラメータ１０５平行移動、拡大、縮小回転パラメータ探索部１０６アフィン変換パラメータ 1 image to be encoded 2 motion model 3 motion parameter detection unit 4 motion parameter 5 frame memory 6 locally decoded image 7 overlap motion compensation unit 8 weighting function, 9 motion compensation predicted image 10 subtractor 11 motion compensation prediction error 12 spatial redundancy Compressor 13 Compressed difference data 14 Difference data encoder 15 Difference image coded data 16 Difference data decompressor 17 Expanded difference image 18 Adder 19 Locally decoded image 20 Motion parameter encoder 21 Motion parameter encoded data 22 Multiplexer 23 multiplexed data 31 motion vector detection unit 32 motion vector 33 overlap motion compensation unit 34 motion vector coding unit 35 motion vector coded data 41 block motion compensation unit 42 DCT / quantization unit 43 inverse DCT / quantization unit 101 frame Memory 102 Image 103 translation parameters searching unit 104 initial translation parameters 105 parallel movement, enlargement, reduction rotation parameter searching unit 106 affine transformation parameters

Claims

[Claims]

1. An image to be encoded is divided into a plurality of small regions, a motion amount between an image to be encoded and a prediction reference image is obtained for each of the small regions, and a prediction value of each pixel in the small region is calculated. In obtaining, the grayscale value at the position pointed to by the motion vector of the small area and the grayscale value at the position pointed to by the motion vector of the adjacent small area are each multiplied by a weighting coefficient, and the sum of these values is used as the predicted value to obtain the predicted image. In the motion compensation predictive coding method for a moving image, wherein the motion of the small area is a reference direction (x) and a direction (x) that is perpendicular to the reference direction. translation to y), rotation, enlargement, reduction,
A motion-compensated predictive coding method for a moving image, characterized in that at least two of shear and deformation are described by respective motion parameters.

2. An image to be encoded is divided into a plurality of small regions, a motion amount between an image to be encoded and a prediction reference image is obtained for each of the small regions, and a prediction value of each pixel in the small region is calculated. In obtaining, the gray value at the position pointed to by the motion vector of the small area and the gray value at the position pointed to by the motion vector of the adjacent small area are each multiplied by a weighting coefficient, and the value obtained by adding these is used as the predicted value to predict. A motion-compensated predictive coding apparatus for generating a picture and coding a difference between the predicted picture and a coding target picture, in a direction (x) based on the motion of the small region and a direction perpendicular thereto. Translation to (y), rotation, enlargement, reduction,
A motion-compensated predictive coding apparatus for a moving picture, comprising means for describing at least two of the transformations by respective motion parameters.

3. A means for dividing an image to be encoded into predetermined small areas, a means for reproducing an image in a state immediately before each small area as a reference image, and a motion vector for each small area. Means for detecting by comparing with the reference image,
Means for decomposing the motion vector into components according to the state of motion, selecting a small area having the highest degree of matching in the reference image and setting the displacement for each small area, and a gray value of the position indicated by the motion vector of the small area. And a means for generating a compensation prediction value for each small area by multiplying each gray value of the position indicated by the motion vector of the adjacent small area by a weighting coefficient, and making the value obtained by adding these to the prediction value, Means for calculating a motion compensation prediction error from the difference between the compensation prediction value for each region and the image to be encoded, means for compressing the spatial redundancy of the motion compensation prediction error, and encoding the quantized compression difference. A motion compensation predictive coding device for a moving image, which has a means, a means for coding a motion vector for each of the small regions, and a means for multiplexing and outputting the coded difference data and the motion vector coded data. In the above, the means for generating the motion compensation prediction value for each small area generates the motion compensation prediction value by a motion parameter expressing at least two of translation, enlargement, reduction, inversion, shearing, rotation, and deformation. A motion-compensated predictive coding apparatus for a moving image.

4. The motion-compensated predictive coding apparatus for a moving picture according to claim 3, wherein the means for generating the motion-compensated predicted value is calculated by using an affine transformation parameter.