JPH09266575A

JPH09266575A - Method and device for encoding dynamic image

Info

Publication number: JPH09266575A
Application number: JP7355896A
Authority: JP
Inventors: Atsushi Sagata; 淳嵯峨田; Hirotaka Jiyosawa; 裕尚如沢; Kazuto Kamikura; 一人上倉; Yutaka Watanabe; 裕渡辺
Original assignee: Nippon Telegraph and Telephone Corp
Current assignee: Nippon Telegraph and Telephone Corp
Priority date: 1996-03-28
Filing date: 1996-03-28
Publication date: 1997-10-07

Abstract

PROBLEM TO BE SOLVED: To decrease the arithmetic quantity of motion vector detection and to improve the efficiency and speed of motion vector detection by performing the detection of a motion vector based on area form information. SOLUTION: A small area dividing means 1 divides an encoding object image into encoding units, and a belonging area discriminating means 2 detects an area, to which the encoding unit as an object to find motion belongs, based on known area form information or area form information detected from an input signal. Besides, a small area dividing means 3 divides a reference image into the small areas of reference objects and a belonging area discriminating means 4 detects an area, to which the small area of the reference image as a reference object belongs, based on the area form information of the reference image and detects whether or not this detected area is equal with the area, to which the encoding unit as the object to find the motion belongs. A motion vector detecting means 5 detects the motion vector by collating the encoding unit of the encoding object image with the small area of reference of the reference image.

Description

Detailed Description of the Invention

【０００１】[0001]

【発明の属する技術分野】本発明は，画像通信，画像記
録等に利用される画像信号のディジタル圧縮符号化方法
およびディジタル圧縮符号化装置に関し，詳しくは画像
信号を端末間で送受信する場合やファイルに蓄積する場
合などに用いられる動画像符号化方法および動画像符号
化装置に関するものである。BACKGROUND OF THE INVENTION 1. Field of the Invention The present invention relates to a digital compression coding method and a digital compression coding apparatus for an image signal used for image communication, image recording, and the like, and more specifically, for transmitting and receiving an image signal between terminals and a file. The present invention relates to a moving picture coding method and a moving picture coding apparatus used for storing in a storage medium.

【０００２】[0002]

【従来の技術】従来，動画像のディジタル圧縮符号化に
おいて，動画像信号の時間冗長性を抑圧する手段とし
て，動き補償フレーム間予測がしばしば用いられてい
る。2. Description of the Related Art Conventionally, motion-compensated inter-frame prediction is often used as a means for suppressing temporal redundancy of a moving picture signal in digital compression coding of a moving picture.

【０００３】図７は，従来の動ベクトル検出法を説明す
る図であり，図８は，得られた動ベクトルに基づく動き
補償を説明する図である。従来のフレーム間予測では，
通常，図７に示すように，符号化対象画像７１を符号化
単位として１６画素×１６ライン等の矩形ブロックに区
切り，各ブロック毎に動きを求める対象となっている符
号化対象ブロック７２を中心とする探索範囲７３を参照
画像７４内に設定し，その範囲内で対象ブロックを平行
移動させながら，符号化対象ブロック７２と最も類似す
るブロックを，絶対値差分や差分自乗和等を評価関数と
して使用し，設定された探索範囲７３内を全探索するこ
とで探し出し，参照画像７４との相対位置を動ベクトル
（ｖ_x，ｖ_y）７５として検出する。FIG. 7 is a diagram for explaining a conventional motion vector detecting method, and FIG. 8 is a diagram for explaining motion compensation based on the obtained motion vector. In conventional inter-frame prediction,
Normally, as shown in FIG. 7, the encoding target image 71 is divided into rectangular blocks of 16 pixels × 16 lines as the encoding unit, and the encoding target block 72 for which the motion is to be obtained is centered for each block. The search range 73 is set in the reference image 74, and the block that is most similar to the encoding target block 72 is used as an evaluation function with the absolute value difference, the sum of squared differences, etc., while moving the target block in parallel within the range. It is used to search for the entire search range 73 that has been set, and the relative position with respect to the reference image 74 is detected as a motion vector (v _x , v _y ) 75.

【０００４】符号化対象画像７１の座標（ｘ，ｙ）上の
画像データと最もよくマッチングする画像は，参照画像
７４中の座標（ｘ_r，ｙ_r）上の画像データとして対応
づけられる。The image that best matches the image data on the coordinates (x, y) of the image to be encoded 71 is associated as the image data on the coordinates (x _r , y _r ) in the reference image 74.

【０００５】[0005]

【数１】 [Equation 1]

【０００６】動ベクトルに基づく動き補償フレーム間予
測では，図８に示すように，参照画像７４中の同じ位置
（ｘ，ｙ）のブロックではなく，得られた動ベクトル
（ｖ_x，ｖ_y）分平行移動した位置のブロックを予測画
像とし，この予測画像と符号化対象画像との差分（動き
補償予測誤差）信号を符号化する。これにより，動画像
のフレーム間相関は飛躍的に向上し，フレーム間予測効
率を大幅に向上することが可能となる。In the motion-compensated inter-frame prediction based on the motion vector, as shown in FIG. 8, instead of the block at the same position (x, y) in the reference image 74, the obtained motion vector (v _x , v _y ) A block at a position parallelly moved by a minute is used as a prediction image, and a difference (motion compensation prediction error) signal between this prediction image and the encoding target image is encoded. As a result, the inter-frame correlation of moving images is dramatically improved, and the inter-frame prediction efficiency can be significantly improved.

【０００７】図９は，従来の動ベクトル検出方式のブロ
ック図である。符号化対象画像９０１は，フレームメモ
リ９０２に蓄えられる。フレームメモリ９０２に蓄えら
れた現フレームの画像９０６が，（ｘ_cf，ｙ_cf）の２座
標成分からなるアドレス信号９０８（ブロックの最も左
上の画素に対応するアドレス）の値をアドレス値として
読み出され，小領域分割部９０７に入力される。小領域
分割部９０７は，現フレームの画像９０６からｍ×ｎの
ブロックデータを読み出し（通常，ｍ＝ｎ＝１６），符
号化対象ブロック信号９１２を出力する。FIG. 9 is a block diagram of a conventional motion vector detecting method. The encoding target image 901 is stored in the frame memory 902. The image 906 of the current frame stored in the frame memory 902 reads the value of the address signal 908 (the address corresponding to the upper leftmost pixel of the block) consisting of two coordinate components of (x _cf , y _cf ) as an address value. And input to the small area dividing unit 907. The small area dividing unit 907 reads m × n block data from the image 906 of the current frame (usually m = n = 16), and outputs the encoding target block signal 912.

【０００８】また，符号化対象画像９０１は１フレーム
遅延９０３の後に参照画像９０４となり，フレームメモ
リ９０５に蓄えられる。フレームメモリ９０５に蓄えら
れた参照フレームの画像９０９が，（ｘ_rf，ｙ_rf）の２
座標成分からなるアドレス信号９１１（ブロックの最も
左上の画素に対応するアドレス）の値をアドレス値とし
て読み出され，小領域分割部９１０に入力される。小領
域分割部９１０は参照フレームの画像９０９からｍ×ｎ
のブロックデータを読み出し（通常，ｍ＝ｎ＝１６），
参照フレームブロック信号９１３を出力する。The image 901 to be coded becomes a reference image 904 after one frame delay 903 and is stored in the frame memory 905. The reference frame image 909 stored in the frame memory 905 is (x _rf , y _rf ) 2
The value of the address signal 911 (address corresponding to the upper leftmost pixel of the block) composed of coordinate components is read as an address value and input to the small area dividing unit 910. The small area dividing unit 910 extracts m × n from the image 909 of the reference frame.
Block data is read (usually m = n = 16),
The reference frame block signal 913 is output.

【０００９】マッチング誤差計算部９１４では，二つの
小領域分割部９０７，９１０から出力される符号化対象
ブロック信号９１２と参照フレームブロック信号９１３
の対応する各画素値を演算し，それを誤差評価値９１５
として比較器９１６に出力する。In the matching error calculation unit 914, the encoding target block signal 912 and the reference frame block signal 913 output from the two small area dividing units 907 and 910 are output.
Each corresponding pixel value of is calculated, and the calculated error evaluation value 915
Is output to the comparator 916.

【００１０】比較器９１６は，最小値メモリ９１７を有
しており，入力された誤差評価値９１５が最小値メモリ
９１７に蓄えられている値より小さければ，最小値メモ
リ９１７の値を更新し，それぞれｘ，ｙの２座標成分か
らなる参照ブロックの位置を示すアドレス信号（ｘ_rf，
ｙ_rf）９１１から，符号化対象ブロックの位置を示すア
ドレス信号（ｘ_cf，ｙ_cf）９０８を減じた値である動ベ
クトル（ｖ_x，ｖ_y）９１８をベクトルメモリ９１９に
記憶する。The comparator 916 has a minimum value memory 917. If the input error evaluation value 915 is smaller than the value stored in the minimum value memory 917, the value in the minimum value memory 917 is updated, An address signal (x _rf , which indicates the position of a reference block composed of two coordinate components x and y, respectively)
The motion vector (v _x , v _y ) 918, which is a value obtained by subtracting the address signal (x _cf , y _cf ) 908 indicating the position of the target block from (y _rf ) 911, is stored in the vector memory 919.

【００１１】探索範囲に対する上記操作が終了した時，
ベクトルメモリ９１９には検出された動ベクトル９１８
が蓄えられている。When the above operation for the search range is completed,
The detected motion vector 918 is stored in the vector memory 919.
Is stored.

【００１２】[0012]

【発明が解決しようとする課題】上記の従来技術では，
動ベクトルを検出するにあたり，動きを求める対象とな
っている符号化単位（ブロック）を中心とする探索範囲
を，映像コンテンツおよびそれに伴う領域形状情報とは
無関係に参照画像内に作成し，その設定された範囲内の
すべてを探索するため，計算量が膨大になってしまう。SUMMARY OF THE INVENTION In the above prior art,
When detecting a motion vector, a search range centered on the coding unit (block) that is the target of motion is created in the reference image regardless of the video content and the area shape information accompanying it, and the setting is made. Since it searches all within the specified range, the amount of calculation becomes enormous.

【００１３】また，従来の動き検出方法では，映像信号
の領域形状を考慮せずに動き検出を行なっているため，
動き予測が外れて，動きを求める対象となっている符号
化単位が属する領域とは異なる領域から，誤って動ベク
トルが検出されてしまう可能性があり，隣接する符号化
単位の動ベクトルの値がバラバラになることが多い。こ
の場合，後に復号器側に各ブロックのベクトル値を伝送
するときに，隣接ベクトルの差分値が大きくなってしま
い，符号化効率が低下してしまう欠点があった。Further, in the conventional motion detecting method, the motion is detected without considering the area shape of the video signal.
There is a possibility that a motion vector may be wrongly detected and a motion vector may be erroneously detected from a region different from the region to which the coding unit for which motion is to be calculated belongs. Often fall apart. In this case, when the vector value of each block is transmitted to the decoder side later, the difference value between the adjacent vectors becomes large, and the coding efficiency is reduced.

【００１４】本発明の目的は，上記問題点を解決し，入
力された映像信号の領域形状情報を利用することで，動
ベクトル検出の精度を落とさずに，探索時の演算量を減
少させる動ベクトル検出方法を用いた動画像符号化方法
および動画像符号化装置を提供することにある。An object of the present invention is to solve the above problems and to use the area shape information of the input video signal to reduce the amount of calculation at the time of search without lowering the accuracy of motion vector detection. An object of the present invention is to provide a moving picture coding method and a moving picture coding apparatus using a vector detection method.

【００１５】[0015]

【課題を解決するための手段】上記目的達成のため，本
発明は，符号化対象画像を符号化単位に分割し，各符号
化単位ごとに符号化対象画像と参照画像との間の動き量
を検出し，その動き量から求めた動ベクトルを用いて動
き補償を行なって予測画像を作成し，該予測画像と符号
化対象画像との差分を符号化する動画像符号化方法にお
いて，前記符号化単位ごとの動き量を検出するにあた
り，入力された映像信号の領域形状情報が既知の場合に
はそれを用い，入力された映像信号の領域形状情報が既
知でない場合には入力映像信号の領域形状を検出し，該
領域形状に基づいて，動ベクトルを探索する範囲を，参
照画像内で，かつ，動きを求める対象となっている符号
化単位が属する領域と同じ領域の中のみとする。In order to achieve the above object, the present invention divides an image to be encoded into encoding units, and for each encoding unit, a motion amount between the image to be encoded and a reference image. Is detected, motion compensation is performed using the motion vector obtained from the motion amount, a predicted image is created, and the difference between the predicted image and the image to be coded is coded. When detecting the motion amount for each conversion unit, if the area shape information of the input video signal is known, it is used, and if the area shape information of the input video signal is not known, the area of the input video signal is used. The shape is detected and the motion vector is searched based on the area shape only within the reference image and within the same area as the area to which the coding unit whose motion is to be calculated belongs.

【００１６】図１は本発明の原理説明図である。本発明
では，符号化対象画像の既知の領域形状情報または入力
された映像信号から検出した符号化対象画像領域形状情
報と，それに対応する参照画像領域形状情報とを入力し
て，次のような手段を用いることにより，動き補償を行
うための動ベクトルを検出する。FIG. 1 is a diagram for explaining the principle of the present invention. In the present invention, the known region shape information of the encoding target image or the encoding target image region shape information detected from the input video signal and the corresponding reference image region shape information are input, and the following is performed. By using the means, a motion vector for motion compensation is detected.

【００１７】図１において，小領域分割手段１は，符号
化対象画像を符号化単位に分割する手段である。所属領
域判定手段２は，符号化対象画像の既知の領域形状情報
または入力された映像信号から検出した領域形状情報を
もとに，動きを求める対象となっている符号化単位が属
する領域を検出する手段である。In FIG. 1, the small area dividing means 1 is means for dividing an image to be encoded into encoding units. The belonging area determination means 2 detects the area to which the coding unit, which is the target of the motion, belongs, based on the known area shape information of the image to be coded or the area shape information detected from the input video signal. Is a means to do.

【００１８】また，小領域分割手段３は，参照画像を参
照対象の小領域に分割する手段である。所属領域判定手
段４は，参照画像の領域形状情報をもとに，参照対象と
なっている参照画像の小領域が属する領域を検出し，前
記動きを求める対象となっている符号化単位が属する領
域と同じ領域であるか否かを検出する手段である。The small area dividing means 3 is means for dividing the reference image into small areas to be referred to. The belonging area determination unit 4 detects the area to which the small area of the reference image that is the reference object belongs, based on the area shape information of the reference image, and the coding unit that is the target of the motion belongs. It is a means for detecting whether or not the area is the same as the area.

【００１９】動ベクトル検出手段５は，符号化対象画像
の符号化単位と参照画像の参照の小領域とを照合し，動
ベクトルを検出する手段であって，特に所属領域判定手
段４による判定結果によって，参照画像の小領域が属す
る領域が，前記動きを求める対象となっている符号化単
位が属する領域と同じ領域でない場合に，参照画像の他
の小領域について動ベクトルの探索範囲を変更する手段
を持つ。The motion vector detecting means 5 is a means for detecting the motion vector by collating the coding unit of the image to be coded with the reference small area of the reference image. When the area to which the small area of the reference image belongs is not the same as the area to which the coding unit for which the motion is to be calculated belongs, the search range of the motion vector is changed for other small areas of the reference image. Have means.

【００２０】これらによって，符号化単位ごとの動き量
を検出するにあたり，領域形状情報を用いることによ
り，動ベクトルを探索する範囲を，参照画像内で，かつ
動きを求める対象となっている符号化単位が属する領域
と同じ領域の中とする。With these, in detecting the motion amount for each coding unit, by using the area shape information, the range in which the motion vector is searched is coded within the reference image and for which the motion is to be obtained. It is in the same area as the area to which the unit belongs.

【００２１】すなわち，本発明では，入力された映像信
号の領域形状情報が既知の際にはそれを用い，領域形状
情報が既知でない場合には，例えばＫ平均クラスタリン
グ法や等濃度領域を抽出するなどして領域形状を検出す
る。そして，符号化対象画像を符号化単位に分割し，各
符号化単位ごとの動ベクトルを検出するにあたり，映像
信号の領域形状情報を基に，動ベクトル探索範囲を，参
照画像のうち動ベクトルを求める対象となっている符号
化単位が属する領域と同じ領域に制限する。That is, in the present invention, when the area shape information of the input video signal is known, it is used, and when the area shape information is not known, for example, the K-means clustering method or the equal density area is extracted. The area shape is detected by performing Then, the image to be coded is divided into coding units, and when detecting the motion vector for each coding unit, the motion vector search range is set based on the area shape information of the video signal, and the motion vector in the reference image is set to It is limited to the same area as the area to which the coding unit to be obtained belongs.

【００２２】例えば図２に示す動画像の場合について考
える。図２の例では簡単のため符号化単位は矩形ブロッ
クとしてある。今，図２の（ａ），（ｂ），（ｃ）は，
順に時系列に並べられている。（ｂ）は（ａ）を参照画
像として符号化され，（ｃ）は（ｂ）を参照画像として
予測符号化されるとする。また，図２の（ｄ），
（ｅ），（ｆ）は，それぞれ図２（ａ），（ｂ），
（ｃ）の領域形状情報である。領域形状情報はクロマキ
ーを用いて映像を作成することで容易に取得することが
できる。また，近年コンピュータグラフィックス（Ｃ
Ｇ）による映像が作成されることが多くなり，このよう
な場合にも領域形状情報は既知である。Consider, for example, the case of the moving image shown in FIG. In the example of FIG. 2, the coding unit is a rectangular block for simplicity. Now, (a), (b), (c) of FIG.
They are arranged in chronological order. It is assumed that (b) is encoded using (a) as a reference image and (c) is predictively encoded using (b) as a reference image. 2 (d),
(E) and (f) are shown in FIGS.
It is the area shape information of (c). The area shape information can be easily acquired by creating an image using a chroma key. In recent years, computer graphics (C
The image according to G) is often created, and the area shape information is known even in such a case.

【００２３】図２の例では，領域形状情報としては映像
の前面にいる人と，背景との２つ領域に分けられてい
る。今，図２の（ｄ），（ｅ），（ｆ）において，黒で
示される前景である人の部分を領域α，白で示される背
景の部分を領域βとする。In the example of FIG. 2, the area shape information is divided into two areas, a person in front of the image and a background. Now, in FIGS. 2D, 2E, and 2F, the foreground part of the person shown in black is a region α, and the background part shown in white is a region β.

【００２４】この場合，図２（ｂ）の人の領域内（領域
α）にある符号化単位，ブロックｂ１，ｂ２の動ベクト
ルを検出するときは，参照画像である図２（ａ）のう
ち，図２（ｄ）の領域α内に対応する部分のみを探索す
る。In this case, when detecting the motion vectors of the coding units, blocks b1 and b2, within the human region (region α) of FIG. 2B, among the reference images of FIG. , Only the portion corresponding to the area α in FIG. 2D is searched.

【００２５】同様に，図２（ｃ）の人の領域内（領域
α）にある符号化単位，ブロックｂ３の動ベクトルを検
出するときは，参照画像である図２（ｂ）のうち，図２
（ｅ）の領域α内に対応する部分のみを探索する。Similarly, when detecting the coding unit in the human area (area α) in FIG. 2C and the motion vector of the block b3, one of the reference images in FIG. Two
Only the portion corresponding to the area α in (e) is searched.

【００２６】また，図２（ｃ）の背景の領域内（領域
β）にある符号化単位，ブロックｂ４の動ベクトルを検
出するときは，参照画像である図２（ｂ）のうち，図２
（ｅ）の領域β内に対応する部分のみを探索する。Further, when detecting the coding unit in the background area (area β) of FIG. 2C and the motion vector of the block b4, among the reference images shown in FIG.
Only the portion corresponding to the area β in (e) is searched.

【００２７】本発明の作用を従来技術と対比して説明す
ると以下のとおりである。従来の動画像符号化方式で
は，領域形状に関係なく，動ベクトルを求める対象とな
っている符号化単位（ブロック）近傍を全探索している
ため，ＣＧによって作成された画像や，クロマキーなど
を用いて作成された映像や，それとＣＧとの合成映像な
ど，映像信号の領域形状情報が既知の場合には極めて動
ベクトルの探索効率が悪く，リアルタイム処理などが要
求される際に非常に大きな問題となっていた。The operation of the present invention will be described below in comparison with the prior art. In the conventional moving image coding method, since the entire search is performed in the vicinity of the coding unit (block) for which the motion vector is obtained regardless of the area shape, the image created by CG, the chroma key, etc. When the area shape information of the video signal is known, such as the video created by using it and the composite video of it and CG, the motion vector search efficiency is extremely poor, and a very big problem when real-time processing is required. It was.

【００２８】また，隣接するブロックの動ベクトルは，
本来なら近い値になるが，従来の動ベクトル検出方法で
は，全探索するため隣接ブロックでありながら相関の低
いベクトルが検出されてしまうことがある。これは，復
号器にブロック単位の動ベクトルを伝送する際に，隣接
ブロックの動ベクトルの差分を符号化・伝送する既存の
符号化方式においては，かえって情報量が増大してしま
うという問題があった。The motion vectors of adjacent blocks are
Originally, the values are close to each other, but in the conventional motion vector detection method, since a full search is performed, a vector with a low correlation may be detected even though it is an adjacent block. This is because in the case of transmitting the motion vector in block units to the decoder, the existing coding method that encodes / transmits the difference between the motion vectors of adjacent blocks has a problem that the amount of information is rather increased. It was

【００２９】本発明を用い，領域形状情報に基づいて動
ベクトルの検出を行うことにより，動ベクトル検出の演
算量を減少させることが可能となり，動ベクトルの検出
効率および速度が改善されるので，本発明はソフトウェ
アによる映像のリアルタイム符号化実現のための基盤技
術となり得る。また，ハードウェア化した際には計算量
の低下に伴い発熱量が低下するため，チップの設計が容
易になる。By using the present invention to detect the motion vector based on the area shape information, it is possible to reduce the calculation amount of the motion vector detection and improve the motion vector detection efficiency and speed. INDUSTRIAL APPLICABILITY The present invention can be a basic technology for realizing real-time encoding of video by software. Also, when hardware is used, the amount of heat generated decreases as the amount of calculation decreases, which facilitates chip design.

【００３０】さらに本発明を用いることで，動ベクトル
の検出精度が向上し，かつ隣接する符号化単位（ブロッ
ク）の動ベクトルが比較的近い値になるため，動ベクト
ルの符号化・伝送に必要な情報量が少なくて済み，無線
伝送路やモデムを介しての画像通信など低レートである
ことが要求される場合でも，効率的に映像を符号化する
ことが可能となる。Further, by using the present invention, the motion vector detection accuracy is improved, and the motion vectors of adjacent coding units (blocks) have relatively close values, which is necessary for the motion vector coding / transmission. It is possible to efficiently encode video even when a small amount of information is required and a low rate is required such as image communication via a wireless transmission line or a modem.

【００３１】[0031]

【発明の実施の形態】以下，図面を用いて，本発明の実
施の形態を，符号化単位として矩形ブロックを用いた例
で説明する。BEST MODE FOR CARRYING OUT THE INVENTION Embodiments of the present invention will be described below with reference to the drawings using an example in which a rectangular block is used as an encoding unit.

【００３２】図３は，本発明の実施の一形態における動
画像符号化装置の動ベクトル検出器の構成例を示す図で
ある。動画像符号化装置の他の構成部分については，従
来用いられている技術と同様でよいので図示を省略す
る。FIG. 3 is a diagram showing a configuration example of a motion vector detector of the moving picture coding apparatus according to the embodiment of the present invention. The other components of the moving picture coding device may be similar to the conventionally used technique, and therefore their illustration is omitted.

【００３３】符号化対象画像１０１は，フレームメモリ
１０２に蓄えられる。フレームメモリ１０２に蓄えられ
た現フレームの画像１０６が，（ｘ_cf，ｙ_cf）の２座標
成分からなるアドレス信号１０７（ブロックの最も左上
の画素に対応するアドレス）の値をアドレス値として読
み出され，小領域分割部１０８に入力される。The image 101 to be coded is stored in the frame memory 102. The image 106 of the current frame stored in the frame memory 102 reads the value of an address signal 107 (address corresponding to the upper leftmost pixel of the block) consisting of two coordinate components (x _cf , y _cf ) as an address value. And input to the small area dividing unit 108.

【００３４】小領域分割部１０８は，現フレームの画像
１０６からｍ×ｎのブロックデータを読み出し（通常，
ｍ＝ｎ＝１６），符号化対象ブロック信号１０９を出力
する。（ｘ_cf，ｙ_cf）の２座標成分からなるアドレス信
号１０７は，同時に，符号化対象画像領域形状情報１１
０と共に，所属領域判定部１１１に入力される。The small area dividing unit 108 reads m × n block data from the image 106 of the current frame (normally,
m = n = 16), and outputs the encoding target block signal 109. The address signal 107 composed of the two coordinate components (x _cf , y _cf ) simultaneously generates the encoding target image area shape information 11
It is input together with 0 to the belonging area determination unit 111.

【００３５】所属領域判定部１１１では，現在符号化し
ようとしている対象のブロックが，符号化対象画像領域
形状情報１１０が示す領域のうち，どの領域に含まれて
いるかを判定し，符号化対象ブロック所属領域フラグ１
１２を出力する。The belonging area determination unit 111 determines which of the areas indicated by the encoding target image area shape information 110 the target block currently to be encoded is included in the encoding target block. Belonging area flag 1
12 is output.

【００３６】また，符号化対象画像１０１は１フレーム
の遅延１０３の後に参照画像１０４となり，フレームメ
モリ１０５に蓄えられる。フレームメモリ１０５に蓄え
られた参照フレームの画像１１３が，（ｘ_rf，ｙ_rf）の
２座標成分からなるアドレス信号１１４（ブロックの最
も左上の画素に対応するアドレス）の値をアドレス値と
して読み出され，小領域分割部１１５に入力される。The image 101 to be coded becomes the reference image 104 after the delay 103 of one frame and is stored in the frame memory 105. The image 113 of the reference frame stored in the frame memory 105 reads the value of the address signal 114 (the address corresponding to the upper leftmost pixel of the block) consisting of two coordinate components (x _rf , y _rf ) as the address value. And input to the small area dividing unit 115.

【００３７】小領域分割部１１５は，参照フレームの画
像１１３からｍ×ｎのブロックデータを読み出し（通
常，ｍ＝ｎ＝１６），参照フレームブロック信号１１６
を出力する。（ｘ_rf，ｙ_rf）の２座標成分からなるアド
レス信号１１４は，同時に，参照画像領域形状情報１１
７，符号化対象ブロック所属領域フラグ１１２と共に，
所属領域判定部１１８に入力される。The small area dividing unit 115 reads m × n block data from the image 113 of the reference frame (usually m = n = 16), and the reference frame block signal 116.
Is output. The address signal 114 composed of two coordinate components of (x _rf , y _rf ) simultaneously generates the reference image area shape information 11
7, together with the encoding target block belonging area flag 112,
It is input to the belonging area determination unit 118.

【００３８】所属領域判定部１１８では，符号化対象ブ
ロック信号１０９を符号化する際に参照するブロック
（参照ブロック）が参照画像領域形状情報１１７が示す
領域のうち，どの領域に含まれているかを判定すると共
に，その領域と符号化対象ブロック所属領域フラグ１１
２が示す領域が同一のものであるかを判定し，符号化対
象ブロックと参照ブロックとが同一の領域に属している
かを示すフラグである同一領域所属フラグ１１９をマッ
チング誤差計算部１２０に出力する。The belonging area determination unit 118 determines which of the areas indicated by the reference image area shape information 117 the block (reference block) to be referred to when the encoding target block signal 109 is encoded is included. Along with the determination, the area and the encoding target block belonging area flag 11
It is determined whether the areas indicated by 2 are the same, and the same area belonging flag 119, which is a flag indicating whether the encoding target block and the reference block belong to the same area, is output to the matching error calculation unit 120. .

【００３９】例えば図４に示す動画像の場合では，今，
図４（ｂ）の符号化対象であるブロックｂ５を符号化す
るとき，図４（ｄ）によると，ブロックｂ５は領域αに
属しているので，符号化対象ブロック所属領域フラグ１
１２は“α”を示すフラグとなる。For example, in the case of the moving image shown in FIG.
When encoding the block b5 to be encoded in FIG. 4B, according to FIG. 4D, since the block b5 belongs to the area α, the encoding target block belonging area flag 1
12 is a flag indicating "α".

【００４０】このとき，図４（ａ）のブロックｂ６を参
照ブロックとして探索しているときには，図４（ｃ）に
よると，ブロックｂ６は領域αに属しているので，同一
領域所属フラグ１１９は“同一の領域”を示すフラグと
なる。また，図４（ａ）のブロックｂ７を参照ブロック
として探索しているときには，図４（ｃ）によると，ブ
ロックｂ７は領域βに属しているので，同一領域所属フ
ラグ１１９は“異なる領域”を示すフラグとなる。At this time, when the block b6 of FIG. 4 (a) is searched as a reference block, according to FIG. 4 (c), the block b6 belongs to the area α, so that the same area belonging flag 119 is set to " It becomes a flag indicating "the same area". Further, when the block b7 of FIG. 4A is searched as a reference block, according to FIG. 4C, since the block b7 belongs to the region β, the same region belonging flag 119 indicates “different region”. It becomes the flag shown.

【００４１】マッチング誤差計算部１２０は，同一領域
所属フラグ１１９の値をチェックする部分と，演算器か
ら構成される。マッチング誤差計算部１２０では，まず
始めに同一領域所属フラグ１１９の値を判定する。The matching error calculation unit 120 is composed of a unit for checking the value of the same region belonging flag 119 and an arithmetic unit. The matching error calculation unit 120 first determines the value of the same area belonging flag 119.

【００４２】（ｉ）フラグが“同一の領域”の場合もし，同一領域所属フラグ１１９が“同一の領域”を示
すフラグであった場合には，二つの小領域分割部１０
８，１１５から出力される符号化対象ブロック信号１０
９と参照フレームブロック信号１１６の対応する各画素
値の差分自乗値を演算し，それを加算したものを誤差評
価値１２１として比較器１２２に出力する。(I) When the flag is "same area" If the same area belonging flag 119 is a flag indicating "same area", the two small area dividing units 10
Encoding target block signal 10 output from 8 and 115
9 and the difference square value of each corresponding pixel value of the reference frame block signal 116 are calculated, and the sum of them is output to the comparator 122 as an error evaluation value 121.

【００４３】（ii）フラグが“異なる領域”の場合一方，同一領域所属フラグ１１９が“異なる領域”を示
すフラグであった場合には，この時点でこの参照フレー
ムブロックに関する処理を中止し，処理中止フラグ１２
６を小領域分割部１１５に伝送し，次の参照フレームブ
ロックの処理に移行する。この処理により，動ベクトル
検出の精度を落とすことなく演算量を大きく低減するこ
とが可能となる。(Ii) When the flag is "different area" On the other hand, when the same area belonging flag 119 is a flag indicating "different area", the processing relating to this reference frame block is stopped at this point and the processing Cancel flag 12
6 is transmitted to the small area dividing unit 115, and the process proceeds to the processing of the next reference frame block. By this processing, it is possible to greatly reduce the amount of calculation without lowering the accuracy of motion vector detection.

【００４４】また，マッチング誤差計算部１２０では，
１つの符号化対象ブロック信号１０９に対応して，探索
範囲内の複数の参照フレームブロック信号１１６と前記
フラグ判定および計算を行う。なお差分値演算は差分自
乗和に限定されることはなく，差分絶対値和などを用い
ることも可能である。Further, in the matching error calculation unit 120,
Corresponding to one block signal to be encoded 109, a plurality of reference frame block signals 116 within the search range and the flag determination and calculation are performed. The difference value calculation is not limited to the sum of squared differences, and it is also possible to use the sum of absolute differences.

【００４５】比較器１２２は，最小値メモリ１２３を有
しており，入力された誤差評価値１２１が最小値メモリ
１２３に蓄えられている値より小さければ，最小値メモ
リ１２３の値を更新し，それぞれｘ，ｙの２座標成分か
らなる参照ブロックの位置を示すアドレス信号（ｘ_rf，
ｙ_rf）１１４から，符号化対象ブロックの位置を示すア
ドレス信号（ｘ_cf，ｙ_cf）１０７を減じた値である動ベ
クトル（ｖ_x，ｖ_y）１２４をベクトルメモリ１２５に
記憶する。The comparator 122 has a minimum value memory 123. If the input error evaluation value 121 is smaller than the value stored in the minimum value memory 123, the value in the minimum value memory 123 is updated, An address signal (x _rf , which indicates the position of a reference block composed of two coordinate components x and y, respectively)
The motion vector (v _x , v _y ) 124, which is a value obtained by subtracting the address signal (x _cf , y _cf ) 107 indicating the position of the target block from (y _rf ) 114, is stored in the vector memory 125.

【００４６】[0046]

【数２】 [Equation 2]

【００４７】探索範囲に対する上記操作が終了したと
き，ベクトルメモリ１２５には検出された動ベクトル１
２４が蓄えられている。また，従来の固定サイズのブロ
ックを用いる動画像符号化方式においては，一つのブロ
ックに対して一つの動ベクトルのみを用いて動き補償を
行なっているため，一方の物体の動きにしか追従でき
ず，動き補償がうまく作用しない部分が存在し，予測誤
差が増加し符号化効率が低下することがすでに指摘され
ている。When the above operation for the search range is completed, the detected motion vector 1 is stored in the vector memory 125.
24 are stored. In addition, in the conventional moving image coding method using fixed-size blocks, motion compensation is performed using only one motion vector for one block, so only the motion of one object can be followed. It has already been pointed out that there is a portion where motion compensation does not work well, the prediction error increases and the coding efficiency decreases.

【００４８】これを避けるため，図５のように符号化対
象であるブロックｂ８に領域形状の輪郭線を含む場合，
領域形状に沿ってブロックを複数個の小領域に分割し，
各小領域を新たな符号化単位とし，各小領域毎に動ベク
トルを求めて符号化する方法がすでに提案されている。In order to avoid this, when the block b8 to be encoded includes the contour line of the area shape as shown in FIG.
Divide the block into multiple small areas according to the area shape,
A method has already been proposed in which each small area is used as a new coding unit and a motion vector is calculated for each small area and then coded.

【００４９】また，図６のように，該ブロックｂ８を小
ブロック（符号化単位）に再分割し，同様に各小ブロッ
ク毎に動ベクトルを求めて符号化する手法も提案されて
いる。Also, as shown in FIG. 6, a method has been proposed in which the block b8 is subdivided into small blocks (encoding units), and similarly, a motion vector is obtained for each small block and encoded.

【００５０】このような場合でも，ブロックを分割する
ことで生成された各符号化単位ごとの動ベクトルを検出
するにあたり，本発明を用いて，映像信号の領域形状情
報をもとに，動ベクトル探索範囲を，参照画像のうち，
動ベクトルを求める対象となっている符号化単位が属す
る領域と同じ領域に制限することで対応でき，ブロック
内に領域形状の境界を含まない場合と同様に計算量を大
きく改善することが可能となる。Even in such a case, in detecting the motion vector for each coding unit generated by dividing the block, the present invention is used to detect the motion vector based on the area shape information of the video signal. The search range is
It can be dealt with by restricting to the same area as the area to which the coding unit for which the motion vector is obtained belongs, and it is possible to greatly improve the calculation amount as in the case where the boundary of the area shape is not included in the block. Become.

【００５１】具体的には，図５（ｄ）に示すように領域
形状を含む符号化対象ブロックが符号化対象小領域
α₁，符号化対象小領域β₁に再分割された場合，符号
化対象小領域α₁は領域αに属しているので，参照画像
である図５（ａ）のうち，図５（ｃ）の領域α内に対応
する部分のみを探索する。Specifically, as shown in FIG. 5D, when the coding target block including the region shape is subdivided into the coding target small area α ₁ and the coding target small area β ₁ , Since the target small area α ₁ belongs to the area α, only the portion corresponding to the area α of FIG. 5C in the reference image of FIG. 5A is searched.

【００５２】同様に，符号化対象小領域β₁は領域βに
属しているので，参照画像である図５（ａ）のうち，図
５（ｃ）の領域β内に対応する部分のみを探索する。ま
た，この実施例に示した方式は，入力画像の領域形状情
報が既知な場合であるが，クロマキーなどを用いずに撮
影された映像の場合のように，入力画像信号の領域形状
情報が既知でない場合には，符号化対象画像領域形状情
報１１０，参照画像領域形状情報１１７はＫ平均クラス
タリングなどを用いて前もって検出しておく必要があ
る。Ｋ平均クラスタリングなどの領域分割の手法につい
ては，「画像解析ハンドブック」（高木幹雄，下田陽久
監修．東京大学出版会，1991年１月）に詳しい。Similarly, since the encoding target small area β ₁ belongs to the area β, only the portion corresponding to the area β of FIG. 5C in the reference image of FIG. 5A is searched. To do. Further, in the method shown in this embodiment, the area shape information of the input image is known, but the area shape information of the input image signal is known as in the case of a video image captured without using a chroma key or the like. If not, the encoding target image area shape information 110 and the reference image area shape information 117 need to be detected in advance using K-means clustering or the like. For details on region segmentation techniques such as K-means clustering, see "Image Analysis Handbook" (supervised by Mikio Takagi and Yohisa Shimoda. The University of Tokyo Press, January 1991).

【００５３】[0053]

【発明の効果】以上説明したように，本発明によれば，
領域形状情報に基づいて動ベクトルの検出を行うことに
より，動ベクトル検出の演算量を減少させることが可能
となり，動ベクトルの検出効率および速度が改善し，ソ
フトウェアによる映像のリアルタイム符号化実現のため
の基盤技術となる。As described above, according to the present invention,
By detecting the motion vector based on the area shape information, it is possible to reduce the calculation amount of the motion vector detection, improve the motion vector detection efficiency and speed, and realize the real-time coding of video by software. It becomes the basic technology of.

【００５４】さらに本発明を用いることで，動ベクトル
の検出精度が向上し，かつ隣接する符号化単位（ブロッ
ク）の動ベクトルが比較的近い値になるため，低レート
であることが要求される場合でも効率的に映像を符号化
することが可能となる。Further, by using the present invention, the detection accuracy of the motion vector is improved, and the motion vectors of adjacent coding units (blocks) have relatively close values, so that the low rate is required. Even in this case, the video can be efficiently coded.

[Brief description of drawings]

【図１】本発明の原理説明図である。FIG. 1 is a diagram illustrating the principle of the present invention.

【図２】本発明の実施の形態における領域形状情報を示
す図である。FIG. 2 is a diagram showing area shape information according to the embodiment of the present invention.

【図３】本発明の実施の一形態における動画像符号化装
置の動ベクトル検出器の構成例を示す図である。[Fig. 3] Fig. 3 is a diagram showing a configuration example of a motion vector detector of the moving picture coding device according to the embodiment of the present invention.

【図４】符号化対象ブロックと参照ブロックが同じ領域
に含まれているかどうかの説明図である。FIG. 4 is an explanatory diagram of whether a coding target block and a reference block are included in the same area.

【図５】符号化対象ブロック内に領域形状の境界が存在
した場合の対処法の例を示す図である。[Fig. 5] Fig. 5 is a diagram illustrating an example of a coping method when a boundary of a region shape exists in a block to be encoded.

【図６】符号化対象ブロック内に領域形状の境界が存在
した場合の小ブロック分割による対処法を示す図であ
る。[Fig. 6] Fig. 6 is a diagram illustrating a coping method by small block division when a boundary of a region shape exists in a coding target block.

【図７】従来の動ベクトル検出法を説明する図である。FIG. 7 is a diagram illustrating a conventional motion vector detection method.

【図８】得られた動ベクトルに基づく動き補償を説明す
る図である。FIG. 8 is a diagram illustrating motion compensation based on the obtained motion vector.

【図９】従来の動ベクトル検出方式のブロック図であ
る。FIG. 9 is a block diagram of a conventional motion vector detection method.

[Explanation of symbols]

１小領域分割手段２所属領域判定手段３小領域分割手段４所属領域判定手段５動ベクトル検出手段１０１符号化対象画像１０２フレームメモリ１０３１フレーム遅延１０４参照画像１０５フレームメモリ１０６現フレームの画像１０７アドレス信号１０８小領域分割部１０９符号化対象ブロック信号１１０符号化対象画像領域形状情報１１１所属領域判定部１１２符号化対象ブロック所属領域フラグ１１３参照フレームの画像１１４アドレス信号１１５小領域分割部１１６参照フレームブロック信号１１７参照画像領域形状情報１１８所属領域判定部１１９同一領域所属フラグ１２０マッチング誤差計算部１２１誤差評価値１２２比較器１２３最小値メモリ１２４動ベクトル１２５ベクトルメモリ１２６処理中止フラグ 1 small area dividing means 2 belonging area judging means 3 small area dividing means 4 belonging area judging means 5 motion vector detecting means 101 image to be encoded 102 frame memory 103 1 frame delay 104 reference image 105 frame memory 106 current frame image 107 address Signal 108 Small area dividing unit 109 Encoding target block signal 110 Encoding target image area shape information 111 Belonging region determination unit 112 Encoding target block Belonging region flag 113 Reference frame image 114 Address signal 115 Small region dividing unit 116 Reference frame block Signal 117 Reference image region shape information 118 Belonging region determination unit 119 Same region belonging flag 120 Matching error calculation unit 121 Error evaluation value 122 Comparator 123 Minimum value memory 124 Motion vector 125 Vector memory 26 treatment discontinuation flag

───────────────────────────────────────────────────── フロントページの続き (72)発明者渡辺裕東京都新宿区西新宿三丁目19番２号日本電信電話株式会社内 ────────────────────────────────────────────────── ─── Continuing on the front page (72) Inventor Hiroshi Watanabe Nippon Telegraph and Telephone Corporation, 3-19-2 Nishishinjuku, Shinjuku-ku, Tokyo

Claims

[Claims]

1. An image to be encoded is divided into encoding units,
The amount of motion between the target image and the reference image is detected for each coding unit, and the motion vector obtained from the amount of motion is used to perform motion compensation to create a predicted image. In the moving picture coding method for coding the difference from the picture to be coded, when detecting the motion amount for each coding unit, if the area shape information of the input video signal is known, it is used, If the area shape information of the input video signal is not known, the area shape of the input video signal is detected, and based on the area shape, the range in which the motion vector is searched is found in the reference image and the motion is obtained. A moving picture coding method, characterized in that it is only in the same area as the area to which the target coding unit belongs.

2. A prediction image is created by detecting a motion amount between an encoding target image and a reference image and performing motion compensation using a motion vector obtained from the motion amount. In a moving picture coding apparatus including means for coding a difference from an encoding target image, means for dividing the encoding target image into encoding units, and known region shape information of the encoding target image or an input video Based on the area shape information detected from the signal, means for detecting the area to which the coding unit that is the target of the motion belongs, means for dividing the reference image into small areas, and area shape information for the reference image Based on
Means for detecting an area to which a small area of a reference image as a reference object belongs, and detecting whether or not the area is the same as an area to which the coding unit as a target of the motion belongs, If the area to which the small area belongs to is not the same as the area to which the coding unit for which the motion is to be calculated belongs, a means for changing the search range of the motion vector with respect to other small areas of the reference image. , In detecting the motion amount for each coding unit, the range for searching the motion vector based on the area shape information is set in the reference image,
A moving picture coding apparatus, characterized in that it is set in the same area as an area to which a coding unit for which motion is to be obtained belongs.