JPH0662388A

JPH0662388A - Dynamic image compressor

Info

Publication number: JPH0662388A
Application number: JP16417893A
Authority: JP
Inventors: Hiroyasu Ide; 博康井手
Original assignee: Casio Computer Co Ltd
Current assignee: Casio Computer Co Ltd
Priority date: 1992-06-08
Filing date: 1993-06-07
Publication date: 1994-03-04
Anticipated expiration: 2019-11-17
Also published as: JP3590976B2

Abstract

PURPOSE:To obtain a high predicting efficiency for any picture, and to improve a picture quality and a compressibility. CONSTITUTION:A dynamic image compressor is equipped with a predicted structure deciding device 2 which judges which correlation is stronger; a field correction or time correlation for the picture based on picture data imparted to decide a predicted structure, and decides the predicted structure by selecting the predicted structure from among a predicted picture group stored in a predicted structure group memory(PM) 3, predicted structure group memory(PM) 3 which stores plural predicted structure groups set based on the field correlation and the time correlation, and compressor 4 which compresses picture data according to the predicted structure selected by the predicted structure deciding device 2. Then, the predicted structure in which an error can be reduced is selected by predicting a P4 picture predicted close in time from an 12 picture and a P3 picture whose field is the same on the same condition, and the picture data are compressed according to the predicted structure.

Description

Detailed Description of the Invention

【０００１】[0001]

【産業上の利用分野】本発明は、動画像圧縮処理等に用
いられる動画像圧縮装置に係り、詳細には、時間軸方向
の予測を伴う動画像圧縮装置に関する。BACKGROUND OF THE INVENTION 1. Field of the Invention The present invention relates to a moving picture compression apparatus used for moving picture compression processing and the like, and more particularly to a moving picture compression apparatus with prediction in the time axis direction.

【０００２】[0002]

【従来の技術】画像圧縮の国際標準としてＪＰＥＧ（Jo
int Photograghic Expert Group）やＭＰＥＧ（Moving
Picture Expert Group）がある。2. Description of the Related Art As an international standard for image compression, JPEG (Jo
int Photograghic Expert Group) and MPEG (Moving
There is a Picture Expert Group).

【０００３】ＭＰＥＧは、ＭＰＥＧI，ＭＰＥＧII，Ｍ
ＰＥＧIIIの３レベルの規格案が検討されている。ＭＰ
ＥＧIでは、１．５Ｍｂｐｓの通信回線で伝送できる動
画像圧縮を目的としており、おもにテレビ電話やテレビ
会議などで使用することが考えられている。ＭＰＥＧI
では、現行のＮＴＳＣ方式のビデオ画像を３２０×２４
０ピクセルの解像度として扱い、１フレームを構成する
２フィールドのうち１フィールドのみのデータを用い
る。ＭＰＥＧIIでは、１０Ｍｂｐｓの通信回線で伝送で
きる圧縮が目標で、ＩＳＤＮなどによる動画像伝送やデ
ィジタル・ビデオがターゲットとされている。そして、
ＭＰＥＧIIIは、ハイビジョンなどによる次世代テレビ
が対象となっている。MPEG is MPEGI, MPEGII, M
A three-level draft of PEGIII is under consideration. MP
EGI aims to compress a moving image that can be transmitted through a communication line of 1.5 Mbps, and is considered to be used mainly in videophones and videoconferences. MPEG I
Then, the current NTSC format video image is 320 x 24
It is treated as a resolution of 0 pixel, and data of only one field is used out of two fields constituting one frame. In MPEGII, compression is a goal that can be transmitted through a 10 Mbps communication line, and moving image transmission by ISDN and digital video are targets. And
MPEGIII is targeted for next-generation televisions such as HDTV.

【０００４】ＭＰＥＧの特徴は、ＤＣＴ（Discrete Cos
ine Transform：離散コサイン変換）による静止画像圧
縮に加えて、時間軸方向の圧縮のためのフレーム間予測
処理を行なうことであるが、動画像圧縮の前提条件とし
てフレームのランダム・アクセスができること、早送り
による再生や巻戻し再生（逆方向）ができることがあげ
られている。従って、ＭＰＥＧにおけるフレーム間予測
は、前向きと後向きの両方向を採用している。ＭＰＥＧ
にあっても、基本的にはＭＣ（動き補償）＋ＤＣＴを用
いる。動き補償を行なうブロックサイズは１６×１６
（但し８×８のモードもある）、ＤＣＴは８×８ブロッ
クに対して行なう。また、この動き補償は１／２画素精
度で行なう。１／２画素精度の動き補償は、予測に用い
る参照フレーム上において画素単位でずらした位置を調
べるのみならず、画素と画素の間の位置を補間によって
生成し、マッチングをとることによって行なう。The feature of MPEG is that DCT (Discrete Cos
ine Transform: Discrete cosine transform) is used to perform inter-frame prediction processing for compression in the time axis direction in addition to still image compression. Random access of frames and fast-forwarding are prerequisites for moving image compression. It has been mentioned that playback and rewind playback (reverse direction) can be performed. Therefore, inter-frame prediction in MPEG employs both forward and backward directions. MPEG
However, basically, MC (motion compensation) + DCT is used. The block size for motion compensation is 16 × 16.
(However, there is also an 8 × 8 mode), DCT is performed on 8 × 8 blocks. Also, this motion compensation is performed with 1/2 pixel precision. Motion compensation with 1/2 pixel accuracy is performed not only by checking the position shifted in pixel units on the reference frame used for prediction, but also by generating the position between pixels by interpolation and performing matching.

【０００５】通常の動き補償＋ＤＣＴとの最も大きな違
いは、周期的なフレーム内符号化フレームを基本とした
動き補償予測である。図８及び図９はＭＰＥＧ動画符号
化時の画面の予測構造を示す図であり、図中の四角形は
動画のフレームを意味する１枚１枚の画像（ピクチャ）
を示し、フレームから伸びる矢印は、根元のフレームが
予測に用いられることを示す。The biggest difference from the normal motion compensation + DCT is the motion compensation prediction based on the periodic intra-frame coded frame. 8 and 9 are diagrams showing a prediction structure of a screen at the time of MPEG moving picture coding, and a quadrangle in the drawing means an image (picture) one by one which means a frame of a moving picture.
The arrow extending from the frame indicates that the root frame is used for prediction.

【０００６】上記ピクチャは、符号化される方式に従っ
て以下のタイプに分類される。The above pictures are classified into the following types according to the encoding method.

【０００７】Ｉピクチャ（Intra-coded picture：イ
ントラ符号化画像）符号化されるときその画像１枚の中だけで閉じた情報の
みを使う。換言すれば、復号化するときＩピクチャ自身
の情報のみで画像が再構成できる。実際には、差分をと
らずそのままＤＣＴして符号化する。この符号化方式
は、一般に効率が悪いが、これを随所に入れてＩピクチ
ャだけを復号すればランダムアクセスや高速再生が可能
となる。さらに、Ｉピクチャを復号してメモリに蓄え、
逆方向に読み出すことを繰り返せば逆転再生をも可能と
なる。I-picture (Intra-coded picture) When encoded, only the information closed in one image is used. In other words, when decoding, an image can be reconstructed using only the information of the I picture itself. Actually, the DCT is encoded as it is without taking the difference. This encoding method is generally inefficient, but random access and high-speed reproduction are possible by inserting this encoding everywhere and decoding only the I picture. Furthermore, I picture is decoded and stored in memory,
Reverse reading can be performed by repeating reading in the reverse direction.

【０００８】Ｐピクチャ（Predictive-coded pictur
e：前方予測符号化画像）Ｐピクチャは、予測画像（差分をとる基準となる画像）
として、入力で時間的に前に位置し既に復号化されたＩ
ピクチャまたはＰピクチャを使う。すなわち、図９に示
すように過去から現在の一方向に予測されるフレームで
ある。実際には動き補償された予測画像との差を符号化
するか差分をとらずに符号化する（イントラ符号化）か
効率のよい方をマクロブロック単位で選択できる。P picture (Predictive-coded pictur
e: Forward predictive encoded image) P picture is a predictive image (image that serves as a reference for taking a difference)
As I, which was previously decoded in time at the input
Use picture or P-picture. That is, as shown in FIG. 9, it is a frame predicted in one direction from the past. In actuality, it is possible to select, in macroblock units, whichever is more efficient, whether the difference from the motion-compensated predicted image is coded or the difference is not taken (intra-coding).

【０００９】Ｂピクチャ（Bidirectionally predicti
ve-coded picture：両方向予測符号化画像）Ｂピクチャは、予測画像として時間的に前に位置し既に
復号化されＩピクチャまたはＰピクチャ、時間的に後ろ
に位置するすでに復号化されたＩピクチャまたはＰピク
チャ、およびその両方から作られた補間画像の３種類を
使う。ここで、補間フレームの場合は両方向から予測を
行なうが、動き補償の予測モードは大きく分類して３つ
ある。過去から現在を予測する順方向動き補償、未来か
ら現在を予測する逆方向動き補償、過去と未来の両方か
ら現在を予測する補間動き補償である。上記順方向動き
補償と逆方向動き補償とは、一つの参照フレームから読
み出したブロックとマッチングをとるという点で、通常
の動き補償（ＭＣ）と同じ処理である。また、上記補間
動き補償は、２つの参照フレームから読み出したブロッ
クを、現在のフレームと参照フレームとの時間距離を考
慮した重みづけをして合成し、予測信号を得るものであ
る。B-picture (Bidirectionally predicti
ve-coded picture: bidirectional predictive-coded image) A B-picture is a predicted image, which is a temporally preceding and already decoded I picture or P picture, and a temporally posterior already decoded I picture, or Three types of P-pictures and interpolated images made from both are used. Here, in the case of an interpolation frame, prediction is performed from both directions, but there are roughly three prediction modes for motion compensation. These are forward motion compensation that predicts the present from the past, backward motion compensation that predicts the present from the future, and interpolation motion compensation that predicts the present from both the past and the future. The forward motion compensation and the backward motion compensation are the same processes as the normal motion compensation (MC) in that they match a block read from one reference frame. In the interpolation motion compensation, the blocks read from the two reference frames are weighted and combined in consideration of the time distance between the current frame and the reference frame to obtain a prediction signal.

【００１０】上記３種類の動き補償後の差分の符号化と
イントラ符号化の中で一番効率のよいものをマクロブロ
ック単位に選択できる。The most efficient one of the above three types of difference encoding and intra-encoding after motion compensation can be selected in macroblock units.

【００１１】一方、上記Ｉピクチャ、Ｐピクチャ及びＢ
ピクチャを含んで構成されるＧＯＰ（グループ・オブ・
ピクチャ）は、１または複数枚のＩピクチャと０または
複数枚の非Ｉピクチャから構成される。Ｂピクチャを符
号化または復号化するには、その予測画像となる時間的
には後方にあるＩピクチャまたはＰピクチャが先に符号
化されていなくてはならないため、ＧＯＰを構成するに
はＩピクチャ、Ｐピクチャ、Ｂピクチャは所定の順序が
必要であるが、Ｉピクチャの間隔、及びＰピクチャの間
隔は自由でＧＯＰの内部でも変わってもよい。On the other hand, the I picture, P picture and B picture
GOP (Group of
A picture) is composed of one or more I-pictures and zero or more non-I-pictures. In order to encode or decode a B picture, an I picture or P picture that is a temporally backward image that becomes a predicted image of the B picture must be encoded first. , P pictures, and B pictures need to be in a predetermined order, but the intervals between I pictures and the intervals between P pictures are arbitrary and may change within the GOP.

【００１２】[0012]

【発明が解決しようとする課題】上述したように、従来
の動画像における時間方向予測には図６に示す直前の画
像からの片側予測や図９に示す過去及び未来の画像から
の双方向予測等がある。As described above, in the conventional temporal direction prediction for a moving image, one-sided prediction from the immediately preceding image shown in FIG. 6 and bidirectional prediction from past and future images shown in FIG. Etc.

【００１３】一般に、蓄積メディア（ＭＰＥＧ）系にお
ける動画像圧縮は、上記図７の双方向予測が有効である
と考えられており、ＭＰＥＧによる両方向予測画像（Ｂ
ピクチャ）の予測元となる画像は、その画像単体で再生
可能な（予測を伴わない）画像（Ｉピクチャ）若しくは
Ｂピクチャ以外からの片側予測を元にする画像（Ｐピク
チャ）である。ところが、フル動画像データは、前述し
たフィールド構造を持っているため、圧縮対象の画像が
予測元と同一フィールドではなく、時間の経過と共に垂
直方向の変化がない場合には予測効率がかなり悪化して
しまうという欠点がある。すなわち、例えばテレビ画像
では１／６０秒毎に１フィールドを表示し、２フィール
ド（奇数フィールドと偶数フィールド）で１フレームと
なっており、空間軸方向に１ドットのずれが存在する。
従って、隣り合った画像同士は縦軸が合わず、予測を行
なうおうとしたとき一番近い画像が必ずしも一番似てい
るとは限らないため、時間経過と共に垂直方向の変化が
ない画像では予測効率が極端に悪化してしまう場合があ
る。It is generally considered that the bidirectional prediction shown in FIG. 7 is effective for moving picture compression in a storage medium (MPEG) system.
An image serving as a prediction source of a picture is an image (I picture) that can be reproduced by the image alone (I picture) or an image (P picture) based on one-sided prediction other than a B picture. However, since full moving image data has the field structure described above, if the image to be compressed is not in the same field as the prediction source and there is no vertical change over time, the prediction efficiency will deteriorate considerably. There is a drawback that it will end up. That is, for example, in a television image, one field is displayed every 1/60 seconds, and two fields (odd field and even field) form one frame, and there is a shift of one dot in the spatial axis direction.
Therefore, the ordinates of adjacent images do not match, and the closest image is not always the most similar when trying to make a prediction. May become extremely worse.

【００１４】ところで、前述したＰピクチャにおいて
は、予測元画像からの時間経過量の点から予測効率はそ
れほど高くない場合が多いと考えられる。仮に、Ｐピク
チャが予測元と違うフィールドであった場合には画像に
よっては著しく予測効率が下がり、他の画像にまで影響
を及ぼしかねないことになる。しかし、Ｐピクチャが予
測元と同一フィールドであった場合には、ある１つのＩ
ピクチャを予測元とした全てのＰピクチャの系列が同一
フィールドとなり、それらのＩピクチャ及びＰピクチャ
を予測元とする両方向予測画面（Ｂピクチャ）におい
て、その両予測元とも違うフィールドとなる場合が発生
し、画像によっては予測効率が減少してしまうという欠
点があった。By the way, in the P picture described above, it is considered that the prediction efficiency is not so high in many cases in terms of the amount of time elapsed from the prediction source image. If the P picture is in a field different from that of the prediction source, the prediction efficiency will be significantly reduced depending on the image, which may affect other images. However, if the P picture is in the same field as the prediction source, one I
A series of all P pictures with a picture as a prediction source may be the same field, and in a bidirectional prediction screen (B picture) with those I pictures and P pictures as prediction sources, a field different from both the prediction sources may occur. However, there is a drawback that the prediction efficiency is reduced depending on the image.

【００１５】このように従来のＩピクチャ、Ｐピクチャ
等に基づく予測構造は設計者が時間相関を基に一義的に
決定していたため、圧縮対象の画像が予測元と同一フィ
ールドではなく時間の経過と共に垂直方向の変化が無い
場合には予測効率がかなり悪化してしまうことがある反
面、逆に、Ｐピクチャが予測元と同一フィールドである
場合であっても、上述した理由により画像によっては予
測効率の低下を招いてしまうことがある。As described above, since the conventional predictive structure based on I-pictures, P-pictures, etc. is uniquely determined by the designer based on the time correlation, the image to be compressed is not in the same field as the predictor and the time elapses. In addition, when there is no change in the vertical direction, the prediction efficiency may be considerably deteriorated. On the contrary, even when the P picture is in the same field as the prediction source, the prediction may not be possible depending on the image for the above reason. This may lead to a decrease in efficiency.

【００１６】そこで本発明は、どのような画像において
も高い予測効率を得ることができる動画像圧縮装置を提
供することを目的とする。Therefore, an object of the present invention is to provide a moving picture compression apparatus capable of obtaining high prediction efficiency for any picture.

【００１７】[0017]

【課題を解決するための手段】請求項１記載の発明は、
上記目的達成のため、複数の予測画像から構成される予
測構造を複数記憶する予測構造記憶手段と、符号化すべ
き画像データに基づいて前記予測構造記憶手段に記憶さ
れた予測構造を選択する予測構造選択手段と、前記予測
構造選択手段により選択された予測構造に従って画像デ
ータを圧縮するデータ圧縮手段とを備えている。The invention according to claim 1 is
To achieve the above object, a prediction structure storage unit that stores a plurality of prediction structures configured from a plurality of prediction images, and a prediction structure that selects a prediction structure stored in the prediction structure storage unit based on image data to be encoded The selection means and the data compression means for compressing the image data according to the prediction structure selected by the prediction structure selection means are provided.

【００１８】前記予測構造記憶手段に記憶される複数の
予測構造は、例えば請求項２に記載されているように、
フィールド相関が高い場合に使用される予測構造と、時
間相関が高い場合に使用される予測構造とを含んで構成
されるものであってもよく、また、前記予測構造は、例
えば請求項３に記載されているように、独立して符号化
可能な符号化画像と、過去から現在を予測する片側予測
画像と、過去及び未来の両方向から現在を予測する両方
向予測画像とを含む複数の画像により構成され、上記各
画像を予測元の画像として所定の順序で用いることによ
り予測を行うようにした構造であってもよい。また、前
記予測構造は、例えば請求項４に記載されているように
フレームを奇数フィールドと偶数フィールドの２つに分
割した一方のフィールドにおいて、独立して符号化可能
な第１の符号化画像と、前記他方のフィールドにおい
て、独立して符号化可能な第２の符号化画像とを含んで
構成されるものであってもよく、さらに前記予測構造
は、例えば請求項６に記載されているように、時間軸方
向の圧縮のためのフレーム間予測処理を行なうものであ
ってもよい。また、前記両方向予測画像は、例えば請求
項５に記載されているように、前記符号化画像又は前記
片側予測画像と前記符号化画像又は前記片側予測画像と
の間に位置する隣り合う２つの画像で構成するようにし
てもよい。The plurality of prediction structures stored in the prediction structure storage means are, for example, as described in claim 2.
It may be configured to include a prediction structure used when the field correlation is high and a prediction structure used when the time correlation is high, and the prediction structure is defined in, for example, claim 3. As described, a plurality of images including a coded image that can be independently coded, a one-sided prediction image that predicts the present from the past, and a bidirectional prediction image that predicts the present from both past and future directions. The structure may be configured such that prediction is performed by using each of the images as a prediction source image in a predetermined order. In addition, the prediction structure has a first coded image that can be coded independently in one field obtained by dividing a frame into two fields, an odd field and an even field, as described in claim 4. , The other field may be configured to include a second coded image that can be coded independently, and the prediction structure may be, for example, as described in claim 6. In addition, inter-frame prediction processing for compression in the time axis direction may be performed. Further, the bidirectional predicted image is, for example, as described in claim 5, two adjacent images located between the coded image or the one-sided predicted image and the coded image or the one-sided predicted image. You may make it comprise.

【００１９】また、前記予測構造選択手段は、例えば請
求項７に記載されているように、符号化しようとする画
像データがフィールド相関と時間相関とで何れの相関性
が強いかを判別し、該判別結果に基づいて前記予測構造
記憶手段に記憶された複数の予測構造のうち、相関性の
強い予測構造を選択するようにしてもよく、また、前記
予測構造選択手段は、例えば請求項８に記載されている
ように、前記符号化画像を予測元の画像として用いて予
測したとときに時間的に近い前記片側予測画像を予測し
たときの予測誤差と、前記符号化画像を予測の元の画像
として用いて予測したときにフィールドが同じ前記片側
予測画像を予測したときの予測誤差とを比較することに
より相関性を判別するものであってもよい。Further, the predictive structure selecting means determines, for example, which of image correlation to be coded is stronger, field correlation and time correlation, as described in claim 7, A prediction structure having a strong correlation may be selected from a plurality of prediction structures stored in the prediction structure storage means based on the determination result, and the prediction structure selection means may be, for example, As described in, the prediction error when predicting the one-sided prediction image that is temporally close when the coded image is used as a prediction source image, and the coded image is the source of the prediction. It is also possible to determine the correlation by comparing with the prediction error when the one-sided prediction image having the same field when used as the image of 1.

【００２０】[0020]

【作用】本発明の手段の作用は次の通りである。The operation of the means of the present invention is as follows.

【００２１】請求項１、２、３、４、５、６、７及び８
記載の発明では、予測構造記憶手段にはフィールド相関
が高い場合に使用される予測構造と、時間相関が高い場
合に使用される予測構造とが記憶されているものとす
る。Claims 1, 2, 3, 4, 5, 6, 7 and 8
In the described invention, it is assumed that the prediction structure storage means stores a prediction structure used when the field correlation is high and a prediction structure used when the time correlation is high.

【００２２】この状態において、符号化すべき画像デー
タが予測構造選択手段に入力されると、予測構造選択手
段によってこの画像データがフィールド相関と時間相関
とで何れの相関性が強いかが判別され、その判別結果に
基づいて予測構造記憶手段に記憶された複数の予測構造
のうち、相関性の強い予測構造が選択される。そして、
予測構造選択手段により選択された予測構造に従って画
像データが圧縮される。In this state, when the image data to be encoded is input to the prediction structure selecting means, the prediction structure selecting means determines which of the field correlation and the time correlation the image data has, and A prediction structure having a strong correlation is selected from the plurality of prediction structures stored in the prediction structure storage means based on the determination result. And
The image data is compressed according to the prediction structure selected by the prediction structure selecting means.

【００２３】従って、どのような画像においても高い予
測効率を得ることができる。Therefore, high prediction efficiency can be obtained in any image.

【００２４】[0024]

【実施例】以下、本発明を図面に基づいて説明する。DESCRIPTION OF THE PREFERRED EMBODIMENTS The present invention will be described below with reference to the drawings.

【００２５】原理説明本発明では、圧縮対象の画像が予測元と同一フィールド
ではなく時間の経過と共に垂直方向の変化が無い場合
と、予測元と同一フィールドであった場合の２つの状態
があることに着目し、圧縮しようとする画像を２つ用意
し、画像情報によりフィールド相関と時間相関のうち相
関性の高い方を利用した予測構造を選択するようにして
予測構造を変化させることで前述のような欠点を補い、
どのような画像においても高い予測効率を得るようにす
るものである。Description of Principle According to the present invention, there are two states, that is, the image to be compressed is not in the same field as the prediction source, and there is no change in the vertical direction with the passage of time, and if it is in the same field as the prediction source. Focusing on, the two images to be compressed are prepared, and by changing the prediction structure by selecting the prediction structure that uses the higher correlation between the field correlation and the time correlation according to the image information, To make up for such drawbacks,
This is to obtain high prediction efficiency in any image.

【００２６】図１は本発明に係る動画像圧縮装置の機能
ブロック図であり、図中、白抜き矢印はデータの流れを
示し、矢印は制御の流れを示す。FIG. 1 is a functional block diagram of a moving picture compression apparatus according to the present invention. In the figure, a white arrow indicates a data flow and an arrow indicates a control flow.

【００２７】図１において、動画像圧縮装置は、データ
圧縮すべき原画像データを記憶するフレームメモリ（Ｆ
Ｍ）１と、予測構造を決定するためにフレームメモリ
（ＦＭ）１から読出された画像データに基づいて画像が
フィールド相関（図５参照）と時間相関（図６参照）の
うち何れの相関性が強いかを判断し、予測構造群メモリ
（ＰＭ）３に記憶された予測画像群から予測構造を選ん
で予測構造を決定する予測構造決定装置２と、フィール
ド相関と時間相関に基づいて設定される複数の予測構造
群を記憶する予測構造群メモリ（ＰＭ）３と、予測構造
決定装置２により選ばれた予測構造に従ってフレームメ
モリ（ＦＭ）１から読出した画像データを圧縮する圧縮
器４と、上記各部の動作を制御する制御装置５とにより
構成されている。In FIG. 1, the moving image compression apparatus is a frame memory (F) for storing original image data to be data-compressed.
M) 1 and which of the field correlation (see FIG. 5) and the time correlation (see FIG. 6) the image has based on the image data read from the frame memory (FM) 1 to determine the prediction structure. Is set, based on the field correlation and the time correlation, and the prediction structure determination device 2 that determines the prediction structure by selecting the prediction structure from the prediction image group stored in the prediction structure group memory (PM) 3. A prediction structure group memory (PM) 3 that stores a plurality of prediction structure groups, and a compressor 4 that compresses the image data read from the frame memory (FM) 1 according to the prediction structure selected by the prediction structure determination device 2. The control device 5 controls the operation of each of the above parts.

【００２８】以上の構成において、まず、フレームメモ
リ（ＦＭ）１から読出した画像データを予測構造決定装
置２に出力する。予測構造決定装置２は、予測構造を決
定するために与えられた画像データに基づいて画像フィ
ールド相関と時間相関のうち何れの相関性が強いかを判
別して予測構造群メモリ（ＰＭ）３から予測構造を選ん
で予測構造を決定する。予測構造の決定結果は、制御装
置５に送られ、制御装置５はどの位置からどのように予
測するかという情報を圧縮器４に与える。圧縮器４は決
定された相関に対応する予測構造に従ってフレームメモ
リ（ＦＭ）１から読み出した画像データを圧縮し圧縮画
像データとして出力する。上記手順が後述する図４に示
すような画像グループ毎に繰り返される。In the above structure, first, the image data read from the frame memory (FM) 1 is output to the prediction structure determination device 2. The prediction structure determination device 2 determines which one of the image field correlation and the time correlation is stronger based on the image data given for determining the prediction structure, and determines from the prediction structure group memory (PM) 3 Select the predictive structure to determine the predictive structure. The result of determination of the prediction structure is sent to the control device 5, and the control device 5 gives the compressor 4 information from which position and how to predict. The compressor 4 compresses the image data read from the frame memory (FM) 1 according to the prediction structure corresponding to the determined correlation, and outputs it as compressed image data. The above procedure is repeated for each image group as shown in FIG. 4 described later.

【００２９】このように、上記原理説明に基づく動画像
圧縮装置は、予測構造を決定するために与えられた画像
データに基づいて画像がフィールド相関と時間相関のう
ち何れの相関性が強いかを判断して予測構造群メモリ
（ＰＭ）３に記憶された予測画像群から予測構造を選ん
で予測構造を決定する予測構造決定装置２と、フィール
ド相関と時間相関に基づいて設定される複数の予測構造
群を記憶する予測構造群メモリ（ＰＭ）３と、予測構造
決定装置２により選ばれた予測構造に従って画像データ
を圧縮する圧縮器４を設け、Ｉピクチャから予測される
時間的に近いＰピクチャとフィールドが同じＰピクチャ
をそれぞれ同一条件下で予測して誤差の少ない予測構造
を選択し、その予測構造で画像データを圧縮しているの
で、どのような画像においても高い予測効率を得ること
ができる。また、予測効率を上げることができるので同
一圧縮率の場合には画質の向上を図ることができ、同一
画質の場合には圧縮率を向上させることができる。As described above, the moving picture compression apparatus based on the above explanation of the principle determines which of the field correlation and the time correlation the image has, based on the image data given for determining the prediction structure. Prediction structure determination device 2 for determining and selecting a prediction structure from a prediction image group stored in the prediction structure group memory (PM) 3, and a plurality of predictions set based on field correlation and time correlation A predictive structure group memory (PM) 3 for storing a structure group and a compressor 4 for compressing image data according to the predictive structure selected by the predictive structure determining device 2 are provided, and a temporally predicted P picture is predicted from an I picture. P-pictures with the same field are predicted under the same conditions, a prediction structure with a small error is selected, and the image data is compressed with this prediction structure. Oite also it is possible to obtain high prediction efficiency. Further, since the prediction efficiency can be increased, the image quality can be improved when the compression rate is the same, and the compression rate can be improved when the image quality is the same.

【００３０】実施例図２〜図７は本発明に係る動画像圧縮装置の一実施例を
示す図である。Embodiment FIG. 2 to FIG. 7 are views showing an embodiment of a moving picture compression apparatus according to the present invention.

【００３１】先ず、構成を説明する。図２は動画像圧縮
装置のブロック図であり、この図において、動画像圧縮
装置の符号化器は、画像モード、予測モード、動きベク
トル及び各種制御信号を出力して、システム全体の制御
を行なうコントローラ３０と、データ圧縮すべき画像デ
ータを記憶する画像メモリ３１と、画像メモリ３１から
読み出した画像データに動き補償フレーム間予測処理に
よる予測結果を減算する減算器３２と、減算器３２によ
り減算された画像データをコントローラ３０に出力する
とともに、該画像データに対しＤＣＴ演算を行なうＤＣ
Ｔ演算部３３と、コントローラ３０で決定された量子化
幅に従ってＤＣＴ演算の出力データを一定の誤差の範囲
内で量子化する量子化部３４と、量子化部３４により量
子化された画像データに対し画像データのほか各種ブロ
ック属性信号を可変長符号化した後、定められたデータ
構造の符号列に多重化するＶＬＣ（Ｖａｒｉａｂｌｅ
ＬｅｎｇｔｈＣｏｄｅ：可変長符号化）３５と、変動
する情報発生を一定レートに平滑化するバッファ３６
と、周期的なフレーム内符号化フレームを基本とした動
き補償予測を行なう動き補償フレーム間予測部３７と、
により構成されている。First, the structure will be described. FIG. 2 is a block diagram of a moving image compression apparatus. In this figure, an encoder of the moving image compression apparatus outputs an image mode, a prediction mode, a motion vector and various control signals to control the entire system. The controller 30, an image memory 31 for storing image data to be data-compressed, a subtracter 32 for subtracting the prediction result of the motion-compensated inter-frame prediction process from the image data read from the image memory 31, and a subtractor 32 for subtraction. DC which performs DCT operation on the image data while outputting the image data to the controller 30.
The T calculation unit 33, the quantization unit 34 that quantizes the output data of the DCT calculation within a certain error range according to the quantization width determined by the controller 30, and the image data quantized by the quantization unit 34. On the other hand, in addition to image data, various block attribute signals are variable-length coded, and then VLC (Variable) which is multiplexed into a code string of a predetermined data structure.
Length Code (variable length coding) 35 and a buffer 36 for smoothing fluctuating information generation to a constant rate.
And a motion-compensated inter-frame prediction unit 37 that performs motion-compensated prediction based on periodic intra-frame coded frames,
It is composed by.

【００３２】上記動き補償フレーム間予測部３７は、量
子化部３４により量子化された画像データを逆量子化す
る逆量子化部３８と、逆量子化部３８により量子化前の
画像データに戻されたデータに対し逆ＤＣＴ（ＩＤＣ
Ｔ）演算を施すＩＤＣＴ演算部３９と、ＩＤＣＴ演算部
３９によりＤＣＴ処理される前の画像データに戻された
データに動き補償を加算する加算器４０と、コントロー
ラ３０からの画像モード、予測モードに従って信号経路
を切り換えるスイッチ４１、４２、４３と、コントロー
ラ３０で演算処理（図７参照）された動きベクトルによ
り動き補償予測を行なう予測器４４、４５とから構成さ
れる。The motion-compensated inter-frame prediction unit 37 dequantizes the image data quantized by the quantization unit 34 and the dequantization unit 38, and the dequantization unit 38 restores the unquantized image data. Inverse DCT (IDC
T) According to the image mode and prediction mode from the controller 30, the IDCT calculation unit 39 that performs the calculation, the adder 40 that adds motion compensation to the data returned to the image data before the DCT processing by the IDCT calculation unit 39. It is composed of switches 41, 42 and 43 for switching signal paths, and predictors 44 and 45 for performing motion compensation prediction based on motion vectors calculated by the controller 30 (see FIG. 7).

【００３３】上記画像メモリ３１は、前記図１の機能ブ
ロック図のフレームメモリ（ＦＭ）１に対応し、上記Ｄ
ＣＴ演算部３３及び量子化部３４は全体として図１の圧
縮器４に対応している。また、コントローラ３０は全体
として図１の予測構造決定装置２、予測構造群メモリ
（ＰＭ）３及び制御装置５に対応しており、コントロー
ラ３０は、内蔵された予測構造群メモリ（ＰＭ）を用い
て予測構造決定処理を実行し、決定された予測構造を基
にシステム全体の動画像圧縮制御を行なう。The image memory 31 corresponds to the frame memory (FM) 1 in the functional block diagram of FIG.
The CT calculator 33 and the quantizer 34 generally correspond to the compressor 4 of FIG. The controller 30 generally corresponds to the prediction structure determination device 2, the prediction structure group memory (PM) 3 and the control device 5 of FIG. 1, and the controller 30 uses the built-in prediction structure group memory (PM). Then, the predictive structure determining process is executed, and the moving image compression control of the entire system is performed based on the determined predictive structure.

【００３４】また、動画像圧縮装置の復号器は、上記符
号化器とは逆の動作を行なうものであり、具体的には、
図２に示すように、変動する情報発生を一定レートに平
滑するバッファ４６と、バッファ４６に記憶された復号
化すべき画像データを前記ビデオマルチプレックス符号
化部（ＶＬＣ）３５の処理と逆の処理を行なって復号化
する逆ビデオマルチプレックス復号化部（ＶＬＣ-1）４
７と、ＶＬＣ-1４７で決定された量子化幅に従ってＶＬ
Ｃ-1４７の出力に対し逆量子化する逆量子化部４８と、
逆量子化部４８で逆量子化されたデータに対し逆ＤＣＴ
演算を施すＩＤＣＴ演算部４９と、ＩＤＣＴ演算部４９
の出力に予測結果を加算する加算器５０と、ＶＬＣ-1４
７からの画像モード、予測モードに従って信号経路を切
り換えるスイッチ５１、５２、５３と、ＶＬＣ-1４７で
算出された動きベクトルにより動き補償予測を行なう予
測器５４、５５とから構成される。Further, the decoder of the moving picture compression apparatus performs an operation reverse to that of the above-mentioned encoder, and more specifically,
As shown in FIG. 2, a buffer 46 for smoothing fluctuating information generation to a constant rate, and image data to be decoded stored in the buffer 46, which is the reverse of the processing of the video multiplex encoding unit (VLC) 35. Inverse video multiplex decoding unit (VLC-1) 4 for performing decoding by performing
7 and VL according to the quantization width determined by VLC-147
An inverse quantizer 48 for inverse quantizing the output of C-147;
Inverse DCT is performed on the data inversely quantized by the inverse quantizer 48.
IDCT calculation unit 49 for performing calculation, and IDCT calculation unit 49
VLC-14 and an adder 50 for adding the prediction result to the output of
It is composed of switches 51, 52 and 53 for switching signal paths according to the image mode and prediction mode from 7 and predictors 54 and 55 for performing motion compensation prediction based on the motion vector calculated by VLC-147.

【００３５】図４は上記動画像圧縮装置で用いられる画
像グループを説明するための図であり、図中の四角形は
画面を、Ｉ，Ｐ，Ｂは夫々Ｉピクチャ、Ｐピクチャ、Ｂ
ピクチャを示している。FIG. 4 is a diagram for explaining an image group used in the above-mentioned moving image compression apparatus. In the figure, a rectangle is a screen, and I, P and B are I picture, P picture and B respectively.
Shows a picture.

【００３６】一般に、時間予測を伴う圧縮では、予測元
となる画像が再生済でなければ、その圧縮データから画
像を展開することができない。現在有効であると考えら
れている画像圧縮には、画像展開時における特殊再生
（途中再生、高速再生等）や、誤差の蓄積による画像の
乱れ等に対応するため、時間予測を伴わない圧縮画像
（Ｉピクチャ）が存在している。この場合、あるＩピク
チャから次のＩピクチャまでの画像データは、それらの
外の画像情報に左右されないため、図４に示すようにそ
れらの画面群を１つのグループ（画像グループ）として
処理することができる。In general, in compression involving temporal prediction, an image as a prediction source cannot be expanded from the compressed data unless the image has been reproduced. Image compression, which is currently considered to be effective, is a compressed image that does not involve time prediction in order to deal with special playback (intermediate playback, high-speed playback, etc.) during image expansion, and image distortion due to error accumulation. (I picture) exists. In this case, the image data from one I picture to the next I picture is not affected by the image information outside them, so that those screen groups should be processed as one group (image group) as shown in FIG. You can

【００３７】図５及び図６は２つの相関による予測構造
を示す図であり、図５がフィールド相関が高い場合の予
測構造を、図６が時間相関が高い場合の予測構造を夫々
示している。前記図１に示した複数の予測構造群を記憶
する予測構造群メモリ（ＰＭ）３は、図５及び図６に示
すような複数の予測構造から構成されている。FIGS. 5 and 6 are diagrams showing a prediction structure based on two correlations. FIG. 5 shows a prediction structure when the field correlation is high, and FIG. 6 shows a prediction structure when the time correlation is high. . The prediction structure group memory (PM) 3 for storing the plurality of prediction structure groups shown in FIG. 1 is composed of a plurality of prediction structures as shown in FIGS. 5 and 6.

【００３８】また、本動動画像圧縮装置の予測構造は、
画像情報により予測構造を変化させるために以下のよう
なＧＯＰ（Group Of Picture）を構成するようにする。The predictive structure of the main moving image compression apparatus is
The following GOP (Group Of Picture) is configured to change the prediction structure according to the image information.

【００３９】まず、過去と未来の双方向から予測される
Ｂピクチャの数を、例えば“２”に固定する。すなわ
ち、基本的にはＢピクチャはＰピクチャ及びＩピクチャ
の間に幾つ存在してもよいが、本実施例ではこのＢピク
チャを２に固定することによって２つのＢピクチャのう
ち必ず一方の画像はその隣のＰピクチャまたはＩピクチ
ャとで空間軸の縦軸が存在することになり、したがっ
て、上記Ｂピクチャの他方は必ず時間的に一番近い画像
となる。すなわち、ＢピクチャはＰピクチャ（このＰピ
クチャはＩピクチャから予測される）からの予測が入る
ことになるが、図５及び図６に示すようにＢピクチャの
数を２に固定することによってＢピクチャとＩピクチャ
（またはＰピクチャ）とに関しては閉じた関係となり、
ある程度独立したものとみることができるようになる。
換言すれば、Ｂピクチャの数を２にすることによってＢ
ピクチャを独立させて影響を受けないようにすることが
できる。First, the number of B pictures predicted bidirectionally in the past and the future is fixed at, for example, "2". That is, basically, any number of B pictures may exist between P pictures and I pictures, but in the present embodiment, by fixing this B picture to 2, one of the two B pictures must be The vertical axis of the spatial axis exists between the adjacent P picture or I picture, and therefore, the other of the B pictures is always the temporally closest image. That is, a B picture is predicted from a P picture (this P picture is predicted from an I picture), but by fixing the number of B pictures to 2 as shown in FIGS. The picture and I picture (or P picture) have a closed relationship,
It can be seen as something independent.
In other words, by setting the number of B pictures to 2,
The pictures can be independent and unaffected.

【００４０】次に、片側から予測されるＰピクチャを以
下のように決定する。１つ以上のＩピクチャを含むフレ
ームメモリの列をＧＯＰという。このＧＯＰにおいて前
述したようにＢピクチャを２つずつとって、あるＧＯＰ
を構成したときにあるＩピクチャと次のＩピクチャとが
違うフィールドとなるようにＩピクチャとＩピクチャと
の間の画面の数を調整する。具体的には、図４及び図５
に示すように、Ｉ1ピクチャと次のＩ2ピクチャとの間の
Ｐピクチャの数が偶数個（本実施例では、４個）あれば
Ｉ1ピクチャとＩ2ピクチャとは異なるフィールドをとる
ように切り換わるようになる。Next, the P picture predicted from one side is determined as follows. A column of the frame memory including one or more I pictures is called a GOP. As described above, in this GOP, two B pictures are taken to obtain a certain GOP.
The number of screens between I-pictures is adjusted so that one I-picture and the next I-picture are different fields. Specifically, FIG. 4 and FIG.
As shown in FIG. 6, if the number of P pictures between the I1 picture and the next I2 picture is even (4 in this embodiment), the I1 picture and the I2 picture are switched so as to take different fields. become.

【００４１】この場合、上記予測構造をとる上記Ｐピク
チャはある少し離れた片側の画像しか相関しないことに
なるので、このＰピクチャの画像が時間的に近い方が予
測誤差が少ないのか、あるいは場所（フィールド）が同
一の方が予測誤差が少ないのかを判定し、その判定結果
に基づいて図５及び図６の予測構造のうち最適な予測構
造を選択する。すなわち、図５及び図６に示すように、
本動画像圧縮装置はフィールド相関が高い場合の予測構
造と時間相関が高い場合の予測構造をそれぞれ持ち、画
像情報によりどちらかを選択する。In this case, since the P picture having the above prediction structure correlates only with the image on one side that is a little distant from the other, whether the P picture image is temporally closer has less prediction error, or the location It is determined whether the prediction error is smaller when the (fields) are the same, and the optimum prediction structure is selected from the prediction structures of FIGS. 5 and 6 based on the determination result. That is, as shown in FIG. 5 and FIG.
The moving image compression apparatus has a prediction structure when the field correlation is high and a prediction structure when the time correlation is high, and selects either one according to the image information.

【００４２】選択方法は、Ｉ2ピクチャから予測され得
るＰ3ピクチャ，Ｐ4ピクチャを夫々同一条件下で予測
し、誤差の少ない方を予測構造に持つ方を選択する。例
えば、時間相関の方が強いときには一番時間の近いとこ
ろから予測し、また、フィールド（場所）相関が強いと
きには同一フィールドから予測できるように予測構造を
切り換える。図５に示すフィールド相関では各Ｐピクチ
ャを同一フィールドの画像から予測し、図６に示す時間
フィールドでは各Ｐピクチャを時間的に近い画像から予
測している。また、Ｂピクチャは片側が時間的に近い画
像、他方が同一のフィールドの画像となるようにしてい
る。As the selection method, P3 picture and P4 picture which can be predicted from I2 picture are predicted under the same conditions, respectively, and one having a smaller error in the prediction structure is selected. For example, when the time correlation is stronger, the prediction structure is switched so that the prediction is performed from the closest time point and when the field (location) correlation is strong, the prediction can be performed from the same field. In the field correlation shown in FIG. 5, each P picture is predicted from an image in the same field, and in the time field shown in FIG. 6, each P picture is predicted from an image that is temporally close. In addition, the B picture is configured such that one side is an image that is close in time and the other side is an image in the same field.

【００４３】次に、本実施例の動作を説明する。Next, the operation of this embodiment will be described.

【００４４】図７はコントローラ３０で実行される予測
構造決定処理のフローチャートであり、前記図１の予測
構造決定装置２における処理に対応する。FIG. 7 is a flowchart of the predictive structure determining process executed by the controller 30, which corresponds to the process in the predictive structure determining device 2 of FIG.

【００４５】先ず、ステップＳ１でＩ2ピクチャからＰ4
ピクチャを予測し、ステップＳ２でＩ2ピクチャからＰ3
ピクチャを予測する。すなわち、Ｉ1ピクチャとフィー
ルドが異なる次のＩ2ピクチャによって時間的に近いＰ
ピクチャとなるＰ4ピクチャを予測し（図６参照）、ま
た、同Ｉ2ピクチャによりフィールドが同じＰピクチャ
となるＰ3ピクチャを予測する（図５参照）。First, in step S1, I2 picture to P4
The picture is predicted, and the I2 picture is converted to the P3 in step S2.
Predict pictures. That is, a P that is temporally closer to the next I2 picture whose field is different from that of the I1 picture.
A P4 picture to be a picture is predicted (see FIG. 6), and a P3 picture having the same P picture in the same field is predicted from the same I2 picture (see FIG. 5).

【００４６】次いで、ステップＳ３でＰ4ピクチャを予
測した方がＰ3ピクチャを予測するより誤差が少ないか
否かを判別する。この場合、選択方法は、Ｉ2ピクチャ
から予測され得るＰ3ピクチャ，Ｐ4ピクチャをそれぞれ
同一条件下で予測し、誤差の少ない方を予測構造に持つ
方を選択する。ここでは、フィールド相関では各Ｐピク
チャを時間的に近い画像から予測している。また、Ｂピ
クチャは片側が時間的に近い画像、他方が同一のフィー
ルドの画像である。Then, in step S3, it is determined whether or not the prediction of the P4 picture has a smaller error than the prediction of the P3 picture. In this case, the selection method predicts the P3 picture and the P4 picture that can be predicted from the I2 picture under the same conditions, and selects the one having the smaller error in the prediction structure. Here, in the field correlation, each P picture is predicted from images that are temporally close. A B picture is an image in which one side is temporally close and the other is an image in the same field.

【００４７】Ｐ4ピクチャを予測した方がＰ3ピクチャを
予測するより誤差が少ないときには時間相関の方が相関
が強いときであるからステップＳ４で図６に示す時間相
関予測構造を採用して本フローの処理を終え、また、Ｐ
3ピクチャを予測した方がＰ4ピクチャを予測するより誤
差が少ないときにはフィールド相関の方が相関が強いと
きであるからステップＳ５で図５に示すフィールド相関
予測構造を採用して本フローの処理を終える。When the error of the P4 picture is smaller than that of the P3 picture, the time correlation is stronger than that of the P3 picture. Therefore, the time correlation prediction structure shown in FIG. After processing, P
When the error is smaller in predicting 3 pictures than in predicting P4 pictures, the field correlation is stronger, so that the field correlation prediction structure shown in FIG. 5 is adopted in step S5 and the processing of this flow ends. .

【００４８】より詳しく説明すると、いま例えば図５及
び図６のような予測構造が用意されている場合、予測構
造の決定は前記図７のフローに示す手順で決定すること
ができる。More specifically, when the prediction structures shown in FIGS. 5 and 6 are prepared, the prediction structure can be determined by the procedure shown in the flow of FIG.

【００４９】先ず、Ｉ2ピクチャからＰ4ピクチャ及びＰ
3ピクチャへの予測は、動き補償（ＭＣ）を含む予測で
あり、通常の動画像予測と同様に行われ、その予測誤差
の合計を比べることによってＰ4ピクチャ及びＰ3ピクチ
ャの何れかの画像の方がよりＩ2ピクチャの画像に近い
かを決定し、Ｐ4ピクチャの画像の方が近ければ（すな
わち、予測誤差が少なければ）図５の時間相関構造を、
Ｐ3ピクチャの画像の方が近ければ図６のフィールド相
関構造を選択する。そして、どちらが選択されたかを示
す情報を図４の画像グループ毎に与えてやるようにすれ
ばデコード時に正常に行なうこともできる。なお、この
選択情報はＭＰＥＧＩのフォーマットではＧＯＰヘッダ
ーを１bit拡張することで実現できる。First, from I2 picture to P4 picture and P
Prediction to 3 pictures is prediction including motion compensation (MC), and is performed in the same manner as normal moving picture prediction. By comparing the total of the prediction errors, either P4 picture or P3 picture is predicted. Is closer to the image of the I2 picture, and if the image of the P4 picture is closer (that is, if the prediction error is smaller), the temporal correlation structure of FIG.
If the P3 picture image is closer, the field correlation structure of FIG. 6 is selected. If information indicating which is selected is given for each image group in FIG. 4, normal operation can be performed at the time of decoding. It should be noted that this selection information can be realized by expanding the GOP header by 1 bit in the MPEGI format.

【００５０】以上説明したように、本実施例に係る動画
像圧縮装置のコントローラ３０は、Ｉ2ピクチャから予
測される時間的に近いＰ4ピクチャとフィールドが同じ
Ｐ3ピクチャをそれぞれ同一条件下で予測して誤差の少
ない予測構造を選択し、その予測構造で画像データを圧
縮しているので、どのような画像においても高い予測効
率を得ることができる。また、予測効率を上げることが
できるので同一圧縮率の場合には画質の向上を図ること
ができ、同一画質の場合には圧縮率を向上させることが
できる。As described above, the controller 30 of the moving picture compression apparatus according to the present embodiment predicts the P4 picture that is temporally close to the I2 picture and the P3 picture having the same field under the same conditions. Since a prediction structure with a small error is selected and the image data is compressed with the prediction structure, high prediction efficiency can be obtained for any image. Further, since the prediction efficiency can be increased, the image quality can be improved when the compression rate is the same, and the compression rate can be improved when the image quality is the same.

【００５１】なお、本実施例では、複数の予測構造を、
フィールド相関が高い場合に使用される予測構造と時間
相関が高い場合に使用される予測構造の２つの予測構造
としているが、これに限定されず、各相関（フィールド
相関、時間相関）に適した予測構造であればいかなる構
造であっても適用可能である。In this embodiment, a plurality of prediction structures are
There are two prediction structures, a prediction structure used when the field correlation is high and a prediction structure used when the time correlation is high, but the present invention is not limited to this, and is suitable for each correlation (field correlation, time correlation). Any structure can be applied as long as it is a prediction structure.

【００５２】また、本実施例では、符号化しようとする
画像データがフィールド相関と時間相関とで何れの相関
性が強いかをＩ2ピクチャから時間的に近いＰ4ピクチャ
を予測したときの予測誤差と、フィールドが同じＰ3ピ
クチャを予測したときの予測誤差とを比較することによ
り相関性を判別するようにしてるが、相関性を判別でき
るものであれば何でもよく、例えば画像の垂直方向への
動き量を調べる等の方法でもよい。Further, in the present embodiment, which correlation between the image data to be coded is stronger between the field correlation and the time correlation is a prediction error when the P4 picture temporally close to the I2 picture is predicted. , The correlation is determined by comparing with the prediction error when the P3 picture with the same field is predicted, but any correlation can be determined, for example, the amount of movement of the image in the vertical direction. May be used.

【００５３】また、上記予測構造は一例であり、Ｐピク
チャやＢピクチャの数、位置、予測の方向等は前述した
実施例に限られないことは勿論である。The above-described prediction structure is an example, and the number of P pictures and B pictures, positions, prediction directions, etc. are not limited to those in the above embodiments.

【００５４】また、本実施例では、動画像圧縮装置をＭ
ＰＥＧアルゴリズムに基づく動画像圧縮装置に適用した
例であるが、勿論これには限定されず、予測構造に基づ
いて符号化を行なうものであれば全ての装置に適用可能
であることは言うまでもない。Further, in the present embodiment, the moving image compression apparatus is M
This is an example applied to a moving image compression apparatus based on the PEG algorithm, but needless to say, the present invention is not limited to this and can be applied to all apparatuses as long as they perform encoding based on a prediction structure.

【００５５】さらに、上記動画像圧縮装置を構成する回
路や部材の数、種類などは前述した実施例に限られない
ことは言うまでもなく、ソフトウェア（例えば、Ｃ言
語）により実現するようにしてもよい。Further, it goes without saying that the number and types of circuits and members constituting the above-mentioned moving picture compression apparatus are not limited to those in the above-mentioned embodiment, but may be realized by software (for example, C language). .

【００５６】[0056]

【発明の効果】請求項１、２、３、４、５、６、７及び
８記載の発明によれば、複数の予測画像から構成される
予測構造を複数記憶する予測構造記憶手段と、符号化す
べき画像データに基づいて前記予測構造記憶手段に記憶
された予測構造を選択する予測構造選択手段と、前記予
測構造選択手段により選択された予測構造に従って画像
データを圧縮するデータ圧縮手段を備えているので、ど
のような画像においても高い予測効率を得ることがで
き、画質及び圧縮率の向上を図ることができる。According to the first, second, third, fourth, fifth, sixth, seventh and eighth aspects of the present invention, a prediction structure storage means for storing a plurality of prediction structures each composed of a plurality of prediction images, and a code A prediction structure selecting means for selecting a prediction structure stored in the prediction structure storing means based on image data to be converted; and a data compressing means for compressing the image data according to the prediction structure selected by the prediction structure selecting means. Therefore, it is possible to obtain high prediction efficiency for any image, and it is possible to improve image quality and compression rate.

[Brief description of drawings]

【図１】動画像圧縮装置の機能ブロック図である。FIG. 1 is a functional block diagram of a moving image compression apparatus.

【図２】動画像圧縮装置の符号化器のブロック構成を示
す図である。FIG. 2 is a diagram showing a block configuration of an encoder of a moving image compression apparatus.

【図３】動画像圧縮装置の復号化器のブロック構成を示
す図である。FIG. 3 is a diagram showing a block configuration of a decoder of a moving image compression apparatus.

【図４】動画像圧縮装置の画像グループを示す図であ
る。FIG. 4 is a diagram showing an image group of a moving image compression apparatus.

【図５】動画像圧縮装置の予測構造を示す図である。FIG. 5 is a diagram showing a prediction structure of a moving image compression apparatus.

【図６】動画像圧縮装置の予測構造を示す図である。FIG. 6 is a diagram showing a prediction structure of a moving image compression apparatus.

【図７】動画像圧縮装置の予測構造の決定処理を示すフ
ローチャートである。FIG. 7 is a flowchart showing a predictive structure determination process of the moving image compression apparatus.

【図８】動画像圧縮装置の画面の予測構造を示す図であ
る。FIG. 8 is a diagram showing a prediction structure of a screen of the moving image compression apparatus.

【図９】動画像圧縮装置の画面の予測構造を示す図であ
る。FIG. 9 is a diagram showing a prediction structure of a screen of the moving image compression apparatus.

[Explanation of symbols]

１フレームメモリ（ＦＭ）２予測構造決定装置３予測構造群メモリ（ＰＭ）４圧縮器５制御装置３０コントローラ３１画像メモリ３２減算器３３ＤＣＴ演算部３４量子化部３５ＶＬＣ３６，４６バッファ３７動き補償フレーム間予測部３８，４８逆量子化部３９，４９ＩＤＣＴ演算部４０，５０加算器４１，４２，４３，５１，５２，５３スイッチ４４，４５，５４，５５予測器４７逆ＶＬＣ 1 Frame Memory (FM) 2 Prediction Structure Determining Device 3 Prediction Structure Group Memory (PM) 4 Compressor 5 Controller 30 Controller 31 Image Memory 32 Subtractor 33 DCT Calculator 34 Quantizer 35 VLC 36, 46 Buffer 37 Motion Compensation Inter-frame predictor 38,48 Dequantizer 39,49 IDCT calculator 40,50 Adder 41,42,43,51,52,53 Switch 44,45,54,55 Predictor 47 Inverse VLC

Claims

[Claims]

1. A prediction structure storage unit for storing a plurality of prediction structures each composed of a plurality of prediction images, and a prediction structure selection unit for selecting a prediction structure stored in the prediction structure storage unit based on image data to be encoded. And a data compression unit that compresses image data according to the prediction structure selected by the prediction structure selection unit.

2. A plurality of prediction structures stored in the prediction structure storage means include a prediction structure used when field correlation is high and a prediction structure used when time correlation is high. The moving picture compression apparatus according to claim 1, wherein

3. The prediction structure includes a plurality of coded images that can be coded independently, a one-sided prediction image that predicts the present from the past, and a bidirectional prediction image that predicts the present from both past and future directions. 3. The moving image according to claim 1, wherein the moving image has a structure configured to perform prediction by using each image as a prediction source image in a predetermined order. Image compression device.

4. The prediction structure has a first coded image that can be independently coded in one field obtained by dividing a frame into an odd field and an even field, and independently in the other field. 4. The moving image compression apparatus according to claim 1, wherein the moving image compression apparatus is configured to include a second encoded image that can be encoded.

5. The bidirectionally predicted image is two adjacent images located between the coded image or the one-sided predicted image and the coded image or the one-sided predicted image. 3. The moving image compression apparatus according to item 3.

6. The predictive structure performs interframe predictive processing for compression in a time axis direction, claim 1, claim 2, claim 3, claim 4 or claim 6. 5. The moving image compression apparatus according to any one of 5 above.

7. The predictive structure selecting means determines which of image correlation to be coded is stronger, field correlation or time correlation, and the predictive structure storing means stores it in the predictive structure storage means based on the determination result. Of the stored prediction structures,
The moving picture compression apparatus according to claim 1, wherein a prediction structure having a strong correlation is selected.

8. The prediction structure selecting means, a prediction error when predicting the one-sided predicted image that is temporally close when predicted using the coded image as a prediction source image,
When the coded image is used as an original image for prediction, the correlation is determined by comparing with a prediction error when the one-sided predicted image having the same field is predicted. The moving image compression apparatus according to claim 1.