JPH10164594A

JPH10164594A - Compression encoding method for moving image and device therefor

Info

Publication number: JPH10164594A
Application number: JP31907396A
Authority: JP
Inventors: Hiroki Mizozoe; 博樹溝添
Original assignee: Hitachi Ltd
Current assignee: Hitachi Ltd
Priority date: 1996-11-29
Filing date: 1996-11-29
Publication date: 1998-06-19

Abstract

PROBLEM TO BE SOLVED: To reduce redundant information which is hardly perceived by the eyes of a human being to a time direction and to improve the compressibility of moving images by forming the difference images of reference images and encoding object images by using motion vector information and performing the batch orthogonal transformation processing of the plural difference images of the different time in a time space. SOLUTION: Inputted moving image signals are stored in a frame memory 11 provided with a storage area worth four frames. In a motion vector detection part 12, bidirectional prediction images are compared with the reference images for each macro-block and a closest position is detected, defined as a motion vector and stored in a memory 13. The difference image is calculated in a difference image calculation part 14 from the image of the frame memory 11 and the motion vector of the motion vector memory 13, and the batch discrete cosine transformation processing of the difference image is performed in a time space DCT part 15. A quantization part 16 divides the DCT-processed data by a quantization coefficient, rounds off a remainder, quantizes them, enlarges the quantization coefficient of a wide-band component, inconspicuous to the eyes of the human being and compresses an information amount while suppressing the degradation of image quality.

Description

DETAILED DESCRIPTION OF THE INVENTION

【０００１】[0001]

【発明の属する技術分野】本発明は動画像の圧縮符号化
方法およびその装置に関する。BACKGROUND OF THE INVENTION 1. Field of the Invention The present invention relates to a method and apparatus for compression-encoding a moving image.

【０００２】[0002]

【従来の技術】ディジタル化された映像信号、とりわけ
動画像は膨大な情報量を持つ。そのため、磁気ディスク
などの記録媒体に長時間にわたってそのまま記録しよう
とすると非常に大きい記憶容量が必要になり、コストが
嵩む。あるいは、有線または無線によりリアルタイムで
伝送しようとすると映像データのビットレートが高いこ
とから非常に広帯域な伝送路が必要であり、その実現は
容易ではない。そこで信号処理により映像データを効率
良く符号化し、データ量を削減するための符号化方式が
いくつか提案されている。2. Description of the Related Art Digitized video signals, especially moving images, have a huge amount of information. Therefore, if the recording is to be performed on a recording medium such as a magnetic disk as it is for a long time, a very large storage capacity is required, and the cost is increased. Alternatively, transmission of data in real time by wire or wireless requires a very wide transmission path because the bit rate of video data is high, and the realization is not easy. Therefore, several encoding methods have been proposed for efficiently encoding video data by signal processing and reducing the amount of data.

【０００３】そのような符号化方式の一つとしてＭＰＥ
Ｇ２（「テレビジョン学会誌」；Ｖｏｌ．４８，Ｎｏ．
１，ｐｐ．４４〜４９に記載）がある。ＭＰＥＧ２では
映像データのデータ量を削減するために主に以下に述べ
る複数の方法を組み合わせて用いている。[0003] One of such encoding systems is MPE.
G2 ("The Journal of the Institute of Television Engineers of Japan"; Vol. 48, No.
1, pp. 44-49). In MPEG2, a plurality of methods described below are mainly used in combination in order to reduce the amount of video data.

【０００４】第１に符号化する画像とその前後の画像
（以下参照画像と呼ぶ）との間で差分を取って振幅の小
さい情報に変換することにより、符号化に要するビット
数を少なくする方法（動き補償予測）である。First, a method of reducing the number of bits required for encoding by taking a difference between an image to be encoded and an image before and after the image (hereinafter referred to as a reference image) and converting the difference into information having a small amplitude. (Motion compensation prediction).

【０００５】画像の参照方法によりＩ，Ｐ，Ｂの３種類
のピクチャ（符号化を行う画像の単位）が存在する。こ
れを図２（ａ）に示す。図中の矢印は始点が参照画像、
終点が符号化対象の画像を表している。Ｉピクチャは画
像の参照を行わない。復号化のために必要な情報をすべ
てそのピクチャ内に符号化するからである。Ｐピクチャ
は原画像順で直前に存在するＩピクチャまたはＰピクチ
ャを参照画像として使用する。そしてＢピクチャは直前
と直後に存在するＩまたはＰピクチャを参照画像として
使用する。Ｂピクチャの場合は原画像順で後に来るＰピ
クチャを参照画像として用いるので、符号化データの順
番としては図２（ｂ）のように符号化対象のＢピクチャ
に対応する２枚の参照画像が常に先に来るように並べか
えて符号化する。これは復号時に正しい順番に並べ替え
られて出力される。There are three types of pictures (units of images to be coded) of I, P, and B depending on the image reference method. This is shown in FIG. The starting point of the arrow in the figure is the reference image,
The end point represents the image to be encoded. The I picture does not refer to an image. This is because all information necessary for decoding is encoded in the picture. As a P picture, an I picture or a P picture existing immediately before in the original picture order is used as a reference picture. The B picture uses the I or P picture existing immediately before and immediately after as a reference picture. In the case of a B picture, a P picture that comes later in the order of the original image is used as a reference image. Therefore, as the order of encoded data, two reference images corresponding to the B picture to be encoded as shown in FIG. Encode by rearranging so that it always comes first. This is output after being rearranged in a correct order at the time of decoding.

【０００６】一つのピクチャはマクロブロックという１
６×１６画素の大きさの区画に区切って符号化を行う。
このマクロブロックには画像の左上から順にマクロブロ
ックアドレスと呼ばれる通し番号が付けられている。図
３に示すように、例えば１ピクチャが７２０×４８０画
素のフレーム画像から構成されている場合は０〜１３４
９までの１３５０個のマクロブロックが存在する。One picture is called a macroblock.
Encoding is performed by partitioning into sections of 6 × 16 pixels.
The macroblocks are assigned serial numbers called macroblock addresses in order from the upper left of the image. As shown in FIG. 3, for example, when one picture is composed of frame images of 720 × 480 pixels, 0 to 134
There are up to 9 1350 macroblocks.

【０００７】マクロブロックは動き補償を行って参照画
像との差分を取る最小単位である。参照画像上で参照に
用いる部分の位置を指し示す情報（動きベクトル情報）
を各マクロブロックの符号化データ中にエンコードす
る。[0007] A macroblock is a minimum unit that performs motion compensation and takes a difference from a reference image. Information indicating the position of the part used for reference on the reference image (motion vector information)
Is encoded in the encoded data of each macroblock.

【０００８】さらに、マクロブロックは８画素×８画素
の単位画像（以下ブロック）から構成されている。例え
ばＹ：Ｃｂ：Ｃｒ＝４：２：０の形式の場合、図３の左
側の図に示すように輝度信号はＹ０〜Ｙ３の四つのブロ
ック、色差信号はＣｂ，Ｃｒの各一つずつ、計六つのブ
ロックによってマクロブロックを構成している。Further, the macro block is composed of a unit image (hereinafter, block) of 8 pixels × 8 pixels. For example, in the case of the format of Y: Cb: Cr = 4: 2: 0, as shown in the left side of FIG. 3, the luminance signal is four blocks of Y0 to Y3, and the color difference signal is one each of Cb and Cr. A macro block is composed of a total of six blocks.

【０００９】データ量削減のための第２の方法は、映像
データにＤＣＴ（離散コサイン変換）を施すことにより
空間周波数領域に変換する方法である。図４に示すよう
に、ｘ−ｙ軸という空間上に存在する画素情報を、ＤＣ
Ｔによりμ−ν軸という空間周波数領域のデータに変換
する。映像信号の場合一般に空間周波数の低域成分にエ
ネルギーが集中している割合が高く、高域成分は振幅値
が小さい。それに応じて符号化に割り当てるビット数を
低域成分ほど多く、高域成分ほど少なくすることによっ
てビット数の割り当てを最適化することができ、結果と
して全体のビット数を削減することが可能である。ＭＰ
ＥＧ２では前述のブロック単位でＤＣＴを行った後、高
域成分ほどビット数の割当てを少なくするよう量子化処
理を行うことにより、データの削減を図っている。A second method for reducing the data amount is a method of performing DCT (Discrete Cosine Transform) on video data to convert the video data into a spatial frequency domain. As shown in FIG. 4, pixel information existing on a space called an xy axis is
The data is converted into data in the spatial frequency domain called μ-ν axis by T. In the case of a video signal, generally, the rate at which energy is concentrated in the low frequency component of the spatial frequency is high, and the amplitude value of the high frequency component is small. Accordingly, the number of bits to be allocated to the coding can be optimized by assigning more bits to the low-frequency component and decreasing the number of bits to the high-frequency component. As a result, the overall number of bits can be reduced. . MP
In EG2, data is reduced by performing DCT on a block-by-block basis and then performing a quantization process so that the higher the frequency component, the smaller the number of bits allocated.

【００１０】第３にそれぞれの値の出現確率に応じて異
なる長さの符号を割り当てる可変長符号化である。出現
頻度の高い値ほど短い符号を割り当てておくことによ
り、全体のビット数を削減することが可能である。ＭＰ
ＥＧ２を始めとする従来の技術では前述の量子化された
ＤＣＴ係数のほか、マクロブロックアドレスや動きベク
トルなどのパラメータの符号化に可変長符号化を用いて
いる。Third, variable-length coding in which codes having different lengths are assigned according to the appearance probabilities of the respective values. By assigning a shorter code to a value having a higher appearance frequency, it is possible to reduce the total number of bits. MP
Conventional technologies such as EG2 use variable-length coding for coding parameters such as macroblock addresses and motion vectors in addition to the above-described quantized DCT coefficients.

【００１１】[0011]

【発明が解決しようとする課題】画像圧縮技術には、動
き補償や可変長符号化のように、符号の表現方法を工夫
することによりできるだけ少ない情報量で画像を表現す
る方法と、ＤＣＴおよび量子化のように、人間の目に知
覚されにくい情報を削減することによって冗長度を削減
する方法とがある。そのうち後者に関して、ＭＰＥＧ２
の場合は、ＤＣＴおよび量子化を空間方向、すなわち同
一時刻の画像内でのみ行っている。したがって、冗長度
の削減が行われているのは空間方向に対してのみであ
る。The image compression technique includes a method of expressing an image with a minimum amount of information by devising a method of expressing a code, such as motion compensation and variable-length coding, a DCT and a quantization. There is a method of reducing redundancy by reducing information that is hardly perceived by the human eye as in the case of image processing. Of the latter, MPEG2
In the case of, DCT and quantization are performed only in the spatial direction, that is, only in the image at the same time. Therefore, the reduction of the redundancy is performed only in the spatial direction.

【００１２】時間方向に対しては、動き補償によって符
号量を削減する工夫はなされている。しかし、それは差
分を取ることによって振幅を減らし、符号化に要するビ
ット数を少なくする手法である。したがって、動き補償
は符号の表現方法の工夫に過ぎず、人間の視覚特性を考
慮した冗長度の削減は、時間方向に関しては考慮されて
いない。In the time direction, a scheme has been devised to reduce the code amount by motion compensation. However, it is a technique of reducing the amplitude by taking the difference and reducing the number of bits required for encoding. Therefore, the motion compensation is only a method of expressing the code, and the reduction of the redundancy in consideration of the human visual characteristics is not considered in the time direction.

【００１３】そのため、従来の技術では時間方向に対す
る圧縮が不十分である。符号の量が一定ならば圧縮効率
が低いほど表現できる画像の画質も低くなるため、従来
の技術では得られる画質も十分であるとはいえない。Therefore, the conventional technique is insufficient in the compression in the time direction. If the amount of codes is constant, the lower the compression efficiency, the lower the image quality of the image that can be expressed, so that the image quality obtained with the conventional technology cannot be said to be sufficient.

【００１４】[0014]

【課題を解決するための手段】上記の問題を解決するた
め、本発明では、参照画像上の位置を指し示す動きベク
トル情報を用いて符号化対象の画像を予測する動き補償
と、画素情報を周波数領域のデータに変換する直交変換
処理と、上記直交変換されたデータを所定の値で除算
し、その余りを切捨てることにより情報を削減する量子
化と、符号の発生頻度に応じて異なる長さの符号を割り
当てる可変長符号化とを用いることにより動画像を圧縮
符号化する符号化方法において、上記動きベクトル情報
を用いて参照画像と符号化対象の画像との差分を取るこ
とにより差分画像を作成し、異なる時刻からなる符号化
対象画像の上記差分画像を複数枚用いて、時空間で一括
して直交変換処理を行うこととした。In order to solve the above problems, the present invention provides motion compensation for predicting an image to be encoded by using motion vector information indicating a position on a reference image, and frequency conversion of pixel information. Orthogonal transformation processing to convert to data in the area, quantization to reduce information by dividing the orthogonally transformed data by a predetermined value, and truncating the remainder, and different lengths depending on the frequency of code generation In a coding method for compressing and coding a moving image by using variable-length coding and assigning a code, a difference image is obtained by taking a difference between a reference image and an image to be coded using the motion vector information. The orthogonal transformation process is performed collectively in space and time using a plurality of difference images of the encoding target image created at different times.

【００１５】符号化対象の画像を参照画像と比較して類
似部分を検出し、参照画像上における該類似部分の位置
を指し示す情報である動きベクトルを求める。An image to be coded is compared with a reference image to detect a similar portion, and a motion vector, which is information indicating the position of the similar portion on the reference image, is obtained.

【００１６】次に、求めた動きベクトルを用いて、符号
化対象の画像と参照画像上の対応部分の画像との差分画
像を計算する。Next, a difference image between the image to be encoded and the image of the corresponding part on the reference image is calculated using the obtained motion vector.

【００１７】異なる時刻からなる符号化対象画像のそれ
らの差分画像を複数枚用いて、時空間で一括して直交変
換処理を行う。Using a plurality of differential images of the image to be coded at different times, orthogonal transform processing is performed collectively in space and time.

【００１８】直交変換された時空間周波数領域のデータ
に対し、所定の量子化係数で量子化処理を行う。A quantization process is performed on the orthogonally transformed spatio-temporal frequency domain data using a predetermined quantization coefficient.

【００１９】以上のようにして得られた量子化データ、
量子化係数および動きベクトル情報を、可変長符号によ
って符号化する。さらに、所定のヘッダ情報を付加し、
符号化映像信号として出力する。The quantized data obtained as described above,
The quantization coefficient and the motion vector information are encoded by a variable length code. Furthermore, predetermined header information is added,
Output as an encoded video signal.

【００２０】[0020]

【発明の実施の形態】図１に本発明の実施例である動画
像圧縮符号化装置１を示す。FIG. 1 shows a moving picture compression encoding apparatus 1 according to an embodiment of the present invention.

【００２１】本実施例では、図２（ａ）に示したよう
に、参照画像であるＩまたはＰピクチャの間に２枚のＢ
ピクチャが挟まれた形の符号化を行うものとする。ま
た、参照画像としては、Ｐピクチャ数枚につき１枚Ｉピ
クチャを周期的に使用する。In this embodiment, as shown in FIG. 2A, two B pictures are placed between an I or P picture as a reference picture.
It is assumed that coding is performed with a picture sandwiched therebetween. As a reference image, one I picture is used periodically for every several P pictures.

【００２２】入力された動画像信号は、まずフレームメ
モリ１１に蓄積する。このフレームメモリ１１は内部に
４フレーム分の画像を蓄える領域を持つ。そのうち２フ
レーム分は参照画像であるＩまたはＰピクチャの蓄積
に、残りの２フレーム分はＢピクチャの蓄積に使用す
る。２フレーム分の参照用画像領域のうち一つ（Ｒｅｆ
１）には、符号化された後復号化された、直前の参照用
画像が既に蓄積されている。例えば、これから図２
（ａ）のＢ４，Ｂ５，Ｐ６を符号化する場合を考える
と、直前の参照画像であるＰ３が既に符号化・再復号化
され、Ｒｅｆ１に蓄えられている。次に、新たに符号化
する３フレーム分（Ｂ４，Ｂ５，Ｐ６）の動画像信号が
入力され、それぞれをフレームメモリ１１のＢピクチャ
用領域（Ｂ１，Ｂ２）及びもう一つの参照用画像領域
（Ｒｅｆ２）に蓄積する。The input moving image signal is first stored in the frame memory 11. The frame memory 11 has an area for storing four frames of images therein. Of these, two frames are used for storing I or P pictures as reference images, and the remaining two frames are used for storing B pictures. One of the reference image areas for two frames (Ref
In 1), the immediately preceding reference image that has been encoded and then decoded is already stored. For example, FIG.
Considering the case of encoding B4, B5, and P6 in (a), P3, which is the immediately preceding reference image, has already been encoded and re-decoded and stored in Ref1. Next, moving image signals for three frames (B4, B5, P6) to be newly encoded are input, and each of them is stored in the B memory area (B1, B2) and another reference image area (B) in the frame memory 11. Ref2).

【００２３】次に動きベクトル検出部１２において、動
きベクトルの検出を行う。Ｂ４，Ｂ５は双方向予測画像
であるので、マクロブロック毎に参照画像であるＰ３お
よびＰ６と比較を行い、最も値の近い位置を検出するこ
とによって、動きベクトルを求める。また、Ｐ６は前方
予測画像であるので、参照画像のＰ３とのみ比較を行っ
て、各マクロブロック毎の動きベクトルを求める。Next, the motion vector detecting section 12 detects a motion vector. Since B4 and B5 are bidirectionally predicted images, a motion vector is obtained by comparing each of the macroblocks with the reference images P3 and P6 and detecting a position having the closest value. Further, since P6 is a forward prediction image, a comparison is made only with P3 of the reference image to obtain a motion vector for each macroblock.

【００２４】このようにして求めた動きベクトルは、動
きベクトル（ＭＶ）メモリ１３に記憶する。動きベクト
ルメモリ１３は、内部に参照画像用領域一つとＢピクチ
ャ用領域二つを持ち、それぞれＰ６，Ｂ４，Ｂ５の各ピ
クチャの動きベクトルを記憶する。The motion vector thus obtained is stored in the motion vector (MV) memory 13. The motion vector memory 13 internally has one reference image area and two B picture areas, and stores the motion vectors of P6, B4, and B5, respectively.

【００２５】次に、フレームメモリ１１に蓄えられた画
像と動きベクトルメモリ１３に記憶された動きベクトル
を用いて、差分画像計算部１４で差分画像を計算する。
例えば、Ｂ４，Ｂ５については、各マクロブロックごと
に、対応する動きベクトルを動きベクトルメモリ１３か
ら読み出し、それを用いてフレームメモリ１１の参照画
像Ｐ３とＰ６から、必要な位置の画像を読み出す。その
後、Ｂピクチャ自身から読み出した画像を差し引くこと
により、差分画像を得る。また、Ｐ６の場合も同様に、
各マクロブロックごとに対応する動きベクトルを動きベ
クトルメモリ１３から読み出し、それを用いてフレーム
メモリ１１の参照画像Ｐ３から、必要な位置の画像を読
み出し、Ｐ６自身から読み出した画像を差し引くことに
より、差分画像を得る。Next, a difference image is calculated by the difference image calculator 14 using the image stored in the frame memory 11 and the motion vector stored in the motion vector memory 13.
For example, for B4 and B5, a corresponding motion vector is read from the motion vector memory 13 for each macroblock, and an image at a required position is read from the reference images P3 and P6 of the frame memory 11 using the same. Thereafter, a difference image is obtained by subtracting the image read from the B picture itself. Similarly, in the case of P6,
A motion vector corresponding to each macro block is read from the motion vector memory 13, an image at a required position is read from the reference image P3 of the frame memory 11 using the motion vector, and the image read from P6 itself is subtracted to obtain a difference. Get an image.

【００２６】ここで、差分画像の計算は必ずしもＢ４，
Ｂ５，Ｐ６の各ピクチャ単位で別々に、あるいは順番に
行う必要はない。次の時空間ＤＣＴ部１５での計算の必
要に応じて必要なピクチャ、必要なマクロブロック位置
の差分画像データをその都度計算して出力する。Here, the calculation of the difference image is not necessarily B4
It is not necessary to perform the processing separately or in order for each picture of B5 and P6. When necessary in the next spatiotemporal DCT unit 15, the necessary picture and the difference image data of the required macroblock position are calculated and output each time.

【００２７】時空間ＤＣＴ部１５では、上記差分画像計
算部１４から出力されるＢ４，Ｂ５，Ｐ６の各差分画像
を一括して離散コサイン変換処理する。すなわち、図５
に示すように、空間ｘ−ｙ方向に加え時間ｔ方向を持つ
３次元のデータを、μ−ν−ｆの時間・空間周波数領域
のデータに変換する。The spatio-temporal DCT unit 15 performs discrete cosine transform processing on each of the difference images B4, B5, and P6 output from the difference image calculation unit 14. That is, FIG.
, Three-dimensional data having the time t direction in addition to the spatial xy direction is converted to data in the time-space frequency domain of μ-ν-f.

【００２８】量子化部１６では、ＤＣＴ処理されたデー
タに対し、各周波数成分毎に定められた量子化係数を用
いて除算し、その余りを切捨てることによって量子化を
行う。一般に人間の視覚特性上目につきにくい高域成分
において量子化係数の値を大きくし、画質の劣化を最小
限にとどめつつ情報量の圧縮を図る。The quantization unit 16 divides the data subjected to the DCT processing by using a quantization coefficient determined for each frequency component, and cuts off the remainder to perform quantization. Generally, the value of the quantization coefficient is increased in a high-frequency component that is hardly noticeable due to human visual characteristics, and the amount of information is compressed while minimizing image quality deterioration.

【００２９】量子化されたデータおよびその際に使用し
た量子化係数は、可変長符号化部１７（ＶＬＣ）におい
て、符号の出現頻度に応じて異なる長さをもつ可変長符
号に符号化される。また、動きベクトルメモリ１３に記
憶された動きベクトルについても、可変長符号化部１７
で可変長符号に符号化される。The quantized data and the quantized coefficients used at that time are encoded by the variable length encoding unit 17 (VLC) into variable length codes having different lengths according to the frequency of appearance of the code. . Further, the motion vectors stored in the motion vector memory 13 are also stored in the variable length coding unit 17.
In the variable length code.

【００３０】最後に、ヘッダ付加部１８において、各種
符号化パラメータがヘッダとして付加され、符号化映像
信号として出力される。Finally, in the header adding section 18, various coding parameters are added as a header and output as a coded video signal.

【００３１】２はローカルデコード部である。この部分
では、次の入力画像の符号化に備えて、今符号化した３
枚の画像のうち参照画像（Ｐ６）を復号化する。Reference numeral 2 denotes a local decoding unit. In this part, in preparation for encoding the next input image, the currently encoded 3
The reference image (P6) among the images is decoded.

【００３２】まず逆量子化部２１において、量子化部１
６から出力される量子化データに各周波数成分毎の量子
化係数を乗じることにより、逆量子化を行う。さらに時
空間ＩＤＣＴ部２２において逆離散コサイン変換処理を
行い、周波数領域から時間・空間領域の画素データに戻
す。First, in the inverse quantization unit 21, the quantization unit 1
Inverse quantization is performed by multiplying the quantized data output from 6 by a quantized coefficient for each frequency component. Further, an inverse discrete cosine transform process is performed in the spatiotemporal IDCT unit 22 to return the pixel data from the frequency domain to the time / space domain.

【００３３】動き補償部２３では、動きベクトルメモリ
１３からマクロブロック毎に動きベクトルを読み出し、
それを用いてフレームメモリ１１の参照画像（Ｐ３）か
ら対応する位置の画像データを読み出し、それを時空間
ＩＤＣＴ部２２の出力データと加算することによって、
復号化を行う。なお、ローカルデコード部２で復号化す
るのは参照画像のＰ６だけであるので、動き補償部２３
においてもＰ６のみ復号化する。The motion compensator 23 reads a motion vector from the motion vector memory 13 for each macroblock,
The image data at the corresponding position is read out from the reference image (P3) in the frame memory 11 using this, and is added to the output data from the spatiotemporal IDCT unit 22, thereby obtaining
Perform decryption. Since the local decoding unit 2 decodes only the reference image P6, the motion compensation unit 23
Also decrypts only P6.

【００３４】復号化された参照画像Ｐ６は、次の入力画
像の符号化に備えて、再びフレームメモリ１１にマクロ
ブロック毎に書き込まれていく。その際、フレームメモ
リ１１内の参照画像領域Ｒｅｆ１には、前の参照画像で
あるＰ３が蓄えられており、Ｐ６全体の復号化が完全に
終了するまで使用される可能性があるので、もう一つの
参照画像領域であるＲｅｆ２に復号化されたＰ６は書き
込まれて行く。The decoded reference image P6 is written again to the frame memory 11 for each macroblock in preparation for encoding the next input image. At this time, P3, which is the previous reference image, is stored in the reference image area Ref1 in the frame memory 11, and may be used until decoding of the entire P6 is completely completed. The decoded P6 is written into one reference image area Ref2.

【００３５】このようにしてＰ６の復号化が完全に終了
すると、次に、新たに符号化する３フレーム分（図２
（ａ）のＢ７，Ｂ８，Ｐ９）の動画像信号が入力され、
以上に述べた一連の符号化のステップが繰り返される。
なお、この次の符号化の際は、フレームメモリ１１の参
照画像用領域Ｒｅｆ２には前の参照画像であるＰ６が蓄
えられているので、Ｐ９はＲｅｆ２ではなく、Ｒｅｆ１
に蓄えられる。このように、参照画像用領域Ｒｅｆ１と
Ｒｅｆ２とは、符号化単位（Ｂ，Ｂ，Ｐ）毎にその役割
を交互に変えて使用される。また、Ｂ７，Ｂ８について
は前と同様にそれぞれフレームメモリ１１中の領域Ｂ
１，Ｂ２に蓄えられる。When the decoding of P6 is completely completed in this way, next, three frames to be newly encoded (FIG. 2)
The moving image signal of (B7, B8, P9) of (a) is input,
A series of encoding steps described above are repeated.
At the time of this next encoding, P6 which is the previous reference image is stored in the reference image area Ref2 of the frame memory 11, so that P9 is not Ref2 but Ref1.
Is stored in As described above, the roles of the reference image regions Ref1 and Ref2 are alternately used for each coding unit (B, B, P). As for B7 and B8, the area B in the frame memory 11 is the same as before.
1 and B2.

【００３６】以上の説明で、参照画像をＰピクチャでは
なくＩピクチャとして符号化する場合には、動き補償処
理は不要であるので、符号化における動きベクトル検出
や、ローカルデコードにおける動き補償は行わない。In the above description, when the reference picture is coded as an I-picture instead of a P-picture, no motion compensation processing is necessary. Therefore, motion vector detection in coding and motion compensation in local decoding are not performed. .

【００３７】なお、本実施例では参照画像であるＩまた
はＰピクチャの間に２枚のＢピクチャが挟まれた形の符
号化を行うことを想定しているが、このＢピクチャの枚
数は２枚に限定されるものではない。Ｂピクチャの枚数
に応じてフレームメモリ１１、動きベクトルメモリ１３
に必要な領域を用意することにより、Ｂピクチャの枚数
が任意の場合でも本発明と同等の効果を得ることが可能
である。In this embodiment, it is assumed that encoding is performed in such a manner that two B pictures are sandwiched between I or P pictures, which are reference pictures, but the number of B pictures is 2 It is not limited to sheets. Frame memory 11 and motion vector memory 13 according to the number of B pictures
By preparing an area necessary for the above, the same effect as that of the present invention can be obtained even when the number of B pictures is arbitrary.

【００３８】[0038]

【発明の効果】本発明によれば、従来の空間方向に加
え、時間方向に対しても人間の目に知覚されにくい冗長
な情報を削減することが可能になり、動画像の圧縮率を
更に高めることができる。また、圧縮効率が高まること
により一定の符号量に対しては画質の向上を実現するこ
とが可能になる。According to the present invention, in addition to the conventional spatial direction, it is possible to reduce redundant information that is hardly perceived by the human eye in the time direction, thereby further reducing the moving image compression rate. Can be enhanced. In addition, an increase in compression efficiency makes it possible to realize an improvement in image quality for a fixed code amount.

[Brief description of the drawings]

【図１】動画像圧縮符号化装置を示すブロック図。FIG. 1 is a block diagram showing a moving image compression encoding apparatus.

【図２】原画像と符号化データの順番の説明図。FIG. 2 is an explanatory diagram of the order of an original image and encoded data.

【図３】マクロブロックとブロックの説明図。FIG. 3 is an explanatory diagram of macro blocks and blocks.

【図４】従来の技術における空間方向のＤＣＴを示す説
明図。FIG. 4 is an explanatory diagram showing DCT in a spatial direction in a conventional technique.

【図５】本発明における時空間方向の３次元ＤＣＴを示
す説明図。FIG. 5 is an explanatory diagram showing a three-dimensional DCT in the spatiotemporal direction according to the present invention.

[Explanation of symbols]

１…動画像圧縮符号化装置、１１…フレームメモリ、１
２…動きベクトル検出部、１３…動きベクトルメモリ、
１４…差分画像計算部、１５…時空間ＤＣＴ部、１６…
量子化部、１７…可変長符号化部、１８…ヘッダ付加
部、２…ローカルデコード部、２１…逆量子化部、２２
…時空間ＩＤＣＴ部、２３…動き補償部。DESCRIPTION OF SYMBOLS 1 ... Video compression coding apparatus, 11 ... Frame memory, 1
2 ... Motion vector detection unit, 13 ... Motion vector memory,
14: difference image calculation unit, 15: spatio-temporal DCT unit, 16 ...
Quantization unit, 17: variable length coding unit, 18: header addition unit, 2: local decoding unit, 21: inverse quantization unit, 22
... a spatiotemporal IDCT unit, 23 ... a motion compensation unit.

Claims

[Claims]

1. A motion compensation for predicting an image to be encoded using motion vector information indicating a position on a reference image,
Orthogonal transformation processing for converting the pixel information into frequency domain data, dividing the orthogonally transformed data by a predetermined value, and quantizing to reduce the information by truncating the remainder,
In a coding method for compressing and coding a moving image by using variable length coding that allocates codes of different lengths according to the frequency of occurrence of the code, the reference image and the image to be coded using the motion vector information A difference image is created by taking a difference from the moving image, and a plurality of the difference images of the encoding target image formed at different times are used, and the orthogonal transformation process is collectively performed in a time and space. Compression encoding method.

2. An encoding apparatus for compressing and encoding a moving image, comprising: a frame memory for storing a plurality of input images;
A motion vector detecting unit that compares the image to be encoded among the images stored in the frame memory with a reference image to detect a similar portion, and outputs motion vector information indicating the position of the similar portion on the reference image. A difference image calculating means for calculating a difference image between the image to be coded and the reference image, an orthogonal transform means for performing an orthogonal transform on an output of the difference image calculating means, and an output of the orthogonal transform means. Quantizing means for quantizing with the quantization coefficient of the following, a motion vector storing means for storing the motion vector information, an output of the quantizing means, a quantized coefficient and / or motion vector information stored in the motion vector storing means. Variable-length coding means for coding the video data with a variable-length code, and predetermined header information is added to the output of the variable-length coding means, and output as a coded video signal. The orthogonal transforming means, wherein the orthogonal transforming means performs orthogonal transform processing in a time and space collectively by using a plurality of difference images composed of different times of the original moving picture. Compression encoding device.