JP2000013791A

JP2000013791A - Image encoding device, image encoding method, image decoding device, image decoding method, and providing medium

Info

Publication number: JP2000013791A
Application number: JP17350098A
Authority: JP
Inventors: Teruhiko Suzuki; 輝彦鈴木; Yoichi Yagasaki; 陽一矢ヶ崎
Original assignee: Sony Corp
Current assignee: Sony Corp
Priority date: 1998-06-19
Filing date: 1998-06-19
Publication date: 2000-01-14
Anticipated expiration: 2018-06-19
Also published as: JP3380981B2

Abstract

PROBLEM TO BE SOLVED: To perform normal decoding even from halfway of a coded bit stream by encoding an image, including header of an upper layer in a header of a lower layer and outputting the coded bit stream. SOLUTION: A VLC(variable length coding) device 6 forms a coded bit stream by arranging information that should originally be arranged at respective headers of VS, VISO, VO, VOL, GOV and VOP and further arranging a variable length coded result of image data which is supplied from a quantizer 5 and outputs it to a transmission buffer 7. Also, the device 6 outputs the information arranged at each header of the VS, VISO, VO and VOL which are upper layers than the GOV to a buffer 16 and stores it in it. After that, the device 6 reads information of each header of the VS, VISO, VO and VOL which are upper layers than the GOV stored in the buffer 16 at the time of outputting a GOV header, inserts it into a prescribed position of the GOV header and outputs it.

Description

DETAILED DESCRIPTION OF THE INVENTION

【０００１】[0001]

【発明の属する技術分野】本発明は、画像符号化装置お
よび画像符号化方法、画像復号装置および画像復号方
法、並びに提供媒体に関する。特に、例えば、動画像デ
ータを、光磁気ディスクや磁気テープなどの記録媒体に
記録し、これを再生してディスプレイなどに表示した
り、テレビ会議システム、テレビ電話システム、放送用
機器、マルチメディアデータベース検索システムなどの
ように、動画像データを伝送路を介して送信側から受信
側に伝送し、受信側において、これを受信し、表示する
場合や、編集して記録する場合などに用いて好適な画像
符号化装置および画像符号化方法、画像復号装置および
画像復号方法、並びに提供媒体に関する。[0001] The present invention relates to an image encoding device and an image encoding method, an image decoding device and an image decoding method, and a providing medium. In particular, for example, moving image data is recorded on a recording medium such as a magneto-optical disk or a magnetic tape, and is reproduced and displayed on a display or the like, a video conference system, a video telephone system, a broadcasting device, a multimedia database. Suitable for transmitting moving image data from the transmitting side to the receiving side via a transmission path, such as a search system, and receiving and displaying the moving image data on the receiving side, or for editing and recording. The present invention relates to an image encoding device and an image encoding method, an image decoding device and an image decoding method, and a providing medium.

【０００２】[0002]

【従来の技術】例えば、テレビ会議システム、テレビ電
話システムなどのように、動画像データを遠隔地に伝送
するシステムにおいては、伝送路を効率良く利用するた
め、画像データを、そのライン相関やフレーム間相関を
利用して圧縮符号化するようになされている。2. Description of the Related Art For example, in a system for transmitting moving image data to a remote place, such as a video conference system or a video telephone system, image data is converted into a line correlation or a frame in order to use a transmission path efficiently. The compression encoding is performed using the inter-correlation.

【０００３】動画像の高能率符号化方式として代表的な
ものとしては、MPEG（Moving Picture Experts Group）
（蓄積用動画像符号化）方式がある。これはＩＳＯ−Ｉ
ＥＣ／ＪＴＣ１／ＳＣ２／ＷＧ１１において議論され、
標準案として提案されたものであり、動き補償予測符号
化とＤＣＴ（Discrete Cosine Transform）符号化を組
み合わせたハイブリッド方式が採用されている。A typical high-efficiency video coding scheme is Moving Picture Experts Group (MPEG).
(Moving picture coding for storage). This is ISO-I
Discussed in EC / JTC1 / SC2 / WG11,
It has been proposed as a standard, and employs a hybrid method combining motion compensation prediction coding and DCT (Discrete Cosine Transform) coding.

【０００４】ＭＰＥＧでは、様々なアプリケーションや
機能に対応するために、いくつかのプロファイルおよび
レベルが定義されている。最も基本となるのが、メイン
プロファイルメインレベル（ＭＰ＠ＭＬ（Main Profile
at Main Level））である。[0004] In MPEG, several profiles and levels are defined in order to support various applications and functions. The most basic is the main profile main level (MP @ ML (Main Profile
at Main Level)).

【０００５】図２４は、ＭＰＥＧ方式におけるＭＰ＠Ｍ
Ｌのエンコーダの一例の構成を示している。FIG. 24 is a diagram showing MP @ M in the MPEG system.
5 shows an exemplary configuration of an L encoder.

【０００６】符号化すべき画像データは、フレームメモ
リ３１に入力され、一時記憶される。そして、動きベク
トル検出器３２は、フレームメモリ３１に記憶された画
像データを、例えば、１６画素×１６画素などで構成さ
れるマクロブロック単位で読み出し、その動きベクトル
を検出する。[0006] Image data to be encoded is input to a frame memory 31 and is temporarily stored. Then, the motion vector detector 32 reads out the image data stored in the frame memory 31 in units of macroblocks composed of, for example, 16 pixels × 16 pixels, and detects the motion vector.

【０００７】ここで、動きベクトル検出器３２において
は、各フレームの画像データを、Ｉピクチャ、Ｐピクチ
ャ、またはＢピクチャのうちのいずれかとして処理す
る。なお、シーケンシャルに入力される各フレームの画
像を、Ｉ，Ｐ，Ｂピクチャのいずれのピクチャとして処
理するかは、予め定められている（例えば、Ｉ，Ｂ，
Ｐ，Ｂ，Ｐ，・・・Ｂ，Ｐとして処理される）。Here, the motion vector detector 32 processes the image data of each frame as any one of an I picture, a P picture, and a B picture. It should be noted that it is determined in advance as to which of the I, P, and B pictures the image of each frame input sequentially is processed (for example, I, B,
P, B, P,..., B, P).

【０００８】即ち、動きベクトル検出器３２は、フレー
ムメモリ３１に記憶された画像の中の、予め定められた
所定の参照フレームを参照し、その参照フレームと、現
在符号化の対象となっているフレームの１６画素×１６
ラインの小ブロック（マクロブロック）とをパターンマ
ッチング（ブロックマッチング）することにより、その
マクロブロックの動きベクトルを検出する。That is, the motion vector detector 32 refers to a predetermined reference frame in an image stored in the frame memory 31, and the reference frame and the current frame are to be encoded. 16 pixels of frame x 16
By performing pattern matching (block matching) with a small block (macro block) of the line, a motion vector of the macro block is detected.

【０００９】ここで、ＭＰＥＧにおいては、画像の予測
モードには、イントラ符号化（フレーム内符号化）、前
方予測符号化、後方予測符号化、両方向予測符号化の４
種類があり、Ｉピクチャはイントラ符号化され、Ｐピク
チャはイントラ符号化または前方予測符号化され、Ｂピ
クチャはイントラ符号化、前方予測符号化、後方予測符
号化、または両方法予測符号化される。[0009] Here, in MPEG, the image prediction modes include four modes: intra coding (intra-frame coding), forward prediction coding, backward prediction coding, and bidirectional prediction coding.
There are types, I pictures are intra-coded, P pictures are intra-coded or forward predicted coded, B pictures are intra-coded, forward predicted coded, backward predicted coded, or both methods predictive coded .

【００１０】即ち、動きベクトル検出器３２は、Ｉピク
チャについては、予測モードとしてイントラ符号化モー
ドを設定する。この場合、動きベクトル検出器３２は、
動きベクトルの検出は行わず、予測モード（イントラ予
測モード）を、ＶＬＣ（可変長符号化）器３６および動
き補償器４２に出力する。That is, the motion vector detector 32 sets the intra coding mode as the prediction mode for the I picture. In this case, the motion vector detector 32
The motion vector is not detected, and the prediction mode (intra prediction mode) is output to the VLC (variable length coding) unit 36 and the motion compensator 42.

【００１１】また、動きベクトル検出器３２は、Ｐピク
チャについては、前方予測を行い、その動きベクトルを
検出する。さらに、動きベクトル検出器３２は、前方予
測を行うことにより生じる予測誤差と、符号化対象のマ
クロブロック（Ｐピクチャのマクロブロック）の、例え
ば分散とを比較し、マクロブロックの分散の方が予測誤
差より小さい場合、予測モードとしてイントラ符号化モ
ードを設定し、ＶＬＣ器３６および動き補償器４２に出
力する。また、動きベクトル検出器３２は、前方予測を
行うことにより生じる予測誤差の方が小さければ、予測
モードとして前方予測符号化モードを設定し、検出した
動きベクトルとともに、ＶＬＣ器３６および動き補償器
４２に出力する。The motion vector detector 32 performs forward prediction on a P picture and detects the motion vector. Further, the motion vector detector 32 compares a prediction error caused by performing forward prediction with, for example, a variance of a coding-target macroblock (a macroblock of a P picture), and the variance of the macroblock is more predictive. When the difference is smaller than the error, the intra coding mode is set as the prediction mode, and the prediction mode is output to the VLC unit 36 and the motion compensator 42. If the prediction error caused by performing the forward prediction is smaller, the motion vector detector 32 sets the forward prediction encoding mode as the prediction mode, and sets the VLC unit 36 and the motion compensator 42 together with the detected motion vector. Output to

【００１２】さらに、動きベクトル検出器３２は、Ｂピ
クチャについては、前方予測、後方予測、および両方向
予測を行い、それぞれの動きベクトルを検出する。そし
て、動きベクトル検出器３２は、前方予測、後方予測、
および両方向予測についての予測誤差の中の最小のもの
（以下、適宜、最小予測誤差という）を検出し、その最
小予測誤差と、符号化対象のマクロブロック（Ｂピクチ
ャのマクロブロック）の、例えば分散とを比較する。そ
の比較の結果、マクロブロックの分散の方が最小予測誤
差より小さい場合、動きベクトル検出器３２は、予測モ
ードとしてイントラ符号化モードを設定し、ＶＬＣ器３
６および動き補償器４２に出力する。また、動きベクト
ル検出器３２は、最小予測誤差の方が小さければ、予測
モードとして、その最小予測誤差が得られた予測モード
を設定し、対応する動きベクトルとともに、ＶＬＣ器３
６および動き補償器４２に出力する。Further, the motion vector detector 32 performs forward prediction, backward prediction, and bidirectional prediction on the B picture, and detects respective motion vectors. Then, the motion vector detector 32 performs forward prediction, backward prediction,
And a minimum prediction error of the bidirectional prediction (hereinafter, appropriately referred to as a minimum prediction error), and the minimum prediction error and the variance of the encoding target macroblock (the macroblock of the B picture), for example. Compare with As a result of the comparison, if the variance of the macroblock is smaller than the minimum prediction error, the motion vector detector 32 sets the intra coding mode as the prediction mode, and sets the VLC unit 3
6 and the motion compensator 42. If the minimum prediction error is smaller, the motion vector detector 32 sets the prediction mode in which the minimum prediction error is obtained as the prediction mode, and sets the VLC unit 3 together with the corresponding motion vector.
6 and the motion compensator 42.

【００１３】動き補償器４２は、動きベクトル検出器３
２から予測モードと動きベクトルの両方を受信すると、
その予測モードおよび動きベクトルにしたがって、フレ
ームメモリ４１に記憶されている、符号化され、既に局
所復号された画像データを読み出し、これを、予測画像
として、演算器３３および４０に供給する。The motion compensator 42 includes a motion vector detector 3
When both the prediction mode and the motion vector are received from 2,
According to the prediction mode and the motion vector, the coded and locally decoded image data stored in the frame memory 41 is read out and supplied to the computing units 33 and 40 as a predicted image.

【００１４】演算器３３は、動きベクトル検出器３２が
フレームメモリ３１から読み出した画像データと同一の
マクロブロックを、フレームメモリ３１から読み出し、
そのマクロブロックと、動き補償器４２からの予測画像
との差分を演算する。この差分値は、ＤＣＴ器３４に供
給される。The arithmetic unit 33 reads from the frame memory 31 the same macroblock as the image data read from the frame memory 31 by the motion vector detector 32,
The difference between the macro block and the predicted image from the motion compensator 42 is calculated. This difference value is supplied to the DCT unit 34.

【００１５】一方、動き補償器４２は、動きベクトル検
出器３２から予測モードのみを受信した場合、即ち、予
測モードがイントラ符号化モードである場合には、予測
画像を出力しない。この場合、演算器３３（後述する演
算器４０も同様）は、特に処理を行わず、フレームメモ
リ３１から読み出したマクロブロックを、そのままＤＣ
Ｔ器３４に出力する。On the other hand, when only the prediction mode is received from the motion vector detector 32, that is, when the prediction mode is the intra-coding mode, the motion compensator 42 does not output a predicted image. In this case, the arithmetic unit 33 (the same applies to the arithmetic unit 40 to be described later) does not perform any processing, and converts the macro block read from the frame memory 31 into the DC
Output to the T unit 34.

【００１６】ＤＣＴ器３４では、演算器３３の出力に対
して、ＤＣＴ処理が施され、その結果得られるＤＣＴ係
数が、量子化器３５に供給される。量子化器３５では、
バッファ３７のデータ蓄積量（バッファ３７に記憶され
ているデータの量）（バッファフィードバック）に対応
して量子化ステップ（量子化スケール）が設定され、そ
の量子化ステップで、ＤＣＴ器３４からのＤＣＴ係数が
量子化される。この量子化されたＤＣＴ係数（以下、適
宜、量子化係数という）は、設定された量子化ステップ
とともに、ＶＬＣ器３６に供給される。In the DCT unit 34, the output of the arithmetic unit 33 is subjected to DCT processing, and the resulting DCT coefficient is supplied to the quantizer 35. In the quantizer 35,
A quantization step (quantization scale) is set corresponding to the amount of data stored in the buffer 37 (the amount of data stored in the buffer 37) (buffer feedback), and the DCT from the DCT unit 34 is set in the quantization step. The coefficients are quantized. The quantized DCT coefficients (hereinafter, appropriately referred to as quantization coefficients) are supplied to the VLC unit 36 together with the set quantization steps.

【００１７】ＶＬＣ器３６では、量子化器３５より供給
される量子化係数が、例えばハフマン符号などの可変長
符号に変換され、バッファ３７に出力される。さらに、
ＶＬＣ器３６は、量子化器３５からの量子化ステップ、
動きベクトル検出器３２からの予測モード（イントラ符
号化（画像内予測符号化）、前方予測符号化、後方予測
符号化、または両方向予測符号化のうちのいずれが設定
されたかを示すモード）および動きベクトルも可変長符
号化し、バッファ３７に出力する。In the VLC unit 36, the quantized coefficient supplied from the quantizer 35 is converted into a variable length code such as a Huffman code and output to a buffer 37. further,
The VLC unit 36 performs a quantization step from the quantizer 35,
The prediction mode (mode indicating which of intra coding (intra-picture predictive coding), forward predictive coding, backward predictive coding, and bidirectional predictive coding has been set) and motion from the motion vector detector 32 The vector is also variable-length coded and output to the buffer 37.

【００１８】バッファ３７は、ＶＬＣ器３６からのデー
タを一時蓄積し、そのデータ量を平滑化して、例えば、
図示せぬ伝送路に出力し、または記録媒体に記録する。The buffer 37 temporarily stores data from the VLC unit 36 and smoothes the data amount.
The data is output to a transmission path (not shown) or recorded on a recording medium.

【００１９】また、バッファ３７は、そのデータ蓄積量
を、量子化器３５に出力しており、量子化器３５は、こ
のバッファ３７からのデータ蓄積量にしたがって量子化
ステップを設定する。即ち、量子化器３５は、バッファ
３７がオーバーフローしそうなとき、量子化ステップを
大きくし、これにより、量子化係数のデータ量を低下さ
せる。また、量子化器３５は、バッファ３７がアンダー
フローしそうなとき、量子化ステップを小さくし、これ
により、量子化係数のデータ量を増大させる。このよう
にして、バッファ３７のオーバフローとアンダフローを
防止するようになっている。The buffer 37 outputs the data storage amount to the quantizer 35, and the quantizer 35 sets a quantization step according to the data storage amount from the buffer 37. That is, when the buffer 37 is about to overflow, the quantizer 35 increases the quantization step, thereby reducing the data amount of the quantization coefficient. When the buffer 37 is about to underflow, the quantizer 35 reduces the quantization step, thereby increasing the data amount of the quantization coefficient. Thus, the overflow and the underflow of the buffer 37 are prevented.

【００２０】量子化器３５が出力する量子化係数と量子
化ステップは、ＶＬＣ器３６だけでなく、逆量子化器３
８にも供給されるようになされている。逆量子化器３８
では、量子化器３５からの量子化係数が、同じく量子化
器３５からの量子化ステップにしたがって逆量子化さ
れ、これによりＤＣＴ係数に変換される。このＤＣＴ係
数は、ＩＤＣＴ器（逆ＤＣＴ器）３９に供給される。Ｉ
ＤＣＴ器３９では、ＤＣＴ係数が逆ＤＣＴ処理され、演
算器４０に供給される。The quantization coefficient and the quantization step output from the quantizer 35 are determined not only by the VLC unit 36 but also by the inverse quantizer 3.
8 as well. Inverse quantizer 38
Then, the quantized coefficient from the quantizer 35 is inversely quantized in accordance with a quantization step from the quantizer 35, and is thereby converted into a DCT coefficient. The DCT coefficient is supplied to an IDCT unit (inverse DCT unit) 39. I
In the DCT unit 39, the DCT coefficient is subjected to an inverse DCT process, and is supplied to the arithmetic unit 40.

【００２１】演算器４０には、ＩＤＣＴ器３９の出力の
他、上述したように、動き補償器４２から、演算器３３
に供給されている予測画像と同一のデータが供給されて
おり、演算器４０は、ＩＤＣＴ器３９からの信号（予測
残差）と、動き補償器４２からの予測画像とを加算する
ことで、元の画像を、局所復号する（但し、予測モード
がイントラ符号化である場合には、ＩＤＣＴ器３９の出
力は、演算器４０をスルーして、フレームメモリ４１に
供給される）。なお、この復号画像は、受信側において
得られる復号画像と同一のものである。The arithmetic unit 40 receives the output of the IDCT unit 39 and, as described above, the motion compensator 42 and the arithmetic unit 33.
Are supplied with the same data as the prediction image supplied to the calculation unit 40. The arithmetic unit 40 adds the signal (prediction residual) from the IDCT unit 39 and the prediction image from the motion compensator 42, The original image is locally decoded (however, when the prediction mode is intra coding, the output of the IDCT unit 39 is supplied to the frame memory 41 through the arithmetic unit 40). This decoded image is the same as the decoded image obtained on the receiving side.

【００２２】演算器４０において得られた復号画像（局
所復号画像）は、フレームメモリ４１に供給されて記憶
され、その後、インター符号化（前方予測符号化、後方
予測符号化、量方向予測符号化）される画像に対する参
照画像（参照フレーム）として用いられる。The decoded image (local decoded image) obtained by the arithmetic unit 40 is supplied to and stored in the frame memory 41, and then inter-coded (forward predictive coding, backward predictive coding, quantitative predictive coding). ) Is used as a reference image (reference frame) for the image to be processed.

【００２３】次に、図２５は、図２４のエンコーダから
出力される符号化データを復号する、ＭＰＥＧにおける
ＭＰ＠ＭＬのデコーダの一例の構成を示している。Next, FIG. 25 shows an example of the configuration of an MPEG @ ML decoder in MPEG for decoding the encoded data output from the encoder shown in FIG.

【００２４】伝送路を介して伝送されてきた符号化デー
タが図示せぬ受信装置で受信され、または記録媒体に記
録された符号化データが図示せぬ再生装置で再生され、
バッファ１０１に供給されて記憶される。The encoded data transmitted via the transmission path is received by a receiving device (not shown), or the encoded data recorded on the recording medium is reproduced by a reproducing device (not shown),
The data is supplied to the buffer 101 and stored.

【００２５】ＩＶＬＣ器（逆ＶＬＣ器）（可変長復号
器）１０２は、バッファ１０１に記憶された符号化デー
タを読み出し、可変長復号することで、その符号化デー
タを、動きベクトル、予測モード、量子化ステップ、お
よび量子化係数に分離する。これらのうち、動きベクト
ルおよび予測モードは動き補償器１０７に供給され、量
子化ステップおよび量子化係数は逆量子化器１０３に供
給される。An IVLC unit (inverse VLC unit) (variable length decoder) 102 reads out the coded data stored in the buffer 101 and performs variable length decoding to convert the coded data into a motion vector, a prediction mode, Separation into quantization steps and quantization coefficients. Among them, the motion vector and the prediction mode are supplied to the motion compensator 107, and the quantization step and the quantization coefficient are supplied to the inverse quantizer 103.

【００２６】逆量子化器１０３は、ＩＶＬＣ器１０２よ
り供給された量子化係数を、同じくＩＶＬＣ器１０２よ
り供給された量子化ステップにしたがって逆量子化し、
その結果得られるＤＣＴ係数を、ＩＤＣＴ器１０４に出
力する。ＩＤＣＴ器１０４は、逆量子化器１０３からの
ＤＣＴ係数を逆ＤＣＴし、演算器１０５に供給する。The inverse quantizer 103 inversely quantizes the quantized coefficient supplied from the IVLC unit 102 in accordance with the quantization step also supplied from the IVLC unit 102.
The resulting DCT coefficient is output to IDCT unit 104. The IDCT unit 104 performs an inverse DCT on the DCT coefficient from the inverse quantizer 103 and supplies the result to an arithmetic unit 105.

【００２７】演算器１０５には、ＩＤＣＴ器１０４の出
力の他、動き補償器１０７の出力も供給されている。即
ち、動き補償器１０７は、フレームメモリ１０６に記憶
されている、既に復号された画像を、図２４の動き補償
器４１における場合と同様に、ＩＶＬＣ器１０２からの
動きベクトルおよび予測モードにしたがって読み出し、
予測画像として、演算器１０５に供給する。演算器１０
５は、ＩＤＣＴ器１０４からの信号（予測残差）と、動
き補償器１０７からの予測画像とを加算することで、元
の画像を復号する。この復号画像は、フレームメモリ１
０６に供給されて記憶される。なお、ＩＤＣＴ器１０４
の出力が、イントラ符号化されたものである場合には、
その出力は、演算器１０５をスルーして、そのままフレ
ームメモリ１０６に供給されて記憶される。The output of the motion compensator 107 is supplied to the arithmetic unit 105 in addition to the output of the IDCT unit 104. That is, the motion compensator 107 reads the already decoded image stored in the frame memory 106 in accordance with the motion vector and the prediction mode from the IVLC unit 102 as in the case of the motion compensator 41 in FIG. ,
The prediction image is supplied to the arithmetic unit 105. Arithmetic unit 10
5 decodes the original image by adding the signal (prediction residual) from the IDCT unit 104 and the predicted image from the motion compensator 107. This decoded image is stored in the frame memory 1
06 and stored. Note that the IDCT device 104
If the output of is intra-coded,
The output passes through the arithmetic unit 105 and is supplied to and stored in the frame memory 106 as it is.

【００２８】フレームメモリ１０６に記憶された復号画
像は、その後に復号される画像の参照画像として用いら
れるとともに、適宜読み出され、例えば、図示せぬディ
スプレイなどに供給されて表示される。The decoded image stored in the frame memory 106 is used as a reference image for an image to be subsequently decoded, read out as appropriate, and supplied to, for example, a display (not shown) and displayed.

【００２９】なお、ＭＰＥＧ１および２では、Ｂピクチ
ャは、参照画像として用いられないため、エンコーダま
たはデコーダそれぞれにおいて、フレームメモリ４１
（図２４）または１０６（図２５）には記憶されない。In MPEG1 and MPEG-2, B pictures are not used as reference pictures, so that the frame memory 41 is used in each of the encoder and the decoder.
(FIG. 24) or 106 (FIG. 25).

【００３０】[0030]

【発明が解決しようとする課題】以上の図２４、図２５
に示したエンコーダ、デコーダは、ＭＰＥＧ１／２の規
格に準拠したものであるが、現在、画像を構成する物体
などのオブジェクトのシーケンスであるＶＯ（Video Ob
ject）単位で符号化を行う方式につき、ＩＳＯ−ＩＥＣ
／ＪＴＣ１／ＳＣ２９／ＷＧ１１において、ＭＰＥＧ
（Moving Picture Experts Group）４として標準化作業
が進められている。FIG. 24 and FIG. 25 described above.
Are compliant with the MPEG1 / 2 standard, but currently, a VO (Video Ob) which is a sequence of objects such as an object constituting an image is used.
ject), the encoding is performed in units of ISO-IEC
MPEG / JTC1 / SC29 / WG11
(Moving Picture Experts Group) 4 is being standardized.

【００３１】ところで、ＭＰＥＧ４については、主とし
て、通信の分野で利用されるものとして、標準化作業が
進められていたため、ＭＰＥＧ１／２において規定され
ているＧＯＰ（Group Of Picture）は、ＭＰＥＧ４では
規定されておらず、従って、ＭＰＥＧ４が蓄積メディア
に利用された場合には、効率的なランダムアクセスが困
難になることが予想される。Since MPEG4 has been standardized mainly for use in the field of communications, the GOP (Group Of Picture) defined in MPEG1 / 2 has been defined in MPEG4. Therefore, when MPEG4 is used for storage media, it is expected that efficient random access will be difficult.

【００３２】このため、本件出願人は、効率的なランダ
ムアクセスを可能とするために、ＭＰＥＧ１／２で規定
されているＧＯＰに相当するＧＯＶ（Group Of VOP)層
の導入を、特願平１０−８０７５８号において先に提案
しており、また、ＭＰＥＧ４において、このＧＯＶ層が
導入された。For this reason, the applicant of the present application has proposed the introduction of a GOV (Group Of VOP) layer corresponding to a GOP defined by MPEG1 / 2 in order to enable efficient random access. No.-80758, and this GOV layer was introduced in MPEG4.

【００３３】ところで、例えば、ＭＰＥＧ１，２，４，
Ｈ．２６３などの規格に準拠して符号化を行うことによ
り得られる符号化ビットストリームは、複数の階層から
なる階層構造を有している。そして、エンコーダ側で
は、各階層には、デコードに必要な情報が、ヘッダに配
置され、デコーダ側では、各階層のヘッダから必要な情
報が抽出され、符号化ビットストリームの復号が行われ
る。By the way, for example, MPEG1, 2, 4,
H. An encoded bit stream obtained by encoding according to a standard such as H.263 has a hierarchical structure including a plurality of layers. Then, on the encoder side, information necessary for decoding is arranged in a header in each layer, and on the decoder side, necessary information is extracted from the header of each layer, and the encoded bit stream is decoded.

【００３４】従って、ＭＰＥＧ１／２では、ＧＯＰにラ
ンダムアクセスした場合に、そのＧＯＰの復号を行うた
めに上位階層のヘッダの情報が必要となることがあるこ
とから、上位階層の送信後に、適宜、その上位階層のヘ
ッダの情報を再送することが可能な規格となっている。Therefore, according to MPEG1 / 2, when a GOP is randomly accessed, information of a header of an upper layer may be required to decode the GOP. It is a standard that allows information of the header of the upper layer to be retransmitted.

【００３５】しかしながら、ＭＰＥＧ４では、上位階層
の送信後に、適宜、その上位階層のヘッダの情報を再送
することか可能な規格になっておらず、このため、ＧＯ
Ｖ層の導入により、効率的なランダムアクセスが可能と
なっても、そのＧＯＶの復号を行うために必要な上位階
層のヘッダの情報が得られず、これにより、正常な復号
結果を得られないおそれがある。However, MPEG4 does not have a standard in which information of the header of the upper layer can be retransmitted after transmission of the upper layer as appropriate.
Even if efficient random access becomes possible due to the introduction of the V layer, information on the header of the upper layer necessary for decoding the GOV cannot be obtained, and thus a normal decoding result cannot be obtained. There is a risk.

【００３６】ここで、ＭＰＥＧ４の符号化ビットストリ
ームが、蓄積メディアに記録されている場合には、その
記録メディアにアクセスすることで、上位階層のヘッダ
の情報を得ることが可能であるが、符号化ビットストリ
ームが放送等される場合には、その符号化ビットストリ
ームを最初から受信しない限りは、上位階層のヘッダの
情報が得られないことになり、従って、符号化ビットス
トリームの受信を、その途中から開始した場合には、正
常な復号結果を得られないおそれがある。Here, when an encoded bit stream of MPEG4 is recorded on a storage medium, it is possible to obtain information of a header of an upper layer by accessing the recording medium. When the encoded bit stream is broadcasted, the information of the header of the upper layer cannot be obtained unless the encoded bit stream is received from the beginning. If the decoding is started halfway, a normal decoding result may not be obtained.

【００３７】本発明は、このような状況に鑑みてなされ
たものであり、符号化ビットストリームの途中からで
も、正常な復号を行うことができるようにするものであ
る。The present invention has been made in view of such a situation, and is intended to enable normal decoding even in the middle of an encoded bit stream.

【００３８】[0038]

【課題を解決するための手段】本発明の画像符号化装置
は、画像を符号化し、下位階層のヘッダに、上位階層の
ヘッダの情報を含め、符号化ビットストリームを出力す
る符号化手段を備えることを特徴とする。An image coding apparatus according to the present invention comprises coding means for coding an image and outputting a coded bit stream including information of a header of an upper layer in a header of a lower layer. It is characterized by the following.

【００３９】本発明の画像符号化方法は、画像を符号化
して、下位階層のヘッダに、上位階層のヘッダの情報を
含め、符号化ビットストリームを出力することを特徴と
する。The image coding method of the present invention is characterized in that an image is coded and a coded bit stream is output including the information of the upper layer header in the lower layer header.

【００４０】本発明の画像復号装置は、下位階層のヘッ
ダに、上位階層のヘッダの情報を含めた符号化ビットス
トリームから、下位階層のヘッダに含まれる情報を抽出
し、その情報に基づいて、符号化ビットストリームを復
号する復号手段を備えることを特徴とする。The image decoding apparatus of the present invention extracts information included in a lower layer header from an encoded bit stream including a lower layer header including information of an upper layer header, and extracts the information based on the information. It is characterized by comprising decoding means for decoding the encoded bit stream.

【００４１】本発明の画像復号方法は、下位階層のヘッ
ダに、上位階層のヘッダの情報を含めた符号化ビットス
トリームから、下位階層のヘッダに含まれる情報を抽出
し、その情報に基づいて、符号化ビットストリームを復
号することを特徴とする。According to the image decoding method of the present invention, information contained in a lower layer header is extracted from an encoded bit stream including a lower layer header including information of an upper layer header, and based on the information, The method is characterized in that the encoded bit stream is decoded.

【００４２】本発明の提供媒体は、画像を符号化して、
下位階層のヘッダに、上位階層のヘッダの情報を含める
ことにより得られる符号化ビットストリームを提供する
ことを特徴とする。The providing medium of the present invention encodes an image,
It is characterized in that an encoded bit stream obtained by including information of an upper layer header in a lower layer header is provided.

【００４３】本発明の画像符号化装置においては、符号
化手段が、画像を符号化し、下位階層のヘッダに、上位
階層のヘッダの情報を含め、符号化ビットストリームを
出力するようになされている。In the image encoding apparatus of the present invention, the encoding means encodes the image and outputs an encoded bit stream including the information of the header of the upper layer in the header of the lower layer. .

【００４４】本発明の画像符号化方法においては、画像
を符号化して、下位階層のヘッダに、上位階層のヘッダ
の情報を含め、符号化ビットストリームを出力するよう
になされている。In the image encoding method according to the present invention, an image is encoded, and an encoded bit stream is output, including the information of the upper layer header in the lower layer header.

【００４５】本発明の画像復号装置においては、復号手
段が、下位階層のヘッダに、上位階層のヘッダの情報を
含めた符号化ビットストリームから、下位階層のヘッダ
に含まれる情報を抽出し、その情報に基づいて、符号化
ビットストリームを復号するようになされている。In the image decoding apparatus according to the present invention, the decoding means extracts information contained in the lower layer header from the coded bit stream including the information of the upper layer header in the lower layer header. The encoded bit stream is decoded based on the information.

【００４６】本発明の画像復号方法においては、下位階
層のヘッダに、上位階層のヘッダの情報を含めた符号化
ビットストリームから、下位階層のヘッダに含まれる情
報を抽出し、その情報に基づいて、符号化ビットストリ
ームを復号するようになされている。In the image decoding method according to the present invention, information contained in a lower layer header is extracted from an encoded bit stream including a lower layer header including information of an upper layer header, and the information is extracted based on the information. , Coded bit stream is decoded.

【００４７】本発明の提供媒体においては、画像を符号
化して、下位階層のヘッダに、上位階層のヘッダの情報
を含めることにより得られる符号化ビットストリームを
提供するようになされている。In the providing medium of the present invention, an image is encoded, and an encoded bit stream obtained by including information of an upper layer header in a lower layer header is provided.

【００４８】[0048]

【発明の実施の形態】以下に、本発明の実施の形態につ
いて説明するが、その前に、ＭＰＥＧ４において規定さ
れている符号化ビットストリームについて説明する。な
お、ここでは、MPEG4規格DraftであるFCD(Final Comitt
ee Draft)における符号化ビットストリームについて説
明する。DETAILED DESCRIPTION OF THE PREFERRED EMBODIMENTS An embodiment of the present invention will be described below. Before that, an encoded bit stream specified in MPEG4 will be described. In this case, the FCD (Final Comitt
ee Draft) will be described.

【００４９】図１は、FCDで規定されている符号化ビッ
トストリームの構成を示している。FIG. 1 shows the structure of an encoded bit stream defined by the FCD.

【００５０】符号化ビットストリームは、同図に示すよ
うに、ＶＳ（Visual Object Sequence）層、ＶＩＳＯ(V
isual Object)層、ＶＯ（video Object）層、ＶＯＬ
（Video Object Layer）、ＧＯＶ(Group of VOP)層、Ｖ
ＯＰ（Video Object Plane）層などの、複数の階層から
なる階層構造を有している（図１において、上方に位置
している階層ほど、上位の階層を構成する）。As shown in the figure, the coded bit stream has a VS (Visual Object Sequence) layer and a VISO (V
isual Object) layer, VO (video Object) layer, VOL
(Video Object Layer), GOV (Group of VOP) layer, V
It has a hierarchical structure composed of a plurality of layers, such as an OP (Video Object Plane) layer (in FIG. 1, the higher the layer, the higher the layer).

【００５１】即ち、符号化ビットストリームは、ＶＳを
単位として構成される。ここで、ＶＳは、画像シーケン
スであり、例えば、一本の番組や映画などに相当する。That is, the coded bit stream is configured in units of VS. Here, VS is an image sequence, and corresponds to, for example, one program or movie.

【００５２】各ＶＳは、１以上のＶＩＳＯから構成され
る。ここで、ＶＩＳＯには、幾つかの種類がある。即
ち、ＶＩＳＯには、例えば、静止画であるスチルテクス
チャオブジェクト（Still Texture Object）や、顔画像
から構成されるフェイスオブジェクト（Face Objec
t）、動画像のオブジェクトであるＶＯ（Video Objec
t）などがある。従って、符号化ビットストリームが動
画像のものである場合、ＶＩＳＯは、ＶＯから構成され
る。Each VS is composed of one or more VISOs. Here, there are several types of VISO. That is, VISO includes, for example, a still texture object (Still Texture Object) as a still image and a face object (Face Objec) composed of a face image.
t), VO (Video Objec)
t) and so on. Therefore, when the encoded bit stream is that of a moving image, VISO is composed of VO.

【００５３】ＶＯは、１以上のＶＯＬ（Video Object L
ayer）から構成される（画像を階層化（階層符号化）し
ないときは１のＶＯＬで構成され、画像を階層化する場
合には、その階層数だけのＶＯＬで構成される）。VO is composed of one or more VOLs (Video Object L
ayer) (when the image is not hierarchized (hierarchical encoding), it is composed of one VOL, and when the image is hierarchized, it is composed of VOLs of the same number of layers).

【００５４】ＶＯＬは、必要な数のＧＯＶ（Group of V
OP）で構成され、ＧＯＶは、１以上のＶＯＰ（Video Ob
ject Plane）のシーケンスで構成される。なお、ＧＯＶ
はなくても良く、この場合、ＶＯＬは、１以上のＶＯＰ
で構成されることになる。The VOL is a required number of GOVs (Group of V
OP), and the GOV is composed of one or more VOPs (Video Obs).
ject plane). In addition, GOV
In this case, VOL is not less than one VOP
It will be composed of

【００５５】ＶＯＰは、従来のフレームに相当する。VOP corresponds to a conventional frame.

【００５６】なお、ＶＳ，ＶＯ，ＶＯＰの関係につい
て、さらに説明すると、ＶＳは、上述したように、画像
シーケンスであり、例えば、一本の番組に相当する。そ
して、ＶＯは、ある合成画像のシーケンスが存在する場
合の、その合成画像を構成する各物体のシーケンスであ
り、ＶＯＰは、ある時刻におけるＶＯを意味する。即
ち、例えば、いま、画像Ｆ１およびＦ２を合成して構成
される合成画像Ｆ３がある場合、画像Ｆ１またはＦ２が
時系列に並んだものが、それぞれＶＯであり、ある時刻
における画像Ｆ１またはＦ２が、それぞれＶＯＰであ
る。従って、ＶＯは、異なる時刻の、同一物体のＶＯＰ
の集合ということができる。The relationship between VS, VO, and VOP will be further described. As described above, VS is an image sequence and corresponds to, for example, one program. VO is the sequence of each object constituting the composite image when there is a sequence of the composite image, and VOP means VO at a certain time. That is, for example, if there is a composite image F3 composed of the images F1 and F2, the image F1 or F2 arranged in time series is VO, and the image F1 or F2 at a certain time is , Are VOPs. Therefore, the VO is the VOP of the same object at different times.
Can be called a set of

【００５７】ここで、図２乃至図４それぞれに、ＶＳ，
ＶＩＳＯ，ＶＯのシンタクスを示す。また、図５乃至図
７に、ＶＯＬのシンタクスを、図８に、ＧＯＶのシンタ
クスを、図９乃至図１１に、ＶＯＰのシンタクスを、そ
れぞれ示す。なお、各層のシンタクスに記載されている
フラグのセマンティクスは、ＭＰＥＧ４ＦＣＤ規格（14
496-2）に記載されているので、それを参照されたい。Here, VS,
Shows the syntax of VISO and VO. FIGS. 5 to 7 show VOL syntax, FIG. 8 shows GOV syntax, and FIGS. 9 to 11 show VOP syntax, respectively. Note that the semantics of the flags described in the syntax of each layer conform to the MPEG4 FCD standard (14
Please refer to it as described in 496-2).

【００５８】MPEG4のFCD規格におけるＶＳ，ＶＩＳＯ，
ＶＯ，ＶＯＬヘッダの情報は、符号化ビットストリーム
を復号するために必要な必須情報を含んでおり、これら
の情報がなければ、前述したように、その符号化ビット
ストリームを正確に復号することは困難である。VS, VISO, MPEG4 FCD standard
The information of the VO and VOL headers includes essential information necessary for decoding the encoded bit stream, and without such information, as described above, it is not possible to decode the encoded bit stream correctly. Have difficulty.

【００５９】即ち、例えば、記録媒体に記録された符号
化ビットストリームに対して、ランダムアクセスや、Ｆ
Ｆ／ＦＲ（早送り／巻き戻し）等のような特殊再生を行
う場合、または放送されている符号化ビットストリーム
に途中からアクセスする場合、その符号化ビットストリ
ームの復号を開始するためには、まず、ＶＳ，ＶＩＳ
Ｏ，ＶＯ，ＶＯＬヘッダの情報を復号することが必要で
ある。That is, for example, random access or F-code access to an encoded bit stream recorded on a recording medium is performed.
When performing special reproduction such as F / FR (fast forward / rewind), or when accessing a broadcast coded bit stream from the middle, decoding of the coded bit stream must be started first. , VS, VIS
It is necessary to decode the information of the O, VO, and VOL headers.

【００６０】しかしながら、MPEG4のFCD規格において
は、符号化ビットストリームの先頭に、一度だけＶＳ，
ＶＩＳＯ，ＶＯ，ＶＯＬヘッダを伝送することしか許さ
れておらず、この場合、特に、放送されてくる符号化ビ
ットストリームの途中から復号を始めることは難しい。However, according to the MPEG4 FCD standard, VS,
Only transmission of VISO, VO, and VOL headers is permitted. In this case, it is particularly difficult to start decoding from the middle of a coded bit stream that is broadcast.

【００６１】さらに、例えば、ＶＯＬヘッダには、量子
化マトリクスを始め、符号化モードを指定するフラグが
記述される。これらの符号化モードは、符号化対象の画
像から得られる符号化ビットストリームの性質に依存し
て、即ち、符号化対象の画像の統計的性質が最適になる
ように設定される。しかしながら、例えば、長時間の画
像シーケンスなどに関しては、画像の性質は時刻によっ
て大きく変化することがあるため、ＶＯＬヘッダに、最
初に設定した値が、必ずしも常に最適であるとは限らな
い。それにもかかわらず、画像シーケンスの先頭（ここ
では、符号化モードを指定するフラグが記述されるＶＯ
Ｌの先頭）でしか、符号化モードを指定するフラグを設
定することができないということは、効率の良い符号化
の妨げとなる。Further, for example, a flag for designating an encoding mode is described in the VOL header, including a quantization matrix. These encoding modes are set depending on the properties of the coded bit stream obtained from the picture to be coded, that is, to optimize the statistical properties of the picture to be coded. However, for a long-time image sequence, for example, the properties of an image may greatly change depending on the time, so that the value initially set in the VOL header is not always optimal. Nevertheless, the beginning of the image sequence (here, VO in which a flag specifying the encoding mode is described)
The fact that the flag for designating the encoding mode can be set only at the head of L) hinders efficient encoding.

【００６２】そこで、図１２は、本発明を適用したエン
コーダの一実施の形態の構成例を示している。なお、こ
のエンコーダを構成するフレームメモリ１、動きベクト
ル検出器２、演算器３，ＤＣＴ器４、量子化器５，ＶＬ
Ｃ器６、バッファ７、逆量子化器８，ＩＤＣＴ器９，演
算器１０、フレームメモリ１１、動き補償器１２は、図
２４に示したエンコーダを構成するフレームメモリ３
１、動きベクトル検出器３２、演算器３３，ＤＣＴ器３
４、量子化器３５，ＶＬＣ器３６、バッファ３７、逆量
子化器３８，ＩＤＣＴ器３９，演算器４０、フレームメ
モリ４１、動き補償器４２にそれぞれ対応している。従
って、フレームメモリ１乃至動き補償器１２それぞれで
は、フレームメモリ３１乃至動き補償器４２それぞれの
処理と同一の処理が行われる場合があり、そのような同
一の処理についての説明は、適宜省略する。FIG. 12 shows a configuration example of an embodiment of an encoder to which the present invention is applied. It should be noted that a frame memory 1, a motion vector detector 2, a computing unit 3, a DCT unit 4, a quantizer 5, and a VL
The C unit 6, the buffer 7, the inverse quantizer 8, the IDCT unit 9, the arithmetic unit 10, the frame memory 11, and the motion compensator 12 constitute a frame memory 3 constituting the encoder shown in FIG.
1, motion vector detector 32, arithmetic unit 33, DCT unit 3
4. Corresponds to the quantizer 35, VLC unit 36, buffer 37, inverse quantizer 38, IDCT unit 39, arithmetic unit 40, frame memory 41, and motion compensator 42, respectively. Therefore, in each of the frame memory 1 to the motion compensator 12, the same processing as the processing of the frame memory 31 to the motion compensator 42 may be performed, and the description of such the same processing will be appropriately omitted.

【００６３】符号化対象のディジタル画像信号を構成す
るＶＯＰは、フレームメモリ１（受信手段）に順次供給
され、そこで受信されて一時記憶される。さらに、フレ
ームメモリ１には、そこに供給されるＶＯＰの、所定の
絶対座標系における大きさを示すフラグFSZと、位置を
示すフラグFPOSも供給されるようになされており、フレ
ームメモリ１は、これらのフラグFSZおよびFPOSも一時
記憶する。The VOPs constituting the digital image signal to be encoded are sequentially supplied to the frame memory 1 (receiving means), where they are received and temporarily stored. Further, the frame memory 1 is also supplied with a flag FSZ indicating the size of the VOP supplied thereto in a predetermined absolute coordinate system and a flag FPOS indicating the position. These flags FSZ and FPOS are also temporarily stored.

【００６４】フレームメモリ１に記憶されたＶＯＰは、
動きベクトル検出器２によって、マクロブロック単位で
読み出される。そして、動きベクトル検出回路２は、予
め設定されている所定のシーケンスに従って、各ＶＯＰ
を、Ｉ（Intra）−ＶＯＰ，Ｐ（Predictive）−ＶＯ
Ｐ、またはＢ（Biderectionally Predictive）−ＶＯＰ
として処理する。シーケンシャルに入力される各ＶＯＰ
を、Ｉ，Ｐ，ＢのいずれのＶＯＰとして処理するかは、
予め定められている（例えば、Ｉ，Ｂ，Ｐ，Ｂ，Ｐ，・
・・Ｂ，Ｐとして処理される）。The VOP stored in the frame memory 1 is
The motion vector is read by the motion vector detector 2 in macroblock units. Then, the motion vector detecting circuit 2 performs each VOP according to a predetermined sequence set in advance.
To I (Intra) -VOP, P (Predictive) -VO
P or B (Biderectionally Predictive)-VOP
Process as Each VOP input sequentially
Is processed as any of VOPs of I, P, and B,
It is predetermined (for example, I, B, P, B, P,.
.. Processed as B and P).

【００６５】動きベクトル検出器２は、処理対象のマク
ロブロックに対して、予め定められた所定の参照画像
（ＶＯＰ）を参照して、動き補償を施し、そのマクロブ
ロックの動きベクトルを検出する。The motion vector detector 2 performs motion compensation on a macroblock to be processed with reference to a predetermined reference image (VOP), and detects a motion vector of the macroblock.

【００６６】ここで、動き補償（フレーム間予測）に
は、前方予測、後方予測、両方向予測の３種類の予測モ
ードがあり、Ｐ−ＶＯＰは、前方予測のみでのみ動き補
償が施され、動きベクトル検出器２は、その予測誤差を
最小にする動きベクトルを検出する。また、Ｂ−ＶＯＰ
は、前方予測、後方予測、両方向予測の３種類で動き補
償が施され、動きベクトル検出器２は、各予測モードに
おいて、その予測誤差を最小にする動きベクトルを検出
する。さらに、動きベクトル検出器２は、３つの予測モ
ードのうち、最小の予測誤差が得られたものを選択し、
その予測モードにおける動きベクトルも選択する。Here, the motion compensation (inter-frame prediction) has three types of prediction modes: forward prediction, backward prediction, and bidirectional prediction. In P-VOP, motion compensation is performed only in forward prediction only. The vector detector 2 detects a motion vector that minimizes the prediction error. Also, B-VOP
Are subjected to motion compensation in three types: forward prediction, backward prediction, and bidirectional prediction, and the motion vector detector 2 detects a motion vector that minimizes the prediction error in each prediction mode. Further, the motion vector detector 2 selects one of the three prediction modes in which the minimum prediction error is obtained,
The motion vector in the prediction mode is also selected.

【００６７】そして、動きベクトル検出器２は、動き補
償の結果得られた予測誤差と、符号化対象のマクロブロ
ックの分散とを比較する。その結果、マクロブロックの
分散の方が小さい場合は、そのマクロブロックについて
はフレーム間予測は行われず、フレーム内符号化が行わ
れる。この場合、予測モードは、画像内符号化（イント
ラ）となり、そのような予測モードが、動きベクトル検
出器２からＶＬＣ器６および動き補償器１２に供給され
る。一方、予測誤差の方が小さい場合には、その予測誤
差が得られた予測モードと動きベクトルとが、動きベク
トル検出器２からＶＬＣ器６および動き補償器１２に供
給される。なお、Ｉ−ＶＯＰについての予測モードは、
必ず画像内符号化にされる。Then, the motion vector detector 2 compares the prediction error obtained as a result of the motion compensation with the variance of the macroblock to be coded. As a result, if the variance of the macroblock is smaller, the interframe prediction is not performed on the macroblock and the intraframe coding is performed. In this case, the prediction mode is intra-picture coding (intra), and such a prediction mode is supplied from the motion vector detector 2 to the VLC unit 6 and the motion compensator 12. On the other hand, when the prediction error is smaller, the prediction mode and the motion vector from which the prediction error was obtained are supplied from the motion vector detector 2 to the VLC unit 6 and the motion compensator 12. Note that the prediction mode for the I-VOP is
It is always intra-coded.

【００６８】ここで、符号化対象となるVOPのシーケン
スは、それぞれ、大きさや位置が異なることがある。従
って、動きベクトルを検出する場合には、基準となる座
標系を設定し、その座標系において、動きベクトルの検
出を行う必要がある。そこで、ここでは、ある１つの絶
対座標を仮定し、その絶対座標における動きベクトルが
算出されるようになされている。即ち、動きベクトル検
出器２には、ＶＯＰの絶対座標系における大きさを示す
フラグFSZと、位置を示すフラグFPOSとが供給されるよ
うになされており、動きベクトル検出器２は、このフラ
グFSZおよびフラグFPOSに基づき、処理対象のＶＯＰ
と、参照画像となるＶＯＰとを、絶対座標系に配置し、
処理対象のＶＯＰ（のマクロブロック）の動きベクトル
を算出するようになされている。Here, the VOP sequences to be encoded may have different sizes and positions. Therefore, when detecting a motion vector, it is necessary to set a reference coordinate system and detect the motion vector in the coordinate system. Therefore, here, a certain absolute coordinate is assumed, and a motion vector at the absolute coordinate is calculated. That is, the motion vector detector 2 is supplied with a flag FSZ indicating the magnitude of the VOP in the absolute coordinate system and a flag FPOS indicating the position. And VOP to be processed based on the flag FPOS
And a VOP serving as a reference image are arranged in an absolute coordinate system,
The motion vector of (the macroblock of) the VOP to be processed is calculated.

【００６９】一方、動き補償器１２は、動きベクトル検
出器２からの動きベクトルおよび予測モードに基づい
て、フレームメモリ１１に記憶されているＶＯＰに対し
て動き補償を施すことで、予測画像を生成する。この予
測画像は、演算器３に供給される。演算器３には、さら
に、動きベクトル検出器２がフレームメモリ１から読み
出した符号化対象のマクロブロックも、フレームメモリ
１から供給される。そして、演算器３は、符号化対象の
マクロブロックを構成する各画素の画素値それぞれと、
予測画像を構成する画素の画素値それぞれの差分を演算
し、その差分信号を、DCT器４に出力する。なお、符号
化対象のマクロブロックが、イントラマクロブロックの
場合には、演算器３は、その符号化対象のマクロブロッ
クをそのままDCT器４に出力する。On the other hand, the motion compensator 12 generates a predicted image by performing motion compensation on the VOP stored in the frame memory 11 based on the motion vector from the motion vector detector 2 and the prediction mode. I do. This predicted image is supplied to the arithmetic unit 3. The arithmetic unit 3 is also supplied from the frame memory 1 with the macroblock to be encoded read from the frame memory 1 by the motion vector detector 2. Then, the arithmetic unit 3 calculates a pixel value of each pixel constituting the macroblock to be encoded,
The difference between the pixel values of the pixels constituting the predicted image is calculated, and the difference signal is output to the DCT unit 4. When the encoding target macroblock is an intra macroblock, the arithmetic unit 3 outputs the encoding target macroblock to the DCT unit 4 as it is.

【００７０】DCT器４では、演算器３の出力に対して、D
CT（離散コサイン変換）処理が施され、DCT係数に変換
される。このＤＣＴ係数は、量子化器５に入力され、送
信バッファ７のデータ蓄積量（バッファ蓄積量）に対応
した量子化ステップで量子化された後、ＶＬＣ（可変長
符号化）器６に入力される。In the DCT unit 4, the output of the arithmetic unit 3 is
The data is subjected to a CT (discrete cosine transform) process and converted into DCT coefficients. This DCT coefficient is input to the quantizer 5, quantized in a quantization step corresponding to the data storage amount (buffer storage amount) of the transmission buffer 7, and then input to the VLC (variable length coding) device 6. You.

【００７１】ＶＬＣ器６は、量子化器５より供給される
画像データを、例えばハフマン符号などの可変長符号に
変換し、その結果得られる符号化ビットストリームを、
送信バッファ７に出力する。The VLC unit 6 converts the image data supplied from the quantizer 5 into a variable length code such as a Huffman code, and converts the resulting coded bit stream into
Output to the transmission buffer 7.

【００７２】ＶＬＣ器６には、また、量子化器５より量
子化ステップ（スケール）が、動きベクトル検出器２よ
り予測モード（画像内予測、前方予測、後方予測、また
は両方向予測のいずれが設定されたかを示すモード）、
および動きベクトルが、後述するキー信号符号化器１３
よりキー信号の符号化結果が、それぞれ供給されるよう
になされている。さらに、ＶＬＣ器６には、フラグFSZ
およびFPOSも供給されるようになされている。ＶＬＣ器
６は、これらの情報、さらには、バッファ１６に記憶さ
れた情報を、図１に示したように構成される符号化ビッ
トストリームの所定の階層のヘッダに挿入（配置）して
出力する。The VLC unit 6 has a quantization step (scale) set by the quantizer 5 and a prediction mode (intra-picture prediction, forward prediction, backward prediction, or bidirectional prediction) set by the motion vector detector 2. Mode that indicates whether the
And a motion vector are generated by a key signal encoder 13 described later.
The encoding result of the key signal is supplied. Further, the VLC unit 6 has a flag FSZ.
And FPOS have also been made available. The VLC unit 6 inserts (arranges) these pieces of information and further the information stored in the buffer 16 into a header of a predetermined layer of the coded bit stream configured as shown in FIG. .

【００７３】なお、ＶＬＣ器６は、各階層のヘッダに配
置された情報を、バッファ１６に出力するようになされ
ており、バッファ１６は、ＶＬＣ器６から供給される情
報を記憶するようになされている。The VLC unit 6 outputs the information arranged in the header of each layer to the buffer 16, and the buffer 16 stores the information supplied from the VLC unit 6. ing.

【００７４】送信バッファ７は、ＶＬＣ器３６からの符
号化ビットストリームを一時蓄積し、その蓄積量に対応
する量子化制御信号を量子化器５に出力する。即ち、送
信バッファ７は、その蓄積量が許容上限値まで増量する
と、量子化スケールを大きくする量子化制御信号を、量
子化器５に供給し、量子化スケールを大きくさせること
で、量子化器５の出力するデータ量を低下させる。ま
た、送信バッファ７は、その蓄積残量が許容下限値まで
減少すると、量子化スケールを小さくする量子化制御信
号を、量子化器５に供給し、量子化スケールを小さくさ
せることで、量子化器５の出力するデータ量を増大させ
る。このようにして、送信バッファ７のオーバフローお
よびアンダフローが防止されるようになされている。The transmission buffer 7 temporarily stores the coded bit stream from the VLC unit 36 and outputs a quantization control signal corresponding to the stored amount to the quantizer 5. That is, the transmission buffer 7 supplies the quantization control signal for increasing the quantization scale to the quantizer 5 when the accumulated amount increases to the allowable upper limit value, and increases the quantization scale to thereby increase the quantization scale. 5 reduces the amount of data output. When the remaining storage amount decreases to the permissible lower limit, the transmission buffer 7 supplies a quantization control signal for reducing the quantization scale to the quantizer 5 to reduce the quantization scale, thereby performing quantization. The amount of data output from the device 5 is increased. In this manner, overflow and underflow of the transmission buffer 7 are prevented.

【００７５】そして、送信バッファ７に蓄積された符号
化ビットストリームは、所定のタイミングで読み出さ
れ、例えば、磁気テープや、磁気ディスク、光磁気ディ
スク、相変化ディスクなどの記録媒体２０１に供給され
て記録され、あるいは、アナログ公衆網や、ＩＳＤＮ、
衛星回線、ＣＡＴＶ網、地上波などの伝送媒体２０２を
介して伝送される。これにより、記録媒体２０１や伝送
媒体２０２を媒介して、符号化ビットストリームが、後
述する図１６のデコーダに提供される。The coded bit stream stored in the transmission buffer 7 is read at a predetermined timing and supplied to a recording medium 201 such as a magnetic tape, a magnetic disk, a magneto-optical disk, and a phase change disk. Or recorded on an analog public network, ISDN,
The data is transmitted via a transmission medium 202 such as a satellite line, a CATV network, and a terrestrial wave. Thereby, the encoded bit stream is provided to the decoder of FIG. 16 described below via the recording medium 201 and the transmission medium 202.

【００７６】ここで、上述したように、ＶＯは、ある合
成画像のシーケンスが存在する場合の、その合成画像を
構成する各物体のシーケンスであり、ＶＯＰは、ある時
刻におけるＶＯを意味する。即ち、例えば、いま、画像
Ｆ１およびＦ２を合成して構成される合成画像Ｆ３があ
る場合、画像Ｆ１またはＦ２が時系列に並んだものが、
それぞれＶＯであり、ある時刻における画像Ｆ１または
Ｆ２が、それぞれＶＯＰである。従って、例えば、画像
Ｆ１を背景とするとともに、画像Ｆ２を前景とすると、
合成画像Ｆ３を得るためには、画像Ｆ２を抜くためのキ
ー信号を用いて、画像Ｆ１およびＦ２を合成する必要が
ある。即ち、合成画像Ｆ３を得るには、画像Ｆ２を抜く
ためのキー信号が必要となる。Here, as described above, VO is a sequence of each object constituting the composite image when a sequence of the composite image exists, and VOP means VO at a certain time. That is, for example, if there is a composite image F3 composed by combining the images F1 and F2, an image in which the images F1 or F2 are arranged in time series is
Each is a VO, and the image F1 or F2 at a certain time is a VOP. Therefore, for example, if the image F1 is set as the background and the image F2 is set as the foreground,
To obtain the composite image F3, it is necessary to combine the images F1 and F2 using a key signal for extracting the image F2. That is, in order to obtain the composite image F3, a key signal for removing the image F2 is required.

【００７７】このため、各ＶＯＰを抜くためのキー信号
が、キー信号符号化器１３に供給されるようになされて
おり、キー信号符号化器１３では、そこに供給されるキ
ー信号が、例えばDPCMなどの所定の手法によって符号化
される。このキー信号の符号化結果は、ＶＬＣ器６およ
びキー信号復号器１４に供給されるようになされてい
る。For this reason, a key signal for removing each VOP is supplied to the key signal encoder 13. In the key signal encoder 13, the key signal supplied thereto is, for example, It is encoded by a predetermined method such as DPCM. The result of encoding the key signal is supplied to the VLC unit 6 and the key signal decoder 14.

【００７８】キー信号復号器１４では、キー信号符号化
器１３からのキー信号の符号化結果が復号され、動きベ
クトル検出器２、ＤＣＴ器４、ＩＤＣＴ器９、動き補償
器１２、および画素置換器１５に供給され、これらのブ
ロックでは、キー信号の復号結果を必要に応じて用いて
処理が行われる。The key signal decoder 14 decodes the encoded result of the key signal from the key signal encoder 13 and outputs the result to the motion vector detector 2, DCT unit 4, IDCT unit 9, motion compensator 12, and pixel replacement. The processing is performed in these blocks using the decoding result of the key signal as necessary.

【００７９】このように、動きベクトル検出器２には、
キー信号復号器１４で局所復号されたキー信号も供給さ
れるようになされているが、このキー信号は、動きベク
トル検出器２が、マクロブロックの予測誤差を計算する
際に用いられる。As described above, the motion vector detector 2 includes:
The key signal locally decoded by the key signal decoder 14 is also supplied. The key signal is used when the motion vector detector 2 calculates a prediction error of a macroblock.

【００８０】即ち、ＶＯＰは、ある時刻の、ある物体の
画像であるから、その形状は、基本的に任意形状であ
り、この場合、符号化対象のマクロブロックに画像（物
体を構成する画素）が存在しない領域が含まれることが
ある。そのような場合に、動きベクトル検出器２は、符
号化対象のマクロブロックにおいて画像が存在しない画
素を除外して、予測誤差を計算するようになされてお
り、即ち、画像が存在する画素の予測誤差のみを用い
て、符号化対象のマクロブロックの予測誤差を計算し、
それを最小とする動きベクトルを検出するようになされ
ており、符号化対象のマクロブロック内の各画素につい
て、画像が存在するかどうかを認識するために、符号化
対象のマクロブロックの、局所復号されたキー信号が参
照される。That is, since the VOP is an image of a certain object at a certain time, its shape is basically an arbitrary shape. In this case, the macroblock to be coded has an image (a pixel constituting the object) There may be areas where there is no. In such a case, the motion vector detector 2 calculates a prediction error by excluding a pixel in which no image exists in the macroblock to be encoded, that is, predicting a pixel in which an image exists. Using only the error, calculate the prediction error of the macroblock to be encoded,
A motion vector that minimizes the motion vector is detected. For each pixel in the macro block to be encoded, local decoding of the macro block to be encoded is performed in order to recognize whether an image exists. The key signal is referred to.

【００８１】具体的には、動きベクトル検出器２では、
キー信号が０である画素については、画像が存在しな
い、物体（画像オブジェクト）の外側の領域に属する画
素であると認識され、キー信号が０以外である画素につ
いては、画像が存在する、物体（画像オブジェクト）の
内側の領域にある画素であると認識される。そして、動
きベクトル検出器２は、キー信号が０である画素につい
ては、予測画像を求めるための、参照画像との差分を計
算しない。Specifically, in the motion vector detector 2,
A pixel having a key signal of 0 is recognized as a pixel that does not have an image and belongs to a region outside the object (image object). It is recognized that the pixel is in the area inside the (image object). Then, the motion vector detector 2 does not calculate the difference between the pixel whose key signal is 0 and the reference image for obtaining the predicted image.

【００８２】なお、ＶＯＰの形状が長方形状である場合
には、キー信号は常に０以外の値（バイナリ（binary）
キー（ハードキー）では１、グレイスケール（gray sca
le）キー（ソフトキー）では１乃至２５５のいずれか）
となるため、マクロブロックのすべての画素を用いて予
測誤差が計算される。When the VOP has a rectangular shape, the key signal always has a value other than 0 (binary).
The key (hard key) is 1, gray scale (gray sca
le) The key (soft key) is 1 to 255)
Therefore, the prediction error is calculated using all the pixels of the macroblock.

【００８３】一方、量子化器５が出力するデータは、逆
量子化器８にも供給され、逆量子化器８では、そのデー
タが、量子化器５より供給される量子化ステップに対応
して逆量子化され、ＤＣＴ係数とされる。このＤＣＴ係
数は、ＩＤＣＴ（逆ＤＣＴ）器９に入力され、逆ＤＣＴ
処理された後、演算器１０に供給される。On the other hand, the data output from the quantizer 5 is also supplied to an inverse quantizer 8, which outputs the data corresponding to the quantization step supplied from the quantizer 5. And inversely quantized to obtain DCT coefficients. This DCT coefficient is input to an IDCT (inverse DCT) unit 9 and the inverse DCT
After being processed, it is supplied to the arithmetic unit 10.

【００８４】予測モードが、前方予測、後方予測、両方
向予測のうちのいずれかである場合、演算器１０には、
ＩＤＣＴ器９の出力の他、動き補償器１２が出力する予
測画像も供給される。演算器１０は、ＩＤＣＴ器９の出
力に、動き補償器１２が出力する予測画像を加算するこ
とで、画像を復号し、画素置換器１５に供給する。When the prediction mode is any of forward prediction, backward prediction, and bidirectional prediction, the arithmetic unit 10
In addition to the output of the IDCT unit 9, a predicted image output by the motion compensator 12 is also supplied. The arithmetic unit 10 decodes the image by adding the predicted image output by the motion compensator 12 to the output of the IDCT unit 9, and supplies the decoded image to the pixel replacement unit 15.

【００８５】なお、演算器１０は、予測モードが画像内
符号化である場合には、ＩＤＣＴ器９の出力を、そのま
ま画素置換器１５に供給するようになされている。When the prediction mode is intra-picture coding, the arithmetic unit 10 supplies the output of the IDCT unit 9 to the pixel replacing unit 15 as it is.

【００８６】画素置換器１５では、演算器１０の出力に
対して、後述するパディング処理が施され、フレームメ
モリ１１に供給される。フレームメモリ１１では、画素
置換器１５の出力が記憶され、この記憶値、即ち、復号
画像は、動き補償器１２による動き補償のために用いら
れる。なお、フレームメモリ１１には、フラグFSZおよ
びFPOSも供給されるようになされており、フレームメモ
リ１１は、これらのフラグFSZおよびFPOSも記憶するよ
うになされている。In the pixel replacement unit 15, the output of the arithmetic unit 10 is subjected to padding processing described later, and is supplied to the frame memory 11. In the frame memory 11, the output of the pixel replacement unit 15 is stored, and the stored value, that is, the decoded image is used for motion compensation by the motion compensator 12. Note that the frame memory 11 is also supplied with the flags FSZ and FPOS, and the frame memory 11 stores these flags FSZ and FPOS.

【００８７】次に、図１３のフローチャートを参照し
て、図１２の画素置換器１５が行うパディング（paddin
g）処理について説明する。Next, referring to the flowchart of FIG. 13, padding (paddin) performed by the pixel replacement unit 15 of FIG.
g) The processing will be described.

【００８８】パディング処理では、まず最初に、ステッ
プＳ１において、演算器１０から画素置換器１５に供給
されたマクロブロックを構成する画素の１つを注目画素
として、その注目画素についてのキー信号が０であるか
否かが判定される。ステップＳ１において、注目画素に
ついてのキー信号が０でないと判定された場合、即ち、
注目画素が、画像オブジェクトの内側を構成するもので
ある場合、ステップＳ２に進み、画素置換器１５は、そ
の注目画素に対して、何も処理を施さず、そのままフレ
ームメモリ１１に出力し、ステップＳ４に進む。In the padding process, first, in step S1, one of the pixels constituting the macro block supplied from the arithmetic unit 10 to the pixel replacement unit 15 is set as a target pixel, and the key signal for the target pixel is set to 0. Is determined. In step S1, when it is determined that the key signal for the target pixel is not 0, that is,
If the pixel of interest is one that forms the inside of the image object, the process proceeds to step S2, where the pixel replacement unit 15 performs no processing on the pixel of interest and outputs it to the frame memory 11 as it is. Proceed to S4.

【００８９】ここで、符号化対象のＶＯＰの形状が長方
形状である場合、上述したように、キー信号は常に０以
外の値となるため、画素置換器１５では、そのＶＯＰ中
の全ての画素が何も処理されずそのまま出力されること
になる。Here, when the VOP to be encoded has a rectangular shape, as described above, the key signal always takes a value other than 0. Therefore, the pixel replacement unit 15 outputs all the pixels in the VOP. Is not processed and is output as it is.

【００９０】一方、ステップＳ１において、注目画素に
ついてのキー信号が０であると判定された場合、即ち、
注目画素が、画像オブジェクトの外側を構成するもので
ある場合、ステップＳ３に進み、注目画素の画素値が、
例えば０とされ、ステップＳ４に進む。ステップＳ４で
は、演算器１０からのマクロブロックを構成する画素す
べてについて処理を行ったかどうかが判定され、まだ、
すべての画素について処理を行っていないと判定された
場合、ステップＳ１に戻り、まだ注目画素とされていな
い画素を、新たに注目画素として、同様の処理が繰り返
される。On the other hand, if it is determined in step S1 that the key signal for the pixel of interest is 0, that is,
If the pixel of interest constitutes the outside of the image object, the process proceeds to step S3, where the pixel value of the pixel of interest is
For example, it is set to 0, and the process proceeds to step S4. In step S4, it is determined whether or not the processing has been performed for all the pixels constituting the macroblock from the arithmetic unit 10.
If it is determined that the processing has not been performed for all the pixels, the process returns to step S1, and the same processing is repeated with a pixel that has not been set as the target pixel as a new target pixel.

【００９１】また、ステップＳ４において、演算器１０
からの画素すべてについて処理を行ったと判定された場
合、ステップＳ５に進み、演算器１０からのマクロブロ
ックのある水平ラインが、注目水平ラインとして選択さ
れ、ステップＳ６に進む。ステップＳ６では、注目水平
ラインの両端の画素の画素値が判定される。In step S4, the operation unit 10
If it is determined that the processing has been performed for all the pixels from, the process proceeds to step S5, the horizontal line including the macro block from the arithmetic unit 10 is selected as the horizontal line of interest, and the process proceeds to step S6. In step S6, the pixel values of the pixels at both ends of the horizontal line of interest are determined.

【００９２】即ち、ステップＳ１乃至Ｓ４の処理が施さ
れた後のマクロブロックの、ある水平ラインに注目した
場合には、その注目水平ラインについては、その両端の
画素値が、いずれも０のケース（両端の画素が画像オブ
ジェクトの外側にあるケース）、いずれか一端の画素値
が０でないケース（一端の画素だけが画像オブジェクト
の内側にあるケース）、および両端の画素値がいずれも
０でないケース（両端の画素が画像オブジェクトの内側
にあるケース）の３通りのケースが生じる。ステップＳ
６では、注目水平ラインが、これらの３つのケースのう
ちのいずれに属するのかが判定される。That is, when attention is paid to a certain horizontal line of the macroblock after the processing of steps S1 to S4, the pixel values at both ends of the horizontal line of interest are all 0. (The case where the pixels at both ends are outside the image object), the case where the pixel value at one end is not 0 (the case where only the pixel at one end is inside the image object), and the case where the pixel values at both ends are not 0 (Cases where the pixels at both ends are inside the image object) occur in three cases. Step S
In 6, it is determined to which of these three cases the horizontal line of interest belongs.

【００９３】ステップＳ６において、注目水平ラインの
両端の画素値が、いずれも０であると判定された場合、
ステップＳ７に進み、その注目水平ラインについて確保
された変数Ｃに、０がセットされ、ステップＳ１０に進
む。また、ステップＳ６において、注目水平ラインの両
端の画素値が、いずれも０でないと判定された場合、ス
テップＳ８に進み、その注目水平ラインについて確保さ
れた変数Ｃに、注目水平ラインの両端の画素値の平均値
がセットされ、ステップＳ１０に進む。さらに、ステッ
プＳ６において、注目水平ラインの両端の画素値のうち
のいずれか一方だけが０でないと判定された場合、ステ
ップＳ９に進み、その注目水平ラインについて確保され
た変数Ｃに、注目水平ラインの両端の画素値のうちの０
でない方の値がセットされ、ステップＳ１０に進む。In step S6, when it is determined that the pixel values at both ends of the horizontal line of interest are both 0,
The process proceeds to step S7, where 0 is set to the variable C secured for the horizontal line of interest, and the process proceeds to step S10. If it is determined in step S6 that the pixel values at both ends of the horizontal line of interest are not 0, the process proceeds to step S8, and the variable C secured for the horizontal line of interest is added to the pixels at both ends of the horizontal line of interest. The average value is set, and the process proceeds to step S10. Further, if it is determined in step S6 that only one of the pixel values at both ends of the horizontal line of interest is not 0, the process proceeds to step S9, and the variable C secured for the horizontal line of interest is added to the variable C of the horizontal line of interest. 0 of pixel values at both ends of
Is set, and the process proceeds to step S10.

【００９４】ステップＳ１０では、演算器１０からのマ
クロブロックのすべての水平ラインを注目水平ラインと
して処理を行ったかどうかが判定され、まだ、すべての
水平ラインを注目水平ラインとして処理を行っていない
と判定された場合、ステップＳ５に戻り、まだ、注目水
平ラインとして選択されていない水平ラインが、新たな
注目水平ラインとして選択され、以下、同様の処理が繰
り返される。In step S10, it is determined whether or not all horizontal lines of the macroblock from the arithmetic unit 10 have been processed as the horizontal line of interest. If all horizontal lines have not yet been processed as the horizontal line of interest. If determined, the process returns to step S5, and a horizontal line that has not been selected as the horizontal line of interest is selected as a new horizontal line of interest, and the same processing is repeated thereafter.

【００９５】また、ステップＳ１０において、すべての
水平ラインを注目水平ラインとして処理を行ったと判定
された場合、ステップＳ１１に進む。If it is determined in step S10 that all horizontal lines have been processed as the horizontal line of interest, the process proceeds to step S11.

【００９６】ステップＳ１１乃至ステップＳ１６では、
演算器１０からのマクロブロックの水平ラインではな
く、垂直ラインを対象として、ステップＳ５乃至Ｓ１０
における場合とそれぞれ同様の処理が行われる。In steps S11 to S16,
Steps S5 to S10 are performed on the vertical line, not the horizontal line, of the macro block from the arithmetic unit 10.
The same processing is performed as in the case of.

【００９７】即ち、ステップＳ１１では、演算器１０か
らのマクロブロックのある垂直ラインが、注目垂直ライ
ンとして選択され、ステップＳ１２に進む。ステップＳ
１２では、注目垂直ラインの両端の画素の画素値が判定
される。That is, in step S11, a vertical line having a macroblock from the arithmetic unit 10 is selected as a target vertical line, and the flow advances to step S12. Step S
In 12, the pixel values of the pixels at both ends of the vertical line of interest are determined.

【００９８】即ち、ステップＳ１乃至Ｓ４の処理が施さ
れた後のマクロブロックの、ある垂直ラインに注目した
場合にも、その注目垂直ラインについては、その両端の
画素値が、いずれも０のケース（両端の画素が画像オブ
ジェクトの外側にあるケース）、いずれか一端の画素値
が０でないケース（一端の画素だけが画像オブジェクト
の内側にあるケース）、および両端の画素値がいずれも
０でないケース（両端の画素が画像オブジェクトの内側
にあるケース）の３通りのケースが生じる。ステップＳ
１２では、注目垂直ラインが、これらの３つのケースの
うちのいずれに属するのかが判定される。That is, even if a certain vertical line of the macroblock after the processing of steps S1 to S4 is focused on, the pixel value at both ends of the focused vertical line is zero. (The case where the pixels at both ends are outside the image object), the case where the pixel value at one end is not 0 (the case where only the pixel at one end is inside the image object), and the case where the pixel values at both ends are not 0 (Cases where the pixels at both ends are inside the image object) occur in three cases. Step S
In 12, it is determined to which of these three cases the vertical line of interest belongs.

【００９９】ステップＳ１２において、注目垂直ライン
の両端の画素値が、いずれも０であると判定された場
合、ステップＳ１３に進み、その注目垂直ラインについ
て確保された変数Ｂに、０がセットされ、ステップＳ１
６に進む。また、ステップＳ１２において、注目垂直ラ
インの両端の画素値が、いずれも０でないと判定された
場合、ステップＳ１４に進み、その注目垂直ラインにつ
いて確保された変数Ｂに、注目垂直ラインの両端の画素
値の平均値がセットされ、ステップＳ１６に進む。さら
に、ステップＳ１２において、注目垂直ラインの両端の
画素値のうちのいずれか一方だけが０でないと判定され
た場合、ステップＳ１５に進み、その注目垂直ラインに
ついて確保された変数Ｂに、注目垂直ラインの両端の画
素値のうちの０でない方の値がセットされ、ステップＳ
１６に進む。If it is determined in step S12 that the pixel values at both ends of the vertical line of interest are all 0, the process proceeds to step S13, where 0 is set to a variable B secured for the vertical line of interest. Step S1
Proceed to 6. If it is determined in step S12 that the pixel values at both ends of the vertical line of interest are not 0, the process proceeds to step S14, and the variables B secured for the vertical line of interest are added to the pixels at both ends of the vertical line of interest. The average value is set, and the process proceeds to step S16. Further, if it is determined in step S12 that only one of the pixel values at both ends of the target vertical line is not 0, the process proceeds to step S15, and the variable B secured for the target vertical line is added to the variable B for the target vertical line. Is set to the non-zero value of the pixel values at both ends of
Proceed to 16.

【０１００】ステップＳ１６では、演算器１０からのマ
クロブロックのすべての垂直ラインを注目垂直ラインと
して処理を行ったかどうかが判定され、まだ、すべての
垂直ラインを注目垂直ラインとして処理を行っていない
と判定された場合、ステップＳ１１に戻り、まだ、注目
垂直ラインとして選択されていない垂直ラインが、新た
な注目垂直ラインとして選択され、以下、同様の処理が
繰り返される。In step S16, it is determined whether or not all vertical lines of the macro block from the arithmetic unit 10 have been processed as the target vertical line. If it is determined, the process returns to step S11, and a vertical line that has not yet been selected as the target vertical line is selected as a new target vertical line, and the same processing is repeated thereafter.

【０１０１】また、ステップＳ１６において、すべての
垂直ラインを注目垂直ラインとして処理を行ったと判定
された場合、ステップＳ１７に進み、演算器１０からの
マクロブロックを構成する画素のうち、ステップＳ２で
そのままフレームメモリ１１に出力した画素を除いたも
のの中から、ある画素が、注目画素として選択され、ス
テップＳ１８に進む。If it is determined in step S16 that all vertical lines have been processed as the target vertical line, the process proceeds to step S17, and among the pixels constituting the macroblock from the arithmetic unit 10, the process proceeds to step S2. A certain pixel is selected as a target pixel from among the pixels except for the pixel output to the frame memory 11, and the process proceeds to step S18.

【０１０２】ステップＳ１８では、注目画素上で交差す
る垂直ラインと水平ラインそれぞれについての変数Ｂと
Ｃのセット（Ｂ，Ｃ）の値が判定される。In step S18, the value of the set (B, C) of the variables B and C for each of the vertical and horizontal lines intersecting on the target pixel is determined.

【０１０３】ステップＳ１８において、変数Ｂが０で、
Ｃが０でないと判定された場合、ステップＳ１９に進
み、変数Ｃの値が、注目画素の画素値として、フレーム
メモリ１１に出力され、ステップＳ２３に進む。また、
ステップＳ１８において、変数ＢおよびＣのいずれも０
でないと判定された場合、ステップＳ２０に進み、変数
ＢとＣの値の平均値が、注目画素の画素値として、フレ
ームメモリ１１に出力され、ステップＳ２３に進む。さ
らに、ステップＳ１８において、変数ＢおよびＣのいず
れも０であると判定された場合、ステップＳ２１に進
み、注目画素の画素値が０のままとされ、ステップＳ２
３に進む。In step S18, when the variable B is 0,
If it is determined that C is not 0, the process proceeds to step S19, where the value of the variable C is output to the frame memory 11 as the pixel value of the target pixel, and the process proceeds to step S23. Also,
In step S18, both variables B and C are set to 0
If it is determined that it is not, the process proceeds to step S20, where the average value of the variables B and C is output to the frame memory 11 as the pixel value of the target pixel, and the process proceeds to step S23. Further, when it is determined in step S18 that both the variables B and C are 0, the process proceeds to step S21, where the pixel value of the target pixel is kept at 0, and the process proceeds to step S2.
Proceed to 3.

【０１０４】一方、ステップＳ１８において、変数Ｂが
０でなく、Ｃが０であると判定された場合、ステップＳ
１９に進み、変数Ｂの値が、注目画素の画素値として、
フレームメモリ１１に出力され、ステップＳ２３に進
む。ステップＳ２３では、演算器１０からのマクロブロ
ックを構成する画素のうち、ステップＳ２でそのまま出
力した画素を除いたものすべてについて処理を行ったか
どうかが判定され、まだ行っていないと判定された場
合、ステップＳ１７に戻り、まだ、注目画素とされてい
ない画素が、新たに注目画素として選択され、以下、同
様の処理が繰り返される。On the other hand, if it is determined in step S18 that the variable B is not 0 and the variable C is 0, the process proceeds to step S18.
Proceeding to 19, the value of the variable B is set as the pixel value of the target pixel,
The data is output to the frame memory 11, and the process proceeds to step S23. In step S23, it is determined whether or not all of the pixels constituting the macroblock from the computing unit 10 have been processed except for the pixels output directly in step S2, and if it is determined that the processing has not been performed, Returning to step S17, a pixel that has not yet been set as the target pixel is newly selected as the target pixel, and the same processing is repeated thereafter.

【０１０５】また、ステップＳ２３において、演算器１
０からのマクロブロックを構成する画素のうち、ステッ
プＳ２でそのまま出力した画素を除いたものすべてにつ
いて処理を行ったと判定された場合、ステップＳ２４に
進み、既にフレームメモリ１１に出力された画素のう
ち、まだフレームメモリ１１に出力されていない各画素
（以下、適宜、未出力画素という）に最も近いものが検
出される。さらに、ステップＳ２４では、その検出され
た画素の画素値が、未出力画素の画素値として、フレー
ムメモリ１１に出力され、パディング処理を終了する。
なお、既に、フレームメモリ１１に出力された画素の中
で、未出力画素に最も近いものが、２個以上検出された
場合には、それらの画素値の平均値が、未出力画素の画
素値として出力される。In step S23, the operation unit 1
If it is determined that all the pixels constituting the macroblock starting from 0 have been processed except for the pixels output as they are in step S2, the process proceeds to step S24, and among the pixels already output to the frame memory 11, The pixel closest to each pixel that has not yet been output to the frame memory 11 (hereinafter, appropriately referred to as an unoutput pixel) is detected. Further, in step S24, the pixel value of the detected pixel is output to the frame memory 11 as a pixel value of a non-output pixel, and the padding process ends.
If two or more pixels that have been output to the frame memory 11 and are closest to the non-output pixel are detected, the average value of those pixel values is calculated as the pixel value of the non-output pixel. Is output as

【０１０６】以上のようなパディング処理を行うこと
で、画像オブジェクトの外側を構成する画素が、いわば
補間され、これにより、モスキートノイズの低減化およ
び動き補償の効率化を図ることができる。By performing the above-described padding processing, pixels constituting the outside of the image object are interpolated, so to speak, so that mosquito noise can be reduced and motion compensation can be made more efficient.

【０１０７】次に、図１のＶＬＣ器６（符号化手段）の
処理について、さらに説明する。Next, the processing of the VLC unit 6 (encoding means) in FIG. 1 will be further described.

【０１０８】ＶＬＣ器６は、ＶＳ，ＶＩＳＯ，ＶＯ，Ｖ
ＯＬ，ＧＯＶ，ＶＯＰそれぞれのヘッダに、本来配置す
べき情報を配置し、さらに、量子化器５の出力の可変長
符号化結果を配置することで、符号化ビットストリーム
を構成し、送信バッファ７に出力する。The VLC unit 6 includes VS, VISO, VO, V
The information to be originally arranged is arranged in the header of each of the OL, GOV, and VOP, and the variable-length encoding result of the output of the quantizer 5 is arranged to form an encoded bit stream. Output to

【０１０９】また、ＶＬＣ器６は、ＧＯＶより上位の階
層であるＶＳ，ＶＩＳＯ，ＶＯ，ＶＯＬの各ヘッダに配
置した情報を、バッファ１６に出力して記憶させる。The VLC unit 6 outputs information stored in the headers of VS, VISO, VO, and VOL, which are higher layers than GOV, to the buffer 16 for storage.

【０１１０】その後、ＶＬＣ器６は、ＧＯＶヘッダを出
力するとき、バッファ１６に記憶されている、ＧＯＶよ
り上位の階層のＶＳ，ＶＩＳＯ，ＶＯ，ＶＯＬの各ヘッ
ダの情報を読み出し、ＧＯＶヘッダの所定の位置に挿入
して（含めて）出力する。従って、この場合、ＧＯＶヘ
ッダには、そこに本来配置すべき情報の他、ＶＳ，ＶＩ
ＳＯ，ＶＯ，ＶＯＬの各ヘッダの情報も配置される。After that, when outputting the GOV header, the VLC unit 6 reads out the information of each header of VS, VISO, VO, and VOL of the hierarchy higher than GOV stored in the buffer 16, and reads out the predetermined information of the GOV header. Insert (include) at the position of and output. Therefore, in this case, the GOV header includes VS, VI in addition to the information to be originally arranged there.
Information of each header of SO, VO, and VOL is also arranged.

【０１１１】図１４は、以上のような処理を行うＶＬＣ
器６が出力するＧＯＶのシンタクスを示している。な
お、図１４において影を付してある部分が、図８に示し
たＦＣＤにおけるシンタクスと異なる部分となってい
る。FIG. 14 shows a VLC performing the above processing.
3 shows the syntax of the GOV output from the device 6. Note that the shaded portions in FIG. 14 are different from the syntax in the FCD shown in FIG.

【０１１２】group_VOP_start_codeは、GOVの開始位置
を示す32ビットのユニークなコードである。time_code
（時刻情報）は、１８bitで構成され、GOVにおいて、最
初に表示されるＶＯＰの秒精度の表示時刻を表す。この
time_codeは、IEC standardpublication 461で規定され
ている「time and control codes for video tape reco
rders」に相当する。Group_VOP_start_code is a unique 32-bit code indicating the start position of the GOV. time_code
The (time information) is formed of 18 bits, and represents the display time of the VOP displayed first with a second precision in the GOV. this
time_code is `` time and control codes for video tape reco specified in IEC standardpublication 461.
rders ".

【０１１３】closed_gopおよびbroken_linkについて
は、MPEG4VideoFCD規格(ISO/IEC 14496-2)を参照された
い。For closed_gop and broken_link, refer to the MPEG4VideoFCD standard (ISO / IEC 14496-2).

【０１１４】is_extension（ヘッダ情報有無フラグ）
は、本実施の形態で導入した１ビットのフラグで、GOV
ヘッダに、ＶＳ，ＶＩＳＯ，ＶＯ，ＶＯＬの各ヘッダ
の、デコーダの初期化を行うための情報、その他の情報
を含めるかどうかを表す。ＶＬＣ器６では、例えば、フ
ラグis_extensionが１の場合、ＶＳ，ＶＩＳＯ，ＶＯ，
ＶＯＬの各ヘッダの情報（VisualObjectSequence(), Vi
sualObject(), VideoObject(), VideoObjectLayer()）
が、GOVヘッダに含められる。即ち、フラグis_extensio
nが１の場合、ＶＳ，ＶＩＳＯ，ＶＯ，ＶＯＬの各ヘッ
ダの情報は、group_VOP_start_code，time_code，close
d_gop，broken_link,is_extensionの後に続けて配置さ
れる。Is_extension (header information presence / absence flag)
Is a 1-bit flag introduced in the present embodiment.
Indicates whether the header includes information for initializing the decoder and other information of each header of VS, VISO, VO, and VOL. In the VLC unit 6, for example, when the flag is_extension is 1, VS, VISO, VO,
Information of each header of VOL (VisualObjectSequence (), Vi
sualObject (), VideoObject (), VideoObjectLayer ())
Is included in the GOV header. That is, the flag is_extensio
When n is 1, information of each header of VS, VISO, VO, and VOL is group_VOP_start_code, time_code, close
It is placed after d_gop, broken_link, and is_extension.

【０１１５】さらに、フラグis_extensionが１の場合
は、ＶＬＣ器６は、ＶＳ，ＶＩＳＯ，ＶＯ，ＶＯＬの各
ヘッダの情報を、GOVヘッダに含めた後、その含めたＶ
Ｓ，ＶＩＳＯ，ＶＯ，ＶＯＬの各ヘッダの情報を、バッ
ファ１６に供給し、いままで記憶されていた情報に替え
て記憶させる。Further, when the flag is_extension is 1, the VLC unit 6 includes the information of each header of VS, VISO, VO, and VOL in the GOV header, and then includes the V
The information of each header of S, VISO, VO, and VOL is supplied to the buffer 16 and stored in place of the information stored up to now.

【０１１６】なお、ＶＬＣ器６は、ＶＳ，ＶＩＳＯ，Ｖ
Ｏ，ＶＯＬの各ヘッダを、その後に出力したときも、そ
のヘッダの情報をバッファ１６に供給して記憶させる。Note that the VLC unit 6 has VS, VISO, V
When the O and VOL headers are subsequently output, the information of the headers is supplied to the buffer 16 and stored.

【０１１７】従って、バッファ１６には、常に最新のＶ
Ｓ，ＶＩＳＯ，ＶＯ，ＶＯＬのヘッダの情報が記憶され
ていることになる。Therefore, the buffer 16 always has the latest V
This means that the header information of S, VISO, VO, and VOL is stored.

【０１１８】ここで、フラグis_extensionが１の場合
に、ＶＳ，ＶＩＳＯ，ＶＯ，ＶＯＬの各ヘッダの情報
を、GOVヘッダに含めた後、その含めたＶＳ，ＶＩＳ
Ｏ，ＶＯ，ＶＯＬの各ヘッダの情報を、バッファ１６に
供給して記憶させるのは、次のような理由による。Here, when the flag is_extension is 1, information of each header of VS, VISO, VO, and VOL is included in the GOV header, and the included VS and VIS are included.
The information of each header of O, VO, and VOL is supplied to and stored in the buffer 16 for the following reason.

【０１１９】即ち、ＶＬＣ器６には、符号化効率を向上
させる等のため、GOVヘッダに含めさせるＶＳ，ＶＩＳ
Ｏ，ＶＯ，ＶＯＬの各ヘッダの情報を変更させることが
できる。この場合、その変更後の情報が最新の情報とい
うことになるので、その最新の情報を、バッファ１６に
記憶させておくために、GOVヘッダに含めたＶＳ，ＶＩ
ＳＯ，ＶＯ，ＶＯＬの各ヘッダの情報を、バッファ１６
に供給して記憶させるようになされている。That is, the VLC unit 6 includes VS and VIS included in the GOV header in order to improve the coding efficiency.
The information of each header of O, VO, and VOL can be changed. In this case, since the information after the change is the latest information, the VS and VI included in the GOV header are stored in the buffer 16 in order to store the latest information.
The information of each header of SO, VO and VOL is stored in the buffer 16.
To be stored in the memory.

【０１２０】一方、フラグis_extensionが０の場合、Ｖ
ＬＣ器６では、ＶＳ，ＶＩＳＯ，ＶＯ，ＶＯＬの各ヘッ
ダの情報は、GOVヘッダに含められない。On the other hand, if the flag is_extension is 0, V
In the LC unit 6, the information of each header of VS, VISO, VO, and VOL is not included in the GOV header.

【０１２１】なお、バッファ１６の記憶値は、外部から
変更することが可能なようになっている。即ち、ＶＳ，
ＶＩＳＯ，ＶＯ，ＶＯＬの各ヘッダの情報の一部または
全部を、符号化ビットストリームの途中で変化させたい
場合がある。即ち、例えば、デコードに用いる量子化マ
トリクスを、符号化ビットストリームの復号の途中で変
更したい場合などがある。このような場合、ユーザは、
バッファ１６に記憶されているＶＳ，ＶＩＳＯ，ＶＯ，
ＶＯＬの各ヘッダの情報を、適宜、所望の情報に変更す
ることができる。この変更後の情報は、フラグis_exten
sionが１になっているGOVヘッダに配置されて出力され
るから、デコーダでは、そのGOVヘッダを受信した後
に、その変更後の情報に基づいて、デコードが行われる
ことになる。The stored value in the buffer 16 can be changed from outside. That is, VS,
In some cases, it is desired to change part or all of the information of each header of VISO, VO, and VOL in the middle of an encoded bit stream. That is, for example, there is a case where it is desired to change the quantization matrix used for decoding during the decoding of the encoded bit stream. In such a case, the user:
VS, VISO, VO, stored in the buffer 16
The information of each header of the VOL can be appropriately changed to desired information. The information after this change is the flag is_exten
Since the output is placed in the GOV header whose sion is 1, the decoder performs decoding based on the changed information after receiving the GOV header.

【０１２２】次に、図１５のフローチャートを参照し
て、図１４に示したようなシンタクスのGOVを出力する
ためのＶＬＣ器６の処理について説明する。Next, the processing of the VLC unit 6 for outputting the GOV of the syntax as shown in FIG. 14 will be described with reference to the flowchart of FIG.

【０１２３】ＶＬＣ器６は、上述したように、ＶＳ，Ｖ
ＩＳＯ，ＶＯ，ＶＯＬ，ＧＯＶ，ＶＯＰそれぞれのヘッ
ダに、本来配置すべき情報を配置し、さらに、量子化器
５の出力の可変長符号化結果を配置することで、符号化
ビットストリームを構成し、送信バッファ７に出力して
いる。As described above, the VLC unit 6 outputs VS, V
The information to be laid out is arranged in the header of each of ISO, VO, VOL, GOV, and VOP, and the result of variable-length encoding of the output of the quantizer 5 is arranged to form an encoded bit stream. , To the transmission buffer 7.

【０１２４】さらに、ＶＬＣ器６は、ＶＳ，ＶＩＳＯ，
ＶＯ，ＶＯＬの各ヘッダを出力するごとに、各ヘッダに
配置した情報を、バッファ１６に出力して記憶させてい
る（上書きしている）。Further, the VLC unit 6 includes VS, VISO,
Each time the VO and VOL headers are output, the information arranged in each header is output to the buffer 16 and stored (overwritten).

【０１２５】そして、ＶＬＣ器６は、ＧＯＶヘッダを出
力する場合には、ステップＳ３１において、そのＧＯＶ
ヘッダについてのフラグis_extensionが１であるかどう
かを判定する。ステップＳ１において、フラグis_exten
sionが１でない（０である）と判定された場合、ＶＬＣ
器６は、ＧＯＶヘッダに、本来配置すべき情報（図８に
示した情報）およびフラグis_extensionを配置し（Ｖ
Ｓ，ＶＩＳＯ，ＶＯ，ＶＯＬの各ヘッダの情報は配置し
ない）、その結果得られるＧＯＶヘッダを出力する。そ
して、次のＧＯＶヘッダを出力するタイミングまで待っ
て、ステップＳ３１に戻る。When outputting the GOV header, the VLC unit 6 determines in step S31 that the GOV header
It is determined whether the flag is_extension for the header is 1. In step S1, the flag is_exten
If it is determined that sion is not 1 (it is 0), VLC
The device 6 arranges information to be originally arranged (information shown in FIG. 8) and a flag is_extension in the GOV header (V
The information of each header of S, VISO, VO, and VOL is not arranged), and the resulting GOV header is output. Then, the process waits until the next GOV header is output, and returns to step S31.

【０１２６】一方、ステップＳ３１において、フラグis
_extensionが１であると判定された場合、ステップＳ３
２に進み、バッファ１６に記憶されているＶＳ，ＶＩＳ
Ｏ，ＶＯ，ＶＯＬの各ヘッダの最新の情報を読み出し、
その最近の情報およびフラグis_extension、並びに本来
配置すべき情報を、ＧＯＶヘッダに配置して出力する。
そして、次のＧＯＶヘッダを出力するタイミングまで待
って、ステップＳ３１に戻る。On the other hand, in step S31, the flag is
If it is determined that _extension is 1, step S3
2 to VS, VIS stored in the buffer 16
Read the latest information of each header of O, VO, VOL,
The latest information, the flag is_extension, and the information to be originally arranged are arranged and output in the GOV header.
Then, the process waits until the next GOV header is output, and returns to step S31.

【０１２７】なお、各ＧＯＶに配置されるフラグis_ext
ensionの値は、例えば、エンコーダの管理者側におい
て、あらかじめ、ＶＬＣ器に設定されている。The flag is_ext arranged in each GOV
The value of the extension is set in the VLC device in advance, for example, on the administrator side of the encoder.

【０１２８】次に、図１６は、記録媒体２０１または伝
送媒体２０２を介して提供される符号化ビットストリー
ムを復号するデコーダの一実施の形態の構成例を示して
いる。このデコーダを構成するバッファ２１、ＩＶＬＣ
器２２，逆量子化器２３，ＩＤＣＴ器２４，演算器２
５、フレームメモリ２６、動き補償器２７は、図２５に
示したデコーダを構成するバッファ１０１、ＩＶＬＣ器
１０２，逆量子化器１０３，ＩＤＣＴ器１０４，演算器
１０５、フレームメモリ１０６、動き補償器１０７にそ
れぞれ対応している。従って、バッファ２１乃至動き補
償器２７それぞれでは、バッファ１０１乃至動き補償器
１０７それぞれの処理と同一の処理が行われる場合があ
り、そのような同一の処理についての説明は、適宜省略
する。FIG. 16 shows a configuration example of an embodiment of a decoder for decoding an encoded bit stream provided via the recording medium 201 or the transmission medium 202. Buffer 21, IVLC constituting this decoder
Unit 22, inverse quantizer 23, IDCT unit 24, arithmetic unit 2
5. The frame memory 26 and the motion compensator 27 are a buffer 101, an IVLC unit 102, an inverse quantizer 103, an IDCT unit 104, an arithmetic unit 105, a frame memory 106, and a motion compensator 107 which constitute the decoder shown in FIG. Respectively. Therefore, in each of the buffer 21 to the motion compensator 27, the same processing as the processing of each of the buffer 101 to the motion compensator 107 may be performed, and the description of such the same processing will be appropriately omitted.

【０１２９】記録媒体２０１または伝送媒体２０２を介
して提供される符号化ビットストリームは、受信バッフ
ァ２１（受信手段）で受信されて一時記憶される。そし
て、受信バッファ２１に記憶された符号化ビットストリ
ームは、適宜、ＩＶＬＣ（可変長復号）器２２によって
読み出される。The coded bit stream provided via the recording medium 201 or the transmission medium 202 is received by the reception buffer 21 (receiving means) and is temporarily stored. Then, the encoded bit stream stored in the reception buffer 21 is read out by an IVLC (variable length decoding) unit 22 as appropriate.

【０１３０】ＩＶＬＣ器２２（復号手段）は、受信バッ
ファ２１から読み出した符号化ビットストリームを可変
長復号し、動きベクトルおよび予測モードを、動き補償
器２７に、また、量子化ステップを逆量子化器２３に、
それぞれ出力するとともに、可変長復号された画像デー
タ（量子化されたＤＣＴ係数）を、逆量子化器２３に出
力する。なお、ＩＶＬＣ器２２は、その他、各階層のヘ
ッダに含まれている、デコーダのデコード処理に用いら
れるパラメータの初期化に必要な情報、その他の情報
（例えば、オーバラップ動き補償を行うかどうかを示す
フラグや、量子化マトリクスなど）を、適宜、必要なブ
ロックに供給する（例えば、オーバラップ動き補償を行
うかどうかを示すフラグは動き補償器２７に、量子化マ
トリクスは逆量子化器２３に、それぞれ供給される）The IVLC unit 22 (decoding means) performs variable-length decoding on the coded bit stream read from the reception buffer 21, sends the motion vector and the prediction mode to the motion compensator 27, and inversely quantizes the quantization step. In the vessel 23,
In addition to outputting the image data, the image data (the quantized DCT coefficient) subjected to the variable length decoding is output to the inverse quantizer 23. Note that the IVLC unit 22 also includes information necessary for initializing parameters used for decoding by the decoder, which is included in the header of each layer, and other information (for example, whether to perform overlap motion compensation. To the necessary blocks (for example, a flag indicating whether or not to perform overlap motion compensation) to the motion compensator 27, and a quantization matrix to the inverse quantizer 23. , Each supplied)

【０１３１】さらに、ＩＶＬＣ器２２は、ＧＯＶヘッダ
については、フラグis_extensionを復号し、フラグis_e
xtensionが1である場合、即ち、ＧＯＶヘッダに、Ｖ
Ｓ，ＶＩＳＯ，ＶＯ，ＶＯＬの各ヘッダの情報が含まれ
ている場合、その情報も、ＶＳ，ＶＩＳＯ，ＶＯ，ＶＯ
Ｌの各ヘッダと同様に可変長復号し、その復号結果を、
必要なブロックに供給する。具体的には、例えば、動き
ベクトル、予測モード、オーバーラップ動き補償を行う
かどうかを示すフラグなどは動き補償器２７に、量子化
ステップおよび量子化マトリクスなどは逆量子化器２３
に、それぞれ供給される。Further, the IVLC unit 22 decodes the flag is_extension for the GOV header, and
If xtension is 1, ie, the GOV header
When the information of each header of S, VISO, VO, and VOL is included, the information is also included in VS, VISO, VO, and VO.
Variable-length decoding is performed in the same manner as each header of L, and the decoding result is
Supply necessary blocks. Specifically, for example, a motion vector, a prediction mode, a flag indicating whether to perform overlap motion compensation, and the like are provided to the motion compensator 27, and a quantization step and a quantization matrix are provided to the inverse quantizer 23.
, Respectively.

【０１３２】また、ＩＶＬＣ器２２は、符号化ビットス
トリームに含まれるフラグFSZおよびFPOSを復号し、フ
レームメモリ２６、動き補償器２７、およびキー信号復
号器２９に供給する。さらに、ＩＶＬＣ器２２は、符号
化ビットストリームに含まれる、符号化されたキー信号
（キー信号ビットストリーム）を抽出し、キー信号復号
器２９に供給する。The IVLC unit 22 decodes the flags FSZ and FPOS included in the encoded bit stream, and supplies them to the frame memory 26, the motion compensator 27, and the key signal decoder 29. Further, the IVLC unit 22 extracts an encoded key signal (key signal bit stream) included in the encoded bit stream, and supplies the extracted key signal to the key signal decoder 29.

【０１３３】キー信号復号器２９は、ＩＶＬＣ器２２よ
り供給されるキー信号ビットストリームを復号する。こ
の復号されたキー信号は、IDCT器２４、動き補償器２
７、および画素置換器２８に供給される。The key signal decoder 29 decodes the key signal bit stream supplied from the IVLC unit 22. The decoded key signal is sent to the IDCT unit 24 and the motion compensator 2
7 and the pixel replacement unit 28.

【０１３４】逆量子化器２３は、ＩＶＬＣ器２２より供
給される画像データを、同じくＩＶＬＣ器２２より供給
される量子化ステップに従って逆量子化し、IDCT器２４
に出力する。ＩＤＣＴ器２４は、逆量子化器２３より出
力されたデータ（DCT係数）に対して、逆DCT処理を施
し、演算器２５に供給する。The inverse quantizer 23 inversely quantizes the image data supplied from the IVLC unit 22 in accordance with the quantization step also supplied from the IVLC unit 22, and the IDCT unit 24
Output to The IDCT unit 24 performs an inverse DCT process on the data (DCT coefficient) output from the inverse quantizer 23, and supplies the data to the arithmetic unit 25.

【０１３５】演算器２５は、IDCT器２４より供給された
画像データが、Ｉ−ＶＯＰのデータである場合、そのデ
ータを、その後に入力される画像データ（ＰまたはＢ−
ＶＯＰのデータ）の予測画像の生成のために、そのま
ま、画素置換器２８を介してフレームメモリ２６に供給
して記憶させる。When the image data supplied from the IDCT unit 24 is I-VOP data, the arithmetic unit 25 converts the data into image data (P or B-
In order to generate a predicted image of VOP data), it is supplied to the frame memory 26 via the pixel replacement unit 28 and stored as it is.

【０１３６】なお、画素置換器２８では、図１２の画素
置換器１５と同様の処理が行われる。Note that the pixel replacement unit 28 performs the same processing as the pixel replacement unit 15 in FIG.

【０１３７】一方、演算器２５に供給されるデータが、
ＰまたはＢ−ＶＯＰのデータである場合、動き補償器２
７は、ＩＶＬＣ器２２より供給される動きベクトルおよ
び予測モードに従って、フレームメモリ２６に記憶され
た、既に復号されている画像を読み出すことで、予測画
像を生成し、演算器２５に出力する。演算器２５ではID
CT器２４より供給される画像データ（差分データ）と、
動き補償器２７より供給される予測画像データを加算
し、復号画像とする。この復号画像は、画素置換器２８
を介してフレームメモリ２６に供給されて記憶され、後
に復号する画像の参照画像（予測画像を生成するために
参照される画像）として、適宜用いられる。また、フレ
ームメモリ２６に記憶された復号画像は、上述したよう
に参照画像として用いられる他、適宜読み出され、例え
ば、図示せぬディスプレイなどに供給されて表示され
る。On the other hand, the data supplied to the arithmetic unit 25 is
If the data is P or B-VOP data, the motion compensator 2
Reference numeral 7 reads a previously decoded image stored in the frame memory 26 in accordance with the motion vector and the prediction mode supplied from the IVLC unit 22 to generate a predicted image and output it to the calculator 25. In computing unit 25, ID
Image data (difference data) supplied from the CT unit 24,
The predicted image data supplied from the motion compensator 27 is added to obtain a decoded image. This decoded image is output to the pixel replacement unit 28
Is supplied to and stored in the frame memory 26 via the CPU, and is appropriately used as a reference image (image referred to for generating a predicted image) of an image to be decoded later. The decoded image stored in the frame memory 26 is not only used as a reference image as described above, but is also appropriately read and supplied to, for example, a display (not shown) and displayed.

【０１３８】次に、図１７のフローチャートを参照し
て、図１６のＩＶＬＣ器２２がＧＯＶヘッダに関して行
う処理について、さらに説明する。Next, the processing performed by the IVLC unit 22 of FIG. 16 on the GOV header will be further described with reference to the flowchart of FIG.

【０１３９】ＩＶＬＣ器２２は、ＧＯＶヘッダを受信す
ると、そのＧＯＶヘッダについて、通常行うべき処理
（図８に示したＧＯＶヘッダが送信されてきたときに行
うべき処理）を行い、さらに、ステップＳ４１におい
て、ＧＯＶヘッダ（図１４）に配置されているフラグis
_extensionが1であるかどうかを判定する。ステップＳ
４１において、フラグis_extensionが1でない（０であ
る）と判定された場合、即ち、ＧＯＶヘッダに、ＶＳ，
ＶＩＳＯ，ＶＯ，ＶＯＬの各ヘッダの情報が含まれてい
ない場合、次のＧＯＶヘッダが送信されてくるのを待っ
て、ステップＳ４１に戻る。When the IVLC unit 22 receives the GOV header, the IVLC unit 22 performs a process to be performed normally (a process to be performed when the GOV header shown in FIG. 8 is transmitted) with respect to the GOV header. , The flag is located in the GOV header (FIG. 14)
Determine whether _extension is 1 or not. Step S
41, if it is determined that the flag is_extension is not 1 (it is 0), that is, VS,
If the information of each header of VISO, VO, and VOL is not included, the process returns to step S41 after waiting for the next GOV header to be transmitted.

【０１４０】また、ステップＳ４１において、フラグis
_extensionが1であると判定された場合、即ち、ＧＯＶ
ヘッダに、ＶＳ，ＶＩＳＯ，ＶＯ，ＶＯＬの各ヘッダの
情報が含まれている場合、ステップＳ４２に進み、ＩＶ
ＬＣ器２２は、そのＶＳ，ＶＩＳＯ，ＶＯ，ＶＯＬの各
ヘッダの情報を、必要なブロックに供給し、次のＧＯＶ
ヘッダが送信されてくるのを待って、ステップＳ４１に
戻る。In step S41, the flag is
If it is determined that _extension is 1, ie, GOV
If the header includes the information of each header of VS, VISO, VO, and VOL, the process proceeds to step S42,
The LC unit 22 supplies the information of each header of the VS, VISO, VO, and VOL to a necessary block, and outputs the next GOV.
After the header is transmitted, the process returns to step S41.

【０１４１】次に、ＧＯＶは、図１４に示したシンタク
スの他、例えば、図１８に示すシンタクスのように構成
することも可能である。Next, in addition to the syntax shown in FIG. 14, the GOV can be configured, for example, like the syntax shown in FIG.

【０１４２】即ち、図１８は、ＧＯＶのシンタクスの他
の例を示している。なお、図１４と図１８とでは、brok
en_linkの下からnext_start_codeの上までの間が異なっ
ている。また、図１８において影を付してある部分が、
図８に示したＦＣＤにおけるシンタクスと異なる部分と
なっている。That is, FIG. 18 shows another example of the syntax of GOV. In FIGS. 14 and 18, brok
The distance from the bottom of en_link to the top of next_start_code is different. Also, the shaded portions in FIG.
It is different from the syntax in the FCD shown in FIG.

【０１４３】図１４の実施の形態では、フラグis_exten
sionにより、ＶＳ，ＶＩＳＯ，ＶＯ，ＶＯＬの各ヘッダ
の情報を、ＧＯＶヘッダにおいて伝送するかどうかだけ
が設定可能であったが、図１８の実施の形態では、フラ
グload_data_typeを採用することにより、ＶＳ，ＶＩＳ
Ｏ，ＶＯ，ＶＯＬの各ヘッダの情報すべてを、ＧＯＶヘ
ッダにおいて伝送するかどうかだけでなく、それらの情
報の一部のみを伝送するような設定も可能になってい
る。即ち、図１８の実施の形態では、ＶＳ，ＶＩＳＯ，
ＶＯ，ＶＯＬの各ヘッダの情報の一部だけを、ＧＯＶヘ
ッダに含ませることが可能であり、フラグload_data_ty
peによれば、そのＧＯＶヘッダに含ませる一部の情報を
識別することができるようになされている。In the embodiment shown in FIG. 14, the flag is_exten
With sion, it was possible to set only whether or not to transmit the information of each header of VS, VISO, VO, and VOL in the GOV header. However, in the embodiment of FIG. 18, by adopting the flag load_data_type, , VIS
It is possible to set whether or not to transmit all the information of each header of O, VO, and VOL in the GOV header as well as to transmit only a part of the information. That is, in the embodiment of FIG. 18, VS, VISO,
Only part of the information of each header of VO and VOL can be included in the GOV header, and the flag load_data_ty
According to pe, a part of information included in the GOV header can be identified.

【０１４４】具体的には、図１８において、load_data_
typeは、可変長符号で、この直後に、ランダムアクセス
時にデコーダを初期化するための情報等を伝送するかど
うかと、伝送する場合には、その伝送する情報の種類を
示す。即ち、例えば、図１９に示すように、load_data_
typeが'1'のときには、ＧＯＶ層より上位階層のヘッダ
の情報は、ＧＯＶには含められない。また、load_data_
typeが'01'のときには、図１４の実施の形態においてフ
ラグis_extensionが１である場合と同様に、ＶＳ，ＶＩ
ＳＯ，ＶＯ，ＶＯＬの各ヘッダの情報（VisualObjectSe
quence(), VisualObject(), VideoObject(), VideoObje
ctLayer()）のすべてが、ＧＯＶに含められる。さら
に、load_data_typeが'001'のときには、ＶＳ，ＶＩＳ
Ｏ，ＶＯ，ＶＯＬの各ヘッダの情報のうち、予め定めら
れた所定のパラメータの情報が、ＧＯＶに含められる。Specifically, in FIG. 18, load_data_
The type is a variable-length code, and indicates immediately after whether or not information for initializing a decoder at the time of random access is to be transmitted and, if so, the type of information to be transmitted. That is, for example, as shown in FIG.
When the type is “1”, the information of the header in the higher layer than the GOV layer is not included in the GOV. Also, load_data_
When the type is '01', VS and VI are set in the same manner as in the case where the flag is_extension is 1 in the embodiment of FIG.
SO, VO, VOL header information (VisualObjectSe
quence (), VisualObject (), VideoObject (), VideoObje
ctLayer ()) are all included in the GOV. Further, when load_data_type is '001', VS, VIS
Among the information of the O, VO, and VOL headers, information of predetermined parameters is included in the GOV.

【０１４５】即ち、図１８の実施の形態において、load
_data_typeが'001'のときにＧＯＶに含められる情報
は、download_parameters()として規定されている。That is, in the embodiment of FIG.
Information included in the GOV when _data_type is '001' is defined as download_parameters ().

【０１４６】ここで、本実施の形態では、download_par
ameters()は、例えば、図２０に示すように規定されて
いる。Here, in the present embodiment, download_par
ameters () is defined, for example, as shown in FIG.

【０１４７】図２０において、フラグobmc_disableは、
オーバーラップ動き補償を用いるかどうかを示す１ビッ
トのフラグである。この値が、'1'である場合には、オ
ーバーラップ動き補償は用いられず、'0'である場合に
は、オーバーラップ動き補償が用いられる。フラグquan
t_typeは、逆量子化の方法を示す１ビットのフラグであ
る。この値が'0'である場合には、H.263に規定されてい
る逆量子化方法を用いて逆量子化が行われ、'1'である
場合には、MPEG2に規定されている逆量子化方法を用い
て逆量子化が行われる。MPEG2に規定されている逆量子
化方法を用いる場合には、さらに、量子化マトリクスを
ダウンロードするかどうかを示すフラグが伝送される。
また、量子化マトリクスをダウンロードする場合には、
そのダウンロードする量子化マトリクスも伝送される。In FIG. 20, the flag obmc_disable is
This is a 1-bit flag indicating whether to use overlap motion compensation. When this value is “1”, overlap motion compensation is not used, and when this value is “0”, overlap motion compensation is used. Flag quan
t_type is a 1-bit flag indicating a method of inverse quantization. If this value is '0', inverse quantization is performed using the inverse quantization method specified in H.263, and if '1', the inverse quantization specified in MPEG2 is used. Inverse quantization is performed using a quantization method. When the inverse quantization method specified in MPEG2 is used, a flag indicating whether to download the quantization matrix is further transmitted.
Also, when downloading the quantization matrix,
The quantization matrix to be downloaded is also transmitted.

【０１４８】その他、図２０のdownload_parameters()
において規定されているload_intra_quant_mat, intra_
quant_mat, load_nonintra_quant_mat, nonintra_quant
_mat,load_intra_quant_mat_grayscale, iontra_quant_
mat_grayscale, load_nonintra_quant_mat_grayscale,
nonintra_quant_mat_grayscaleのセマンティクスは、Ｆ
ＣＤにおけるＶＯＬ（図５乃至図７）で規定されている
同名のフラグのセマンティクスと同様である。In addition, download_parameters () in FIG.
Load_intra_quant_mat, intra_ specified in
quant_mat, load_nonintra_quant_mat, nonintra_quant
_mat, load_intra_quant_mat_grayscale, iontra_quant_
mat_grayscale, load_nonintra_quant_mat_grayscale,
The semantics of nonintra_quant_mat_grayscale is F
This is the same as the semantics of the flag of the same name defined in the VOL (FIGS. 5 to 7) on the CD.

【０１４９】なお、図１９の実施の形態では、フラグlo
ad_data_typeについて、３通りの場合しか規定していな
いが、４通り以上の場合を規定することも可能である。
この場合、図２０のdownload_parameters()で規定され
る情報の組み合わせとは異なる組み合わせの情報を、Ｇ
ＯＶヘッダに配置することが可能となる。In the embodiment shown in FIG. 19, the flag lo
Although only three cases are defined for ad_data_type, it is also possible to define four or more cases.
In this case, information of a combination different from the combination of information defined by download_parameters () in FIG.
It can be arranged in the OV header.

【０１５０】図２１は、図１８に示したＧＯＶヘッダを
出力するエンコーダの一実施の形態の構成例を示してい
る。なお、図中、図１２における場合と対応する部分に
ついては、同一の符号を付してある。即ち、図２１のエ
ンコーダは、パーサ１７（選択手段）が新たに設けられ
ている他は、図１２における場合と同様に構成されてい
る。FIG. 21 shows a configuration example of an embodiment of an encoder that outputs the GOV header shown in FIG. In the figure, the same reference numerals are given to portions corresponding to the case in FIG. That is, the encoder of FIG. 21 has the same configuration as that of FIG. 12 except that a parser 17 (selection means) is newly provided.

【０１５１】パーサ（フラグ識別器）１７は、ＶＬＣ器
６が出力しようとしているＧＯＶヘッダについてのフラ
グload_data_typeを参照し、そのフラグload_data_type
にしたがって、バッファ１６から情報を読み出し、ＶＬ
Ｃ器６に供給する。ＶＬＣ器６では、パーサ１７から供
給される情報が、フラグload_data_typeとともに、ＧＯ
Ｖヘッダの図１８に示した所定の位置に配置されて出力
される。The parser (flag discriminator) 17 refers to the flag load_data_type for the GOV header that the VLC unit 6 is going to output, and the flag load_data_type
, The information is read from the buffer 16 and VL
It is supplied to the C unit 6. In the VLC device 6, the information supplied from the parser 17 together with the flag load_data_type together with the GO
It is arranged and output at the predetermined position shown in FIG. 18 of the V header.

【０１５２】次に、図２２のフローチャートを参照し
て、図１８に示したようなシンタクスのGOVをＶＬＣ器
６に出力させるためのパーサ１７の処理について説明す
る。Next, the processing of the parser 17 for outputting the GOV of the syntax as shown in FIG. 18 to the VLC unit 6 will be described with reference to the flowchart of FIG.

【０１５３】ＶＬＣ器６は、ＧＯＶヘッダを出力するタ
イミングで、そのＧＯＶヘッダについてのフラグload_d
ata_typeを、パーサ１７に供給する。パーサ１７は、Ｖ
ＬＣ器６からのフラグload_data_typeを受信し、ステッ
プＳ５１において、その値を判定する。ステップＳ５１
において、フラグload_data_typeが１であると判定され
た場合、パーサ１７は、ＶＬＣ器６に対して、何も出力
せず、次のＧＯＶヘッダに配置されたフラグload_data_
typeが、ＶＬＣ器６から送信されてくるのを待って、ス
テップＳ５１に戻る。この場合、ＶＬＣ器６では、ＧＯ
Ｖヘッダに、本来配置すべき情報およびload_data_type
を配置し、その結果得られるＧＯＶヘッダを出力する。At the timing when the GOV header is output, the VLC unit 6 sets the flag load_d for the GOV header.
The ata_type is supplied to the parser 17. The parser 17
The flag load_data_type is received from the LC unit 6, and the value is determined in step S51. Step S51
, When it is determined that the flag load_data_type is 1, the parser 17 does not output anything to the VLC unit 6, and the flag load_data_type placed in the next GOV header is output.
After waiting for the type to be transmitted from the VLC device 6, the process returns to step S51. In this case, in the VLC device 6, GO
Information to be originally placed and load_data_type in the V header
And outputs the resulting GOV header.

【０１５４】また、ステップＳ５１において、フラグlo
ad_data_typeが０１であると判定された場合、ステップ
Ｓ５２に進み、パーサ１７は、バッファ１６から、Ｖ
Ｓ，ＶＩＳＯ，ＶＯ，ＶＯＬの各ヘッダの最新の情報を
読み出し、ＶＬＣ器６に供給する。そして、次のＧＯＶ
ヘッダに配置されたフラグload_data_typeが、ＶＬＣ器
６から送信されてくるのを待って、ステップＳ５１に戻
る。従って、この場合、ＶＬＣ器６では、ＧＯＶヘッダ
に、本来配置すべき情報およびload_data_typeの他に、
ＶＳ，ＶＩＳＯ，ＶＯ，ＶＯＬの各ヘッダの情報も配置
される。In step S51, the flag lo
When it is determined that the ad_data_type is 01, the process proceeds to step S52, where the parser 17
The latest information of each header of S, VISO, VO, and VOL is read and supplied to the VLC unit 6. And the next GOV
After waiting for the flag load_data_type arranged in the header to be transmitted from the VLC device 6, the process returns to step S51. Therefore, in this case, in the VLC device 6, in addition to the information to be originally arranged and the load_data_type,
Information of each header of VS, VISO, VO, and VOL is also arranged.

【０１５５】一方、ステップＳ５１において、フラグlo
ad_data_typeが００１であると判定された場合、ステッ
プＳ５３に進み、パーサ１７は、バッファ１６に記憶さ
れている情報のうち、図２０に示したdownload_paramet
ers()に含まれるものを選択して読み出し、ＶＬＣ器６
に供給する。そして、次のＧＯＶヘッダに配置されたフ
ラグload_data_typeが、ＶＬＣ器６から送信されてくる
のを待って、ステップＳ５１に戻る。従って、この場
合、ＶＬＣ器６では、ＧＯＶヘッダに、本来配置すべき
情報およびload_data_typeの他に、図２０に示したdown
load_parameters()も配置される。On the other hand, in step S51, the flag lo
When it is determined that the ad_data_type is 001, the process proceeds to step S53, and the parser 17 determines, among the information stored in the buffer 16, the download_paramet shown in FIG.
Select and read out what is included in ers (), and VLC unit 6
To supply. Then, the process returns to step S51 after waiting for the flag load_data_type arranged in the next GOV header to be transmitted from the VLC device 6. Therefore, in this case, in the VLC device 6, in addition to the information to be originally arranged and the load_data_type, the down stream shown in FIG.
load_parameters () is also placed.

【０１５６】次に、図２１のエンコーダから、記録媒体
２０１または伝送媒体２０２を介して提供される符号化
ビットストリームは、図１６に示した構成のデコーダに
よってデコードすることができる。Next, the coded bit stream provided from the encoder of FIG. 21 via the recording medium 201 or the transmission medium 202 can be decoded by the decoder having the configuration shown in FIG.

【０１５７】図２３は、図１６に示した構成のデコーダ
のＩＶＬＣ器２２が、図１８に示したシンタクスのＧＯ
Ｖヘッダに関して行う処理を説明するためのフローチャ
ートである。FIG. 23 shows that the IVLC unit 22 of the decoder having the configuration shown in FIG.
It is a flowchart for demonstrating the process performed about a V header.

【０１５８】ＩＶＬＣ器２２は、ＧＯＶヘッダを受信す
ると、そのＧＯＶヘッダについて、通常行うべき処理
（図８に示したＧＯＶヘッダが送信されてきたときに行
うべき処理）を行い、さらに、ステップＳ６１におい
て、ＧＯＶヘッダ（図１８）に配置されているフラグlo
ad_data_typeの値を判定する。ステップＳ６１におい
て、フラグload_data_typeが1であると判定された場
合、次のＧＯＶヘッダが送信されてくるのを待って、ス
テップＳ６１に戻る。Upon receiving the GOV header, the IVLC unit 22 performs a process to be performed normally (a process to be performed when the GOV header shown in FIG. 8 is transmitted) with respect to the GOV header. , Flag lo located in the GOV header (FIG. 18)
Determine the value of ad_data_type. If it is determined in step S61 that the flag load_data_type is 1, the process returns to step S61 after waiting for the next GOV header to be transmitted.

【０１５９】また、ステップＳ６１において、フラグlo
ad_data_typeが０１であると判定された場合、即ち、Ｇ
ＯＶヘッダに、ＶＳ，ＶＩＳＯ，ＶＯ，ＶＯＬの各ヘッ
ダの情報が含まれている場合、ステップＳ６２に進み、
ＩＶＬＣ器２２は、フラグload_data_typeに基づいて、
そのＶＳ，ＶＩＳＯ，ＶＯ，ＶＯＬの各ヘッダの情報
を、符号化ビットストリームから抽出し、必要なブロッ
クに供給する。即ち、その情報を可変長復号し、その結
果得られる、例えば、動きベクトル、予測モード、およ
びオーバーラップ動き補償を行うかどうかを示すフラグ
を、動き補償器２７に、また、量子化ステップおよび量
子化マトリクスを、逆量子化器２３に、それぞれ供給す
る。そして、次のＧＯＶヘッダが送信されてくるのを待
って、ステップＳ６１に戻る。In step S61, the flag lo
If ad_data_type is determined to be 01, ie, G
If the OV header includes the information of each header of VS, VISO, VO, and VOL, the process proceeds to step S62,
The IVLC unit 22 determines, based on the flag load_data_type,
The information of each of the VS, VISO, VO, and VOL headers is extracted from the encoded bit stream and supplied to necessary blocks. That is, the information is subjected to variable-length decoding, and the resulting, for example, motion vector, prediction mode, and a flag indicating whether or not to perform overlap motion compensation are sent to the motion compensator 27, The quantization matrices are supplied to the inverse quantizers 23, respectively. Then, the process returns to step S61 after waiting for the next GOV header to be transmitted.

【０１６０】一方、ステップＳ６１において、フラグlo
ad_data_typeが００１であると判定された場合、即ち、
ＧＯＶヘッダに、download_parameters()（パラメータ
更新情報）が含まれている場合、ステップＳ６３に進
み、ＩＶＬＣ器２２は、フラグload_data_typeに基づい
て、そのdownload_parameters()を、符号化ビットスト
リームから抽出し、必要なブロックに供給する。即ち、
そのdownload_parameters()を可変長復号し、その結果
得られる、例えば、オーバーラップ動き補償を行うかど
うかを示すフラグを、動き補償器２７に、また、量子化
ステップおよび量子化マトリクスを、逆量子化器２３
に、それぞれ供給する。そして、次のＧＯＶヘッダが送
信されてくるのを待って、ステップＳ６１に戻る。On the other hand, in step S61, the flag lo
When it is determined that ad_data_type is 001, that is,
If the GOV header includes download_parameters () (parameter update information), the process proceeds to step S63, where the IVLC unit 22 extracts the download_parameters () from the encoded bit stream based on the flag load_data_type, and Supply to the new block. That is,
The download_parameters () is subjected to variable-length decoding, and the resulting flag, for example, indicating whether or not to perform overlap motion compensation, is supplied to the motion compensator 27, and the quantization step and the quantization matrix are dequantized. Table 23
Respectively. Then, the process returns to step S61 after waiting for the next GOV header to be transmitted.

【０１６１】以上のように、ＧＯＶのヘッダに、それに
より上位階層のＶＳ，ＶＩＳＯ，ＶＯ，ＶＯＬのヘッダ
の情報の全部または一部（本実施の形態では、図２０に
示したdownload_parameters()）を含めるようにしたの
で、符号化ビットストリームに対して、ランダムアクセ
ス等し、その途中から、正常な復号を行うことが可能と
なる。さらに、ＧＯＶの先頭で、量子化ステップや量子
化マトリクスを変更することが可能となり、その結果、
効率の良い符号化を行うことができるようになる。As described above, in the GOV header, all or part of the information of the headers of the VS, VISO, VO, and VOL in the upper layer (download_parameters () shown in FIG. 20 in this embodiment) Is included, so that random decoding or the like can be performed on the encoded bit stream, and normal decoding can be performed midway. Furthermore, it is possible to change the quantization step and the quantization matrix at the beginning of the GOV, and as a result,
Efficient encoding can be performed.

【０１６２】以上、本発明を、ＭＰＥＧ４に基づいた符
号化／復号を行うエンコーダ／デコーダに適用した場合
について説明したが、本発明の適用範囲は、ＭＰＥＧ４
に基づいた符号化／復号に限定されるものではない。The case where the present invention is applied to an encoder / decoder that performs encoding / decoding based on MPEG4 has been described above.
However, the present invention is not limited to encoding / decoding based on.

【０１６３】また、本実施の形態では、download_param
eters()として、図２０に示した情報を、ＧＯＶのヘッ
ダに含めるようにしたが、download_parameters()とし
てＧＯＶのヘッダに含める情報は、図２０に示したもの
に限定されるものではない。In the present embodiment, download_param
Although the information shown in FIG. 20 is included in the GOV header as eters (), the information included in the GOV header as download_parameters () is not limited to the information shown in FIG. 20.

【０１６４】さらに、図１２および図２１に示したエン
コーダ、並びに図１６に示したデコーダは、ハードウェ
アによって実現することも可能であるし、また、コンピ
ュータなどにプログラムを実行させることによって実現
することも可能である。Further, the encoder shown in FIGS. 12 and 21 and the decoder shown in FIG. 16 can be realized by hardware, or can be realized by causing a computer or the like to execute a program. Is also possible.

【０１６５】また、ＭＰＥＧ４では、スケーラビリティ
を実現するための階層符号化が可能であるが、本発明
は、階層符号化を行うか否かにかかわらず適用可能であ
る。In MPEG4, hierarchical coding for realizing scalability is possible, but the present invention is applicable regardless of whether or not hierarchical coding is performed.

【０１６６】[0166]

【発明の効果】以上の如く、本発明の画像符号化装置お
よび画像符号化方法によれば、画像を符号化することに
より得られる符号化ビットストリームの中の、下位階層
のヘッダに、上位階層のヘッダの情報が含められる。従
って、効率的な符号化が可能となる。As described above, according to the image encoding apparatus and the image encoding method of the present invention, the upper layer header is added to the lower layer header in the encoded bit stream obtained by encoding the image. Header information is included. Therefore, efficient encoding becomes possible.

【０１６７】また、本発明の画像復号装置および画像復
号方法によれば、下位階層のヘッダに、上位階層のヘッ
ダの情報を含めた符号化ビットストリームから、下位階
層のヘッダに含まれる情報が抽出され、その情報に基づ
いて、符号化ビットストリームが復号される。従って、
符号化ビットストリームの途中からでも、正常な復号を
行うことが可能となる。Further, according to the image decoding apparatus and the image decoding method of the present invention, information included in a lower layer header is extracted from an encoded bit stream including a lower layer header including information of an upper layer header. The encoded bit stream is decoded based on the information. Therefore,
Normal decoding can be performed even in the middle of the encoded bit stream.

【０１６８】さらに、本発明の提供媒体によれば、画像
を符号化して、下位階層のヘッダに、上位階層のヘッダ
の情報を含めることにより得られる符号化ビットストリ
ームが提供される。従って、その符号化ビットストリー
ムの途中からでも、正常な復号を行うことが可能とな
る。Further, according to the providing medium of the present invention, an encoded bit stream obtained by encoding an image and including information of an upper layer header in a lower layer header is provided. Therefore, normal decoding can be performed even in the middle of the encoded bit stream.

[Brief description of the drawings]

【図１】MPEG４規格FCDで規定されている符号化ビット
ストリームの構成を示す図である。FIG. 1 is a diagram illustrating a configuration of an encoded bit stream defined by the MPEG4 standard FCD.

【図２】MPEG４規格FCDで規定されているＶＳのシンタ
クスを示す図である。FIG. 2 is a diagram illustrating the syntax of VS defined by the MPEG4 standard FCD.

【図３】MPEG４規格FCDで規定されているＶＩＳＯのシ
ンタクスを示す図である。FIG. 3 is a diagram showing the syntax of VISO defined by the MPEG4 standard FCD.

【図４】MPEG４規格FCDで規定されているＶＯのシンタ
クスを示す図である。FIG. 4 is a diagram illustrating the syntax of a VO defined by the MPEG4 standard FCD.

【図５】MPEG４規格FCDで規定されているＶＯＬのシン
タクスを示す図である。FIG. 5 is a diagram showing the syntax of a VOL defined by the MPEG4 standard FCD.

【図６】MPEG４規格FCDで規定されているＶＯＬのシン
タクスを示す図である。FIG. 6 is a diagram illustrating the syntax of a VOL defined by the MPEG4 standard FCD.

【図７】MPEG４規格FCDで規定されているＶＯＬのシン
タクスを示す図である。FIG. 7 is a diagram showing the syntax of a VOL defined by the MPEG4 standard FCD.

【図８】MPEG４規格FCDで規定されているＧＯＶのシン
タクスを示す図である。FIG. 8 is a diagram showing the syntax of GOV defined by the MPEG4 standard FCD.

【図９】MPEG４規格FCDで規定されているＶＯＰのシン
タクスを示す図である。FIG. 9 is a diagram illustrating the syntax of a VOP defined by the MPEG4 standard FCD.

【図１０】MPEG４規格FCDで規定されているＶＯＰのシ
ンタクスを示す図である。FIG. 10 is a diagram illustrating the syntax of a VOP defined by the MPEG4 standard FCD.

【図１１】MPEG４規格FCDで規定されているＶＯＰのシ
ンタクスを示す図である。FIG. 11 is a diagram illustrating the syntax of a VOP defined by the MPEG4 standard FCD.

【図１２】本発明を適用したエンコーダの一実施の形態
の構成例を示すブロック図である。FIG. 12 is a block diagram illustrating a configuration example of an embodiment of an encoder to which the present invention has been applied.

【図１３】図１２の画素置換器１５の処理を説明するた
めのフローチャートである。FIG. 13 is a flowchart for explaining processing of the pixel replacement unit 15 in FIG. 12;

【図１４】図１２のＶＬＣ器６が出力するＧＯＶのシン
タクスを示す図である。14 is a diagram illustrating the syntax of a GOV output by the VLC device 6 in FIG.

【図１５】図１２のＶＬＣ器６の処理を説明するための
フローチャートである。FIG. 15 is a flowchart for explaining processing of the VLC device 6 in FIG. 12;

【図１６】本発明を適用したデコーダの一実施の形態の
構成例を示すブロック図である。FIG. 16 is a block diagram illustrating a configuration example of an embodiment of a decoder to which the present invention has been applied.

【図１７】図１６のＩＶＬＣ器２２の処理を説明するた
めのフローチャートである。FIG. 17 is a flowchart for explaining processing of the IVLC unit 22 in FIG. 16;

【図１８】図２１のＶＬＣ器６が出力するＧＯＶのシン
タクスを示す図である。18 is a diagram illustrating the syntax of a GOV output from the VLC device 6 in FIG. 21.

【図１９】load_data_typeを説明するための図である。FIG. 19 is a diagram for describing load_data_type.

【図２０】図１８のdownload_parameters()のシンタク
スを示す図である。20 is a diagram illustrating the syntax of download_parameters () in FIG.

【図２１】本発明を適用したエンコーダの他の実施の形
態の構成例を示すブロック図である。FIG. 21 is a block diagram illustrating a configuration example of another embodiment of an encoder to which the present invention has been applied.

【図２２】図２１のパーサ１７の処理を説明するための
フローチャートである。FIG. 22 is a flowchart for explaining processing of the parser 17 of FIG. 21;

【図２３】図１６のＩＶＬＣ器２２の処理を説明するた
めのフローチャートである。FIG. 23 is a flowchart for explaining processing of the IVLC unit 22 in FIG. 16;

【図２４】従来のエンコーダの一例の構成を示すブロッ
ク図である。FIG. 24 is a block diagram illustrating a configuration of an example of a conventional encoder.

【図２５】従来のデコーダの一例の構成を示すブロック
図である。FIG. 25 is a block diagram showing a configuration of an example of a conventional decoder.

[Explanation of symbols]

１フレームメモリ（受信手段），２動きベクトル
検出器，３演算器，４ＤＣＴ器，５量子化
器，６ＶＬＣ器（符号化手段），７バッファ，
８逆量子化器，９ＩＤＣＴ器，１０演算
器，１１フレームメモリ，１２動き補償器，
１３キー信号符号化器，１４キー信号復号器，
１５画素置換器，１６バッファ，１７パーサ
（選択手段），２１バッファ（受信手段），２２
ＩＶＬＣ器（復号手段），２３逆量子化器，２４
ＩＤＣＴ器，２５演算器，２６フレームメモ
リ，２７動き補償器，２８画素置換器，２９
キー信号復号器，２０１記録媒体，２０２伝
送媒体1 frame memory (receiving means), 2 motion vector detector, 3 operation unit, 4 DCT unit, 5 quantizer, 6 VLC unit (encoding means), 7 buffer,
8 inverse quantizer, 9 IDCT unit, 10 operation unit, 11 frame memory, 12 motion compensator,
13 key signal encoder, 14 key signal decoder,
15 pixel replacement unit, 16 buffers, 17 parser (selection unit), 21 buffers (reception unit), 22
IVLC unit (decoding means), 23 inverse quantizer, 24
IDCT unit, 25 arithmetic unit, 26 frame memory, 27 motion compensator, 28 pixel replacement unit, 29
Key signal decoder, 201 recording medium, 202 transmission medium

───────────────────────────────────────────────────── フロントページの続きＦターム(参考） 5C053 FA21 FA23 FA27 GA11 GB19 GB21 GB26 GB29 GB32 GB38 KA04 LA14 5C059 KK03 MA00 MA04 MA05 MA23 MA31 MB01 MB11 MB22 MB27 MC11 MC14 MC38 ME02 NN01 NN28 PP05 PP06 PP07 RC04 RC24 RC38 SS01 SS07 SS11 UA02 UA05 UA33 UA34 UA39 ──────────────────────────────────────────────────続き Continued on front page F term (reference) 5C053 FA21 FA23 FA27 GA11 GB19 GB21 GB26 GB29 GB32 GB38 KA04 LA14 5C059 KK03 MA00 MA04 MA05 MA23 MA31 MB01 MB11 MB22 MB27 MC11 MC14 MC38 ME02 NN01 NN28 PP05 PP06 PP07 RC04 RC24 RC38 SS01 SS07 SS11 UA02 UA05 UA33 UA34 UA39

Claims

[Claims]

1. An image encoding apparatus that encodes an image and outputs an encoded bit stream having a hierarchical structure including a plurality of layers, comprising: a receiving unit that receives the image; And an encoding unit that outputs the encoded bit stream including the information of the header of the upper layer in the header of the image encoding apparatus.

2. The encoding unit according to claim 1, wherein the header of the lower layer includes information necessary for initializing a parameter for decoding the encoded bit stream, among information of the header of the upper layer. The image encoding device according to claim 1, wherein:

3. The encoding means arranges, in the lower layer header, a header information presence / absence flag indicating whether or not to include the information of the upper layer header, and the header information presence / absence flag is 2. The image encoding apparatus according to claim 1, wherein the information of the header of the upper layer is included in the header of the lower layer only when indicating that the information of the header of the layer is included.

4. The encoding means arranges, in the lower layer header, an identification flag for identifying information of the higher layer header included therein, and includes the lower layer header in the lower layer header. 2. The image encoding apparatus according to claim 1, further comprising a selection unit that selects information of a header of an upper layer according to the identification flag.

5. The lower hierarchy is a hierarchy for defining a group composed of one or more images, and its header contains time information on the display time of the image displayed first in the group. When included, the encoding unit arranges information of the header of the upper layer after the time information.
An image encoding device according to claim 1.

6. The image is a moving picture (MPEG).
When encoding is performed by a method conforming to the standards of Experts Group (GOV) 4, the encoding means may include a GOV (Group of VOP (Video Objec).
t Plane)) In the header of the hierarchy, VS (Visual Object Sequ
ence) hierarchy, VISO (Visual Object) hierarchy, VO (V
video object layer or VOL (video object layer)
The image encoding device according to claim 1, wherein r) includes information of one or more headers in a hierarchy.

7. The GOV hierarchy header includes a flag indicating whether to use overlap motion compensation, a flag indicating a method of inverse quantization, or a quantization matrix. The image encoding device according to claim 6.

8. An image encoding method for encoding an image and outputting an encoded bit stream having a hierarchical structure including a plurality of layers, comprising: receiving the image; encoding the image; Outputting the coded bit stream including information of a header of an upper layer.

9. Obtained by encoding an image,
An image decoding device that decodes an encoded bit stream having a hierarchical structure including a plurality of layers, a receiving unit that receives the encoded bit stream including information of an upper layer header in a lower layer header, An image decoding apparatus, comprising: decoding means for extracting information included in a lower layer header and decoding the encoded bit stream based on the information.

10. The header of the lower layer includes information of the header of the upper layer necessary for initializing a parameter for decoding the coded bit stream. Item 10. The image decoding device according to Item 9.

11. The header of the lower layer includes a header information presence / absence flag indicating whether or not to include information of the header of the upper layer. When indicating to include header information,
The image decoding apparatus according to claim 9, wherein information of the header of the upper layer included in the header of the lower layer is extracted.

12. The lower layer header includes an identification flag for identifying information of the upper layer header included therein, and the decoding unit determines the lower layer header based on the identification flag. 10. The image decoding apparatus according to claim 9, wherein information of the header of the upper layer included in the header is extracted.

13. The lower hierarchy is a hierarchy for defining a group composed of one or more images, and its header includes time information on the display time of the image displayed first in the group. The image decoding device according to claim 9, wherein when included, the information of the header of the upper layer is arranged after the time information.

14. The encoded bit stream may be MP
In the case where the image is encoded by a method based on the standard of EG (Moving Picture Experts Group) 4, the header of the GOV (Group of VOP (Video Object Plane)) layer includes a VS (Visual Object Sequence) layer. , VIS
The image decoding apparatus according to claim 9, wherein information of one or more headers of an O (Visual Object) layer, a VO (Video Object) layer, or a VOL (Video Object Layer) layer is included.

15. The method according to claim 14, wherein the GOV layer header includes a flag indicating whether to use overlap motion compensation, a flag indicating an inverse quantization method, or a quantization matrix. An image decoding apparatus according to claim 1.

16. An image decoding method for decoding an encoded bit stream having a hierarchical structure composed of a plurality of layers, which is obtained by encoding an image, wherein information of an upper layer header is added to a lower layer header. An image decoding method, comprising: receiving the encoded bitstream including the extracted bitstream; extracting information included in the header of the lower layer; and decoding the encoded bitstream based on the information.

17. A providing medium for providing a coded bit stream having a hierarchical structure composed of a plurality of layers, obtained by coding an image, wherein the image is coded and a header of a lower layer is provided in a header of a lower layer. A providing medium for providing the coded bit stream obtained by including information of a header of a layer.

18. The header of the lower layer includes information of the header of the upper layer necessary for initializing parameters for decoding the coded bit stream. Item 18. The providing medium according to Item 17.

19. The providing medium according to claim 17, wherein the lower layer header includes a header information presence / absence flag indicating whether to include information of the upper layer header.

20. The providing medium according to claim 17, wherein the lower layer header includes an identification flag for identifying information of the upper layer header included therein.

21. The lower hierarchy is a hierarchy for defining a group composed of one or more images, and its header contains time information on the display time of the image displayed first in the group. 18. The providing medium according to claim 17, wherein when included, the information of the header of the upper layer is arranged after the time information.

22. The coded bit stream has an MP
In the case where the image is encoded by a method based on the standard of EG (Moving Picture Experts Group) 4, the header of the GOV (Group of VOP (Video Object Plane)) layer includes a VS (Visual Object Sequence) layer. , VIS
18. The providing medium according to claim 17, wherein information of one or more headers of an O (Visual Object) layer, a VO (Video Object) layer, or a VOL (Video Object Layer) layer is included.

23. The method according to claim 22, wherein the GOV layer header includes a flag indicating whether to use overlap motion compensation, a flag indicating an inverse quantization method, or a quantization matrix. Provided medium as described.