JP3380981B2

JP3380981B2 - Image encoding device and image encoding method, image decoding device and image decoding method, and recording medium

Info

Publication number: JP3380981B2
Application number: JP17350098A
Authority: JP
Inventors: 輝彦鈴木; 陽一矢ヶ崎
Original assignee: Sony Corp
Current assignee: Sony Corp
Priority date: 1998-06-19
Filing date: 1998-06-19
Publication date: 2003-02-24
Anticipated expiration: 2018-06-19
Also published as: JP2000013791A

Description

Detailed Description of the Invention

【０００１】[0001]

【発明の属する技術分野】本発明は、画像符号化装置お
よび画像符号化方法、画像復号装置および画像復号方
法、並びに記録媒体に関する。特に、例えば、動画像デ
ータを、光磁気ディスクや磁気テープなどの記録媒体に
記録し、これを再生してディスプレイなどに表示した
り、テレビ会議システム、テレビ電話システム、放送用
機器、マルチメディアデータベース検索システムなどの
ように、動画像データを伝送路を介して送信側から受信
側に伝送し、受信側において、これを受信し、表示する
場合や、編集して記録する場合などに用いて好適な画像
符号化装置および画像符号化方法、画像復号装置および
画像復号方法、並びに記録媒体に関する。TECHNICAL FIELD The present invention relates to an image coding apparatus and an image coding method, an image decoding apparatus and an image decoding method, and a recording medium . In particular, for example, moving image data is recorded on a recording medium such as a magneto-optical disk or a magnetic tape and is reproduced and displayed on a display or the like, or a video conference system, a video telephone system, broadcasting equipment, a multimedia database. Suitable for transmitting moving image data from the transmitting side to the receiving side via a transmission line, such as a search system, and for receiving and displaying this at the receiving side, or for editing and recording. Image encoding device and image encoding method, image decoding device and image decoding method, and recording medium .

【０００２】[0002]

【従来の技術】例えば、テレビ会議システム、テレビ電
話システムなどのように、動画像データを遠隔地に伝送
するシステムにおいては、伝送路を効率良く利用するた
め、画像データを、そのライン相関やフレーム間相関を
利用して圧縮符号化するようになされている。2. Description of the Related Art In a system for transmitting moving image data to a remote place, such as a video conference system or a video telephone system, for example, the image data is line-correlated or framed in order to use the transmission line efficiently. The compression coding is performed by using the inter-correlation.

【０００３】動画像の高能率符号化方式として代表的な
ものとしては、MPEG（Moving Picture Experts Group）
（蓄積用動画像符号化）方式がある。これはＩＳＯ−Ｉ
ＥＣ／ＪＴＣ１／ＳＣ２／ＷＧ１１において議論され、
標準案として提案されたものであり、動き補償予測符号
化とＤＣＴ（Discrete Cosine Transform）符号化を組
み合わせたハイブリッド方式が採用されている。A typical example of a high-efficiency coding method for moving images is MPEG (Moving Picture Experts Group).
There is a (video encoding for storage) system. This is ISO-I
Discussed in EC / JTC1 / SC2 / WG11,
It has been proposed as a standard proposal, and a hybrid method in which motion compensation predictive coding and DCT (Discrete Cosine Transform) coding are combined is adopted.

【０００４】ＭＰＥＧでは、様々なアプリケーションや
機能に対応するために、いくつかのプロファイルおよび
レベルが定義されている。最も基本となるのが、メイン
プロファイルメインレベル（ＭＰ＠ＭＬ（Main Profile
at Main Level））である。In MPEG, several profiles and levels are defined in order to support various applications and functions. The most basic is the main profile main level (MP @ ML (Main Profile
at Main Level)).

【０００５】図２４は、ＭＰＥＧ方式におけるＭＰ＠Ｍ
Ｌのエンコーダの一例の構成を示している。FIG. 24 shows MP @ M in the MPEG system.
The structure of an example of the encoder of L is shown.

【０００６】符号化すべき画像データは、フレームメモ
リ３１に入力され、一時記憶される。そして、動きベク
トル検出器３２は、フレームメモリ３１に記憶された画
像データを、例えば、１６画素×１６画素などで構成さ
れるマクロブロック単位で読み出し、その動きベクトル
を検出する。Image data to be encoded is input to the frame memory 31 and temporarily stored. Then, the motion vector detector 32 reads out the image data stored in the frame memory 31, for example, in units of macroblocks composed of 16 pixels × 16 pixels, and detects the motion vector.

【０００７】ここで、動きベクトル検出器３２において
は、各フレームの画像データを、Ｉピクチャ、Ｐピクチ
ャ、またはＢピクチャのうちのいずれかとして処理す
る。なお、シーケンシャルに入力される各フレームの画
像を、Ｉ，Ｐ，Ｂピクチャのいずれのピクチャとして処
理するかは、予め定められている（例えば、Ｉ，Ｂ，
Ｐ，Ｂ，Ｐ，・・・Ｂ，Ｐとして処理される）。Here, the motion vector detector 32 processes the image data of each frame as one of an I picture, a P picture, and a B picture. It should be noted that which of I, P, and B pictures is to be processed as an image of each frame that is sequentially input is predetermined (for example, I, B, or B).
Processed as P, B, P, ... B, P).

【０００８】即ち、動きベクトル検出器３２は、フレー
ムメモリ３１に記憶された画像の中の、予め定められた
所定の参照フレームを参照し、その参照フレームと、現
在符号化の対象となっているフレームの１６画素×１６
ラインの小ブロック（マクロブロック）とをパターンマ
ッチング（ブロックマッチング）することにより、その
マクロブロックの動きベクトルを検出する。That is, the motion vector detector 32 refers to a predetermined reference frame in the image stored in the frame memory 31, and the reference frame and the current encoding target. 16 pixels of frame x 16
By performing pattern matching (block matching) with a small block (macroblock) of a line, the motion vector of that macroblock is detected.

【０００９】ここで、ＭＰＥＧにおいては、画像の予測
モードには、イントラ符号化（フレーム内符号化）、前
方予測符号化、後方予測符号化、両方向予測符号化の４
種類があり、Ｉピクチャはイントラ符号化され、Ｐピク
チャはイントラ符号化または前方予測符号化され、Ｂピ
クチャはイントラ符号化、前方予測符号化、後方予測符
号化、または両方法予測符号化される。Here, in MPEG, there are four image prediction modes: intra coding (intra-frame coding), forward predictive coding, backward predictive coding, and bidirectional predictive coding.
There are types, I pictures are intra coded, P pictures are intra coded or forward predictive coded, and B pictures are intra coded, forward predictive coded, backward predictive coded, or both methods predictive coded. .

【００１０】即ち、動きベクトル検出器３２は、Ｉピク
チャについては、予測モードとしてイントラ符号化モー
ドを設定する。この場合、動きベクトル検出器３２は、
動きベクトルの検出は行わず、予測モード（イントラ予
測モード）を、ＶＬＣ（可変長符号化）器３６および動
き補償器４２に出力する。That is, the motion vector detector 32 sets the intra coding mode as the prediction mode for the I picture. In this case, the motion vector detector 32
The motion vector is not detected, and the prediction mode (intra prediction mode) is output to the VLC (variable length coding) unit 36 and the motion compensator 42.

【００１１】また、動きベクトル検出器３２は、Ｐピク
チャについては、前方予測を行い、その動きベクトルを
検出する。さらに、動きベクトル検出器３２は、前方予
測を行うことにより生じる予測誤差と、符号化対象のマ
クロブロック（Ｐピクチャのマクロブロック）の、例え
ば分散とを比較し、マクロブロックの分散の方が予測誤
差より小さい場合、予測モードとしてイントラ符号化モ
ードを設定し、ＶＬＣ器３６および動き補償器４２に出
力する。また、動きベクトル検出器３２は、前方予測を
行うことにより生じる予測誤差の方が小さければ、予測
モードとして前方予測符号化モードを設定し、検出した
動きベクトルとともに、ＶＬＣ器３６および動き補償器
４２に出力する。The motion vector detector 32 also performs forward prediction for P pictures to detect the motion vector. Further, the motion vector detector 32 compares a prediction error caused by performing forward prediction with, for example, the variance of the macroblock to be encoded (macroblock of P picture), and the variance of the macroblock is predicted. If the difference is smaller than the error, the intra coding mode is set as the prediction mode and output to the VLC unit 36 and the motion compensator 42. Further, if the prediction error caused by performing the forward prediction is smaller, the motion vector detector 32 sets the forward prediction coding mode as the prediction mode, and the detected motion vector, the VLC unit 36, and the motion compensator 42 are set. Output to.

【００１２】さらに、動きベクトル検出器３２は、Ｂピ
クチャについては、前方予測、後方予測、および両方向
予測を行い、それぞれの動きベクトルを検出する。そし
て、動きベクトル検出器３２は、前方予測、後方予測、
および両方向予測についての予測誤差の中の最小のもの
（以下、適宜、最小予測誤差という）を検出し、その最
小予測誤差と、符号化対象のマクロブロック（Ｂピクチ
ャのマクロブロック）の、例えば分散とを比較する。そ
の比較の結果、マクロブロックの分散の方が最小予測誤
差より小さい場合、動きベクトル検出器３２は、予測モ
ードとしてイントラ符号化モードを設定し、ＶＬＣ器３
６および動き補償器４２に出力する。また、動きベクト
ル検出器３２は、最小予測誤差の方が小さければ、予測
モードとして、その最小予測誤差が得られた予測モード
を設定し、対応する動きベクトルとともに、ＶＬＣ器３
６および動き補償器４２に出力する。Further, the motion vector detector 32 performs forward prediction, backward prediction, and bidirectional prediction for the B picture, and detects each motion vector. Then, the motion vector detector 32 uses forward prediction, backward prediction,
And a minimum prediction error (hereinafter, appropriately referred to as a minimum prediction error) among the prediction errors for bidirectional prediction, and the minimum prediction error and, for example, the variance of the macroblock to be encoded (the macroblock of the B picture). Compare with. As a result of the comparison, when the variance of the macroblock is smaller than the minimum prediction error, the motion vector detector 32 sets the intra coding mode as the prediction mode, and the VLC unit 3
6 and the motion compensator 42. If the minimum prediction error is smaller, the motion vector detector 32 sets the prediction mode in which the minimum prediction error is obtained as the prediction mode, and the VLC unit 3 with the corresponding motion vector.
6 and the motion compensator 42.

【００１３】動き補償器４２は、動きベクトル検出器３
２から予測モードと動きベクトルの両方を受信すると、
その予測モードおよび動きベクトルにしたがって、フレ
ームメモリ４１に記憶されている、符号化され、既に局
所復号された画像データを読み出し、これを、予測画像
として、演算器３３および４０に供給する。The motion compensator 42 is a motion vector detector 3
When receiving both the prediction mode and the motion vector from 2,
According to the prediction mode and the motion vector, the encoded and already locally decoded image data stored in the frame memory 41 is read out, and this is supplied to the computing units 33 and 40 as a predicted image.

【００１４】演算器３３は、動きベクトル検出器３２が
フレームメモリ３１から読み出した画像データと同一の
マクロブロックを、フレームメモリ３１から読み出し、
そのマクロブロックと、動き補償器４２からの予測画像
との差分を演算する。この差分値は、ＤＣＴ器３４に供
給される。The calculator 33 reads from the frame memory 31 the same macroblock as the image data read from the frame memory 31 by the motion vector detector 32,
The difference between the macroblock and the predicted image from the motion compensator 42 is calculated. This difference value is supplied to the DCT device 34.

【００１５】一方、動き補償器４２は、動きベクトル検
出器３２から予測モードのみを受信した場合、即ち、予
測モードがイントラ符号化モードである場合には、予測
画像を出力しない。この場合、演算器３３（後述する演
算器４０も同様）は、特に処理を行わず、フレームメモ
リ３１から読み出したマクロブロックを、そのままＤＣ
Ｔ器３４に出力する。On the other hand, the motion compensator 42 does not output the predicted image when it receives only the prediction mode from the motion vector detector 32, that is, when the prediction mode is the intra coding mode. In this case, the arithmetic unit 33 (similarly to the arithmetic unit 40 to be described later) does not perform any processing, and the macroblock read from the frame memory 31 is directly processed by the DC
Output to the T unit 34.

【００１６】ＤＣＴ器３４では、演算器３３の出力に対
して、ＤＣＴ処理が施され、その結果得られるＤＣＴ係
数が、量子化器３５に供給される。量子化器３５では、
バッファ３７のデータ蓄積量（バッファ３７に記憶され
ているデータの量）（バッファフィードバック）に対応
して量子化ステップ（量子化スケール）が設定され、そ
の量子化ステップで、ＤＣＴ器３４からのＤＣＴ係数が
量子化される。この量子化されたＤＣＴ係数（以下、適
宜、量子化係数という）は、設定された量子化ステップ
とともに、ＶＬＣ器３６に供給される。In the DCT unit 34, the output of the arithmetic unit 33 is subjected to DCT processing, and the resulting DCT coefficient is supplied to the quantizer 35. In the quantizer 35,
A quantization step (quantization scale) is set corresponding to the amount of data accumulated in the buffer 37 (the amount of data stored in the buffer 37) (buffer feedback), and the DCT from the DCT unit 34 is set at the quantization step. The coefficients are quantized. The quantized DCT coefficient (hereinafter, appropriately referred to as a quantized coefficient) is supplied to the VLC unit 36 together with the set quantization step.

【００１７】ＶＬＣ器３６では、量子化器３５より供給
される量子化係数が、例えばハフマン符号などの可変長
符号に変換され、バッファ３７に出力される。さらに、
ＶＬＣ器３６は、量子化器３５からの量子化ステップ、
動きベクトル検出器３２からの予測モード（イントラ符
号化（画像内予測符号化）、前方予測符号化、後方予測
符号化、または両方向予測符号化のうちのいずれが設定
されたかを示すモード）および動きベクトルも可変長符
号化し、バッファ３７に出力する。In the VLC unit 36, the quantized coefficient supplied from the quantizer 35 is converted into a variable length code such as Huffman code, and output to the buffer 37. further,
The VLC unit 36 uses the quantization step from the quantizer 35,
Prediction mode from motion vector detector 32 (mode indicating which of intra coding (intra-picture predictive coding), forward predictive coding, backward predictive coding, or bidirectional predictive coding) is set and motion The vector is also variable-length coded and output to the buffer 37.

【００１８】バッファ３７は、ＶＬＣ器３６からのデー
タを一時蓄積し、そのデータ量を平滑化して、例えば、
図示せぬ伝送路に出力し、または記録媒体に記録する。The buffer 37 temporarily stores the data from the VLC device 36, smooths the data amount, and, for example,
The data is output to a transmission path (not shown) or recorded on a recording medium.

【００１９】また、バッファ３７は、そのデータ蓄積量
を、量子化器３５に出力しており、量子化器３５は、こ
のバッファ３７からのデータ蓄積量にしたがって量子化
ステップを設定する。即ち、量子化器３５は、バッファ
３７がオーバーフローしそうなとき、量子化ステップを
大きくし、これにより、量子化係数のデータ量を低下さ
せる。また、量子化器３５は、バッファ３７がアンダー
フローしそうなとき、量子化ステップを小さくし、これ
により、量子化係数のデータ量を増大させる。このよう
にして、バッファ３７のオーバフローとアンダフローを
防止するようになっている。The buffer 37 also outputs the data storage amount to the quantizer 35, and the quantizer 35 sets the quantization step in accordance with the data storage amount from the buffer 37. That is, the quantizer 35 increases the quantization step when the buffer 37 is about to overflow, thereby reducing the data amount of the quantization coefficient. The quantizer 35 reduces the quantization step when the buffer 37 is likely to underflow, thereby increasing the data amount of the quantization coefficient. In this way, overflow and underflow of the buffer 37 are prevented.

【００２０】量子化器３５が出力する量子化係数と量子
化ステップは、ＶＬＣ器３６だけでなく、逆量子化器３
８にも供給されるようになされている。逆量子化器３８
では、量子化器３５からの量子化係数が、同じく量子化
器３５からの量子化ステップにしたがって逆量子化さ
れ、これによりＤＣＴ係数に変換される。このＤＣＴ係
数は、ＩＤＣＴ器（逆ＤＣＴ器）３９に供給される。Ｉ
ＤＣＴ器３９では、ＤＣＴ係数が逆ＤＣＴ処理され、演
算器４０に供給される。The quantizing coefficient and the quantizing step output by the quantizer 35 are not limited to those of the VLC unit 36, but also of the inverse quantizer 3.
It is designed to be supplied to 8 as well. Inverse quantizer 38
Then, the quantized coefficient from the quantizer 35 is inversely quantized in accordance with the quantization step from the quantizer 35, and converted into the DCT coefficient. The DCT coefficient is supplied to the IDCT device (inverse DCT device) 39. I
In the DCT unit 39, the DCT coefficient is subjected to inverse DCT processing and supplied to the arithmetic unit 40.

【００２１】演算器４０には、ＩＤＣＴ器３９の出力の
他、上述したように、動き補償器４２から、演算器３３
に供給されている予測画像と同一のデータが供給されて
おり、演算器４０は、ＩＤＣＴ器３９からの信号（予測
残差）と、動き補償器４２からの予測画像とを加算する
ことで、元の画像を、局所復号する（但し、予測モード
がイントラ符号化である場合には、ＩＤＣＴ器３９の出
力は、演算器４０をスルーして、フレームメモリ４１に
供給される）。なお、この復号画像は、受信側において
得られる復号画像と同一のものである。In addition to the output of the IDCT device 39, the operation device 40 outputs the motion compensator 42 to the operation device 33 as described above.
Is supplied with the same data as the predicted image that is supplied to the arithmetic unit 40, and the arithmetic unit 40 adds the signal (prediction residual) from the IDCT unit 39 and the predicted image from the motion compensator 42, The original image is locally decoded (however, when the prediction mode is intra coding, the output of the IDCT unit 39 is passed through the arithmetic unit 40 and supplied to the frame memory 41). The decoded image is the same as the decoded image obtained on the receiving side.

【００２２】演算器４０において得られた復号画像（局
所復号画像）は、フレームメモリ４１に供給されて記憶
され、その後、インター符号化（前方予測符号化、後方
予測符号化、量方向予測符号化）される画像に対する参
照画像（参照フレーム）として用いられる。The decoded image (locally decoded image) obtained by the arithmetic unit 40 is supplied to and stored in the frame memory 41, and then inter-coded (forward predictive coding, backward predictive coding, quantity direction predictive coding). ) Is used as a reference image (reference frame) for the image.

【００２３】次に、図２５は、図２４のエンコーダから
出力される符号化データを復号する、ＭＰＥＧにおける
ＭＰ＠ＭＬのデコーダの一例の構成を示している。Next, FIG. 25 shows an example of the structure of an MP @ ML decoder in MPEG for decoding the encoded data output from the encoder of FIG.

【００２４】伝送路を介して伝送されてきた符号化デー
タが図示せぬ受信装置で受信され、または記録媒体に記
録された符号化データが図示せぬ再生装置で再生され、
バッファ１０１に供給されて記憶される。The encoded data transmitted through the transmission path is received by a receiving device (not shown), or the encoded data recorded on the recording medium is reproduced by a reproducing device (not shown),
The data is supplied to and stored in the buffer 101.

【００２５】ＩＶＬＣ器（逆ＶＬＣ器）（可変長復号
器）１０２は、バッファ１０１に記憶された符号化デー
タを読み出し、可変長復号することで、その符号化デー
タを、動きベクトル、予測モード、量子化ステップ、お
よび量子化係数に分離する。これらのうち、動きベクト
ルおよび予測モードは動き補償器１０７に供給され、量
子化ステップおよび量子化係数は逆量子化器１０３に供
給される。The IVLC unit (inverse VLC unit) (variable length decoder) 102 reads the coded data stored in the buffer 101 and performs variable length decoding to convert the coded data into a motion vector, a prediction mode, The quantization step and the quantization coefficient are separated. Of these, the motion vector and the prediction mode are supplied to the motion compensator 107, and the quantization step and the quantization coefficient are supplied to the inverse quantizer 103.

【００２６】逆量子化器１０３は、ＩＶＬＣ器１０２よ
り供給された量子化係数を、同じくＩＶＬＣ器１０２よ
り供給された量子化ステップにしたがって逆量子化し、
その結果得られるＤＣＴ係数を、ＩＤＣＴ器１０４に出
力する。ＩＤＣＴ器１０４は、逆量子化器１０３からの
ＤＣＴ係数を逆ＤＣＴし、演算器１０５に供給する。The inverse quantizer 103 inversely quantizes the quantized coefficient supplied from the IVLC unit 102 in accordance with the quantization step also supplied from the IVLC unit 102,
The resulting DCT coefficient is output to the IDCT device 104. The IDCT unit 104 performs inverse DCT on the DCT coefficient from the inverse quantizer 103, and supplies it to the arithmetic unit 105.

【００２７】演算器１０５には、ＩＤＣＴ器１０４の出
力の他、動き補償器１０７の出力も供給されている。即
ち、動き補償器１０７は、フレームメモリ１０６に記憶
されている、既に復号された画像を、図２４の動き補償
器４１における場合と同様に、ＩＶＬＣ器１０２からの
動きベクトルおよび予測モードにしたがって読み出し、
予測画像として、演算器１０５に供給する。演算器１０
５は、ＩＤＣＴ器１０４からの信号（予測残差）と、動
き補償器１０７からの予測画像とを加算することで、元
の画像を復号する。この復号画像は、フレームメモリ１
０６に供給されて記憶される。なお、ＩＤＣＴ器１０４
の出力が、イントラ符号化されたものである場合には、
その出力は、演算器１０５をスルーして、そのままフレ
ームメモリ１０６に供給されて記憶される。The output of the IDCT unit 104 and the output of the motion compensator 107 are also supplied to the arithmetic unit 105. That is, the motion compensator 107 reads the already-decoded image stored in the frame memory 106 according to the motion vector and the prediction mode from the IVLC unit 102, as in the case of the motion compensator 41 of FIG. ,
The prediction image is supplied to the arithmetic unit 105. Calculator 10
5 decodes the original image by adding the signal (prediction residual) from the IDCT unit 104 and the predicted image from the motion compensator 107. This decoded image is stored in the frame memory 1
06 and stored. The IDCT device 104
If the output of is intra-coded, then
The output passes through the arithmetic unit 105, is supplied to the frame memory 106 as it is, and is stored therein.

【００２８】フレームメモリ１０６に記憶された復号画
像は、その後に復号される画像の参照画像として用いら
れるとともに、適宜読み出され、例えば、図示せぬディ
スプレイなどに供給されて表示される。The decoded image stored in the frame memory 106 is used as a reference image of an image to be subsequently decoded, and is appropriately read and supplied to, for example, a display not shown for display.

【００２９】なお、ＭＰＥＧ１および２では、Ｂピクチ
ャは、参照画像として用いられないため、エンコーダま
たはデコーダそれぞれにおいて、フレームメモリ４１
（図２４）または１０６（図２５）には記憶されない。In MPEG1 and MPEG2, the B picture is not used as a reference image, so that the frame memory 41 is used in each of the encoder and the decoder.
(FIG. 24) or 106 (FIG. 25).

【００３０】[0030]

【発明が解決しようとする課題】以上の図２４、図２５
に示したエンコーダ、デコーダは、ＭＰＥＧ１／２の規
格に準拠したものであるが、現在、画像を構成する物体
などのオブジェクトのシーケンスであるＶＯ（Video Ob
ject）単位で符号化を行う方式につき、ＩＳＯ−ＩＥＣ
／ＪＴＣ１／ＳＣ２９／ＷＧ１１において、ＭＰＥＧ
（Moving Picture Experts Group）４として標準化作業
が進められている。[Problems to be Solved by the Invention]
The encoders and decoders shown in (1) are compliant with the MPEG1 / 2 standard, but are currently VO (Video Ob) which is a sequence of objects such as objects forming an image.
ISO-IEC for the method of encoding in units of
In / JTC1 / SC29 / WG11, MPEG
(Moving Picture Experts Group) 4 is in the process of standardization.

【００３１】ところで、ＭＰＥＧ４については、主とし
て、通信の分野で利用されるものとして、標準化作業が
進められていたため、ＭＰＥＧ１／２において規定され
ているＧＯＰ（Group Of Picture）は、ＭＰＥＧ４では
規定されておらず、従って、ＭＰＥＧ４が蓄積メディア
に利用された場合には、効率的なランダムアクセスが困
難になることが予想される。Meanwhile, with regard to MPEG4, since standardization work has been advanced mainly for use in the field of communication, GOP (Group Of Picture) defined in MPEG1 / 2 is defined in MPEG4. Therefore, if MPEG4 is used as a storage medium, it is expected that efficient random access will be difficult.

【００３２】このため、本件出願人は、効率的なランダ
ムアクセスを可能とするために、ＭＰＥＧ１／２で規定
されているＧＯＰに相当するＧＯＶ（Group Of VOP)層
の導入を、特願平１０−８０７５８号において先に提案
しており、また、ＭＰＥＧ４において、このＧＯＶ層が
導入された。Therefore, in order to enable efficient random access, the applicant of the present invention has introduced a GOV (Group Of VOP) layer corresponding to GOP defined by MPEG1 / 2 in Japanese Patent Application No. -80758, previously proposed, and in MPEG4 this GOV layer was introduced.

【００３３】ところで、例えば、ＭＰＥＧ１，２，４，
Ｈ．２６３などの規格に準拠して符号化を行うことによ
り得られる符号化ビットストリームは、複数の階層から
なる階層構造を有している。そして、エンコーダ側で
は、各階層には、デコードに必要な情報が、ヘッダに配
置され、デコーダ側では、各階層のヘッダから必要な情
報が抽出され、符号化ビットストリームの復号が行われ
る。By the way, for example, MPEG 1, 2, 4,
H. An encoded bit stream obtained by performing encoding in accordance with a standard such as H.263 has a hierarchical structure including a plurality of layers. Then, on the encoder side, the information necessary for decoding is arranged in the header in each layer, and on the decoder side, the necessary information is extracted from the header of each layer, and the encoded bit stream is decoded.

【００３４】従って、ＭＰＥＧ１／２では、ＧＯＰにラ
ンダムアクセスした場合に、そのＧＯＰの復号を行うた
めに上位階層のヘッダの情報が必要となることがあるこ
とから、上位階層の送信後に、適宜、その上位階層のヘ
ッダの情報を再送することが可能な規格となっている。Therefore, in MPEG1 / 2, when the GOP is randomly accessed, the information of the header of the upper layer may be necessary for decoding the GOP. It is a standard that can retransmit the information of the header of the upper layer.

【００３５】しかしながら、ＭＰＥＧ４では、上位階層
の送信後に、適宜、その上位階層のヘッダの情報を再送
することか可能な規格になっておらず、このため、ＧＯ
Ｖ層の導入により、効率的なランダムアクセスが可能と
なっても、そのＧＯＶの復号を行うために必要な上位階
層のヘッダの情報が得られず、これにより、正常な復号
結果を得られないおそれがある。However, in MPEG4, there is no standard that it is possible to appropriately retransmit the information of the header of the upper layer after the transmission of the upper layer. Therefore, GO
Even if efficient random access becomes possible by introducing the V layer, the information of the upper layer header necessary for decoding the GOV cannot be obtained, and thus the normal decoding result cannot be obtained. There is a risk.

【００３６】ここで、ＭＰＥＧ４の符号化ビットストリ
ームが、蓄積メディアに記録されている場合には、その
記録メディアにアクセスすることで、上位階層のヘッダ
の情報を得ることが可能であるが、符号化ビットストリ
ームが放送等される場合には、その符号化ビットストリ
ームを最初から受信しない限りは、上位階層のヘッダの
情報が得られないことになり、従って、符号化ビットス
トリームの受信を、その途中から開始した場合には、正
常な復号結果を得られないおそれがある。Here, when the MPEG4 coded bit stream is recorded in the storage medium, it is possible to obtain the information of the header of the upper layer by accessing the recording medium. When a coded bitstream is broadcast, etc., unless the coded bitstream is received from the beginning, the information of the upper layer header cannot be obtained. If it starts in the middle, there is a possibility that a normal decoding result cannot be obtained.

【００３７】本発明は、このような状況に鑑みてなされ
たものであり、符号化ビットストリームの途中からで
も、正常な復号を行うことができるようにするものであ
る。The present invention has been made in view of such a situation, and makes it possible to perform normal decoding even in the middle of an encoded bit stream.

【００３８】[0038]

【課題を解決するための手段】本発明の画像符号化装置
は、符号化ビットストリームの上位階層のヘッダ情報を
記憶部に記憶させる記憶手段と、符号化ビットストリー
ムの下位階層のヘッダに配置されている識別フラグに応
じた上位階層のヘッダ情報を記憶部から読み出し、それ
を下位階層のヘッダに配置する配置手段と、配置手段に
より、記憶部から読み出された上位階層のヘッダ情報が
下位階層のヘッダに配置された符号化ビットストリーム
を出力する出力手段とを備え、識別フラグは少なくと
も、上位階層のヘッダ情報を何も伝送しない、上位階層
のヘッダ情報を全て伝送する、若しくは、上位階層のヘ
ッダ情報のうち所定のヘッダ情報のみを伝送する、の３
通りを規定することを特徴とする。An image coding apparatus according to the present invention is arranged in a storage unit for storing header information of an upper layer of a coded bitstream in a storage unit and a header of a lower layer of the coded bitstream. The header information of the upper layer corresponding to the identification flag is read from the storage unit and arranged in the header of the lower layer, and the header information of the upper layer read from the storage unit by the arrangement unit is the lower layer. and output means for outputting disposed in the header encoded bit stream, the identification flag is small
Also transmits no header information of the upper layer, the upper layer
All header information of the
Of the header information, only certain header information is transmitted, 3
It is characterized by defining a street .

【００３９】ヘッダ情報には、オーバーラップ動き補償
を用いるか否かを示すパラメータ、または逆量子化の方
法を表すパラメーを含ませることができる。 Overlap motion compensation is included in the header information.
Parameter indicating whether to use or dequantization
You can include parameters that represent the law.

【００４０】記憶手段は、上位階層の最新のヘッダ情報
を記憶部に記憶させることができる。 The storage means stores the latest header information of the upper hierarchy.
Can be stored in the storage unit.

【００４１】記憶部に記憶されている上位階層のヘッダ
情報を変更する変更手段をさらに設けることができる。 Upper layer header stored in the storage unit
Change means for changing the information can be further provided.

【００４２】本発明の画像符号化方法は、符号化ビット
ストリームの上位階層のヘッダ情報を記憶部に記憶させ
る記憶ステップと、符号化ビットストリームの下位階層
のヘッダに配置されている識別フラグに応じた上位階層
のヘッダ情報を記憶部から読み出し、それを下位階層の
ヘッダに配置する配置ステップと、配置ステップの処理
で記憶部から読み出された上位階層のヘッダ情報が下位
階層のヘッダに配置された符号化ビットストリームを出
力する出力ステップとを含み、識別フラグは少なくと
も、上位階層のヘッダ情報を何も伝送しない、上位階層
のヘッダ情報を全て伝送する、若しくは、上位階層のヘ
ッダ情報のうち所定のヘッダ情報のみを伝送する、の３
通りを規定することを特徴とする。The image coding method of the present invention is responsive to the storage step of storing the header information of the upper layer of the coded bitstream in the storage section and the identification flag arranged in the header of the lower layer of the coded bitstream. The placement step of reading the header information of the upper layer from the storage unit and placing it in the header of the lower layer, and the header information of the upper layer read from the storage unit in the processing of the placement step is placed in the header of the lower layer. saw including an output step of outputting a coded bit stream, the identification flag is small
Also transmits no header information of the upper layer, the upper layer
All header information of the
Of the header information, only certain header information is transmitted, 3
It is characterized by defining a street .

【００４３】本発明の画像符号化装置および方法におい
ては、符号化ビットストリームの上位階層のヘッダ情報
が記憶部に記憶され、符号化ビットストリームの下位階
層のヘッダに配置されている識別フラグに応じた上位階
層のヘッダ情報が記憶部から読み出され、それが下位階
層のヘッダに配置され、記憶部から読み出された上位階
層のヘッダ情報が下位階層のヘッダに配置された符号化
ビットストリームが出力され、識別フラグには少なくと
も、上位階層のヘッダ情報を何も伝送しない、上位階層
のヘッダ情報を全て伝送する、若しくは、上位階層のヘ
ッダ情報のうち所定のヘッダ情報のみを伝送する、の３
通りが規定されている。In the image coding apparatus and method of the present invention, the header information of the upper layer of the coded bitstream is stored in the storage unit, and the header information of the lower layer of the coded bitstream is stored according to the identification flag. The upper-layer header information read from the storage unit is placed in the lower-layer header, and the upper-layer header information read from the storage unit is placed in the lower-layer header. Output and at least the identification flag
Also transmits no header information of the upper layer, the upper layer
All header information of the
Of the header information, only certain header information is transmitted, 3
The street is regulated .

【００４４】本発明の画像復号装置は、識別フラグに基
づいて、符号化ビットストリームの下位階層のヘッダに
含まれる上位階層のヘッダ情報を抽出する抽出手段と、
抽出手段により抽出されたヘッダ情報に基づいて、符号
化ビットストリームを復号する復号手段とを備え、識別
フラグは少なくとも、上位階層のヘッダ情報を何も伝送
しない、上位階層のヘッダ情報を全て伝送する、若しく
は、上位階層のヘッダ情報のうち所定のヘッダ情報のみ
を伝送する、の３通りを規定することを特徴とする。The image decoding apparatus of the present invention comprises an extracting means for extracting the header information of the upper layer contained in the header of the lower layer of the encoded bitstream based on the identification flag,
Decoding means for decoding the coded bit stream based on the header information extracted by the extracting means ,
The flag transmits at least no upper layer header information
No, all upper layer header information is transmitted,
Is only the specified header information in the upper-layer header information
Is specified .

【００４５】ヘッダ情報には、オーバーラップ動き補償
を用いるか否かを示すパラメータ、または逆量子化の方
法を表すパラメータが含ませることができる。The header information can include a parameter indicating whether or not the overlap motion compensation is used, or a parameter indicating a dequantization method.

【００４６】本発明の画像復号方法は、識別フラグに基
づいて、符号化ビットストリームの下位階層のヘッダに
含まれる上位階層のヘッダ情報を抽出する抽出ステップ
と、抽出ステップの処理で抽出されたヘッダ情報に基づ
いて、符号化ビットストリームを復号する復号ステップ
とを含み、識別フラグは少なくとも、上位階層のヘッダ
情報を何も伝送しない、上位階層のヘッダ情報を全て伝
送する、若しくは、上位階層のヘッダ情報のうち所定の
ヘッダ情報のみを伝送する、の３通りを規定することを
特徴とする。According to the image decoding method of the present invention, the extraction step of extracting the header information of the upper layer included in the header of the lower layer of the encoded bitstream based on the identification flag, and the header extracted by the processing of the extraction step. based on the information, see contains a decoding step of decoding the coded bit stream, the identification flag is at least, the upper layer header
Does not transmit any information, transmits all upper-layer header information
To send, or a predetermined number of upper-layer header information
It is characterized by defining three types of transmitting only header information .

【００４７】本発明の画像復号装置および方法において
は、識別フラグに基づいて、符号化ビットストリームの
下位階層のヘッダに含まれる上位階層のヘッダ情報が抽
出され、抽出されたヘッダ情報に基づいて、符号化ビッ
トストリームが復号され、識別フラグには少なくとも、
上位階層のヘッダ情報を何も伝送しない、上位階層のヘ
ッダ情報を全て伝送する、若しくは、上位階層のヘッダ
情報のうち所定のヘッダ情報のみを伝送する、の３通り
が規定されている。In the image decoding apparatus and method of the present invention, the upper layer header information included in the lower layer header of the coded bitstream is extracted based on the identification flag, and based on the extracted header information, Encoding bit
Stream is decoded and the identification flag contains at least
The header information of the upper layer that does not transmit any header information of the upper layer
All header information is transmitted, or the upper layer header
Of the information, only the predetermined header information is transmitted.
Is specified .

【００４８】[0048]

【発明の実施の形態】以下に、本発明の実施の形態につ
いて説明するが、その前に、ＭＰＥＧ４において規定さ
れている符号化ビットストリームについて説明する。な
お、ここでは、MPEG4規格DraftであるFCD(Final Comitt
ee Draft)における符号化ビットストリームについて説
明する。BEST MODE FOR CARRYING OUT THE INVENTION Embodiments of the present invention will be described below, but before that, a coded bit stream defined in MPEG4 will be described. In addition, here, FCD (Final Comitt) which is a Draft of MPEG4 standard is used.
An encoded bitstream in ee Draft) will be described.

【００４９】図１は、FCDで規定されている符号化ビッ
トストリームの構成を示している。FIG. 1 shows the structure of a coded bit stream specified by FCD.

【００５０】符号化ビットストリームは、同図に示すよ
うに、ＶＳ（Visual Object Sequence）層、ＶＩＳＯ(V
isual Object)層、ＶＯ（video Object）層、ＶＯＬ
（Video Object Layer）、ＧＯＶ(Group of VOP)層、Ｖ
ＯＰ（Video Object Plane）層などの、複数の階層から
なる階層構造を有している（図１において、上方に位置
している階層ほど、上位の階層を構成する）。As shown in the figure, the coded bit stream has a VS (Visual Object Sequence) layer, a VISO (V
isual Object) layer, VO (video Object) layer, VOL
(Video Object Layer), GOV (Group of VOP) layer, V
It has a hierarchical structure composed of a plurality of layers such as OP (Video Object Plane) layer (the upper layer in FIG. 1 constitutes a higher layer).

【００５１】即ち、符号化ビットストリームは、ＶＳを
単位として構成される。ここで、ＶＳは、画像シーケン
スであり、例えば、一本の番組や映画などに相当する。That is, the coded bit stream is configured in units of VS. Here, VS is an image sequence and corresponds to, for example, one program or movie.

【００５２】各ＶＳは、１以上のＶＩＳＯから構成され
る。ここで、ＶＩＳＯには、幾つかの種類がある。即
ち、ＶＩＳＯには、例えば、静止画であるスチルテクス
チャオブジェクト（Still Texture Object）や、顔画像
から構成されるフェイスオブジェクト（Face Objec
t）、動画像のオブジェクトであるＶＯ（Video Objec
t）などがある。従って、符号化ビットストリームが動
画像のものである場合、ＶＩＳＯは、ＶＯから構成され
る。Each VS is composed of one or more VISOs. Here, there are several types of VISO. That is, in the VISO, for example, a still texture object (Still Texture Object) that is a still image and a face object (Face Objec) composed of a face image are included.
t), VO (Video Objec) that is a moving image object
t) etc. Therefore, if the encoded bitstream is of a moving image, the VISO is composed of VOs.

【００５３】ＶＯは、１以上のＶＯＬ（Video Object L
ayer）から構成される（画像を階層化（階層符号化）し
ないときは１のＶＯＬで構成され、画像を階層化する場
合には、その階層数だけのＶＯＬで構成される）。VO is one or more VOLs (Video Object L
ayer) (composed of one VOL when the image is not layered (layered encoding), and is composed of the number of VOLs when the image is layered).

【００５４】ＶＯＬは、必要な数のＧＯＶ（Group of V
OP）で構成され、ＧＯＶは、１以上のＶＯＰ（Video Ob
ject Plane）のシーケンスで構成される。なお、ＧＯＶ
はなくても良く、この場合、ＶＯＬは、１以上のＶＯＰ
で構成されることになる。VOL is a required number of GOVs (Group of V
OP), and GOV is one or more VOP (Video Ob
ject Plane) sequence. In addition, GOV
May be omitted, in which case the VOL is one or more VOPs.
Will be composed of.

【００５５】ＶＯＰは、従来のフレームに相当する。A VOP corresponds to a conventional frame.

【００５６】なお、ＶＳ，ＶＯ，ＶＯＰの関係につい
て、さらに説明すると、ＶＳは、上述したように、画像
シーケンスであり、例えば、一本の番組に相当する。そ
して、ＶＯは、ある合成画像のシーケンスが存在する場
合の、その合成画像を構成する各物体のシーケンスであ
り、ＶＯＰは、ある時刻におけるＶＯを意味する。即
ち、例えば、いま、画像Ｆ１およびＦ２を合成して構成
される合成画像Ｆ３がある場合、画像Ｆ１またはＦ２が
時系列に並んだものが、それぞれＶＯであり、ある時刻
における画像Ｆ１またはＦ２が、それぞれＶＯＰであ
る。従って、ＶＯは、異なる時刻の、同一物体のＶＯＰ
の集合ということができる。The relationship between VS, VO, and VOP will be further described. VS is an image sequence as described above, and corresponds to, for example, one program. Then, VO is a sequence of each object forming a composite image when a sequence of the composite image exists, and VOP means VO at a certain time. That is, for example, when there is a composite image F3 configured by combining the images F1 and F2, the images F1 or F2 arranged in time series are VOs, respectively, and the image F1 or F2 at a certain time is , And each is a VOP. Therefore, VOs are VOPs of the same object at different times.
Can be said to be a set of.

【００５７】ここで、図２乃至図４それぞれに、ＶＳ，
ＶＩＳＯ，ＶＯのシンタクスを示す。また、図５乃至図
７に、ＶＯＬのシンタクスを、図８に、ＧＯＶのシンタ
クスを、図９乃至図１１に、ＶＯＰのシンタクスを、そ
れぞれ示す。なお、各層のシンタクスに記載されている
フラグのセマンティクスは、ＭＰＥＧ４ＦＣＤ規格（14
496-2）に記載されているので、それを参照されたい。Here, in each of FIGS. 2 to 4, VS,
The syntax of VISO and VO is shown. 5 to 7 show VOL syntax, FIG. 8 shows GOV syntax, and FIGS. 9 to 11 show VOP syntax. The flag semantics described in the syntax of each layer are defined in the MPEG4 FCD standard (14
496-2), refer to it.

【００５８】MPEG4のFCD規格におけるＶＳ，ＶＩＳＯ，
ＶＯ，ＶＯＬヘッダの情報は、符号化ビットストリーム
を復号するために必要な必須情報を含んでおり、これら
の情報がなければ、前述したように、その符号化ビット
ストリームを正確に復号することは困難である。VS, VISO, in the FCD standard of MPEG4,
The information of the VO and VOL headers includes essential information necessary for decoding the coded bitstream, and without this information, as described above, it is impossible to correctly decode the coded bitstream. Have difficulty.

【００５９】即ち、例えば、記録媒体に記録された符号
化ビットストリームに対して、ランダムアクセスや、Ｆ
Ｆ／ＦＲ（早送り／巻き戻し）等のような特殊再生を行
う場合、または放送されている符号化ビットストリーム
に途中からアクセスする場合、その符号化ビットストリ
ームの復号を開始するためには、まず、ＶＳ，ＶＩＳ
Ｏ，ＶＯ，ＶＯＬヘッダの情報を復号することが必要で
ある。That is, for example, random access to the coded bit stream recorded on the recording medium or F
When performing special playback such as F / FR (fast forward / rewind) or accessing a coded bit stream being broadcast from the middle, first of all, in order to start decoding of the coded bit stream, , VS, VIS
It is necessary to decode the information in the O, VO, and VOL headers.

【００６０】しかしながら、MPEG4のFCD規格において
は、符号化ビットストリームの先頭に、一度だけＶＳ，
ＶＩＳＯ，ＶＯ，ＶＯＬヘッダを伝送することしか許さ
れておらず、この場合、特に、放送されてくる符号化ビ
ットストリームの途中から復号を始めることは難しい。However, according to the FCD standard of MPEG4, the VS,
Only VISO, VO, and VOL headers are allowed to be transmitted, and in this case, it is particularly difficult to start decoding from the middle of the coded bit stream that is broadcast.

【００６１】さらに、例えば、ＶＯＬヘッダには、量子
化マトリクスを始め、符号化モードを指定するフラグが
記述される。これらの符号化モードは、符号化対象の画
像から得られる符号化ビットストリームの性質に依存し
て、即ち、符号化対象の画像の統計的性質が最適になる
ように設定される。しかしながら、例えば、長時間の画
像シーケンスなどに関しては、画像の性質は時刻によっ
て大きく変化することがあるため、ＶＯＬヘッダに、最
初に設定した値が、必ずしも常に最適であるとは限らな
い。それにもかかわらず、画像シーケンスの先頭（ここ
では、符号化モードを指定するフラグが記述されるＶＯ
Ｌの先頭）でしか、符号化モードを指定するフラグを設
定することができないということは、効率の良い符号化
の妨げとなる。Further, for example, the VOL header describes a quantization matrix and a flag designating a coding mode. These coding modes are set depending on the property of the coded bitstream obtained from the image to be coded, that is, the statistical property of the image to be coded is optimized. However, for example, with respect to a long-time image sequence or the like, the nature of the image may change greatly depending on the time, so that the value initially set in the VOL header is not always optimal. Nevertheless, at the beginning of the image sequence (here, the VO in which the flag designating the coding mode is described
The fact that the flag that specifies the encoding mode can be set only at the beginning of L) impedes efficient encoding.

【００６２】そこで、図１２は、本発明を適用したエン
コーダの一実施の形態の構成例を示している。なお、こ
のエンコーダを構成するフレームメモリ１、動きベクト
ル検出器２、演算器３，ＤＣＴ器４、量子化器５，ＶＬ
Ｃ器６、バッファ７、逆量子化器８，ＩＤＣＴ器９，演
算器１０、フレームメモリ１１、動き補償器１２は、図
２４に示したエンコーダを構成するフレームメモリ３
１、動きベクトル検出器３２、演算器３３，ＤＣＴ器３
４、量子化器３５，ＶＬＣ器３６、バッファ３７、逆量
子化器３８，ＩＤＣＴ器３９，演算器４０、フレームメ
モリ４１、動き補償器４２にそれぞれ対応している。従
って、フレームメモリ１乃至動き補償器１２それぞれで
は、フレームメモリ３１乃至動き補償器４２それぞれの
処理と同一の処理が行われる場合があり、そのような同
一の処理についての説明は、適宜省略する。Therefore, FIG. 12 shows a configuration example of an embodiment of an encoder to which the present invention is applied. A frame memory 1, a motion vector detector 2, an arithmetic unit 3, a DCT unit 4, a quantizer 5, and VL which constitute this encoder.
The C unit 6, the buffer 7, the inverse quantizer 8, the IDCT unit 9, the arithmetic unit 10, the frame memory 11, and the motion compensator 12 are the frame memory 3 which constitutes the encoder shown in FIG.
1, motion vector detector 32, calculator 33, DCT device 3
4, a quantizer 35, a VLC device 36, a buffer 37, an inverse quantizer 38, an IDCT device 39, a calculator 40, a frame memory 41, and a motion compensator 42, respectively. Therefore, the frame memory 1 to the motion compensator 12 may perform the same process as the frame memory 31 to the motion compensator 42, respectively, and the description of the same process will be appropriately omitted.

【００６３】符号化対象のディジタル画像信号を構成す
るＶＯＰは、フレームメモリ１（受信手段）に順次供給
され、そこで受信されて一時記憶される。さらに、フレ
ームメモリ１には、そこに供給されるＶＯＰの、所定の
絶対座標系における大きさを示すフラグFSZと、位置を
示すフラグFPOSも供給されるようになされており、フレ
ームメモリ１は、これらのフラグFSZおよびFPOSも一時
記憶する。The VOPs forming the digital image signal to be coded are sequentially supplied to the frame memory 1 (reception means), where they are received and temporarily stored. Further, the frame memory 1 is also supplied with a flag FSZ indicating the size of the VOP supplied thereto in a predetermined absolute coordinate system and a flag FPOS indicating the position. These flags FSZ and FPOS are also temporarily stored.

【００６４】フレームメモリ１に記憶されたＶＯＰは、
動きベクトル検出器２によって、マクロブロック単位で
読み出される。そして、動きベクトル検出回路２は、予
め設定されている所定のシーケンスに従って、各ＶＯＰ
を、Ｉ（Intra）−ＶＯＰ，Ｐ（Predictive）−ＶＯ
Ｐ、またはＢ（Biderectionally Predictive）−ＶＯＰ
として処理する。シーケンシャルに入力される各ＶＯＰ
を、Ｉ，Ｐ，ＢのいずれのＶＯＰとして処理するかは、
予め定められている（例えば、Ｉ，Ｂ，Ｐ，Ｂ，Ｐ，・
・・Ｂ，Ｐとして処理される）。The VOP stored in the frame memory 1 is
The motion vector detector 2 reads the data in macroblock units. Then, the motion vector detection circuit 2 follows each VOP according to a preset predetermined sequence.
, I (Intra) -VOP, P (Predictive) -VO
P or B (Biderectionally Predictive) -VOP
Process as. Each VOP input sequentially
Is processed as a VOP of I, P, or B,
Predetermined (for example, I, B, P, B, P, ...
.. are treated as B and P).

【００６５】動きベクトル検出器２は、処理対象のマク
ロブロックに対して、予め定められた所定の参照画像
（ＶＯＰ）を参照して、動き補償を施し、そのマクロブ
ロックの動きベクトルを検出する。The motion vector detector 2 refers to a predetermined reference image (VOP) that is predetermined for the macroblock to be processed, performs motion compensation, and detects the motion vector of the macroblock.

【００６６】ここで、動き補償（フレーム間予測）に
は、前方予測、後方予測、両方向予測の３種類の予測モ
ードがあり、Ｐ−ＶＯＰは、前方予測のみでのみ動き補
償が施され、動きベクトル検出器２は、その予測誤差を
最小にする動きベクトルを検出する。また、Ｂ−ＶＯＰ
は、前方予測、後方予測、両方向予測の３種類で動き補
償が施され、動きベクトル検出器２は、各予測モードに
おいて、その予測誤差を最小にする動きベクトルを検出
する。さらに、動きベクトル検出器２は、３つの予測モ
ードのうち、最小の予測誤差が得られたものを選択し、
その予測モードにおける動きベクトルも選択する。Here, there are three types of prediction modes of motion compensation (inter-frame prediction): forward prediction, backward prediction, and bidirectional prediction. In P-VOP, motion compensation is performed only in forward prediction, and motion compensation is performed. The vector detector 2 detects a motion vector that minimizes the prediction error. Also, B-VOP
Is subjected to motion compensation in three types of prediction, forward prediction, and bidirectional prediction, and the motion vector detector 2 detects a motion vector that minimizes the prediction error in each prediction mode. Furthermore, the motion vector detector 2 selects one of the three prediction modes in which the minimum prediction error is obtained,
The motion vector in the prediction mode is also selected.

【００６７】そして、動きベクトル検出器２は、動き補
償の結果得られた予測誤差と、符号化対象のマクロブロ
ックの分散とを比較する。その結果、マクロブロックの
分散の方が小さい場合は、そのマクロブロックについて
はフレーム間予測は行われず、フレーム内符号化が行わ
れる。この場合、予測モードは、画像内符号化（イント
ラ）となり、そのような予測モードが、動きベクトル検
出器２からＶＬＣ器６および動き補償器１２に供給され
る。一方、予測誤差の方が小さい場合には、その予測誤
差が得られた予測モードと動きベクトルとが、動きベク
トル検出器２からＶＬＣ器６および動き補償器１２に供
給される。なお、Ｉ−ＶＯＰについての予測モードは、
必ず画像内符号化にされる。Then, the motion vector detector 2 compares the prediction error obtained as a result of motion compensation with the variance of the macroblock to be coded. As a result, if the variance of the macroblock is smaller, interframe prediction is not performed for that macroblock, and intraframe coding is performed. In this case, the prediction mode is intra-picture coding (intra), and such a prediction mode is supplied from the motion vector detector 2 to the VLC unit 6 and the motion compensator 12. On the other hand, when the prediction error is smaller, the prediction mode and the motion vector for which the prediction error is obtained are supplied from the motion vector detector 2 to the VLC unit 6 and the motion compensator 12. The prediction mode for I-VOP is
Be sure to use intra-picture coding.

【００６８】ここで、符号化対象となるVOPのシーケン
スは、それぞれ、大きさや位置が異なることがある。従
って、動きベクトルを検出する場合には、基準となる座
標系を設定し、その座標系において、動きベクトルの検
出を行う必要がある。そこで、ここでは、ある１つの絶
対座標を仮定し、その絶対座標における動きベクトルが
算出されるようになされている。即ち、動きベクトル検
出器２には、ＶＯＰの絶対座標系における大きさを示す
フラグFSZと、位置を示すフラグFPOSとが供給されるよ
うになされており、動きベクトル検出器２は、このフラ
グFSZおよびフラグFPOSに基づき、処理対象のＶＯＰ
と、参照画像となるＶＯＰとを、絶対座標系に配置し、
処理対象のＶＯＰ（のマクロブロック）の動きベクトル
を算出するようになされている。Here, the VOP sequences to be encoded may differ in size and position. Therefore, when detecting a motion vector, it is necessary to set a reference coordinate system and detect the motion vector in the coordinate system. Therefore, here, one certain absolute coordinate is assumed, and the motion vector at the absolute coordinate is calculated. That is, the motion vector detector 2 is supplied with the flag FSZ indicating the size of the VOP in the absolute coordinate system and the flag FPOS indicating the position, and the motion vector detector 2 receives this flag FSZ. And VOP to be processed based on flag FPOS
And the VOP to be the reference image are arranged in the absolute coordinate system,
The motion vector of (the macroblock of) the VOP to be processed is calculated.

【００６９】一方、動き補償器１２は、動きベクトル検
出器２からの動きベクトルおよび予測モードに基づい
て、フレームメモリ１１に記憶されているＶＯＰに対し
て動き補償を施すことで、予測画像を生成する。この予
測画像は、演算器３に供給される。演算器３には、さら
に、動きベクトル検出器２がフレームメモリ１から読み
出した符号化対象のマクロブロックも、フレームメモリ
１から供給される。そして、演算器３は、符号化対象の
マクロブロックを構成する各画素の画素値それぞれと、
予測画像を構成する画素の画素値それぞれの差分を演算
し、その差分信号を、DCT器４に出力する。なお、符号
化対象のマクロブロックが、イントラマクロブロックの
場合には、演算器３は、その符号化対象のマクロブロッ
クをそのままDCT器４に出力する。On the other hand, the motion compensator 12 generates a predicted image by performing motion compensation on the VOP stored in the frame memory 11 based on the motion vector from the motion vector detector 2 and the prediction mode. To do. This predicted image is supplied to the arithmetic unit 3. The encoding unit macroblock read from the frame memory 1 by the motion vector detector 2 is also supplied to the arithmetic unit 3 from the frame memory 1. Then, the computing unit 3 sets the pixel value of each pixel forming the encoding target macroblock,
The difference between the pixel values of the pixels forming the predicted image is calculated, and the difference signal is output to the DCT unit 4. If the macroblock to be encoded is an intra macroblock, the arithmetic unit 3 outputs the macroblock to be encoded as it is to the DCT unit 4.

【００７０】DCT器４では、演算器３の出力に対して、D
CT（離散コサイン変換）処理が施され、DCT係数に変換
される。このＤＣＴ係数は、量子化器５に入力され、送
信バッファ７のデータ蓄積量（バッファ蓄積量）に対応
した量子化ステップで量子化された後、ＶＬＣ（可変長
符号化）器６に入力される。In the DCT unit 4, D
CT (discrete cosine transform) processing is performed and converted into DCT coefficients. The DCT coefficient is input to the quantizer 5, quantized in a quantization step corresponding to the data storage amount (buffer storage amount) of the transmission buffer 7, and then input to the VLC (variable length coding) unit 6. It

【００７１】ＶＬＣ器６は、量子化器５より供給される
画像データを、例えばハフマン符号などの可変長符号に
変換し、その結果得られる符号化ビットストリームを、
送信バッファ７に出力する。The VLC device 6 converts the image data supplied from the quantizer 5 into a variable length code such as Huffman code, and the resulting encoded bit stream is
Output to the transmission buffer 7.

【００７２】ＶＬＣ器６には、また、量子化器５より量
子化ステップ（スケール）が、動きベクトル検出器２よ
り予測モード（画像内予測、前方予測、後方予測、また
は両方向予測のいずれが設定されたかを示すモード）、
および動きベクトルが、後述するキー信号符号化器１３
よりキー信号の符号化結果が、それぞれ供給されるよう
になされている。さらに、ＶＬＣ器６には、フラグFSZ
およびFPOSも供給されるようになされている。ＶＬＣ器
６は、これらの情報、さらには、バッファ１６に記憶さ
れた情報を、図１に示したように構成される符号化ビッ
トストリームの所定の階層のヘッダに挿入（配置）して
出力する。In the VLC unit 6, the quantizer 5 sets a quantization step (scale), and the motion vector detector 2 sets a prediction mode (intra-picture prediction, forward prediction, backward prediction, or bidirectional prediction). Mode),
And the motion vector is a key signal encoder 13 described later.
More key signal encoding results are supplied. Further, the VLC unit 6 has a flag FSZ.
And FPOS is also being supplied. The VLC unit 6 inserts (arranges) these pieces of information, and further, the information stored in the buffer 16 into the header of a predetermined layer of the encoded bitstream configured as shown in FIG. 1 and outputs it. .

【００７３】なお、ＶＬＣ器６は、各階層のヘッダに配
置された情報を、バッファ１６に出力するようになされ
ており、バッファ１６は、ＶＬＣ器６から供給される情
報を記憶するようになされている。The VLC unit 6 outputs the information arranged in the header of each layer to the buffer 16, and the buffer 16 stores the information supplied from the VLC unit 6. ing.

【００７４】送信バッファ７は、ＶＬＣ器３６からの符
号化ビットストリームを一時蓄積し、その蓄積量に対応
する量子化制御信号を量子化器５に出力する。即ち、送
信バッファ７は、その蓄積量が許容上限値まで増量する
と、量子化スケールを大きくする量子化制御信号を、量
子化器５に供給し、量子化スケールを大きくさせること
で、量子化器５の出力するデータ量を低下させる。ま
た、送信バッファ７は、その蓄積残量が許容下限値まで
減少すると、量子化スケールを小さくする量子化制御信
号を、量子化器５に供給し、量子化スケールを小さくさ
せることで、量子化器５の出力するデータ量を増大させ
る。このようにして、送信バッファ７のオーバフローお
よびアンダフローが防止されるようになされている。The transmission buffer 7 temporarily stores the coded bit stream from the VLC unit 36 and outputs a quantization control signal corresponding to the stored amount to the quantizer 5. That is, the transmission buffer 7 supplies the quantization control signal for increasing the quantization scale to the quantizer 5 when the accumulated amount increases to the allowable upper limit value, and increases the quantization scale, thereby increasing the quantization scale. 5 reduces the amount of data output. Further, when the remaining storage amount decreases to the allowable lower limit value, the transmission buffer 7 supplies the quantizer 5 with a quantization control signal that reduces the quantization scale, and reduces the quantization scale, thereby performing quantization. The amount of data output from the device 5 is increased. In this way, overflow and underflow of the transmission buffer 7 are prevented.

【００７５】そして、送信バッファ７に蓄積された符号
化ビットストリームは、所定のタイミングで読み出さ
れ、例えば、磁気テープや、磁気ディスク、光磁気ディ
スク、相変化ディスクなどの記録媒体２０１に供給され
て記録され、あるいは、アナログ公衆網や、ＩＳＤＮ、
衛星回線、ＣＡＴＶ網、地上波などの伝送媒体２０２を
介して伝送される。これにより、記録媒体２０１や伝送
媒体２０２を媒介して、符号化ビットストリームが、後
述する図１６のデコーダに提供される。Then, the encoded bit stream accumulated in the transmission buffer 7 is read out at a predetermined timing and supplied to a recording medium 201 such as a magnetic tape, a magnetic disk, a magneto-optical disk, a phase change disk or the like. Recorded, or analog public network, ISDN,
It is transmitted via a transmission medium 202 such as a satellite line, a CATV network, or a terrestrial wave. As a result, the coded bitstream is provided to the decoder of FIG. 16 described later via the recording medium 201 and the transmission medium 202.

【００７６】ここで、上述したように、ＶＯは、ある合
成画像のシーケンスが存在する場合の、その合成画像を
構成する各物体のシーケンスであり、ＶＯＰは、ある時
刻におけるＶＯを意味する。即ち、例えば、いま、画像
Ｆ１およびＦ２を合成して構成される合成画像Ｆ３があ
る場合、画像Ｆ１またはＦ２が時系列に並んだものが、
それぞれＶＯであり、ある時刻における画像Ｆ１または
Ｆ２が、それぞれＶＯＰである。従って、例えば、画像
Ｆ１を背景とするとともに、画像Ｆ２を前景とすると、
合成画像Ｆ３を得るためには、画像Ｆ２を抜くためのキ
ー信号を用いて、画像Ｆ１およびＦ２を合成する必要が
ある。即ち、合成画像Ｆ３を得るには、画像Ｆ２を抜く
ためのキー信号が必要となる。Here, as described above, VO is a sequence of each object forming a synthetic image when a sequence of the synthetic image exists, and VOP means VO at a certain time. That is, for example, when there is a composite image F3 that is composed by combining the images F1 and F2, the image F1 or F2 arranged in time series is
Each is a VO, and the image F1 or F2 at a certain time point is a VOP. Therefore, for example, if the image F1 is the background and the image F2 is the foreground,
In order to obtain the combined image F3, it is necessary to combine the images F1 and F2 using the key signal for removing the image F2. That is, in order to obtain the composite image F3, a key signal for extracting the image F2 is required.

【００７７】このため、各ＶＯＰを抜くためのキー信号
が、キー信号符号化器１３に供給されるようになされて
おり、キー信号符号化器１３では、そこに供給されるキ
ー信号が、例えばDPCMなどの所定の手法によって符号化
される。このキー信号の符号化結果は、ＶＬＣ器６およ
びキー信号復号器１４に供給されるようになされてい
る。Therefore, the key signal for removing each VOP is supplied to the key signal encoder 13. In the key signal encoder 13, the key signal supplied thereto is, for example, It is encoded by a predetermined method such as DPCM. The encoded result of the key signal is supplied to the VLC unit 6 and the key signal decoder 14.

【００７８】キー信号復号器１４では、キー信号符号化
器１３からのキー信号の符号化結果が復号され、動きベ
クトル検出器２、ＤＣＴ器４、ＩＤＣＴ器９、動き補償
器１２、および画素置換器１５に供給され、これらのブ
ロックでは、キー信号の復号結果を必要に応じて用いて
処理が行われる。The key signal decoder 14 decodes the encoded result of the key signal from the key signal encoder 13, and the motion vector detector 2, the DCT unit 4, the IDCT unit 9, the motion compensator 12, and the pixel replacement. The blocks are supplied to the device 15, and the blocks are processed by using the decryption result of the key signal as needed.

【００７９】このように、動きベクトル検出器２には、
キー信号復号器１４で局所復号されたキー信号も供給さ
れるようになされているが、このキー信号は、動きベク
トル検出器２が、マクロブロックの予測誤差を計算する
際に用いられる。Thus, the motion vector detector 2 has
Although the key signal locally decoded by the key signal decoder 14 is also supplied, this key signal is used when the motion vector detector 2 calculates the prediction error of the macroblock.

【００８０】即ち、ＶＯＰは、ある時刻の、ある物体の
画像であるから、その形状は、基本的に任意形状であ
り、この場合、符号化対象のマクロブロックに画像（物
体を構成する画素）が存在しない領域が含まれることが
ある。そのような場合に、動きベクトル検出器２は、符
号化対象のマクロブロックにおいて画像が存在しない画
素を除外して、予測誤差を計算するようになされてお
り、即ち、画像が存在する画素の予測誤差のみを用い
て、符号化対象のマクロブロックの予測誤差を計算し、
それを最小とする動きベクトルを検出するようになされ
ており、符号化対象のマクロブロック内の各画素につい
て、画像が存在するかどうかを認識するために、符号化
対象のマクロブロックの、局所復号されたキー信号が参
照される。That is, since the VOP is an image of a certain object at a certain time, its shape is basically an arbitrary shape. In this case, the image (pixels forming the object) is displayed in the macroblock to be encoded. May include areas where no. In such a case, the motion vector detector 2 is adapted to calculate a prediction error by excluding pixels in the macroblock to be encoded in which an image does not exist, that is, prediction of a pixel in which an image exists. Calculate the prediction error of the macroblock to be encoded using only the error,
It is designed to detect a motion vector that minimizes it, and for each pixel in the macroblock to be encoded, in order to recognize whether or not an image exists, local decoding of the macroblock to be encoded is performed. The generated key signal is referred to.

【００８１】具体的には、動きベクトル検出器２では、
キー信号が０である画素については、画像が存在しな
い、物体（画像オブジェクト）の外側の領域に属する画
素であると認識され、キー信号が０以外である画素につ
いては、画像が存在する、物体（画像オブジェクト）の
内側の領域にある画素であると認識される。そして、動
きベクトル検出器２は、キー信号が０である画素につい
ては、予測画像を求めるための、参照画像との差分を計
算しない。Specifically, in the motion vector detector 2,
A pixel having a key signal of 0 is recognized as a pixel belonging to a region outside the object (image object) in which an image does not exist, and a pixel having a key signal other than 0 indicates that an image exists. It is recognized as a pixel in the area inside the (image object). Then, the motion vector detector 2 does not calculate the difference between the pixel having the key signal of 0 and the reference image for obtaining the predicted image.

【００８２】なお、ＶＯＰの形状が長方形状である場合
には、キー信号は常に０以外の値（バイナリ（binary）
キー（ハードキー）では１、グレイスケール（gray sca
le）キー（ソフトキー）では１乃至２５５のいずれか）
となるため、マクロブロックのすべての画素を用いて予
測誤差が計算される。When the VOP has a rectangular shape, the key signal is always a value other than 0 (binary).
1 for the key (hard key), gray scale (gray sca
le) key (soft key) from 1 to 255)
Therefore, the prediction error is calculated using all the pixels of the macroblock.

【００８３】一方、量子化器５が出力するデータは、逆
量子化器８にも供給され、逆量子化器８では、そのデー
タが、量子化器５より供給される量子化ステップに対応
して逆量子化され、ＤＣＴ係数とされる。このＤＣＴ係
数は、ＩＤＣＴ（逆ＤＣＴ）器９に入力され、逆ＤＣＴ
処理された後、演算器１０に供給される。On the other hand, the data output from the quantizer 5 is also supplied to the inverse quantizer 8, and in the inverse quantizer 8, the data corresponds to the quantization step supplied from the quantizer 5. Inversely quantized to obtain DCT coefficients. This DCT coefficient is input to the IDCT (inverse DCT) unit 9 and the inverse DCT
After being processed, it is supplied to the arithmetic unit 10.

【００８４】予測モードが、前方予測、後方予測、両方
向予測のうちのいずれかである場合、演算器１０には、
ＩＤＣＴ器９の出力の他、動き補償器１２が出力する予
測画像も供給される。演算器１０は、ＩＤＣＴ器９の出
力に、動き補償器１２が出力する予測画像を加算するこ
とで、画像を復号し、画素置換器１５に供給する。When the prediction mode is any of forward prediction, backward prediction, and bidirectional prediction, the arithmetic unit 10
In addition to the output of the IDCT device 9, the predicted image output by the motion compensator 12 is also supplied. The arithmetic unit 10 adds the predicted image output from the motion compensator 12 to the output of the IDCT unit 9 to decode the image, and supplies the image to the pixel replacer 15.

【００８５】なお、演算器１０は、予測モードが画像内
符号化である場合には、ＩＤＣＴ器９の出力を、そのま
ま画素置換器１５に供給するようになされている。When the prediction mode is intra-picture coding, the arithmetic unit 10 supplies the output of the IDCT unit 9 to the pixel replacing unit 15 as it is.

【００８６】画素置換器１５では、演算器１０の出力に
対して、後述するパディング処理が施され、フレームメ
モリ１１に供給される。フレームメモリ１１では、画素
置換器１５の出力が記憶され、この記憶値、即ち、復号
画像は、動き補償器１２による動き補償のために用いら
れる。なお、フレームメモリ１１には、フラグFSZおよ
びFPOSも供給されるようになされており、フレームメモ
リ１１は、これらのフラグFSZおよびFPOSも記憶するよ
うになされている。In the pixel replacer 15, the output of the calculator 10 is subjected to padding processing, which will be described later, and the result is supplied to the frame memory 11. The output of the pixel replacer 15 is stored in the frame memory 11, and the stored value, that is, the decoded image is used for motion compensation by the motion compensator 12. The flags FSZ and FPOS are also supplied to the frame memory 11, and the frame memory 11 also stores these flags FSZ and FPOS.

【００８７】次に、図１３のフローチャートを参照し
て、図１２の画素置換器１５が行うパディング（paddin
g）処理について説明する。Next, referring to the flowchart of FIG. 13, the padding (paddin) performed by the pixel replacer 15 of FIG.
g) Describe the processing.

【００８８】パディング処理では、まず最初に、ステッ
プＳ１において、演算器１０から画素置換器１５に供給
されたマクロブロックを構成する画素の１つを注目画素
として、その注目画素についてのキー信号が０であるか
否かが判定される。ステップＳ１において、注目画素に
ついてのキー信号が０でないと判定された場合、即ち、
注目画素が、画像オブジェクトの内側を構成するもので
ある場合、ステップＳ２に進み、画素置換器１５は、そ
の注目画素に対して、何も処理を施さず、そのままフレ
ームメモリ１１に出力し、ステップＳ４に進む。In the padding process, first, in step S1, one of the pixels forming the macroblock supplied from the calculator 10 to the pixel replacer 15 is set as a target pixel, and the key signal for the target pixel is set to 0. Is determined. When it is determined in step S1 that the key signal for the pixel of interest is not 0, that is,
If the pixel of interest constitutes the inside of the image object, the process proceeds to step S2, where the pixel replacer 15 does not perform any processing on the pixel of interest and outputs it to the frame memory 11 as it is. Proceed to S4.

【００８９】ここで、符号化対象のＶＯＰの形状が長方
形状である場合、上述したように、キー信号は常に０以
外の値となるため、画素置換器１５では、そのＶＯＰ中
の全ての画素が何も処理されずそのまま出力されること
になる。Here, when the VOP to be encoded has a rectangular shape, the key signal always has a value other than 0 as described above, so that the pixel replacer 15 selects all the pixels in the VOP. Will be output without any processing.

【００９０】一方、ステップＳ１において、注目画素に
ついてのキー信号が０であると判定された場合、即ち、
注目画素が、画像オブジェクトの外側を構成するもので
ある場合、ステップＳ３に進み、注目画素の画素値が、
例えば０とされ、ステップＳ４に進む。ステップＳ４で
は、演算器１０からのマクロブロックを構成する画素す
べてについて処理を行ったかどうかが判定され、まだ、
すべての画素について処理を行っていないと判定された
場合、ステップＳ１に戻り、まだ注目画素とされていな
い画素を、新たに注目画素として、同様の処理が繰り返
される。On the other hand, if it is determined in step S1 that the key signal for the pixel of interest is 0, that is,
If the pixel of interest constitutes the outside of the image object, the process proceeds to step S3, and the pixel value of the pixel of interest is
For example, it is set to 0, and the process proceeds to step S4. In step S4, it is determined whether or not the processing has been performed for all the pixels forming the macroblock from the arithmetic unit 10, and
If it is determined that the processing has not been performed for all the pixels, the process returns to step S1 and the same processing is repeated with the pixel not yet set as the target pixel as a new target pixel.

【００９１】また、ステップＳ４において、演算器１０
からの画素すべてについて処理を行ったと判定された場
合、ステップＳ５に進み、演算器１０からのマクロブロ
ックのある水平ラインが、注目水平ラインとして選択さ
れ、ステップＳ６に進む。ステップＳ６では、注目水平
ラインの両端の画素の画素値が判定される。In step S4, the arithmetic unit 10
If it is determined that the processing has been performed for all the pixels from, the process proceeds to step S5, the horizontal line having the macroblock from the arithmetic unit 10 is selected as the target horizontal line, and the process proceeds to step S6. In step S6, the pixel values of the pixels at both ends of the horizontal line of interest are determined.

【００９２】即ち、ステップＳ１乃至Ｓ４の処理が施さ
れた後のマクロブロックの、ある水平ラインに注目した
場合には、その注目水平ラインについては、その両端の
画素値が、いずれも０のケース（両端の画素が画像オブ
ジェクトの外側にあるケース）、いずれか一端の画素値
が０でないケース（一端の画素だけが画像オブジェクト
の内側にあるケース）、および両端の画素値がいずれも
０でないケース（両端の画素が画像オブジェクトの内側
にあるケース）の３通りのケースが生じる。ステップＳ
６では、注目水平ラインが、これらの３つのケースのう
ちのいずれに属するのかが判定される。That is, when attention is paid to a certain horizontal line of the macroblock after the processing of steps S1 to S4, the pixel values at both ends of the horizontal line of interest are both 0. (The pixels at both ends are outside the image object), the pixel value at one end is not 0 (only the pixels at one end are inside the image object), and the pixel values at both ends are not 0 There are three cases (cases where pixels at both ends are inside the image object). Step S
At 6, it is determined which of these three cases the horizontal line of interest belongs to.

【００９３】ステップＳ６において、注目水平ラインの
両端の画素値が、いずれも０であると判定された場合、
ステップＳ７に進み、その注目水平ラインについて確保
された変数Ｃに、０がセットされ、ステップＳ１０に進
む。また、ステップＳ６において、注目水平ラインの両
端の画素値が、いずれも０でないと判定された場合、ス
テップＳ８に進み、その注目水平ラインについて確保さ
れた変数Ｃに、注目水平ラインの両端の画素値の平均値
がセットされ、ステップＳ１０に進む。さらに、ステッ
プＳ６において、注目水平ラインの両端の画素値のうち
のいずれか一方だけが０でないと判定された場合、ステ
ップＳ９に進み、その注目水平ラインについて確保され
た変数Ｃに、注目水平ラインの両端の画素値のうちの０
でない方の値がセットされ、ステップＳ１０に進む。If it is determined in step S6 that the pixel values at both ends of the horizontal line of interest are both 0,
The process proceeds to step S7, 0 is set to the variable C secured for the horizontal line of interest, and the process proceeds to step S10. If it is determined in step S6 that the pixel values at both ends of the target horizontal line are not 0, the process proceeds to step S8 and the variable C secured for the target horizontal line is set to the pixels at both ends of the target horizontal line. The average value is set, and the process proceeds to step S10. Further, when it is determined in step S6 that only one of the pixel values at both ends of the target horizontal line is not 0, the process proceeds to step S9, and the variable C secured for the target horizontal line is set to the target horizontal line. 0 of the pixel values at both ends of
The other value is set, and the process proceeds to step S10.

【００９４】ステップＳ１０では、演算器１０からのマ
クロブロックのすべての水平ラインを注目水平ラインと
して処理を行ったかどうかが判定され、まだ、すべての
水平ラインを注目水平ラインとして処理を行っていない
と判定された場合、ステップＳ５に戻り、まだ、注目水
平ラインとして選択されていない水平ラインが、新たな
注目水平ラインとして選択され、以下、同様の処理が繰
り返される。In step S10, it is determined whether or not all the horizontal lines of the macroblock from the arithmetic unit 10 have been processed as the noticeable horizontal lines, and all the horizontal lines have not been processed as the noticeable horizontal lines. When it is determined, the process returns to step S5, a horizontal line that is not yet selected as the horizontal line of interest is selected as a new horizontal line of interest, and the same processing is repeated thereafter.

【００９５】また、ステップＳ１０において、すべての
水平ラインを注目水平ラインとして処理を行ったと判定
された場合、ステップＳ１１に進む。If it is determined in step S10 that all the horizontal lines have been processed as the noticeable horizontal lines, the process proceeds to step S11.

【００９６】ステップＳ１１乃至ステップＳ１６では、
演算器１０からのマクロブロックの水平ラインではな
く、垂直ラインを対象として、ステップＳ５乃至Ｓ１０
における場合とそれぞれ同様の処理が行われる。In steps S11 to S16,
Steps S5 to S10 are performed on the vertical line, not the horizontal line of the macroblock from the arithmetic unit 10.
The same processing as in the case of is performed.

【００９７】即ち、ステップＳ１１では、演算器１０か
らのマクロブロックのある垂直ラインが、注目垂直ライ
ンとして選択され、ステップＳ１２に進む。ステップＳ
１２では、注目垂直ラインの両端の画素の画素値が判定
される。That is, in step S11, the vertical line with the macroblock from the arithmetic unit 10 is selected as the vertical line of interest, and the process proceeds to step S12. Step S
At 12, the pixel values of the pixels at both ends of the vertical line of interest are determined.

【００９８】即ち、ステップＳ１乃至Ｓ４の処理が施さ
れた後のマクロブロックの、ある垂直ラインに注目した
場合にも、その注目垂直ラインについては、その両端の
画素値が、いずれも０のケース（両端の画素が画像オブ
ジェクトの外側にあるケース）、いずれか一端の画素値
が０でないケース（一端の画素だけが画像オブジェクト
の内側にあるケース）、および両端の画素値がいずれも
０でないケース（両端の画素が画像オブジェクトの内側
にあるケース）の３通りのケースが生じる。ステップＳ
１２では、注目垂直ラインが、これらの３つのケースの
うちのいずれに属するのかが判定される。That is, even when attention is paid to a certain vertical line of the macroblock after the processing of steps S1 to S4, the pixel values at both ends of the vertical line of interest are both 0. (The pixels at both ends are outside the image object), the pixel value at one end is not 0 (only the pixels at one end are inside the image object), and the pixel values at both ends are not 0 There are three cases (cases where pixels at both ends are inside the image object). Step S
At 12, it is determined which of these three cases the vertical line of interest belongs to.

【００９９】ステップＳ１２において、注目垂直ライン
の両端の画素値が、いずれも０であると判定された場
合、ステップＳ１３に進み、その注目垂直ラインについ
て確保された変数Ｂに、０がセットされ、ステップＳ１
６に進む。また、ステップＳ１２において、注目垂直ラ
インの両端の画素値が、いずれも０でないと判定された
場合、ステップＳ１４に進み、その注目垂直ラインにつ
いて確保された変数Ｂに、注目垂直ラインの両端の画素
値の平均値がセットされ、ステップＳ１６に進む。さら
に、ステップＳ１２において、注目垂直ラインの両端の
画素値のうちのいずれか一方だけが０でないと判定され
た場合、ステップＳ１５に進み、その注目垂直ラインに
ついて確保された変数Ｂに、注目垂直ラインの両端の画
素値のうちの０でない方の値がセットされ、ステップＳ
１６に進む。When it is determined in step S12 that the pixel values at both ends of the target vertical line are both 0, the process proceeds to step S13, and 0 is set to the variable B secured for the target vertical line. Step S1
Go to 6. If it is determined in step S12 that the pixel values at both ends of the vertical line of interest are not 0, the process proceeds to step S14 and the variable B secured for the vertical line of interest is set to the pixels at both ends of the vertical line of interest. The average value is set, and the process proceeds to step S16. Further, when it is determined in step S12 that only one of the pixel values at both ends of the target vertical line is not 0, the process proceeds to step S15, and the variable B secured for the target vertical line is set to the target vertical line. One of the pixel values at both ends of the non-zero value is set, and step S
Proceed to 16.

【０１００】ステップＳ１６では、演算器１０からのマ
クロブロックのすべての垂直ラインを注目垂直ラインと
して処理を行ったかどうかが判定され、まだ、すべての
垂直ラインを注目垂直ラインとして処理を行っていない
と判定された場合、ステップＳ１１に戻り、まだ、注目
垂直ラインとして選択されていない垂直ラインが、新た
な注目垂直ラインとして選択され、以下、同様の処理が
繰り返される。In step S16, it is determined whether or not all the vertical lines of the macroblock from the arithmetic unit 10 have been processed as the target vertical lines, and all the vertical lines have not been processed as the target vertical lines. If determined, the process returns to step S11, a vertical line that has not yet been selected as the vertical line of interest is selected as a new vertical line of interest, and the same processing is repeated thereafter.

【０１０１】また、ステップＳ１６において、すべての
垂直ラインを注目垂直ラインとして処理を行ったと判定
された場合、ステップＳ１７に進み、演算器１０からの
マクロブロックを構成する画素のうち、ステップＳ２で
そのままフレームメモリ１１に出力した画素を除いたも
のの中から、ある画素が、注目画素として選択され、ス
テップＳ１８に進む。If it is determined in step S16 that all the vertical lines have been processed as the vertical lines of interest, the process proceeds to step S17, and among the pixels forming the macro block from the arithmetic unit 10, the same as in step S2. A pixel is selected as a pixel of interest from the pixels excluding the pixel output to the frame memory 11, and the process proceeds to step S18.

【０１０２】ステップＳ１８では、注目画素上で交差す
る垂直ラインと水平ラインそれぞれについての変数Ｂと
Ｃのセット（Ｂ，Ｃ）の値が判定される。In step S18, the value of the set (B, C) of variables B and C for each of the vertical line and the horizontal line intersecting on the target pixel is determined.

【０１０３】ステップＳ１８において、変数Ｂが０で、
Ｃが０でないと判定された場合、ステップＳ１９に進
み、変数Ｃの値が、注目画素の画素値として、フレーム
メモリ１１に出力され、ステップＳ２３に進む。また、
ステップＳ１８において、変数ＢおよびＣのいずれも０
でないと判定された場合、ステップＳ２０に進み、変数
ＢとＣの値の平均値が、注目画素の画素値として、フレ
ームメモリ１１に出力され、ステップＳ２３に進む。さ
らに、ステップＳ１８において、変数ＢおよびＣのいず
れも０であると判定された場合、ステップＳ２１に進
み、注目画素の画素値が０のままとされ、ステップＳ２
３に進む。In step S18, the variable B is 0,
When it is determined that C is not 0, the process proceeds to step S19, the value of the variable C is output to the frame memory 11 as the pixel value of the pixel of interest, and the process proceeds to step S23. Also,
In step S18, both variables B and C are 0
If it is determined that it is not, the process proceeds to step S20, the average value of the values of the variables B and C is output to the frame memory 11 as the pixel value of the pixel of interest, and the process proceeds to step S23. Furthermore, when it is determined in step S18 that both the variables B and C are 0, the process proceeds to step S21, in which the pixel value of the target pixel remains 0, and step S2
Go to 3.

【０１０４】一方、ステップＳ１８において、変数Ｂが
０でなく、Ｃが０であると判定された場合、ステップＳ
１９に進み、変数Ｂの値が、注目画素の画素値として、
フレームメモリ１１に出力され、ステップＳ２３に進
む。ステップＳ２３では、演算器１０からのマクロブロ
ックを構成する画素のうち、ステップＳ２でそのまま出
力した画素を除いたものすべてについて処理を行ったか
どうかが判定され、まだ行っていないと判定された場
合、ステップＳ１７に戻り、まだ、注目画素とされてい
ない画素が、新たに注目画素として選択され、以下、同
様の処理が繰り返される。On the other hand, if it is determined in step S18 that the variable B is not 0 and C is 0, then step S
In step 19, the value of the variable B is the pixel value of the target pixel,
It is output to the frame memory 11, and the process proceeds to step S23. In step S23, it is determined whether or not all the pixels forming the macroblock from the arithmetic unit 10 except the pixels output as they are in step S2 have been processed. If it is determined that the processing has not been performed, Returning to step S17, a pixel that has not yet been set as a target pixel is newly selected as a target pixel, and the same processing is repeated thereafter.

【０１０５】また、ステップＳ２３において、演算器１
０からのマクロブロックを構成する画素のうち、ステッ
プＳ２でそのまま出力した画素を除いたものすべてにつ
いて処理を行ったと判定された場合、ステップＳ２４に
進み、既にフレームメモリ１１に出力された画素のう
ち、まだフレームメモリ１１に出力されていない各画素
（以下、適宜、未出力画素という）に最も近いものが検
出される。さらに、ステップＳ２４では、その検出され
た画素の画素値が、未出力画素の画素値として、フレー
ムメモリ１１に出力され、パディング処理を終了する。
なお、既に、フレームメモリ１１に出力された画素の中
で、未出力画素に最も近いものが、２個以上検出された
場合には、それらの画素値の平均値が、未出力画素の画
素値として出力される。In step S23, the arithmetic unit 1
When it is determined that all the pixels forming the macro block from 0 except the pixel output as it is in step S2 are processed, the process proceeds to step S24, and among the pixels already output to the frame memory 11. The pixel closest to each pixel that has not been output to the frame memory 11 (hereinafter, appropriately referred to as a non-output pixel) is detected. Further, in step S24, the pixel value of the detected pixel is output to the frame memory 11 as the pixel value of the non-output pixel, and the padding process ends.
In addition, when two or more pixels that are closest to the non-output pixel have already been detected among the pixels output to the frame memory 11, the average value of those pixel values is the pixel value of the non-output pixel. Is output as.

【０１０６】以上のようなパディング処理を行うこと
で、画像オブジェクトの外側を構成する画素が、いわば
補間され、これにより、モスキートノイズの低減化およ
び動き補償の効率化を図ることができる。By performing the padding processing as described above, the pixels constituting the outside of the image object are interpolated, so to speak, whereby the mosquito noise can be reduced and the motion compensation efficiency can be improved.

【０１０７】次に、図１のＶＬＣ器６（符号化手段）の
処理について、さらに説明する。Next, the processing of the VLC unit 6 (encoding means) of FIG. 1 will be further described.

【０１０８】ＶＬＣ器６は、ＶＳ，ＶＩＳＯ，ＶＯ，Ｖ
ＯＬ，ＧＯＶ，ＶＯＰそれぞれのヘッダに、本来配置す
べき情報を配置し、さらに、量子化器５の出力の可変長
符号化結果を配置することで、符号化ビットストリーム
を構成し、送信バッファ７に出力する。The VLC device 6 has VS, VISO, VO, V
Information to be originally arranged is arranged in the headers of OL, GOV, and VOP, and further, the variable-length coding result of the output of the quantizer 5 is arranged to form a coded bit stream, and the transmission buffer 7 Output to.

【０１０９】また、ＶＬＣ器６は、ＧＯＶより上位の階
層であるＶＳ，ＶＩＳＯ，ＶＯ，ＶＯＬの各ヘッダに配
置した情報を、バッファ１６に出力して記憶させる。Further, the VLC unit 6 outputs the information arranged in the headers of VS, VISO, VO, and VOL, which are higher layers than the GOV, to the buffer 16 and stores it therein.

【０１１０】その後、ＶＬＣ器６は、ＧＯＶヘッダを出
力するとき、バッファ１６に記憶されている、ＧＯＶよ
り上位の階層のＶＳ，ＶＩＳＯ，ＶＯ，ＶＯＬの各ヘッ
ダの情報を読み出し、ＧＯＶヘッダの所定の位置に挿入
して（含めて）出力する。従って、この場合、ＧＯＶヘ
ッダには、そこに本来配置すべき情報の他、ＶＳ，ＶＩ
ＳＯ，ＶＯ，ＶＯＬの各ヘッダの情報も配置される。After that, when outputting the GOV header, the VLC unit 6 reads the information of the VS, VISO, VO, and VOL headers of the hierarchy higher than the GOV, which is stored in the buffer 16, and determines the predetermined GOV header. Insert (include) in the position of and output. Therefore, in this case, in addition to the information that should be originally placed in the GOV header, VS, VI
Information of each header of SO, VO, and VOL is also arranged.

【０１１１】図１４は、以上のような処理を行うＶＬＣ
器６が出力するＧＯＶのシンタクスを示している。な
お、図１４において影を付してある部分が、図８に示し
たＦＣＤにおけるシンタクスと異なる部分となってい
る。FIG. 14 shows a VLC which performs the above processing.
The syntax of GOV output by the device 6 is shown. The shaded area in FIG. 14 is different from the syntax in the FCD shown in FIG.

【０１１２】group_VOP_start_codeは、GOVの開始位置
を示す32ビットのユニークなコードである。time_code
（時刻情報）は、１８bitで構成され、GOVにおいて、最
初に表示されるＶＯＰの秒精度の表示時刻を表す。この
time_codeは、IEC standardpublication 461で規定され
ている「time and control codes for video tape reco
rders」に相当する。Group_VOP_start_code is a 32-bit unique code indicating the start position of GOV. time_code
The (time information) is composed of 18 bits and represents the display time with the second precision of the VOP first displayed in the GOV. this
time_code is `` time and control codes for video tape reco '' specified in IEC standard publication 461.
"rders".

【０１１３】closed_gopおよびbroken_linkについて
は、MPEG4VideoFCD規格(ISO/IEC 14496-2)を参照された
い。For closed_gop and broken_link, refer to the MPEG4 Video FCD standard (ISO / IEC 14496-2).

【０１１４】is_extension（ヘッダ情報有無フラグ）
は、本実施の形態で導入した１ビットのフラグで、GOV
ヘッダに、ＶＳ，ＶＩＳＯ，ＶＯ，ＶＯＬの各ヘッダ
の、デコーダの初期化を行うための情報、その他の情報
を含めるかどうかを表す。ＶＬＣ器６では、例えば、フ
ラグis_extensionが１の場合、ＶＳ，ＶＩＳＯ，ＶＯ，
ＶＯＬの各ヘッダの情報（VisualObjectSequence(), Vi
sualObject(), VideoObject(), VideoObjectLayer()）
が、GOVヘッダに含められる。即ち、フラグis_extensio
nが１の場合、ＶＳ，ＶＩＳＯ，ＶＯ，ＶＯＬの各ヘッ
ダの情報は、group_VOP_start_code，time_code，close
d_gop，broken_link,is_extensionの後に続けて配置さ
れる。Is_extension (header information presence / absence flag)
Is a 1-bit flag introduced in this embodiment.
It indicates whether or not the header includes information for initializing the decoder of the VS, VISO, VO, and VOL headers and other information. In the VLC device 6, for example, when the flag is_extension is 1, VS, VISO, VO,
Information of each VOL header (VisualObjectSequence (), Vi
sualObject (), VideoObject (), VideoObjectLayer ())
Is included in the GOV header. That is, the flag is_extensio
When n is 1, the information of each header of VS, VISO, VO, and VOL is group_VOP_start_code, time_code, close.
Placed after d_gop, broken_link, is_extension.

【０１１５】さらに、フラグis_extensionが１の場合
は、ＶＬＣ器６は、ＶＳ，ＶＩＳＯ，ＶＯ，ＶＯＬの各
ヘッダの情報を、GOVヘッダに含めた後、その含めたＶ
Ｓ，ＶＩＳＯ，ＶＯ，ＶＯＬの各ヘッダの情報を、バッ
ファ１６に供給し、いままで記憶されていた情報に替え
て記憶させる。Further, when the flag is_extension is 1, the VLC unit 6 includes the information of each header of VS, VISO, VO, and VOL in the GOV header, and then includes the included V.
The information of each header of S, VISO, VO, and VOL is supplied to the buffer 16 and stored in place of the information stored so far.

【０１１６】なお、ＶＬＣ器６は、ＶＳ，ＶＩＳＯ，Ｖ
Ｏ，ＶＯＬの各ヘッダを、その後に出力したときも、そ
のヘッダの情報をバッファ１６に供給して記憶させる。The VLC device 6 is provided with VS, VISO, V
Even when the O and VOL headers are subsequently output, the information of the headers is supplied to the buffer 16 and stored therein.

【０１１７】従って、バッファ１６には、常に最新のＶ
Ｓ，ＶＩＳＯ，ＶＯ，ＶＯＬのヘッダの情報が記憶され
ていることになる。Therefore, the buffer 16 always has the latest V.
The information on the headers of S, VISO, VO, and VOL is stored.

【０１１８】ここで、フラグis_extensionが１の場合
に、ＶＳ，ＶＩＳＯ，ＶＯ，ＶＯＬの各ヘッダの情報
を、GOVヘッダに含めた後、その含めたＶＳ，ＶＩＳ
Ｏ，ＶＯ，ＶＯＬの各ヘッダの情報を、バッファ１６に
供給して記憶させるのは、次のような理由による。Here, when the flag is_extension is 1, after the information of each header of VS, VISO, VO, and VOL is included in the GOV header, the included VS, VIS is included.
The information of each header of O, VO, and VOL is supplied to the buffer 16 and stored therein for the following reason.

【０１１９】即ち、ＶＬＣ器６には、符号化効率を向上
させる等のため、GOVヘッダに含めさせるＶＳ，ＶＩＳ
Ｏ，ＶＯ，ＶＯＬの各ヘッダの情報を変更させることが
できる。この場合、その変更後の情報が最新の情報とい
うことになるので、その最新の情報を、バッファ１６に
記憶させておくために、GOVヘッダに含めたＶＳ，ＶＩ
ＳＯ，ＶＯ，ＶＯＬの各ヘッダの情報を、バッファ１６
に供給して記憶させるようになされている。That is, the VLC unit 6 includes VS and VIS to be included in the GOV header in order to improve coding efficiency.
The information of each header of O, VO, and VOL can be changed. In this case, since the changed information is the latest information, the VS and VI included in the GOV header are stored in order to store the latest information in the buffer 16.
Information of each header of SO, VO, and VOL is stored in the buffer 16
It is designed to be supplied to and stored in.

【０１２０】一方、フラグis_extensionが０の場合、Ｖ
ＬＣ器６では、ＶＳ，ＶＩＳＯ，ＶＯ，ＶＯＬの各ヘッ
ダの情報は、GOVヘッダに含められない。On the other hand, when the flag is_extension is 0, V
In the LC device 6, the information of each VS, VISO, VO, and VOL header is not included in the GOV header.

【０１２１】なお、バッファ１６の記憶値は、外部から
変更することが可能なようになっている。即ち、ＶＳ，
ＶＩＳＯ，ＶＯ，ＶＯＬの各ヘッダの情報の一部または
全部を、符号化ビットストリームの途中で変化させたい
場合がある。即ち、例えば、デコードに用いる量子化マ
トリクスを、符号化ビットストリームの復号の途中で変
更したい場合などがある。このような場合、ユーザは、
バッファ１６に記憶されているＶＳ，ＶＩＳＯ，ＶＯ，
ＶＯＬの各ヘッダの情報を、適宜、所望の情報に変更す
ることができる。この変更後の情報は、フラグis_exten
sionが１になっているGOVヘッダに配置されて出力され
るから、デコーダでは、そのGOVヘッダを受信した後
に、その変更後の情報に基づいて、デコードが行われる
ことになる。The value stored in the buffer 16 can be changed externally. That is, VS,
There are cases where it is desired to change some or all of the information in each header of VISO, VO, and VOL in the middle of the encoded bitstream. That is, for example, there is a case where it is desired to change the quantization matrix used for decoding during the decoding of the encoded bitstream. In such cases, the user
VS, VISO, VO stored in the buffer 16
The information in each header of the VOL can be appropriately changed to desired information. The information after this change is the flag is_exten
Since the sion is placed in the GOV header and is output, the decoder receives the GOV header and then decodes it based on the changed information.

【０１２２】次に、図１５のフローチャートを参照し
て、図１４に示したようなシンタクスのGOVを出力する
ためのＶＬＣ器６の処理について説明する。Next, the processing of the VLC unit 6 for outputting the GOV of the syntax shown in FIG. 14 will be described with reference to the flowchart of FIG.

【０１２３】ＶＬＣ器６は、上述したように、ＶＳ，Ｖ
ＩＳＯ，ＶＯ，ＶＯＬ，ＧＯＶ，ＶＯＰそれぞれのヘッ
ダに、本来配置すべき情報を配置し、さらに、量子化器
５の出力の可変長符号化結果を配置することで、符号化
ビットストリームを構成し、送信バッファ７に出力して
いる。As described above, the VLC device 6 has VS, V
An encoded bit stream is formed by arranging information to be originally arranged in the headers of ISO, VO, VOL, GOV, and VOP, and further arranging the variable-length coding result of the output of the quantizer 5. , To the transmission buffer 7.

【０１２４】さらに、ＶＬＣ器６は、ＶＳ，ＶＩＳＯ，
ＶＯ，ＶＯＬの各ヘッダを出力するごとに、各ヘッダに
配置した情報を、バッファ１６に出力して記憶させてい
る（上書きしている）。Further, the VLC device 6 has VS, VISO,
Each time each header of VO and VOL is output, the information arranged in each header is output to the buffer 16 and stored (overwritten).

【０１２５】そして、ＶＬＣ器６は、ＧＯＶヘッダを出
力する場合には、ステップＳ３１において、そのＧＯＶ
ヘッダについてのフラグis_extensionが１であるかどう
かを判定する。ステップＳ１において、フラグis_exten
sionが１でない（０である）と判定された場合、ＶＬＣ
器６は、ＧＯＶヘッダに、本来配置すべき情報（図８に
示した情報）およびフラグis_extensionを配置し（Ｖ
Ｓ，ＶＩＳＯ，ＶＯ，ＶＯＬの各ヘッダの情報は配置し
ない）、その結果得られるＧＯＶヘッダを出力する。そ
して、次のＧＯＶヘッダを出力するタイミングまで待っ
て、ステップＳ３１に戻る。When outputting the GOV header, the VLC unit 6 determines the GOV header in step S31.
It is determined whether the flag is_extension for the header is 1. In step S1, the flag is_exten
If it is determined that sion is not 1 (0), VLC
The device 6 arranges the information that should be originally arranged (information shown in FIG. 8) and the flag is_extension in the GOV header (V
The information of each header of S, VISO, VO, and VOL is not arranged), and the GOV header obtained as a result is output. Then, after waiting for the timing of outputting the next GOV header, the process returns to step S31.

【０１２６】一方、ステップＳ３１において、フラグis
_extensionが１であると判定された場合、ステップＳ３
２に進み、バッファ１６に記憶されているＶＳ，ＶＩＳ
Ｏ，ＶＯ，ＶＯＬの各ヘッダの最新の情報を読み出し、
その最近の情報およびフラグis_extension、並びに本来
配置すべき情報を、ＧＯＶヘッダに配置して出力する。
そして、次のＧＯＶヘッダを出力するタイミングまで待
って、ステップＳ３１に戻る。On the other hand, in step S31, the flag is
If it is determined that _extension is 1, step S3
2 proceeds to VS, VIS stored in the buffer 16
Read the latest information of each header of O, VO, VOL,
The latest information, the flag is_extension, and the information to be originally placed are placed in the GOV header and output.
Then, after waiting for the timing of outputting the next GOV header, the process returns to step S31.

【０１２７】なお、各ＧＯＶに配置されるフラグis_ext
ensionの値は、例えば、エンコーダの管理者側におい
て、あらかじめ、ＶＬＣ器に設定されている。The flag is_ext allocated to each GOV
The value of the tension is set in the VLC device in advance on the administrator side of the encoder, for example.

【０１２８】次に、図１６は、記録媒体２０１または伝
送媒体２０２を介して提供される符号化ビットストリー
ムを復号するデコーダの一実施の形態の構成例を示して
いる。このデコーダを構成するバッファ２１、ＩＶＬＣ
器２２，逆量子化器２３，ＩＤＣＴ器２４，演算器２
５、フレームメモリ２６、動き補償器２７は、図２５に
示したデコーダを構成するバッファ１０１、ＩＶＬＣ器
１０２，逆量子化器１０３，ＩＤＣＴ器１０４，演算器
１０５、フレームメモリ１０６、動き補償器１０７にそ
れぞれ対応している。従って、バッファ２１乃至動き補
償器２７それぞれでは、バッファ１０１乃至動き補償器
１０７それぞれの処理と同一の処理が行われる場合があ
り、そのような同一の処理についての説明は、適宜省略
する。Next, FIG. 16 shows an example of the configuration of an embodiment of a decoder for decoding a coded bit stream provided via the recording medium 201 or the transmission medium 202. Buffer 21 and IVLC that constitute this decoder
Device 22, inverse quantizer 23, IDCT device 24, calculator 2
5, a frame memory 26, and a motion compensator 27 are a buffer 101, an IVLC unit 102, an inverse quantizer 103, an IDCT unit 104, an arithmetic unit 105, a frame memory 106, and a motion compensator 107 which constitute the decoder shown in FIG. It corresponds to each. Therefore, the buffer 21 to the motion compensator 27 may perform the same processing as the buffer 101 to the motion compensator 107, respectively, and the description of the same processing will be appropriately omitted.

【０１２９】記録媒体２０１または伝送媒体２０２を介
して提供される符号化ビットストリームは、受信バッフ
ァ２１（受信手段）で受信されて一時記憶される。そし
て、受信バッファ２１に記憶された符号化ビットストリ
ームは、適宜、ＩＶＬＣ（可変長復号）器２２によって
読み出される。The coded bit stream provided via the recording medium 201 or the transmission medium 202 is received by the reception buffer 21 (reception means) and temporarily stored. Then, the coded bit stream stored in the reception buffer 21 is read by the IVLC (variable length decoding) unit 22 as appropriate.

【０１３０】ＩＶＬＣ器２２（復号手段）は、受信バッ
ファ２１から読み出した符号化ビットストリームを可変
長復号し、動きベクトルおよび予測モードを、動き補償
器２７に、また、量子化ステップを逆量子化器２３に、
それぞれ出力するとともに、可変長復号された画像デー
タ（量子化されたＤＣＴ係数）を、逆量子化器２３に出
力する。なお、ＩＶＬＣ器２２は、その他、各階層のヘ
ッダに含まれている、デコーダのデコード処理に用いら
れるパラメータの初期化に必要な情報、その他の情報
（例えば、オーバラップ動き補償を行うかどうかを示す
フラグや、量子化マトリクスなど）を、適宜、必要なブ
ロックに供給する（例えば、オーバラップ動き補償を行
うかどうかを示すフラグは動き補償器２７に、量子化マ
トリクスは逆量子化器２３に、それぞれ供給される）The IVLC unit 22 (decoding means) performs variable length decoding of the coded bit stream read from the reception buffer 21, the motion vector and the prediction mode to the motion compensator 27, and the quantization step to the inverse quantization. In the vessel 23,
The image data (quantized DCT coefficient) subjected to variable length decoding is output to the inverse quantizer 23 while being output. The IVLC unit 22 also determines other information necessary for initializing the parameters included in the header of each layer and used in the decoding process of the decoder, and other information (for example, whether to perform overlap motion compensation). A flag indicating a quantization matrix or the like is appropriately supplied to a necessary block (for example, a flag indicating whether or not to perform overlap motion compensation is supplied to the motion compensator 27, and a quantization matrix is supplied to the inverse quantizer 23). , Each supplied)

【０１３１】さらに、ＩＶＬＣ器２２は、ＧＯＶヘッダ
については、フラグis_extensionを復号し、フラグis_e
xtensionが1である場合、即ち、ＧＯＶヘッダに、Ｖ
Ｓ，ＶＩＳＯ，ＶＯ，ＶＯＬの各ヘッダの情報が含まれ
ている場合、その情報も、ＶＳ，ＶＩＳＯ，ＶＯ，ＶＯ
Ｌの各ヘッダと同様に可変長復号し、その復号結果を、
必要なブロックに供給する。具体的には、例えば、動き
ベクトル、予測モード、オーバーラップ動き補償を行う
かどうかを示すフラグなどは動き補償器２７に、量子化
ステップおよび量子化マトリクスなどは逆量子化器２３
に、それぞれ供給される。Furthermore, the IVLC unit 22 decodes the flag is_extension for the GOV header, and the flag is_e.
When xtension is 1, that is, in the GOV header, V
When the information of each header of S, VISO, VO, and VOL is included, the information is also VS, VISO, VO, and VO.
Variable length decoding is performed in the same way as for each header of L, and the decoding result is
Supply the required blocks. Specifically, for example, a motion vector, a prediction mode, a flag indicating whether or not to perform overlap motion compensation, and the like are provided to the motion compensator 27, and a quantization step, a quantization matrix, and the like are provided to the dequantizer 23.
, Respectively.

【０１３２】また、ＩＶＬＣ器２２は、符号化ビットス
トリームに含まれるフラグFSZおよびFPOSを復号し、フ
レームメモリ２６、動き補償器２７、およびキー信号復
号器２９に供給する。さらに、ＩＶＬＣ器２２は、符号
化ビットストリームに含まれる、符号化されたキー信号
（キー信号ビットストリーム）を抽出し、キー信号復号
器２９に供給する。The IVLC unit 22 also decodes the flags FSZ and FPOS contained in the encoded bit stream and supplies them to the frame memory 26, motion compensator 27, and key signal decoder 29. Further, the IVLC unit 22 extracts the encoded key signal (key signal bit stream) included in the encoded bit stream and supplies it to the key signal decoder 29.

【０１３３】キー信号復号器２９は、ＩＶＬＣ器２２よ
り供給されるキー信号ビットストリームを復号する。こ
の復号されたキー信号は、IDCT器２４、動き補償器２
７、および画素置換器２８に供給される。The key signal decoder 29 decodes the key signal bit stream supplied from the IVLC unit 22. This decoded key signal is used for the IDCT unit 24 and the motion compensator 2
7 and the pixel replacer 28.

【０１３４】逆量子化器２３は、ＩＶＬＣ器２２より供
給される画像データを、同じくＩＶＬＣ器２２より供給
される量子化ステップに従って逆量子化し、IDCT器２４
に出力する。ＩＤＣＴ器２４は、逆量子化器２３より出
力されたデータ（DCT係数）に対して、逆DCT処理を施
し、演算器２５に供給する。The inverse quantizer 23 inversely quantizes the image data supplied from the IVLC unit 22 in accordance with the quantization step also supplied from the IVLC unit 22, and the IDCT unit 24
Output to. The IDCT device 24 performs inverse DCT processing on the data (DCT coefficient) output from the inverse quantizer 23, and supplies the data to the calculator 25.

【０１３５】演算器２５は、IDCT器２４より供給された
画像データが、Ｉ−ＶＯＰのデータである場合、そのデ
ータを、その後に入力される画像データ（ＰまたはＢ−
ＶＯＰのデータ）の予測画像の生成のために、そのま
ま、画素置換器２８を介してフレームメモリ２６に供給
して記憶させる。When the image data supplied from the IDCT device 24 is I-VOP data, the calculator 25 converts the image data into image data (P or B-
In order to generate a predicted image of (VOP data), it is directly supplied to the frame memory 26 via the pixel replacer 28 and stored therein.

【０１３６】なお、画素置換器２８では、図１２の画素
置換器１５と同様の処理が行われる。The pixel replacer 28 performs the same processing as the pixel replacer 15 of FIG.

【０１３７】一方、演算器２５に供給されるデータが、
ＰまたはＢ−ＶＯＰのデータである場合、動き補償器２
７は、ＩＶＬＣ器２２より供給される動きベクトルおよ
び予測モードに従って、フレームメモリ２６に記憶され
た、既に復号されている画像を読み出すことで、予測画
像を生成し、演算器２５に出力する。演算器２５ではID
CT器２４より供給される画像データ（差分データ）と、
動き補償器２７より供給される予測画像データを加算
し、復号画像とする。この復号画像は、画素置換器２８
を介してフレームメモリ２６に供給されて記憶され、後
に復号する画像の参照画像（予測画像を生成するために
参照される画像）として、適宜用いられる。また、フレ
ームメモリ２６に記憶された復号画像は、上述したよう
に参照画像として用いられる他、適宜読み出され、例え
ば、図示せぬディスプレイなどに供給されて表示され
る。On the other hand, the data supplied to the arithmetic unit 25 is
If it is P or B-VOP data, the motion compensator 2
7 reads out the already decoded image stored in the frame memory 26 according to the motion vector and the prediction mode supplied from the IVLC unit 22, thereby generating a predicted image and outputting it to the calculator 25. ID in the calculator 25
Image data (difference data) supplied from the CT device 24,
The predicted image data supplied from the motion compensator 27 is added to obtain a decoded image. This decoded image is used as the pixel replacer 28.
The image is supplied to and stored in the frame memory 26 via, and is appropriately used as a reference image (image referred to for generating a predicted image) of an image to be decoded later. The decoded image stored in the frame memory 26 is used as a reference image as described above, and is also read as appropriate and supplied to, for example, a display (not shown) for display.

【０１３８】次に、図１７のフローチャートを参照し
て、図１６のＩＶＬＣ器２２がＧＯＶヘッダに関して行
う処理について、さらに説明する。Next, with reference to the flowchart of FIG. 17, the processing performed by the IVLC unit 22 of FIG. 16 with respect to the GOV header will be further described.

【０１３９】ＩＶＬＣ器２２は、ＧＯＶヘッダを受信す
ると、そのＧＯＶヘッダについて、通常行うべき処理
（図８に示したＧＯＶヘッダが送信されてきたときに行
うべき処理）を行い、さらに、ステップＳ４１におい
て、ＧＯＶヘッダ（図１４）に配置されているフラグis
_extensionが1であるかどうかを判定する。ステップＳ
４１において、フラグis_extensionが1でない（０であ
る）と判定された場合、即ち、ＧＯＶヘッダに、ＶＳ，
ＶＩＳＯ，ＶＯ，ＶＯＬの各ヘッダの情報が含まれてい
ない場合、次のＧＯＶヘッダが送信されてくるのを待っ
て、ステップＳ４１に戻る。Upon receiving the GOV header, the IVLC unit 22 performs the processing that should normally be performed on the GOV header (the processing that should be performed when the GOV header shown in FIG. 8 is transmitted), and at step S41. , Flag is located in the GOV header (Fig. 14)
Determine whether _extension is 1. Step S
In 41, when it is determined that the flag is_extension is not 1 (is 0), that is, in the GOV header, VS,
When the information of each header of VISO, VO, and VOL is not included, it waits for the next GOV header to be transmitted, and returns to step S41.

【０１４０】また、ステップＳ４１において、フラグis
_extensionが1であると判定された場合、即ち、ＧＯＶ
ヘッダに、ＶＳ，ＶＩＳＯ，ＶＯ，ＶＯＬの各ヘッダの
情報が含まれている場合、ステップＳ４２に進み、ＩＶ
ＬＣ器２２は、そのＶＳ，ＶＩＳＯ，ＶＯ，ＶＯＬの各
ヘッダの情報を、必要なブロックに供給し、次のＧＯＶ
ヘッダが送信されてくるのを待って、ステップＳ４１に
戻る。In step S41, the flag is
When it is determined that _extension is 1, that is, GOV
When the header includes information of each header of VS, VISO, VO, and VOL, the process proceeds to step S42 and IV
The LC unit 22 supplies the information of the respective headers of VS, VISO, VO, and VOL to the necessary block, and the next GOV.
After waiting for the header to be transmitted, the process returns to step S41.

【０１４１】次に、ＧＯＶは、図１４に示したシンタク
スの他、例えば、図１８に示すシンタクスのように構成
することも可能である。Next, the GOV can be configured as the syntax shown in FIG. 18, for example, in addition to the syntax shown in FIG.

【０１４２】即ち、図１８は、ＧＯＶのシンタクスの他
の例を示している。なお、図１４と図１８とでは、brok
en_linkの下からnext_start_codeの上までの間が異なっ
ている。また、図１８において影を付してある部分が、
図８に示したＦＣＤにおけるシンタクスと異なる部分と
なっている。That is, FIG. 18 shows another example of the syntax of GOV. In addition, in FIG. 14 and FIG.
There is a difference between the bottom of en_link and the top of next_start_code. In addition, the shaded portion in FIG.
This is a part different from the syntax in the FCD shown in FIG.

【０１４３】図１４の実施の形態では、フラグis_exten
sionにより、ＶＳ，ＶＩＳＯ，ＶＯ，ＶＯＬの各ヘッダ
の情報を、ＧＯＶヘッダにおいて伝送するかどうかだけ
が設定可能であったが、図１８の実施の形態では、フラ
グload_data_typeを採用することにより、ＶＳ，ＶＩＳ
Ｏ，ＶＯ，ＶＯＬの各ヘッダの情報すべてを、ＧＯＶヘ
ッダにおいて伝送するかどうかだけでなく、それらの情
報の一部のみを伝送するような設定も可能になってい
る。即ち、図１８の実施の形態では、ＶＳ，ＶＩＳＯ，
ＶＯ，ＶＯＬの各ヘッダの情報の一部だけを、ＧＯＶヘ
ッダに含ませることが可能であり、フラグload_data_ty
peによれば、そのＧＯＶヘッダに含ませる一部の情報を
識別することができるようになされている。In the embodiment shown in FIG. 14, the flag is_exten is used.
It was possible to set only whether or not the information of each header of VS, VISO, VO, and VOL is transmitted in the GOV header by sion, but in the embodiment of FIG. 18, by adopting the flag load_data_type, VS , VIS
It is possible to set not only whether all the information in each of the O, VO, and VOL headers is transmitted in the GOV header, but also to transmit only a part of the information. That is, in the embodiment of FIG. 18, VS, VISO,
It is possible to include only a part of the information of each header of VO and VOL in the GOV header, and the flag load_data_ty
According to pe, some information included in the GOV header can be identified.

【０１４４】具体的には、図１８において、load_data_
typeは、可変長符号で、この直後に、ランダムアクセス
時にデコーダを初期化するための情報等を伝送するかど
うかと、伝送する場合には、その伝送する情報の種類を
示す。即ち、例えば、図１９に示すように、load_data_
typeが'1'のときには、ＧＯＶ層より上位階層のヘッダ
の情報は、ＧＯＶには含められない。また、load_data_
typeが'01'のときには、図１４の実施の形態においてフ
ラグis_extensionが１である場合と同様に、ＶＳ，ＶＩ
ＳＯ，ＶＯ，ＶＯＬの各ヘッダの情報（VisualObjectSe
quence(), VisualObject(), VideoObject(), VideoObje
ctLayer()）のすべてが、ＧＯＶに含められる。さら
に、load_data_typeが'001'のときには、ＶＳ，ＶＩＳ
Ｏ，ＶＯ，ＶＯＬの各ヘッダの情報のうち、予め定めら
れた所定のパラメータの情報が、ＧＯＶに含められる。Specifically, in FIG. 18, load_data_
The type is a variable length code, and immediately after this, indicates whether or not to transmit information for initializing the decoder at the time of random access and, if so, the type of information to be transmitted. That is, for example, as shown in FIG. 19, load_data_
When the type is "1", the information of the header in the upper hierarchy than the GOV layer is not included in the GOV. Also, load_data_
When type is “01”, VS and VI are the same as when the flag is_extension is 1 in the embodiment of FIG.
Information of each header of SO, VO, VOL (VisualObjectSe
quence (), VisualObject (), VideoObject (), VideoObje
All of ctLayer ()) are included in the GOV. Furthermore, when load_data_type is '001', VS, VIS
Of the information of each header of O, VO, and VOL, information of a predetermined parameter that is determined in advance is included in the GOV.

【０１４５】即ち、図１８の実施の形態において、load
_data_typeが'001'のときにＧＯＶに含められる情報
は、download_parameters()として規定されている。That is, in the embodiment of FIG. 18, load
Information included in GOV when _data_type is “001” is defined as download_parameters ().

【０１４６】ここで、本実施の形態では、download_par
ameters()は、例えば、図２０に示すように規定されて
いる。Here, in the present embodiment, download_par
ameters () is defined as shown in FIG. 20, for example.

【０１４７】図２０において、フラグobmc_disableは、
オーバーラップ動き補償を用いるかどうかを示す１ビッ
トのフラグである。この値が、'1'である場合には、オ
ーバーラップ動き補償は用いられず、'0'である場合に
は、オーバーラップ動き補償が用いられる。フラグquan
t_typeは、逆量子化の方法を示す１ビットのフラグであ
る。この値が'0'である場合には、H.263に規定されてい
る逆量子化方法を用いて逆量子化が行われ、'1'である
場合には、MPEG2に規定されている逆量子化方法を用い
て逆量子化が行われる。MPEG2に規定されている逆量子
化方法を用いる場合には、さらに、量子化マトリクスを
ダウンロードするかどうかを示すフラグが伝送される。
また、量子化マトリクスをダウンロードする場合には、
そのダウンロードする量子化マトリクスも伝送される。In FIG. 20, the flag obmc_disable is
This is a 1-bit flag indicating whether to use overlap motion compensation. If this value is '1', overlap motion compensation is not used, and if it is '0', overlap motion compensation is used. Flag quan
t_type is a 1-bit flag indicating the method of inverse quantization. If this value is '0', inverse quantization is performed using the inverse quantization method specified in H.263, and if it is '1', the inverse quantization specified in MPEG2 is performed. Inverse quantization is performed using the quantization method. When using the inverse quantization method defined in MPEG2, a flag indicating whether or not to download the quantization matrix is further transmitted.
Also, when downloading the quantization matrix,
The downloaded quantization matrix is also transmitted.

【０１４８】その他、図２０のdownload_parameters()
において規定されているload_intra_quant_mat, intra_
quant_mat, load_nonintra_quant_mat, nonintra_quant
_mat,load_intra_quant_mat_grayscale, iontra_quant_
mat_grayscale, load_nonintra_quant_mat_grayscale,
nonintra_quant_mat_grayscaleのセマンティクスは、Ｆ
ＣＤにおけるＶＯＬ（図５乃至図７）で規定されている
同名のフラグのセマンティクスと同様である。In addition, download_parameters () shown in FIG.
Load_intra_quant_mat, intra_ specified in
quant_mat, load_nonintra_quant_mat, nonintra_quant
_mat, load_intra_quant_mat_grayscale, iontra_quant_
mat_grayscale, load_nonintra_quant_mat_grayscale,
The semantics of nonintra_quant_mat_grayscale are F
This is the same as the semantics of the flag of the same name defined in the VOL (FIGS. 5 to 7) on the CD.

【０１４９】なお、図１９の実施の形態では、フラグlo
ad_data_typeについて、３通りの場合しか規定していな
いが、４通り以上の場合を規定することも可能である。
この場合、図２０のdownload_parameters()で規定され
る情報の組み合わせとは異なる組み合わせの情報を、Ｇ
ＯＶヘッダに配置することが可能となる。In the embodiment of FIG. 19, the flag lo
Regarding ad_data_type, only three cases are specified, but it is also possible to specify four or more cases.
In this case, the information of the combination different from the combination of the information defined by download_parameters () in FIG.
It becomes possible to arrange in the OV header.

【０１５０】図２１は、図１８に示したＧＯＶヘッダを
出力するエンコーダの一実施の形態の構成例を示してい
る。なお、図中、図１２における場合と対応する部分に
ついては、同一の符号を付してある。即ち、図２１のエ
ンコーダは、パーサ１７（選択手段）が新たに設けられ
ている他は、図１２における場合と同様に構成されてい
る。FIG. 21 shows a configuration example of an embodiment of an encoder for outputting the GOV header shown in FIG. In addition, in the figure, the same reference numerals are given to the portions corresponding to the case in FIG. That is, the encoder of FIG. 21 is configured similarly to the case of FIG. 12, except that the parser 17 (selection means) is newly provided.

【０１５１】パーサ（フラグ識別器）１７は、ＶＬＣ器
６が出力しようとしているＧＯＶヘッダについてのフラ
グload_data_typeを参照し、そのフラグload_data_type
にしたがって、バッファ１６から情報を読み出し、ＶＬ
Ｃ器６に供給する。ＶＬＣ器６では、パーサ１７から供
給される情報が、フラグload_data_typeとともに、ＧＯ
Ｖヘッダの図１８に示した所定の位置に配置されて出力
される。The parser (flag discriminator) 17 refers to the flag load_data_type regarding the GOV header which the VLC unit 6 is about to output, and the flag load_data_type.
Information is read from the buffer 16 according to
It is supplied to the C unit 6. In the VLC unit 6, the information supplied from the parser 17 together with the flag load_data_type is GO.
The V header is arranged and output at a predetermined position shown in FIG.

【０１５２】次に、図２２のフローチャートを参照し
て、図１８に示したようなシンタクスのGOVをＶＬＣ器
６に出力させるためのパーサ１７の処理について説明す
る。Next, the processing of the parser 17 for outputting the GOV having the syntax shown in FIG. 18 to the VLC device 6 will be described with reference to the flowchart of FIG.

【０１５３】ＶＬＣ器６は、ＧＯＶヘッダを出力するタ
イミングで、そのＧＯＶヘッダについてのフラグload_d
ata_typeを、パーサ１７に供給する。パーサ１７は、Ｖ
ＬＣ器６からのフラグload_data_typeを受信し、ステッ
プＳ５１において、その値を判定する。ステップＳ５１
において、フラグload_data_typeが１であると判定され
た場合、パーサ１７は、ＶＬＣ器６に対して、何も出力
せず、次のＧＯＶヘッダに配置されたフラグload_data_
typeが、ＶＬＣ器６から送信されてくるのを待って、ス
テップＳ５１に戻る。この場合、ＶＬＣ器６では、ＧＯ
Ｖヘッダに、本来配置すべき情報およびload_data_type
を配置し、その結果得られるＧＯＶヘッダを出力する。The VLC unit 6 outputs the GOV header flag load_d at the timing of outputting the GOV header.
The ata_type is supplied to the parser 17. Parser 17 is V
The flag load_data_type is received from the LC device 6, and its value is determined in step S51. Step S51
When it is determined that the flag load_data_type is 1, the parser 17 outputs nothing to the VLC unit 6 and the flag load_data_ arranged in the next GOV header.
After waiting for the type to be transmitted from the VLC unit 6, the process returns to step S51. In this case, in the VLC device 6, GO
Information that should be placed in the V header and load_data_type
And output the resulting GOV header.

【０１５４】また、ステップＳ５１において、フラグlo
ad_data_typeが０１であると判定された場合、ステップ
Ｓ５２に進み、パーサ１７は、バッファ１６から、Ｖ
Ｓ，ＶＩＳＯ，ＶＯ，ＶＯＬの各ヘッダの最新の情報を
読み出し、ＶＬＣ器６に供給する。そして、次のＧＯＶ
ヘッダに配置されたフラグload_data_typeが、ＶＬＣ器
６から送信されてくるのを待って、ステップＳ５１に戻
る。従って、この場合、ＶＬＣ器６では、ＧＯＶヘッダ
に、本来配置すべき情報およびload_data_typeの他に、
ＶＳ，ＶＩＳＯ，ＶＯ，ＶＯＬの各ヘッダの情報も配置
される。In step S51, the flag lo
When it is determined that ad_data_type is 01, the process proceeds to step S52, where the parser 17 reads V from the buffer 16.
The latest information in each header of S, VISO, VO, and VOL is read and supplied to the VLC unit 6. And the next GOV
After waiting for the flag load_data_type arranged in the header to be transmitted from the VLC unit 6, the process returns to step S51. Therefore, in this case, in the VLC unit 6, in addition to the information and load_data_type that should be originally arranged in the GOV header,
Information on each header of VS, VISO, VO, and VOL is also arranged.

【０１５５】一方、ステップＳ５１において、フラグlo
ad_data_typeが００１であると判定された場合、ステッ
プＳ５３に進み、パーサ１７は、バッファ１６に記憶さ
れている情報のうち、図２０に示したdownload_paramet
ers()に含まれるものを選択して読み出し、ＶＬＣ器６
に供給する。そして、次のＧＯＶヘッダに配置されたフ
ラグload_data_typeが、ＶＬＣ器６から送信されてくる
のを待って、ステップＳ５１に戻る。従って、この場
合、ＶＬＣ器６では、ＧＯＶヘッダに、本来配置すべき
情報およびload_data_typeの他に、図２０に示したdown
load_parameters()も配置される。On the other hand, in step S51, the flag lo
If it is determined that ad_data_type is 001, the parser 17 proceeds to step S53, and the parser 17 download_paramet shown in FIG. 20 among the information stored in the buffer 16.
Select the one included in ers () and read it out.
Supply to. Then, after waiting for the flag load_data_type arranged in the next GOV header to be transmitted from the VLC unit 6, the process returns to step S51. Therefore, in this case, in the VLC unit 6, in addition to the information and load_data_type to be originally arranged in the GOV header, down shown in FIG.
load_parameters () is also placed.

【０１５６】次に、図２１のエンコーダから、記録媒体
２０１または伝送媒体２０２を介して提供される符号化
ビットストリームは、図１６に示した構成のデコーダに
よってデコードすることができる。Next, the encoded bit stream provided from the encoder of FIG. 21 via the recording medium 201 or the transmission medium 202 can be decoded by the decoder having the configuration shown in FIG.

【０１５７】図２３は、図１６に示した構成のデコーダ
のＩＶＬＣ器２２が、図１８に示したシンタクスのＧＯ
Ｖヘッダに関して行う処理を説明するためのフローチャ
ートである。FIG. 23 shows that the IVLC unit 22 of the decoder having the configuration shown in FIG. 16 has the syntax GO shown in FIG.
It is a flow chart for explaining processing performed about a V header.

【０１５８】ＩＶＬＣ器２２は、ＧＯＶヘッダを受信す
ると、そのＧＯＶヘッダについて、通常行うべき処理
（図８に示したＧＯＶヘッダが送信されてきたときに行
うべき処理）を行い、さらに、ステップＳ６１におい
て、ＧＯＶヘッダ（図１８）に配置されているフラグlo
ad_data_typeの値を判定する。ステップＳ６１におい
て、フラグload_data_typeが1であると判定された場
合、次のＧＯＶヘッダが送信されてくるのを待って、ス
テップＳ６１に戻る。Upon receiving the GOV header, the IVLC unit 22 performs the processing that should normally be performed on the GOV header (the processing that should be performed when the GOV header shown in FIG. 8 is transmitted), and at step S61. , The flag lo located in the GOV header (FIG. 18)
Determine the value of ad_data_type. When it is determined in step S61 that the flag load_data_type is 1, the process waits for the next GOV header to be transmitted, and the process returns to step S61.

【０１５９】また、ステップＳ６１において、フラグlo
ad_data_typeが０１であると判定された場合、即ち、Ｇ
ＯＶヘッダに、ＶＳ，ＶＩＳＯ，ＶＯ，ＶＯＬの各ヘッ
ダの情報が含まれている場合、ステップＳ６２に進み、
ＩＶＬＣ器２２は、フラグload_data_typeに基づいて、
そのＶＳ，ＶＩＳＯ，ＶＯ，ＶＯＬの各ヘッダの情報
を、符号化ビットストリームから抽出し、必要なブロッ
クに供給する。即ち、その情報を可変長復号し、その結
果得られる、例えば、動きベクトル、予測モード、およ
びオーバーラップ動き補償を行うかどうかを示すフラグ
を、動き補償器２７に、また、量子化ステップおよび量
子化マトリクスを、逆量子化器２３に、それぞれ供給す
る。そして、次のＧＯＶヘッダが送信されてくるのを待
って、ステップＳ６１に戻る。Further, in step S61, the flag lo
When it is determined that ad_data_type is 01, that is, G
If the OV header includes the information of the VS, VISO, VO, and VOL headers, the process proceeds to step S62.
The IVLC unit 22 uses the flag load_data_type to
The information of the VS, VISO, VO, and VOL headers is extracted from the encoded bitstream and supplied to the necessary blocks. That is, the information is subjected to variable length decoding, and a flag obtained as a result thereof, for example, a motion vector, a prediction mode, and whether or not to perform overlap motion compensation is provided to the motion compensator 27, and the quantization step and the The respective quantization matrices are supplied to the inverse quantizer 23. Then, after waiting for the next GOV header to be transmitted, the process returns to step S61.

【０１６０】一方、ステップＳ６１において、フラグlo
ad_data_typeが００１であると判定された場合、即ち、
ＧＯＶヘッダに、download_parameters()（パラメータ
更新情報）が含まれている場合、ステップＳ６３に進
み、ＩＶＬＣ器２２は、フラグload_data_typeに基づい
て、そのdownload_parameters()を、符号化ビットスト
リームから抽出し、必要なブロックに供給する。即ち、
そのdownload_parameters()を可変長復号し、その結果
得られる、例えば、オーバーラップ動き補償を行うかど
うかを示すフラグを、動き補償器２７に、また、量子化
ステップおよび量子化マトリクスを、逆量子化器２３
に、それぞれ供給する。そして、次のＧＯＶヘッダが送
信されてくるのを待って、ステップＳ６１に戻る。On the other hand, in step S61, the flag lo
When it is determined that ad_data_type is 001, that is,
If the GOV header includes download_parameters () (parameter update information), the IVLC unit 22 extracts the download_parameters () from the encoded bitstream based on the flag load_data_type, and proceeds to step S63. Supply to the block. That is,
The download_parameters () is subjected to variable length decoding, and the resulting flag, for example, indicating whether or not to perform overlap motion compensation is provided to the motion compensator 27, and the quantization step and the quantization matrix are inversely quantized. Bowl 23
, Respectively. Then, after waiting for the next GOV header to be transmitted, the process returns to step S61.

【０１６１】以上のように、ＧＯＶのヘッダに、それに
より上位階層のＶＳ，ＶＩＳＯ，ＶＯ，ＶＯＬのヘッダ
の情報の全部または一部（本実施の形態では、図２０に
示したdownload_parameters()）を含めるようにしたの
で、符号化ビットストリームに対して、ランダムアクセ
ス等し、その途中から、正常な復号を行うことが可能と
なる。さらに、ＧＯＶの先頭で、量子化ステップや量子
化マトリクスを変更することが可能となり、その結果、
効率の良い符号化を行うことができるようになる。As described above, in the GOV header, all or part of the information of the VS, VISO, VO, and VOL headers of the upper layer, according to the header (download_parameters () shown in FIG. 20 in this embodiment). Since it has been included, it is possible to perform random access or the like to the encoded bit stream and perform normal decoding from the middle. Furthermore, it becomes possible to change the quantization step and the quantization matrix at the beginning of the GOV, and as a result,
It becomes possible to perform efficient encoding.

【０１６２】以上、本発明を、ＭＰＥＧ４に基づいた符
号化／復号を行うエンコーダ／デコーダに適用した場合
について説明したが、本発明の適用範囲は、ＭＰＥＧ４
に基づいた符号化／復号に限定されるものではない。The case where the present invention is applied to the encoder / decoder which performs encoding / decoding based on MPEG4 has been described above. However, the applicable range of the present invention is MPEG4.
It is not limited to encoding / decoding based on

【０１６３】また、本実施の形態では、download_param
eters()として、図２０に示した情報を、ＧＯＶのヘッ
ダに含めるようにしたが、download_parameters()とし
てＧＯＶのヘッダに含める情報は、図２０に示したもの
に限定されるものではない。Further, in the present embodiment, download_param
The information shown in FIG. 20 as the eters () is included in the GOV header, but the information included in the GOV header as the download_parameters () is not limited to that shown in FIG.

【０１６４】さらに、図１２および図２１に示したエン
コーダ、並びに図１６に示したデコーダは、ハードウェ
アによって実現することも可能であるし、また、コンピ
ュータなどにプログラムを実行させることによって実現
することも可能である。Further, the encoder shown in FIGS. 12 and 21 and the decoder shown in FIG. 16 can be realized by hardware, or can be realized by causing a computer or the like to execute a program. Is also possible.

【０１６５】また、ＭＰＥＧ４では、スケーラビリティ
を実現するための階層符号化が可能であるが、本発明
は、階層符号化を行うか否かにかかわらず適用可能であ
る。Also, with MPEG4, hierarchical coding for achieving scalability is possible, but the present invention is applicable regardless of whether hierarchical coding is performed or not.

【０１６６】[0166]

【発明の効果】第１の本発明によれば、効率的な符号化
が可能となる。 According to the first aspect of the present invention, efficient coding is possible.

【０１６７】第２の本発明によれば、符号化ビットスト
リームの途中からでも、正常な復号を行うことが可能と
なる。 According to the second aspect of the present invention , normal decoding can be performed even in the middle of the encoded bit stream.

【０１６８】[0168]

[Brief description of drawings]

【図１】MPEG４規格FCDで規定されている符号化ビット
ストリームの構成を示す図である。FIG. 1 is a diagram showing a configuration of a coded bitstream defined by MPEG4 standard FCD.

【図２】MPEG４規格FCDで規定されているＶＳのシンタ
クスを示す図である。[Fig. 2] Fig. 2 is a diagram illustrating the syntax of VS defined by the MPEG4 standard FCD.

【図３】MPEG４規格FCDで規定されているＶＩＳＯのシ
ンタクスを示す図である。[Fig. 3] Fig. 3 is a diagram illustrating the syntax of VISO defined by the MPEG4 standard FCD.

【図４】MPEG４規格FCDで規定されているＶＯのシンタ
クスを示す図である。FIG. 4 is a diagram showing the syntax of VO defined by the MPEG4 standard FCD.

【図５】MPEG４規格FCDで規定されているＶＯＬのシン
タクスを示す図である。[Fig. 5] Fig. 5 is a diagram illustrating the syntax of a VOL defined by the MPEG4 standard FCD.

【図６】MPEG４規格FCDで規定されているＶＯＬのシン
タクスを示す図である。FIG. 6 is a diagram showing the syntax of a VOL defined by the MPEG4 standard FCD.

【図７】MPEG４規格FCDで規定されているＶＯＬのシン
タクスを示す図である。[Fig. 7] Fig. 7 is a diagram illustrating the syntax of a VOL defined by the MPEG4 standard FCD.

【図８】MPEG４規格FCDで規定されているＧＯＶのシン
タクスを示す図である。[Fig. 8] Fig. 8 is a diagram illustrating the syntax of GOV defined by the MPEG4 standard FCD.

【図９】MPEG４規格FCDで規定されているＶＯＰのシン
タクスを示す図である。[Fig. 9] Fig. 9 is a diagram illustrating the syntax of a VOP defined by the MPEG4 standard FCD.

【図１０】MPEG４規格FCDで規定されているＶＯＰのシ
ンタクスを示す図である。[Fig. 10] Fig. 10 is a diagram illustrating the syntax of a VOP defined by the MPEG4 standard FCD.

【図１１】MPEG４規格FCDで規定されているＶＯＰのシ
ンタクスを示す図である。FIG. 11 is a diagram showing the syntax of VOP defined by the MPEG4 standard FCD.

【図１２】本発明を適用したエンコーダの一実施の形態
の構成例を示すブロック図である。FIG. 12 is a block diagram illustrating a configuration example of an embodiment of an encoder to which the present invention has been applied.

【図１３】図１２の画素置換器１５の処理を説明するた
めのフローチャートである。FIG. 13 is a flowchart for explaining the process of the pixel replacer 15 in FIG.

【図１４】図１２のＶＬＣ器６が出力するＧＯＶのシン
タクスを示す図である。14 is a diagram showing the syntax of GOV output by the VLC unit 6 of FIG.

【図１５】図１２のＶＬＣ器６の処理を説明するための
フローチャートである。FIG. 15 is a flowchart for explaining the processing of the VLC device 6 of FIG.

【図１６】本発明を適用したデコーダの一実施の形態の
構成例を示すブロック図である。FIG. 16 is a block diagram showing a configuration example of an embodiment of a decoder to which the present invention has been applied.

【図１７】図１６のＩＶＬＣ器２２の処理を説明するた
めのフローチャートである。FIG. 17 is a flowchart for explaining the processing of the IVLC device 22 of FIG.

【図１８】図２１のＶＬＣ器６が出力するＧＯＶのシン
タクスを示す図である。FIG. 18 is a diagram showing the syntax of GOV output by the VLC unit 6 of FIG. 21.

【図１９】load_data_typeを説明するための図である。FIG. 19 is a diagram for explaining load_data_type.

【図２０】図１８のdownload_parameters()のシンタク
スを示す図である。[Fig. 20] Fig. 20 is a diagram illustrating the syntax of download_parameters () in Fig. 18.

【図２１】本発明を適用したエンコーダの他の実施の形
態の構成例を示すブロック図である。FIG. 21 is a block diagram showing a configuration example of another embodiment of an encoder to which the present invention has been applied.

【図２２】図２１のパーサ１７の処理を説明するための
フローチャートである。22 is a flowchart for explaining the process of the parser 17 in FIG.

【図２３】図１６のＩＶＬＣ器２２の処理を説明するた
めのフローチャートである。23 is a flow chart for explaining the processing of the IVLC device 22 of FIG.

【図２４】従来のエンコーダの一例の構成を示すブロッ
ク図である。FIG. 24 is a block diagram showing a configuration of an example of a conventional encoder.

【図２５】従来のデコーダの一例の構成を示すブロック
図である。FIG. 25 is a block diagram showing a configuration of an example of a conventional decoder.

[Explanation of symbols]

１フレームメモリ（受信手段），２動きベクトル
検出器，３演算器，４ＤＣＴ器，５量子化
器，６ＶＬＣ器（符号化手段），７バッファ，
８逆量子化器，９ＩＤＣＴ器，１０演算
器，１１フレームメモリ，１２動き補償器，
１３キー信号符号化器，１４キー信号復号器，
１５画素置換器，１６バッファ，１７パーサ
（選択手段），２１バッファ（受信手段），２２
ＩＶＬＣ器（復号手段），２３逆量子化器，２４
ＩＤＣＴ器，２５演算器，２６フレームメモ
リ，２７動き補償器，２８画素置換器，２９
キー信号復号器，２０１記録媒体，２０２伝
送媒体1 frame memory (reception means), 2 motion vector detectors, 3 arithmetic units, 4 DCT units, 5 quantizers, 6 VLC units (encoding means), 7 buffers,
8 inverse quantizer, 9 IDCT device, 10 arithmetic unit, 11 frame memory, 12 motion compensator,
13 key signal encoder, 14 key signal decoder,
15 pixel replacer, 16 buffer, 17 parser (selecting means), 21 buffer (receiving means), 22
IVLC device (decoding means), 23 inverse quantizer, 24
IDCT device, 25 arithmetic unit, 26 frame memory, 27 motion compensator, 28 pixel replacer, 29
Key signal decoder, 201 recording medium, 202 transmission medium

フロントページの続き (56)参考文献特開平８−125966（ＪＰ，Ａ) インダーフェース，，日本，1992 年，Ｖｏｌ．18 Ｎｏ．８，ｐ．124− 146 コンピュータサイエンス誌ｂｉｔｖｏｌ．29 ｎｏ．６共立出版社（1997．６．１），日本，ｐ．86−96 (58)調査した分野(Int.Cl.⁷，ＤＢ名) H04N 5/91 - 5/95 H04N 7/24 - 7/68 Continuation of the front page (56) References JP-A-8-125966 (JP, A) Underface, Japan, 1992, Vol. 18 No. 8, p. 124-146 Computer Science Magazine bit vol. 29 no. 6 Kyoritsu Publishing Co. (1997.6.1), Japan, p. 86-96 (58) Fields investigated (Int.Cl. ⁷ , DB name) H04N ^5/ 91-5/95 H04N ^7/ 24-7/68

Claims

(57) [Claims]

1. An image encoding apparatus for encoding an image and outputting an encoded bitstream having a hierarchical structure composed of a plurality of layers, wherein header information of an upper layer of the encoded bitstream is stored in a storage unit. Storage means, and arrangement means for reading header information of the higher layer according to an identification flag arranged in the header of the lower layer of the encoded bitstream from the storage unit and arranging it in the header of the lower layer. , by the arrangement means, and output means for header information of the upper layer which is read from the storage unit to output the encoded bit stream which is disposed in the header of the lower layer, the identification flag is at least, Header information of the upper layer
No information is transmitted, all header information of the upper layer is transmitted.
Of the header information of the upper layer
An image coding apparatus characterized in that it defines three types of transmitting only predetermined header information .

To wherein said header information, picture coding according to claim 1, characterized in that it includes a parameter representing the process parameters or inverse quantization, indicating whether to use overlapping motion compensation apparatus.

3. The image coding apparatus according to claim 1, wherein the storage unit stores the latest header information of the upper layer in the storage unit.

4. The image coding apparatus according to claim 1, further comprising a changing unit that changes the header information of the upper layer stored in the storage unit.

5. An image encoding method for an image encoding apparatus, which encodes an image and outputs an encoded bitstream having a hierarchical structure composed of a plurality of layers, wherein header information of an upper layer of the encoded bitstream is provided. A storage step of storing in the storage unit, read the header information of the upper layer according to the identification flag arranged in the header of the lower layer of the encoded bitstream from the storage unit, in the header of the lower layer seen including a placement step of placing, and an output step of header information of said placement step of processing the upper read from the storage unit in a hierarchical outputs the coded bit stream arranged in a header of the lower layer , The identification flag is at least the header information of the upper layer.
No information is transmitted, all header information of the upper layer is transmitted.
Of the header information of the upper layer
An image coding method characterized in that it defines three types of transmitting only predetermined header information .

6. The header information of the upper layer of the encoded bitstream read from the storage unit according to the identification flag arranged in the header of the lower layer of the encoded bitstream of the hierarchical structure is the lower layer. An image decoding apparatus for decoding the coded bitstream arranged in the header of the extracting unit, which extracts header information of the upper layer included in the header of the lower layer based on the identification flag; Decoding means for decoding the encoded bitstream based on the header information extracted by the means , wherein the identification flag is at least the header information of the upper layer.
No information is transmitted, all header information of the upper layer is transmitted.
Of the header information of the upper layer
An image decoding device characterized in that it defines three types of transmitting only predetermined header information .

To wherein said header information, the image decoding apparatus according to claim 6, characterized in that it includes a parameter representing the process parameters or inverse quantization, indicating whether to use overlapping motion compensation .

8. The header information of the upper layer of the coded bitstream, which is read from the storage unit and corresponds to the identification flag arranged in the header of the lower layer of the coded bitstream of the hierarchical structure, is the lower layer. In the image decoding method of the image decoding device for decoding the coded bitstream arranged in the header, the extracting step of extracting header information of the higher layer included in the header of the lower layer based on the identification flag. If, on the basis of the header information extracted by the processing of the extracting step, seen including a decoding step of decoding the coded bit stream, the identification flag is at least, the header information of the upper layer
No information is transmitted, all header information of the upper layer is transmitted.
Of the header information of the upper layer
An image decoding method characterized by defining three types of transmitting only predetermined header information .