JP2021052429A

JP2021052429A - Moving picture prediction decoding method

Info

Publication number: JP2021052429A
Application number: JP2020213666A
Authority: JP
Inventors: ブン　チュンセン; Chunsen Bun; チュンセンブン; 順也瀧上; Junya Takigami; 暁藤林; Akira Fujibayashi; タン　ティオ　ケン; Tio Ken Tang; ティオケンタン
Original assignee: NTT Docomo Inc
Current assignee: NTT Docomo Inc
Priority date: 2019-12-19
Filing date: 2020-12-23
Publication date: 2021-04-01
Anticipated expiration: 2032-06-28
Also published as: JP6967653B2

Abstract

To contribute to prediction performance improvement, by making a decodable picture available as a reference picture of a subsequent picture.SOLUTION: A moving picture prediction decoding method includes: an input step for inputting a bit stream including compression image data having information of an NAL unit type for identifying a picture as one of a plurality of picture types including a random access picture, a random access skip (RAS) reading picture, and a non-RAS reading picture, for each picture; a restoration step of decoding the compression image data, on the basis of the picture type, and restoring to a reproduction image; and an output step of outputting the reproduction image. In the restoration step, the moving picture prediction decoding method decodes the non-RAS reading picture and skips decoding of the RAS reading picture when the restoration step is started from the random access picture.SELECTED DRAWING: Figure 2

Description

本発明は、動画像予測復号方法に関するものであり、とりわけ、ランダム・アクセスに有効な画面間予測に係る動画像予測復号方法に関する。 The present invention relates to a moving image prediction decoding method, and more particularly to a moving image prediction decoding method relating to interscreen prediction effective for random access.

動画像データの伝送や蓄積を効率よく行うために、圧縮符号化技術が用いられる。動画像の場合ではＭＰＥＧ１〜４やＨ．２６１〜Ｈ．２６４の方式が広く用いられている。 Compression coding technology is used to efficiently transmit and store moving image data. In the case of moving images, MPEG1-4 and H.M. 261-H. The 264 method is widely used.

これらの符号化方式では、符号化の対象となる画像を複数のブロックに分割した上で符号化・復号処理を行う。符号化効率を高めるため下記のような予測符号化方法が用いられる。画面内の予測符号化では、対象ブロックと同じ画面内にある隣接する既再生の画像信号（過去に圧縮された画像データを復元したもの）を用いて予測信号を生成した上で、それを対象ブロックの信号から引き算した差分信号を符号化する。画面間の予測符号化では、対象ブロックと異なる画面内にある既再生の画像信号を参照し、信号の変位を検索し、その動いた分を補償して予測信号を生成し、それを対象ブロックの信号から引き算した差分信号を符号化する。動きの検索・補償を行うために参照される既再生の画像を、参照画像という。 In these coding methods, the image to be coded is divided into a plurality of blocks, and then the coding / decoding process is performed. The following predictive coding method is used to improve the coding efficiency. In the predictive coding in the screen, a predictive signal is generated using an adjacent already-reproduced image signal (reconstructed image data compressed in the past) in the same screen as the target block, and then the target is targeted. Encode the difference signal subtracted from the block signal. In predictive coding between screens, the already reproduced image signal in the screen different from the target block is referred to, the displacement of the signal is searched, the movement amount is compensated to generate the predictive signal, and the prediction signal is generated and used as the target block. The difference signal subtracted from the signal of is encoded. A replayed image that is referenced to search for and compensate for movement is called a reference image.

また、双方向画面間予測では、出力時間順に過去にある画像だけではなくて、対象画像の後に出力される未来の画像もあわせて参照する場合がある（但し、この未来画像は対象画像より先に符号化し、予め再生しておく必要がある）。過去の画像と未来の画像から取得された予測信号を平均化することによって、隠されていて新たに現れる物体の信号の予測に有効であると同時に、両予測信号に含まれている雑音を軽減する効果がある。 Further, in the bidirectional screen-to-screen prediction, not only the images in the past in the order of output time but also the future image output after the target image may be referred to (however, this future image is ahead of the target image). It is necessary to encode it into and play it in advance). By averaging the prediction signals obtained from the past image and the future image, it is effective in predicting the signal of the hidden and newly appearing object, and at the same time, the noise contained in both prediction signals is reduced. Has the effect of

さらに、Ｈ．２６４の画面間予測符号化では、対象ブロックに対する予測信号は、過去に符号化した上で再生された複数の参照画像を参照し、動き検索しながら誤差の最も少ない画像信号を最適な予測信号として選択する。対象ブロックの画素信号とこの最適な予測信号との差分を求め、離散コサイン変換を施し、量子化した上でエントロピー符号化する。同時に、対象ブロックに対する最適な予測信号をどの参照画像のどの領域から取得するかに関する情報（それぞれ、参照インデックスと動きベクトルという）も合わせて符号化する。Ｈ．２６４では、再生された４枚ないし５枚の画像が参照画像としてフレームメモリまたは再生画像バッファ（decoded picture buffer）に格納されている。 In addition, H. In the inter-screen prediction coding of 264, the prediction signal for the target block refers to a plurality of reference images encoded and reproduced in the past, and the image signal having the smallest error is used as the optimum prediction signal while performing motion search. select. The difference between the pixel signal of the target block and this optimum prediction signal is obtained, subjected to discrete cosine transform, quantized, and then entropy-encoded. At the same time, information on which region of which reference image the optimum prediction signal for the target block is acquired (referred to as a reference index and a motion vector, respectively) is also encoded. H. In 264, the reproduced 4 to 5 images are stored in the frame memory or the reproduced image buffer (decoded picture buffer) as reference images.

画面間予測符号化は、画像間の相関を活かして効率よく圧縮符号化することができるものの、テレビのチャンネルの切り替えで映像番組を途中から視聴できるようにするには、画面間の依存性を断ち切る必要がある。動画像の圧縮ビットストリームの中で、画面間の依存性のない箇所を、ランダム・アクセス・ポイントという。チャンネルの切り替えのほかに、動画像を編集したり、異なる動画像の圧縮データをつないだりする場合にも、ランダム・アクセス・ポイントが必要となるが、従来技術では、ランダム・アクセス・ポイントとして「クリーン・ランダム・アクセス・ポイント」が設けられている。クリーン・ランダム・アクセス・ポイントはNAL unit typeのクリーン・ランダム・アクセス・ピクチャ（以下「CRAピクチャ」という）によって特定される。一つのビットストリームには、複数のCRAピクチャを含むことが可能であり、動画像予測復号装置はどのクリーン・ランダム・アクセス・ポイントから復号を開始してもよい。 Although screen-to-screen predictive coding can efficiently perform compression coding by taking advantage of the correlation between images, in order to enable viewing of a video program from the middle by switching TV channels, the dependency between screens is required. It needs to be cut off. In the compressed bitstream of a moving image, the part where there is no dependency between screens is called a random access point. In addition to switching channels, a random access point is also required for editing moving images and connecting compressed data of different moving images. A "clean random access point" is provided. A clean random access point is identified by a NAL unit type clean random access picture ("CRA picture"). A bitstream can contain a plurality of CRA pictures, and the moving image predictive decoding device may start decoding from any clean random access point.

発明の開示のため、CRAピクチャに関連するピクチャのピクチャ・タイプを以下に定義する（図１０参照）。
a)パスト・ピクチャ：CRAピクチャよりも先に復号され、かつ、CRAピクチャよりも先に出力されるピクチャ
b)ラギング・ピクチャ：CRAピクチャよりも先に復号されるが、CRAピクチャよりも後に出力されるピクチャ
c)リーディング・ピクチャ：CRAピクチャよりも後に復号されるが、CRAピクチャよりも先に出力されるピクチャ
d)ノーマル・ピクチャ：.CRAピクチャよりも後に復号され、かつ、CRAピクチャよりも後に出力されるピクチャ For the disclosure of the invention, the picture types of pictures associated with CRA pictures are defined below (see FIG. 10).
a) Past picture: A picture that is decoded before the CRA picture and output before the CRA picture.
b) Lagging picture: A picture that is decoded before the CRA picture but output after the CRA picture
c) Reading picture: A picture that is decoded after the CRA picture but output before the CRA picture
d) Normal picture: A picture that is decoded after the .CRA picture and output after the CRA picture

CRAピクチャは画面内予測のみのピクチャとして定義されているため、復号に必要な情報を完備しており、他のピクチャへの参照をせずに正しく復号することができる。CRAピクチャの後に続く全てのノーマル・ピクチャはパスト・ピクチャ、ラギング・ピクチャ、またはリーディング・ピクチャからの画面間予測をしてはならないと定義される。 Since the CRA picture is defined as a picture with only in-screen prediction, it has all the information necessary for decoding and can be correctly decoded without referring to other pictures. It is defined that all normal pictures following a CRA picture must not make interscreen predictions from past pictures, lagging pictures, or reading pictures.

ビットストリームをCRAピクチャから復号する際は、CRAピクチャ及びノーマル・ピクチャが画面間予測の誤りなしで正しく復号されることが保証される。しかしながら、リーディング・ピクチャは、CRAピクチャの後に復号され、画面間予測の誤りなしで正しく復号される保証はない。即ち、正しく復号されるリーディング・ピクチャが存在する一方で、正しく復号されないリーディング・ピクチャも存在しうる。 When decoding a bitstream from a CRA picture, it is guaranteed that the CRA picture and the normal picture are correctly decoded without any interscreen prediction errors. However, the reading picture is decoded after the CRA picture, and there is no guarantee that it will be decoded correctly without error in interscreen prediction. That is, while there may be a reading picture that is correctly decoded, there may be a reading picture that is not correctly decoded.

なお、ここで「正しく復号される」とは、復号されたピクチャが、CRAピクチャからではなくビットストリームの始めから復号された場合に得られるピクチャと同一であることを意味している。CRAピクチャから復号した際は、復号順で当該CRAピクチャに先行するピクチャ（例えばラギング・ピクチャ）は復号されず、再生画像バッファに存在しない。したがって、復号順で当該CRAピクチャに先行するピクチャを直接的あるいは間接的に画面間予測に使用する後続のピクチャは、復号誤りを含むおそれがある。 Note that "correctly decoded" here means that the decoded picture is the same as the picture obtained when the decoded picture is decoded from the beginning of the bitstream, not from the CRA picture. When decoding from a CRA picture, the picture (for example, a lagging picture) preceding the CRA picture in the decoding order is not decoded and does not exist in the reproduced image buffer. Therefore, a subsequent picture that directly or indirectly uses a picture preceding the CRA picture in the decoding order for inter-screen prediction may contain a decoding error.

Benjamin Bross et. al., "High efficiency video coding (HEVC) text specification draft 7", Joint Collaborative Team on Video Coding (JCT-VC) of ITU-T SG16 WP3 and ISO/IEC JTC1/SC29/WG11, 9th Meeting: Geneva, CH, 27 April - 7 May 2012Benjamin Bross et. Al., "High efficiency video coding (HEVC) text specification draft 7", Joint Collaborative Team on Video Coding (JCT-VC) of ITU-T SG16 WP3 and ISO / IEC JTC1 / SC29 / WG11, 9th Meeting : Geneva, CH, 27 April --7 May 2012

上記のように動画像予測復号装置がランダム・アクセス・ポイントから復号を開始する際には、正しく復号できないピクチャが存在するおそれがあり、当該正しく復号できないピクチャは以降の復号に利用することができない。その一方で、正しく復号できるピクチャもあり、当該正しく復号できるピクチャは以降の復号に利用することができる。従来技術においては、復号順でランダム・アクセス・ポイントに続くどのピクチャが破棄されるのかを特定する方法が無いため、リーディング・ピクチャの全てが正しく復号できないピクチャとして扱われ破棄される。しかしながら、これらのピクチャの一部は実際には復号可能であり、後続のピクチャの予測性能改善に貢献することができるため、リーディング・ピクチャの全てを正しく復号できないピクチャとして破棄する方法は、望ましい方法ではない。 As described above, when the moving image prediction decoding device starts decoding from a random access point, there may be a picture that cannot be correctly decoded, and the picture that cannot be correctly decoded cannot be used for subsequent decoding. .. On the other hand, some pictures can be correctly decoded, and the correctly decoding pictures can be used for subsequent decoding. In the prior art, since there is no way to specify which picture following the random access point is discarded in the decoding order, all the reading pictures are treated as pictures that cannot be correctly decoded and discarded. However, some of these pictures are actually decodable and can contribute to improving the predictive performance of subsequent pictures, so a method of discarding all of the reading pictures as pictures that cannot be correctly decoded is a desirable method. is not it.

そこで、本発明は、復号可能なピクチャを判別可能とすることで、復号可能なピクチャを後続のピクチャの参照ピクチャとして利用可能とし予測性能改善に貢献することを目的とする。 Therefore, an object of the present invention is to make it possible to discriminate a decodable picture so that the decodable picture can be used as a reference picture of a subsequent picture and contribute to improvement of prediction performance.

本発明に係る動画像予測復号方法は、動画像予測復号装置により実行される動画像予測復号方法であって、動画像を構成する複数のピクチャのための圧縮画像データであって、ピクチャごとに、ランダム・アクセスピクチャと、ランダム・アクセス・スキップ（ＲＡＳ）リーディングピクチャと、ｎｏｎ−ＲＡＳリーディングピクチャとを含む複数のピクチャ・タイプのうちのひとつとして当該ピクチャを識別するＮＡＬユニット・タイプの情報を有する圧縮画像データを含むビットストリームを入力する入力ステップと、前記ピクチャ・タイプに基づいて、前記圧縮画像データを復号し、再生画像に復元する復元ステップと、前記再生画像を出力する出力ステップと、を具備し、
１）ランダム・アクセスピクチャは、前記ビットストリームの任意のランダム・アクセスピクチャから開始される復号プロセスが当該ランダム・アクセスピクチャから開始される場合、復号順で前記ビットストリームの最初のピクチャであり、
２）ＲＡＳリーディングピクチャは、関連するランダム・アクセスピクチャに出力順で先行するピクチャであって、関連するランダム・アクセスピクチャが復号順で前記ビットストリームの最初のピクチャである場合には復号できないピクチャであり、
３）ｎｏｎ−ＲＡＳリーディングピクチャは、関連するランダム・アクセスピクチャに出力順で先行するピクチャであって、復号できるピクチャであり、
ｎｏｎ−ＲＡＳリーディングピクチャの画面間予測のために使用される参照ピクチャを含む前記ｎｏｎ−ＲＡＳリーディングピクチャのリファレンス・ピクチャ・セットは、ＲＡＳリーディングピクチャと、関連するランダム・アクセスピクチャに復号順で先行するピクチャとのいずれも含まず、第２のランダム・アクセスピクチャが第１のランダム・アクセスピクチャの後に復号される場合、前記第２のランダム・アクセスピクチャのリファレンス・ピクチャ・セットは、前記第１のランダム・アクセスピクチャに復号順で先行するいずれのピクチャも含まず、前記復元ステップにおいて、復号プロセスがランダム・アクセスピクチャから開始される場合、ｎｏｎ−ＲＡＳリーディングピクチャを復号し、ＲＡＳリーディングピクチャの復号をスキップする、ことを特徴とする。 The moving image prediction / decoding method according to the present invention is a moving image prediction / decoding method executed by a moving image prediction / decoding device, which is compressed image data for a plurality of pictures constituting a moving image, and is for each picture. , Has NAL unit type information that identifies the picture as one of a plurality of picture types including a random access picture, a random access skip (RAS) reading picture, and a non-RAS reading picture. An input step for inputting a bit stream including compressed image data, a restoration step for decoding the compressed image data based on the picture type and restoring the reproduced image, and an output step for outputting the reproduced image. Equipped with
1) The random access picture is the first picture of the bitstream in the decoding order when the decoding process started from any random access picture of the bitstream is started from the random access picture.
2) The RAS reading picture is a picture that precedes the related random access picture in the output order and cannot be decoded if the related random access picture is the first picture of the bitstream in the decoding order. Yes,
3) The non-RAS reading picture is a picture that precedes the related random access picture in the output order and can be decoded.
The reference picture set of the non-RAS reading picture, including the reference picture used for interscreen prediction of the non-RAS reading picture, precedes the RAS reading picture and the associated random access picture in decoding order. If the second random access picture is decoded after the first random access picture without including any of the pictures, then the reference picture set of the second random access picture is the first random access picture. If the random access picture does not include any of the pictures that precede it in the decoding order and the decoding process is started from the random access picture in the restore step, then the non-RAS reading picture is decoded and the RAS reading picture is decoded. It is characterized by skipping.

本発明によれば、復号可能なピクチャを判別可能とすることで、復号可能なピクチャを後続のピクチャの参照ピクチャとして利用可能とし予測性能改善に貢献することができる。より詳しくは、動画像予測復号装置は、ビットストリームの先頭のＣＲＡピクチャから復号が開始される場合に、（ラベルの利用あるいはリファレンス・ピクチャ・セットとの照合によって）あるピクチャが正しく復号できるか否かを検出することができる。このため、動画像予測復号装置は、全てのリーディング・ピクチャを破棄する代わりに、復号できないピクチャのみを選択して破棄することができ、復号可能なピクチャを後続のピクチャの参照ピクチャとして利用可能とし、予測性能改善に貢献することができる。 According to the present invention, by making it possible to discriminate a decodable picture, the decodable picture can be used as a reference picture of a subsequent picture, which can contribute to the improvement of prediction performance. More specifically, whether the moving image predictive decoding device can correctly decode a certain picture (by using a label or collating with a reference picture set) when decoding is started from the CRA picture at the beginning of the bitstream. Can be detected. Therefore, instead of discarding all the reading pictures, the moving image prediction decoding device can select and discard only the pictures that cannot be decoded, and the decodable pictures can be used as the reference pictures of the subsequent pictures. , Can contribute to the improvement of prediction performance.

本発明の実施形態に係る動画像予測符号化装置を示すブロック図である。It is a block diagram which shows the moving image prediction coding apparatus which concerns on embodiment of this invention. 本発明の実施形態に係る動画像予測復号装置を示すブロック図である。It is a block diagram which shows the moving image prediction decoding apparatus which concerns on embodiment of this invention. 本発明の実施形態に係るシンタックス・エレメントを説明するための図である。It is a figure for demonstrating the syntax element which concerns on embodiment of this invention. 本発明の実施形態に係る動画像予測符号化方法を示す流れ図である。It is a flow chart which shows the moving image prediction coding method which concerns on embodiment of this invention. 本発明の実施形態に係る動画像予測復号方法を示す流れ図である。It is a flow chart which shows the moving image prediction decoding method which concerns on embodiment of this invention. 記録媒体に記録されたプログラムを実行するためのコンピュータのハードウェア構成を示す図である。It is a figure which shows the hardware configuration of the computer for executing the program recorded on the recording medium. 記録媒体に記憶されたプログラムを実行するためのコンピュータの概観図である。It is an overview diagram of a computer for executing a program stored in a recording medium. 動画像予測符号化プログラムの構成例を示すブロック図である。It is a block diagram which shows the structural example of the moving image prediction coding program. 動画像予測復号プログラムの構成例を示すブロック図である。It is a block diagram which shows the structural example of the moving image prediction decoding program. 本発明の背景を説明するための図である。It is a figure for demonstrating the background of this invention.

以下、本発明の実施形態について、図１〜図９を用いて説明する。 Hereinafter, embodiments of the present invention will be described with reference to FIGS. 1 to 9.

［動画像予測符号化装置について］
図１は本実施形態に係る動画像予測符号化装置１００の構成を示す機能ブロック図である。図１に示すように、動画像予測符号化装置１００は、機能的な構成として、入力端子１０１、ブロック分割器１０２、予測信号生成器１０３、フレームメモリ１０４、減算器１０５、変換器１０６、量子化器１０７、逆量子化器１０８、逆変換器１０９、加算器１１０、エントロピー符号化器１１１、出力端子１１２、入力端子１１３、及びフレームメモリ管理器（またはバッファ管理器ともいう）１１４を備える。各機能ブロックの動作は、後述する動画像予測符号化装置１００の動作の中で説明する。なお、変換器１０６及び量子化器１０７は符号化手段に対応し、逆量子化器１０８、逆変換器１０９及び加算器１１０は復号手段に対応する。 [About moving image prediction coding device]
FIG. 1 is a functional block diagram showing the configuration of the moving image prediction coding device 100 according to the present embodiment. As shown in FIG. 1, the moving image prediction coding device 100 has an input terminal 101, a block divider 102, a prediction signal generator 103, a frame memory 104, a subtractor 105, a converter 106, and a quantum as functional configurations. It includes a converter 107, an inverse quantizer 108, an inverse converter 109, an adder 110, an entropy encoder 111, an output terminal 112, an input terminal 113, and a frame memory manager (also referred to as a buffer manager) 114. The operation of each functional block will be described in the operation of the moving image prediction coding device 100 described later. The converter 106 and the quantizer 107 correspond to the coding means, and the inverse quantizer 108, the inverse converter 109 and the adder 110 correspond to the decoding means.

以上のように構成された動画像予測符号化装置１００の動作について、以下述べる。複数枚の画像からなる動画像の信号は入力端子１０１に入力される。符号化処理の対象となる画像はブロック分割器１０２にて、複数の領域に分割される。本実施形態では、８×８の画素からなるブロックに分割されるが、それ以外のブロックの大きさまたは形に分割してもよい。次に、符号化処理の対象となる領域（以下「対象ブロック」という）に対して、予測信号を生成する。本実施形態では、２種類の予測方法が用いられる。即ち、画面間予測と画面内予測である。 The operation of the moving image prediction coding device 100 configured as described above will be described below. A moving image signal composed of a plurality of images is input to the input terminal 101. The image to be coded is divided into a plurality of regions by the block divider 102. In the present embodiment, the block is divided into blocks composed of 8 × 8 pixels, but the block may be divided into other block sizes or shapes. Next, a prediction signal is generated for a region (hereinafter referred to as “target block”) to be coded. In this embodiment, two types of prediction methods are used. That is, inter-screen prediction and in-screen prediction.

画面間予測では、過去に符号化されたのちに復元された再生画像を参照画像として、この参照画像から対象ブロックに対する誤差の最も小さい予測信号を与える動き情報を求める。この処理は動き検出とよばれる。また場合に応じて、対象ブロックを再分割し、再分割された小領域に対し画面間予測方法を決定してもよい。この場合、各種の分割方法の中から、対象ブロック全体に対し最も効率のよい分割方法及びそれぞれの動き情報を決定する。本発明による実施形態では、予測信号生成器１０３にて行われ、対象ブロックはラインＬ１０２、参照画像はＬ１０４経由で入力される。参照画像としては、過去に符号化され復元された複数の画像を参照画像として用いる。詳細は従来の技術であるＭＰＥＧ−２、４、Ｈ．２６４のいずれかの方法と同じである。このように決定された動き情報及び小領域の分割方法はラインＬ１１２経由でエントロピー符号化器１１１に送られ符号化した上で出力端子１１２から送出される。また複数の参照画像の中で、予測信号がどの参照画像から取得するかに関する情報（リファレンス・インデックス）もラインＬ１１２経由でエントロピー符号化器１１１に送られる。なお、本発明による実施形態では、４枚ないし５枚の再生画像をフレームメモリ１０４に格納し、参照画像として用いる。予測信号生成器１０３では、小領域の分割方法及びそれぞれの小領域に対応する、参照画像と動き情報をもとにフレームメモリ１０４から参照画像信号を取得し、予測信号を生成する。このように生成された画面間予測信号はラインＬ１０３経由で減算器１０５に送られる。 In the inter-screen prediction, a reproduced image that has been encoded in the past and then restored is used as a reference image, and motion information that gives a prediction signal having the smallest error with respect to the target block is obtained from this reference image. This process is called motion detection. Further, depending on the case, the target block may be subdivided, and the inter-screen prediction method may be determined for the subdivided small area. In this case, the most efficient division method and the motion information of each are determined from the various division methods for the entire target block. In the embodiment according to the present invention, it is performed by the prediction signal generator 103, the target block is input via the line L102, and the reference image is input via L104. As the reference image, a plurality of images encoded and restored in the past are used as the reference image. For details, see MPEG-2, 4, H.M. It is the same as any method of 264. The motion information determined in this way and the method of dividing the small area are sent to the entropy encoder 111 via the line L112, encoded, and then sent out from the output terminal 112. In addition, information (reference index) regarding which reference image the prediction signal is acquired from among the plurality of reference images is also sent to the entropy encoder 111 via the line L112. In the embodiment according to the present invention, four to five reproduced images are stored in the frame memory 104 and used as reference images. The prediction signal generator 103 acquires a reference image signal from the frame memory 104 based on the reference image and motion information corresponding to the method of dividing the small area and each small area, and generates the prediction signal. The interscreen prediction signal generated in this way is sent to the subtractor 105 via the line L103.

画面内予測では、対象ブロックに空間的に隣接する既再生の画素値を用いて画面内予測信号を生成する。具体的には予測信号生成器１０３では、フレームメモリ１０４から同じ画面内にある既再生の画素信号を取得し、これらの信号を外挿することによって画面内予測信号を生成する。外挿の方法に関する情報はラインＬ１１２経由でエントロピー符号化器１１１に送られ符号化した上で出力端子１１２から送出される。このように生成された画面内予測信号は減算器１０５に送られる。予測信号生成器１０３における画面内の予測信号生成方法は、従来の技術であるＨ．２６４の方法と同じである。上述のように求められた画面間予測信号と画面内予測信号に対し、誤差の最も小さいものが選択され、減算器１０５に送られる。 In the in-screen prediction, an in-screen prediction signal is generated using the already reproduced pixel values that are spatially adjacent to the target block. Specifically, the prediction signal generator 103 acquires an already reproduced pixel signal in the same screen from the frame memory 104, and extrapolates these signals to generate an in-screen prediction signal. Information on the extrapolation method is sent to the entropy encoder 111 via the line L112, encoded, and then sent from the output terminal 112. The in-screen prediction signal generated in this way is sent to the subtractor 105. The in-screen prediction signal generation method in the prediction signal generator 103 is a conventional technique, H.I. It is the same as the method of 264. With respect to the inter-screen prediction signal and the in-screen prediction signal obtained as described above, the one having the smallest error is selected and sent to the subtractor 105.

一枚目の画像については、それより前に画像がないため、全ての対象ブロックは画面内予測で処理される。また、テレビのチャンネルの切り替えに備えて、定期的にランダム・アクセス・ポイントとされた対象ブロックは画面内予測で処理される。このような画像はイントラフレームとよび、Ｈ．２６４ではＩＤＲピクチャとよぶ。 As for the first image, since there is no image before that, all the target blocks are processed by the in-screen prediction. In addition, the target block, which is periodically set as a random access point, is processed by in-screen prediction in preparation for switching TV channels. Such an image is called an intraframe, and H.I. In 264, it is called an IDR picture.

減算器１０５にて対象ブロックの信号（ラインＬ１０２経由）から予測信号（ラインＬ１０３経由）を引き算し、残差信号を生成する。この残差信号は変換器１０６にて離散コサイン変換され、その各係数は量子化器１０７にて量子化される。最後にエントロピー符号化器１１１にて量子化された変換係数を符号化して、予測方法に関する情報とともに出力端子１１２より送出される。 The subtractor 105 subtracts the prediction signal (via line L103) from the signal of the target block (via line L102) to generate a residual signal. The residual signal is discrete cosine transformed by the converter 106, and each coefficient is quantized by the quantizer 107. Finally, the conversion coefficient quantized by the entropy encoder 111 is encoded and sent from the output terminal 112 together with information on the prediction method.

後続の対象ブロックに対する画面内予測もしくは画面間予測を行うために、圧縮された対象ブロックの信号は逆処理し復元される。即ち、量子化された変換係数は逆量子化器１０８にて逆量子化されたのちに逆変換器１０９にて逆離散コサイン変換され、残差信号を復元する。加算器１１０にて復元された残差信号とラインＬ１０３から送られた予測信号とを加算し、対象ブロックの信号を再生し、フレームメモリ１０４に格納する。本実施形態では変換器１０６と逆変換器１０９を用いているが、これらの変換器に代わるほかの変換処理を用いてもよい。場合によって、変換器１０６と逆変換器１０９がなくてもよい。 The compressed signal of the target block is reverse-processed and restored in order to perform in-screen prediction or inter-screen prediction for the subsequent target block. That is, the quantized conversion coefficient is inversely quantized by the inverse quantizer 108 and then inverse discrete cosine transform is performed by the inverse converter 109 to restore the residual signal. The residual signal restored by the adder 110 and the prediction signal sent from the line L103 are added, the signal of the target block is reproduced, and the signal is stored in the frame memory 104. Although the converter 106 and the inverse converter 109 are used in this embodiment, other conversion processes may be used instead of these converters. In some cases, the converter 106 and the inverse converter 109 may not be present.

フレームメモリ１０４は有限なものであり、すべての再生画像を格納することは不可能である。後続の画像の符号化に用いられる再生画像のみフレームメモリ１０４に格納される。このフレームメモリ１０４を制御するのがフレームメモリ管理器１１４である。フレームメモリ管理器１１４は、フレームメモリ１０４内にあるＮ枚（ここではＮ＝４であるが、予め決められた整数でもよい）の再生画像の中から、不要なもの（例えば最も古いもの）を消去し参照画像として用いられる直近の再生画像を格納できるようにラインＬ１１５経由で制御する。入力端子１１３より各画像の出力順番情報や画像を符号化するタイプ（画面内予測符号化、画面間予測符号化、双方向予測符号化）、またリファレンス・インデックスもラインＬ１１２経由で入力され、これらの情報に基づいてフレームメモリ管理器１１４が動作する。 The frame memory 104 is finite, and it is impossible to store all the reproduced images. Only the reproduced image used for encoding the subsequent image is stored in the frame memory 104. The frame memory manager 114 controls the frame memory 104. The frame memory manager 114 selects unnecessary (for example, the oldest) reproduced images of N images (here, N = 4, but may be a predetermined integer) in the frame memory 104. It is controlled via the line L115 so that the most recently reproduced image to be erased and used as a reference image can be stored. The output order information of each image, the type for encoding the image (in-screen predictive coding, inter-screen predictive coding, bidirectional predictive coding), and the reference index are also input from the input terminal 113 via the line L112. The frame memory manager 114 operates based on the information in.

同時に、各画像の出力順番情報や、後述するＮＡＬユニット・タイプの情報は必要に応じてラインＬ１１４経由でエントロピー符号化器１１１に送られ、符号化した上で圧縮された画像データとともに出力される。出力順番情報は各画像に付随するもので、画像の順番を示すものや画像を出力する時刻でもいいし、画像の出力参照時間（テンポラルリファレンス）でもよい。本実施形態では、出力順番情報の値をそのまま二値符号化に変換される。本実施形態のフレームメモリ管理器１１４の動作については後述する。 At the same time, the output order information of each image and the NAL unit type information described later are sent to the entropy encoder 111 via the line L114 as needed, and are output together with the encoded and compressed image data. .. The output order information is attached to each image, and may be an image indicating the order of the images, the time when the images are output, or the output reference time (temporal reference) of the images. In the present embodiment, the value of the output order information is directly converted into binary coding. The operation of the frame memory manager 114 of this embodiment will be described later.

［動画像予測復号装置について］
次に、本発明に係る動画像予測復号装置について説明する。図２は本実施形態に係る動画像予測復号装置２００の構成を示す機能ブロック図である。図２に示すように、動画像予測復号装置２００は、機能的な構成として、入力端子２０１、データ解析器２０２、逆量子化器２０３、逆変換器２０４、加算器２０５、予測信号生成器２０８、フレームメモリ２０７、出力端子２０６、フレームメモリ管理器２０９、制御器２１０、及び切替器２１１を備える。各機能ブロックの動作は、後述する動画像予測復号装置２００の動作の中で説明する。逆量子化器２０３及び逆変換器２０４は復号手段に対応する。なお、復号に係る手段としては、逆量子化器２０３及び逆変換器２０４に限定されるものではなく、これら以外のものを用いてもよい。また、復号に係る手段は、逆変換器２０４を無くし逆量子化器２０３のみで構成してもよい。 [About moving image prediction decoding device]
Next, the moving image prediction decoding device according to the present invention will be described. FIG. 2 is a functional block diagram showing the configuration of the moving image prediction / decoding device 200 according to the present embodiment. As shown in FIG. 2, the moving image prediction / decoding device 200 has an input terminal 201, a data analyzer 202, an inverse quantizer 203, an inverse converter 204, an adder 205, and a prediction signal generator 208 as functional configurations. , Frame memory 207, output terminal 206, frame memory manager 209, controller 210, and switch 211. The operation of each functional block will be described in the operation of the moving image prediction / decoding device 200 described later. The inverse quantizer 203 and the inverse converter 204 correspond to the decoding means. The decoding means is not limited to the inverse quantizer 203 and the inverse converter 204, and other means may be used. Further, the means for decoding may be configured by eliminating the inverse converter 204 and using only the inverse quantizer 203.

以下、動画像予測復号装置２００の動作を述べる。動画像予測符号化装置１００にて上述した方法で圧縮符号化された圧縮データは、入力端子２０１から入力される。この圧縮データには、画像を複数のブロックに分割された対象ブロックを予測し符号化された残差信号及び予測信号の生成に関連する情報が含まれている。予測信号の生成に関連する情報として、画面間予測の場合はブロック分割に関する情報（ブロックのサイズ）や、動き情報と上述のリファレンス・インデックスやＮＡＬユニット・タイプに関する情報が含まれ、画面内予測の場合は周辺の既再生の画素からの外挿方法に関する情報が含まれている。 Hereinafter, the operation of the moving image prediction decoding device 200 will be described. The compressed data compressed and encoded by the method described above by the moving image prediction coding device 100 is input from the input terminal 201. This compressed data includes information related to the generation of the predicted and encoded residual signal and the predicted signal for the target block in which the image is divided into a plurality of blocks. Information related to the generation of the prediction signal includes information on block division (block size) in the case of inter-screen prediction, motion information and information on the above-mentioned reference index and NAL unit type, and is used for in-screen prediction. In the case, information on the method of extrapolation from the peripheral already reproduced pixels is included.

データ解析器２０２にて、圧縮データから対象ブロックの残差信号、予測信号の生成に関連する情報、量子化パラメータ、画像の出力順番情報を抽出する。対象ブロックの残差信号は逆量子化器２０３にて量子化パラメータ（ラインＬ２０２及びＬ２１１経由）をもとに逆量子化される。その結果は逆変換器２０４にて逆離散コサイン変換される。 The data analyzer 202 extracts the residual signal of the target block, the information related to the generation of the prediction signal, the quantization parameter, and the output order information of the image from the compressed data. The residual signal of the target block is inversely quantized by the inverse quantizer 203 based on the quantization parameters (via lines L202 and L211). The result is inverse discrete cosine transformed by inverse transformer 204.

次にラインＬ２０６ｂ経由で予測信号の生成に関連する情報が予測信号生成器２０８に送られる。予測信号生成器２０８では、予測信号の生成に関連する情報をもとに、フレームメモリ２０７にアクセスし、複数の参照画像の中から参照信号を取得し（ラインＬ２０７経由）予測信号を生成する。この予測信号はラインＬ２０８経由で加算器２０５に送られ、復元された残差信号に加算され、対象ブロック信号を再生し、ラインＬ２０５経由で出力端子２０６から出力すると同時にフレームメモリ２０７に格納される。 Information related to the generation of the predicted signal is then sent to the predicted signal generator 208 via line L206b. The prediction signal generator 208 accesses the frame memory 207 based on the information related to the generation of the prediction signal, acquires the reference signal from the plurality of reference images (via the line L207), and generates the prediction signal. This prediction signal is sent to the adder 205 via the line L208, is added to the restored residual signal, reproduces the target block signal, is output from the output terminal 206 via the line L205, and is stored in the frame memory 207 at the same time. ..

フレームメモリ２０７には、後続の画像の復号・再生に用いられる再生画像が格納されている。フレームメモリ管理器２０９はラインＬ２０９ａを介してフレームメモリ２０７を制御する。フレームメモリ２０７は、格納されているＮ枚（ここではＮ＝４であるが、予め決められた整数でもよい）の再生画像の中から、不要なもの（例えば最も古いもの）を消去し参照画像として用いられる直近の再生画像を格納できるように制御される。 The frame memory 207 stores a reproduced image used for decoding / reproducing a subsequent image. The frame memory manager 209 controls the frame memory 207 via the line L209a. The frame memory 207 erases unnecessary images (for example, the oldest one) from the stored N images (here, N = 4, but may be a predetermined integer) and is a reference image. It is controlled so that the most recently reproduced image used as can be stored.

制御器２１０は、ラインＬ２０６ａ経由で送られる対象画像の出力順番情報と画像の符号化タイプ及びＮＡＬユニット・タイプに関する情報に基づいて動作する。あるいはラインＬ２０６ａ経由で送られるリファレンス・インデックスとラインＬ２０９ｂ経由で送られてくるフレームメモリに格納されているフレームの情報に基づいて動作する。本発明による制御器２１０の動作については後述する。 The controller 210 operates based on the output order information of the target image sent via the line L206a and the information regarding the coding type and the NAL unit type of the image. Alternatively, it operates based on the reference index sent via the line L206a and the information of the frame stored in the frame memory sent via the line L209b. The operation of the controller 210 according to the present invention will be described later.

切替器２１１は制御器２１０からラインＬ２１０を介して制御され、条件に応じて特定のフレームの復号をスキップするように動作する。本発明による切替器２１１の動作については後述する。 The switch 211 is controlled from the controller 210 via the line L210, and operates so as to skip decoding of a specific frame depending on the conditions. The operation of the switch 211 according to the present invention will be described later.

図３にビットストリームのシンタックス・エレメント５００を示す。ビットストリームのシンタックス・エレメント５００はひとつのピクチャ（５１０、５２０など）の復号に必要な複数のシンタックス・エレメントから構成されている。ピクチャのシンタックスにおいて、以下の３つのエレメントに注目する。
１）ネットワーク・アダプテーション・レイヤー・ユニット・タイプ（ＮＵＴ）またはＮＡＬユニット・タイプ（５３０）
２）ピクチャー・アウトプット・カウント（ＰＯＣ）（５４０）
３）リファレンス・ピクチャ・セット（ＲＰＳ）（５５０） FIG. 3 shows the bitstream syntax element 500. The bitstream syntax element 500 is composed of a plurality of syntax elements necessary for decoding one picture (510, 520, etc.). In the syntax of the picture, we pay attention to the following three elements.
1) Network adaptation layer unit type (NUT) or NAL unit type (530)
2) Picture Output Count (POC) (540)
3) Reference Picture Set (RPS) (550)

１）ＮＵＴは、ピクチャ・タイプに関する情報を含んでいる。本発明においては、ピクチャ・タイプをシグナリングする他の手段も使用可能であることに留意されたい。本実施形態では、各ピクチャは、３種類のＮＡＬユニット・タイプのうちのひとつとしてラベル付けされる。ＮＡＬユニット・タイプとは、以下に述べるＲＡＳ、ＣＲＡ、ｎｏｎ−ＲＡＳである。 1) The NUT contains information about the picture type. Note that other means of signaling the picture type can also be used in the present invention. In this embodiment, each picture is labeled as one of three NAL unit types. The NAL unit type is RAS, CRA, non-RAS described below.

ＲＡＳ（ランダム・アクセス・スキップ）ピクチャとしてラベル付けされたピクチャは、そのＲＡＳピクチャに関連付けされたＣＲＡピクチャから復号が開始された際に、スキップされ出力されない。一方、前記のＣＲＡピクチャがビットストームの最初のピクチャでない場合（あるいは復号が前記のＣＲＡピクチャから開始されていない場合）は、動画像予測復号装置２００はＲＡＳピクチャをｎｏｎ−ＲＡＳピクチャであると見なし、そのピクチャの出力指示に従って復号し、出力するものとする。 A picture labeled as a RAS (Random Access Skip) picture is skipped and not output when decoding is initiated from the CRA picture associated with that RAS picture. On the other hand, if the CRA picture is not the first picture of the bit storm (or decoding is not started from the CRA picture), the moving image predictive decoding device 200 considers the RAS picture to be a non-RAS picture. , Decode and output according to the output instruction of the picture.

ＣＲＡ（クリーン・ランダム・アクセス）ピクチャとしてラベル付けされたピクチャは、ビットストリームの復号がそのＣＲＡピクチャから開始される場合に、前記ＣＲＡピクチャに関連付けされた、ＲＡＳピクチャ以外のピクチャを誤りなしで復号できることを示している。 A picture labeled as a CRA (Clean Random Access) picture correctly decodes a non-RAS picture associated with the CRA picture when the decoding of the bitstream is started from that CRA picture. It shows that it can be done.

ｎｏｎ−ＲＡＳピクチャとしてラベル付けされたピクチャは、動画像予測復号装置２００によって復号され、ピクチャの出力指示に従って出力されるものとする。ＣＲＡピクチャは特に断りがない限り、ｎｏｎ−ＲＡＳピクチャと見なされる。 It is assumed that the picture labeled as a non-RAS picture is decoded by the moving image prediction decoding apparatus 200 and output according to the output instruction of the picture. CRA pictures are considered non-RAS pictures unless otherwise noted.

２）ＰＯＣは、出力されるピクチャの順番の情報を含んでいる。 2) The POC contains information on the order of the output pictures.

３）ＲＰＳは、現在のピクチャの画面間予測のために使用される参照ピクチャの情報を含んでいる。ＲＰＳに存在しない再生画像バッファ（ＤＰＢ）中の参照ピクチャは、現在のピクチャまたはいずれかのピクチャによって予測復号のための参照ピクチャとして利用することができない。 3) The RPS contains information on the reference picture used for interscreen prediction of the current picture. A reference picture in the reproduction image buffer (DPB) that does not exist in the RPS cannot be used as a reference picture for predictive decoding by the current picture or any of the pictures.

本実施形態においては、ビットストリームの復号がＣＲＡピクチャから開始される場合に、全てのｎｏｎ−ＲＡＳピクチャが正しく復号できることを保証するため、ＲＰＳに関して以下の特徴をもつ。
特徴１：リーディング・ピクチャによって利用されるＲＰＳに関して、ひとつ以上の参照ピクチャ（または少なくともひとつの参照ピクチャ）がＲＡＳピクチャである、または、関連付けされたＣＲＡピクチャに復号順で先行する場合、そのリーディング・ピクチャはＲＡＳピクチャとする。
特徴２：ｎｏｎ−ＲＡＳピクチャによって利用されるＲＰＳにある全ての参照ピクチャは、ｎｏｎ−ＲＡＳピクチャである参照ピクチャ、かつ、関連付けされたＣＲＡピクチャ以降に復号される参照ピクチャとする。 In the present embodiment, in order to ensure that all non-RAS pictures can be correctly decoded when the decoding of the bitstream is started from the CRA picture, the RPS has the following features.
Feature 1: With respect to the RPS utilized by the reading picture, if one or more reference pictures (or at least one reference picture) are RAS pictures or precede the associated CRA picture in decoding order, the reading. The picture is a RAS picture.
Feature 2: All reference pictures in the RPS used by the non-RAS picture are reference pictures that are non-RAS pictures and reference pictures that are decoded after the associated CRA picture.

本実施形態においては、ノーマル・ピクチャはｎｏｎ−ＲＡＳピクチャとして扱われるため、特徴１と２を満たさないピクチャはビットストリーム中で許容されない。しかしながら、本発明は、特徴１で述べられたリーディング・ピクチャに限定されるものではなく、全てのピクチャに等しく適用することができる。特徴２に関して、本発明は、参照ピクチャがリーディング・ピクチャのみに制限された場合にも適用することができる。 In this embodiment, since the normal picture is treated as a non-RAS picture, a picture that does not satisfy features 1 and 2 is not allowed in the bitstream. However, the present invention is not limited to the reading picture described in Feature 1, and can be applied equally to all pictures. With respect to feature 2, the present invention can also be applied when the reference picture is limited to the reading picture only.

［動画像予測符号化装置１００における特徴的な動作］
図４を用いて、本発明のポイントである前述の特徴をもつビットストリームを生成するための動画像予測符号化装置１００の動作を説明する。動画像予測符号化装置１００はランダム・アクセスを実現するため、一定周期でＣＲＡピクチャを挿入する。この挿入されたＣＲＡピクチャに符号化順で続く全てのピクチャは、次のＣＲＡピクチャが挿入されるまでの間、当該挿入されたＣＲＡピクチャに関連付けされ、以下のステップに従って符号化される。 [Characteristic operation in moving image prediction coding device 100]
The operation of the moving image prediction coding device 100 for generating the bit stream having the above-mentioned characteristics, which is the point of the present invention, will be described with reference to FIG. The moving image prediction coding device 100 inserts CRA pictures at regular intervals in order to realize random access. All the pictures following the inserted CRA picture in the coding order are associated with the inserted CRA picture until the next CRA picture is inserted, and are encoded according to the following steps.

ステップ６２０において、当該ピクチャのＲＰＳに含まれる参照ピクチャの１枚以上がＲＡＳピクチャであるか否かが判定される。当該ピクチャのＲＰＳに含まれる参照ピクチャの１枚以上がＲＡＳピクチャである場合（ＹＥＳ）はステップ６５０に、そうでない場合（ＮＯ）はステップ６３０に進む。 In step 620, it is determined whether or not one or more of the reference pictures included in the RPS of the picture is a RAS picture. If one or more of the reference pictures included in the RPS of the picture are RAS pictures (YES), the process proceeds to step 650, and if not (NO), the process proceeds to step 630.

ステップ６３０において、当該ピクチャのＲＰＳに含まれる参照ピクチャの１枚以上が、当該ピクチャに関連付けされたＣＲＡピクチャに符号化順で先行するか否かが判定される。当該ピクチャのＲＰＳに含まれる参照ピクチャの１枚以上が、当該ピクチャに関連付けされたＣＲＡピクチャに符号化順で先行する場合（ＹＥＳ）はステップ６５０に、そうでない場合（ＮＯ）はステップ６４０に進む。 In step 630, it is determined whether or not one or more of the reference pictures included in the RPS of the picture precedes the CRA picture associated with the picture in the coding order. If one or more of the reference pictures included in the RPS of the picture precedes the CRA picture associated with the picture in the coding order (YES), the process proceeds to step 650, and if not (NO), the process proceeds to step 640. ..

ステップ６５０において、当該ピクチャのＰＯＣと当該ピクチャに関連付けされたＣＲＡピクチャのＰＯＣとを比較することで、当該ピクチャがリーディング・ピクチャであるか否かを検査する。当該ピクチャのＰＯＣが当該ピクチャに関連付けされたＣＲＡピクチャのＰＯＣよりも小さい場合、当該ピクチャがリーディング・ピクチャである（ＹＥＳ）と判定され、ステップ６７０に進む。そうでない場合は、当該ピクチャがリーディング・ピクチャでない（ＮＯ）と判定されるが、ステップ６２０及びステップ６３０の判定はリーディング・ピクチャに対してのみ（ＹＥＳ）となり、当該ピクチャがリーディング・ピクチャでない（ＮＯ）との判定結果は異常であるため、ステップ６６０に進みエラーメッセージを出力してステップ６８０に進む。なお、ステップ６６０でエラーメッセージを出力した後は、図４の処理を異常終了してもよい。 In step 650, the POC of the picture is compared with the POC of the CRA picture associated with the picture to check if the picture is a leading picture. If the POC of the picture is smaller than the POC of the CRA picture associated with the picture, it is determined that the picture is a leading picture (YES) and the process proceeds to step 670. If not, it is determined that the picture is not a reading picture (NO), but the determinations in steps 620 and 630 are only for the reading picture (YES), and the picture is not a reading picture (NO). ) Is abnormal, so the process proceeds to step 660, an error message is output, and the process proceeds to step 680. After outputting the error message in step 660, the process of FIG. 4 may be terminated abnormally.

ステップ６７０において、当該ピクチャはＲＡＳピクチャとして符号化され、当該ピクチャがＲＡＳピクチャであるという情報（ＮＡＬユニット・タイプ：ＲＡＳ）が符号化される。その後ステップ６８０に進む。 In step 670, the picture is encoded as a RAS picture, and the information that the picture is a RAS picture (NAL unit type: RAS) is encoded. Then proceed to step 680.

ステップ６４０において、当該ピクチャをｎｏｎ−ＲＡＳピクチャとして符号化され、当該ピクチャがｎｏｎ−ＲＡＳピクチャであるという情報（ＮＡＬユニット・タイプ：ｎｏｎ−ＲＡＳ）が符号化される。その後ステップ６８０に進む。ここで、特に断りがない限り、ＣＲＡピクチャはｎｏｎ−ＲＡＳピクチャに含まれる。 In step 640, the picture is encoded as a non-RAS picture, and the information that the picture is a non-RAS picture (NAL unit type: non-RAS) is encoded. Then proceed to step 680. Here, unless otherwise specified, the CRA picture is included in the non-RAS picture.

ステップ６４０と６７０において、当該ピクチャがＲＡＳピクチャであるかｎｏｎ−ＲＡＳピクチャであるかという情報は必ずしも符号化する必要がなく、当該情報を符号化する代わりに、各ピクチャの参照ピクチャリストとフレームメモリ１０４に格納されているピクチャとを照合することで当該ピクチャがＲＡＳピクチャであるかｎｏｎ−ＲＡＳピクチャであるかを判定してもよい。 In steps 640 and 670, the information about whether the picture is a RAS picture or a non-RAS picture does not necessarily have to be encoded, and instead of encoding the information, the reference picture list and frame memory of each picture. It may be determined whether the picture is a RAS picture or a non-RAS picture by collating with the picture stored in 104.

ステップ６８０において、動画像予測符号化装置１００は符号化するピクチャがさらにあるか否かを判定し、ある場合（ＹＥＳ）はステップ６２０に戻って処理を繰り返し、一方、ない場合（ＮＯ）は図４の処理を終了する。 In step 680, the moving image prediction coding device 100 determines whether or not there are more pictures to be encoded, and if there is (YES), the process returns to step 620 and the process is repeated, while if there is not (NO), the figure is shown. The process of 4 is completed.

上述した一連の処理は、図１の動画像予測符号化装置１００全体の処理に該当するが、とりわけステップ６２０、６３０、６５０の判定処理はフレームメモリ管理器１１４により行われる。 The series of processes described above corresponds to the processes of the entire moving image prediction coding device 100 of FIG. 1, and in particular, the determination processes of steps 620, 630, and 650 are performed by the frame memory manager 114.

［動画像予測復号装置２００における特徴的な動作］
本実施形態に係る動画像予測復号装置２００では、復号プロセスがビットストリームの最初のピクチャとしてＣＲＡピクチャから開始された場合と、ビットストリームの最初のピクチャがＣＲＡピクチャでない場合とで、動作が異なる。この復号プロセスは、次のＣＲＡピクチャの復号時に通常の復号プロセスに戻る。 [Characteristic operation in moving image prediction / decoding device 200]
In the moving image prediction decoding apparatus 200 according to the present embodiment, the operation differs depending on whether the decoding process is started from the CRA picture as the first picture of the bitstream and the case where the first picture of the bitstream is not the CRA picture. This decoding process returns to the normal decoding process when the next CRA picture is decoded.

図５を用いて、本発明のポイントである前述の特徴をもつビットストリームを復号するための動画像予測復号装置２００の動作を説明する。 The operation of the moving image prediction decoding apparatus 200 for decoding the bit stream having the above-mentioned characteristics, which is the point of the present invention, will be described with reference to FIG.

ステップ７１０において、動画像予測復号装置２００はビットストリームの最初のピクチャ（即ち、ビットストリームの復号を開始する最初のピクチャ）がＣＲＡピクチャであるか否かを、ＮＡＬユニット・タイプに基づいて判定する。最初のピクチャがＣＲＡピクチャでない場合（ＮＯ）、ステップ７８０に進み、動画像予測復号装置２００は各ピクチャを通常通りに復号する。即ち、このステップ７８０ではＲＡＳピクチャはｎｏｎ−ＲＡＳピクチャとみなされ、通常通りにピクチャ内の指示に従って復号され、出力される。一方、ステップ７１０でビットストリームの最初のピクチャがＣＲＡピクチャである場合（ＹＥＳ）は、ステップ７２０に進む。 In step 710, the moving image predictive decoding apparatus 200 determines whether or not the first picture of the bitstream (that is, the first picture that starts decoding the bitstream) is a CRA picture based on the NAL unit type. .. If the first picture is not a CRA picture (NO), the process proceeds to step 780, and the moving image prediction decoding apparatus 200 decodes each picture as usual. That is, in this step 780, the RAS picture is regarded as a non-RAS picture, and is decoded and output according to the instructions in the picture as usual. On the other hand, if the first picture of the bitstream is a CRA picture in step 710 (YES), the process proceeds to step 720.

ステップ７２０からステップ７７０までの処理は、次のＣＲＡピクチャの復号が開始される直前までの期間、全てのピクチャに対して繰り返し実行され、その後ステップ７８０で通常の復号処理に戻る。以下では、ステップ７２０からステップ７７０までの処理について述べる。 The process from step 720 to step 770 is repeatedly executed for all the pictures during the period until immediately before the next CRA picture starts to be decoded, and then returns to the normal decoding process in step 780. Hereinafter, the processes from step 720 to step 770 will be described.

ステップ７２０において、動画像予測復号装置２００は当該ピクチャの復号開始時に、当該ピクチャが正しく復号されるか否かを判定する。本実施形態におけるビットストリームは前述の特徴１、２を持つため、動画像予測復号装置２００は、以下の２つの方法のうちの少なくとも１つを用いて、当該ピクチャが正しく復号できるか否かを判定することができる。第１の方法は、当該ピクチャのＮＡＬユニット・タイプのラベルを見る方法である。もし、当該ピクチャがＲＡＳピクチャとしてラベル付けされていれば、当該ピクチャは正しく復号することができないと判定することができる。第２の方法は、動画像予測復号装置２００がＤＰＢに存在する参照ピクチャと、当該ピクチャのＲＰＳの参照ピクチャリストとを照合することである。もし、当該ピクチャのＲＰＳにある参照ピクチャのうちいずれかがＤＰＢに存在しない場合は、当該ピクチャを正しく復号することができないと判定することができる。以上のような第１、第２の方法の少なくとも１つを用いて、動画像予測復号装置２００が、当該ピクチャは正しく復号可能と判定した場合（ＹＥＳ）、ステップ７３０に進み、当該ピクチャは正しく復号することができないと判定した場合（ＮＯ）、ステップ７５０に進む。 In step 720, the moving image prediction decoding apparatus 200 determines whether or not the picture is correctly decoded at the start of decoding the picture. Since the bitstream in the present embodiment has the above-mentioned features 1 and 2, the moving image prediction decoding apparatus 200 uses at least one of the following two methods to determine whether or not the picture can be correctly decoded. It can be determined. The first method is to look at the NAL unit type label of the picture. If the picture is labeled as a RAS picture, it can be determined that the picture cannot be decoded correctly. The second method is that the moving image prediction / decoding device 200 collates the reference picture existing in the DPB with the reference picture list of the RPS of the picture. If any of the reference pictures in the RPS of the picture does not exist in the DPB, it can be determined that the picture cannot be decoded correctly. If the moving image prediction decoding apparatus 200 determines that the picture can be correctly decoded by using at least one of the first and second methods as described above (YES), the process proceeds to step 730, and the picture is correctly decoded. If it is determined that the decryption cannot be performed (NO), the process proceeds to step 750.

ステップ７３０において、動画像予測復号装置２００は、当該ピクチャ内の指示に従って、当該ピクチャを復号し出力する。これはＣＲＡピクチャにも適用される。その後、ステップ７４０に進む。 In step 730, the moving image prediction decoding apparatus 200 decodes and outputs the picture according to the instructions in the picture. This also applies to CRA pictures. Then, the process proceeds to step 740.

ステップ７５０において、当該ピクチャのＰＯＣと、当該ピクチャに関連付けされたＣＲＡピクチャのＰＯＣとを比較することで、当該ピクチャがリーディング・ピクチャであるか否かを判定する。当該ピクチャのＰＯＣが、当該ピクチャに関連付けされたＣＲＡピクチャのＰＯＣよりも小さい場合（ＹＥＳ）は、当該ピクチャはリーディング・ピクチャであると判定され、後述のステップ７７０に進む。そうでない場合（ＮＯ）は、当該ピクチャはリーディング・ピクチャでなく、誤りが生じることから、ステップ７６０に進み、動画像予測復号装置２００はエラーメッセージを出力してステップ７４０に進む。なお、ステップ７６０でエラーメッセージを出力した後は、図５の処理を異常終了してもよい。また、前述の通り、ステップ７５０の判定は、特徴１がリーディング・ピクチャだけに限定される場合のみに必要となることに留意されたい。 In step 750, the POC of the picture is compared with the POC of the CRA picture associated with the picture to determine if the picture is a leading picture. If the POC of the picture is smaller than the POC of the CRA picture associated with the picture (YES), the picture is determined to be a leading picture and the process proceeds to step 770 described below. If this is not the case (NO), the picture is not a reading picture and an error occurs. Therefore, the process proceeds to step 760, and the moving image prediction / decoding device 200 outputs an error message and proceeds to step 740. After outputting the error message in step 760, the process of FIG. 5 may be terminated abnormally. Also, as mentioned above, it should be noted that the determination in step 750 is necessary only when feature 1 is limited to the reading picture.

ステップ７７０において、動画像予測復号装置２００は、当該ピクチャの復号をスキップし、以下のような必要なハウスキーピング処理を行う。ここで、必要なハウスキーピング処理としては、例えば、スキップされる当該ピクチャに対して「参照ピクチャとして利用不可能であり出力されない」ことを示すラベル付けを行う処理などが挙げられる。その後、ステップ７４０に進む。 In step 770, the moving image prediction decoding device 200 skips decoding the picture and performs the following necessary housekeeping processing. Here, examples of the necessary housekeeping process include a process of labeling the skipped picture to indicate that it cannot be used as a reference picture and is not output. Then, the process proceeds to step 740.

ステップ７４０において、動画像予測復号装置２００は、次に復号されるピクチャがＣＲＡピクチャであるか否かを判定し、次のピクチャがＣＲＡピクチャでない場合（ＮＯ）はステップ７２０に戻って処理を繰り返す。一方、次のピクチャがＣＲＡピクチャである場合（ＹＥＳ）、次のＣＲＡピクチャ以降は本発明に係る復号プロセス（ランダム・アクセス復号プロセス）が必要でなくなるため、ステップ７８０に進み、通常の復号プロセス（全てのピクチャが復号され出力指示情報に従って出力される処理）に移る。 In step 740, the moving image prediction decoding apparatus 200 determines whether or not the next picture to be decoded is a CRA picture, and if the next picture is not a CRA picture (NO), the process returns to step 720 and the process is repeated. .. On the other hand, when the next picture is a CRA picture (YES), the decoding process (random access decoding process) according to the present invention is not required after the next CRA picture. All pictures are decoded and output according to the output instruction information).

上述した一連の処理は、図２の動画像予測復号装置２００全体の処理に該当するが、とりわけステップ７２０、７５０の判定及びステップ７３０、７７０の制御は、制御器２１０により行われる。 The series of processes described above corresponds to the processes of the entire moving image prediction / decoding device 200 of FIG. 2, and in particular, the determination of steps 720 and 750 and the control of steps 730 and 770 are performed by the controller 210.

以上のような本実施形態によれば、動画像予測復号装置２００は、ビットストリームの先頭のＣＲＡピクチャから復号が開始される場合に、（ラベルの利用あるいはリファレンス・ピクチャ・セットとの照合によって）あるピクチャが正しく復号できるか否かを検出することができる。このため、動画像予測復号装置２００は、全てのリーディング・ピクチャを破棄する代わりに、復号できないピクチャのみを選択して破棄することができ、復号可能なピクチャを後続のピクチャの参照ピクチャとして利用可能とし、予測性能改善に貢献することができる。 According to the present embodiment as described above, when decoding is started from the CRA picture at the beginning of the bit stream, the moving image prediction decoding device 200 (by using a label or collating with a reference picture set). It is possible to detect whether or not a picture can be correctly decoded. Therefore, the moving image prediction decoding device 200 can select and discard only the pictures that cannot be decoded instead of discarding all the reading pictures, and the decodable picture can be used as a reference picture of the subsequent picture. It can contribute to the improvement of prediction performance.

なお、ピクチャに対してＲＡＳというＮＡＬユニット・タイプが割り当てられる際に、動画像予測符号化装置１００は、正しく復号できるピクチャと正しく復号できないピクチャとを生成する。一方、動画像予測復号装置２００は正しく復号できないピクチャを出力しない。これにより、出力されるピクチャに時間的なギャップが生じ、フレームの出力レートに影響を及ぼすおそれがある。システムによっては、出力にギャップがあることは望ましくない。本実施形態においては、動画像予測符号化装置１００は、追加情報として、これらのＲＡＳピクチャでのギャップがあるか否かを、ＣＲＡピクチャ・ヘッダまたはビデオ・ユーザビリティー・シンタックス（ＹＵＩ）の中のフラグによって動画像予測復号装置２００に指示する。このフラグを受け取った動画像予測復号装置２００は、正しく復号可能であるがギャップのあるリーディング・ピクチャを出力するか否かを選択することができる。 When the NAL unit type RAS is assigned to the picture, the moving image prediction coding device 100 generates a picture that can be correctly decoded and a picture that cannot be correctly decoded. On the other hand, the moving image prediction decoding device 200 does not output a picture that cannot be correctly decoded. This creates a time gap in the output picture, which may affect the output rate of the frame. In some systems, gaps in output are not desirable. In the present embodiment, the moving image prediction coding device 100 additionally determines whether or not there is a gap in these RAS pictures in the CRA picture header or the video usability syntax (YUI). The moving image prediction decoding device 200 is instructed by the flag of. Upon receiving this flag, the moving image prediction decoding device 200 can select whether or not to output a reading picture that can be correctly decoded but has a gap.

また、上記とは別の手段として、出力順でＣＲＡピクチャに先行するＲＡＳピクチャでギャップが生じないように、ビットストリームに更なる制限を設けてもよい。即ち、ＲＡＳピクチャでのギャップを生じないようにビットストリームが連続的に出力されるようにしてもよい。 Further, as another means different from the above, a further restriction may be provided on the bit stream so that a gap does not occur in the RAS picture preceding the CRA picture in the output order. That is, the bit stream may be continuously output so as not to generate a gap in the RAS picture.

さらに別の手段として、動画像予測復号装置２００が、動画像予測符号化装置１００からの他の追加情報や、ピクチャの出力指示情報に依存せずに、ｎｏｎ−ＲＡＳのリーディング・ピクチャを復号するが出力しないことを決定してもよい。 As yet another means, the moving image prediction decoding device 200 decodes the non-RAS reading picture without depending on other additional information from the moving image prediction coding device 100 or the output instruction information of the picture. May decide not to output.

本実施形態においては、ＮＡＬユニット・タイプ（ＲＡＳ、ＣＲＡ、ｎｏｎ−ＲＡＳ）のラベルは、動画像予測復号装置２００により検出され使用されるが、ＮＡＬユニット・タイプのラベルは、ネットワーク内の他の装置（例えばサーバ、適切なネットワークエレメンツなど）においても、ランダム・アクセス・ポイントから復号を開始する際に、ＲＡＳピクチャを破棄する処理実行のために検出され使用されてもよい。これにより、ネットワークの帯域を節約することができる。 In this embodiment, the NAL unit type (RAS, CRA, non-RAS) label is detected and used by the moving image predictive decoding device 200, while the NAL unit type label is other in the network. The device (eg, server, suitable network elements, etc.) may also be detected and used to perform the process of destroying the RAS picture when starting decoding from a random access point. This can save network bandwidth.

本実施形態においては、ビットストリームはそれぞれが多数のＣＲＡピクチャを含むことができ、それぞれのＣＲＡピクチャに関連付けされたＲＡＳピクチャが存在する。復号順で２番目のＣＲＡピクチャが１番目のＣＲＡピクチャの後に続く場合、前述の２番目のＣＲＡピクチャのＲＰＳが、１番目のＣＲＡピクチャよりも前に復号される参照ピクチャを含むことは許容されない。これにより、１番目のＣＲＡピクチャがビットストリームの最初のピクチャであった場合に、２番目のＣＲＡピクチャのＲＡＳピクチャが復号されることが保証される。 In this embodiment, each bitstream can contain a large number of CRA pictures, and there is a RAS picture associated with each CRA picture. If the second CRA picture follows the first CRA picture in the decoding order, it is not permissible for the RPS of the second CRA picture described above to contain a reference picture that is decoded before the first CRA picture. .. This guarantees that if the first CRA picture is the first picture in the bitstream, the RAS picture of the second CRA picture will be decoded.

［動画像予測符号化プログラム、動画像予測復号プログラムについて］
動画像予測符号化装置１００に係る発明は、コンピュータを動画像予測符号化装置１００として機能させるための動画像予測符号化プログラムに係る発明として捉えることができる。同様に、動画像予測復号装置２００に係る発明は、コンピュータを動画像予測復号装置２００として機能させるための動画像予測復号プログラムに係る発明として捉えることができる。 [About moving image prediction coding program and moving image prediction decoding program]
The invention relating to the moving image predictive coding device 100 can be regarded as the invention relating to the moving image predictive coding program for causing the computer to function as the moving image predictive coding device 100. Similarly, the invention relating to the moving image prediction decoding device 200 can be regarded as the invention relating to the moving image prediction decoding program for causing the computer to function as the moving image prediction decoding device 200.

動画像予測符号化プログラム及び動画像予測復号プログラムは、例えば、記録媒体に格納されて提供される。なお、記録媒体としては、フレキシブルディスク、ＣＤ−ＲＯＭ、ＵＳＢメモリ、ＤＶＤ、半導体メモリ等が例示される。 The moving image prediction coding program and the moving image prediction decoding program are provided, for example, stored in a recording medium. Examples of the recording medium include flexible disks, CD-ROMs, USB memories, DVDs, semiconductor memories, and the like.

図８には、コンピュータを動画像予測符号化装置１００として機能させるための動画像予測符号化プログラムのモジュールを示す。図８に示すように、動画像予測符号化プログラムＰ１００は、入力モジュールＰ１０１、符号化モジュールＰ１０２、復元モジュールＰ１０３、画像格納モジュールＰ１０４、及び制御モジュールＰ１０５を備えている。 FIG. 8 shows a module of a moving image predictive coding program for making a computer function as a moving image predictive coding device 100. As shown in FIG. 8, the moving image prediction coding program P100 includes an input module P101, a coding module P102, a restoration module P103, an image storage module P104, and a control module P105.

また、図９には、コンピュータを動画像予測復号装置２００として機能させるための動画像予測復号プログラムのモジュールを示す。図９に示すように、動画像予測復号プログラムＰ２００は、入力モジュールＰ２０１、復元モジュールＰ２０２、画像格納モジュールＰ２０３、及び制御モジュールＰ２０４を備えている。 Further, FIG. 9 shows a module of a moving image prediction / decoding program for causing the computer to function as the moving image prediction / decoding device 200. As shown in FIG. 9, the moving image prediction / decoding program P200 includes an input module P201, a restoration module P202, an image storage module P203, and a control module P204.

上記のように構成された動画像予測符号化プログラムＰ１００及び動画像予測復号プログラムＰ２００は、図６及び図７に示す記録媒体１０に記憶可能であり、後述するコンピュータ３０により実行される。 The moving image prediction coding program P100 and the moving image prediction decoding program P200 configured as described above can be stored in the recording medium 10 shown in FIGS. 6 and 7, and are executed by the computer 30 described later.

図６は、記録媒体に記録されたプログラムを実行するためのコンピュータのハードウェア構成を示す図であり、図７は、記録媒体に記憶されたプログラムを実行するためのコンピュータの概観図である。コンピュータとしては、ＣＰＵを具備しソフトウエアによる処理及び制御を行うＤＶＤプレーヤ、セットトップボックス、携帯電話なども含まれる。 FIG. 6 is a diagram showing a hardware configuration of a computer for executing a program recorded on a recording medium, and FIG. 7 is an overview diagram of a computer for executing a program stored on the recording medium. Computers also include DVD players, set-top boxes, mobile phones, etc. that have a CPU and perform processing and control by software.

図６に示すように、コンピュータ３０は、フレキシブルディスクドライブ装置、ＣＤ−ＲＯＭドライブ装置、ＤＶＤドライブ装置等の読み取り装置１２と、オペレーティングシステムを常駐させた作業用メモリ（ＲＡＭ）１４と、記録媒体１０に記憶されたプログラムを記憶するメモリ１６と、ディスプレイといった表示装置１８と、入力装置であるマウス２０及びキーボード２２と、データ等の送受信を行うための通信装置２４と、プログラムの実行を制御するＣＰＵ２６とを備えている。コンピュータ３０は、記録媒体１０が読み取り装置１２に挿入されると、読み取り装置１２から記録媒体１０に格納された動画像予測符号化プログラムにアクセス可能になり、当該動画像予測符号化プログラムを実行することによって、本発明に係る動画像予測符号化装置として動作することが可能になる。同様に、コンピュータ３０は、記録媒体１０が読み取り装置１２に挿入されると、読み取り装置１２から記録媒体１０に格納された動画像予測復号プログラムにアクセス可能になり、当該動画像予測復号プログラムを実行することによって、本発明に係る動画像予測復号装置として動作することが可能になる。 As shown in FIG. 6, the computer 30 includes a reading device 12 such as a flexible disk drive device, a CD-ROM drive device, and a DVD drive device, a working memory (RAM) 14 in which an operating system is resident, and a recording medium 10. A memory 16 for storing a program stored in a computer, a display device 18 such as a display, a mouse 20 and a keyboard 22 as input devices, a communication device 24 for transmitting and receiving data and the like, and a CPU 26 for controlling program execution. And have. When the recording medium 10 is inserted into the reading device 12, the computer 30 can access the moving image predictive coding program stored in the recording medium 10 from the reading device 12 and executes the moving image predictive coding program. This makes it possible to operate as a moving image prediction coding device according to the present invention. Similarly, when the recording medium 10 is inserted into the reading device 12, the computer 30 becomes accessible from the reading device 12 to the moving image prediction / decoding program stored in the recording medium 10, and executes the moving image prediction / decoding program. By doing so, it becomes possible to operate as a moving image prediction / decoding device according to the present invention.

図７に示すように、動画像予測符号化プログラム又は動画像予測復号プログラムは、搬送波に重畳されたコンピュータデータ信号４０としてネットワークを介して提供されるものであってもよい。この場合、コンピュータ３０は、通信装置２４によって受信された動画像予測符号化プログラム又は動画像予測復号プログラムをメモリ１６に格納し、当該動画像予測符号化プログラム又は当該動画像予測復号プログラムを実行することができる。 As shown in FIG. 7, the moving image prediction coding program or the moving image prediction decoding program may be provided via a network as a computer data signal 40 superimposed on a carrier wave. In this case, the computer 30 stores the moving image prediction coding program or the moving image prediction decoding program received by the communication device 24 in the memory 16 and executes the moving image prediction coding program or the moving image prediction decoding program. be able to.

本発明に係る動画像予測符号化装置は、動画像を構成する複数の画像を入力する入力手段と、前記画像を画面内予測もしくは画面間予測のいずれかの方法で符号化することで、ランダム・アクセス・ポイントとなる画像を含む圧縮画像データを生成するとともに、各画像の出力順番情報と各画像のピクチャ・タイプに関する情報とを符号化する符号化手段と、前記圧縮画像データを復号し、再生画像に復元する復元手段と、前記再生画像を、後続の画像を符号化するために用いられる参照画像として１つ以上格納する画像格納手段と、前記ピクチャ・タイプを判定し、判定結果に基づいて前記画像格納手段を制御する制御手段と、を具備し、前記制御手段は、
１）ＣＲＡピクチャ：ＣＲＡピクチャに続くタイプ２ピクチャが、前記ＣＲＡピクチャから復号が開始された場合に正しく復号することができる、ことを特徴とするピクチャ、
２）タイプ１ピクチャ：当該ピクチャに関連付けされたＣＲＡピクチャの後に復号され、前記ＣＲＡピクチャよりも先に出力されるピクチャであって、画面間予測を行うために、タイプ１ピクチャとしてラベル付けされた少なくとも１つの参照ピクチャ、あるいは、前記関連付けされたＣＲＡピクチャに復号順で先行する少なくとも１つの参照ピクチャ、を含む参照ピクチャのリストを持つピクチャ、
３）タイプ２ピクチャ：画面間予測を行うために、参照ピクチャのリストを持つピクチャであって、当該参照ピクチャのリストにある全ての参照ピクチャが、タイプ２ピクチャあるいはＣＲＡピクチャとしてラベル付けされ、当該ピクチャに関連付けされたＣＲＡピクチャ以降に復号されることを特徴とする、当該ピクチャ、
の計３つのタイプのうちひとつとして、前記画像をラベル付けする、ことを特徴とする。 The moving image prediction coding device according to the present invention is random by using an input means for inputting a plurality of images constituting a moving image and encoding the image by either in-screen prediction or inter-screen prediction. -The compressed image data including the image serving as the access point is generated, and the coding means for encoding the output order information of each image and the information regarding the picture type of each image and the compressed image data are decoded. A restoration means for restoring a reproduced image, an image storage means for storing one or more of the reproduced images as a reference image used for encoding a subsequent image, and the picture type are determined, and based on the determination result. The control means is provided with a control means for controlling the image storage means.
1) CRA picture: A picture characterized in that a type 2 picture following a CRA picture can be correctly decoded when decoding is started from the CRA picture.
2) Type 1 picture: A picture that is decoded after the CRA picture associated with the picture and output before the CRA picture, and is labeled as a type 1 picture for interscreen prediction. A picture having a list of reference pictures that includes at least one reference picture, or at least one reference picture that precedes the associated CRA picture in decoding order.
3) Type 2 picture: A picture having a list of reference pictures for interscreen prediction, and all the reference pictures in the list of reference pictures are labeled as type 2 pictures or CRA pictures. The picture, characterized in that it is decoded after the CRA picture associated with the picture.
As one of the three types in total, the image is labeled.

本発明に係る動画像予測符号化方法は、動画像予測符号化装置により実行される動画像予測符号化方法であって、動画像を構成する複数の画像を入力する入力ステップと、前記画像を画面内予測もしくは画面間予測のいずれかの方法で符号化することで、ランダム・アクセス・ポイントとなる画像を含む圧縮画像データを生成するとともに、各画像の出力順番情報と各画像のピクチャ・タイプに関する情報とを符号化する符号化ステップと、前記圧縮画像データを復号し、再生画像に復元する復元ステップと、前記再生画像を、後続の画像を符号化するために用いられる参照画像として１つ以上格納する画像格納ステップと、前記ピクチャ・タイプを判定し、判定結果に基づいて前記画像格納ステップを制御する制御ステップと、を具備し、前記制御ステップにおいて、前記動画像予測符号化装置は、
１）ＣＲＡピクチャ：ＣＲＡピクチャに続くタイプ２ピクチャが、前記ＣＲＡピクチャから復号が開始された場合に正しく復号することができる、ことを特徴とするピクチャ、
２）タイプ１ピクチャ：当該ピクチャに関連付けされたＣＲＡピクチャの後に復号され、前記ＣＲＡピクチャよりも先に出力されるピクチャであって、画面間予測を行うために、タイプ１ピクチャとしてラベル付けされた少なくとも１つの参照ピクチャ、あるいは、前記関連付けされたＣＲＡピクチャに復号順で先行する少なくとも１つの参照ピクチャ、を含む参照ピクチャのリストを持つピクチャ、
３）タイプ２ピクチャ：画面間予測を行うために、参照ピクチャのリストを持つピクチャであって、当該参照ピクチャのリストにある全ての参照ピクチャが、タイプ２ピクチャあるいはＣＲＡピクチャとしてラベル付けされ、当該ピクチャに関連付けされたＣＲＡピクチャ以降に復号されることを特徴とする、当該ピクチャ、
の計３つのタイプのうちひとつとして、前記画像をラベル付けする、ことを特徴とする。 The moving image prediction coding method according to the present invention is a moving image prediction coding method executed by a moving image prediction coding device, in which an input step for inputting a plurality of images constituting a moving image and the image are combined. By encoding by either in-screen prediction or inter-screen prediction, compressed image data including images that serve as random access points is generated, and the output order information of each image and the picture type of each image are generated. A coding step for encoding information about, a restoration step for decoding the compressed image data and restoring it to a reproduced image, and one of the reproduced images as a reference image used for encoding a subsequent image. The image storage step for storing the above image and a control step for determining the picture type and controlling the image storage step based on the determination result are provided. In the control step, the moving image prediction coding device is provided.
1) CRA picture: A picture characterized in that a type 2 picture following a CRA picture can be correctly decoded when decoding is started from the CRA picture.
2) Type 1 picture: A picture that is decoded after the CRA picture associated with the picture and output before the CRA picture, and is labeled as a type 1 picture for interscreen prediction. A picture having a list of reference pictures that includes at least one reference picture, or at least one reference picture that precedes the associated CRA picture in decoding order.
3) Type 2 picture: A picture having a list of reference pictures for interscreen prediction, and all the reference pictures in the list of reference pictures are labeled as type 2 pictures or CRA pictures. The picture, characterized in that it is decoded after the CRA picture associated with the picture.
As one of the three types in total, the image is labeled.

本発明に係る動画像予測符号化プログラムは、コンピュータを、動画像を構成する複数の画像を入力する入力手段と、前記画像を画面内予測もしくは画面間予測のいずれかの方法で符号化することで、ランダム・アクセス・ポイントとなる画像を含む圧縮画像データを生成するとともに、各画像の出力順番情報と各画像のピクチャ・タイプに関する情報とを符号化する符号化手段と、前記圧縮画像データを復号し、再生画像に復元する復元手段と、前記再生画像を、後続の画像を符号化するために用いられる参照画像として１つ以上格納する画像格納手段と、前記ピクチャ・タイプを判定し、判定結果に基づいて前記画像格納手段を制御する制御手段、として機能させ、前記制御手段は、
１）ＣＲＡピクチャ：ＣＲＡピクチャに続くタイプ２ピクチャが、前記ＣＲＡピクチャから復号が開始された場合に正しく復号することができる、ことを特徴とするピクチャ、
２）タイプ１ピクチャ：当該ピクチャに関連付けされたＣＲＡピクチャの後に復号され、前記ＣＲＡピクチャよりも先に出力されるピクチャであって、画面間予測を行うために、タイプ１ピクチャとしてラベル付けされた少なくとも１つの参照ピクチャ、あるいは、前記関連付けされたＣＲＡピクチャに復号順で先行する少なくとも１つの参照ピクチャ、を含む参照ピクチャのリストを持つピクチャ、
３）タイプ２ピクチャ：画面間予測を行うために、参照ピクチャのリストを持つピクチャであって、当該参照ピクチャのリストにある全ての参照ピクチャが、タイプ２ピクチャあるいはＣＲＡピクチャとしてラベル付けされ、当該ピクチャに関連付けされたＣＲＡピクチャ以降に復号されることを特徴とする、当該ピクチャ、
の計３つのタイプのうちひとつとして、前記画像をラベル付けする、ことを特徴とする。 The moving image prediction coding program according to the present invention encodes a computer by an input means for inputting a plurality of images constituting a moving image and the image by either in-screen prediction or inter-screen prediction. In addition to generating compressed image data including an image serving as a random access point, the encoding means for encoding the output order information of each image and the information regarding the picture type of each image, and the compressed image data are used. The picture type is determined and determined, the restoration means for decoding and restoring to the reproduced image, the image storage means for storing one or more of the reproduced images as a reference image used for encoding the subsequent image, and the picture type. The image storage means is made to function as a control means for controlling the image storage means based on the result, and the control means is used.
1) CRA picture: A picture characterized in that a type 2 picture following a CRA picture can be correctly decoded when decoding is started from the CRA picture.
2) Type 1 picture: A picture that is decoded after the CRA picture associated with the picture and output before the CRA picture, and is labeled as a type 1 picture for interscreen prediction. A picture having a list of reference pictures that includes at least one reference picture, or at least one reference picture that precedes the associated CRA picture in decoding order.
3) Type 2 picture: A picture having a list of reference pictures for interscreen prediction, and all the reference pictures in the list of reference pictures are labeled as type 2 pictures or CRA pictures. The picture, characterized in that it is decoded after the CRA picture associated with the picture.
As one of the three types in total, the image is labeled.

本発明に係る動画像予測復号装置は、動画像を構成する複数の画像に対し画面内予測もしくは画面間予測のいずれかによる符号化が行われることで得られた、ランダム・アクセス画像を含む圧縮画像データと、各画像の出力順番および各画像のピクチャ・タイプを示す符号化データとを入力する入力手段と、前記圧縮画像データと前記符号化データとを復号し、再生画像と出力順番情報とピクチャ・タイプ情報とに復元する復元手段と、前記再生画像を、後続の画像を復号するために用いられる参照画像として１つ以上格納する画像格納手段と、前記ピクチャ・タイプに基づいて、前記復元手段を制御する制御手段と、を具備し、前記ピクチャ・タイプは、
１）ＣＲＡピクチャ：ＣＲＡピクチャに続くタイプ２ピクチャが、当該ＣＲＡピクチャから復号が開始された場合に正しく復号することができる、ことを特徴とするピクチャ、
２）タイプ１ピクチャ：当該ピクチャに関連付けされたＣＲＡピクチャの後に復号され、前記ＣＲＡピクチャよりも先に出力されるピクチャであって、画面間予測を行うために、タイプ１ピクチャとしてラベル付けされた少なくとも１つの参照ピクチャ、あるいは、前記関連付けされたＣＲＡピクチャに復号順で先行する少なくとも１つの参照ピクチャ、を含む参照ピクチャのリストを持つピクチャ、
３）タイプ２ピクチャ：画面間予測を行うために、参照ピクチャのリストを持つピクチャであって、当該参照ピクチャのリストにある全ての参照ピクチャが、タイプ２ピクチャあるいはＣＲＡピクチャとしてラベル付けされ、当該ピクチャに関連付けされたＣＲＡピクチャ以降に復号されることを特徴とする、当該ピクチャ、
の計３つのタイプのうちひとつとして各画像にラベル付けされており、前記復元手段は、符号化データの復号がＣＲＡピクチャから開始された場合、タイプ２ピクチャとしてラベル付けされた画像を復号し、タイプ１ピクチャとしてラベル付けされた画像の復号をスキップする、といった復号処理を、次のＣＲＡピクチャの処理直前までの期間、続ける、ことを特徴とする。 The moving image prediction / decoding device according to the present invention is a compression including a random access image obtained by encoding a plurality of images constituting a moving image by either in-screen prediction or inter-screen prediction. An input means for inputting image data, an output order of each image, and coded data indicating a picture type of each image, decoding the compressed image data and the coded data, and reproducing a reproduced image and output order information. Restoration means for restoring to picture type information, image storage means for storing one or more of the reproduced images as reference images used for decoding subsequent images, and the restoration based on the picture type. The picture type comprises a control means for controlling the means.
1) CRA picture: A picture characterized in that a type 2 picture following a CRA picture can be correctly decoded when decoding is started from the CRA picture.
2) Type 1 picture: A picture that is decoded after the CRA picture associated with the picture and output before the CRA picture, and is labeled as a type 1 picture for interscreen prediction. A picture having a list of reference pictures that includes at least one reference picture, or at least one reference picture that precedes the associated CRA picture in decoding order.
3) Type 2 picture: A picture having a list of reference pictures for interscreen prediction, and all the reference pictures in the list of reference pictures are labeled as type 2 pictures or CRA pictures. The picture, characterized in that it is decoded after the CRA picture associated with the picture.
Each image is labeled as one of a total of three types, and the restoring means decodes the image labeled as a type 2 picture when the decoding of the encoded data is started from the CRA picture. It is characterized in that a decoding process such as skipping the decoding of an image labeled as a type 1 picture is continued for a period until immediately before the processing of the next CRA picture.

本発明に係る動画像予測復号方法は、動画像予測復号装置により実行される動画像予測復号方法であって、動画像を構成する複数の画像に対し画面内予測もしくは画面間予測のいずれかによる符号化が行われることで得られた、ランダム・アクセス画像を含む圧縮画像データと、各画像の出力順番および各画像のピクチャ・タイプを示す符号化データとを入力する入力ステップと、前記圧縮画像データと前記符号化データとを復号し、再生画像と出力順番情報とピクチャ・タイプ情報とに復元する復元ステップと、前記再生画像を、後続の画像を復号するために用いられる参照画像として１つ以上格納する画像格納ステップと、前記ピクチャ・タイプに基づいて、前記復元ステップを制御する制御ステップと、を具備し、前記ピクチャ・タイプは、
１）ＣＲＡピクチャ：ＣＲＡピクチャに続くタイプ２ピクチャが、当該ＣＲＡピクチャから復号が開始された場合に正しく復号することができる、ことを特徴とするピクチャ、
２）タイプ１ピクチャ：当該ピクチャに関連付けされたＣＲＡピクチャの後に復号され、前記ＣＲＡピクチャよりも先に出力されるピクチャであって、画面間予測を行うために、タイプ１ピクチャとしてラベル付けされた少なくとも１つの参照ピクチャ、あるいは、前記関連付けされたＣＲＡピクチャに復号順で先行する少なくとも１つの参照ピクチャ、を含む参照ピクチャのリストを持つピクチャ、
３）タイプ２ピクチャ：画面間予測を行うために、参照ピクチャのリストを持つピクチャであって、当該参照ピクチャのリストにある全ての参照ピクチャが、タイプ２ピクチャあるいはＣＲＡピクチャとしてラベル付けされ、当該ピクチャに関連付けされたＣＲＡピクチャ以降に復号されることを特徴とする、当該ピクチャ、
の計３つのタイプのうちひとつとして各画像にラベル付けされており、前記復元ステップにおいて、前記動画像予測復号装置は、符号化データの復号がＣＲＡピクチャから開始された場合、タイプ２ピクチャとしてラベル付けされた画像を復号し、タイプ１ピクチャとしてラベル付けされた画像の復号をスキップする、といった復号処理を、次のＣＲＡピクチャの処理直前までの期間、続ける、ことを特徴とする。 The moving image prediction / decoding method according to the present invention is a moving image prediction / decoding method executed by a moving image prediction / decoding device, and is based on either in-screen prediction or inter-screen prediction for a plurality of images constituting the moving image. An input step for inputting compressed image data including a random access image obtained by encoding, and coded data indicating the output order of each image and the picture type of each image, and the compressed image. A restoration step of decoding the data and the encoded data and restoring the reproduced image, output order information, and picture type information, and the reproduced image as a reference image used for decoding a subsequent image. The image storage step for storing the above image and a control step for controlling the restoration step based on the picture type are provided, and the picture type includes the image storage step.
1) CRA picture: A picture characterized in that a type 2 picture following a CRA picture can be correctly decoded when decoding is started from the CRA picture.
2) Type 1 picture: A picture that is decoded after the CRA picture associated with the picture and output before the CRA picture, and is labeled as a type 1 picture for interscreen prediction. A picture having a list of reference pictures that includes at least one reference picture, or at least one reference picture that precedes the associated CRA picture in decoding order.
3) Type 2 picture: A picture having a list of reference pictures for interscreen prediction, and all the reference pictures in the list of reference pictures are labeled as type 2 pictures or CRA pictures. The picture, characterized in that it is decoded after the CRA picture associated with the picture.
Each image is labeled as one of a total of three types, and in the restoration step, the moving image predictive decoding device labels the image as a type 2 picture when decoding of the encoded data is started from the CRA picture. It is characterized in that the decoding process of decoding the attached image and skipping the decoding of the image labeled as the type 1 picture is continued for a period until immediately before the processing of the next CRA picture.

本発明に係る動画像予測復号プログラムは、コンピュータを、動画像を構成する複数の画像に対し画面内予測もしくは画面間予測のいずれかによる符号化が行われることで得られた、ランダム・アクセス画像を含む圧縮画像データと、各画像の出力順番および各画像のピクチャ・タイプを示す符号化データとを入力する入力手段と、前記圧縮画像データと前記符号化データとを復号し、再生画像と出力順番情報とピクチャ・タイプ情報とに復元する復元手段と、前記再生画像を、後続の画像を復号するために用いられる参照画像として１つ以上格納する画像格納手段と、前記ピクチャ・タイプに基づいて、前記復元手段を制御する制御手段、として機能させ、前記ピクチャ・タイプは、
１）ＣＲＡピクチャ：ＣＲＡピクチャに続くタイプ２ピクチャが、当該ＣＲＡピクチャから復号が開始された場合に正しく復号することができる、ことを特徴とするピクチャ、
２）タイプ１ピクチャ：当該ピクチャに関連付けされたＣＲＡピクチャの後に復号され、前記ＣＲＡピクチャよりも先に出力されるピクチャであって、画面間予測を行うために、タイプ１ピクチャとしてラベル付けされた少なくとも１つの参照ピクチャ、あるいは、前記関連付けされたＣＲＡピクチャに復号順で先行する少なくとも１つの参照ピクチャ、を含む参照ピクチャのリストを持つピクチャ、
３）タイプ２ピクチャ：画面間予測を行うために、参照ピクチャのリストを持つピクチャであって、当該参照ピクチャのリストにある全ての参照ピクチャが、タイプ２ピクチャあるいはＣＲＡピクチャとしてラベル付けされ、当該ピクチャに関連付けされたＣＲＡピクチャ以降に復号されることを特徴とする、当該ピクチャ、
の計３つのタイプのうちひとつとして各画像にラベル付けされており、前記復元手段は、符号化データの復号がＣＲＡピクチャから開始された場合、タイプ２ピクチャとしてラベル付けされた画像を復号し、タイプ１ピクチャとしてラベル付けされた画像の復号をスキップする、といった復号処理を、次のＣＲＡピクチャの処理直前までの期間、続ける、ことを特徴とする。 The moving image prediction / decoding program according to the present invention is a random access image obtained by encoding a plurality of images constituting a moving image by either in-screen prediction or inter-screen prediction. An input means for inputting compressed image data including the image, output order of each image, and coded data indicating a picture type of each image, decoding the compressed image data and the coded data, and outputting a reproduced image and output. Based on the restoration means for restoring the sequence information and the picture type information, the image storage means for storing one or more of the reproduced images as reference images used for decoding the subsequent image, and the picture type. , The picture type, which functions as a control means for controlling the restoration means.
1) CRA picture: A picture characterized in that a type 2 picture following a CRA picture can be correctly decoded when decoding is started from the CRA picture.
2) Type 1 picture: A picture that is decoded after the CRA picture associated with the picture and output before the CRA picture, and is labeled as a type 1 picture for interscreen prediction. A picture having a list of reference pictures that includes at least one reference picture, or at least one reference picture that precedes the associated CRA picture in decoding order.
3) Type 2 picture: A picture having a list of reference pictures for interscreen prediction, and all the reference pictures in the list of reference pictures are labeled as type 2 pictures or CRA pictures. The picture, characterized in that it is decoded after the CRA picture associated with the picture.
Each image is labeled as one of a total of three types, and the restoring means decodes the image labeled as a type 2 picture when the decoding of the encoded data is started from the CRA picture. It is characterized in that a decoding process such as skipping the decoding of an image labeled as a type 1 picture is continued for a period until immediately before the processing of the next CRA picture.

さて、本発明に係る動画像予測符号化装置、方法及びプログラム、並びに、動画像予測復号装置、方法及びプログラムは、以下の態様も採用することができる。 By the way, the moving image prediction coding device, the method and the program, and the moving image prediction and decoding device, the method and the program according to the present invention can also adopt the following aspects.

本発明に係る動画像予測符号化装置は、動画像を構成する複数の画像を入力する入力手段と、前記画像を画面内予測もしくは画面間予測のいずれかの方法で符号化することで、ランダム・アクセス・ポイントとなる画像を含む圧縮画像データを生成するとともに、各画像の出力順番情報を符号化する符号化手段と、前記圧縮画像データを復号し、再生画像に復元する復元手段と、前記再生画像を、後続の画像を符号化するために用いられる参照画像として１つ以上格納する画像格納手段と、前記画像格納手段を制御する制御手段と、を具備し、前記制御手段は、
１）符号化データの復号が開始されるＣＲＡピクチャ、
２）当該ピクチャに関連付けされたＣＲＡピクチャよりも後に復号され、前記関連付けされたＣＲＡピクチャよりも先に出力され、前記復元手段による復号処理がスキップされ、前記画像格納手段に格納されず出力されないピクチャであって、画面間予測を行うために、前記復元手段による復号処理がスキップされる少なくとも１つの参照ピクチャ、あるいは、前記関連付けされたＣＲＡピクチャに復号順で先行する少なくとも１つの参照ピクチャ、を含む参照ピクチャのリストを持つ、当該ピクチャ、
３）前記復元手段により復号され、必要に応じて参照のために前記画像格納手段に格納されるピクチャであって、当該ピクチャは画面間予測を行うために参照ピクチャのリストを持ち、当該参照ピクチャのリストにある全ての参照ピクチャが、前記復元手段により復号され、当該ピクチャに関連付けされたＣＲＡピクチャ以降に復号されることを特徴とする、当該ピクチャ、
の計３つに、前記画像を分類し制御する、ことを特徴とする。 The moving image prediction coding device according to the present invention is random by using an input means for inputting a plurality of images constituting a moving image and encoding the image by either in-screen prediction or inter-screen prediction. A coding means for generating compressed image data including an image serving as an access point and encoding output order information of each image, a restoring means for decoding the compressed image data and restoring it to a reproduced image, and the above. The control means includes an image storage means for storing one or more reproduced images as a reference image used for encoding a subsequent image, and a control means for controlling the image storage means.
1) CRA picture at which decoding of coded data is started,
2) A picture that is decoded after the CRA picture associated with the picture, output before the associated CRA picture, the decoding process by the restoration means is skipped, and the picture is not stored in the image storage means and is not output. The inter-screen prediction includes at least one reference picture in which the decoding process by the restoration means is skipped, or at least one reference picture that precedes the associated CRA picture in the decoding order. The picture, which has a list of referenced pictures,
3) A picture that is decoded by the restoration means and stored in the image storage means for reference as needed, and the picture has a list of reference pictures for performing interscreen prediction, and the reference picture. The picture, characterized in that all the reference pictures in the list are decoded by the restoring means and are decoded after the CRA picture associated with the picture.
The image is classified and controlled in a total of three.

本発明に係る動画像予測符号化方法は、動画像予測符号化装置により実行される動画像予測符号化方法であって、動画像を構成する複数の画像を入力する入力ステップと、前記画像を画面内予測もしくは画面間予測のいずれかの方法で符号化することで、ランダム・アクセス・ポイントとなる画像を含む圧縮画像データを生成するとともに、各画像の出力順番情報を符号化する符号化ステップと、前記圧縮画像データを復号し、再生画像に復元する復元ステップと、前記再生画像を、後続の画像を符号化するために用いられる参照画像として１つ以上格納する画像格納ステップと、前記画像格納ステップを制御する制御ステップと、を具備し、前記制御ステップにおいて、前記動画像予測符号化装置は、
１）符号化データの復号が開始されるＣＲＡピクチャ、
２）当該ピクチャに関連付けされたＣＲＡピクチャよりも後に復号され、前記関連付けされたＣＲＡピクチャよりも先に出力され、前記復元ステップによる復号処理がスキップされ、前記画像格納ステップにて格納されず出力されないピクチャであって、画面間予測を行うために、前記復元ステップによる復号処理がスキップされる少なくとも１つの参照ピクチャ、あるいは、前記関連付けされたＣＲＡピクチャに復号順で先行する少なくとも１つの参照ピクチャ、を含む参照ピクチャのリストを持つ、当該ピクチャ、
３）前記復元ステップにより復号され、必要に応じて参照のために前記画像格納ステップにて格納されるピクチャであって、当該ピクチャは画面間予測を行うために参照ピクチャのリストを持ち、当該参照ピクチャのリストにある全ての参照ピクチャが、前記復元ステップにより復号され、当該ピクチャに関連付けされたＣＲＡピクチャ以降に復号されることを特徴とする、当該ピクチャ、
の計３つに、前記画像を分類し制御する、ことを特徴とする。 The moving image prediction coding method according to the present invention is a moving image prediction coding method executed by a moving image prediction coding device, in which an input step for inputting a plurality of images constituting a moving image and the image are combined. A coding step that generates compressed image data including an image that serves as a random access point by encoding by either in-screen prediction or inter-screen prediction, and encodes the output order information of each image. A restoration step of decoding the compressed image data and restoring it to a reproduced image, an image storage step of storing one or more of the reproduced images as a reference image used for encoding a subsequent image, and the image. A control step for controlling a storage step is provided, and in the control step, the moving image prediction coding device is provided.
1) CRA picture at which decoding of coded data is started,
2) It is decoded after the CRA picture associated with the picture, output before the associated CRA picture, the decoding process by the restoration step is skipped, and it is not stored and output in the image storage step. A picture, at least one reference picture in which the decoding process by the restoration step is skipped in order to perform interscreen prediction, or at least one reference picture that precedes the associated CRA picture in the decoding order. The picture, which has a list of referenced pictures, including
3) A picture that is decoded by the restoration step and stored in the image storage step for reference as needed, and the picture has a list of reference pictures for interscreen prediction, and the reference. The picture, characterized in that all referenced pictures in the list of pictures are decoded by the restore step and are decoded after the CRA picture associated with the picture.
The image is classified and controlled in a total of three.

本発明に係る動画像予測符号化プログラムは、コンピュータを、動画像を構成する複数の画像を入力する入力手段と、前記画像を画面内予測もしくは画面間予測のいずれかの方法で符号化することで、ランダム・アクセス・ポイントとなる画像を含む圧縮画像データを生成するとともに、各画像の出力順番情報を符号化する符号化手段と、前記圧縮画像データを復号し、再生画像に復元する復元手段と、前記再生画像を、後続の画像を符号化するために用いられる参照画像として１つ以上格納する画像格納手段と、前記画像格納手段を制御する制御手段、として機能させ、前記制御手段は、
１）符号化データの復号が開始されるＣＲＡピクチャ、
２）当該ピクチャに関連付けされたＣＲＡピクチャよりも後に復号され、前記関連付けされたＣＲＡピクチャよりも先に出力され、前記復元手段による復号処理がスキップされ、前記画像格納手段に格納されず出力されないピクチャであって、画面間予測を行うために、前記復元手段による復号処理がスキップされる少なくとも１つの参照ピクチャ、あるいは、前記関連付けされたＣＲＡピクチャに復号順で先行する少なくとも１つの参照ピクチャ、を含む参照ピクチャのリストを持つ、当該ピクチャ、
３）前記復元手段により復号され、必要に応じて参照のために前記画像格納手段に格納されるピクチャであって、当該ピクチャは画面間予測を行うために参照ピクチャのリストを持ち、当該参照ピクチャのリストにある全ての参照ピクチャが、前記復元手段により復号され、当該ピクチャに関連付けされたＣＲＡピクチャ以降に復号されることを特徴とする、当該ピクチャ、
の計３つに、前記画像を分類し制御する、ことを特徴とする。 The moving image prediction coding program according to the present invention encodes a computer by an input means for inputting a plurality of images constituting a moving image and the image by either in-screen prediction or inter-screen prediction. A coding means for generating compressed image data including an image serving as a random access point and encoding the output order information of each image, and a restoration means for decoding the compressed image data and restoring it to a reproduced image. The reproduced image is made to function as an image storage means for storing one or more as a reference image used for encoding a subsequent image and a control means for controlling the image storage means.
1) CRA picture at which decoding of coded data is started,
2) A picture that is decoded after the CRA picture associated with the picture, output before the associated CRA picture, the decoding process by the restoration means is skipped, and the picture is not stored in the image storage means and is not output. The inter-screen prediction includes at least one reference picture in which the decoding process by the restoration means is skipped, or at least one reference picture that precedes the associated CRA picture in the decoding order. The picture, which has a list of referenced pictures,
3) A picture that is decoded by the restoration means and stored in the image storage means for reference as needed, and the picture has a list of reference pictures for performing interscreen prediction, and the reference picture. The picture, characterized in that all the reference pictures in the list are decoded by the restoring means and are decoded after the CRA picture associated with the picture.
The image is classified and controlled in a total of three.

本発明に係る動画像予測復号装置は、動画像を構成する複数の画像に対し画面内予測もしくは画面間予測のいずれかによる符号化が行われることで得られた、ランダム・アクセス画像を含む圧縮画像データと、各画像の出力順番を示す符号化データとを入力する入力手段と、前記圧縮画像データと前記符号化データとを復号し、再生画像と出力順番情報とに復元する復元手段と、前記再生画像を、後続の画像を復号するために用いられる参照画像として１つ以上格納する画像格納手段と、前記復元手段を制御する制御手段と、を具備し、前記制御手段は、
１）符号化データの復号が開始されるＣＲＡピクチャ、
２）当該ピクチャに関連付けされたＣＲＡピクチャよりも後に復号され、前記関連付けされたＣＲＡピクチャよりも先に出力され、前記復元手段による復号処理がスキップされ、前記画像格納手段に格納されず出力されないピクチャであって、画面間予測を行うために、前記復元手段による復号処理がスキップされる少なくとも１つの参照ピクチャ、あるいは、前記関連付けされたＣＲＡピクチャに復号順で先行する少なくとも１つの参照ピクチャ、を含む参照ピクチャのリストを持つ、当該ピクチャ、
３）前記復元手段により復号され、必要に応じて参照のために前記画像格納手段に格納されるピクチャであって、当該ピクチャは画面間予測を行うために参照ピクチャのリストを持ち、当該参照ピクチャのリストにある全ての参照ピクチャが、前記復元手段により復号され、当該ピクチャに関連付けされたＣＲＡピクチャ以降に復号されることを特徴とする、当該ピクチャ、
の計３つに、前記画像を分類して制御し、前記復元手段は、符号化データの復号が当該ピクチャに関連付けされたＣＲＡピクチャで開始された場合、当該ピクチャに関する参照ピクチャのリストにある全ての参照ピクチャが前記画像格納手段に格納されているか否かを判定し、参照ピクチャのリストにある全ての参照ピクチャが格納されていれば当該ピクチャを復号し、参照ピクチャのリストにある１つ以上の参照ピクチャが格納されていなければ当該ピクチャの復号をスキップする、といった復号処理を、次のＣＲＡピクチャの処理直前までの期間、続ける、ことを特徴とする。 The moving image prediction / decoding device according to the present invention is a compression including a random access image obtained by encoding a plurality of images constituting a moving image by either in-screen prediction or inter-screen prediction. An input means for inputting image data and coded data indicating the output order of each image, a restoration means for decoding the compressed image data and the coded data, and restoring the reproduced image and output order information. The control means includes an image storage means for storing one or more of the reproduced images as a reference image used for decoding a subsequent image, and a control means for controlling the restoration means.
1) CRA picture at which decoding of coded data is started,
2) A picture that is decoded after the CRA picture associated with the picture, output before the associated CRA picture, the decoding process by the restoration means is skipped, and the picture is not stored in the image storage means and is not output. The inter-screen prediction includes at least one reference picture in which the decoding process by the restoration means is skipped, or at least one reference picture that precedes the associated CRA picture in the decoding order. The picture, which has a list of referenced pictures,
3) A picture that is decoded by the restoration means and stored in the image storage means for reference as needed, and the picture has a list of reference pictures for performing interscreen prediction, and the reference picture. The picture, characterized in that all the reference pictures in the list are decoded by the restoring means and are decoded after the CRA picture associated with the picture.
The image is classified and controlled in a total of three, and when the decoding of the encoded data is started with the CRA picture associated with the picture, all of the reference pictures related to the picture are listed. It is determined whether or not the reference picture of is stored in the image storage means, and if all the reference pictures in the list of reference pictures are stored, the picture is decoded and one or more of the reference pictures in the list of reference pictures are stored. The decoding process, such as skipping the decoding of the picture if the reference picture of the above is not stored, is continued for a period until immediately before the processing of the next CRA picture.

本発明に係る動画像予測復号方法は、動画像予測復号装置により実行される動画像予測復号方法であって、動画像を構成する複数の画像に対し画面内予測もしくは画面間予測のいずれかによる符号化が行われることで得られた、ランダム・アクセス画像を含む圧縮画像データと、各画像の出力順番を示す符号化データとを入力する入力ステップと、前記圧縮画像データと前記符号化データとを復号し、再生画像と出力順番情報とに復元する復元ステップと、前記再生画像を、後続の画像を復号するために用いられる参照画像として１つ以上格納する画像格納ステップと、前記復元ステップを制御する制御ステップと、を具備し、前記制御ステップにおいて、前記動画像予測復号装置は、
１）符号化データの復号が開始されるＣＲＡピクチャ、
２）当該ピクチャに関連付けされたＣＲＡピクチャよりも後に復号され、前記関連付けされたＣＲＡピクチャよりも先に出力され、前記復元ステップによる復号処理がスキップされ、前記画像格納ステップにて格納されず出力されないピクチャであって、画面間予測を行うために、前記復元ステップによる復号処理がスキップされる少なくとも１つの参照ピクチャ、あるいは、前記関連付けされたＣＲＡピクチャに復号順で先行する少なくとも１つの参照ピクチャ、を含む参照ピクチャのリストを持つ、当該ピクチャ、
３）前記復元ステップにより復号され、必要に応じて参照のために前記画像格納ステップにて格納されるピクチャであって、当該ピクチャは画面間予測を行うために参照ピクチャのリストを持ち、当該参照ピクチャのリストにある全ての参照ピクチャが、前記復元ステップにより復号され、当該ピクチャに関連付けされたＣＲＡピクチャ以降に復号されることを特徴とする、当該ピクチャ、
の計３つに、前記画像を分類して制御し、前記復元ステップにおいて、前記動画像予測復号装置は、符号化データの復号が当該ピクチャに関連付けされたＣＲＡピクチャで開始された場合、当該ピクチャに関する参照ピクチャのリストにある全ての参照ピクチャが前記画像格納ステップにて格納されているか否かを判定し、参照ピクチャのリストにある全ての参照ピクチャが格納されていれば当該ピクチャを復号し、参照ピクチャのリストにある１つ以上の参照ピクチャが格納されていなければ当該ピクチャの復号をスキップする、といった復号処理を、次のＣＲＡピクチャの処理直前までの期間、続ける、ことを特徴とする。 The moving image prediction / decoding method according to the present invention is a moving image prediction / decoding method executed by a moving image prediction / decoding device, and is based on either in-screen prediction or inter-screen prediction for a plurality of images constituting the moving image. An input step for inputting compressed image data including a random access image obtained by encoding and encoded data indicating the output order of each image, and the compressed image data and the encoded data. A restoration step of decoding and restoring the reproduced image and output order information, an image storage step of storing one or more of the reproduced images as a reference image used for decoding a subsequent image, and the restoration step. A control step for controlling is provided, and in the control step, the moving image prediction / decoding device is provided.
1) CRA picture at which decoding of coded data is started,
2) It is decoded after the CRA picture associated with the picture, output before the associated CRA picture, the decoding process by the restoration step is skipped, and it is not stored and output in the image storage step. A picture, at least one reference picture in which the decoding process by the restoration step is skipped in order to perform interscreen prediction, or at least one reference picture that precedes the associated CRA picture in the decoding order. The picture, which has a list of referenced pictures, including
3) A picture that is decoded by the restoration step and stored in the image storage step for reference as needed, and the picture has a list of reference pictures for interscreen prediction, and the reference. The picture, characterized in that all referenced pictures in the list of pictures are decoded by the restore step and are decoded after the CRA picture associated with the picture.
In a total of three, the image is classified and controlled, and in the restoration step, when the decoding of the encoded data is started with the CRA picture associated with the picture, the picture It is determined whether or not all the reference pictures in the list of reference pictures related to are stored in the image storage step, and if all the reference pictures in the list of reference pictures are stored, the picture is decoded. It is characterized in that the decoding process of skipping the decoding of the picture if one or more reference pictures in the list of the reference pictures are not stored is continued for a period until immediately before the processing of the next CRA picture.

本発明に係る動画像予測復号プログラムは、コンピュータを、動画像を構成する複数の画像に対し画面内予測もしくは画面間予測のいずれかによる符号化が行われることで得られた、ランダム・アクセス画像を含む圧縮画像データと、各画像の出力順番を示す符号化データとを入力する入力手段と、前記圧縮画像データと前記符号化データとを復号し、再生画像と出力順番情報とに復元する復元手段と、前記再生画像を、後続の画像を復号するために用いられる参照画像として１つ以上格納する画像格納手段と、前記復元手段を制御する制御手段、として機能させ、前記制御手段は、
１）符号化データの復号が開始されるＣＲＡピクチャ、
２）当該ピクチャに関連付けされたＣＲＡピクチャよりも後に復号され、前記関連付けされたＣＲＡピクチャよりも先に出力され、前記復元手段による復号処理がスキップされ、前記画像格納手段に格納されず出力されないピクチャであって、画面間予測を行うために、前記復元手段による復号処理がスキップされる少なくとも１つの参照ピクチャ、あるいは、前記関連付けされたＣＲＡピクチャに復号順で先行する少なくとも１つの参照ピクチャ、を含む参照ピクチャのリストを持つ、当該ピクチャ、
３）前記復元手段により復号され、必要に応じて参照のために前記画像格納手段に格納されるピクチャであって、当該ピクチャは画面間予測を行うために参照ピクチャのリストを持ち、当該参照ピクチャのリストにある全ての参照ピクチャが、前記復元手段により復号され、当該ピクチャに関連付けされたＣＲＡピクチャ以降に復号されることを特徴とする、当該ピクチャ、
の計３つに、前記画像を分類して制御し、前記復元手段は、符号化データの復号が当該ピクチャに関連付けされたＣＲＡピクチャで開始された場合、当該ピクチャに関する参照ピクチャのリストにある全ての参照ピクチャが前記画像格納手段に格納されているか否かを判定し、参照ピクチャのリストにある全ての参照ピクチャが格納されていれば当該ピクチャを復号し、参照ピクチャのリストにある１つ以上の参照ピクチャが格納されていなければ当該ピクチャの復号をスキップする、といった復号処理を、次のＣＲＡピクチャの処理直前までの期間、続ける、ことを特徴とする。 The moving image prediction / decoding program according to the present invention is a random access image obtained by encoding a plurality of images constituting a moving image by either in-screen prediction or inter-screen prediction. An input means for inputting compressed image data including the image and encoded data indicating the output order of each image, decoding the compressed image data and the encoded data, and restoring the reproduced image and output order information. The means, the image storage means for storing one or more of the reproduced images as a reference image used for decoding the subsequent image, and the control means for controlling the restoration means are made to function, and the control means is used.
1) CRA picture at which decoding of coded data is started,
2) A picture that is decoded after the CRA picture associated with the picture, output before the associated CRA picture, the decoding process by the restoration means is skipped, and the picture is not stored in the image storage means and is not output. The inter-screen prediction includes at least one reference picture in which the decoding process by the restoration means is skipped, or at least one reference picture that precedes the associated CRA picture in the decoding order. The picture, which has a list of referenced pictures,
3) A picture that is decoded by the restoration means and stored in the image storage means for reference as needed, and the picture has a list of reference pictures for performing interscreen prediction, and the reference picture. The picture, characterized in that all the reference pictures in the list are decoded by the restoring means and are decoded after the CRA picture associated with the picture.
The image is classified and controlled in a total of three, and when the decoding of the encoded data is started with the CRA picture associated with the picture, all of the reference pictures related to the picture are listed. It is determined whether or not the reference picture of is stored in the image storage means, and if all the reference pictures in the list of reference pictures are stored, the picture is decoded and one or more of the reference pictures in the list of reference pictures are stored. The decoding process, such as skipping the decoding of the picture if the reference picture of the above is not stored, is continued for a period until immediately before the processing of the next CRA picture.

本発明に係る動画像予測復号装置は、動画像を構成する複数のピクチャのための圧縮画像データであって、ピクチャごとに、ランダム・アクセスピクチャと、ランダム・アクセス・スキップ（ＲＡＳ）リーディングピクチャと、ｎｏｎ−ＲＡＳリーディングピクチャとを含む複数のピクチャ・タイプのうちのひとつとして当該ピクチャを識別するＮＡＬユニット・タイプの情報を有する圧縮画像データを含むビットストリームを入力する入力手段と、前記ピクチャ・タイプに基づいて、前記圧縮画像データを復号し、再生画像に復元する復元手段と、前記再生画像を出力する出力手段と、を具備し、
１）ランダム・アクセスピクチャは、前記ビットストリームの任意のランダム・アクセスピクチャから開始される復号プロセスが当該ランダム・アクセスピクチャから開始される場合、復号順で前記ビットストリームの最初のピクチャであり、
２）ＲＡＳリーディングピクチャは、関連するランダム・アクセスピクチャに出力順で先行するピクチャであって、関連するランダム・アクセスピクチャが復号順で前記ビットストリームの最初のピクチャである場合には復号できないピクチャであり、
３）ｎｏｎ−ＲＡＳリーディングピクチャは、関連するランダム・アクセスピクチャに出力順で先行するピクチャであって、復号できるピクチャであり、
ｎｏｎ−ＲＡＳリーディングピクチャの画面間予測のために使用される参照ピクチャを含む前記ｎｏｎ−ＲＡＳリーディングピクチャのリファレンス・ピクチャ・セットは、ＲＡＳリーディングピクチャと、関連するランダム・アクセスピクチャに復号順で先行するピクチャとのいずれも含まず、
第２のランダム・アクセスピクチャが第１のランダム・アクセスピクチャの後に復号される場合、前記第２のランダム・アクセスピクチャのリファレンス・ピクチャ・セットは、前記第１のランダム・アクセスピクチャに復号順で先行するいずれのピクチャも含まない、ことを特徴とする。 The moving image prediction / decoding device according to the present invention is compressed image data for a plurality of pictures constituting a moving image, and includes a random access picture and a random access skip (RAS) reading picture for each picture. An input means for inputting a bit stream containing compressed image data having information on the NAL unit type that identifies the picture as one of a plurality of picture types including the non-RAS reading picture, and the picture type. Based on the above, the compressed image data is decoded and restored to a reproduced image, and an output means for outputting the reproduced image is provided.
1) The random access picture is the first picture of the bitstream in the decoding order when the decoding process started from any random access picture of the bitstream is started from the random access picture.
2) The RAS reading picture is a picture that precedes the related random access picture in the output order and cannot be decoded if the related random access picture is the first picture of the bitstream in the decoding order. Yes,
3) The non-RAS reading picture is a picture that precedes the related random access picture in the output order and can be decoded.
The reference picture set of the non-RAS reading picture, including the reference picture used for interscreen prediction of the non-RAS reading picture, precedes the RAS reading picture and the associated random access picture in decoding order. Does not include any of the pictures
If the second random access picture is decoded after the first random access picture, the reference picture set of the second random access picture will be decoded in order to the first random access picture. It is characterized in that it does not include any of the preceding pictures.

また、第２のランダム・アクセスピクチャが第１のランダム・アクセスピクチャの後に復号される場合、前記第２のランダム・アクセスピクチャに復号順で後続する他のピクチャの画面間予測に使用される参照ピクチャを含む、前記第２のランダム・アクセスピクチャのリファレンス・ピクチャ・セットは、前記第１のランダム・アクセスピクチャに復号順で先行するいずれのピクチャも含まない、ことを特徴としてもよい。 Also, when the second random access picture is decoded after the first random access picture, it is used for interscreen prediction of other pictures following the second random access picture in the decoding order. The reference picture set of the second random access picture, including the pictures, may be characterized by not including any of the pictures preceding the first random access picture in the decoding order.

本発明に係る動画像予測復号方法は、動画像予測復号装置により実行される動画像予測復号方法であって、動画像を構成する複数のピクチャのための圧縮画像データであって、ピクチャごとに、ランダム・アクセスピクチャと、ランダム・アクセス・スキップ（ＲＡＳ）リーディングピクチャと、ｎｏｎ−ＲＡＳリーディングピクチャとを含む複数のピクチャ・タイプのうちのひとつとして当該ピクチャを識別するＮＡＬユニット・タイプの情報を有する圧縮画像データを含むビットストリームを入力する入力ステップと、前記ピクチャ・タイプに基づいて、前記圧縮画像データを復号し、再生画像に復元する復元ステップと、前記再生画像を出力する出力ステップと、を具備し、
１）ランダム・アクセスピクチャは、前記ビットストリームの任意のランダム・アクセスピクチャから開始される復号プロセスが当該ランダム・アクセスピクチャから開始される場合、復号順で前記ビットストリームの最初のピクチャであり、
２）ＲＡＳリーディングピクチャは、関連するランダム・アクセスピクチャに出力順で先行するピクチャであって、関連するランダム・アクセスピクチャが復号順で前記ビットストリームの最初のピクチャである場合には復号できないピクチャであり、
３）ｎｏｎ−ＲＡＳリーディングピクチャは、関連するランダム・アクセスピクチャに出力順で先行するピクチャであって、復号できるピクチャであり、
ｎｏｎ−ＲＡＳリーディングピクチャの画面間予測のために使用される参照ピクチャを含む前記ｎｏｎ−ＲＡＳリーディングピクチャのリファレンス・ピクチャ・セットは、ＲＡＳリーディングピクチャと、関連するランダム・アクセスピクチャに復号順で先行するピクチャとのいずれも含まず、
第２のランダム・アクセスピクチャが第１のランダム・アクセスピクチャの後に復号される場合、前記第２のランダム・アクセスピクチャのリファレンス・ピクチャ・セットは、前記第１のランダム・アクセスピクチャに復号順で先行するいずれのピクチャも含まない、ことを特徴とする。 The moving image prediction / decoding method according to the present invention is a moving image prediction / decoding method executed by a moving image prediction / decoding device, which is compressed image data for a plurality of pictures constituting a moving image, and is for each picture. , Has NAL unit type information that identifies the picture as one of a plurality of picture types including a random access picture, a random access skip (RAS) reading picture, and a non-RAS reading picture. An input step for inputting a bit stream including compressed image data, a restoration step for decoding the compressed image data based on the picture type and restoring the reproduced image, and an output step for outputting the reproduced image. Equipped with
1) The random access picture is the first picture of the bitstream in the decoding order when the decoding process started from any random access picture of the bitstream is started from the random access picture.
2) The RAS reading picture is a picture that precedes the related random access picture in the output order and cannot be decoded if the related random access picture is the first picture of the bitstream in the decoding order. Yes,
3) The non-RAS reading picture is a picture that precedes the related random access picture in the output order and can be decoded.
The reference picture set of the non-RAS reading picture, including the reference picture used for interscreen prediction of the non-RAS reading picture, precedes the RAS reading picture and the associated random access picture in decoding order. Does not include any of the pictures
If the second random access picture is decoded after the first random access picture, the reference picture set of the second random access picture will be decoded in order to the first random access picture. It is characterized in that it does not include any of the preceding pictures.

本発明に係る動画像予測復号プログラムは、コンピュータを、動画像を構成する複数のピクチャのための圧縮画像データであって、ピクチャごとに、ランダム・アクセスピクチャと、ランダム・アクセス・スキップ（ＲＡＳ）リーディングピクチャと、ｎｏｎ−ＲＡＳリーディングピクチャとを含む複数のピクチャ・タイプのうちのひとつとして当該ピクチャを識別するＮＡＬユニット・タイプの情報を有する圧縮画像データを含むビットストリームを入力する入力手段と、前記ピクチャ・タイプに基づいて、前記圧縮画像データを復号し、再生画像に復元する復元手段と、前記再生画像を出力する出力手段と、として機能させ、
１）ランダム・アクセスピクチャは、前記ビットストリームの任意のランダム・アクセスピクチャから開始される復号プロセスが当該ランダム・アクセスピクチャから開始される場合、復号順で前記ビットストリームの最初のピクチャであり、
２）ＲＡＳリーディングピクチャは、関連するランダム・アクセスピクチャに出力順で先行するピクチャであって、関連するランダム・アクセスピクチャが復号順で前記ビットストリームの最初のピクチャである場合には復号できないピクチャであり、
３）ｎｏｎ−ＲＡＳリーディングピクチャは、関連するランダム・アクセスピクチャに出力順で先行するピクチャであって、復号できるピクチャであり、
ｎｏｎ−ＲＡＳリーディングピクチャの画面間予測のために使用される参照ピクチャを含む前記ｎｏｎ−ＲＡＳリーディングピクチャのリファレンス・ピクチャ・セットは、ＲＡＳリーディングピクチャと、関連するランダム・アクセスピクチャに復号順で先行するピクチャとのいずれも含まず、
第２のランダム・アクセスピクチャが第１のランダム・アクセスピクチャの後に復号される場合、前記第２のランダム・アクセスピクチャのリファレンス・ピクチャ・セットは、前記第１のランダム・アクセスピクチャに復号順で先行するいずれのピクチャも含まない、ことを特徴とする。 The moving image prediction / decoding program according to the present invention is compressed image data for a plurality of pictures constituting a moving image on a computer, and for each picture, a random access picture and a random access skip (RAS). An input means for inputting a bit stream containing compressed image data having information on a NAL unit type that identifies the picture as one of a plurality of picture types including a reading picture and a non-RAS reading picture, and the above-mentioned. It functions as a restoration means for decoding the compressed image data and restoring it to a reproduced image based on the picture type, and an output means for outputting the reproduced image.
1) The random access picture is the first picture of the bitstream in the decoding order when the decoding process started from any random access picture of the bitstream is started from the random access picture.
2) The RAS reading picture is a picture that precedes the related random access picture in the output order and cannot be decoded if the related random access picture is the first picture of the bitstream in the decoding order. Yes,
3) The non-RAS reading picture is a picture that precedes the related random access picture in the output order and can be decoded.
The reference picture set of the non-RAS reading picture, including the reference picture used for interscreen prediction of the non-RAS reading picture, precedes the RAS reading picture and the associated random access picture in decoding order. Does not include any of the pictures
If the second random access picture is decoded after the first random access picture, the reference picture set of the second random access picture will be decoded in order to the first random access picture. It is characterized in that it does not include any of the preceding pictures.

本発明に係る動画像予測復号装置は、動画像を構成する複数のピクチャのための圧縮画像データであって、ピクチャごとに、ランダム・アクセスピクチャと、ランダム・アクセス・スキップ（ＲＡＳ）リーディングピクチャと、ｎｏｎ−ＲＡＳリーディングピクチャとを含む複数のピクチャ・タイプのうちのひとつとして当該ピクチャを識別するＮＡＬユニット・タイプの情報を有する圧縮画像データを含むビットストリームを入力する入力手段と、前記ピクチャ・タイプに基づいて、前記圧縮画像データを復号し、再生画像に復元する復元手段と、前記再生画像を出力する出力手段と、を具備し、
１）ランダム・アクセスピクチャは、前記ビットストリームの任意のランダム・アクセスピクチャから開始される復号プロセスが当該ランダム・アクセスピクチャから開始される場合、復号順で前記ビットストリームの最初のピクチャであり、
２）ＲＡＳリーディングピクチャは、関連するランダム・アクセスピクチャに出力順で先行するピクチャであって、関連するランダム・アクセスピクチャが復号順で前記ビットストリームの最初のピクチャである場合には復号できないピクチャであり、
３）ｎｏｎ−ＲＡＳリーディングピクチャは、関連するランダム・アクセスピクチャに出力順で先行するピクチャであって、復号できるピクチャであり、
第２のランダム・アクセスピクチャが第１のランダム・アクセスピクチャの後に復号される場合、前記第２のランダム・アクセスピクチャのリファレンス・ピクチャ・セットは、前記第１のランダム・アクセスピクチャに復号順で先行するいずれのピクチャも含まず、関連するランダム・アクセスピクチャが復号順で前記ビットストリームの最初のピクチャである場合には、ＲＡＳリーディングピクチャは出力されない、ことを特徴とする。 The moving image prediction / decoding device according to the present invention is compressed image data for a plurality of pictures constituting a moving image, and includes a random access picture and a random access skip (RAS) reading picture for each picture. An input means for inputting a bit stream containing compressed image data having information on the NAL unit type that identifies the picture as one of a plurality of picture types including the non-RAS reading picture, and the picture type. Based on the above, the compressed image data is decoded and restored to a reproduced image, and an output means for outputting the reproduced image is provided.
1) The random access picture is the first picture of the bitstream in the decoding order when the decoding process started from any random access picture of the bitstream is started from the random access picture.
2) The RAS reading picture is a picture that precedes the related random access picture in the output order and cannot be decoded if the related random access picture is the first picture of the bitstream in the decoding order. Yes,
3) The non-RAS reading picture is a picture that precedes the related random access picture in the output order and can be decoded.
If the second random access picture is decoded after the first random access picture, the reference picture set of the second random access picture will be decoded in order to the first random access picture. The RAS reading picture is not output if it does not include any of the preceding pictures and the associated random access picture is the first picture of the bitstream in the decoding order.

本発明に係る動画像予測復号方法は、動画像予測復号装置により実行される動画像予測復号方法であって、動画像を構成する複数のピクチャのための圧縮画像データであって、ピクチャごとに、ランダム・アクセスピクチャと、ランダム・アクセス・スキップ（ＲＡＳ）リーディングピクチャと、ｎｏｎ−ＲＡＳリーディングピクチャとを含む複数のピクチャ・タイプのうちのひとつとして当該ピクチャを識別するＮＡＬユニット・タイプの情報を有する圧縮画像データを含むビットストリームを入力する入力ステップと、前記ピクチャ・タイプに基づいて、前記圧縮画像データを復号し、再生画像に復元する復元ステップと、前記再生画像を出力する出力ステップと、を具備し、
１）ランダム・アクセスピクチャは、前記ビットストリームの任意のランダム・アクセスピクチャから開始される復号プロセスが当該ランダム・アクセスピクチャから開始される場合、復号順で前記ビットストリームの最初のピクチャであり、
２）ＲＡＳリーディングピクチャは、関連するランダム・アクセスピクチャに出力順で先行するピクチャであって、関連するランダム・アクセスピクチャが復号順で前記ビットストリームの最初のピクチャである場合には復号できないピクチャであり、
３）ｎｏｎ−ＲＡＳリーディングピクチャは、関連するランダム・アクセスピクチャに出力順で先行するピクチャであって、復号できるピクチャであり、
第２のランダム・アクセスピクチャが第１のランダム・アクセスピクチャの後に復号される場合、前記第２のランダム・アクセスピクチャのリファレンス・ピクチャ・セットは、前記第１のランダム・アクセスピクチャに復号順で先行するいずれのピクチャも含まず、関連するランダム・アクセスピクチャが復号順で前記ビットストリームの最初のピクチャである場合には、ＲＡＳリーディングピクチャは出力されない、ことを特徴とする。 The moving image prediction / decoding method according to the present invention is a moving image prediction / decoding method executed by a moving image prediction / decoding device, which is compressed image data for a plurality of pictures constituting a moving image, and is for each picture. , Has NAL unit type information that identifies the picture as one of a plurality of picture types including a random access picture, a random access skip (RAS) reading picture, and a non-RAS reading picture. An input step for inputting a bit stream including compressed image data, a restoration step for decoding the compressed image data based on the picture type and restoring the reproduced image, and an output step for outputting the reproduced image. Equipped with
1) The random access picture is the first picture of the bitstream in the decoding order when the decoding process started from any random access picture of the bitstream is started from the random access picture.
2) The RAS reading picture is a picture that precedes the related random access picture in the output order and cannot be decoded if the related random access picture is the first picture of the bitstream in the decoding order. Yes,
3) The non-RAS reading picture is a picture that precedes the related random access picture in the output order and can be decoded.
If the second random access picture is decoded after the first random access picture, the reference picture set of the second random access picture will be decoded in order to the first random access picture. The RAS reading picture is not output if it does not include any of the preceding pictures and the associated random access picture is the first picture of the bitstream in the decoding order.

本発明に係る動画像予測復号方法は、動画像予測復号装置により実行される動画像予測復号方法であって、動画像を構成する複数のピクチャのための圧縮画像データであって、ピクチャごとに、ランダム・アクセスピクチャと、ランダム・アクセス・スキップ（ＲＡＳ）リーディングピクチャと、ｎｏｎ−ＲＡＳリーディングピクチャとを含む複数のピクチャ・タイプのうちのひとつとして当該ピクチャを識別するＮＡＬユニット・タイプの情報を有する圧縮画像データを含むビットストリームを入力する入力ステップと、前記ピクチャ・タイプに基づいて、前記圧縮画像データを復号し、再生画像に復元する復元ステップと、前記再生画像を出力する出力ステップと、を具備し、
１）ランダム・アクセスピクチャは、前記ビットストリームの任意のランダム・アクセスピクチャから開始される復号プロセスが当該ランダム・アクセスピクチャから開始される場合、復号順で前記ビットストリームの最初のピクチャであり、
２）ＲＡＳリーディングピクチャは、関連するランダム・アクセスピクチャに出力順で先行するピクチャであって、関連するランダム・アクセスピクチャが復号順で前記ビットストリームの最初のピクチャである場合には復号できないピクチャであり、
３）ｎｏｎ−ＲＡＳリーディングピクチャは、関連するランダム・アクセスピクチャに出力順で先行するピクチャであって、復号できるピクチャであり、
ｎｏｎ−ＲＡＳリーディングピクチャの画面間予測のために使用される参照ピクチャを含む前記ｎｏｎ−ＲＡＳリーディングピクチャのリファレンス・ピクチャ・セットは、ＲＡＳリーディングピクチャと、関連するランダム・アクセスピクチャに復号順で先行するピクチャとのいずれも含まず、第２のランダム・アクセスピクチャが第１のランダム・アクセスピクチャの後に復号される場合、前記第２のランダム・アクセスピクチャのリファレンス・ピクチャ・セットは、前記第１のランダム・アクセスピクチャに復号順で先行するいずれのピクチャも含まず、前記復元ステップにおいて、前記動画像予測復号装置は、当該ピクチャの復号開始時に、当該ピクチャが正しく復号されるか否かを判定する、ことを特徴とする。 The moving image prediction / decoding method according to the present invention is a moving image prediction / decoding method executed by a moving image prediction / decoding device, which is compressed image data for a plurality of pictures constituting a moving image, and is for each picture. , Has NAL unit type information that identifies the picture as one of a plurality of picture types including a random access picture, a random access skip (RAS) reading picture, and a non-RAS reading picture. An input step for inputting a bit stream including compressed image data, a restoration step for decoding the compressed image data based on the picture type and restoring the reproduced image, and an output step for outputting the reproduced image. Equipped with
1) The random access picture is the first picture of the bitstream in the decoding order when the decoding process started from any random access picture of the bitstream is started from the random access picture.
2) The RAS reading picture is a picture that precedes the related random access picture in the output order and cannot be decoded if the related random access picture is the first picture of the bitstream in the decoding order. Yes,
3) The non-RAS reading picture is a picture that precedes the related random access picture in the output order and can be decoded.
The reference picture set of the non-RAS reading picture, including the reference picture used for interscreen prediction of the non-RAS reading picture, precedes the RAS reading picture and the associated random access picture in decoding order. If none of the pictures are included and the second random access picture is decoded after the first random access picture, then the reference picture set of the second random access picture is said to be the first. The random access picture does not include any of the pictures preceding in the decoding order, and in the restoration step, the moving image prediction decoding device determines whether or not the picture is correctly decoded at the start of decoding the picture. , Characterized by.

前記復元ステップの判定において、前記動画像予測復号装置は、当該ピクチャが前記ＲＡＳリーディングピクチャであるか否かを判定し、当該ピクチャが前記ＲＡＳリーディングピクチャであれば当該ピクチャが正しく復号されないと判定し、当該ピクチャが前記ＲＡＳリーディングピクチャでなければ当該ピクチャが正しく復号されると判定する、ことを特徴としてもよい。 In the determination of the restoration step, the moving image prediction decoding device determines whether or not the picture is the RAS reading picture, and if the picture is the RAS reading picture, it is determined that the picture is not correctly decoded. If the picture is not the RAS reading picture, it may be determined that the picture is correctly decoded.

前記復元ステップの判定において、前記動画像予測復号装置は、当該ピクチャに関する参照ピクチャのリストにある全ての参照ピクチャが格納されているか否かを判定し、参照ピクチャのリストにある全ての参照ピクチャが格納されていれば当該ピクチャが正しく復号されると判定し、参照ピクチャのリストにある１つ以上の参照ピクチャが格納されていなければ当該ピクチャが正しく復号されないと判定する、ことを特徴としてもよい。 In the determination of the restoration step, the moving image prediction / decoding device determines whether or not all the reference pictures in the list of reference pictures related to the picture are stored, and all the reference pictures in the list of reference pictures are stored. If it is stored, it is determined that the picture is correctly decoded, and if one or more reference pictures in the list of reference pictures are not stored, it is determined that the picture is not correctly decoded. ..

本発明に係る動画像予測復号方法は、動画像予測復号装置により実行される動画像予測復号方法であって、動画像を構成する複数のピクチャのための圧縮画像データであって、ピクチャごとに、ランダム・アクセスピクチャと、ランダム・アクセス・スキップ（ＲＡＳ）リーディングピクチャと、ｎｏｎ−ＲＡＳリーディングピクチャとを含む複数のピクチャ・タイプのうちのひとつとして当該ピクチャを識別するＮＡＬユニット・タイプの情報を有する圧縮画像データを含むビットストリームを入力する入力ステップと、前記ピクチャ・タイプに基づいて、前記圧縮画像データを復号し、再生画像に復元する復元ステップと、前記再生画像を出力する出力ステップと、を具備し、
１）ランダム・アクセスピクチャは、前記ビットストリームの任意のランダム・アクセスピクチャから開始される復号プロセスが当該ランダム・アクセスピクチャから開始される場合、復号順で前記ビットストリームの最初のピクチャであり、
２）ＲＡＳリーディングピクチャは、関連するランダム・アクセスピクチャに出力順で先行するピクチャであって、関連するランダム・アクセスピクチャが復号順で前記ビットストリームの最初のピクチャである場合には復号できないピクチャであり、
３）ｎｏｎ−ＲＡＳリーディングピクチャは、関連するランダム・アクセスピクチャに出力順で先行するピクチャであって、復号できるピクチャであり、
第２のランダム・アクセスピクチャが第１のランダム・アクセスピクチャの後に復号される場合、前記第２のランダム・アクセスピクチャのリファレンス・ピクチャ・セットは、前記第１のランダム・アクセスピクチャに復号順で先行するいずれのピクチャも含まず、関連するランダム・アクセスピクチャが復号順で前記ビットストリームの最初のピクチャである場合には、ＲＡＳリーディングピクチャは出力されず、前記復元ステップにおいて、復号プロセスがランダム・アクセスピクチャから開始される場合、ｎｏｎ−ＲＡＳリーディングピクチャを復号し、ＲＡＳリーディングピクチャの復号をスキップする、ことを特徴とする。 The moving image prediction / decoding method according to the present invention is a moving image prediction / decoding method executed by a moving image prediction / decoding device, which is compressed image data for a plurality of pictures constituting a moving image, and is for each picture. , Has NAL unit type information that identifies the picture as one of a plurality of picture types including a random access picture, a random access skip (RAS) reading picture, and a non-RAS reading picture. An input step for inputting a bit stream including compressed image data, a restoration step for decoding the compressed image data based on the picture type and restoring the reproduced image, and an output step for outputting the reproduced image. Equipped with
1) The random access picture is the first picture of the bitstream in the decoding order when the decoding process started from any random access picture of the bitstream is started from the random access picture.
2) The RAS reading picture is a picture that precedes the related random access picture in the output order and cannot be decoded if the related random access picture is the first picture of the bitstream in the decoding order. Yes,
3) The non-RAS reading picture is a picture that precedes the related random access picture in the output order and can be decoded.
If the second random access picture is decoded after the first random access picture, the reference picture set of the second random access picture will be decoded in order to the first random access picture. If none of the preceding pictures are included and the associated random access picture is the first picture in the bitstream in decoding order, no RAS reading picture is output and the decoding process is randomized in the restore step. When starting from an access picture, the non-RAS reading picture is decoded and the decoding of the RAS reading picture is skipped.

１０…記録媒体、３０…コンピュータ、１００…動画像予測符号化装置、１０１…入力端子、１０２…ブロック分割器、１０３…予測信号生成器、１０４…フレームメモリ、１０５…減算器、１０６…変換器、１０７…量子化器、１０８…逆量子化器、１０９…逆変換器、１１０…加算器、１１１…エントロピー符号化器、１１２…出力端子、１１３…入力端子、１１４…フレームメモリ管理器、２００…動画像予測復号装置、２０１…入力端子、２０２…データ解析器、２０３…逆量子化器、２０４…逆変換器、２０５…加算器、２０６…出力端子、２０７…フレームメモリ、２０８…予測信号生成器、２０９…フレームメモリ管理器、２１０…制御器、Ｐ１００…動画像予測符号化プログラム、Ｐ１０１…入力モジュール、Ｐ１０２…符号化モジュール、Ｐ１０３…復元モジュール、Ｐ１０４…画像格納モジュール、Ｐ１０５…制御モジュール、Ｐ２００…動画像予測復号プログラム、Ｐ２０１…入力モジュール、Ｐ２０２…復元モジュール、Ｐ２０３…画像格納モジュール、Ｐ２０４…制御モジュール。 10 ... Recording medium, 30 ... Computer, 100 ... Moving image prediction coding device, 101 ... Input terminal, 102 ... Block divider, 103 ... Prediction signal generator, 104 ... Frame memory, 105 ... Subtractor, 106 ... Converter , 107 ... Quantifier, 108 ... Inverse Quantizer, 109 ... Inverse Converter, 110 ... Adder, 111 ... Entropy Encoder, 112 ... Output Terminal, 113 ... Input Terminal, 114 ... Frame Memory Manager, 200 ... Moving image prediction / decoding device, 201 ... Input terminal, 202 ... Data analyzer, 203 ... Inverse quantizer, 204 ... Inverse converter, 205 ... Adder, 206 ... Output terminal, 207 ... Frame memory, 208 ... Prediction signal Generator, 209 ... Frame memory manager, 210 ... Control, P100 ... Video prediction coding program, P101 ... Input module, P102 ... Coding module, P103 ... Restoration module, P104 ... Image storage module, P105 ... Control module , P200 ... Video prediction / decoding program, P201 ... Input module, P202 ... Restoration module, P203 ... Image storage module, P204 ... Control module.

Claims

A moving image predictive decoding method executed by a moving image predictive decoding device.
A plurality of compressed image data for a plurality of pictures constituting a moving image, including a random access picture, a random access skip (RAS) reading picture, and a non-RAS reading picture for each picture. An input step that inputs a bit stream containing compressed image data with information on the NAL unit type that identifies the picture as one of the picture types.
A restoration step of decoding the compressed image data and restoring it to a reproduced image based on the picture type.
The output step to output the reproduced image and
Equipped with
1) The random access picture is the first picture of the bitstream in the decoding order when the decoding process started from any random access picture of the bitstream is started from the random access picture.
2) The RAS reading picture is a picture that precedes the related random access picture in the output order and cannot be decoded if the related random access picture is the first picture of the bitstream in the decoding order. Yes,
3) The non-RAS reading picture is a picture that precedes the related random access picture in the output order and can be decoded.
The reference picture set of the non-RAS reading picture, including the reference picture used for interscreen prediction of the non-RAS reading picture, precedes the RAS reading picture and the associated random access picture in decoding order. Does not include any of the pictures
If the second random access picture is decoded after the first random access picture, the reference picture set of the second random access picture will be decoded in order to the first random access picture. Does not include any preceding pictures
In the restore step, if the decoding process is started with a random access picture, the non-RAS reading picture is decoded and the decoding of the RAS reading picture is skipped.
A moving image prediction decoding method characterized by this.