JP2004242286A

JP2004242286A - Motion picture encoding method and motion picture decoding method

Info

Publication number: JP2004242286A
Application number: JP2003403183A
Authority: JP
Inventors: Shinya Sumino; 眞也角野
Original assignee: Matsushita Electric Industrial Co Ltd
Current assignee: Panasonic Holdings Corp
Priority date: 2003-01-17
Filing date: 2003-12-02
Publication date: 2004-08-26

Abstract

<P>PROBLEM TO BE SOLVED: To provide a motion picture encoding method and motion picture decoding method or the like with which a picture to be decoded for variable speed reproduction can be easily specified and encoding and decoding suited to variable speed reproduction can be performed. <P>SOLUTION: A motion picture encoding apparatus 1 comprises a map creation part 113 for creating information Map for variable speed reproduction that is information required for variable speed reproduction in accordance with a picture type Ptype, a variable length encoding part 112 for encoding and disposing the information Map for variable speed reproduction in an encoded stream Str, a detection part 114 for detecting whether or not to encode a picture parameter set PPS that a picture of an encoding targets refers to, and a common information adding part 115 for adding the picture parameter set PPS that is detected to be encoded by the detection part 114, to the encoding target picture. <P>COPYRIGHT: (C)2004,JPO&NCIPI

Description

本発明は、動画像を可変速再生できるように符号化する動画像符号化方法と、そのように符号化されたストリームを復号化する動画像復号化方法、およびそのストリームに関する。 The present invention relates to a moving image encoding method for encoding a moving image so that the moving image can be reproduced at a variable speed, a moving image decoding method for decoding a stream encoded as such, and a stream thereof.

近年、音声、画像、その他の画素値を統合的に扱うマルチメディア時代を迎え、従来からの情報メディア、つまり新聞、雑誌、テレビ、ラジオ、電話等の情報を人に伝達する手段がマルチメディアの対象として取り上げられるようになってきた。一般に、マルチメディアとは、文字だけでなく、図形、音声、特に画像等を同時に関連づけて表すことをいうが、上記従来の情報メディアをマルチメディアの対象とするには、その情報をディジタル形式にして表すことが必須条件となる。 In recent years, the multimedia era, in which audio, images, and other pixel values are integrated, has been approached, and the traditional information media, that is, means for transmitting information such as newspapers, magazines, televisions, radios, and telephones to humans, has It has been taken up as an object. Generally, multimedia means not only characters, but also figures, sounds, and especially images, etc., that are simultaneously associated with each other. Is an essential condition.

ところが、上記各情報メディアの持つ情報量をディジタル情報量として見積もってみると、文字の場合１文字当たりの情報量は１〜２バイトであるのに対し、音声の場合１秒当たり64Kbits（電話品質）、さらに動画については１秒当たり100Mbits（現行テレビ受信品質）以上の情報量が必要となり、上記情報メディアでその膨大な情報をディジタル形式でそのまま扱うことは現実的では無い。例えば、テレビ電話は、64Kbit/s〜1.5Mbits/sの伝送速度を持つサービス総合ディジタル網（ISDN : Integrated Services Digital Network）によってすでに実用化されているが、テレビ・カメラの映像をそのままISDNで送ることは不可能である。 However, when the amount of information of each information medium is estimated as a digital information amount, the amount of information per character is 1-2 bytes in the case of characters, whereas 64 Kbits per second in the case of voice (telephone quality). In addition, for a moving image, an information amount of 100 Mbits per second (current television reception quality) or more is required, and it is not realistic to handle the vast amount of information in the above-mentioned information medium as it is in a digital format. For example, videophones have already been put into practical use by the Integrated Services Digital Network (ISDN), which has a transmission speed of 64 Kbit / s to 1.5 Mbits / s. It is impossible.

そこで、必要となってくるのが情報の圧縮技術であり、例えば、テレビ電話の場合、ITU-T（国際電気通信連合電気通信標準化部門）で勧告されたH.261やH.263規格の動画圧縮技術が用いられている。また、MPEG-１規格の情報圧縮技術によると、通常の音楽用CD（コンパクト・ディスク）に音声情報とともに画像情報を入れることも可能となる。
ここで、MPEG（Moving Picture Experts Group）とは、ISO/IEC（国際標準化機構国際電気標準会議）で標準化された動画像信号圧縮の国際規格であり、MPEG-１は、動画像信号を１．５Mbpsまで、つまりテレビ信号の情報を約１００分の１にまで圧縮する規格である。また、MPEG-１規格では対象とする品質を伝送速度が主として約１．５Mbpsで実現できる程度の中程度の品質としたことから、さらなる高画質化の要求をみたすべく規格化されたMPEG-２では、動画像信号を２〜１５MbpsでＴＶ放送品質を実現する。 Therefore, it is necessary to use information compression technology. For example, in the case of videophones, videos based on the H.261 and H.263 standards recommended by the ITU-T (International Telecommunication Union Telecommunication Standardization Sector) Compression technology is used. Further, according to the information compression technology of the MPEG-1 standard, it is possible to store image information together with audio information in a normal music CD (compact disc).
Here, MPEG (Moving Picture Experts Group) is an international standard for moving image signal compression standardized by ISO / IEC (International Organization for Standardization), and MPEG-1 is used for moving image signals. It is a standard that compresses information of television signals to 5 Mbps, that is, to about 1/100. In addition, since the target quality is set to a medium quality that can be realized at a transmission speed of about 1.5 Mbps mainly in the MPEG-1 standard, MPEG-2 standardized to meet the demand for higher image quality. Then, the TV broadcast quality of the moving image signal is realized at 2 to 15 Mbps.

さらに現状では、MPEG-１、MPEG-２と標準化を進めてきた作業グループ（ISO/IEC JTC1/SC29/WG11）によって、MPEG-１、MPEG-２を上回る圧縮率を達成し、更に物体単位で符号化・復号化・操作を可能とし、マルチメディア時代に必要な新しい機能を実現するMPEG-４が規格化された。MPEG-４では、当初、低ビットレートの符号化方法の標準化を目指して進められたが、現在はインタレース画像も含む高ビットレートも含む、より汎用的な符号化に拡張されている。更に、現在は、ISO/IECとITU-Tが共同でより高圧縮率の次世代画像符号化方式として、MPEG-4 AVCおよびITU H.264の標準化活動が進んでいる。2002年5月の時点で、次世代画像符号化方式はコミッティー・ドラフト（CD）と呼ばれるものが発行されている（例えば、非特許文献１参照。）。 Furthermore, at present, the working group (ISO / IEC JTC1 / SC29 / WG11), which has been working on the standardization of MPEG-1 and MPEG-2, has achieved compression ratios exceeding those of MPEG-1 and MPEG-2, and furthermore, on an object-by-object basis. MPEG-4, which enables encoding, decoding, and operation, and realizes new functions required in the multimedia age, has been standardized. MPEG-4 was initially aimed at standardizing a low bit rate coding method, but is now expanded to more general-purpose coding including high bit rates including interlaced images. In addition, ISO / IEC and ITU-T are currently working together to standardize MPEG-4 AVC and ITU H.264 as next-generation image coding systems with higher compression ratios. As of May 2002, a so-called committee draft (CD) has been issued as a next-generation image coding method (for example, see Non-Patent Document 1).

一般に動画像の符号化では、時間方向および空間方向の冗長性を削減することによって情報量の圧縮を行う。そこで時間的な冗長性の削減を目的とする画面間予測符号化では、前方または後方のピクチャを参照してブロック単位で動きの検出および予測画像の作成を行い、得られた予測画像と符号化対象ピクチャとの差分値に対して符号化を行う。ここで、ピクチャとは１枚の画面を表す用語であり、プログレッシブ画像ではフレームを意味し、インタレース画像ではフレームもしくはフィールドを意味する。ここで、インタレース画像とは、１つのフレームが時刻の異なる２つのフィールドから構成される画像である。インタレース画像の符号化や復号化処理においては、１つのフレームをフレームのまま処理したり、２つのフィールドとして処理したり、フレーム内のブロック毎にフレーム構造またはフィールド構造として処理したりすることができる。 Generally, in coding of a moving image, the amount of information is compressed by reducing redundancy in the time direction and the space direction. Therefore, in inter-picture predictive coding for the purpose of reducing temporal redundancy, motion detection and a predicted image are created in block units with reference to a forward or backward picture, and the obtained predicted image and coding are performed. Encoding is performed on the difference value from the target picture. Here, a picture is a term representing one screen, and means a frame in a progressive image and a frame or a field in an interlaced image. Here, an interlaced image is an image in which one frame is composed of two fields at different times. In encoding and decoding of an interlaced image, one frame may be processed as a frame, processed as two fields, or processed as a frame structure or a field structure for each block in the frame. it can.

参照画像を持たず画面内予測符号化を行うものをＩピクチャと呼ぶ。また、１枚のピクチャのみを参照し画面間予測符号化を行うものをＰピクチャと呼ぶ。また、同時に２枚のピクチャを参照して画面間予測符号化を行うことのできるものをＢピクチャと呼ぶ。Ｂピクチャは表示時間が前方もしくは後方から任意の組み合わせとして２枚のピクチャを参照することが可能である。参照画像（参照ピクチャ）は符号化および復号化の基本単位であるブロックごとに指定することができるが、符号化を行ったビットストリーム中に先に記述される方の参照ピクチャを第１参照ピクチャ、後に記述される方を第２参照ピクチャとして区別する。ただし、これらのピクチャを符号化および復号化する場合の条件として、参照するピクチャが既に符号化および復号化されている必要がある。 A picture that does not have a reference picture and performs intra-picture prediction coding is called an I picture. A picture that performs inter-picture predictive encoding by referring to only one picture is called a P picture. A picture that can perform inter-picture predictive encoding by simultaneously referring to two pictures is called a B picture. The B picture can refer to two pictures as an arbitrary combination of display time from the front or the rear. The reference picture (reference picture) can be specified for each block, which is a basic unit of encoding and decoding, but the reference picture described earlier in the encoded bit stream is replaced with the first reference picture. , The one described later is distinguished as the second reference picture. However, as a condition for encoding and decoding these pictures, the picture to be referred to must be already encoded and decoded.

Ｐピクチャ又はＢピクチャの符号化には、動き補償画面間予測符号化が用いられている。動き補償画面間予測符号化とは、画面間予測符号化に動き補償を適用した符号化方式である。動き補償とは、単純に参照ピクチャの画素値から予測するのではなく、ピクチャ内の各部の動き量（以下、これを動きベクトルと呼ぶ）を検出し、当該動き量を考慮した予測を行うことにより予測精度を向上すると共に、データ量を減らす方式である。例えば、符号化対象ピクチャの動きベクトルを検出し、その動きベクトルの分だけシフトした予測値と符号化対象ピクチャとの予測残差を符号化することによりデータ量を減らしている。この方式の場合には、復号化の際に動きベクトルの情報が必要になるため、動きベクトルも符号化されて記録又は伝送される。 Motion-compensated inter-picture prediction coding is used for coding a P picture or a B picture. The motion compensated inter-picture predictive coding is a coding method in which motion compensation is applied to the inter-picture predictive coding. Motion compensation refers to detecting a motion amount (hereinafter, referred to as a motion vector) of each part in a picture and performing prediction in consideration of the motion amount, instead of simply predicting from a pixel value of a reference picture. Is a method for improving the prediction accuracy and reducing the amount of data. For example, a data amount is reduced by detecting a motion vector of a current picture to be coded and coding a prediction residual between a predicted value shifted by the motion vector and the current picture to be coded. In the case of this method, since information of a motion vector is required at the time of decoding, the motion vector is also encoded and recorded or transmitted.

動きベクトルはマクロブロックもしくはブロック単位で検出されており、具体的には、符号化対象ピクチャ側のマクロブロックもしくはブロックを固定しておき、参照ピクチャ側のマクロブロックもしくはブロックを探索範囲内で移動させ、基準マクロブロックもしくはブロックと最も似通った参照ブロックの位置を見つけることにより、動きベクトルが検出される。 The motion vector is detected in macroblock or block units. Specifically, the macroblock or block on the encoding target picture side is fixed, and the macroblock or block on the reference picture side is moved within the search range. , The motion vector is detected by finding the position of the reference block most similar to the reference macroblock or block.

図１９は従来のMPEG2のストリームの構成を示す図であり、（ａ）ピクチャの流れ、（ｂ）ストリームの階層構造を示す図である。図１９（ａ）、（ｂ）に示すようにMPEG2のストリームは以下のような階層構造を有している。ストリーム（Stream）は複数のグループ・オブ・ピクチャ（GOP：Group Of Picture）から構成されており、これを符号化処理の基本単位とすることで動画像の編集やランダムアクセスが可能になっている。グループ・オブ・ピクチャは、複数のピクチャから構成され、各ピクチャは、Ｉピクチャ、Ｐピクチャ又はＢピクチャがある。ストリーム、GOPおよびピクチャはさらにそれぞれの単位の区切りを示す同期信号（sync）と当該単位に共通のデータであるヘッダ（header）から構成されている。 FIG. 19 is a diagram showing the configuration of a conventional MPEG2 stream, in which (a) shows the flow of pictures and (b) shows the hierarchical structure of the stream. As shown in FIGS. 19A and 19B, the MPEG2 stream has the following hierarchical structure. The stream (Stream) is composed of a plurality of Group Of Pictures (GOPs), and by using this as a basic unit of the encoding process, editing of moving images and random access are possible. . The group of pictures is composed of a plurality of pictures, and each picture includes an I picture, a P picture, or a B picture. The stream, GOP, and picture further include a synchronization signal (sync) indicating a break of each unit and a header (header) which is data common to the unit.

図２０は他の従来のストリームの階層構造を示す図である。このストリームは現在ITU-TとISO/IECが共同で標準化中のJVT（H.264/MPEG-4 AVC）に対応する。JVTでは、ヘッダという概念は無く、共通データはストリームの先頭にパラメータセットPSという名称で配置される。また、GOPに相当する概念は無いが他のピクチャに依存せずに復号化できる特別なピクチャ単位でデータを分割すればGOPに相当するランダムアクセス可能な単位が構成できるので、これをランダムアクセスユニットRAUと呼ぶことにする。 FIG. 20 is a diagram showing a hierarchical structure of another conventional stream. This stream supports JVT (H.264 / MPEG-4 AVC), which is currently being jointly standardized by ITU-T and ISO / IEC. In JVT, there is no concept of a header, and common data is arranged at the head of a stream under the name of a parameter set PS. Also, if there is no concept equivalent to a GOP, but data is divided into special picture units that can be decoded without depending on other pictures, a random accessible unit equivalent to the GOP can be configured. Let's call it RAU.

パラメータセットPSは各ピクチャのヘッダに相当するデータであるピクチャパラメータセットPPSとMPEG-2のGOPもしくはシーケンス単位のヘッダに相当するシーケンスパラメータセットSPSがある。各ピクチャには前記ピクチャパラメータセットPPSおよびシーケンスパラメータセットSPSの複数の候補の中から何れを参照するかを示す識別子が付与される。即ち、ピクチャパラメータセットPPSおよびシーケンスパラメータセットSPSは複数のセットを1回だけ符号化し、各ピクチャではそのセットの中のどれを参照するかを識別子で示すことで、MPEG-2のように各ピクチャ毎に同じ値のヘッダ（パラメータセット）を何回も符号化する無駄を省き圧縮率を向上している。 The parameter set PS includes a picture parameter set PPS, which is data corresponding to the header of each picture, and a sequence parameter set SPS, which corresponds to a GOP of MPEG-2 or a header in sequence units. Each picture is provided with an identifier indicating which of the plurality of candidates for the picture parameter set PPS and the sequence parameter set SPS is to be referred to. That is, the picture parameter set PPS and the sequence parameter set SPS encode a plurality of sets only once, and in each picture, which identifier is used to indicate which of the sets is to be referred to, each picture can be encoded as in MPEG-2. The compression ratio is improved by eliminating the need to encode the header (parameter set) having the same value many times for each time.

ピクチャ番号PNはピクチャを識別するための識別番号である。シーケンスパラメータセットSPSには、最大参照可能ピクチャ数、画像サイズ等が含まれており、ピクチャパラメータセットPPSには、可変長符号化のタイプ（ハフマン符号化と算術符号化の切替）、量子化ステップの初期値、参照ピクチャ数等が含まれている。
図２１は従来の動画像符号化方法を実現する動画像符号化装置の構成を示すブロック図である。 The picture number PN is an identification number for identifying a picture. The sequence parameter set SPS includes the maximum number of pictures that can be referred to, the image size, and the like. The picture parameter set PPS includes the type of variable-length coding (switching between Huffman coding and arithmetic coding) and the quantization step. , The number of reference pictures, and the like.
FIG. 21 is a block diagram showing a configuration of a moving picture coding apparatus that realizes a conventional moving picture coding method.

動画像符号化装置３は、入力される画像信号Vin を圧縮符号化して可変長符号化等のビットストリームに変換した符号化ストリームStr を出力する装置であり、動き検出部１０１、動き補償部１０２、減算部１０３、直交変換部１０４、量子化部１０５、逆量子化部１０６、逆直交変換部１０７、加算部１０８、ピクチャメモリ１０９、スイッチ１１０、予測構造決定部１１１、および可変長符号化部３０１を備えている。 The video encoding device 3 is a device that outputs an encoded stream Str obtained by compression-encoding an input image signal Vin and converting it into a bit stream such as variable-length encoding. The motion detection unit 101 and the motion compensation unit 102 , Subtractor 103, orthogonal transformer 104, quantizer 105, inverse quantizer 106, inverse orthogonal transformer 107, adder 108, picture memory 109, switch 110, predictive structure determiner 111, and variable length encoder 301 is provided.

画像信号Vin は、減算部１０３および動き検出部１０１に入力される。減算部１０３は、入力された画像信号Vin と予測画像の差分値を計算し、直交変換部１０４に出力する。直交変換部１０４は、差分値を周波数係数に変換し、量子化部１０５に出力する。量子化部１０５は、入力された周波数係数を量子化し、量子化値Qcoefを可変長符号化部３０１に出力する。 The image signal Vin is input to the subtraction unit 103 and the motion detection unit 101. The subtraction unit 103 calculates a difference value between the input image signal Vin and the predicted image, and outputs the difference value to the orthogonal transformation unit 104. The orthogonal transform unit 104 converts the difference value into a frequency coefficient, and outputs the frequency coefficient to the quantization unit 105. The quantization unit 105 quantizes the input frequency coefficient and outputs a quantized value Qcoef to the variable length coding unit 301.

逆量子化部１０６は、量子化値Qcoefを逆量子化して周波数係数に復元し、逆直交変換部１０７に出力する。逆直交変換部１０７は、周波数係数から画素差分値に逆周波数変換し、加算部１０８に出力する。加算部１０８は、画素差分値と動き補償部１０２から出力される予測画像とを加算して復号化画像とする。スイッチ１１０は、当該復号化画像の保存が指示された場合にＯＮになり、復号化画像はピクチャメモリ１０９に保存される。 The inverse quantization unit 106 inversely quantizes the quantized value Qcoef to restore the frequency coefficient, and outputs the frequency coefficient to the inverse orthogonal transform unit 107. The inverse orthogonal transform unit 107 performs an inverse frequency transform from the frequency coefficient to the pixel difference value, and outputs the result to the adding unit 108. The addition unit 108 adds the pixel difference value and the predicted image output from the motion compensation unit 102 to obtain a decoded image. The switch 110 is turned on when the storage of the decoded image is instructed, and the decoded image is stored in the picture memory 109.

一方、画像信号Vin がマクロブロック単位で入力された動き検出部１０１は、ピクチャメモリ１０９に格納されている復号化画像を探索対象とし、最も入力画像信号に近い画像領域を検出してその位置を指し示す動きベクトルＭＶを決定する。動きベクトル検出はマクロブロックをさらに分割したブロック単位で行われる。このとき、複数のピクチャを参照ピクチャとして使用することができるため、参照するピクチャを指定するための識別番号（参照インデックスIndex）がブロックごとに必要となる。参照インデックスIndexによって、ピクチャメモリ１０９中の各ピクチャが有するピクチャ番号との対応を取ることにより参照ピクチャを指定することが可能となる。 On the other hand, the motion detection unit 101 to which the image signal Vin is input in units of macroblocks sets the decoded image stored in the picture memory 109 as a search target, detects an image area closest to the input image signal, and determines the position. The pointing motion vector MV is determined. Motion vector detection is performed for each block obtained by further dividing a macroblock. At this time, since a plurality of pictures can be used as reference pictures, an identification number (reference index Index) for specifying a picture to be referenced is required for each block. By using the reference index Index, it is possible to specify a reference picture by associating it with the picture number of each picture in the picture memory 109.

動き補償部１０２では、上記処理によって検出された動きベクトルＭＶおよび参照インデックスIndexを用いて、ピクチャメモリ１０９に格納されている復号化画像から予測画像に最適な画像領域を取り出す。
予測構造決定部１１１は、対象ピクチャがランダムアクセスユニットRAUの開始位置を示すランダムアクセスユニット開始ピクチャRAUinであれば、この対象ピクチャをランダムアクセスが可能な特別なピクチャとして符号化（画面内符号化）するように、ピクチャタイプPtypeにより動き検出部１０１および動き補償部１０２に指示する。また、予測構造決定部１１１は、そのピクチャタイプPtypeを可変長符号化部３０１に出力する。 The motion compensation unit 102 extracts an optimal image area for a predicted image from the decoded image stored in the picture memory 109, using the motion vector MV detected by the above process and the reference index Index.
If the target picture is a random access unit start picture RAUin indicating the start position of the random access unit RAU, the prediction structure determination unit 111 codes the target picture as a special picture that can be randomly accessed (intra-screen coding). To the motion detection unit 101 and the motion compensation unit 102 according to the picture type Ptype. Further, the prediction structure determination unit 111 outputs the picture type Ptype to the variable length coding unit 301.

可変長符号化部３０１は量子化値Qcoef、参照インデックスIndex、ピクチャタイプPtypeおよび動きベクトルＭＶを可変長符号化して符号化ストリームStrとする。
図２２は従来の動画像復号化方法を実現する動画像復号化装置の構成を示すブロック図である。同図において、図２１の従来の動画像符号化方法を実現する動画像符号化装置と同じ動作をする機器は同じ記号を付し、説明を省略する。 The variable-length coding unit 301 performs variable-length coding on the quantization value Qcoef, the reference index Index, the picture type Ptype, and the motion vector MV to obtain a coded stream Str.
FIG. 22 is a block diagram showing a configuration of a moving picture decoding apparatus that realizes a conventional moving picture decoding method. In the figure, devices that perform the same operations as those of the moving picture coding apparatus that realizes the conventional moving picture coding method of FIG. 21 are denoted by the same reference numerals, and description thereof is omitted.

可変長復号化部４０１は符号化ストリームStrを復号化し、量子化値Qcoef、参照インデックスIndex、ピクチャタイプPtypeおよび動きベクトルＭＶを出力する。量子化値Qcoef、参照インデックスIndexおよび動きベクトルＭＶは、ピクチャメモリ２０８、動き補償部２０４および逆量子化部２０５に入力され復号化処理が行われるが、その動作は図２１の従来の動画像符号化装置と同様である。
Text of Committee Draft of Joint Video Specification (ITU-T Rec. H.264 | ISO/IEC 14496-10 AVC) 0.4 'Overview of the syntax' 0.4.1 'Temporal processing' 8.2.2 'Parameter set decoding' The variable length decoding unit 401 decodes the coded stream Str, and outputs a quantized value Qcoef, a reference index Index, a picture type Ptype, and a motion vector MV. The quantization value Qcoef, the reference index Index, and the motion vector MV are input to the picture memory 208, the motion compensation unit 204, and the inverse quantization unit 205 to perform decoding processing. It is the same as the chemical conversion device.
Text of Committee Draft of Joint Video Specification (ITU-T Rec.H.264 | ISO / IEC 14496-10 AVC) 0.4 'Overview of the syntax' 0.4.1 'Temporal processing' 8.2.2 'Parameter set decoding'

しかしながら、ランダムアクセスユニットRAUはそのユニットのピクチャだけで復号化が可能であるが、従来のJVTの符号化方法およびストリームではVTRやディスクレコーダ等の蓄積装置で重要な可変速再生のための情報を得ることができない。これは、JVTが符号化効率（圧縮率）を大きく向上させるために、非常に柔軟なピクチャ間の予測構造を導入したことによる。 However, the random access unit RAU can decode only the picture of the unit, but the conventional JVT encoding method and stream can store information for variable speed playback that is important in storage devices such as VTRs and disk recorders. I can't get it. This is because JVT introduced a very flexible inter-picture prediction structure in order to greatly improve the coding efficiency (compression rate).

図２３はピクチャの参照関係の例を示す模式図である。図２３（ａ）はMPEG-2で使用されているピクチャ間の予測構造である。同図で斜線をつけたピクチャは他のピクチャから参照されるピクチャである。MPEG-2ではPピクチャ（P4、P7）は表示時刻が直前１枚のIピクチャもしくはPピクチャのみ参照した予測符号化が可能である。また、Bピクチャ（B1、B2、B3、B5、B6）は表示時刻が直前１枚と直後１枚のIピクチャもしくはPピクチャを参照した予測符号化が可能である。更に、ストリームに配置される順序も決まっており、IピクチャおよびPピクチャは表示時刻の順序、Bピクチャは直後に表示されるIピクチャもしくはPピクチャの直後に配置される。従って、１）全てのピクチャを復号化、２）IピクチャとPピクチャのストリームのみ復号化してIピクチャとPピクチャのみ表示、３）Iピクチャのストリームのみ復号化して表示、の３通りで復号化できるため、１）の通常の再生から２）の中速再生、３）の高速再生の３通りが容易に実現できる。 FIG. 23 is a schematic diagram showing an example of a picture reference relationship. FIG. 23A shows a prediction structure between pictures used in MPEG-2. In the figure, the hatched pictures are pictures referred to by other pictures. In MPEG-2, P pictures (P4, P7) can be predictively coded with reference to only one I picture or P picture immediately before the display time. The B pictures (B1, B2, B3, B5, B6) can be predictively coded with reference to one I picture or one P picture immediately before and one immediately after the display time. Furthermore, the order in which the pictures are arranged in the stream is also determined. The I picture and the P picture are arranged in the order of the display time, and the B picture is arranged immediately after the I picture or the P picture displayed immediately after. Therefore, decoding is performed in three ways: 1) decoding all pictures, 2) decoding only I and P picture streams and displaying only I and P pictures, and 3) decoding and displaying only I picture streams. Therefore, three modes, i.e., 1) normal reproduction, 2) medium speed reproduction, and 3) high speed reproduction, can be easily realized.

JVTではBピクチャからBピクチャを参照した予測も可能である。図２３（ｂ）はJVTの予測の例であり、Bピクチャ（B1、B3）はBピクチャ（B2）を参照している。この例では、１）全てのピクチャを復号化、２）Iピクチャ、Pピクチャ、Bピクチャの参照されるストリームのみ復号化して表示、３）IピクチャとPピクチャのストリームのみ復号化してIピクチャとPピクチャのみ表示、４）Iピクチャのストリームのみ復号化して表示、の４通りが実現できる。 In JVT, prediction that refers to a B picture from a B picture is also possible. FIG. 23B shows an example of JVT prediction, in which B pictures (B1, B3) refer to B pictures (B2). In this example, 1) decoding all pictures, 2) decoding and displaying only the referenced streams of I-pictures, P-pictures, and B-pictures, 3) decoding only the I-picture and P-picture streams, and Only the P picture is displayed, and 4) only the I picture stream is decoded and displayed.

しかしながら、JVTでは更にPピクチャからBピクチャを参照することも可能になっており、図２４に示すように、Pピクチャ（P7）がBピクチャ（B2）を参照することもできる。この場合は、Pピクチャ（P7）はBピクチャ（B2）が復号化できていなければ復号化ができないため、１）全てのピクチャを復号化、２）Iピクチャ、Pピクチャ、Bピクチャの参照されるストリームのみ復号化して表示、３）Iピクチャのストリームのみ復号化して表示、の３通りが実現できる。 However, in the JVT, it is also possible to refer to a B picture from a P picture, and as shown in FIG. 24, a P picture (P7) can also refer to a B picture (B2). In this case, since the P picture (P7) cannot be decoded unless the B picture (B2) has been decoded, 1) all pictures are decoded, and 2) I pictures, P pictures, and B pictures are referred to. 3) decoding and displaying only the I-picture stream, and 3) decoding and displaying only the I-picture stream.

このようにJVTでは非常に柔軟な予測構造が許容されるため、実際の予測構造がわからなければどのような可変速再生が可能か不明である。そこで、図２３および図２４の例から予測構造に拠らず実現できるのは、１）全てのピクチャを復号化、２）Iピクチャのストリームのみ復号化して表示、の高々２通りにすぎない。これでは、MPEG-2で実現できる可変速再生と比較してあまりにも実現できる速度の選択肢が少なすぎる。 As described above, since a very flexible prediction structure is allowed in JVT, it is unknown what kind of variable speed playback is possible unless the actual prediction structure is known. Therefore, only two ways of 1) decoding all pictures and 2) decoding and displaying only a stream of I pictures can be realized irrespective of the prediction structure from the examples of FIGS. This means that there are too few speed options that can be achieved compared to the variable speed playback that can be achieved with MPEG-2.

そこで、本発明は上記の事情に鑑みてなされたものであり、可変速再生のために復号化が必要なピクチャを容易に特定することができ、可変速再生に適した符号化および復号化を行うことができる動画像符号化方法および動画像復号化方法等を提供することを目的とする。 The present invention has been made in view of the above circumstances, and can easily specify a picture that needs to be decoded for variable-speed playback, and can perform encoding and decoding suitable for variable-speed playback. It is an object of the present invention to provide a moving image encoding method, a moving image decoding method, and the like that can be performed.

上記目的を達成するために、本発明に係る動画像符号化方法は、動画像信号をピクチャ単位で符号化して符号化ストリームを生成する動画像符号化方法であって、可変速再生の対象とするピクチャを特定するための可変速再生用情報を作成する情報作成ステップと、前記可変速再生用情報を符号化して前記符号化ストリームに付加する符号化ステップとを含むことを特徴とする。 In order to achieve the above object, a moving picture coding method according to the present invention is a moving picture coding method that generates a coded stream by coding a moving picture signal in picture units, And a coding step of coding the variable-speed playback information to add the variable-speed playback information to the coded stream.

これによって、可変速再生時に可変速再生用情報に基づいて所望の可変速再生の対象とするピクチャを特定することができ、この特定したピクチャだけを復号化することで、不要なピクチャの復号化を省略して容易に可変速再生が可能になる。
ここで、前記情報作成ステップでは、複数のピクチャから構成されるランダムアクセスユニットであり、当該ランダムアクセスユニット内のピクチャだけを参照して当該ランダムアクセスユニット内の復号化対象ピクチャを復号化することが可能である前記ランダムアクセスユニット単位で、前記可変速再生用情報を作成してもよい。 This makes it possible to specify a desired picture to be subjected to variable-speed playback based on the information for variable-speed playback at the time of variable-speed playback, and to decode only the specified picture to decode unnecessary pictures. And the variable speed reproduction can be easily performed.
Here, in the information creating step, the decoding target picture in the random access unit is a random access unit composed of a plurality of pictures, with reference to only the picture in the random access unit. The variable speed reproduction information may be created for each possible random access unit.

これによって、ランダムアクセスユニット単位でピクチャの参照関係の構造が相違していても、可変速再生時に可変速再生用情報に基づいて所望の可変速再生の対象とするピクチャを特定することができる。
また、前記動画像符号化方法は、さらに、前記可変速再生の対象とするピクチャが参照する共通情報の符号化の必要性を検知する検知ステップと、前記検知ステップにより前記共通情報の符号化の必要性が検知されたピクチャに対して前記共通情報を付加する共通情報付加ステップとを含んでもよい。 As a result, even if the structure of the reference relation of pictures differs in random access unit units, it is possible to specify a desired picture to be subjected to variable speed reproduction based on variable speed reproduction information during variable speed reproduction.
Further, the moving picture coding method further includes a detecting step of detecting the necessity of coding of the common information referred to by the picture to be subjected to the variable speed reproduction, and a detecting step of coding the common information by the detecting step. A common information adding step of adding the common information to the picture whose necessity is detected.

これによって、例えばＮ倍速未満で再生されるピクチャに既に付加された共通情報であっても、Ｎ倍速で再生されるピクチャに必要とされる場合には、Ｎ倍速で再生されるピクチャにも共通情報を必ず付加することができるので、可変速再生時に、参照すべき共通情報がないという状況を回避することができる。
また、本発明に係る動画像復号化方法は、符号化ストリームをピクチャ単位で復号化する動画像復号化方法であって、可変速再生の対象とするピクチャを特定するための可変速再生用情報を抽出する情報抽出ステップと、前記可変速再生用情報を復号化するとともに、前記可変速再生用情報に基づいて可変速再生の対象とするピクチャを特定し復号化する復号化ステップとを含むことを特徴とする。 Thus, for example, even if the common information is already added to a picture played back at a speed lower than N times, if it is needed for a picture played back at N times speed, the common information is also applied to a picture played back at N times speed. Since information can always be added, it is possible to avoid a situation where there is no common information to be referred to at the time of variable speed reproduction.
Further, the moving picture decoding method according to the present invention is a moving picture decoding method for decoding an encoded stream in units of pictures, and includes information for variable speed reproduction for specifying a picture to be subjected to variable speed reproduction. And a decoding step of decoding the variable-speed playback information, and identifying and decoding a picture to be subjected to variable-speed playback based on the variable-speed playback information. It is characterized by.

これによって、可変速再生時に可変速再生用情報に基づいて所望の可変速再生の対象とするピクチャを特定することができ、この特定したピクチャだけを復号化することで、不要なピクチャの復号化を省略して容易に可変速再生が可能になる。
ここで、前記情報抽出ステップでは、複数のピクチャから構成されるランダムアクセスユニットであり、当該ランダムアクセスユニット内のピクチャだけを参照して当該ランダムアクセスユニット内の復号化対象ピクチャを復号化することが可能である前記ランダムアクセスユニット単位で、前記可変速再生用情報を抽出してもよい。 This makes it possible to specify a desired picture to be subjected to variable-speed playback based on the information for variable-speed playback at the time of variable-speed playback, and to decode only the specified picture to decode unnecessary pictures. And the variable speed reproduction can be easily performed.
Here, in the information extracting step, a decoding target picture in the random access unit may be decoded with reference to only a picture in the random access unit, which is a random access unit including a plurality of pictures. The variable speed reproduction information may be extracted for each possible random access unit.

これによって、ランダムアクセスユニット単位でピクチャの参照関係の構造が相違していても、可変速再生時に可変速再生用情報に基づいて所望の可変速再生の対象とするピクチャを特定することができる。
また、前記動画像復号化方法は、さらに、可変速再生時には、他のピクチャに参照されるピクチャについては、前記可変速再生の対象でないピクチャであっても、ピクチャメモリに保存しているものとして制御するメモリ制御ステップを含んでもよい。 As a result, even if the structure of the reference relation of pictures differs in random access unit units, it is possible to specify a desired picture to be subjected to variable speed reproduction based on variable speed reproduction information during variable speed reproduction.
In addition, the moving picture decoding method may further include storing, in variable-speed playback, a picture referred to by another picture in a picture memory, even if the picture is not a target of the variable-speed playback. A memory control step for controlling may be included.

これによって、相対的に参照するピクチャを指定されていても、可変速再生時に復号化対象ピクチャの参照ピクチャとして指定されたピクチャが符号化時の参照ピクチャと相違することを防止することができる。
なお、本発明は、このような動画像符号化方法および動画像復号化方法として実現することができるだけでなく、このような動画像符号化方法および動画像復号化方法が含む特徴的なステップを手段として備える動画像符号化装置および動画像復号化装置として実現することもできる。また、それらのステップをコンピュータに実行させるプログラムとして実現したり、前記動画像符号化方法により符号化した符号化ストリームとして実現したりすることもできる。そして、そのようなプログラムおよび符号化ストリームは、ＣＤ−ＲＯＭ等の記録媒体やインターネット等の伝送媒体を介して配信することができるのは言うまでもない。 This makes it possible to prevent a picture designated as a reference picture of a current picture to be decoded from being different from a reference picture at the time of encoding even when a picture to be relatively referenced is designated.
It should be noted that the present invention can be realized not only as such a moving picture coding method and a moving picture decoding method, but also includes characteristic steps included in such a moving picture coding method and a moving picture decoding method. The present invention can also be realized as a moving image encoding device and a moving image decoding device provided as means. Further, these steps may be realized as a program for causing a computer to execute the steps, or as an encoded stream encoded by the moving image encoding method. Needless to say, such a program and an encoded stream can be distributed via a recording medium such as a CD-ROM or a transmission medium such as the Internet.

以上の様に本発明に係る動画像符号化方法および動画像復号化方法によれば、ランダムアクセスユニットRAUの中で可変速再生のために復号化が必要なピクチャが容易に特定できるので、可変速再生に適した符号化および復号化を容易に実現することができ、その実用的価値が高い。 As described above, according to the moving picture coding method and the moving picture decoding method according to the present invention, pictures that need to be decoded for variable-speed playback in the random access unit RAU can be easily specified. Encoding and decoding suitable for variable speed reproduction can be easily realized, and its practical value is high.

JVTの予測構造が柔軟すぎるために可変速再生が困難になるのであるが、ランダムアクセスユニットRAUを復号化する前にどのような予測構造がそのランダムアクセスユニットRAUのピクチャで使用されているかを知ることができれば、図２３および図２４の各例で示したように２通り以上の速度の可変速再生が実現できる。
以下、本発明の実施の形態について、図１から図１８を用いて説明する。 Variable speed playback becomes difficult because the JVT prediction structure is too flexible, but before decoding the random access unit RAU, we know what prediction structure is used in the picture of the random access unit RAU If this is possible, variable speed reproduction of two or more speeds can be realized as shown in the examples of FIGS.
Hereinafter, embodiments of the present invention will be described with reference to FIGS.

なお、実施の形態の説明で用いるランダムアクセスユニットRAUは必ずしもＪＶＴの特別な単位である必要は無く、パラメータセットPSをランダムアクセスユニットRAU毎に配置することから単に画面内符号化（Ｉピクチャ）で始まるピクチャの集合であってもよい。 Note that the random access unit RAU used in the description of the embodiment does not necessarily need to be a special unit of the JVT, and the parameter set PS is arranged for each random access unit RAU. It may be a set of pictures that start.

（実施の形態１）
図１は本発明に係る動画像符号化方法を用いた動画像符号化装置の一実施の形態の構成を示すブロック図である。なお、図２１に示す従来の動画像符号化装置３の各部と同じ動作をする機器は同じ符号を付し、説明を省略する。 (Embodiment 1)
FIG. 1 is a block diagram showing the configuration of an embodiment of a moving picture coding apparatus using the moving picture coding method according to the present invention. Note that the same reference numerals are given to devices that perform the same operations as the respective units of the conventional video encoding device 3 shown in FIG. 21 and description thereof is omitted.

動画像符号化装置１は、入力される画像信号Vin を圧縮符号化して可変長符号化等のビットストリームに変換した符号化ストリームStr を出力する装置であり、動き検出部１０１、動き補償部１０２、減算部１０３、直交変換部１０４、量子化部１０５、逆量子化部１０６、逆直交変換部１０７、加算部１０８、ピクチャメモリ１０９、スイッチ１１０、予測構造決定部１１１、可変長符号化部１１２、マップ作成部１１３、検知部１１４、および共通情報付加部１１５を備えている。 The moving image encoding device 1 is a device that outputs an encoded stream Str obtained by compression-encoding an input image signal Vin and converting it into a bit stream such as variable-length encoding. The motion detecting unit 101 and the motion compensating unit 102 , Subtractor 103, orthogonal transformer 104, quantizer 105, inverse quantizer 106, inverse orthogonal transformer 107, adder 108, picture memory 109, switch 110, predictive structure determiner 111, variable length encoder 112 , A map creating unit 113, a detecting unit 114, and a common information adding unit 115.

マップ作成部１１３は、ピクチャタイプPtypeに対応して可変速再生に必要な情報である可変速再生用情報Map（例えば後述する可変速再生マップRAMや可変速再生マップテーブルRAMTBL、可変速再生マップ識別子RAMID）を作成し、可変長符号化部１１２に出力する。可変長符号化部１１２は、符号化ストリームStrに可変速再生用情報Mapを符号化して配置する。 The map creator 113 generates a variable-speed reproduction information Map (for example, a variable-speed reproduction map RAM, a variable-speed reproduction map table RAMTBL, and a variable-speed reproduction map identifier, which will be described later), which are information necessary for variable-speed reproduction corresponding to the picture type Ptype. RAMID) and outputs it to the variable-length encoding unit 112. The variable-length encoding unit 112 encodes and arranges the variable-speed reproduction information Map in the encoded stream Str.

検知部１１４は、符号化対象のピクチャがＮ倍速以上で再生（復号）されるピクチャである時、対象ピクチャが参照する共通情報であるピクチャパラメータセットPPSが、Ｎ倍速以上で再生されるピクチャで既に符号化されているか否かを判定することで、ピクチャパラメータセットPPSの符号化の必要性を検知する。共通情報付加部１１５は、検知部１１４により符号化の必要性が検知されたピクチャに対してピクチャパラメータセットPPSを付加する。 When the picture to be coded is a picture to be reproduced (decoded) at N times speed or higher, the detection unit 114 determines that the picture parameter set PPS, which is the common information referred to by the current picture, is a picture to be reproduced at N times speed or higher. The necessity of encoding the picture parameter set PPS is detected by determining whether or not the encoding is already performed. The common information adding unit 115 adds the picture parameter set PPS to the picture for which the necessity of encoding has been detected by the detecting unit 114.

図２は実施の形態１のストリームの構成図である。図２０の従来のストリームの構成図との違いは、ランダムアクセスユニットRAUに可変速再生マップRAMを配置したことである。
図２（ａ）はランダムアクセスユニットRAUの構造例を示す。この構造例では、可変速再生マップRAMをランダムアクセスユニットRAU内のピクチャの前に配置し、可変速再生マップRAMにどのピクチャのストリームを復号化すれば所望の可変速で再生できるかの情報を記載している。動画像復号化装置では可変速再生マップRAMに記載された情報に従って所望の可変速再生に必要なピクチャのみを復号化することで、不要なピクチャの復号化を省略して容易に可変速再生が可能になる。 FIG. 2 is a configuration diagram of a stream according to the first embodiment. The difference from the conventional stream configuration diagram in FIG. 20 is that a variable speed reproduction map RAM is arranged in the random access unit RAU.
FIG. 2A shows an example of the structure of the random access unit RAU. In this structure example, a variable speed reproduction map RAM is arranged before a picture in the random access unit RAU, and information on which picture stream can be decoded and reproduced at a desired variable speed is stored in the variable speed reproduction map RAM. It has been described. The moving picture decoding apparatus decodes only the pictures necessary for the desired variable-speed playback in accordance with the information described in the variable-speed playback map RAM, so that variable-speed playback can be easily performed without decoding unnecessary pictures. Will be possible.

図２（ｂ）は可変速再生マップRAMの例である。ランダムアクセスユニットRAU内のピクチャの識別子であるピクチャ番号PN毎に何倍速再生でそのピクチャの復号化が必要かを可変速再生マップRAMに記載している。すなわち、各ピクチャ番号PNの後位置にそのピクチャが何倍速再生で復号化が必要か示す情報（Speed）を記載している。このようにすることで、所望の再生速度でどのピクチャの復号化が必要か容易に知ることができる。なお、ピクチャ番号PNをランダムアクセスユニットRAUの先頭にまとめて配置し、その後に何倍速再生で復号化が必要か示す情報（Speed）をまとめて配置しても良い。 FIG. 2B shows an example of the variable speed reproduction map RAM. For each picture number PN that is an identifier of a picture in the random access unit RAU, the variable speed playback map RAM describes how many times the playback speed is required to decode the picture. That is, information (Speed) indicating how many times the picture is reproduced at a required speed and decoded is described at a position after each picture number PN. By doing so, it is possible to easily know which picture needs to be decoded at a desired reproduction speed. Note that the picture number PN may be arranged collectively at the head of the random access unit RAU, and thereafter information (Speed) indicating how many times speed reproduction is required for decoding may be arranged collectively.

図２（ｃ）は可変速再生マップRAMの他の例である。ランダムアクセスユニットRAU内の各ピクチャが何倍速の再生で必要かを順番に記載している。これにより、必ずしもピクチャ番号PNを可変速再生マップRAMに配置する必要が無く、ピクチャ番号PNを配置するために必要な領域を節約できる。
図２（ｄ）は可変速再生マップRAMの他の例である。最初に何倍速再生で復号化が必要か示す情報（Speed）を配置し、その後にその速度の再生のために必要なピクチャのピクチャ番号PNを記載している。 FIG. 2C shows another example of the variable speed reproduction map RAM. The number of times at which each picture in the random access unit RAU is required for reproduction is described in order. As a result, it is not always necessary to arrange the picture number PN in the variable speed reproduction map RAM, and it is possible to save an area necessary for arranging the picture number PN.
FIG. 2D shows another example of the variable speed reproduction map RAM. First, information (Speed) indicating how many times speed reproduction is required for decoding is arranged, and then a picture number PN of a picture required for reproduction at that speed is described.

何倍速再生で復号化が必要か示す情報（Speed）としては、例えばピクチャMがN倍速で必要と記載されている場合にSpeedはNとする。このとき、再生速度がK倍速の場合に、N＜KであればピクチャMは復号化が不要であるが、N＞＝KであればピクチャMの復号化が必要である。
図３はピクチャの参照関係の例を示す模式図である。ここで、斜線をつけたピクチャは他のピクチャから参照されるピクチャである。 As the information (Speed) indicating how many times the reproduction speed requires decoding, for example, the speed is set to N when the picture M is described as required at N times speed. At this time, when the reproduction speed is K times, if N <K, the picture M does not need to be decoded, but if N> = K, the picture M needs to be decoded.
FIG. 3 is a schematic diagram showing an example of a picture reference relationship. Here, the hatched pictures are pictures referred to by other pictures.

図３（ａ）に示す例では、３倍速の場合にピクチャI0、P3、P6を再生する。図３（ｂ）に示す例では、４倍速の場合にピクチャI0、P4を再生し、２倍速の場合にピクチャI0、B2、P4、B6を再生する。図３（ｃ）に示す例では、３倍速の場合にピクチャI0、P3、P6を再生し１．５倍速の場合にピクチャI0、P1、P3、P4、P6、P7を再生する。
従って、何倍速再生で復号化が必要か示す情報（Speed）は、図３（ａ）に示す例ではピクチャI0、P3、P6が「３」、他のピクチャは「１」となり、図３（ｂ）に示す例ではピクチャI0、P4が「４」、ピクチャB2、B6が「２」、他のピクチャは「１」となり、図３（ｃ）に示す例ではピクチャI0、P3、P6が「３」、ピクチャP1、P4、P7が「１．５」、他のピクチャは「１」となる。 In the example shown in FIG. 3A, the pictures I0, P3, and P6 are reproduced at the triple speed. In the example shown in FIG. 3B, the pictures I0 and P4 are reproduced at 4 × speed, and the pictures I0, B2, P4 and B6 are reproduced at 2 × speed. In the example shown in FIG. 3C, the pictures I0, P3, and P6 are reproduced at 3 × speed, and the pictures I0, P1, P3, P4, P6 and P7 are reproduced at 1.5 × speed.
Accordingly, in the example shown in FIG. 3A, the information (Speed) indicating how many times the reproduction speed is required for decoding is "3" for the pictures I0, P3, and P6, and "1" for the other pictures. In the example shown in b), the pictures I0 and P4 are "4", the pictures B2 and B6 are "2", and the other pictures are "1". In the example shown in FIG. 3C, the pictures I0, P3 and P6 are "4". 3 ", pictures P1, P4, and P7 are" 1.5 ", and other pictures are" 1 ".

次に、上記のように構成された動画像符号化装置１の動作について説明する。図４は可変速再生マップRAMを作成する際の動作を示すフローチャートである。
マップ作成部１１３は、符号化対象のピクチャがランダムアクセスポイント即ちランダムアクセスユニットRAUの最初のピクチャであるか否かを判定する（ステップＳ１０）。この判定の結果、ランダムアクセスポイントのピクチャであれば（ステップＳ１０でＹＥＳ）、マップ作成部１１３は、可変速再生マップRAMを作成し、可変長符号化部１１２に出力する（ステップＳ１１）。次に、可変長符号化部１１２は、可変速再生マップRAMを符号化する（ステップＳ１２）。さらに、可変長符号化部１１２は、対象ピクチャを符号化する（ステップＳ１３）。 Next, the operation of the moving picture coding apparatus 1 configured as described above will be described. FIG. 4 is a flowchart showing the operation when creating the variable speed reproduction map RAM.
The map creating unit 113 determines whether the picture to be encoded is a random access point, that is, the first picture of the random access unit RAU (step S10). If the result of this determination is that the picture is a random access point (YES in step S10), the map creation unit 113 creates a variable speed reproduction map RAM and outputs it to the variable length encoding unit 112 (step S11). Next, the variable length encoding unit 112 encodes the variable speed reproduction map RAM (step S12). Further, the variable-length encoding unit 112 encodes the current picture (Step S13).

一方、上記判定の結果、ランダムアクセスポイントのピクチャでなければ（ステップＳ１０でＮＯ）、可変長符号化部１１２は、対象ピクチャを符号化する（ステップＳ１３）。
次に、未符号化ピクチャがあるか否かを判定（ステップＳ１４）し、未符号化ピクチャがあれば上記動作（ステップＳ１０〜Ｓ１４）を繰り返し、未符号化ピクチャが無ければ処理を終了する。 On the other hand, if the result of the determination is that the picture is not a picture of a random access point (NO in step S10), variable-length coding section 112 codes the current picture (step S13).
Next, it is determined whether or not there is an uncoded picture (step S14). If there is an uncoded picture, the above operation (steps S10 to S14) is repeated, and if there is no uncoded picture, the process ends.

ところで、符号化ストリームStrの復号化時には、上記のようにピクチャパラメータセットPPSを参照して復号化を行っており、このピクチャパラメータセットPPSが変更された場合には、この新しいピクチャパラメータセットPPSは、参照するピクチャのデータ以前に符号化ストリームStrとして送られている。しかしながら、可変速再生を行うと、変更された新しいピクチャパラメータセットPPSを参照すべきピクチャを復号化する際に、このピクチャパラメータセットPPSがないという状況が発生する。例えば、図３（ｂ）に示す例では、ピクチャI0、P4、B2、B1、B3、P8、B6、B5、B7の順に符号化される。このとき、ピクチャB2においてピクチャパラメータセットPPSが変更され、変更された新しいピクチャパラメータセットPPSを参照するピクチャB2のデータ中にこの新しいピクチャパラメータセットPPSが付加されたとする。この場合、通常および２倍速の再生では問題はないが、例えば、４倍速再生ではピクチャI0、P4、P8を再生することになり、ピクチャB2は復号化されないので、変更された新しいピクチャパラメータセットPPSも復号化されない。よって、ピクチャP8が参照すべきピクチャパラメータセットPPSが復号化されていないため、ピクチャP8を復号化できないという問題が生じる。 By the way, when decoding the encoded stream Str, decoding is performed with reference to the picture parameter set PPS as described above, and when the picture parameter set PPS is changed, the new picture parameter set PPS is , Before the data of the picture to be referred to. However, when performing variable-speed playback, when decoding a picture that should refer to the changed new picture parameter set PPS, a situation occurs in which there is no picture parameter set PPS. For example, in the example shown in FIG. 3B, the pictures are coded in the order of I0, P4, B2, B1, B3, P8, B6, B5, B7. At this time, it is assumed that the picture parameter set PPS is changed in the picture B2, and the new picture parameter set PPS is added to the data of the picture B2 that refers to the changed new picture parameter set PPS. In this case, there is no problem in normal and double speed playback, but for example, in quadruple speed playback, pictures I0, P4, and P8 are played, and picture B2 is not decoded. Is not decrypted. Therefore, there is a problem that the picture P8 cannot be decoded because the picture parameter set PPS to be referred to by the picture P8 has not been decoded.

そこで、本実施の形態では、ピクチャパラメータセットPPSは、Ｎ倍速で再生されるピクチャに必要とされる場合には、Ｎ倍速未満で再生されるピクチャに既に付加されていても、Ｎ倍速で再生されるピクチャにも必ず付加する。すなわち、上記の例では、ピクチャB2において変更された新しいピクチャパラメータセットPPSがピクチャP8のデータ中にも付加されることになる。 Therefore, in the present embodiment, if the picture parameter set PPS is required for a picture reproduced at N times speed, the picture parameter set PPS is reproduced at N times speed even if it is already added to the picture reproduced at less than N times speed. Is always added to the picture to be copied. That is, in the above example, the new picture parameter set PPS changed in the picture B2 is also added to the data of the picture P8.

図５はピクチャパラメータセットPPSを付加する際の動作を示すフローチャートである。
検知部１１４は、符号化対象のピクチャがＮ倍速以上で再生（復号）されるか否かを判定する（ステップＳ２０）。この判定の結果、Ｎ倍速以上で再生されるピクチャであれば（ステップＳ２０でＹＥＳ）、対象ピクチャが参照するピクチャパラメータセットPPSは、Ｎ倍速以上で再生されるピクチャの符号化時に既に符号化されているか否かを判定する（ステップＳ２１）。ここで、Ｎ倍速以上で再生されるピクチャの符号化時にまだ符号化されていない場合（ステップＳ２１でＮＯ）、共通情報付加部１１５は、対象ピクチャに対してピクチャパラメータセットPPSを付加するために、可変長符号化部１１２に出力する（ステップＳ２２）。次に、可変長符号化部１１２は、ピクチャパラメータセットPPSを符号化する（ステップＳ２３）。さらに、可変長符号化部１１２は、対象ピクチャを符号化する（ステップＳ２４）。 FIG. 5 is a flowchart showing the operation when the picture parameter set PPS is added.
The detecting unit 114 determines whether or not the picture to be coded is reproduced (decoded) at N times speed or higher (step S20). If the result of this determination is that the picture is to be played back at N times speed or higher (YES in step S20), the picture parameter set PPS referred to by the target picture has already been coded at the time of coding the picture played back at N times speed or higher. It is determined whether or not it has been performed (step S21). Here, if the picture to be reproduced at the N-times speed or higher has not been encoded yet (NO in step S21), the common information adding unit 115 adds the picture parameter set PPS to the target picture. , To the variable length coding unit 112 (step S22). Next, the variable length coding unit 112 codes the picture parameter set PPS (Step S23). Further, the variable-length encoding unit 112 encodes the current picture (Step S24).

一方、上記判定の結果、Ｎ倍速以上で再生されるピクチャでない場合（ステップＳ２０でＮＯ）、および対象ピクチャが参照するピクチャパラメータセットPPSは、Ｎ倍速以上で再生されるピクチャの符号化時に既に符号化されている（ステップＳ２１でＹＥＳ）場合、可変長符号化部１１２は、対象ピクチャを符号化する（ステップＳ２４）。
次に、未符号化ピクチャがあるか否かを判定（ステップＳ２５）し、未符号化ピクチャがあれば上記動作（ステップＳ２０〜Ｓ２５）を繰り返し、未符号化ピクチャが無ければ処理を終了する。 On the other hand, as a result of the above determination, if the picture is not reproduced at the N-fold speed or higher (NO in step S20), and the picture parameter set PPS referred to by the target picture is already encoded at the time of encoding the picture reproduced at the N-fold speed or higher. If the current picture is encoded (YES in step S21), the variable-length encoding unit 112 encodes the current picture (step S24).
Next, it is determined whether or not there is an uncoded picture (step S25). If there is an uncoded picture, the above operation (steps S20 to S25) is repeated, and if there is no uncoded picture, the process ends.

図６は対象ピクチャを符号化する際の動作を示すフローチャートである。
検知部１１４は、符号化対象のピクチャがＮ倍速以上で再生（復号）されるか否かを判定する（ステップＳ３０）。この判定の結果、Ｎ倍速以上で再生されるピクチャであれば（ステップＳ３０でＹＥＳ）、Ｎ倍速以上で再生されるピクチャを参照して対象ピクチャの符号化を行う（ステップＳ３１）。一方、Ｎ倍速以上で再生されるピクチャでなければ（ステップＳ３０でＮＯ）、任意のピクチャを参照して対象ピクチャの符号化を行う（ステップＳ３２）。 FIG. 6 is a flowchart showing the operation when encoding the current picture.
The detecting unit 114 determines whether or not the picture to be coded is reproduced (decoded) at N-times speed or higher (step S30). If the result of this determination is that the picture is to be played back at N times speed or higher (YES in step S30), the current picture is coded with reference to the picture played back at N times speed or higher (step S31). On the other hand, if the picture is not a picture reproduced at the N-fold speed or higher (NO in step S30), the target picture is encoded with reference to an arbitrary picture (step S32).

次に、対象ピクチャが他のピクチャを符号化する際に参照されるか否かを判定する（ステップＳ３３）。この判定の結果、他のピクチャに参照されるピクチャであれば（ステップＳ３３でＹＥＳ）、対象ピクチャをピクチャメモリ１０９に保存する（ステップＳ３４）。一方、他のピクチャに参照されるピクチャでなければ（ステップＳ３３でＮＯ）、対象ピクチャのピクチャメモリ１０９への保存は行わない。 Next, it is determined whether or not the current picture is referred to when encoding another picture (step S33). If the result of this determination is that the picture is referenced by another picture (YES in step S33), the current picture is stored in the picture memory 109 (step S34). On the other hand, if the picture is not a picture referred to by another picture (NO in step S33), the current picture is not stored in the picture memory 109.

次に、未符号化ピクチャがあるか否かを判定（ステップＳ３５）し、未符号化ピクチャがあれば上記動作（ステップＳ３０〜Ｓ３５）を繰り返し、未符号化ピクチャが無ければ処理を終了する。
以上のように、可変速再生マップRAMをランダムアクセスユニットRAU内のピクチャの前に配置し、可変速再生マップRAMにどのピクチャのストリームを復号化すれば所望の可変速で再生できるかの情報を記載している。また、Ｎ倍速未満で再生されるピクチャに既に付加されたピクチャパラメータセットPPSであっても、Ｎ倍速で再生されるピクチャに必要とされる場合には、Ｎ倍速で再生されるピクチャにも必ず付加している。これよって、動画像復号化装置では可変速再生マップRAMに記載された情報に従って所望の可変速再生に必要なピクチャのみを復号化することで、不要なピクチャの復号化を省略して容易に可変速再生が可能になる。 Next, it is determined whether or not there is an uncoded picture (step S35). If there is an uncoded picture, the above operation (steps S30 to S35) is repeated, and if there is no uncoded picture, the process ends.
As described above, the variable-speed reproduction map RAM is arranged before the pictures in the random access unit RAU, and the variable-speed reproduction map RAM stores information on which picture stream can be decoded and reproduced at a desired variable speed. It has been described. Further, even if the picture parameter set PPS is already added to a picture played back at a speed lower than N times, if the picture played back at N times speed is required, the picture played back at N times speed is always used. Has been added. Thus, the moving picture decoding apparatus can easily decode unnecessary pictures by decoding only pictures necessary for the desired variable-speed playback in accordance with the information described in the variable-speed playback map RAM. Variable-speed reproduction becomes possible.

なお、本実施の形態において、可変速再生マップRAMを作成する際に、他のピクチャから参照されないピクチャは他のピクチャの復号化に影響を与えないため、他のピクチャに影響を与える参照されるピクチャにのみ対応する可変速再生マップRAMを作成してもよい。また可変速再生マップRAMに相当する機能のもの、例えば各ピクチャの情報を配置する代わりに他から参照されないピクチャ、Iピクチャ、Pピクチャ、Bピクチャ等のカテゴリごとの動作情報を配置してもよい。 In the present embodiment, when creating a variable-speed playback map RAM, a picture that is not referred to from other pictures does not affect decoding of other pictures, and thus is referred to affect other pictures. A variable speed reproduction map RAM corresponding only to a picture may be created. A function corresponding to the variable speed reproduction map RAM, for example, instead of arranging information of each picture, operation information for each category such as a picture, an I picture, a P picture, and a B picture which is not referred to by others may be arranged. .

更に、ストリームの全てがこのような可変速再生が容易なランダムアクセスユニットRAUで構成されていることを示す識別情報を付与してもよい。
また、本実施の形態では、例えばピクチャMのSpeed（何倍速再生で復号化が必要か示す情報）がNで、再生速度がK倍速の場合に、N＜KであればピクチャMは復号化が不要であるが、N＞＝KであればピクチャMの復号化が必要であるとして説明したが、N＜KであってもNとKの差が小さい場合には可変速再生の画質を向上するために、ピクチャMを復号化してもよい。 Further, identification information indicating that all of the streams are composed of such random access units RAU that can easily perform variable speed reproduction may be added.
Also, in the present embodiment, for example, if the speed of a picture M (information indicating how many times the speed is to be reproduced and decoding is necessary) is N and the reproduction speed is K times, if N <K, the picture M is decoded. Is not necessary, but it has been described that the decoding of the picture M is necessary if N> = K. However, even if N <K, if the difference between N and K is small, the image quality of the variable speed playback is reduced. For improvement, the picture M may be decoded.

また、上述した何倍速再生で復号化が必要か示す情報（Speed）は実際の再生速度を表す値ではなく、再生速度の度合いを示す値をつけてもよい。例えば、通常速度の再生でのみ必要なピクチャは「１」、その次に速い速度の再生で必要なピクチャは「２」、更にそれよりも速い速度の再生で必要なピクチャは「３」というようにすればよい。 In addition, the information (Speed) indicating how many times the reproduction speed is required for decoding may not be a value indicating the actual reproduction speed, but may be a value indicating the degree of the reproduction speed. For example, "1" is a picture required only for normal-speed playback, "2" is a picture required for next-fast playback, and "3" is a picture required for faster-speed playback. What should I do?

（実施の形態２）
実施の形態１ではランダムアクセスユニットRAUに可変速再生マップRAMを配置する例を示したが、各ランダムアクセスユニットRAUの可変速再生マップRAMが同じ内容であればランダムアクセスユニットRAUで可変速再生マップRAMを配置する必要は無い。 (Embodiment 2)
In the first embodiment, an example in which the variable speed reproduction map RAM is arranged in the random access unit RAU has been described. However, if the variable speed reproduction map RAM of each random access unit RAU has the same contents, the variable speed reproduction map RAM is There is no need to allocate RAM.

図７は実施の形態２のストリームの構成図である。
本実施の形態では、例えば図３に示すようなピクチャの参照関係に基づく複数の可変速再生マップRAMを含む可変速再生マップテーブルRAMTBLを図７（ｂ）に示すように作成する。そして、各ランダムアクセスユニットRAUには、図７（ａ）に示すように可変速再生マップテーブルRAMTBLのどの可変速再生マップRAMに対応するかを示す可変速再生マップ識別子RAMIDを配置する。 FIG. 7 is a configuration diagram of a stream according to the second embodiment.
In the present embodiment, for example, a variable speed reproduction map table RAMTBL including a plurality of variable speed reproduction map RAMs based on the picture reference relationship as shown in FIG. 3 is created as shown in FIG. 7B. Then, in each random access unit RAU, as shown in FIG. 7A, a variable speed reproduction map identifier RAMID indicating which variable speed reproduction map RAM of the variable speed reproduction map table RAMTBL corresponds is arranged.

可変速再生マップテーブルRAMTBLはストリームの先頭に配置しても良いし、また付加情報として別のストリームで符号化したり、予め所定の値を決めておいて機器に備えておいても良い。
図８は可変速再生マップ識別子RAMIDを作成する際の動作を示すフローチャートである。ここでは、実施の形態１における可変速再生マップRAMの作成および符号化（図４、ステップＳ１１〜Ｓ１２）を行う代わりに、可変速再生マップ識別子RAMIDの作成および符号化を行っている（ステップＳ４１〜Ｓ４２）。 The variable speed reproduction map table RAMTBL may be arranged at the head of the stream, may be encoded as additional information with another stream, or may be provided in a device with a predetermined value determined in advance.
FIG. 8 is a flowchart showing the operation when creating the variable speed reproduction map identifier RAMID. Here, instead of creating and encoding the variable speed reproduction map RAM in the first embodiment (FIG. 4, steps S11 to S12), the variable speed reproduction map identifier RAMID is created and encoded (step S41). To S42).

以上のようにすることで、実施の形態１と同様のことを可変速再生マップ識別子RAMIDで実現できる。
なお、可変速再生マップテーブルRAMTBLに含む可変速再生マップRAMを作成する際に、他のピクチャから参照されないピクチャは他のピクチャの復号化に影響を与えないため、他のピクチャに影響を与える参照されるピクチャにのみ対応する可変速再生マップRAMを作成してもよい。また可変速再生マップRAMに相当する機能のもの、例えば各ピクチャの情報を配置する代わりに他から参照されないピクチャ、Iピクチャ、Pピクチャ、Bピクチャ等のカテゴリごとの動作情報を配置してもよい。 By doing as described above, the same thing as the first embodiment can be realized by the variable speed reproduction map identifier RAMID.
When creating a variable speed playback map RAM included in the variable speed playback map table RAMTBL, a picture that is not referred to from other pictures does not affect decoding of other pictures, and therefore, a reference that affects other pictures. A variable speed reproduction map RAM corresponding to only the picture to be reproduced may be created. A function corresponding to the variable speed reproduction map RAM, for example, instead of arranging information of each picture, operation information for each category such as a picture, an I picture, a P picture, and a B picture which is not referred to by others may be arranged. .

更に、ストリームの全てがこのような可変速再生が容易なランダムアクセスユニットRAUで構成されていることを示す識別情報を付与してもよい。
また、実施の形態１、２では、可変速再生マップRAMおよび可変速再生マップ識別子RAMIDをランダムアクセスユニットRAU単位に付与しているが、これに限られるものではない。例えば、ストリーム全体で構造が同じであれば、可変速再生マップRAMおよび可変速再生マップ識別子RAMIDをストリーム単位に付与しても構わない。 Further, identification information indicating that all of the streams are composed of such random access units RAU that can easily perform variable speed reproduction may be added.
Further, in the first and second embodiments, the variable-speed reproduction map RAM and the variable-speed reproduction map identifier RAMID are assigned to each random access unit RAU. However, the present invention is not limited to this. For example, if the entire stream has the same structure, the variable-speed reproduction map RAM and the variable-speed reproduction map identifier RAMID may be assigned to each stream.

また、実施の形態１及び２では少なくとも１つの可変速再生マップRAMを符号化する場合のみを述べたが、運用によって可変速再生マップRAMを固定にする場合も考えられ、その場合は、可変速再生マップRAMの符号化が不要になる。このような運用を可能にするためには、可変速再生の対象とするピクチャが参照する共通情報の符号化の必要性を予め検知し、共通情報の符号化の必要性が検知されたピクチャに対して共通情報を付加するようにすればよい。 Further, in the first and second embodiments, only the case where at least one variable speed reproduction map RAM is encoded has been described. However, a case where the variable speed reproduction map RAM is fixed by operation may be considered. The encoding of the reproduction map RAM becomes unnecessary. In order to enable such an operation, it is necessary to detect in advance the necessity of encoding the common information referred to by the picture to be subjected to variable-speed playback, and to determine the necessity of encoding the common information in the detected picture. In this case, common information may be added.

（実施の形態３）
図９は本発明に係る動画像復号化方法を用いた動画像復号化装置の構成を示すブロック図である。なお、図２２に示す従来の動画像復号化装置４の各部と同じ動作をする機器は同じ符号を付し、説明を省略する。
動画像復号化装置２は、上記のように動画像符号化装置１により符号化された符号化ストリームStr を復号化する装置であり、ストリーム抽出部２０１、可変長復号化部２０２、抽出ピクチャ選択部２０３、動き補償部２０４、逆量子化部２０５、逆直交変換部２０６、加算部２０７、ピクチャメモリ２０８、およびメモリ制御部２０９を備えている。 (Embodiment 3)
FIG. 9 is a block diagram showing the configuration of a video decoding device using the video decoding method according to the present invention. Note that the same reference numerals are given to devices that perform the same operations as the respective units of the conventional video decoding device 4 shown in FIG. 22, and description thereof is omitted.
The video decoding device 2 is a device that decodes the encoded stream Str encoded by the video encoding device 1 as described above, and includes a stream extraction unit 201, a variable-length decoding unit 202, and an extracted picture selection. A section 203, a motion compensation section 204, an inverse quantization section 205, an inverse orthogonal transform section 206, an addition section 207, a picture memory 208, and a memory control section 209.

抽出ピクチャ選択部２０３は、外部から入力される再生速度情報PlaySpeedで指示された再生速度で再生するために復号化が必要なピクチャを、可変長復号化部２０２で復号化された可変速再生用情報Mapに基づいて決定し、ストリーム抽出部２０１に通知する。ストリーム抽出部２０１は、抽出ピクチャ選択部２０３で復号化が必要と判断されたピクチャに対応するストリームのみを抽出して可変長復号化部２０２に伝送する。メモリ制御部２０９は、可変速再生時には他のピクチャに参照されるピクチャについては復号化していなくても、ピクチャメモリ２０８に保存しているものとして制御を行う。 The extracted picture selection unit 203 converts the picture that needs to be decoded to be played back at the playback speed indicated by the playback speed information PlaySpeed input from the outside for the variable speed playback decoded by the variable length decoding unit 202. It is determined based on the information Map and notified to the stream extraction unit 201. The stream extracting unit 201 extracts only a stream corresponding to a picture determined to be required for decoding by the extracted picture selecting unit 203, and transmits the extracted stream to the variable length decoding unit 202. The memory control unit 209 performs control assuming that the picture referred to by another picture is stored in the picture memory 208 at the time of variable-speed playback, even if the picture is not decoded.

次に、上記のように構成された動画像復号化装置２の動作について説明する。図１０は動画像復号化装置２での動作を示すフローチャートであり、（ａ）は実施の形態１に示すストリームの構成に対応し、（ｂ）は実施の形態２に示すストリームの構成に対応する。
実施の形態１に示すストリームの構成で符号化された符号化ストリームStr を復号化する場合、ストリーム抽出部２０１は、復号化対象のピクチャがランダムアクセスポイント即ちランダムアクセスユニットRAUの最初のピクチャであるか否かを判定する（ステップＳ５０）。この判定の結果、ランダムアクセスポイントのピクチャであれば（ステップＳ５０でＹＥＳ）、ストリーム抽出部２０１は、可変速再生マップRAMを抽出し、可変長復号化部２０２に出力する（ステップＳ５１）。そして、可変長復号化部２０２は、可変速再生マップRAMを復号化し、抽出ピクチャ選択部２０３へ出力する（ステップＳ５２）。 Next, the operation of the video decoding device 2 configured as described above will be described. FIGS. 10A and 10B are flowcharts showing the operation of the video decoding device 2, where FIG. 10A corresponds to the stream configuration shown in the first embodiment, and FIG. 10B corresponds to the stream configuration shown in the second embodiment. I do.
When decoding the encoded stream Str encoded with the stream configuration shown in Embodiment 1, the stream extraction unit 201 determines that the picture to be decoded is a random access point, that is, the first picture of the random access unit RAU. It is determined whether or not (step S50). If the result of this determination is that the picture is at a random access point (YES in step S50), the stream extraction unit 201 extracts the variable speed reproduction map RAM and outputs it to the variable length decoding unit 202 (step S51). Then, the variable-length decoding unit 202 decodes the variable-speed reproduction map RAM and outputs it to the extracted picture selection unit 203 (Step S52).

次に、抽出ピクチャ選択部２０３は、外部から入力される再生速度情報PlaySpeedで指示された再生速度で再生するために復号化が必要なピクチャを、可変長復号化部２０２で復号化された可変速再生マップRAMに基づいて決定し、ストリーム抽出部２０１に通知する（ステップＳ５３）。ストリーム抽出部２０１は、対象ピクチャが抽出ピクチャ選択部２０３で復号化が不要と決定されたピクチャであるか否かを判定する（ステップＳ５４）。ここで、対象ピクチャが復号化不要と決定されたピクチャでなければ（ステップＳ５４でＮＯ）、対象ピクチャに対応するストリームのみを抽出して可変長復号化部２０２に出力する。可変長復号化部２０２は、入力されたピクチャに対応するストリームを復号化する（ステップＳ５５）。 Next, the extracted picture selection unit 203 determines whether the variable-length decoding unit 202 has decoded the picture that needs to be decoded in order to play at the playback speed indicated by the externally input playback speed information PlaySpeed. It is determined based on the variable speed reproduction map RAM, and is notified to the stream extraction unit 201 (step S53). The stream extracting unit 201 determines whether or not the current picture is a picture for which decoding has been determined to be unnecessary by the extracted picture selecting unit 203 (step S54). Here, if the current picture is not a picture determined not to need to be decoded (NO in step S54), only the stream corresponding to the current picture is extracted and output to the variable length decoding unit 202. The variable length decoding unit 202 decodes the stream corresponding to the input picture (Step S55).

次に、未復号化ピクチャがあるか否かを判定（ステップＳ５６）し、未復号化ピクチャがあれば上記動作（ステップＳ５０〜Ｓ５６）を繰り返し、未復号化ピクチャが無ければ処理を終了する。
また、実施の形態２に示すストリームの構成で符号化された符号化ストリームStr を復号化する場合には、図１０（ｂ）に示すように復号化対象のピクチャがランダムアクセスポイントのピクチャであれば（ステップＳ５０でＹＥＳ）、ストリーム抽出部２０１は、可変速再生マップ識別子RAMIDを抽出し、可変長復号化部２０２に出力する（ステップＳ６１）。そして、可変長復号化部２０２は、可変速再生マップ識別子RAMIDを復号化し、抽出ピクチャ選択部２０３へ出力する（ステップＳ６２）。 Next, it is determined whether or not there is an undecoded picture (step S56). If there is an undecoded picture, the above operation (steps S50 to S56) is repeated, and if there is no undecoded picture, the process ends.
When decoding the coded stream Str encoded by the stream configuration shown in the second embodiment, as shown in FIG. 10B, the decoding target picture is a picture of a random access point. If it is (YES in step S50), the stream extraction unit 201 extracts the variable speed reproduction map identifier RAMID and outputs it to the variable length decoding unit 202 (step S61). Then, the variable-length decoding unit 202 decodes the variable-speed reproduction map identifier RAMID and outputs it to the extracted picture selection unit 203 (Step S62).

次に、抽出ピクチャ選択部２０３は、外部から入力される再生速度情報PlaySpeedで指示された再生速度で再生するために復号化が必要なピクチャを、可変長復号化部２０２で復号化された可変速再生マップ識別子RAMIDおよび可変速再生マップテーブルRAMTBLに基づいて決定し、ストリーム抽出部２０１に通知する（ステップＳ６３）。以降の動作は、実施の形態１に示すストリームの構成で符号化された符号化ストリームStr を復号化する場合と同様である。なお、ここでは可変速再生マップテーブルRAMTBLはあらかじめ復号化され、抽出ピクチャ選択部２０３が有しているものとしている。 Next, the extracted picture selection unit 203 determines whether the variable-length decoding unit 202 has decoded the picture that needs to be decoded in order to play at the playback speed indicated by the externally input playback speed information PlaySpeed. It is determined based on the variable speed reproduction map identifier RAMID and the variable speed reproduction map table RAMTBL, and is notified to the stream extraction unit 201 (step S63). Subsequent operations are the same as the case of decoding the coded stream Str coded with the stream configuration shown in the first embodiment. Here, it is assumed that the variable speed reproduction map table RAMTBL is decoded in advance and is included in the extracted picture selection unit 203.

ところで、対象ピクチャを復号化する際には、相対的な参照インデックスIndexを用いて参照するピクチャを指定しているため、可変速再生を行う場合に参照ピクチャとして指定されたピクチャが符号化時の参照ピクチャと相違することがある。例えば、図３（ｂ）に示すストリーム例では、すべてのピクチャを通常速度で再生する場合、ピクチャP8の復号化時には図１１（ａ）に示すように参照されるすべてのピクチャI0、P4、B2がピクチャメモリ２０８に保存されている。これに対して、４倍速再生する場合には、図１１（ｂ）に示すように４倍速再生で再生（復号化）され、かつ参照されるピクチャI0、P4がピクチャメモリ２０８に保存されている。このため、ピクチャP8が参照するピクチャP4を指定する参照インデックスIndex（ピクチャメモリ２０８中の２つ前のピクチャを指定）を、４倍速再生する場合にも用いると、図１１（ｂ）に示すようにピクチャI0を指定することになり不都合が生じることになる。 By the way, when decoding the current picture, since the picture to be referred to is specified using the relative reference index Index, the picture specified as the reference picture when performing variable-speed playback is It may be different from the reference picture. For example, in the stream example shown in FIG. 3B, when all pictures are reproduced at the normal speed, all pictures I0, P4, B2 referred to as shown in FIG. Are stored in the picture memory 208. On the other hand, in the case of quadruple speed reproduction, pictures I0 and P4 which are reproduced (decoded) at quadruple speed reproduction and are referred to are stored in the picture memory 208 as shown in FIG. . For this reason, if the reference index Index (designating the two-preceding picture in the picture memory 208) that designates the picture P4 referenced by the picture P8 is also used in the case of quadruple-speed playback, as shown in FIG. , The picture I0 is specified, and a problem occurs.

そこで、本実施の形態では、可変速再生を行う場合に再生されないピクチャについても、他のピクチャに参照されるピクチャは必ずピクチャメモリ２０８に保存されているものとして扱う。すなわち、上記の例では、図１１（ｃ）に示すようにピクチャP4の次にはデータが保存されているものとして制御する。
図１２は対象ピクチャを復号化する際の動作を示すフローチャートである。なお、このフローチャートで示す動作は、図１０に示すフローチャートにおける復号化が不要であるか否かの判定処理、対象ピクチャの復号化処理、および未復号化ピクチャがあるか否かを判定処理（ステップＳ５３〜Ｓ５６）の部分に相当する。 Therefore, in the present embodiment, even for a picture that is not reproduced when performing variable-speed reproduction, a picture referred to by another picture is always treated as being stored in the picture memory 208. That is, in the above example, as shown in FIG. 11C, control is performed assuming that data is stored after the picture P4.
FIG. 12 is a flowchart showing the operation when decoding the current picture. Note that the operation shown in this flowchart is a process of determining whether or not decoding is unnecessary in the flowchart of FIG. 10, a process of decoding the current picture, and a process of determining whether or not there is an undecoded picture (step S53 to S56).

ストリーム抽出部２０１は、対象ピクチャがＮ倍速で復号化が必要と決定されたピクチャであるか否かを判定する（ステップＳ９０）。この判定の結果、対象ピクチャが復号化必要と決定されたピクチャであれば（ステップＳ９０でＹＥＳ）、可変長復号化部２０２は、Ｎ倍速以上のピクチャを参照して対象ピクチャを復号化する（ステップＳ９１）。次に、メモリ制御部２０９は、Ｎ倍速で復号化されていないが他のピクチャに参照されるピクチャが存在するか否かを判定する（ステップＳ９２）。 The stream extracting unit 201 determines whether or not the current picture is a picture determined to be required to be decoded at N × speed (Step S90). As a result of this determination, if the current picture is a picture that is determined to need to be decoded (YES in step S90), the variable-length decoding unit 202 decodes the current picture with reference to a picture at N-times speed or higher ( Step S91). Next, the memory control unit 209 determines whether or not there is a picture that has not been decoded at N times speed but is referred to by another picture (step S92).

この判定の結果、このようなピクチャが存在すれば（ステップＳ９２でＹＥＳ）、メモリ制御部２０９は、このＮ倍速で復号化されていないが他のピクチャに参照されるピクチャをピクチャメモリ２０８に保存しているものとする（ステップＳ９３）。次に、メモリ制御部２０９は、復号化した対象ピクチャをピクチャメモリ２０８に保存する（ステップＳ９４）。 As a result of this determination, if such a picture exists (YES in step S92), the memory control unit 209 stores the picture that has not been decoded at N times speed but is referred to by another picture in the picture memory 208. (Step S93). Next, the memory control unit 209 stores the decoded target picture in the picture memory 208 (Step S94).

一方、上記のようなピクチャが存在しなければ（ステップＳ９２でＮＯ）、メモリ制御部２０９は、単に復号化した対象ピクチャをピクチャメモリ２０８に保存する（ステップＳ９４）。
次に、未復号化ピクチャがあるか否かを判定（ステップＳ９５）し、未復号化ピクチャがあれば上記動作（ステップＳ９０〜Ｓ９５）を繰り返し、未復号化ピクチャが無ければ処理を終了する。また、対象ピクチャがＮ倍速で復号化が必要ない場合（ステップＳ９０でＮＯ）も、同様に未復号化ピクチャがあるか否かを判定（ステップＳ９５）し、未復号化ピクチャがあれば上記動作（ステップＳ９０〜Ｓ９５）を繰り返し、未復号化ピクチャが無ければ処理を終了する。 On the other hand, if no such picture exists (NO in step S92), the memory control unit 209 simply stores the decoded target picture in the picture memory 208 (step S94).
Next, it is determined whether or not there is an undecoded picture (step S95). If there is an undecoded picture, the above operation (steps S90 to S95) is repeated, and if there is no undecoded picture, the process ends. Also, if the current picture is N times faster and does not need to be decoded (NO in step S90), it is similarly determined whether there is an undecoded picture (step S95). (Steps S90 to S95) are repeated, and if there is no undecoded picture, the process ends.

以上のように、実施の形態１および実施の形態２に示すように符号化された符号化ストリームStr を、可変速再生用情報Mapに基づいて所望の可変速再生に必要なピクチャのみを復号化することで、不要なピクチャの復号化を省略して容易に可変速再生を行うことができる。 As described above, the coded stream Str encoded as shown in the first and second embodiments is decoded from only the pictures necessary for the desired variable speed reproduction based on the variable speed reproduction information Map. By doing so, variable-speed playback can be easily performed without omitting unnecessary decoding of pictures.

（実施の形態４）
本実施の形態では、実施の形態３に示す動画像復号化装置２の動作が一部相違する場合について説明する。 (Embodiment 4)
In the present embodiment, a case will be described in which the operation of moving image decoding apparatus 2 shown in Embodiment 3 is partially different.

図１３は動画像復号化装置２での動作を示すフローチャートであり、（ａ）は実施の形態１に示すストリームの構成に対応し、（ｂ）は実施の形態２に示すストリームの構成に対応する。図１３において、図１０に示すフローチャートの各処理と同じ処理は同じステップ番号を付し、説明を省略する。
実施の形態３では、図１０（ａ）に示すようにストリーム抽出部２０１は、復号化対象のピクチャがランダムアクセスユニットRAUの最初のピクチャであるか否かを判定（ステップＳ５０）しているが、本実施の形態では、図１３（ａ）に示すように単純に可変速再生マップRAMが配置されているか否かを判定（ステップＳ７０）している。この結果、可変速再生マップRAMが配置されていれば（ステップＳ７０でＹＥＳ）、ストリーム抽出部２０１は、可変速再生マップRAMを抽出し、可変長復号化部２０２に出力する（ステップＳ５１）。 FIGS. 13A and 13B are flowcharts showing the operation of the video decoding device 2. FIG. 13A corresponds to the stream configuration shown in the first embodiment, and FIG. 13B corresponds to the stream configuration shown in the second embodiment. I do. 13, the same processes as those in the flowchart shown in FIG. 10 are denoted by the same step numbers, and description thereof will be omitted.
In the third embodiment, as shown in FIG. 10A, the stream extraction unit 201 determines whether the picture to be decoded is the first picture of the random access unit RAU (step S50). In the present embodiment, it is simply determined whether or not the variable speed reproduction map RAM is arranged as shown in FIG. 13A (step S70). As a result, if the variable speed reproduction map RAM is arranged (YES in step S70), the stream extraction unit 201 extracts the variable speed reproduction map RAM and outputs it to the variable length decoding unit 202 (step S51).

同様に、図１０（ｂ）では、ストリーム抽出部２０１は、復号化対象のピクチャがランダムアクセスユニットRAUの最初のピクチャであるか否かを判定（ステップＳ５０）しているが、本実施の形態では、図１３（ｂ）に示すように単純に可変速再生マップ識別子RAMIDが配置されているか否かを判定（ステップＳ８０）している。この結果、可変速再生マップ識別子RAMIDが配置されていれば（ステップＳ８０でＹＥＳ）、ストリーム抽出部２０１は、可変速再生マップ識別子RAMIDを抽出し、可変長復号化部２０２に出力する（ステップＳ６１）。 Similarly, in FIG. 10B, the stream extraction unit 201 determines whether the picture to be decoded is the first picture of the random access unit RAU (step S50). Then, as shown in FIG. 13B, it is determined whether or not the variable speed reproduction map identifier RAMID is simply arranged (step S80). As a result, if the variable speed reproduction map identifier RAMID is arranged (YES in step S80), the stream extraction unit 201 extracts the variable speed reproduction map identifier RAMID and outputs it to the variable length decoding unit 202 (step S61). ).

以上のように、可変速再生マップRAMまたは可変速再生マップ識別子RAMIDが配置されているか否かを判定することでも、実施の形態１および実施の形態２に示すように符号化された符号化ストリームStr を、可変速再生用情報Mapに基づいて所望の可変速再生に必要なピクチャのみを復号化することで、不要なピクチャの復号化を省略して容易に可変速再生を行うことができる。 As described above, whether or not the variable-speed reproduction map RAM or the variable-speed reproduction map identifier RAMID is arranged can also determine whether the coded stream encoded as described in the first and second embodiments is used. By decoding only the pictures necessary for the desired variable-speed playback based on the variable-speed playback information Map, the variable-speed playback can be easily performed without decoding unnecessary pictures.

（実施の形態５）
さらに、上記各実施の形態で示した動画像符号化方法および動画像復号化方法を実現するためのプログラムを、フレキシブルディスク等の記録媒体に記録するようにすることにより、上記各実施の形態で示した処理を、独立したコンピュータシステムにおいて簡単に実施することが可能となる。 (Embodiment 5)
Furthermore, by recording a program for realizing the moving picture encoding method and the moving picture decoding method described in each of the above-described embodiments on a recording medium such as a flexible disk, the above-described embodiments can be used. The illustrated processing can be easily performed in an independent computer system.

図１４は、上記各実施の形態の動画像符号化方法および動画像復号化方法を、フレキシブルディスク等の記録媒体に記録されたプログラムを用いて、コンピュータシステムにより実施する場合の説明図である。
図１４(b) は、フレキシブルディスクの正面からみた外観、断面構造、及びフレキシブルディスクを示し、図１４(a) は、記録媒体本体であるフレキシブルディスクの物理フォーマットの例を示している。フレキシブルディスクＦＤはケースＦ内に内蔵され、該ディスクの表面には、同心円状に外周からは内周に向かって複数のトラックＴｒが形成され、各トラックは角度方向に１６のセクタＳｅに分割されている。従って、上記プログラムを格納したフレキシブルディスクでは、上記フレキシブルディスクＦＤ上に割り当てられた領域に、上記プログラムが記録されている。 FIG. 14 is an explanatory diagram of a case where the moving picture encoding method and the moving picture decoding method of each of the above embodiments are implemented by a computer system using a program recorded on a recording medium such as a flexible disk.
FIG. 14B shows the appearance, cross-sectional structure, and flexible disk of the flexible disk viewed from the front, and FIG. 14A shows an example of the physical format of the flexible disk which is a recording medium body. The flexible disk FD is built in the case F, and a plurality of tracks Tr are formed concentrically from the outer circumference toward the inner circumference on the surface of the disk, and each track is divided into 16 sectors Se in an angular direction. ing. Therefore, in the flexible disk storing the program, the program is recorded in an area allocated on the flexible disk FD.

また、図１４(c) は、フレキシブルディスクＦＤに上記プログラムの記録再生を行うための構成を示す。動画像符号化方法および動画像復号化方法を実現する上記プログラムをフレキシブルディスクＦＤに記録する場合は、コンピュータシステムＣｓから上記プログラムをフレキシブルディスクドライブを介して書き込む。また、フレキシブルディスク内の動画像符号化方法および動画像復号化方法を実現するプログラムにより上記動画像符号化方法および動画像復号化方法をコンピュータシステム中に構築する場合は、フレキシブルディスクドライブによりプログラムをフレキシブルディスクから読み出し、コンピュータシステムに転送する。 FIG. 14C shows a configuration for recording and reproducing the program on the flexible disk FD. When the above-mentioned program for realizing the moving picture encoding method and the moving picture decoding method is recorded on the flexible disk FD, the program is written from the computer system Cs via the flexible disk drive. When the moving picture coding method and the moving picture decoding method are implemented in a computer system by a program for realizing the moving picture coding method and the moving picture decoding method in a flexible disk, the program is executed by a flexible disk drive. Read from flexible disk and transfer to computer system.

なお、上記説明では、記録媒体としてフレキシブルディスクを用いて説明を行ったが、光ディスクを用いても同様に行うことができる。また、記録媒体はこれに限らず、ＩＣカード、ＲＯＭカセット等、プログラムを記録できるものであれば同様に実施することができる。 In the above description, the description has been made using a flexible disk as a recording medium. However, the same description can be made using an optical disk. Further, the recording medium is not limited to this, and the present invention can be similarly implemented as long as the program can be recorded, such as an IC card or a ROM cassette.

（実施の形態６）
さらにここで、上記実施の形態で示した動画像符号化方法や動画像復号化方法の応用例とそれを用いたシステムを説明する。 (Embodiment 6)
Further, here, application examples of the moving picture coding method and the moving picture decoding method described in the above embodiment and a system using the same will be described.

図１５は、コンテンツ配信サービスを実現するコンテンツ供給システムex１００の全体構成を示すブロック図である。通信サービスの提供エリアを所望の大きさに分割し、各セル内にそれぞれ固定無線局である基地局ex１０７〜ex１１０が設置されている。
このコンテンツ供給システムex１００は、例えば、インターネットex１０１にインターネットサービスプロバイダex１０２および電話網ex１０４、および基地局ex１０７〜ex１１０を介して、コンピュータex１１１、ＰＤＡ（personal digital assistant）ex１１２、カメラex１１３、携帯電話ex１１４、カメラ付きの携帯電話ｅｘ１１５などの各機器が接続される。 FIG. 15 is a block diagram illustrating an overall configuration of a content supply system ex100 that realizes a content distribution service. A communication service providing area is divided into desired sizes, and base stations ex107 to ex110, which are fixed wireless stations, are installed in each cell.
The content supply system ex100 includes, for example, a computer ex111, a PDA (personal digital assistant) ex112, a camera ex113, a mobile phone ex114, and a camera on the Internet ex101 via the Internet service provider ex102 and the telephone network ex104, and the base stations ex107 to ex110. Each device such as a mobile phone ex115 with a tag is connected.

しかし、コンテンツ供給システムex１００は図１５のような組合せに限定されず、いずれかを組み合わせて接続するようにしてもよい。また、固定無線局である基地局ex１０７〜ex１１０を介さずに、各機器が電話網ex１０４に直接接続されてもよい。
カメラex１１３はデジタルビデオカメラ等の動画撮影が可能な機器である。また、携帯電話は、ＰＤＣ（Personal Digital Communications）方式、ＣＤＭＡ（Code Division Multiple Access）方式、Ｗ−ＣＤＭＡ（Wideband-Code Division Multiple Access）方式、若しくはＧＳＭ（Global System for Mobile Communications）方式の携帯電話機、またはＰＨＳ（Personal Handyphone System）等であり、いずれでも構わない。 However, the content supply system ex100 is not limited to the combination as shown in FIG. 15, and may be connected in any combination. Further, each device may be directly connected to the telephone network ex104 without going through the base stations ex107 to ex110 which are fixed wireless stations.
The camera ex113 is a device capable of shooting moving images, such as a digital video camera. In addition, a mobile phone is a PDC (Personal Digital Communications) system, a CDMA (Code Division Multiple Access) system, a W-CDMA (Wideband-Code Division Multiple Access) system, or a GSM (Global System for Mobile Communications) system. Or PHS (Personal Handyphone System) or the like, and either may be used.

また、ストリーミングサーバex１０３は、カメラex１１３から基地局ex１０９、電話網ex１０４を通じて接続されており、カメラex１１３を用いてユーザが送信する符号化処理されたデータに基づいたライブ配信等が可能になる。撮影したデータの符号化処理はカメラex１１３で行っても、データの送信処理をするサーバ等で行ってもよい。また、カメラex１１６で撮影した動画データはコンピュータex１１１を介してストリーミングサーバex１０３に送信されてもよい。カメラex１１６はデジタルカメラ等の静止画、動画が撮影可能な機器である。この場合、動画データの符号化はカメラex１１６で行ってもコンピュータex１１１で行ってもどちらでもよい。また、符号化処理はコンピュータex１１１やカメラex１１６が有するＬＳＩex１１７において処理することになる。なお、画像符号化・復号化用のソフトウェアをコンピュータex１１１等で読み取り可能な記録媒体である何らかの蓄積メディア（ＣＤ−ＲＯＭ、フレキシブルディスク、ハードディスクなど）に組み込んでもよい。さらに、カメラ付きの携帯電話ex１１５で動画データを送信してもよい。このときの動画データは携帯電話ex１１５が有するＬＳＩで符号化処理されたデータである。 The streaming server ex103 is connected from the camera ex113 to the base station ex109 and the telephone network ex104, and enables live distribution and the like based on the encoded data transmitted by the user using the camera ex113. The encoding process of the captured data may be performed by the camera ex113, or may be performed by a server or the like that performs the data transmission process. Also, moving image data captured by the camera ex116 may be transmitted to the streaming server ex103 via the computer ex111. The camera ex116 is a device such as a digital camera capable of shooting still images and moving images. In this case, encoding of the moving image data may be performed by the camera ex116 or the computer ex111. The encoding process is performed by the LSI ex117 included in the computer ex111 and the camera ex116. The image encoding / decoding software may be incorporated in any storage medium (CD-ROM, flexible disk, hard disk, or the like) that is a recording medium readable by the computer ex111 or the like. Further, the moving image data may be transmitted by a mobile phone with camera ex115. The moving image data at this time is data that has been encoded by the LSI included in the mobile phone ex115.

このコンテンツ供給システムex１００では、ユーザがカメラex１１３、カメラex１１６等で撮影しているコンテンツ（例えば、音楽ライブを撮影した映像等）を上記実施の形態同様に符号化処理してストリーミングサーバex１０３に送信する一方で、ストリーミングサーバex１０３は要求のあったクライアントに対して上記コンテンツデータをストリーム配信する。クライアントとしては、上記符号化処理されたデータを復号化することが可能な、コンピュータex１１１、ＰＤＡex１１２、カメラex１１３、携帯電話ex１１４等がある。このようにすることでコンテンツ供給システムex１００は、符号化されたデータをクライアントにおいて受信して再生することができ、さらにクライアントにおいてリアルタイムで受信して復号化し、再生することにより、個人放送をも実現可能になるシステムである。 In the content supply system ex100, the content (for example, a video of a live music shot) captured by the user with the camera ex113, the camera ex116, or the like is encoded and transmitted to the streaming server ex103 in the same manner as in the above embodiment. On the other hand, the streaming server ex103 stream-distributes the content data to the requesting client. Examples of the client include a computer ex111, a PDA ex112, a camera ex113, a mobile phone ex114, and the like, which are capable of decoding the encoded data. In this way, the content providing system ex100 can receive and reproduce the encoded data on the client, and further, can receive, decode, and reproduce the encoded data on the client in real time, thereby realizing personal broadcasting. It is a system that becomes possible.

このシステムを構成する各機器の符号化、復号化には上記各実施の形態で示した動画像符号化装置あるいは動画像復号化装置を用いるようにすればよい。
その一例として携帯電話について説明する。
図１６は、上記実施の形態で説明した動画像符号化方法と動画像復号化方法を用いた携帯電話ex１１５を示す図である。携帯電話ex１１５は、基地局ex１１０との間で電波を送受信するためのアンテナex２０１、ＣＣＤカメラ等の映像、静止画を撮ることが可能なカメラ部ex２０３、カメラ部ex２０３で撮影した映像、アンテナex２０１で受信した映像等が復号化されたデータを表示する液晶ディスプレイ等の表示部ex２０２、操作キーｅｘ２０４群から構成される本体部、音声出力をするためのスピーカ等の音声出力部ex２０８、音声入力をするためのマイク等の音声入力部ex２０５、撮影した動画もしくは静止画のデータ、受信したメールのデータ、動画のデータもしくは静止画のデータ等、符号化されたデータまたは復号化されたデータを保存するための記録メディアex２０７、携帯電話ex１１５に記録メディアex２０７を装着可能とするためのスロット部ex２０６を有している。記録メディアex２０７はＳＤカード等のプラスチックケース内に電気的に書換えや消去が可能な不揮発性メモリであるＥＥＰＲＯＭ（Electrically Erasable and Programmable Read Only Memory）の一種であるフラッシュメモリ素子を格納したものである。 The encoding and decoding of each device constituting this system may be performed using the video encoding device or the video decoding device described in each of the above embodiments.
A mobile phone will be described as an example.
FIG. 16 is a diagram illustrating the mobile phone ex115 using the moving picture coding method and the moving picture decoding method described in the above embodiment. The mobile phone ex115 includes an antenna ex201 for transmitting and receiving radio waves to and from the base station ex110, a video image of a CCD camera or the like, a camera unit ex203 capable of taking a still image, a video image captured by the camera unit ex203, and an antenna ex201. A display unit ex202 such as a liquid crystal display for displaying data obtained by decoding a received video or the like, a main unit including a group of operation keys ex204, an audio output unit ex208 such as a speaker for outputting audio, and audio input. Input unit ex205 such as a microphone for storing encoded or decoded data, such as data of captured moving images or still images, received mail data, moving image data or still image data, etc. Recording medium ex207, and a slot ex20 for allowing the recording medium ex207 to be attached to the mobile phone ex115. The it has. The recording medium ex207 stores a flash memory element which is a kind of EEPROM (Electrically Erasable and Programmable Read Only Memory) which is a nonvolatile memory which can be electrically rewritten and erased, in a plastic case such as an SD card.

さらに、携帯電話ex１１５について図１７を用いて説明する。携帯電話ex１１５は表示部ex２０２及び操作キーｅｘ２０４を備えた本体部の各部を統括的に制御するようになされた主制御部ex３１１に対して、電源回路部ex３１０、操作入力制御部ex３０４、画像符号化部ex３１２、カメラインターフェース部ex３０３、ＬＣＤ（Liquid Crystal Display）制御部ex３０２、画像復号化部ex３０９、多重分離部ex３０８、記録再生部ex３０７、変復調回路部ex３０６及び音声処理部ex３０５が同期バスex３１３を介して互いに接続されている。 Further, the mobile phone ex115 will be described with reference to FIG. The mobile phone ex115 controls a power supply circuit unit ex310, an operation input control unit ex304, an image encoding unit, and a main control unit ex311 that integrally controls each unit of a main unit including a display unit ex202 and operation keys ex204. Unit ex312, camera interface unit ex303, LCD (Liquid Crystal Display) control unit ex302, image decoding unit ex309, demultiplexing unit ex308, recording / reproducing unit ex307, modulation / demodulation circuit unit ex306, and audio processing unit ex305 via a synchronous bus ex313. Connected to each other.

電源回路部ex３１０は、ユーザの操作により終話及び電源キーがオン状態にされると、バッテリパックから各部に対して電力を供給することによりカメラ付ディジタル携帯電話ex１１５を動作可能な状態に起動する。
携帯電話ex１１５は、ＣＰＵ、ＲＯＭ及びＲＡＭ等でなる主制御部ex３１１の制御に基づいて、音声通話モード時に音声入力部ex２０５で集音した音声信号を音声処理部ex３０５によってディジタル音声データに変換し、これを変復調回路部ex３０６でスペクトラム拡散処理し、送受信回路部ex３０１でディジタルアナログ変換処理及び周波数変換処理を施した後にアンテナex２０１を介して送信する。また携帯電話機ex１１５は、音声通話モード時にアンテナex２０１で受信した受信データを増幅して周波数変換処理及びアナログディジタル変換処理を施し、変復調回路部ex３０６でスペクトラム逆拡散処理し、音声処理部ex３０５によってアナログ音声データに変換した後、これを音声出力部ex２０８を介して出力する。 The power supply circuit unit ex310 activates the camera-equipped digital mobile phone ex115 in an operable state by supplying power to each unit from the battery pack when the call end and the power key are turned on by a user operation. .
The mobile phone ex115 converts an audio signal collected by the audio input unit ex205 into digital audio data by the audio processing unit ex305 in the audio communication mode based on the control of the main control unit ex311 including a CPU, a ROM, a RAM, and the like. This is spread-spectrum processed by a modulation / demodulation circuit unit ex306, subjected to digital-analog conversion processing and frequency conversion processing by a transmission / reception circuit unit ex301, and then transmitted via an antenna ex201. The mobile phone ex115 amplifies received data received by the antenna ex201 in the voice call mode, performs frequency conversion processing and analog-to-digital conversion processing, performs spectrum despreading processing in the modulation / demodulation circuit unit ex306, and performs analog voice decoding in the voice processing unit ex305. After the data is converted, the data is output via the audio output unit ex208.

さらに、データ通信モード時に電子メールを送信する場合、本体部の操作キーｅｘ２０４の操作によって入力された電子メールのテキストデータは操作入力制御部ex３０４を介して主制御部ex３１１に送出される。主制御部ex３１１は、テキストデータを変復調回路部ex３０６でスペクトラム拡散処理し、送受信回路部ex３０１でディジタルアナログ変換処理及び周波数変換処理を施した後にアンテナex２０１を介して基地局ex１１０へ送信する。 Further, when an e-mail is transmitted in the data communication mode, text data of the e-mail input by operating the operation key ex204 of the main body is sent to the main control unit ex311 via the operation input control unit ex304. The main control unit ex311 performs spread spectrum processing on the text data in the modulation / demodulation circuit unit ex306, performs digital / analog conversion processing and frequency conversion processing on the transmission / reception circuit unit ex301, and transmits the text data to the base station ex110 via the antenna ex201.

データ通信モード時に画像データを送信する場合、カメラ部ex２０３で撮像された画像データをカメラインターフェース部ex３０３を介して画像符号化部ex３１２に供給する。また、画像データを送信しない場合には、カメラ部ex２０３で撮像した画像データをカメラインターフェース部ex３０３及びＬＣＤ制御部ex３０２を介して表示部ex２０２に直接表示することも可能である。 When transmitting image data in the data communication mode, the image data captured by the camera unit ex203 is supplied to the image encoding unit ex312 via the camera interface unit ex303. When the image data is not transmitted, the image data captured by the camera unit ex203 can be directly displayed on the display unit ex202 via the camera interface unit ex303 and the LCD control unit ex302.

画像符号化部ex３１２は、本願発明で説明した動画像符号化装置を備えた構成であり、カメラ部ex２０３から供給された画像データを上記実施の形態で示した動画像符号化装置に用いた符号化方法によって圧縮符号化することにより符号化画像データに変換し、これを多重分離部ex３０８に送出する。また、このとき同時に携帯電話機ex１１５は、カメラ部ex２０３で撮像中に音声入力部ex２０５で集音した音声を音声処理部ex３０５を介してディジタルの音声データとして多重分離部ex３０８に送出する。 The image encoding unit ex312 includes the moving image encoding device described in the present invention, and encodes the image data supplied from the camera unit ex203 using the moving image encoding device described in the above embodiment. The image data is converted into encoded image data by performing compression encoding according to a demultiplexing method, and is transmitted to the demultiplexing unit ex308. Further, at this time, the mobile phone ex115 simultaneously transmits the voice collected by the voice input unit ex205 during imaging by the camera unit ex203 to the demultiplexing unit ex308 as digital voice data via the voice processing unit ex305.

多重分離部ex３０８は、画像符号化部ex３１２から供給された符号化画像データと音声処理部ex３０５から供給された音声データとを所定の方式で多重化し、その結果得られる多重化データを変復調回路部ex３０６でスペクトラム拡散処理し、送受信回路部ex３０１でディジタルアナログ変換処理及び周波数変換処理を施した後にアンテナex２０１を介して送信する。 The demultiplexing unit ex308 multiplexes the encoded image data supplied from the image encoding unit ex312 and the audio data supplied from the audio processing unit ex305 by a predetermined method, and modulates and outputs the resulting multiplexed data. The signal is subjected to spread spectrum processing in ex306 and subjected to digital / analog conversion processing and frequency conversion processing in the transmission / reception circuit unit ex301, and then transmitted via the antenna ex201.

データ通信モード時にホームページ等にリンクされた動画像ファイルのデータを受信する場合、アンテナex２０１を介して基地局ex１１０から受信した受信データを変復調回路部ex３０６でスペクトラム逆拡散処理し、その結果得られる多重化データを多重分離部ex３０８に送出する。
また、アンテナex２０１を介して受信された多重化データを復号化するには、多重分離部ex３０８は、多重化データを分離することにより画像データのビットストリームと音声データのビットストリームとに分け、同期バスex３１３を介して当該符号化画像データを画像復号化部ex３０９に供給すると共に当該音声データを音声処理部ex３０５に供給する。 When data of a moving image file linked to a homepage or the like is received in the data communication mode, the data received from the base station ex110 via the antenna ex201 is subjected to spectrum despreading processing by the modulation / demodulation circuit unit ex306, and the resulting multiplexed data is obtained. The demultiplexed data is sent to the demultiplexing unit ex308.
To decode the multiplexed data received via the antenna ex201, the demultiplexing unit ex308 separates the multiplexed data into a bit stream of image data and a bit stream of audio data, and performs synchronization. The coded image data is supplied to the image decoding unit ex309 via the bus ex313, and the audio data is supplied to the audio processing unit ex305.

次に、画像復号化部ex３０９は、本願発明で説明した動画像復号化装置を備えた構成であり、画像データのビットストリームを上記実施の形態で示した符号化方法に対応した復号化方法で復号することにより再生動画像データを生成し、これをＬＣＤ制御部ex３０２を介して表示部ex２０２に供給し、これにより、例えばホームページにリンクされた動画像ファイルに含まれる動画データが表示される。このとき同時に音声処理部ex３０５は、音声データをアナログ音声データに変換した後、これを音声出力部ex２０８に供給し、これにより、例えばホームページにリンクされた動画像ファイルに含まる音声データが再生される。 Next, the image decoding unit ex309 has a configuration including the moving image decoding device described in the present invention, and converts a bit stream of image data into a decoding method corresponding to the encoding method described in the above embodiment. By decoding, reproduced moving image data is generated and supplied to the display unit ex202 via the LCD control unit ex302, whereby, for example, moving image data included in a moving image file linked to a homepage is displayed. At this time, at the same time, the audio processing unit ex305 converts the audio data into analog audio data, and then supplies the analog audio data to the audio output unit ex208, whereby, for example, the audio data included in the moving image file linked to the homepage is reproduced. You.

なお、上記システムの例に限られず、最近は衛星、地上波によるディジタル放送が話題となっており、図１８に示すようにディジタル放送用システムにも上記実施の形態の少なくとも動画像符号化装置または動画像復号化装置のいずれかを組み込むことができる。具体的には、放送局ex４０９では映像情報のビットストリームが電波を介して通信または放送衛星ex４１０に伝送される。これを受けた放送衛星ex４１０は、放送用の電波を発信し、この電波を衛星放送受信設備をもつ家庭のアンテナex４０６で受信し、テレビ（受信機）ex４０１またはセットトップボックス（ＳＴＢ）ex４０７などの装置によりビットストリームを復号化してこれを再生する。また、記録媒体であるCDやDVD等の蓄積メディアex４０２に記録したビットストリームを読み取り、復号化する再生装置ex４０３にも上記実施の形態で示した動画像復号化装置を実装することが可能である。この場合、再生された映像信号はモニタex４０４に表示される。また、ケーブルテレビ用のケーブルex４０５または衛星／地上波放送のアンテナex４０６に接続されたセットトップボックスex４０７内に動画像復号化装置を実装し、これをテレビのモニタex４０８で再生する構成も考えられる。このときセットトップボックスではなく、テレビ内に動画像復号化装置を組み込んでも良い。また、アンテナex４１１を有する車ex４１２で衛星ex４１０からまたは基地局ex１０７等から信号を受信し、車ex４１２が有するカーナビゲーションex４１３等の表示装置に動画を再生することも可能である。 It should be noted that the present invention is not limited to the example of the system described above, and digital broadcasting using satellites and terrestrial waves has recently become a topic. As shown in FIG. Any of the video decoding devices can be incorporated. Specifically, at the broadcasting station ex409, the bit stream of the video information is transmitted to the communication or the broadcasting satellite ex410 via radio waves. The broadcast satellite ex410 receiving this transmits a broadcast radio wave, receives this radio wave with a home antenna ex406 having a satellite broadcast reception facility, and outputs the radio wave to a television (receiver) ex401 or a set-top box (STB) ex407. The device decodes the bit stream and reproduces it. In addition, the moving picture decoding apparatus described in the above embodiment can be mounted on a reproducing apparatus ex403 that reads and decodes a bit stream recorded on a storage medium ex402 such as a CD or DVD that is a recording medium. . In this case, the reproduced video signal is displayed on the monitor ex404. Further, a configuration is also conceivable in which a moving image decoding apparatus is mounted in a set-top box ex407 connected to a cable ex405 for cable television or an antenna ex406 for satellite / terrestrial broadcasting, and this is reproduced on a monitor ex408 of the television. At this time, the moving picture decoding device may be incorporated in the television instead of the set-top box. In addition, a car ex412 having an antenna ex411 can receive a signal from the satellite ex410 or a base station ex107 or the like, and can reproduce a moving image on a display device such as a car navigation ex413 included in the car ex412.

更に、画像信号を上記実施の形態で示した動画像符号化装置で符号化し、記録媒体に記録することもできる。具体例としては、DVDディスクｅｘ４２１に画像信号を記録するDVDレコーダや、ハードディスクに記録するディスクレコーダなどのレコーダｅx４２０がある。更にSDカードｅｘ４２２に記録することもできる。レコーダｅｘ４２０が上記実施の形態で示した動画像復号化装置を備えていれば、DVDディスクｅｘ４２１やSDカードｅｘ４２２に記録した画像信号を再生し、モニタｅｘ４０８で表示することができる。 Furthermore, an image signal can be encoded by the moving image encoding device described in the above embodiment and recorded on a recording medium. As a specific example, there is a recorder ex420 such as a DVD recorder for recording an image signal on a DVD disc ex421 or a disc recorder for recording on a hard disk. Furthermore, it can be recorded on the SD card ex422. If the recorder ex420 includes the moving picture decoding device described in the above embodiment, the video signal recorded on the DVD disc ex421 or the SD card ex422 can be reproduced and displayed on the monitor ex408.

なお、カーナビゲーションex４１３の構成は例えば図１７に示す構成のうち、カメラ部ex２０３とカメラインターフェース部ex３０３、画像符号化部ｅｘ３１２を除いた構成が考えられ、同様なことがコンピュータex１１１やテレビ（受信機）ex４０１等でも考えられる。
また、上記携帯電話ex１１４等の端末は、符号化器・復号化器を両方持つ送受信型の端末の他に、符号化器のみの送信端末、復号化器のみの受信端末の３通りの実装形式が考えられる。 The configuration of the car navigation system ex413 may be, for example, a configuration excluding the camera unit ex203, the camera interface unit ex303, and the image encoding unit ex312 from the configuration illustrated in FIG. 17, and the same applies to the computer ex111 and the television (receiver). ) Ex401 and the like are also conceivable.
In addition, the terminal such as the above-mentioned mobile phone ex114 has three mounting formats, in addition to a transmitting / receiving terminal having both an encoder and a decoder, a transmitting terminal having only an encoder and a receiving terminal having only a decoder. Can be considered.

このように、上記実施の形態で示した動画像符号化方法あるいは動画像復号化方法を上述したいずれの機器・システムに用いることは可能であり、そうすることで、上記実施の形態で説明した効果を得ることができる。
また、本発明はかかる上記実施形態に限定されるものではなく、本発明の範囲を逸脱することなく種々の変形または修正が可能である。 As described above, the moving picture coding method or the moving picture decoding method described in the above embodiment can be used for any of the devices and systems described above. The effect can be obtained.
Further, the present invention is not limited to the above embodiment, and various changes or modifications can be made without departing from the scope of the present invention.

以上のように、本発明に係る動画像符号化方法および動画像復号化方法は、例えば携帯電話、ＤＶＤ装置、およびパーソナルコンピュータ等で、動画像を構成する各ピクチャを符号化して符号列を生成したり、生成された符号列を復号化したりするための方法として有用である。 As described above, according to the moving picture coding method and the moving picture decoding method according to the present invention, for example, a mobile phone, a DVD device, a personal computer or the like codes each picture constituting a moving picture to generate a code sequence. Or a method for decoding the generated code string.

本発明の動画像符号化方法を実現する動画像符号化装置の構成を示すブロック図である。FIG. 1 is a block diagram illustrating a configuration of a video encoding device that implements the video encoding method of the present invention. 本発明のストリームの構成図（実施の形態１）であり、（ａ）ランダムアクセスユニットRAUの構造例、（ｂ）可変速再生マップRAMの一例、（ｃ）可変速再生マップRAMの他の例、（ｄ）可変速再生マップRAMの他の例である。It is a block diagram of a stream of the present invention (Embodiment 1), (a) a structural example of a random access unit RAU, (b) an example of a variable speed reproduction map RAM, (c) another example of a variable speed reproduction map RAM , (D) is another example of a variable speed reproduction map RAM. ピクチャの参照関係（実施の形態１）を示す模式図であり、（ａ）ピクチャの参照関係の一例、（ｂ）ピクチャの参照関係の他の例、（ｃ）ピクチャの参照関係の他の例である。It is a schematic diagram which shows the reference relationship of a picture (Embodiment 1), (a) An example of a reference relationship of a picture, (b) Another example of a reference relationship of a picture, (c) Another example of a reference relationship of a picture It is. 可変速再生マップRAMを作成する際の動作を示すフローチャートである。5 is a flowchart showing an operation when creating a variable speed reproduction map RAM. ピクチャパラメータセットPPSを付加する際の動作を示すフローチャートである。9 is a flowchart illustrating an operation when a picture parameter set PPS is added. 対象ピクチャを符号化する際の動作を示すフローチャートである。9 is a flowchart illustrating an operation when encoding a target picture. 本発明のストリームの構成図（実施の形態２）であり、（ａ）ランダムアクセスユニットRAUの構造例、（ｂ）可変速再生マップテーブルRAMTBLの一例である。It is a block diagram (Embodiment 2) of a stream of the present invention, (a) is an example of the structure of the random access unit RAU, (b) is an example of the variable speed reproduction map table RAMTBL. 可変速再生マップ識別子RAMIDを作成する際の動作を示すフローチャートである。9 is a flowchart showing an operation when creating a variable speed reproduction map identifier RAMID. 本発明の動画像復号化方法を実現する動画像復号化装置の構成を示すブロック図である。It is a block diagram showing the composition of the video decoding device which realizes the video decoding method of the present invention. 動画像復号化装置での動作を示すフローチャートであり、（ａ）実施の形態１に示すストリームの構成、（ｂ）実施の形態２に示すストリームの構成に対応するフローチャートである。It is a flowchart which shows operation | movement in a moving image decoding apparatus, Comprising: (a) It is a flowchart corresponding to the stream structure shown in Embodiment 1, and (b) the stream structure shown in Embodiment 2. ピクチャメモリの保存状態を示す模式図であり、（ａ）通常再生時、（ｂ）従来の４倍速再生時、（ｃ）本発明の４倍速再生時、の保存状態を示す模式図である。It is a schematic diagram which shows the preservation | save state of a picture memory, It is a schematic diagram which shows the preservation | save state at the time of (a) at the time of normal reproduction | regeneration, (b) at the time of 4 times speed reproduction of the conventional, and (c) at the time of 4 times speed reproduction of this invention. 対象ピクチャを復号化する際の動作を示すフローチャートである。9 is a flowchart illustrating an operation when decoding a target picture. 動画像復号化装置での他の動作を示すフローチャートであり、（ａ）実施の形態１に示すストリームの構成、（ｂ）実施の形態２に示すストリームの構成に対応するフローチャートである。10 is a flowchart illustrating another operation of the video decoding device, and corresponds to (a) a stream configuration described in Embodiment 1 and (b) a flowchart corresponding to the stream configuration described in Embodiment 2. 各実施の形態の動画像符号化方法および動画像復号化方法をコンピュータシステムにより実現するためのプログラムを格納するための記録媒体についての説明図であり、（ａ）記録媒体本体であるフレキシブルディスクの物理フォーマットの例を示した説明図、（ｂ）フレキシブルディスクの正面からみた外観、断面構造、及びフレキシブルディスクを示した説明図、（ｃ）フレキシブルディスクＦＤに上記プログラムの記録再生を行うための構成を示した説明図である。It is an explanatory view of a recording medium for storing a program for realizing the moving picture encoding method and the moving picture decoding method of each embodiment by a computer system, and (a) a flexible disk as a recording medium main body. Explanatory diagram showing an example of a physical format, (b) an external view from the front of a flexible disk, a cross-sectional structure, and an explanatory diagram showing a flexible disk, and (c) a configuration for recording and reproducing the program on the flexible disk FD. FIG. コンテンツ配信サービスを実現するコンテンツ供給システムの全体構成を示すブロック図である。FIG. 1 is a block diagram illustrating an overall configuration of a content supply system that realizes a content distribution service. 携帯電話の一例を示す図である。It is a figure which shows an example of a mobile telephone. 携帯電話の内部構成を示すブロック図である。FIG. 2 is a block diagram illustrating an internal configuration of the mobile phone. ディジタル放送用システムの全体構成を示すブロック図である。FIG. 1 is a block diagram illustrating an overall configuration of a digital broadcasting system. 従来のMPEG2のストリームの構成図であり、（ａ）ピクチャの流れ、（ｂ）ストリームの階層構造を示す図である。It is a block diagram of a conventional MPEG2 stream, and is a diagram showing (a) a picture flow and (b) a hierarchical structure of a stream. 従来の他のストリームの階層構造を示す図である。FIG. 11 is a diagram illustrating a hierarchical structure of another conventional stream. 従来の動画像符号化方法を実現する動画像符号化装置の構成を示すブロック図である。FIG. 11 is a block diagram illustrating a configuration of a moving image encoding device that realizes a conventional moving image encoding method. 従来の動画像復号化方法を実現する動画像復号化装置の構成を示すブロック図である。FIG. 11 is a block diagram illustrating a configuration of a video decoding device that implements a conventional video decoding method. ピクチャの参照関係の例を示す模式図であり、（ａ）MPEG-2におけるピクチャ間の予測構造、（ｂ）JVTにおける予測構造の例である。FIG. 3 is a schematic diagram illustrating an example of a picture reference relationship, in which (a) is a prediction structure between pictures in MPEG-2, and (b) is an example of a prediction structure in JVT. ピクチャの参照関係の例を示す模式図である。FIG. 4 is a schematic diagram illustrating an example of a picture reference relationship.

Explanation of reference numerals

１動画像符号化装置
２動画像復号化装置
１０１動き検出部
１０２、２０４動き補償部
１０３減算部
１０４直交変換部
１０５量子化部
１０６、２０５逆量子化部
１０７、２０６逆直交変換部
１０８、２０７加算部
１０９、２０８ピクチャメモリ
１１０スイッチ
１１１予測構造決定部
１１２可変長符号化部
１１３マップ作成部
１１４検知部
１１５共通情報付加部
２０１ストリーム抽出部
２０２可変長復号化部
２０３抽出ピクチャ選択部
２０９メモリ制御部 REFERENCE SIGNS LIST 1 moving picture coding apparatus 2 moving picture decoding apparatus 101 motion detecting section 102, 204 motion compensating section 103 subtracting section 104 orthogonal transform section 105 quantizing section 106, 205 inverse quantizing section 107, 206 inverse orthogonal transform section 108, 207 Addition unit 109, 208 Picture memory 110 Switch 111 Prediction structure determination unit 112 Variable length coding unit 113 Map creation unit 114 Detection unit 115 Common information addition unit 201 Stream extraction unit 202 Variable length decoding unit 203 Extracted picture selection unit 209 Memory control Department

Claims

A moving picture coding method for coding a moving picture signal on a picture basis to generate a coded stream,
An information creating step of creating variable speed playback information for specifying a picture to be subjected to variable speed playback,
Encoding the variable speed playback information and adding it to the encoded stream.

In the information creating step, the random access unit is composed of a plurality of pictures, and the decoding target picture in the random access unit can be decoded with reference to only the pictures in the random access unit. The moving picture encoding method according to claim 1, wherein the variable speed reproduction information is created for each of the random access units.

2. The information creating step according to claim 1, wherein the variable-speed playback information is created by associating a variable-speed playback speed with information specifying a picture to be played back at the speed. Video encoding method.

The moving picture encoding method according to claim 3, wherein the variable-speed playback information describes the speed at which decoding of the picture is required for each picture number.

The moving picture encoding method according to claim 3, wherein the variable-speed reproduction information describes, in order, the rates at which decoding is required for each picture.

The moving picture encoding method according to claim 3, wherein the variable speed reproduction information describes a picture number of a picture necessary for reproduction at the speed for each of the speeds.

The moving picture encoding method according to claim 1, wherein, in the information creating step, an index for identifying a corresponding reference relation from a list of predetermined reference relations is created as the variable speed reproduction information.

The list of the predetermined reference relationship is a map table having a plurality of sets of map information in which the speed of variable-speed playback and information for specifying a picture to be played at the speed are associated with each other,
The moving picture encoding method according to claim 7, wherein, in the information creating step, an identifier for selecting corresponding map information from the map table is created as the variable speed reproduction information.

In the information creating step, the map table is created,
The moving image encoding method according to claim 8, wherein, in the encoding step, the map table is encoded and added to the encoded stream.

The video encoding method further includes:
A detecting step of detecting the necessity of encoding the common information referred to by the picture targeted for the variable-speed playback,
2. The moving picture encoding method according to claim 1, further comprising: adding a common information to the picture for which the necessity of encoding the common information is detected in the detecting step.

A moving image decoding method for decoding an encoded stream in units of pictures,
An information extraction step of extracting variable speed reproduction information for specifying a picture to be subjected to variable speed reproduction,
Decoding the variable-speed playback information, and identifying and decoding a picture to be subjected to variable-speed playback based on the variable-speed playback information. Method.

The information extracting step is a random access unit composed of a plurality of pictures, and is capable of decoding a decoding target picture in the random access unit with reference to only the pictures in the random access unit. The moving picture decoding method according to claim 11, wherein the variable speed reproduction information is extracted for each of the random access units.

The variable speed reproduction information is associated with a variable speed reproduction speed and information specifying a picture to be reproduced at the speed,
The moving picture decoding method according to claim 11, wherein in the decoding step, a picture to be reproduced at the specified speed is specified based on the specified speed and the variable speed reproduction information. .

The moving picture decoding method according to claim 13, wherein the variable-speed playback information describes the speed at which decoding of the picture is required for each picture number.

The moving picture decoding method according to claim 13, wherein the variable speed reproduction information describes the speeds at which decoding is required for each picture in order.

The moving picture decoding method according to claim 13, wherein the variable speed reproduction information describes a picture number of a picture necessary for reproduction at the speed for each of the speeds.

The variable-speed playback information is an index that specifies a corresponding reference relationship from a predetermined list of reference relationships,
The moving picture decoding method according to claim 11, wherein, in the decoding step, a corresponding reference relation is specified from the predetermined reference relation list based on the index.

The list of the predetermined reference relationship is a map table having a plurality of sets of map information in which the speed of variable-speed playback and information for specifying a picture to be played at the speed are associated with each other,
The index is an identifier for selecting corresponding map information from the map table,
The moving picture decoding according to claim 17, wherein, in the decoding step, a picture to be reproduced at the designated speed is specified based on the designated speed, the map table, and the identifier. Method.

In the information extracting step, the map table is extracted,
19. The moving picture decoding method according to claim 18, wherein, in the decoding step, the map table is decoded.

The video decoding method may further include:
At the time of variable speed playback, a memory control step of controlling a picture referred to by another picture as being stored in a picture memory, even if the picture is not a target of the variable speed playback, is included. The moving picture decoding method according to claim 11.

A moving image encoding apparatus that encodes a moving image signal on a picture basis to generate an encoded stream,
Information creating means for creating information for variable speed playback for specifying a picture to be subjected to variable speed playback,
Encoding means for encoding the variable speed reproduction information and adding the encoded information to the encoded stream.

A moving picture decoding apparatus for decoding an encoded stream in units of pictures,
Information extracting means for extracting information for variable-speed reproduction for specifying a picture to be subjected to variable-speed reproduction,
Decoding means for decoding the information for variable-speed playback and for specifying and decoding a picture to be subjected to variable-speed playback based on the information for variable-speed playback. apparatus.

A program for encoding a moving image signal on a picture basis to generate an encoded stream,
An information creating step of creating variable speed playback information for specifying a picture to be subjected to variable speed playback,
A step of encoding the variable speed reproduction information and adding the encoded information to the encoded stream.

A program for decoding an encoded stream in units of pictures,
An information extraction step of extracting variable speed reproduction information for specifying a picture to be subjected to variable speed reproduction,
Decoding the variable-speed playback information, and identifying and decoding a picture to be subjected to variable-speed playback based on the variable-speed playback information. .

A coded stream obtained by coding a moving image signal on a picture basis,
A coded stream including variable-speed playback information for specifying a picture to be subjected to variable-speed playback.