JP2015002495A

JP2015002495A - Image decoder, and image encoder

Info

Publication number: JP2015002495A
Application number: JP2013127103A
Authority: JP
Inventors: 山本　智幸; Tomoyuki Yamamoto; 智幸山本; 健史筑波; Kenji Tsukuba; 知宏猪飼; Tomohiro Igai
Original assignee: Sharp Corp
Current assignee: Sharp Corp
Priority date: 2013-06-18
Filing date: 2013-06-18
Publication date: 2015-01-05

Abstract

PROBLEM TO BE SOLVED: To reduce the memory in coding or decoding.SOLUTION: Prior to decode picture derivation processing of a target picture in an upper layer, a hierarchy moving picture decoder adds to a decode picture buffer an interlayer reference picture that is generated on the basis of a decode picture of a lower layer, sets the reference mark of the interlayer reference picture to reference use, and sets the output mark of the interlayer reference picture to output unnecessary. With reference to the output mark and reference mark of each picture of the decode picture buffer, a picture management section performs bumping for specifying a picture not requiring decoding and output after the target picture, and discarding the picture thus specified from the decode picture buffer.

Description

本発明は、画像が階層的に符号化された階層符号化データを復号する画像復号装置、および画像を階層的に符号化することによって階層符号化データを生成する画像符号化装置に関する。 The present invention relates to an image decoding apparatus that decodes hierarchically encoded data in which an image is hierarchically encoded, and an image encoding apparatus that generates hierarchically encoded data by hierarchically encoding an image.

通信システムで伝送される情報、あるいは蓄積装置に記録される情報の１つに画像あるいは動画像がある。従来、これらの画像（以降、動画像を含む）の伝送・蓄積のため、画像を符号化する技術が知られている。 One of information transmitted in the communication system or information recorded in the storage device is an image or a moving image. 2. Description of the Related Art Conventionally, a technique for encoding an image for transmitting and storing these images (hereinafter including moving images) is known.

動画像符号化方式としては、AVC（H.264/MPEG-4 Advanced Video Coding）や、その後継コーデックであるHEVC（High-Efficiency Video Coding）が知られている（非特許文献１）。 As a moving picture coding system, AVC (H.264 / MPEG-4 Advanced Video Coding) and its successor codec HEVC (High-Efficiency Video Coding) are known (Non-patent Document 1).

これらの動画像符号化方式では、通常、入力画像を符号化／復号することによって得られる局所復号画像に基づいて予測画像が生成され、当該予測画像を入力画像（原画像）から減算して得られる予測残差（「差分画像」または「残差画像」と呼ぶこともある）が符号化される。また、予測画像の生成方法としては、画面間予測（インター予測）、および、画面内予測（イントラ予測）が挙げられる。 In these moving image encoding methods, a predicted image is usually generated based on a local decoded image obtained by encoding / decoding an input image, and obtained by subtracting the predicted image from the input image (original image). Prediction residuals (sometimes referred to as “difference images” or “residual images”) are encoded. In addition, examples of the method for generating a predicted image include inter-screen prediction (inter prediction) and intra-screen prediction (intra prediction).

イントラ予測では、同一ピクチャ内の局所復号画像に基づいて、当該ピクチャにおける予測画像が順次生成される。 In intra prediction, predicted images in a picture are sequentially generated based on a locally decoded image in the same picture.

インター予測では、ピクチャ間の動き補償により予測画像が生成される。インター予測で予測画像生成に用いられる復号済のピクチャは参照ピクチャと呼ばれる。 In inter prediction, a predicted image is generated by motion compensation between pictures. A decoded picture used for predictive image generation in inter prediction is called a reference picture.

HEVCでは、復号済のピクチャは復号ピクチャバッファ（DPB: Decoded Picture Buffer）に記録されている。DPB上の各ピクチャには、参照ピクチャとして利用される可能性の有無の情報（参照マーク）と、出力ピクチャとして出力する必要の有無の情報（出力マーク）が関連付けられている。参照マークおよび出力マークの情報に基づいて不要なピクチャをDPBから取り除くことで、DPBに記録されているピクチャの枚数が過剰にならないように制御できる。 In HEVC, decoded pictures are recorded in a decoded picture buffer (DPB). Each picture on the DPB is associated with information on whether or not there is a possibility of being used as a reference picture (reference mark) and information on whether or not there is a need to output as an output picture (output mark). By removing unnecessary pictures from the DPB based on the reference mark and output mark information, the number of pictures recorded in the DPB can be controlled so as not to be excessive.

また、近年、必要なデータレートに従って、画像を階層的に符号化する階層符号化技術が提案されている。代表的な階層符号化方式の一つとしてSHVC（Scalable HEVC）が知られている（非特許文献２）。 In recent years, a hierarchical encoding technique for hierarchically encoding an image according to a necessary data rate has been proposed. SHVC (Scalable HEVC) is known as one of representative hierarchical coding methods (Non-patent Document 2).

SHVCでは、空間スケーラビリティ、時間スケーラビリティ、SNRスケーラビリティをサポートする。例えば空間スケーラビリティの場合、原画像から所望の解像度にダウンサンプリングした画像を下位レイヤとして符号化する。次に上位レイヤではレイヤ間の冗長性を除去するためにレイヤ間予測を行う。 SHVC supports spatial scalability, temporal scalability, and SNR scalability. For example, in the case of spatial scalability, an image downsampled from an original image to a desired resolution is encoded as a lower layer. Next, the upper layer performs inter-layer prediction in order to remove redundancy between layers.

レイヤ間予測には、レイヤ間画像予測とレイヤ間動き予測がある。レイヤ間画像予測では、下位レイヤの復号画像を利用して、予測画像を生成する。レイヤ間動き予測では、下位レイヤの動き情報を利用して、動き情報の予測値を導出する。 Inter-layer prediction includes inter-layer image prediction and inter-layer motion prediction. In inter-layer image prediction, a predicted image is generated using a decoded image of a lower layer. In inter-layer motion prediction, motion information prediction values are derived using motion information of lower layers.

SHVCでは、予測画像の生成に、インター予測、イントラ予測、レイヤ間画像予測のいずれかを利用できる。 In SHVC, any one of inter prediction, intra prediction, and inter-layer image prediction can be used to generate a predicted image.

SHVCのレイヤ間画像予測で使用される下位レイヤの復号画像は、所定の変換（例えばスケーリング）を適用した後に、レイヤ間参照ピクチャとしてバッファに記録される。SHVCでは、復号された対象レイヤのピクチャについては、HEVC同様に復号済のピクチャをDPBに記録して管理する仕組みが適用される。 The decoded image of the lower layer used in the inter-layer image prediction of SHVC is recorded in the buffer as an inter-layer reference picture after applying a predetermined transformation (for example, scaling). In SHVC, a mechanism for recording and managing a decoded picture in the DPB is applied to the decoded picture of the target layer, as in HEVC.

「Recommendation H.265 (04/13)」, ITU-T (２０１３年６月７日公開)"Recommendation H.265 (04/13)", ITU-T (released June 7, 2013) JCTVC-M1008_v1 「SHVC Working Draft 2」, Joint Collaborative Team on Video Coding (JCT-VC) of ITU-T SG 16 WP 3 and ISO/IEC JTC 1/SC 29/WG 11 13th Meeting: Incheon, KR, 18-26 Apr. 2013 (２０１３年５月２１日公開)JCTVC-M1008_v1 `` SHVC Working Draft 2 '', Joint Collaborative Team on Video Coding (JCT-VC) of ITU-T SG 16 WP 3 and ISO / IEC JTC 1 / SC 29 / WG 11 13th Meeting: Incheon, KR, 18- 26 Apr. 2013 (May 21, 2013)

しかしながら、従来技術として挙げたSHVCでは、レイヤ間予測画像生成に利用される下位レイヤの復号ピクチャを、拡張レイヤにおけるDPBで正しく管理する仕組みが提供されていない。より具体的には、レイヤ間予測で参照した下位レイヤの復号ピクチャが不要となるタイミングが明らかではない。そのため、拡張レイヤの符号化または復号処理時に下位レイヤの復号ピクチャが適切なタイミングで破棄できないという課題があった。 However, the SHVC cited as the prior art does not provide a mechanism for correctly managing lower layer decoded pictures used for inter-layer prediction image generation by DPB in the enhancement layer. More specifically, the timing at which the decoded picture of the lower layer referenced in the inter-layer prediction becomes unnecessary is not clear. For this reason, there is a problem that the decoded picture of the lower layer cannot be discarded at an appropriate timing during the encoding or decoding process of the enhancement layer.

本発明は、上記問題に鑑みてされたものであり、その目的は、階層符号化方式において、下位レイヤの復号ピクチャをDPBに含めて管理することにより、下位レイヤの復号ピクチャがバッファ上で不要となった時点で破棄できる仕組みを提供することにある。また、それにより、より少ないメモリで符号化データを符号化／復号する画像符号化装置および画像復号装置を実現することにある。 The present invention has been made in view of the above problems, and its purpose is to eliminate the need for decoded pictures in the lower layer on the buffer by managing the decoded pictures in the lower layer in the DPB in the hierarchical coding scheme. It is to provide a mechanism that can be discarded when it becomes. It is also intended to realize an image encoding device and an image decoding device that encode / decode encoded data with less memory.

上記課題を解決するために、本発明に係る画像復号装置は、階層符号化された符号化データに含まれる上位レイヤの符号化データを復号し、上位レイヤの復号ピクチャを復元する画像復号装置であって、上位レイヤの復号ピクチャバッファを管理するピクチャ管理部を備え、上記ピクチャ管理部は、上位レイヤ内の対象ピクチャの復号ピクチャ導出処理に先立って、下位レイヤの復号ピクチャに基づき生成されるレイヤ間参照ピクチャを上記復号ピクチャバッファに追加し、当該レイヤ間参照ピクチャの参照マークを参照使用に設定し、当該レイヤ間参照ピクチャの出力マークを出力不要に設定することを特徴としている。 In order to solve the above problem, an image decoding apparatus according to the present invention is an image decoding apparatus that decodes higher layer encoded data included in hierarchically encoded data and restores a decoded picture of the upper layer. A picture management unit for managing a decoded picture buffer of an upper layer, wherein the picture management unit is a layer generated based on a decoded picture of a lower layer prior to a decoded picture derivation process of a target picture in the upper layer An inter-reference picture is added to the decoded picture buffer, the reference mark of the inter-layer reference picture is set to be used for reference, and the output mark of the inter-layer reference picture is set to be output unnecessary.

また、上記画像復号装置において、上記ピクチャ管理部は、上記復号ピクチャ導出処理の完了後、上記対象ピクチャが復号順である次の対象ピクチャに移る前のタイミングで、上記レイヤ間参照ピクチャの参照マークを参照不使用に設定する、ことが好ましい。 In the image decoding device, the picture management unit may perform the reference mark of the inter-layer reference picture at a timing after the decoded picture derivation process is completed and before the target picture moves to the next target picture in decoding order. Is preferably set to non-reference use.

また、上記画像復号装置において、上記ピクチャ管理部は、上記復号ピクチャバッファの各ピクチャの出力マークと参照マークとを参照して、上記対象ピクチャ以降の復号及び出力が不要であるピクチャを特定し、当該特定したピクチャを上記復号ピクチャバッファから破棄するバンピング処理を実行する、ことが好ましい。 Further, in the image decoding device, the picture management unit refers to an output mark and a reference mark of each picture in the decoded picture buffer, identifies a picture that does not need to be decoded and output after the target picture, It is preferable to execute a bumping process for discarding the identified picture from the decoded picture buffer.

また、上記画像復号装置において、参照レイヤのピクチャの復号ピクチャであるベース復号ピクチャを生成するベース復号部と、上記ベース復号部は、参照レイヤの復号ピクチャバッファであるベース復号ピクチャバッファを管理するベース復号ピクチャ管理部とを備え、上記ベースピクチャ管理部は、上記ピクチャ管理部で実行されるバンピング処理と同一のバンピング処理により、復号ピクチャバッファから破棄するピクチャを決定する、ことが好ましい。 Further, in the image decoding apparatus, a base decoding unit that generates a base decoded picture that is a decoded picture of a reference layer picture, and a base decoding unit that manages a base decoded picture buffer that is a decoded picture buffer of a reference layer Preferably, the base picture management unit determines a picture to be discarded from the decoded picture buffer by the same bumping process as the bumping process executed by the picture management unit.

上記課題を解決するために、本発明に係る画像復号装置は、階層符号化された符号化データに含まれる上位レイヤの符号化データを復号し、上位レイヤの復号ピクチャを復元する画像復号装置であって、複数レイヤ共通の復号ピクチャバッファを管理するピクチャ管理部を備え、上記ピクチャ管理部は、対象ピクチャの属するサブレイヤにおける同一レイヤ内ピクチャ間のインター予測適用有無を判定し、当該判定結果が当該同一レイヤ内ピクチャ間のインター予測適用不可である場合には、当該対象ピクチャの参照マークを参照不使用に設定することを特徴としている。 In order to solve the above problem, an image decoding apparatus according to the present invention is an image decoding apparatus that decodes higher layer encoded data included in hierarchically encoded data and restores a decoded picture of the upper layer. A picture management unit for managing a decoded picture buffer common to a plurality of layers, wherein the picture management unit determines whether or not inter prediction is applied between pictures in the same layer in a sublayer to which the target picture belongs, and the determination result is When inter prediction between the pictures in the same layer is not applicable, the reference mark of the target picture is set to non-reference use.

また、上記画像復号装置において、上記同一レイヤ内ピクチャ間のインター予測適用有無の判定は、最高次サブレイヤに属するピクチャに対してのみ適用される、ことが好ましい。 In the image decoding apparatus, it is preferable that the determination of whether to apply inter prediction between pictures in the same layer is applied only to a picture belonging to the highest sublayer.

上記課題を解決するために、本発明に係る画像復号装置は、階層符号化された符号化データに含まれる上位レイヤの符号化データを復号し、上位レイヤの復号ピクチャを復元する画像復号装置であって、複数レイヤ共通の復号ピクチャバッファを管理するピクチャ管理部と、階層符号化データからシンタックス値を復号する可変長復号部とを備え、上記可変長復号部は、シンタックス値として廃棄可能フラグを復号し、上記廃棄可能フラグは、対象ピクチャが復号順で後続のピクチャのインター予測またはレイヤ間画像予測で参照されない場合に１の値をとり、対象ピクチャが復号順で後続のピクチャのインター予測またはレイヤ間画像予測で参照される場合に０の値をとり、上記ピクチャ管理部は、上記廃棄可能フラグの値に応じて上記対象ピクチャの参照マークを更新する、ことが好ましい。 In order to solve the above problem, an image decoding apparatus according to the present invention is an image decoding apparatus that decodes higher layer encoded data included in hierarchically encoded data and restores a decoded picture of the upper layer. A variable length decoding unit that decodes a syntax value from hierarchically encoded data, and the variable length decoding unit can be discarded as a syntax value. The flag is decoded, and the discardable flag takes a value of 1 when the target picture is not referred to in inter prediction of the subsequent picture or inter-layer image prediction in the decoding order, and the target picture is interlaced of the subsequent picture in the decoding order. It takes a value of 0 when referred to in prediction or inter-layer image prediction, and the picture management unit sets the pair according to the value of the discardable flag. Updating a reference mark of the picture, it is preferable.

上記課題を解決するために、本発明に係る画像符号化装置は、入力画像から上位レイヤの符号化データを生成する画像符号化装置であって、上位レイヤの復号ピクチャバッファを管理するピクチャ管理部を備え、上記ピクチャ管理部は、上位レイヤ内の対象ピクチャの復号ピクチャ導出処理に先立って、下位レイヤの復号ピクチャに基づき生成されるレイヤ間参照ピクチャを上記復号ピクチャバッファに追加し、当該レイヤ間参照ピクチャの参照マークを参照使用に設定し、当該レイヤ間参照ピクチャの出力マークを出力不要に設定することを特徴としている。 In order to solve the above problems, an image encoding device according to the present invention is an image encoding device that generates encoded data of an upper layer from an input image, and a picture management unit that manages a decoded picture buffer of an upper layer The picture management unit adds an inter-layer reference picture generated based on the decoded picture of the lower layer to the decoded picture buffer prior to the decoding picture derivation process of the target picture in the upper layer, and The reference mark of the reference picture is set to be used for reference, and the output mark of the inter-layer reference picture is set to be output unnecessary.

本発明に係る画像復号装置はピクチャ管理部を備えおり、前記ピクチャ管理部は、レイヤ間参照ピクチャをDPBに追加し、続いて、当該レイヤ間参照ピクチャの参照マークを「参照使用」に設定するとともに、当該レイヤ間参照ピクチャの出力マークを「出力不要」に設定するベース参照ピクチャ制御部を備えている。したがって、階層動画像復号装置１は、ピクチャ復号に用いたレイヤ間参照ピクチャを、レイヤ間参照ピクチャとインター予測の参照ピクチャを区別しないバンピング処理を用いて、不要になった時点でDPBから除去できるため、少ないメモリ平均使用量で入力符号化データを復号できる。 The image decoding apparatus according to the present invention includes a picture management unit, and the picture management unit adds an inter-layer reference picture to the DPB, and subsequently sets a reference mark of the inter-layer reference picture to “use reference”. In addition, a base reference picture control unit for setting the output mark of the inter-layer reference picture to “output unnecessary” is provided. Therefore, the hierarchical video decoding device 1 can remove the inter-layer reference picture used for picture decoding from the DPB when it becomes unnecessary by using a bumping process that does not distinguish between the inter-layer reference picture and the inter-prediction reference picture. Therefore, input encoded data can be decoded with a small average memory usage.

本発明の一実施形態に係る階層動画像復号装置に含まれる復号ピクチャ管理部におけるピクチャ管理処理を例示するフロー図である。It is a flowchart which illustrates the picture management process in the decoding picture management part contained in the hierarchy moving image decoding apparatus which concerns on one Embodiment of this invention. 本発明の実施形態に係る階層符号化データのレイヤ構造を説明するための図であって、（ａ）は、階層動画像符号化装置側について示しており、（ｂ）は、階層動画像復号装置側について示している。It is a figure for demonstrating the layer structure of the hierarchy coding data which concerns on embodiment of this invention, Comprising: (a) has shown about the hierarchy moving image encoder side, (b) is a hierarchy moving image decoding. The device side is shown. 本発明の実施形態に係る階層符号化データの構成を説明するための図であって、（ａ）は、シーケンスＳＥＱを規定するシーケンスレイヤを示しており、（ｂ）は、ピクチャＰＩＣＴを規定するピクチャレイヤを示しており、（ｃ）は、スライスＳを規定するスライスレイヤを示しており、（ｄ）は、符号化ツリーユニットＣＴＵを規定するＣＴＵレイヤを示しており、（ｅ）は、符号化ツリーユニットＣＴＵに含まれる符号化単位（Coding Unit；ＣＵ）を規定するＣＵレイヤを示している。It is a figure for demonstrating the structure of the hierarchy coding data which concerns on embodiment of this invention, Comprising: (a) has shown the sequence layer which prescribes | regulates sequence SEQ, (b) has prescribed | regulated picture PICT. (C) shows the slice layer that defines the slice S, (d) shows the CTU layer that defines the coding tree unit CTU, and (e) shows the code layer 3 illustrates a CU layer that defines a coding unit (CU) included in a coding tree unit CTU. 上記階層動画像復号装置の概略的構成を示す機能ブロック図である。It is a functional block diagram which shows the schematic structure of the said hierarchy moving image decoding apparatus. 上記階層動画像復号装置に含まれる復号ピクチャ管理部の構成を例示した機能ブロック図である。It is the functional block diagram which illustrated the structure of the decoded picture management part contained in the said hierarchy moving image decoding apparatus. 復号処理の特定のタイミングでのDPBの状態を例示した図である。It is the figure which illustrated the state of DPB in the specific timing of a decoding process. 上記階層動画像復号装置の可変長復号部からＳＰＳ復号時に利用されるシンタックス表の一部であって、参照ピクチャセットおよび参照ピクチャリストに係る部分を示す図である。It is a part of the syntax table utilized at the time of SPS decoding from the variable length decoding part of the said hierarchy moving image decoding apparatus, Comprising: It is a figure which shows the part which concerns on a reference picture set and a reference picture list. 上記階層動画像復号装置の可変長復号部から短期参照ピクチャセット情報の復号時に利用されるシンタックス表を示す図である。It is a figure which shows the syntax table utilized at the time of decoding of short-term reference picture set information from the variable-length decoding part of the said hierarchy moving image decoding apparatus. 上記階層動画像復号装置の可変長復号部からスライスヘッダ復号時に利用されるシンタックス表の一部であって、参照ピクチャセットに係る部分を示す図である。It is a figure which is a part of syntax tables used at the time of slice header decoding from the variable length decoding part of the said hierarchy moving image decoding apparatus, Comprising: It is a figure which shows the part which concerns on a reference picture set. スライス復号時に参照されるシンタックス表の一部であって、ILRP情報に相当する部分を示す図である。It is a part of syntax table referred at the time of slice decoding, and is a figure which shows the part corresponded to ILRP information. 上記階層動画像復号装置の可変長復号部からスライスヘッダ復号時に利用されるシンタックス表の一部であって、参照ピクチャリストに係る部分を示す図である。It is a figure which is a part of syntax tables used at the time of slice header decoding from the variable length decoding part of the said hierarchy moving image decoding apparatus, Comprising: It is a figure which shows the part which concerns on a reference picture list. 上記階層動画像復号装置の可変長復号部から参照ピクチャリスト修正情報の復号時に利用されるシンタックス表を示す図である。It is a figure which shows the syntax table utilized at the time of the decoding of reference picture list correction information from the variable-length decoding part of the said hierarchy moving image decoding apparatus. 対象レイヤ上で復号順の連続する２枚のピクチャ（順にピクチャＭ、ピクチャＮ）でのピクチャ管理処理の具体例を示す図である。It is a figure which shows the specific example of the picture management process in two pictures (a picture M and a picture N in order) with which the decoding order continues on the object layer. 上記階層動画像復号装置に含まれるベース復号部の構成を例示する機能ブロック図である。It is a functional block diagram which illustrates the structure of the base decoding part contained in the said hierarchy moving image decoding apparatus. 共通DPMモデル利用時のピクチャ管理処理の一例において処理の一部を表すフロー図である。It is a flowchart showing a part of process in an example of the picture management process at the time of common DPM model utilization. 共通DPMモデル利用時のピクチャ管理処理の別の一例において処理の一部を表すフロー図である。It is a flowchart showing a part of process in another example of the picture management process at the time of common DPM model use. 本発明の一実施形態に係る階層動画像符号化装置の概略的構成を示す機能ブロック図である。It is a functional block diagram which shows schematic structure of the hierarchy moving image encoder which concerns on one Embodiment of this invention. 上記階層動画像符号化装置を搭載した送信装置、および、上記階層動画像復号装置を搭載した受信装置の構成を示した図である。（ａ）は、階層動画像符号化装置を搭載した送信装置を示しており、（ｂ）は、階層動画像復号装置を搭載した受信装置を示している。It is the figure which showed the structure of the transmitter which mounts the said hierarchy moving image encoder, and the receiver which mounts the said hierarchy moving image decoder. (A) shows a transmission device equipped with a hierarchical video encoding device, and (b) shows a reception device equipped with a hierarchical video decoding device. 上記階層動画像符号化装置を搭載した記録装置、および、上記階層動画像復号装置を搭載した再生装置の構成を示した図である。（ａ）は、階層動画像符号化装置を搭載した記録装置を示しており、（ｂ）は、階層動画像復号装置を搭載した再生装置を示している。It is the figure which showed the structure of the recording device carrying the said hierarchy moving image encoder, and the reproducing | regenerating apparatus carrying the said hierarchy moving image decoding apparatus. (A) shows a recording device equipped with a hierarchical video encoding device, and (b) shows a playback device equipped with a hierarchical video decoding device.

図１〜図１９に基づいて、本発明の一実施形態に係る階層動画像復号装置１および階層動画像符号化装置２を説明すれば以下のとおりである。 The hierarchical moving picture decoding apparatus 1 and the hierarchical moving picture encoding apparatus 2 according to an embodiment of the present invention will be described as follows based on FIGS.

〔概要〕
本実施の形態に係る階層動画像復号装置（画像復号装置）１は、階層動画像符号化装置（画像符号化装置）２によって階層符号化された符号化データを復号する。階層符号化とは、動画像を低品質のものから高品質のものにかけて階層的に符号化する符号化方式のことである。階層符号化は、例えば、SVCやSHVCにおいて標準化されている。なお、ここでいう動画像の品質とは、主観的および客観的な動画像の見栄えに影響する要素のことを広く意味する。動画像の品質には、例えば、“解像度”、“フレームレート”、“画質”、および、“画素の表現精度”が含まれる。よって、以下、動画像の品質が異なるといえば、例示的には、“解像度”等が異なることを指すが、これに限られない。例えば、異なる量子化ステップで量子化された動画像の場合（すなわち、異なる符号化雑音により符号化された動画像の場合）も互いに動画像の品質が異なるといえる。〔Overview〕
A hierarchical video decoding device (image decoding device) 1 according to the present embodiment decodes encoded data that has been hierarchically encoded by a hierarchical video encoding device (image encoding device) 2. Hierarchical coding is a coding scheme that hierarchically encodes moving images from low quality to high quality. Hierarchical coding is standardized in SVC and SHVC, for example. Note that the quality of a moving image here widely means an element that affects the appearance of a subjective and objective moving image. The quality of the moving image includes, for example, “resolution”, “frame rate”, “image quality”, and “pixel representation accuracy”. Therefore, hereinafter, if the quality of the moving image is different, it means that, for example, “resolution” is different, but it is not limited thereto. For example, in the case of moving images quantized in different quantization steps (that is, moving images encoded with different encoding noises), it can be said that the quality of moving images is different from each other.

また、階層符号化技術は、階層化される情報の種類の観点から、（１）空間スケーラビリティ、（２）時間スケーラビリティ、（３）ＳＮＲ（Signal to Noise Ratio）スケーラビリティ、および（４）ビュースケーラビリティに分類されることもある。空間スケーラビリティとは、解像度や画像のサイズにおいて階層化する技術である。時間スケーラビリティとは、フレームレート（単位時間のフレーム数）において階層化する技術である。ＳＮＲスケーラビリティは、符号化雑音において階層化する技術である。また、ビュースケーラビリティは、各画像に対応付けられた視点位置において階層化する技術である。 In addition, the hierarchical coding technology is (1) spatial scalability, (2) temporal scalability, (3) SNR (Signal to Noise Ratio) scalability, and (4) view scalability from the viewpoint of the type of information to be layered. May be classified. Spatial scalability is a technique for hierarchizing resolution and image size. Time scalability is a technique for layering at a frame rate (number of frames per unit time). SNR scalability is a technique for layering in coding noise. Also, view scalability is a technique for hierarchizing at the viewpoint position associated with each image.

本実施形態に係る階層動画像符号化装置２及び階層動画像復号装置１の詳細な説明に先立って、まず（１）階層動画像符号化装置２によって生成され、階層動画像復号装置１によって復号される階層符号化データのレイヤ構造を説明し、次いで（２）各レイヤで採用できるデータ構造の具体例について説明を行う。 Prior to detailed description of the hierarchical video encoding device 2 and the hierarchical video decoding device 1 according to the present embodiment, first, (1) the hierarchical video encoding device 2 generates and the hierarchical video decoding device 1 performs decoding. The layer structure of the hierarchically encoded data to be performed will be described, and then (2) a specific example of the data structure that can be adopted in each layer will be described.

〔階層符号化データのレイヤ構造〕
ここで、図２を用いて、階層符号化データの符号化および復号について説明すると次のとおりである。図２は、動画像を、下位階層Ｌ３、中位階層Ｌ２、および上位階層Ｌ１の３階層により階層的に符号化／復号する場合について模式的に表す図である。つまり、図２（ａ）および（ｂ）に示す例では、３階層のうち、上位階層Ｌ１が最上位層となり、下位階層Ｌ３が最下位層となる。 [Layer structure of hierarchically encoded data]
Here, encoding and decoding of hierarchically encoded data will be described with reference to FIG. FIG. 2 is a diagram schematically illustrating a case where a moving image is hierarchically encoded / decoded by three layers of a lower layer L3, a middle layer L2, and an upper layer L1. That is, in the example shown in FIGS. 2A and 2B, of the three layers, the upper layer L1 is the highest layer and the lower layer L3 is the lowest layer.

以下において、階層符号化データから復号され得る特定の品質に対応する復号画像は、特定の階層の復号画像（または、特定の階層に対応する復号画像）と称される（例えば、上位階層Ｌ１の復号画像ＰＯＵＴ＃Ａ）。 In the following, a decoded image corresponding to a specific quality that can be decoded from hierarchically encoded data is referred to as a decoded image of a specific hierarchy (or a decoded image corresponding to a specific hierarchy) (for example, an upper layer L1). Decoded image POUT # A).

図２（ａ）は、入力画像ＰＩＮ＃Ａ〜ＰＩＮ＃Ｃをそれぞれ階層的に符号化して符号化データＤＡＴＡ＃Ａ〜ＤＡＴＡ＃Ｃを生成する階層動画像符号化装置２＃Ａ〜２＃Ｃを示している。図２（ｂ）は、階層的に符号化された符号化データＤＡＴＡ＃Ａ〜ＤＡＴＡ＃Ｃをそれぞれ復号して復号画像ＰＯＵＴ＃Ａ〜ＰＯＵＴ＃Ｃを生成する階層動画像復号装置１＃Ａ〜１＃Ｃを示している。 FIG. 2A shows a hierarchical video encoding device 2 # A-2 # C that generates encoded data DATA # A-DATA # C by hierarchically encoding input images PIN # A-PIN # C, respectively. Is shown. FIG. 2B illustrates a hierarchical video decoding device 1 # A that generates decoded images POUT # A to POUT # C by decoding the hierarchically encoded data DATA # A to DATA # C, respectively. 1 # C is shown.

まず、図２（ａ）を用いて、符号化装置側について説明する。符号化装置側の入力となる入力画像ＰＩＮ＃Ａ、ＰＩＮ＃Ｂ、およびＰＩＮ＃Ｃは、原画は同じだが、画像の品質（解像度、フレームレート、および画質等）が異なる。画像の品質は、入力画像ＰＩＮ＃Ａ、ＰＩＮ＃Ｂ、およびＰＩＮ＃Ｃの順に低くなる。 First, the encoding device side will be described with reference to FIG. The input images PIN # A, PIN # B, and PIN # C that are input on the encoding device side have the same original image but different image quality (resolution, frame rate, image quality, and the like). The image quality decreases in the order of the input images PIN # A, PIN # B, and PIN # C.

下位階層Ｌ３の階層動画像符号化装置２＃Ｃは、下位階層Ｌ３の入力画像ＰＩＮ＃Ｃを符号化して下位階層Ｌ３の符号化データＤＡＴＡ＃Ｃを生成する。下位階層Ｌ３の復号画像ＰＯＵＴ＃Ｃを復号するのに必要な基本情報が含まれる（図２において“Ｃ”にて示している）。下位階層Ｌ３は、最下層の階層であるため、下位階層Ｌ３の符号化データＤＡＴＡ＃Ｃは、基本符号化データとも称される。 The hierarchical video encoding apparatus 2 # C of the lower hierarchy L3 encodes the input image PIN # C of the lower hierarchy L3 to generate encoded data DATA # C of the lower hierarchy L3. Basic information necessary for decoding the decoded image POUT # C of the lower layer L3 is included (indicated by “C” in FIG. 2). Since the lower layer L3 is the lowest layer, the encoded data DATA # C of the lower layer L3 is also referred to as basic encoded data.

また、中位階層Ｌ２の階層動画像符号化装置２＃Ｂは、中位階層Ｌ２の入力画像ＰＩＮ＃Ｂを、下位階層の符号化データＤＡＴＡ＃Ｃを参照しながら符号化して中位階層Ｌ２の符号化データＤＡＴＡ＃Ｂを生成する。中位階層Ｌ２の符号化データＤＡＴＡ＃Ｂには、符号化データＤＡＴＡ＃Ｃに含まれる基本情報“Ｃ”に加えて、中位階層の復号画像ＰＯＵＴ＃Ｂを復号するのに必要な付加的情報（図２において“Ｂ”にて示している）が含まれる。 Further, the hierarchical video encoding apparatus 2 # B of the middle hierarchy L2 encodes the input image PIN # B of the middle hierarchy L2 with reference to the encoded data DATA # C of the lower hierarchy, and performs the middle hierarchy L2 Encoded data DATA # B is generated. In addition to the basic information “C” included in the encoded data DATA # C, additional data necessary for decoding the decoded image POUT # B of the intermediate hierarchy is added to the encoded data DATA # B of the intermediate hierarchy L2. Information (indicated by “B” in FIG. 2) is included.

また、上位階層Ｌ１の階層動画像符号化装置２＃Ａは、上位階層Ｌ１の入力画像ＰＩＮ＃Ａを、中位階層Ｌ２の符号化データＤＡＴＡ＃Ｂを参照しながら符号化して上位階層Ｌ１の符号化データＤＡＴＡ＃Ａを生成する。上位階層Ｌ１の符号化データＤＡＴＡ＃Ａには、下位階層Ｌ３の復号画像ＰＯＵＴ＃Ｃを復号するのに必要な基本情報“Ｃ”および中位階層Ｌ２の復号画像ＰＯＵＴ＃Ｂを復号するのに必要な付加的情報“Ｂ”に加えて、上位階層の復号画像ＰＯＵＴ＃Ａを復号するのに必要な付加的情報（図２において“Ａ”にて示している）が含まれる。 Further, the hierarchical video encoding apparatus 2 # A of the upper hierarchy L1 encodes the input image PIN # A of the upper hierarchy L1 with reference to the encoded data DATA # B of the intermediate hierarchy L2 to Encoded data DATA # A is generated. The encoded data DATA # A of the upper layer L1 is used to decode the basic information “C” necessary for decoding the decoded image POUT # C of the lower layer L3 and the decoded image POUT # B of the middle layer L2. In addition to the necessary additional information “B”, additional information (indicated by “A” in FIG. 2) necessary for decoding the decoded image POUT # A of the upper layer is included.

このように上位階層Ｌ１の符号化データＤＡＴＡ＃Ａは、異なる複数の品質の復号画像に関する情報を含む。 As described above, the encoded data DATA # A of the upper layer L1 includes information related to decoded images having a plurality of different qualities.

次に、図２（ｂ）を参照しながら復号装置側について説明する。復号装置側では、上位階層Ｌ１、中位階層Ｌ２、および下位階層Ｌ３それぞれの階層に応じた復号装置１＃Ａ、１＃Ｂ、および１＃Ｃが、符号化データＤＡＴＡ＃Ａ、ＤＡＴＡ＃Ｂ、およびＤＡＴＡ＃Ｃを復号して復号画像ＰＯＵＴ＃Ａ、ＰＯＵＴ＃Ｂ、およびＰＯＵＴ＃Ｃを出力する。 Next, the decoding device side will be described with reference to FIG. On the decoding device side, the decoding devices 1 # A, 1 # B, and 1 # C corresponding to the layers of the upper layer L1, the middle layer L2, and the lower layer L3 are encoded data DATA # A and DATA # B, respectively. , And DATA # C are decoded to output decoded images POUT # A, POUT # B, and POUT # C.

なお、上位の階層符号化データの一部の情報を抽出して、より下位の特定の復号装置において、当該抽出した情報を復号することで特定の品質の動画像を再生することもできる。 It is also possible to reproduce a moving image having a specific quality by extracting a part of the information of the upper layer encoded data and decoding the extracted information in a lower specific decoding device.

例えば、中位階層Ｌ２の階層復号装置１＃Ｂは、上位階層Ｌ１の階層符号化データＤＡＴＡ＃Ａから、復号画像ＰＯＵＴ＃Ｂを復号するのに必要な情報（すなわち、階層符号化データＤＡＴＡ＃Ａに含まれる“Ｂ”および“Ｃ”）を抽出して、復号画像ＰＯＵＴ＃Ｂを復号してもよい。言い換えれば、復号装置側では、上位階層Ｌ１の階層符号化データＤＡＴＡ＃Ａに含まれる情報に基づいて、復号画像ＰＯＵＴ＃Ａ、ＰＯＵＴ＃Ｂ、およびＰＯＵＴ＃Ｃを復号できる。 For example, the hierarchy decoding apparatus 1 # B of the middle hierarchy L2 receives information necessary for decoding the decoded image POUT # B from the hierarchy encoded data DATA # A of the upper hierarchy L1 (that is, the hierarchy encoded data DATA # A decoded image POUT # B may be decoded by extracting “B” and “C”) included in A. In other words, on the decoding device side, the decoded images POUT # A, POUT # B, and POUT # C can be decoded based on information included in the hierarchically encoded data DATA # A of the upper hierarchy L1.

なお、以上の３階層の階層符号化データに限られず、階層符号化データは、２階層で階層符号化されていてもよいし、３階層よりも多い階層数にて階層符号化されていてもよい。 The hierarchical encoded data is not limited to the above three-layer hierarchical encoded data, and the hierarchical encoded data may be hierarchically encoded with two layers or may be hierarchically encoded with a number of layers larger than three. Good.

また、特定の階層の復号画像に関する符号化データの一部または全部を他の階層とは独立して符号化し、特定の階層の復号の際に、他の階層の情報を参照しなくても済むように階層符号化データを構成してもよい。例えば、図２（ａ）および（ｂ）を用いて上述した例では、復号画像ＰＯＵＴ＃Ｂの復号に“Ｃ”および“Ｂ”を参照すると説明したが、これに限られない。復号画像ＰＯＵＴ＃Ｂが“Ｂ”だけを用いて復号できるように階層符号化データを構成することも可能である。例えば、復号画像ＰＯＵＴ＃Ｂの復号に、“Ｂ”だけから構成される階層符号化データと、復号画像ＰＯＵＴ＃Ｃを入力とする階層動画像復号装置も構成できる。 Also, a part or all of the encoded data related to the decoded image of a specific hierarchy is encoded independently of the other hierarchy, and it is not necessary to refer to information of the other hierarchy when decoding the specific hierarchy. Hierarchically encoded data may be configured as described above. For example, in the example described above with reference to FIGS. 2A and 2B, it has been described that “C” and “B” are referred to for decoding the decoded image POUT # B, but the present invention is not limited thereto. It is also possible to configure the hierarchically encoded data so that the decoded image POUT # B can be decoded using only “B”. For example, it is possible to configure a hierarchical video decoding apparatus that receives the hierarchically encoded data composed only of “B” and the decoded image POUT # C for decoding the decoded image POUT # B.

なお、ＳＮＲスケーラビリティを実現する場合、入力画像ＰＩＮ＃Ａ、ＰＩＮ＃Ｂ、およびＰＩＮ＃Ｃとして同一の原画を用いた上で、復号画像ＰＯＵＴ＃Ａ、ＰＯＵＴ＃Ｂ、およびＰＯＵＴ＃Ｃが異なる画質となるよう階層符号化データを生成することもできる。その場合、下位階層の階層動画像符号化装置が、上位階層の階層動画像符号化装置に較べて、より大きい量子化幅を用いて予測残差を量子化することで階層符号化データを生成する。 When SNR scalability is realized, the same original image is used as the input images PIN # A, PIN # B, and PIN # C, and the decoded images POUT # A, POUT # B, and POUT # C have different image quality. Hierarchically encoded data can also be generated so that In that case, the lower layer hierarchical video encoding device generates hierarchical encoded data by quantizing the prediction residual using a larger quantization width than the upper layer hierarchical video encoding device. To do.

本書では、説明の便宜上、次のとおり用語を定義する。以下の用語は、特に断りがなければ、下記の技術的事項のことを表わすのに用いる。 In this document, the following terms are defined for convenience of explanation. The following terms are used to indicate the following technical matters unless otherwise specified.

上位レイヤ：ある階層よりも上位に位置する階層のことを、上位レイヤと称する。例えば、図２において、下位階層Ｌ３の上位レイヤは、中位階層Ｌ２および上位階層Ｌ１である。また、上位レイヤの復号画像とは、より品質の高い（例えば、解像度が高い、フレームレートが高い、画質が高い等）復号画像のことをいう。 Upper layer: A layer positioned higher than a certain layer is referred to as an upper layer. For example, in FIG. 2, the upper layers of the lower layer L3 are the middle layer L2 and the upper layer L1. The decoded image of the upper layer means a decoded image with higher quality (for example, high resolution, high frame rate, high image quality, etc.).

下位レイヤ：ある階層よりも下位に位置する階層のことを、下位レイヤと称する。例えば、図２において、上位階層Ｌ１の下位レイヤは、中位階層Ｌ２および下位階層Ｌ３である。また、下位レイヤの復号画像とは、より品質の低い復号画像のことをいう。 Lower layer: A layer located lower than a certain layer is referred to as a lower layer. For example, in FIG. 2, the lower layers of the upper layer L1 are the middle layer L2 and the lower layer L3. Further, the decoded image of the lower layer refers to a decoded image with lower quality.

対象レイヤ：復号または符号化の対象となっている階層のことをいう。 Target layer: A layer that is the target of decoding or encoding.

参照レイヤ（reference layer）：対象レイヤに対応する復号画像を復号するのに参照される特定の下位レイヤのことを参照レイヤと称する。 Reference layer: A specific lower layer referred to for decoding a decoded image corresponding to a target layer is referred to as a reference layer.

図２（ａ）および（ｂ）に示した例では、上位階層Ｌ１の参照レイヤは、中位階層Ｌ２および下位階層Ｌ３である。しかしながら、これに限られず、特定の上記レイヤの復号において、下位レイヤのすべてを参照しなくてもよいように階層符号化データを構成することもできる。例えば、上位階層Ｌ１の参照レイヤが、中位階層Ｌ２および下位階層Ｌ３のいずれか一方となるように階層符号化データを構成することも可能である。 In the example shown in FIGS. 2A and 2B, the reference layers of the upper hierarchy L1 are the middle hierarchy L2 and the lower hierarchy L3. However, the present invention is not limited to this, and the hierarchically encoded data can be configured so that it is not necessary to refer to all of the lower layers in decoding of the specific layer. For example, the hierarchical encoded data can be configured such that the reference layer of the upper hierarchy L1 is either the middle hierarchy L2 or the lower hierarchy L3.

基本レイヤ（base layer）：最下層に位置する階層のことを基本レイヤと称する。基本レイヤの復号画像は、符号化データから復号され得るもっとも低い品質の復号画像であり、基本復号画像と呼称される。別の言い方をすれば、基本復号画像は、最下層の階層に対応する復号画像のことである。基本復号画像の復号に必要な階層符号化データの部分符号化データは基本符号化データと呼称される。例えば、上位階層Ｌ１の階層符号化データＤＡＴＡ＃Ａに含まれる基本情報“Ｃ”が基本符号化データである。 Base layer: A layer located at the lowest layer is referred to as a base layer. The decoded image of the base layer is the lowest quality decoded image that can be decoded from the encoded data, and is referred to as a basic decoded image. In other words, the basic decoded image is a decoded image corresponding to the lowest layer. The partially encoded data of the hierarchically encoded data necessary for decoding the basic decoded image is referred to as basic encoded data. For example, the basic information “C” included in the hierarchically encoded data DATA # A of the upper hierarchy L1 is the basic encoded data.

拡張レイヤ：基本レイヤの上位レイヤは、拡張レイヤと称される。 Enhancement layer: The upper layer of the base layer is referred to as an enhancement layer.

レイヤ識別子：レイヤ識別子は、階層を識別するためのものであり、階層と１対１に対応する。階層符号化データには特定の階層の復号画像の復号に必要な部分符号化データを選択するために用いられる階層識別子が含まれる。特定のレイヤに対応するレイヤ識別子に関連付けられた階層符号化データの部分集合は、レイヤ表現とも呼称される。 Layer identifier: The layer identifier is for identifying a hierarchy, and corresponds to the hierarchy one-to-one. The hierarchically encoded data includes a hierarchical identifier used for selecting partial encoded data necessary for decoding a decoded image of a specific hierarchy. A subset of hierarchically encoded data associated with a layer identifier corresponding to a specific layer is also referred to as a layer representation.

一般に、特定の階層の復号画像の復号には、当該階層のレイヤ表現、および／または、当該階層の下位レイヤに対応するレイヤ表現が用いられる。すなわち、対象レイヤの復号画像の復号においては、対象レイヤのレイヤ表現、および／または、対象レイヤの下位レイヤに含まれる１つ以上階層のレイヤ表現が用いられる。 In general, for decoding a decoded image of a specific hierarchy, a layer expression of the hierarchy and / or a layer expression corresponding to a lower layer of the hierarchy is used. That is, in decoding the decoded image of the target layer, layer representation of the target layer and / or layer representation of one or more layers included in a lower layer of the target layer are used.

レイヤ間予測：レイヤ間予測とは、対象レイヤのレイヤ表現と異なる階層（参照レイヤ）のレイヤ表現に含まれるシンタックス要素値、シンタックス要素値より導出される値、および復号画像に基づいて、対象レイヤのシンタックス要素値や対象レイヤの復号に用いられる符号化パラメータ等を予測することである。動き予測に関する情報を参照レイヤの情報から予測するレイヤ間予測のことを動き情報予測と称することもある。また、下位レイヤの復号画像から予測するレイヤ間予測のことをレイヤ間画像予測（あるいはレイヤ間テクスチャ予測）と称することもある。なお、レイヤ間予測に用いられる階層は、例示的には、対象レイヤの下位レイヤである。また、参照レイヤを用いず対象レイヤ内で予測を行うことをレイヤ内予測と称することもある。 Inter-layer prediction: Inter-layer prediction is based on a syntax element value included in a layer expression of a layer (reference layer) different from the layer expression of the target layer, a value derived from the syntax element value, and a decoded image. It is to predict the syntax element value of the target layer, the encoding parameter used for decoding of the target layer, and the like. Inter-layer prediction that predicts information related to motion prediction from reference layer information is sometimes referred to as motion information prediction. In addition, inter-layer prediction predicted from a lower layer decoded image may be referred to as inter-layer image prediction (or inter-layer texture prediction). Note that the hierarchy used for inter-layer prediction is, for example, a lower layer of the target layer. In addition, performing prediction within a target layer without using a reference layer may be referred to as intra-layer prediction.

なお、以上の用語は、飽くまで説明の便宜上のものであり、上記の技術的事項を別の用語にて表現してもかまわない。 Note that the above terms are for convenience of explanation until they are tired, and the above technical matters may be expressed by other terms.

〔階層符号化データのデータ構造について〕
以下、各階層の符号化データを生成する符号化方式として、HEVCおよびその拡張方式を用いる場合について例示する。しかしながら、これに限られず、各階層の符号化データを、MPEG-2や、H.264/AVCなどの符号化方式により生成してもよい。 [Data structure of hierarchically encoded data]
Hereinafter, a case where HEVC and its extension method are used as an encoding method for generating encoded data of each layer will be exemplified. However, the present invention is not limited to this, and the encoded data of each layer may be generated by an encoding method such as MPEG-2 or H.264 / AVC.

また、下位レイヤと上位レイヤとが異なる符号化方式によって符号化されていてもよい。また、各階層の符号化データは、互いに異なる伝送路を介して階層動画像復号装置１に供給されてもよいし、同一の伝送路を介して階層動画像復号装置１に供給されてもよい。 Further, the lower layer and the upper layer may be encoded by different encoding methods. Also, the encoded data of each layer may be supplied to the hierarchical video decoding device 1 via different transmission paths, or may be supplied to the hierarchical video decoding device 1 via the same transmission path. .

例えば、超高精細映像（動画像、４Ｋ映像データ）を基本レイヤおよび１つの拡張レイヤによりスケーラブル符号化して伝送する場合、基本レイヤは、４Ｋ映像データをダウンスケーリングし、インタレース化した映像データをMPEG-2またはH.264/AVCにより符号化してテレビ放送網で伝送し、拡張レイヤは、４Ｋ映像（プログレッシブ）をHEVCにより符号化して、インターネットで伝送してもよい。 For example, when transmitting ultra-high-definition video (moving image, 4K video data) with a base layer and one extended layer in a scalable encoding, the base layer downscales 4K video data, and interlaced video data. It may be encoded by MPEG-2 or H.264 / AVC and transmitted over a television broadcast network, and the enhancement layer may encode 4K video (progressive) with HEVC and transmit over the Internet.

（基本レイヤ）
図３は、基本レイヤにおいて採用できる符号化データ（図２の例でいえば、階層符号化データＤＡＴＡ＃Ｃ）のデータ構造を例示する図である。階層符号化データＤＡＴＡ＃Ｃは、例示的に、シーケンス、およびシーケンスを構成する複数のピクチャを含む。 (Basic layer)
FIG. 3 is a diagram illustrating a data structure of encoded data (hierarchically encoded data DATA # C in the example of FIG. 2) that can be employed in the base layer. Hierarchically encoded data DATA # C illustratively includes a sequence and a plurality of pictures constituting the sequence.

階層符号化データＤＡＴＡ＃Ｃにおけるデータの階層構造を図３に示す。図３の（ａ）〜（ｅ）は、それぞれ、シーケンスＳＥＱを規定するシーケンスレイヤ、ピクチャＰＩＣＴを規定するピクチャレイヤ、スライスＳを規定するスライスレイヤ、符号化ツリーユニット（Coding Tree Unit；ＣＴＵ）を規定するＣＴＵレイヤ、符号化ツリーユニットＣＴＵに含まれる符号化単位（Coding Unit；ＣＵ）を規定するＣＵレイヤを示す図である。 FIG. 3 shows a hierarchical structure of data in the hierarchically encoded data DATA # C. 3A to 3E respectively show a sequence layer that defines the sequence SEQ, a picture layer that defines the picture PICT, a slice layer that defines the slice S, and a coding tree unit (CTU). It is a figure which shows the CU layer which prescribes | regulates the coding unit (Coding Unit; CU) contained in the CTU layer and coding tree unit CTU to prescribe | regulate.

（シーケンスレイヤ）
シーケンスレイヤでは、処理対象のシーケンスＳＥＱ（以下、対象シーケンスとも称する）を復号するために階層動画像復号装置１が参照するデータの集合が規定されている。シーケンスＳＥＱは、図３の（ａ）に示すように、ビデオパラメータセットＶＰＳ（Video Parameter Set）、シーケンスパラメータセットＳＰＳ（Sequence Parameter Set）、ピクチャパラメータセットＰＰＳ（Picture Parameter Set）、ピクチャＰＩＣＴ_１〜ＰＩＣＴ_NP（ＮＰはシーケンスＳＥＱに含まれるピクチャの総数）、及び、付加拡張情報ＳＥＩ（Supplemental Enhancement Information）を含んでいる。 (Sequence layer)
In the sequence layer, a set of data referred to by the hierarchical video decoding device 1 for decoding a sequence SEQ to be processed (hereinafter also referred to as a target sequence) is defined. As shown in FIG. 3A, the sequence SEQ includes a video parameter set VPS (Video Parameter Set), a sequence parameter set SPS (Sequence Parameter Set), a picture parameter set PPS (Picture Parameter Set), and pictures PICT _{1 to} PICT. _{It includes NP} (NP is the total number of pictures included in the sequence SEQ) and supplemental enhancement information (SEI).

ビデオパラメータセットＶＰＳでは、符号化データに含まれるレイヤ数、レイヤ間の依存関係が規定されている。 In the video parameter set VPS, the number of layers included in the encoded data and the dependency relationship between the layers are defined.

シーケンスパラメータセットＳＰＳでは、対象シーケンスを復号するために階層動画像復号装置１が参照する符号化パラメータの集合が規定されている。ＳＰＳは符号化データ内に複数存在してもよい。その場合、対象シーケンス毎に復号に用いられるＳＰＳが複数の候補から選択される。特定シーケンスの復号に使用されるＳＰＳは、アクティブＳＰＳとも呼ばれる。以下では、特に断りがなければ、対象シーケンスに対するアクティブＳＰＳを意味する。 In the sequence parameter set SPS, a set of encoding parameters referred to by the hierarchical video decoding device 1 for decoding the target sequence is defined. A plurality of SPSs may exist in the encoded data. In that case, an SPS used for decoding is selected from a plurality of candidates for each target sequence. An SPS used for decoding a specific sequence is also called an active SPS. In the following, unless otherwise specified, it means an active SPS for the target sequence.

ピクチャパラメータセットＰＰＳでは、対象シーケンス内の各ピクチャを復号するために階層動画像復号装置１が参照する符号化パラメータの集合が規定されている。なお、ＰＰＳは符号化データ内に複数存在してもよい。その場合、対象シーケンス内の各ピクチャから複数のＰＰＳの何れかを選択する。特定ピクチャの復号に使用されるＰＰＳはアクティブＰＰＳとも呼ばれる。以下では、特に断りがなければ、ＰＰＳは対象ピクチャに対するアクティブＰＰＳを意味する。 In the picture parameter set PPS, a set of encoding parameters referred to by the hierarchical video decoding device 1 for decoding each picture in the target sequence is defined. A plurality of PPS may exist in the encoded data. In that case, one of a plurality of PPSs is selected from each picture in the target sequence. A PPS used for decoding a specific picture is also called an active PPS. In the following, unless otherwise specified, PPS means active PPS for the current picture.

（ピクチャレイヤ）
ピクチャレイヤでは、処理対象のピクチャＰＩＣＴ（以下、対象ピクチャとも称する）を復号するために階層動画像復号装置１が参照するデータの集合が規定されている。ピクチャＰＩＣＴは、図３の（ｂ）に示すように、スライスヘッダＳＨ₁〜ＳＨ_NS、及び、スライスＳ₁〜Ｓ_NSを含んでいる（ＮＳはピクチャＰＩＣＴに含まれるスライスの総数）。 (Picture layer)
In the picture layer, a set of data that is referred to by the hierarchical video decoding device 1 in order to decode a picture PICT to be processed (hereinafter also referred to as a target picture) is defined. As shown in FIG. 3B, the picture PICT includes slice headers SH _{1 to} SH _NS and slices S _{1 to} S _NS (NS is the total number of slices included in the picture PICT).

なお、以下、スライスヘッダＳＨ₁〜ＳＨ_NSやスライスＳ₁〜Ｓ_NSのそれぞれを区別する必要が無い場合、符号の添え字を省略して記述することがある。また、以下に説明する階層符号化データＤＡＴＡ＃Ｃに含まれるデータであって、添え字を付している他のデータも同様である。 Hereinafter, when it is not necessary to distinguish each of the slice headers SH _{1 to} SH _NS and the slices S _{1 to} S _NS , the reference numerals may be omitted. The same applies to other data with subscripts included in hierarchically encoded data DATA # C described below.

スライスヘッダＳＨ_kには、対応するスライスＳ_kの復号方法を決定するために階層動画像復号装置１が参照する符号化パラメータ群が含まれている。例えば、ＳＰＳを指定するＳＰＳ識別子（seq_parameter_set_id）や、ＰＰＳを指定するＰＰＳ識別子（pic_parameter_set_id）が含まれる。また、スライスタイプを指定するスライスタイプ指定情報（slice_type）は、スライスヘッダＳＨに含まれる符号化パラメータの一例である。 The slice header SH _k includes a coding parameter group referred to by the hierarchical video decoding device 1 in order to determine a decoding method for the corresponding slice S _k . For example, an SPS identifier (seq_parameter_set_id) that specifies SPS and a PPS identifier (pic_parameter_set_id) that specifies PPS are included. The slice type designation information (slice_type) for designating the slice type is an example of an encoding parameter included in the slice header SH.

スライスタイプ指定情報により指定可能なスライスタイプとしては、（１）符号化の際にイントラ予測のみを用いるＩスライス、（２）符号化の際に単方向予測、又は、イントラ予測を用いるＰスライス、（３）符号化の際に単方向予測、双方向予測、又は、イントラ予測を用いるＢスライスなどが挙げられる。 As slice types that can be specified by the slice type specification information, (1) I slice that uses only intra prediction at the time of encoding, (2) P slice that uses unidirectional prediction or intra prediction at the time of encoding, (3) B-slice using unidirectional prediction, bidirectional prediction, or intra prediction at the time of encoding may be used.

（スライスレイヤ）
スライスレイヤでは、処理対象のスライスＳ（対象スライスとも称する）を復号するために階層動画像復号装置１が参照するデータの集合が規定されている。スライスＳは、図３の（ｃ）に示すように、符号化ツリーユニットＣＴＵ₁〜ＣＴＵ_NC（ＮＣはスライスＳに含まれるＣＴＵの総数）を含んでいる。 (Slice layer)
In the slice layer, a set of data that is referred to by the hierarchical video decoding device 1 in order to decode a slice S (also referred to as a target slice) to be processed is defined. As shown in FIG. 3C, the slice S includes coding tree units CTU _{1 to} CTU _NC (NC is the total number of CTUs included in the slice S).

（ＣＴＵレイヤ）
ＣＴＵレイヤでは、処理対象の符号化ツリーユニットＣＴＵ（以下、対象ＣＴＵとも称する）を復号するために階層動画像復号装置１が参照するデータの集合が規定されている。なお、符号化ツリーユニットのことを符号化ツリーブロック（CTB: Coding Tree block）、または、最大符号化単位（LCU:Largest Cording Unit）と呼ぶこともある。 (CTU layer)
In the CTU layer, a set of data referred to by the hierarchical video decoding device 1 for decoding a coding tree unit CTU to be processed (hereinafter also referred to as a target CTU) is defined. Note that the coding tree unit may be referred to as a coding tree block (CTB) or a maximum coding unit (LCU).

符号化ツリーユニットＣＴＵは、ＣＴＵヘッダＣＴＵＨと、符号化単位情報ＣＵ_１〜ＣＵ_ＮＬ（ＮＬはＣＴＵに含まれる符号化単位情報の総数）とを含む。ここで、まず、符号化ツリーユニットＣＴＵと、符号化単位情報ＣＵとの関係について説明すると次のとおりである。 The coding tree unit CTU includes a CTU header CTUH and coding unit information CU _{1 to} CU _NL (NL is the total number of coding unit information included in the CTU). Here, first, the relationship between the coding tree unit CTU and the coding unit information CU will be described as follows.

符号化ツリーユニットＣＴＵは、イントラ予測またはインター予測、および、変換の各処理ためのブロックサイズを特定するための単位に分割される。 The coding tree unit CTU is divided into units for specifying a block size for each process of intra prediction or inter prediction and transformation.

符号化ツリーユニットＣＴＵの上記単位は、再帰的な４分木分割により分割されている。この再帰的な４分木分割により得られる木構造のことを以下、符号化ツリー（coding tree）と称する。 The unit of the coding tree unit CTU is divided by recursive quadtree partitioning. The tree structure obtained by this recursive quadtree partitioning is hereinafter referred to as a coding tree.

以下、符号化ツリーの末端のノードであるリーフ（leaf）に対応する単位を、符号化ノード（coding node）として参照する。また、符号化ノードは、符号化処理の基本的な単位となるため、以下、符号化ノードのことを、符号化単位（ＣＵ）とも称する。 Hereinafter, a unit corresponding to a leaf, which is a node at the end of the coding tree, is referred to as a coding node. In addition, since the encoding node is a basic unit of the encoding process, hereinafter, the encoding node is also referred to as an encoding unit (CU).

つまり、符号化単位情報（以下、ＣＵ情報と称する）ＣＵ_１〜ＣＵ_ＮＬは、符号化ツリーユニットＣＴＵを再帰的に４分木分割して得られる各符号化ノード（符号化単位）に対応する情報である。 That is, coding unit information (hereinafter referred to as CU information) CU _{1 to} CU _NL corresponds to each coding node (coding unit) obtained by recursively dividing the coding tree unit CTU into quadtrees. Information.

また、符号化ツリーのルート（root）は、符号化ツリーユニットＣＴＵに対応付けられる。換言すれば、符号化ツリーユニットＣＴＵは、複数の符号化ノードを再帰的に含む４分木分割の木構造の最上位ノードに対応付けられる。 The root of the coding tree is associated with the coding tree unit CTU. In other words, the coding tree unit CTU is associated with the highest node of the tree structure of the quadtree partition that recursively includes a plurality of coding nodes.

なお、各符号化ノードのサイズは、当該符号化ノードの親ノードとなる符号化ノード（すなわち、当該符号化ノードの１階層上位のノード）のサイズの縦横とも半分である。 Note that the size of each coding node is half of the size of the coding node that is the parent node of the coding node (that is, the node one layer higher than the coding node).

また、符号化ツリーユニットＣＴＵのサイズ、および、各符号化ユニットのとり得るサイズは、シーケンスパラメータセットＳＰＳに含まれる、最小符号化ノードのサイズ指定情報、および最大符号化ノードと最小符号化ノードの階層深度の差分に依存する。例えば、最小符号化ノードのサイズが８×８画素であって、最大符号化ノードと最小符号化ノードの階層深度の差分が３である場合、符号化ツリーユニットＣＴＵのサイズが６４×６４画素であって、符号化ノードのサイズは、４種類のサイズ、すなわち、６４×６４画素、３２×３２画素、１６×１６画素、および、８×８画素の何れかをとり得る。 Also, the size of the coding tree unit CTU and the size that each coding unit can take are the size designation information of the minimum coding node and the maximum coding node and the minimum coding node included in the sequence parameter set SPS. Depends on hierarchy depth difference. For example, when the size of the minimum coding node is 8 × 8 pixels and the difference in the layer depth between the maximum coding node and the minimum coding node is 3, the size of the coding tree unit CTU is 64 × 64 pixels. Thus, the size of the encoding node can take any of four sizes, that is, 64 × 64 pixels, 32 × 32 pixels, 16 × 16 pixels, and 8 × 8 pixels.

（ＣＴＵヘッダ）
ＣＴＵヘッダＣＴＵＨには、対象ＣＴＵの復号方法を決定するために階層動画像復号装置１が参照する符号化パラメータが含まれる。具体的には、図３の（ｄ）に示すように、対象ＣＴＵの各ＣＵへの分割パターンを指定するＣＴＵ分割情報ＳＰ＿ＣＴＵ、および、量子化ステップの大きさを指定する量子化パラメータ差分Δｑｐ（qp_delta）が含まれる。 (CTU header)
The CTU header CTUH includes an encoding parameter referred to by the hierarchical video decoding device 1 in order to determine a decoding method of the target CTU. Specifically, as shown in FIG. 3 (d), CTU division information SP_CTU for designating a division pattern of the target CTU into each CU, and a quantization parameter difference Δqp (for designating the quantization step size) qp_delta).

ＣＴＵ分割情報ＳＰ＿ＣＴＵは、ＣＴＵを分割するための符号化ツリーを表す情報であり、具体的には、対象ＣＴＵに含まれる各ＣＵの形状、サイズ、および、対象ＣＴＵ内での位置を指定する情報である。 The CTU division information SP_CTU is information representing a coding tree for dividing the CTU, and specifically, information specifying the shape and size of each CU included in the target CTU and the position in the target CTU. It is.

なお、ＣＴＵ分割情報ＳＰ＿ＣＴＵは、ＣＵの形状やサイズを明示的に含んでいなくてもよい。例えばＣＴＵ分割情報ＳＰ＿ＣＴＵは、対象ＣＴＵ全体またはＣＴＵの部分領域を四分割するか否かを示すフラグの集合であってもよい。その場合、ＣＴＵの形状やサイズを併用することで各ＣＵの形状やサイズを特定できる。 Note that the CTU partition information SP_CTU may not explicitly include the shape or size of the CU. For example, the CTU division information SP_CTU may be a set of flags indicating whether or not the entire target CTU or a partial region of the CTU is to be divided into four. In that case, the shape and size of each CU can be specified by using the shape and size of the CTU together.

また、量子化パラメータ差分Δｑｐは、対象ＣＴＵにおける量子化パラメータｑｐと、当該対象ＣＴＵの直前に符号化されたＣＴＵにおける量子化パラメータｑｐ’との差分ｑｐ−ｑｐ’である。 The quantization parameter difference Δqp is a difference qp−qp ′ between the quantization parameter qp in the target CTU and the quantization parameter qp ′ in the CTU encoded immediately before the target CTU.

（ＣＵレイヤ）
ＣＵレイヤでは、処理対象のＣＵ（以下、対象ＣＵとも称する）を復号するために階層動画像復号装置１が参照するデータの集合が規定されている。 (CU layer)
In the CU layer, a set of data referred to by the hierarchical video decoding device 1 for decoding a CU to be processed (hereinafter also referred to as a target CU) is defined.

ここで、ＣＵ情報ＣＵに含まれるデータの具体的な内容の説明をする前に、ＣＵに含まれるデータの木構造について説明する。符号化ノードは、予測ツリー（prediction tree；ＰＴ）および変換ツリー（transform tree；ＴＴ）のルートのノードとなる。予測ツリーおよび変換ツリーについて説明すると次のとおりである。 Here, before describing specific contents of data included in the CU information CU, a tree structure of data included in the CU will be described. The encoding node is a node at the root of a prediction tree (PT) and a transform tree (TT). The prediction tree and the conversion tree are described as follows.

予測ツリーにおいては、符号化ノードが１または複数の予測ブロックに分割され、各予測ブロックの位置とサイズとが規定される。換言すれば、予測ブロックは、符号化ノードを構成する１または複数の重複しない領域である。また、予測ツリーは、上述の分割により得られた１または複数の予測ブロックを含む。 In the prediction tree, the encoding node is divided into one or a plurality of prediction blocks, and the position and size of each prediction block are defined. In other words, the prediction block is one or a plurality of non-overlapping areas constituting the encoding node. The prediction tree includes one or a plurality of prediction blocks obtained by the above division.

予測処理は、この予測ブロックごとに行われる。以下、予測の単位である予測ブロックのことを、予測単位（prediction unit；ＰＵ）とも称する。 Prediction processing is performed for each prediction block. Hereinafter, a prediction block that is a unit of prediction is also referred to as a prediction unit (PU).

予測ツリーにおける分割（以下、ＰＵ分割と略称する）の種類は、大まかにいえば、イントラ予測の場合と、インター予測の場合との２つがある。 Broadly speaking, there are two types of partitioning in the prediction tree (hereinafter abbreviated as PU partitioning): intra prediction and inter prediction.

イントラ予測の場合、分割方法は、２Ｎ×２Ｎ（符号化ノードと同一サイズ）と、Ｎ×Ｎとがある。 In the case of intra prediction, there are 2N × 2N (the same size as the encoding node) and N × N division methods.

また、インター予測の場合、分割方法は、２Ｎ×２Ｎ（符号化ノードと同一サイズ）、２Ｎ×Ｎ、２Ｎ×ｎＵ、２Ｎ×ｎＤ、Ｎ×２Ｎ、ｎＬ×２Ｎ、および、ｎＲ×２Ｎなどがある。 In the case of inter prediction, 2N × 2N (the same size as the encoding node), 2N × N, 2N × nU, 2N × nD, N × 2N, nL × 2N, nR × 2N, and the like are used as division methods. is there.

また、変換ツリーにおいては、符号化ノードが１または複数の変換ブロックに分割され、各変換ブロックの位置とサイズとが規定される。換言すれば、変換ブロックは、符号化ノードを構成する１または複数の重複しない領域のことである。また、変換ツリーは、上述の分割より得られた１または複数の変換ブロックを含む。 In the transform tree, the encoding node is divided into one or a plurality of transform blocks, and the position and size of each transform block are defined. In other words, the transform block is one or a plurality of non-overlapping areas constituting the encoding node. The conversion tree includes one or a plurality of conversion blocks obtained by the above division.

変換ツリーにおける分割には、符号化ノードと同一のサイズの領域を変換ブロックとして割り付けるものと、上述したツリーブロックの分割と同様、再帰的な４分木分割によるものがある。 There are two types of division in the transformation tree: one in which an area having the same size as that of a coding node is assigned as a transformation block, and the other in division by recursive quadtree division as in the above-described division of a tree block.

変換処理は、この変換ブロックごとに行われる。以下、変換の単位である変換ブロックのことを、変換単位（transform unit；ＴＵ）とも称する。 The conversion process is performed for each conversion block. Hereinafter, a transform block that is a unit of transform is also referred to as a transform unit (TU).

（ＣＵ情報のデータ構造）
続いて、図３（ｅ）を参照しながらＣＵ情報ＣＵに含まれるデータの具体的な内容を説明する。図３（ｅ）に示すように、ＣＵ情報ＣＵは、具体的には、スキップフラグＳＫＩＰ、予測ツリー情報（以下、ＰＴ情報と略称する）ＰＴＩ、および、変換ツリー情報（以下、ＴＴ情報と略称する）ＴＴＩを含む。 (Data structure of CU information)
Next, specific contents of data included in the CU information CU will be described with reference to FIG. As shown in FIG. 3E, the CU information CU specifically includes a skip flag SKIP, prediction tree information (hereinafter abbreviated as PT information) PTI, and conversion tree information (hereinafter abbreviated as TT information). Include TTI).

スキップフラグＳＫＩＰは、対象のＰＵについて、スキップモードが適用されているか否かを示すフラグであり、スキップフラグＳＫＩＰの値が１の場合、すなわち、対象ＣＵにスキップモードが適用されている場合、そのＣＵ情報ＣＵにおけるＰＴ情報ＰＴＩの一部、および、ＴＴ情報ＴＴＩは省略される。なお、スキップフラグＳＫＩＰは、Ｉスライスでは省略される。 The skip flag SKIP is a flag indicating whether or not the skip mode is applied to the target PU. When the value of the skip flag SKIP is 1, that is, when the skip mode is applied to the target CU, A part of the PT information PTI and the TT information TTI in the CU information CU are omitted. Note that the skip flag SKIP is omitted for the I slice.

［ＰＴ情報］
ＰＴ情報ＰＴＩは、ＣＵに含まれる予測ツリー（以下、ＰＴと略称する）に関する情報である。言い換えれば、ＰＴ情報ＰＴＩは、ＰＴに含まれる１または複数のＰＵそれぞれに関する情報の集合であり、階層動画像復号装置１により予測画像を生成する際に参照される。ＰＴ情報ＰＴＩは、図３（ｅ）に示すように、予測タイプ情報ＰＴｙｐｅ、および、予測情報ＰＩｎｆｏを含んでいる。 [PT information]
The PT information PTI is information related to a prediction tree (hereinafter abbreviated as PT) included in the CU. In other words, the PT information PTI is a set of information related to each of one or a plurality of PUs included in the PT, and is referred to when a predicted image is generated by the hierarchical video decoding device 1. As shown in FIG. 3E, the PT information PTI includes prediction type information PType and prediction information PInfo.

予測タイプ情報ＰＴｙｐｅは、対象ＰＵについての予測画像生成方法を指定する情報である。ベースレイヤにおいては、イントラ予測を用いるのか、または、インター予測を用いるのかを指定する情報である。 The prediction type information PType is information that specifies a predicted image generation method for the target PU. In the base layer, it is information that specifies whether intra prediction or inter prediction is used.

予測情報ＰＩｎｆｏは、予測タイプ情報ＰＴｙｐｅで指定される予測方法において用いられる予測情報である。ベースレイヤにおいては、イントラ予測の場合にイントラ予測情報PP_Intraが含まれる。また、インター予測の場合にはインター予測情報PP_Interを含む。 The prediction information PInfo is prediction information used in the prediction method specified by the prediction type information PType. In the base layer, intra prediction information PP_Intra is included in the case of intra prediction. In the case of inter prediction, inter prediction information PP_Inter is included.

インター予測情報PP_Interは、階層動画像復号装置１が、インター予測によってインター予測画像を生成する際に参照される予測情報を含む。より具体的には、インター予測情報PP_Interは、対象ＣＵの各インターＰＵへの分割パターンを指定するインターＰＵ分割情報、および、各インターＰＵについてのインター予測パラメータ（動き補償パラメータ）を含む。インター予測パラメータとしては、例えば、マージフラグ（merge_flag）、マージインデックス（merge_idx）、推定動きベクトルインデックス（mvp_idx）、参照ピクチャインデックス（ref_idx）、インター予測フラグ（inter_pred_flag）、および動きベクトル残差（mvd）を含む。 The inter prediction information PP_Inter includes prediction information that is referred to when the hierarchical video decoding device 1 generates an inter prediction image by inter prediction. More specifically, the inter prediction information PP_Inter includes inter PU division information that specifies a division pattern of the target CU into each inter PU, and inter prediction parameters (motion compensation parameters) for each inter PU. Examples of inter prediction parameters include a merge flag (merge_flag), a merge index (merge_idx), an estimated motion vector index (mvp_idx), a reference picture index (ref_idx), an inter prediction flag (inter_pred_flag), and a motion vector residual (mvd) including.

イントラ予測情報PP_Intraは、階層動画像復号装置１が、イントラ予測によってイントラ予測画像を生成する際に参照される符号化パラメータを含む。より具体的には、イントラ予測情報PP_Intraには、対象ＣＵの各イントラＰＵへの分割パターンを指定するイントラＰＵ分割情報、および、各イントラＰＵについてのイントラ予測パラメータが含まれる。イントラ予測パラメータは、各イントラＰＵについてのイントラ予測方法（予測モード）を指定するためのパラメータである。 The intra prediction information PP_Intra includes an encoding parameter that is referred to when the hierarchical video decoding device 1 generates an intra predicted image by intra prediction. More specifically, the intra prediction information PP_Intra includes intra PU division information that specifies a division pattern of the target CU into each intra PU, and intra prediction parameters for each intra PU. The intra prediction parameter is a parameter for designating an intra prediction method (prediction mode) for each intra PU.

ここで、イントラ予測パラメータは、各イントラＰＵについてのイントラ予測（予測モード）を復元するためのパラメータである。予測モードを復元するためのパラメータには、ＭＰＭ（Most Probable Mode、以下同様）に関するフラグであるmpm_flag、ＭＰＭを選択するためのインデックスであるmpm_idx、および、ＭＰＭ以外の予測モードを指定するためのインデックスであるrem_idxが含まれる。ここで、ＭＰＭとは、対象パーティションで選択される可能性が高い推定予測モードである。例えば、対象パーティションの周辺のパーティションに割り付けられた予測モードに基づいて推定された推定予測モードや、一般的に発生確率の高いＤＣモードやＰｌａｎａｒモードがＭＰＭに含まれ得る。 Here, the intra prediction parameter is a parameter for restoring intra prediction (prediction mode) for each intra PU. The parameters for restoring the prediction mode include mpm_flag which is a flag related to MPM (Most Probable Mode, the same applies hereinafter), mpm_idx which is an index for selecting the MPM, and an index for specifying a prediction mode other than MPM. Rem_idx is included. Here, MPM is an estimated prediction mode that is highly likely to be selected in the target partition. For example, the MPM may include an estimated prediction mode estimated based on prediction modes assigned to partitions around the target partition, and a DC mode or Planar mode that generally has a high probability of occurrence.

また、以下において、単に“予測モード”と表記する場合、特にことわりのない限り、輝度予測モードのことを指すものとする。色差予測モードについては、“色差予測モード”と表記し、輝度予測モードと区別する。また、予測モードを復元するパラメータには、色差予測モードを指定するためのパラメータであるchroma_modeが含まれる。 In the following description, when simply described as “prediction mode”, it means the luminance prediction mode unless otherwise specified. The color difference prediction mode is described as “color difference prediction mode” and is distinguished from the luminance prediction mode. The parameter for restoring the prediction mode includes chroma_mode that is a parameter for designating the color difference prediction mode.

［ＴＴ情報］
ＴＴ情報ＴＴＩは、ＣＵに含まれる変換ツリー（以下、ＴＴと略称する）に関する情報である。言い換えれば、ＴＴ情報ＴＴＩは、ＴＴに含まれる１または複数の変換ブロックそれぞれに関する情報の集合であり、階層動画像復号装置１により残差データを復号する際に参照される。 [TT information]
The TT information TTI is information regarding a conversion tree (hereinafter abbreviated as TT) included in the CU. In other words, the TT information TTI is a set of information regarding each of one or a plurality of transform blocks included in the TT, and is referred to when the hierarchical video decoding device 1 decodes residual data.

ＴＴ情報ＴＴＩは、図３（ｅ）に示すように、対象ＣＵの各変換ブロックへの分割パターンを指定するＴＴ分割情報ＳＰ＿ＴＴ、および、量子化予測残差ＱＤ₁〜ＱＤ_NT（ＮＴは、対象ＣＵに含まれるブロックの総数）を含んでいる。 As shown in FIG. 3 (e), the TT information TTI includes TT division information SP_TT that specifies a division pattern of the target CU into each transform block, and quantized prediction residuals QD _{1 to} QD _NT (NT is the target The total number of blocks included in the CU).

ＴＴ分割情報ＳＰ＿ＴＴは、具体的には、対象ＣＵに含まれる各変換ブロックの形状、および、対象ＣＵ内での位置を決定するための情報である。例えば、ＴＴ分割情報ＳＰ＿ＴＴは、対象ノードの分割を行うのか否かを示す情報（split_transform_unit_flag）と、その分割の深度を示す情報（trafoDepth）とから実現できる。 Specifically, the TT division information SP_TT is information for determining the shape of each transform block included in the target CU and the position in the target CU. For example, the TT division information SP_TT can be realized from information (split_transform_unit_flag) indicating whether or not the target node is divided and information (trafoDepth) indicating the division depth.

また、例えば、ＣＵサイズが、６４×６４の場合、分割により得られる各変換ブロックは、３２×３２画素から４×４画素までのサイズをとり得る。 For example, when the CU size is 64 × 64, each transform block obtained by the division can take a size from 32 × 32 pixels to 4 × 4 pixels.

各量子化予測残差ＱＤは、階層動画像符号化装置２が以下の処理１〜３を、処理対象の変換ブロックである対象ブロックに施すことによって生成した符号化データである。 Each quantization prediction residual QD is encoded data generated by the hierarchical video encoding device 2 performing the following processes 1 to 3 on a target block that is a conversion block to be processed.

処理１：符号化対象画像から予測画像を減算した予測残差を周波数変換（例えば、ＤＣＴ変換（Discrete Cosine Transform）およびＤＳＴ変換（Discrete Sine Transform）等）する；
処理２：処理１にて得られた変換係数を量子化する；
処理３：処理２にて量子化された変換係数を可変長符号化する；
なお、上述した量子化パラメータｑｐは、階層動画像符号化装置２が変換係数を量子化する際に用いた量子化ステップＱＰの大きさを表す（ＱＰ＝２^qp/6）。 Process 1: The prediction residual obtained by subtracting the prediction image from the encoding target image is subjected to frequency conversion (for example, DCT conversion (Discrete Cosine Transform) and DST conversion (Discrete Sine Transform));
Process 2: Quantize the transform coefficient obtained in Process 1;
Process 3: Variable length coding is performed on the transform coefficient quantized in Process 2;
Note that the quantization parameter qp described above represents the size of the quantization step QP used when the hierarchical moving image encoding apparatus 2 quantizes the transform coefficient (QP = 2 ^{qp / 6} ).

（ＰＵ分割情報）
ＰＵ分割情報によって指定されるＰＵ分割タイプには、対象ＣＵのサイズを２Ｎ×２Ｎ画素とすると、次の合計８種類のパターンがある。すなわち、２Ｎ×２Ｎ画素、２Ｎ×Ｎ画素、Ｎ×２Ｎ画素、およびＮ×Ｎ画素の４つの対称的分割（symmetric splittings）、並びに、２Ｎ×ｎＵ画素、２Ｎ×ｎＤ画素、ｎＬ×２Ｎ画素、およびｎＲ×２Ｎ画素の４つの非対称的分割（asymmetric splittings）である。なお、Ｎ＝２^ｍ（ｍは１以上の任意の整数）を意味している。以下、対象ＣＵを分割して得られる予測単位のことを予測ブロック、または、パーティションと称する。 (PU partition information)
The PU partition type specified by the PU partition information includes the following eight patterns in total, assuming that the size of the target CU is 2N × 2N pixels. That is, 4 symmetric splittings of 2N × 2N pixels, 2N × N pixels, N × 2N pixels, and N × N pixels, and 2N × nU pixels, 2N × nD pixels, nL × 2N pixels, And four asymmetric splittings of nR × 2N pixels. N = 2 ^m (m is an arbitrary integer of 1 or more). Hereinafter, a prediction unit obtained by dividing the target CU is referred to as a prediction block or a partition.

（拡張レイヤ）
拡張レイヤのレイヤ表現に含まれる符号化データ（以下、拡張レイヤ符号化データ）についても、例えば、図３に示すデータ構造とほぼ同様のデータ構造を採用できる。ただし、拡張レイヤ符号化データでは、以下のとおり、付加的な情報を追加したり、パラメータを省略できる。 (Enhancement layer)
For encoded data included in the layer representation of the enhancement layer (hereinafter, enhancement layer encoded data), for example, a data structure substantially similar to the data structure shown in FIG. 3 can be adopted. However, in the enhancement layer encoded data, additional information can be added or parameters can be omitted as follows.

スライスレイヤでは、空間スケーラビリティ、時間スケーラビリティ、および、ＳＮＲスケーラビリティ、ビュースケーラビリティの階層の識別情報（それぞれ、dependency_id、temporal_id、quality_id、および、view_id）が符号化されていてもよい。 In the slice layer, spatial scalability, temporal scalability, SNR scalability, and view scalability hierarchy identification information (dependency_id, temporal_id, quality_id, and view_id, respectively) may be encoded.

また、ＣＵ情報ＣＵに含まれる予測タイプ情報ＰＴｙｐｅは、対象ＣＵについての予測画像生成方法がイントラ予測、インター予測、または、レイヤ間画像予測のいずれかを指定する情報である。予測タイプ情報ＰＴｙｐｅには、レイヤ間画像予測モードの適用有無を指定するフラグ（レイヤ間画像予測フラグ）を含む。なお、レイヤ間画像予測フラグは、texture_rl_flag、inter_layer_pred_flag、または、base_mode_flagと呼ばれることもある。 Further, the prediction type information PType included in the CU information CU is information that specifies whether the prediction image generation method for the target CU is intra prediction, inter prediction, or inter-layer image prediction. The prediction type information PType includes a flag (inter-layer image prediction flag) that specifies whether or not to apply the inter-layer image prediction mode. Note that the inter-layer image prediction flag may be referred to as texture_rl_flag, inter_layer_pred_flag, or base_mode_flag.

拡張レイヤにおいて、対象ＣＵのＣＵタイプが、イントラＣＵ、レイヤ間ＣＵ、インターＣＵ、スキップＣＵのいずれであるかが指定されていてもよい。 In the enhancement layer, it may be specified whether the CU type of the target CU is an intra CU, an inter-layer CU, an inter CU, or a skip CU.

イントラＣＵは、ベースレイヤにおけるイントラＣＵと同様に定義できる。イントラＣＵでは、レイヤ間画像予測フラグが“０”に、予測モードフラグが“０”に設定される。 An intra CU can be defined similarly to an intra CU in the base layer. In the intra CU, the inter-layer image prediction flag is set to “0”, and the prediction mode flag is set to “0”.

レイヤ間ＣＵは、参照レイヤのピクチャの復号画像を予測画像生成に用いるＣＵと定義できる。レイヤ間ＣＵでは、レイヤ間画像予測フラグが“１”に、予測モードフラグが“０”に設定される。 An inter-layer CU can be defined as a CU that uses a decoded image of a picture of a reference layer to generate a predicted image. In the inter-layer CU, the inter-layer image prediction flag is set to “1” and the prediction mode flag is set to “0”.

スキップＣＵは、上述のHEVC方式の場合と同様に定義できる。例えば、スキップＣＵでは、スキップフラグに“１”が設定される。 The skip CU can be defined in the same manner as in the HEVC scheme described above. For example, in the skip CU, “1” is set in the skip flag.

インターＣＵは、非スキップかつ動き補償（MC；Motion Compensation）を適用するＣＵと定義されていてもよい。インターＣＵでは、例えば、スキップフラグに“０”が設定され、予測モードフラグに“１”が設定される。 The inter CU may be defined as a CU that applies non-skip and motion compensation (MC). In the inter CU, for example, “0” is set in the skip flag and “1” is set in the prediction mode flag.

また、上述のとおり拡張レイヤの符号化データを、下位レイヤの符号化方式と異なる符号化方式により生成しても構わない。すなわち、拡張レイヤの符号化・復号処理は、下位レイヤのコーデックの種類に依存しない。 Further, as described above, the encoded data of the enhancement layer may be generated by an encoding method different from the encoding method of the lower layer. That is, the encoding / decoding process of the enhancement layer does not depend on the type of the lower layer codec.

下位レイヤが、例えば、MPEG-2や、H.264/AVC方式によって符号化されていてもよい。 The lower layer may be encoded by, for example, MPEG-2 or H.264 / AVC format.

拡張レイヤ符号化データでは、VPSが拡張されて、レイヤ間の参照構造を表すパラメータが含まれていてもよい。 In the enhancement layer encoded data, the VPS may be extended to include a parameter representing a reference structure between layers.

また、拡張レイヤ符号化データでは、SPS、PPS、スライスヘッダが拡張されて、レイヤ間画像予測に用いる参照レイヤの復号画像に係る情報（例えば、後述のレイヤ間参照ピクチャセット、レイヤ間参照ピクチャリスト、ベース制御情報等を直接、または、間接的に導出するためのシンタックス）が含まれていてもよい。 Also, in the enhancement layer encoded data, SPS, PPS, and slice header are extended, and information related to a decoded image of a reference layer used for inter-layer image prediction (for example, an inter-layer reference picture set, an inter-layer reference picture list described later) , Syntax for deriving base control information or the like directly or indirectly).

なお、以上に説明したパラメータは、単独で符号化されていてもよいし、複数のパラメータが複合的に符号化されていてもよい。複数のパラメータが複合的に符号化される場合は、そのパラメータの値の組み合わせに対してインデックスが割り当てられ、割り当てられた当該インデックスが符号化される。また、パラメータが、別のパラメータや、復号済みの情報から導出可能であれば、当該パラメータの符号化を省略できる。 Note that the parameters described above may be encoded independently, or a plurality of parameters may be encoded in combination. When a plurality of parameters are encoded in combination, an index is assigned to the combination of parameter values, and the assigned index is encoded. Also, if the parameter can be derived from another parameter or decoded information, the encoding of the parameter can be omitted.

〔階層動画像復号装置〕
以下では、本実施形態に係る階層動画像復号装置１の構成について、図１〜図１６を参照して説明する。 [Hierarchical video decoding device]
Below, the structure of the hierarchy moving image decoding apparatus 1 which concerns on this embodiment is demonstrated with reference to FIGS.

（階層動画像復号装置の構成）
図４を用いて、階層動画像復号装置１の概略的構成を説明すると次のとおりである。図４は、階層動画像復号装置１の概略的構成を示した機能ブロック図である。階層動画像復号装置１は、階層動画像符号化装置２から供給される階層符号化データＤＡＴＡを復号して、対象レイヤの復号画像ＰＯＵＴ＃Ｔを生成する。なお、以下では、対象レイヤは拡張レイヤであるとして説明する。そのため、対象レイヤは、参照レイヤに対する上位レイヤでもある。逆に、参照レイヤは、対象レイヤに対する下位レイヤでもある。 (Configuration of Hierarchical Video Decoding Device)
The schematic configuration of the hierarchical video decoding device 1 will be described with reference to FIG. FIG. 4 is a functional block diagram showing a schematic configuration of the hierarchical video decoding device 1. The hierarchical video decoding device 1 decodes the hierarchical encoded data DATA supplied from the hierarchical video encoding device 2 to generate a target layer decoded image POUT # T. In the following description, it is assumed that the target layer is an enhancement layer. Therefore, the target layer is also an upper layer with respect to the reference layer. Conversely, the reference layer is also a lower layer with respect to the target layer.

図４に示すように階層動画像復号装置１は、ＮＡＬ逆多重化部１１、可変長復号部１２、ピクチャ復号部１３、復号ピクチャ管理部１４、および、ベース復号部１５を備える。 As shown in FIG. 4, the hierarchical video decoding device 1 includes a NAL demultiplexing unit 11, a variable length decoding unit 12, a picture decoding unit 13, a decoded picture management unit 14, and a base decoding unit 15.

ＮＡＬ逆多重化部１１は、ＮＡＬ（Network Abstraction Layer）におけるＮＡＬユニット単位で伝送される階層符号化データＤＡＴＡを逆多重化する。 The NAL demultiplexing unit 11 demultiplexes hierarchically encoded data DATA transmitted in units of NAL units in NAL (Network Abstraction Layer).

ＮＡＬは、ＶＣＬ（Video Coding Layer）と、符号化データを伝送・蓄積する下位システムとの間における通信を抽象化するために設けられる層である。 The NAL is a layer provided to abstract communication between a VCL (Video Coding Layer) and a lower system that transmits and stores encoded data.

ＶＣＬは、動画像符号化処理を行う層のことであり、ＶＣＬにおいて符号化が行われる。一方、ここでいう、下位システムは、H.264/AVCおよびHEVCのファイルフォーマットや、MPEG-2システムに対応する。 VCL is a layer that performs moving image encoding processing, and encoding is performed in VCL. On the other hand, the lower system here corresponds to the H.264 / AVC and HEVC file formats and the MPEG-2 system.

なお、ＮＡＬでは、ＶＣＬで生成されたビットストリームが、ＮＡＬユニットという単位で区切られて、宛先となる下位システムへ伝送される。ＮＡＬユニットには、ＶＣＬで符号化された符号化データ、および、当該符号化データが宛先の下位システムに適切に届けられるためのヘッダが含まれる。また、各階層における符号化データは、ＮＡＬユニット格納されることでＮＡＬ多重化されて階層動画像復号装置１に伝送される。 In NAL, a bit stream generated by VCL is divided into units called NAL units and transmitted to a destination lower system. The NAL unit includes encoded data encoded by the VCL and a header for appropriately delivering the encoded data to the destination lower system. Also, the encoded data in each layer is stored in the NAL unit, is NAL multiplexed, and is transmitted to the hierarchical moving image decoding apparatus 1.

ＮＡＬ逆多重化部１１は、階層符号化データＤＡＴＡを逆多重化して、対象レイヤ符号化データＤＡＴＡ＃Ｔおよび参照レイヤ符号化データＤＡＴＡ＃Ｒを取り出す。また、ＮＡＬ逆多重化部１１は、対象レイヤ符号化データＤＡＴＡ＃Ｔを可変長復号部１２に供給するとともに、参照レイヤ符号化データＤＡＴＡ＃Ｒをベース復号部１５に供給する。 The NAL demultiplexing unit 11 demultiplexes the hierarchical encoded data DATA, and extracts the target layer encoded data DATA # T and the reference layer encoded data DATA # R. Further, the NAL demultiplexing unit 11 supplies the target layer encoded data DATA # T to the variable length decoding unit 12 and also supplies the reference layer encoded data DATA # R to the base decoding unit 15.

可変長復号部１２は、対象レイヤ符号化データＤＡＴＡ＃Ｔに含まれるバイナリから各種のシンタックス値を復号して出力する。 The variable length decoding unit 12 decodes and outputs various syntax values from the binary included in the target layer encoded data DATA # T.

可変長復号部１２で復号されるシンタックス値には、変換係数や予測情報に係るシンタックス値が含まれる。 The syntax value decoded by the variable length decoding unit 12 includes a syntax value related to transform coefficients and prediction information.

また、可変長復号部１２で復号されるシンタックス値には、対象レイヤの復号に用いるパラメータセット、すなわち、VPS、SPS、PPSに含まれるシンタックス値が含まれる。また、スライスヘッダに含まれるシンタックス値も含まれる。 The syntax value decoded by the variable length decoding unit 12 includes a parameter set used for decoding of the target layer, that is, a syntax value included in VPS, SPS, and PPS. Further, the syntax value included in the slice header is also included.

可変長復号部１２は、スライスヘッダ、予測情報、および、変換係数に係るシンタックス値をピクチャ復号部１３に供給する。また、可変長復号部１２は、パラメータセット、および、スライスヘッダに係るシンタックス値を復号ピクチャ管理部１４に供給する。 The variable length decoding unit 12 supplies a slice header, prediction information, and a syntax value related to the transform coefficient to the picture decoding unit 13. Further, the variable length decoding unit 12 supplies the parameter set and the syntax value related to the slice header to the decoded picture management unit 14.

ピクチャ復号部１３は、対象ピクチャに関するスライスヘッダ、予測情報、および、変換係数に係るシンタックス値に基づいて、対象ピクチャの復号ピクチャを復号する。復号ピクチャは復号ピクチャ管理部１４に出力される。 The picture decoding unit 13 decodes the decoded picture of the target picture based on the slice header related to the target picture, the prediction information, and the syntax value related to the transform coefficient. The decoded picture is output to the decoded picture management unit 14.

なお、ピクチャ復号部１３におけるインター予測による予測画像生成には、復号ピクチャ管理部１４内のDPBに記録されている復号ピクチャであって、入力される参照ピクチャリストに含まれるピクチャが参照ピクチャとして利用される。また、ピクチャ復号部１３におけるレイヤ間画像予測による予測画像生成には、復号ピクチャ管理部１４内のDPBに記録されているレイヤ間参照ピクチャであって、入力される参照ピクチャリストに含まれるピクチャが参照ピクチャとして利用される。 Note that, in the prediction picture generation by inter prediction in the picture decoding unit 13, a decoded picture recorded in the DPB in the decoded picture management unit 14 and included in the input reference picture list is used as a reference picture. Is done. In addition, in the prediction image generation by the inter-layer image prediction in the picture decoding unit 13, an inter-layer reference picture recorded in the DPB in the decoded picture management unit 14 and included in the input reference picture list is used. Used as a reference picture.

復号ピクチャ管理部１４は、入力される復号ピクチャやベース復号ピクチャを内部のDPBに記録するとともに、入力されるシンタックス値に基づいて参照ピクチャリスト生成や出力ピクチャ決定を行う。なお、復号ピクチャ管理部１４は本実施形態における重要な構成要素であるため、詳細を後述する。 The decoded picture management unit 14 records the input decoded picture and the base decoded picture in the internal DPB, and performs reference picture list generation and output picture determination based on the input syntax value. The decoded picture management unit 14 is an important component in this embodiment, and will be described in detail later.

ベース復号部１５は、参照レイヤ符号化データＤＡＴＡ＃Ｒからベース復号ピクチャを復号する。ベース復号ピクチャは、対象レイヤの復号ピクチャ復号時に利用される参照レイヤの復号ピクチャである。なお、ベース復号ピクチャは１レイヤの復号ピクチャとは限らない。すなわち、対象レイヤが複数の参照レイヤを参照する場合は、ベース復号部１５は、参照する複数のレイヤに対応するベース復号ピクチャを復号する。ベース復号部１５は、復号したベース復号ピクチャを復号ピクチャ管理部１４に供給する。なお、ベース復号部１５の詳細は後述する。 The base decoding unit 15 decodes a base decoded picture from the reference layer encoded data DATA # R. The base decoded picture is a decoded picture of the reference layer used when decoding the decoded picture of the target layer. Note that the base decoded picture is not necessarily a one-layer decoded picture. That is, when the target layer refers to a plurality of reference layers, the base decoding unit 15 decodes base decoded pictures corresponding to the plurality of layers to be referenced. The base decoding unit 15 supplies the decoded base decoded picture to the decoded picture management unit 14. Details of the base decoding unit 15 will be described later.

以下では、復号ピクチャ管理部１４とベース復号部１５の詳細を説明する。 Below, the detail of the decoded picture management part 14 and the base decoding part 15 is demonstrated.

（復号ピクチャ管理部）
図５を用いて、復号ピクチャ管理部１４の詳細構成を説明する。図５は、復号ピクチャ管理部１４の構成を例示した機能ブロック図である。 (Decoded picture management unit)
The detailed configuration of the decoded picture management unit 14 will be described with reference to FIG. FIG. 5 is a functional block diagram illustrating the configuration of the decoded picture management unit 14.

図５に示すように、復号ピクチャ管理部１４は、DPB１４１、RPS導出部１４２、参照ピクチャ制御部１４３、ベース参照ピクチャ制御部１４４、RPL導出部１４５、出力制御部１４６を備える。 As shown in FIG. 5, the decoded picture management unit 14 includes a DPB 141, an RPS deriving unit 142, a reference picture control unit 143, a base reference picture control unit 144, an RPL deriving unit 145, and an output control unit 146.

＜DPB１４１＞
DPB１４１は、復号ピクチャバッファ（Decoded Picture Buffer）とも呼ばれ、ピクチャ復号部１３で復号された対象レイヤの各ピクチャの復号ピクチャを記録する。DPBには、対象レイヤの各ピクチャに対応する復号ピクチャが出力順（POC: Picture Order Count）に関連付けて記録されている。加えて、DPBの各ピクチャに対しては、参照マークおよび出力マークを設定できる。 <DPB141>
The DPB 141 is also called a decoded picture buffer, and records the decoded picture of each picture of the target layer decoded by the picture decoding unit 13. In the DPB, a decoded picture corresponding to each picture of the target layer is recorded in association with an output order (POC: Picture Order Count). In addition, a reference mark and an output mark can be set for each DPB picture.

参照マークは、DPB上のピクチャが対象ピクチャ以降の復号処理における予測画像生成処理（例えば、インター予測やレイヤ間画像予測）への利用可否を示す情報である。参照マークは、具体的には、「短期参照使用」（「used for short-term reference」）、「長期参照使用」（「used for long-term reference」）、「参照不使用」（「not used for reference」）のいずれかの値を取る。 The reference mark is information indicating whether or not the picture on the DPB can be used for prediction image generation processing (for example, inter prediction or inter-layer image prediction) in decoding processing after the target picture. Specifically, the reference marks are "short-term reference use" ("used for short-term reference"), "long-term reference use" ("used for long-term reference"), "not used" ("not used for reference ").

なお、参照マークの取り得る値を上記とするがそれに限らない。例えば、「レイヤ間参照使用」（「used for inter-layer reference」）の値に参照マークを設定できてもよい。また、「短期参照使用」と「長期参照使用」の区別をせず、両者の和集合を「参照使用」（「used for reference」）と定義してもよい。 In addition, although the value which a reference mark can take is set as above, it is not restricted to it. For example, a reference mark may be set to the value of “used for inter-layer reference”. Further, without distinguishing between “short-term reference use” and “long-term reference use”, the union of both may be defined as “used for reference”.

出力マークは、DPB上のピクチャを外部へ出力する必要性の有無を示す情報である。出力マークは、具体的には、「出力要」（「needed for output」）、「出力不要」（「not needed for output」）のいずれかの値を取る。 The output mark is information indicating whether or not a picture on the DPB needs to be output to the outside. Specifically, the output mark takes one of the values “needed for output” and “not needed for output”.

なお、参照マークや出力マークは復号処理や符号化処理の特定のタイミングで明示的に設定されていなくてもよく、その場合、参照マークまたは出力マークは「未定義」と判定する。 Note that the reference mark and the output mark may not be explicitly set at a specific timing of the decoding process or the encoding process. In this case, the reference mark or the output mark is determined as “undefined”.

ここでDPBの状態の例を図６を参照して説明する。図６は、復号処理の特定のタイミングでのDPBの状態を例示している。図６において、DPBには4枚のピクチャが記録されており、それぞれ、POC、参照マーク（図中ではRM）、出力マーク（図中ではOM）が関連付けられている。POCがａ、参照マークがb、出力マークがcのピクチャを「a-b-c」と表記すると、DPBには「2-LR-NNfO」、「10-NR-NNfO」、「14-SR-NfO」、「12-unk-NfO」のピクチャが含まれている。なお、参照マークの値は略記号を用いて、「長期参照使用」を“LR”、「長期参照使用」を“SR”、「長期参照使用」を“NR”、「未定義」を“unk”と表記している。また、出力マークの値は略記号を用いて、「出力要」を“NfO”、「出力不要」を“NNfO”と表記している。本発明に関する以降の説明において、特に断りがなければ、DPB内のピクチャの状態を表すために上記の表記方法を用いる。 Here, an example of the state of the DPB will be described with reference to FIG. FIG. 6 illustrates the state of the DPB at a specific timing of the decoding process. In FIG. 6, four pictures are recorded in the DPB, and a POC, a reference mark (RM in the figure), and an output mark (OM in the figure) are associated with each other. If a picture with POC a, reference mark b, and output mark c is written as “abc”, DPB has “2-LR-NNfO”, “10-NR-NNfO”, “14-SR-NfO”, “ The picture “12-unk-NfO” is included. Note that the value of the reference mark is abbreviated. “Long-term reference use” is “LR”, “Long-term reference use” is “SR”, “Long-term reference use” is “NR”, “Undefined” is “unk” ". In addition, the value of the output mark is abbreviated to indicate “output required” as “NfO” and “output unnecessary” as “NNfO”. In the following description regarding the present invention, unless otherwise specified, the above-described notation method is used to represent the state of a picture in the DPB.

このDPB状態の例では、「10-NR-NNfO」のピクチャは「参照不使用」であって「出力不要」であるからDPBに保持しておく必要はなく、出来るだけ早い段階にDPBから削除することで、DPBのメモリ使用量を節約できる。 In this DPB state example, the picture of “10-NR-NNfO” is “reference not used” and “output unnecessary”, so it is not necessary to keep it in the DPB, and it is deleted from the DPB as early as possible. By doing so, you can save DPB memory usage.

＜RPS導出部１４２＞
RPS導出部１４２は、入力されたRPSに係るシンタックス値に基づいて対象ピクチャの復号処理に用いるRPS（参照ピクチャセット）を導出してベース参照ピクチャ制御部１４４、参照ピクチャ制御部１４３、および、RPL導出部１４５に出力する。 <RPS deriving unit 142>
The RPS deriving unit 142 derives an RPS (reference picture set) used for decoding processing of the target picture based on the syntax value related to the input RPS, and a base reference picture control unit 144, a reference picture control unit 143, and The data is output to the RPL deriving unit 145.

以下ではRPSに係るシンタックス値、および、当該シンタックス値に基づくRPS導出処理の詳細をそれぞれ説明する。 Hereinafter, a syntax value related to RPS and details of RPS derivation processing based on the syntax value will be described.

（RPS情報）
RPSに係るシンタックス値（以下、RPS情報）は以下を含む。
１．SPS短期RPS情報：SPSに含まれる短期参照ピクチャセット情報
２．SPS長期RP情報：SPSに含まれる長期参照ピクチャ情報
３．SH短期RPS情報：スライスヘッダに含まれる短期参照ピクチャセット情報
４．SH長期RP情報：スライスヘッダに含まれる長期参照ピクチャ情報
５．ILRP情報：レイヤ間参照ピクチャ情報
（１．SPS短期RPS情報）
SPS短期RPS情報は、SPSを参照する各ピクチャから利用され得る複数の短期RPSの情報を含む。なお、短期RPSは、対象ピクチャに対する相対的な位置（例えば対象ピクチャとのPOC差分）により指定される参照ピクチャ（短期参照ピクチャ）となり得るピクチャの集合である。 (RPS information)
Syntax values related to RPS (hereinafter referred to as RPS information) include the following.
1. 1. SPS short-term RPS information: short-term reference picture set information included in the SPS 2. SPS long-term RP information: long-term reference picture information included in SPS SH short-term RPS information: short-term reference picture set information included in the slice header SH long-term RP information: long-term reference picture information included in the slice header ILRP information: Inter-layer reference picture information (1. SPS short-term RPS information)
The SPS short-term RPS information includes information on a plurality of short-term RPSs that can be used from each picture referring to the SPS. Note that the short-term RPS is a set of pictures that can be a reference picture (short-term reference picture) specified by a relative position with respect to the target picture (for example, a POC difference from the target picture).

SPS短期RPS情報を、図７を参照して説明する。図７は、SPS復号時に利用されるSPSシンタックス表の一部を例示している。図７の（Ａ）の部分がSPS短期RPS情報に相当する。SPS短期RPS情報には、SPSに含まれる短期RPS数（num_short_term_ref_pic_sets）、および、各短期RPSの定義（short_term_ref_pic_set(i)）が含まれる。 The SPS short-term RPS information will be described with reference to FIG. FIG. 7 illustrates a part of the SPS syntax table used at the time of SPS decoding. The portion (A) in FIG. 7 corresponds to SPS short-term RPS information. The SPS short-term RPS information includes the number of short-term RPS included in the SPS (num_short_term_ref_pic_sets) and the definition of each short-term RPS (short_term_ref_pic_set (i)).

短期RPS情報について、図８を参照して説明する。図８は、SPS復号時、および、スライスヘッダ復号時に利用される短期RPSのシンタックス表を例示している。 The short-term RPS information will be described with reference to FIG. FIG. 8 illustrates a short-term RPS syntax table used at the time of SPS decoding and slice header decoding.

短期RPS情報には、対象ピクチャより表示順が早い短期参照ピクチャ数（num_negative_pics）、および、対象ピクチャより表示順が遅い短期参照ピクチャ数（num_positive_pics）が含まれる。なお、以下では、対象ピクチャより表示順が早い短期参照ピクチャを前方短期参照ピクチャ、対象ピクチャより表示順が遅い短期参照ピクチャを後方短期参照ピクチャと呼ぶ。 The short-term RPS information includes the number of short-term reference pictures (num_negative_pics) whose display order is earlier than that of the target picture and the number of short-term reference pictures (num_positive_pics) whose display order is later than that of the target picture. In the following, a short-term reference picture whose display order is earlier than the target picture is referred to as a front short-term reference picture, and a short-term reference picture whose display order is later than the target picture is referred to as a rear short-term reference picture.

また、短期RPS情報には、各前方短期参照ピクチャに対して、対象ピクチャに対するPOC差分の絶対値（delta_poc_s0_minus1[i]）、および、対象ピクチャの参照ピクチャとして使用される可能性の有無（used_by_curr_pic_s0_flag[i]）が含まれる。加えて、各後方短期参照ピクチャに対して、対象ピクチャに対するPOC差分の絶対値（delta_poc_s1_minus1[i]）、および、対象ピクチャの参照ピクチャとして使用される可能性の有無（used_by_curr_pic_s1_flag[i]）が含まれる。 The short-term RPS information includes, for each forward short-term reference picture, the absolute value of the POC difference with respect to the target picture (delta_poc_s0_minus1 [i]), and the presence / absence of use as a reference picture of the target picture (used_by_curr_pic_s0_flag [ i]). In addition, for each backward short-term reference picture, the absolute value of the POC difference with respect to the target picture (delta_poc_s1_minus1 [i]) and the possibility of being used as the reference picture of the target picture (used_by_curr_pic_s1_flag [i]) are included It is.

（２．SPS長期RP情報）
SPS長期RP情報は、SPSを参照する各ピクチャから利用され得る複数の長期参照ピクチャの情報を含む。なお、長期参照ピクチャとは、シーケンス内の絶対的な位置（例えばPOC）により指定されるピクチャである。 (2. SPS long-term RP information)
The SPS long-term RP information includes information on a plurality of long-term reference pictures that can be used from each picture that references the SPS. A long-term reference picture is a picture specified by an absolute position (for example, POC) in a sequence.

SPS長期RP情報の復号について、図７を再び参照して説明する。図７の（Ｂ）の部分がSPS長期RP情報に相当する。SPS長期RP情報には、SPSで伝送される長期参照ピクチャの有無を示す情報（long_term_ref_pics_present_flag）、SPSに含まれる長期参照ピクチャ数（num_long_term_ref_pics_sps）、および、各長期参照ピクチャの情報が含まれる。長期参照ピクチャの情報には、参照ピクチャのPOC（lt_ref_pic_poc_lsb_sps[i]）、および、対象ピクチャの参照ピクチャとして使用される可能性の有無（used_by_curr_pic_lt_sps_flag[i]）が含まれる。 Decoding of SPS long-term RP information will be described with reference to FIG. 7 again. 7B corresponds to the SPS long-term RP information. The SPS long-term RP information includes information (long_term_ref_pics_present_flag) indicating the presence or absence of a long-term reference picture transmitted in the SPS, the number of long-term reference pictures included in the SPS (num_long_term_ref_pics_sps), and information on each long-term reference picture. The long-term reference picture information includes the POC (lt_ref_pic_poc_lsb_sps [i]) of the reference picture and the possibility of being used as the reference picture of the target picture (used_by_curr_pic_lt_sps_flag [i]).

なお、上記参照ピクチャのPOCは、参照ピクチャに関連付けられたPOCの値自体であってもよいし、POCのLSB（Least Significant Bit）、すなわち、POCを既定の２の冪乗の数で割った余りの値を用いてもよい。 The POC of the reference picture may be the POC value associated with the reference picture, or the LSB (Least Significant Bit) of the POC, that is, the POC divided by a predetermined number of powers of 2. The remainder value may be used.

（３．SH短期RPS情報）
SH短期RPS情報は、スライスヘッダを参照するピクチャから利用され得る単一の短期RPSの情報を含む。 (3. SH short-term RPS information)
The SH short-term RPS information includes information of a single short-term RPS that can be used from a picture that references a slice header.

SPS短期RPS情報の復号について、図９を参照して説明する。図９は、スライスヘッダ復号時に利用されるスライスヘッダシンタックス表の一部を例示している。図９の（Ａ）の部分がSH短期RPS情報に相当する。SH短期RPS情報は、短期RPSをSPSで復号済みの短期RPSの中から選択するか、スライスヘッダに明示的に含めるかを示すフラグ（short_term_ref_pic_set_sps_flag）を含む。SPSで復号済の中から選択する場合、復号済の短期RPSを一つ選択する識別子（short_term_ref_pic_set_idx）が含まれる。スライスヘッダに明示的に含める場合は、前述の図８を参照して説明したシンタックス表（short_term_ref_pic_set(idx)）に相当する情報が、SPS短期RPS情報に含まれる。 Decoding of SPS short-term RPS information will be described with reference to FIG. FIG. 9 illustrates a part of a slice header syntax table used at the time of decoding the slice header. 9A corresponds to the SH short-term RPS information. The SH short-term RPS information includes a flag (short_term_ref_pic_set_sps_flag) indicating whether the short-term RPS is selected from the short-term RPS decoded by the SPS or explicitly included in the slice header. When selecting from SPS decoded, an identifier (short_term_ref_pic_set_idx) for selecting one decoded short-term RPS is included. When explicitly included in the slice header, information corresponding to the syntax table (short_term_ref_pic_set (idx)) described with reference to FIG. 8 is included in the SPS short-term RPS information.

（４．SH長期RP情報）
SH長期RP情報は、スライスヘッダを参照するピクチャから利用され得る長期参照ピクチャの情報を含む。 (4. SH long-term RP information)
The SH long-term RP information includes information on a long-term reference picture that can be used from a picture that references a slice header.

SH長期RP情報の復号について、図９を再び参照して説明する。図９の（Ｂ）の部分がSH長期RP情報に相当する。SH長期RP情報は、対象ピクチャで長期参照ピクチャが利用可能（long_term_ref_pic_present_flag）である場合のみスライスヘッダに含まれる。SPSで１以上の長期参照ピクチャを復号済である場合（num_long_term_ref_pics_sps>0）、SPSで復号済の長期参照ピクチャの中で対象ピクチャで参照され得る参照ピクチャの数（num_long_term_sps）がSH長期RP情報に含まれる。また、スライスヘッダで明示的に伝送される長期参照ピクチャ数（num_long_term_pics）がSH長期RP情報に含まれる。加えて、上記num_long_term_spsの数の長期参照ピクチャをSPSで伝送済の長期参照ピクチャの中から選択する情報（lt_idx_sps[i]）がSH長期RP情報に含まれる。さらに、スライスヘッダに明示的に含める長期参照ピクチャの情報として、上記num_long_term_picsの数だけ、参照ピクチャのPOC（poc_lsb_lt [i]）、および、対象ピクチャの参照ピクチャとして使用される可能性の有無（used_by_curr_pic_lt_flag[i]）が含まれる。 Decoding of SH long-term RP information will be described with reference to FIG. 9 again. 9B corresponds to the SH long-term RP information. The SH long-term RP information is included in the slice header only when a long-term reference picture is available in the target picture (long_term_ref_pic_present_flag). When one or more long-term reference pictures have been decoded by SPS (num_long_term_ref_pics_sps> 0), the number of reference pictures (num_long_term_sps) that can be referred to by the target picture among the long-term reference pictures decoded by SPS is the SH long-term RP information included. Further, the number of long-term reference pictures (num_long_term_pics) explicitly transmitted in the slice header is included in the SH long-term RP information. In addition, information (lt_idx_sps [i]) for selecting the num_long_term_sps number of long-term reference pictures from among the long-term reference pictures transmitted by the SPS is included in the SH long-term RP information. Furthermore, as information on long-term reference pictures to be explicitly included in the slice header, the number of reference pictures POC (poc_lsb_lt [i]) and the presence / absence of use as a reference picture of the target picture (used_by_curr_pic_lt_flag) [i]) is included.

（５．ILRP情報）
ILRP情報は、スライスヘッダを含むピクチャからレイヤ間予測で参照され得るレイヤ間参照ピクチャの情報を含む。 (5. ILRP information)
The ILRP information includes information on an inter-layer reference picture that can be referred to by inter-layer prediction from a picture including a slice header.

ILRP情報について、図１０を参照して説明する。図１０はスライス復号時に参照されるシンタックス表の一部であって、ILRP情報に相当する部分である。 ILRP information will be described with reference to FIG. FIG. 10 shows a part of a syntax table that is referred to at the time of slice decoding, and corresponds to ILRP information.

ILRP情報は、レイヤ間予測有効フラグ（inter_layer_pred_enabled_flag）を含む。さらに、レイヤ間予測有効フラグが１であり（レイヤ間予測が有効であり）、かつ、対象ピクチャから参照可能な参照レイヤ数（NumDirectRefLayers[nuh_layer_id]）が１より大きい場合、レイヤ間参照ピクチャ数を表すシンタックス（num_inter_layr_ref_pics_minus1）がILRP情報に含まれる。対象ピクチャにおけるレイヤ間参照ピクチャ数（NumActiveRefLayerPics）は“num_inter_layer_ref_pics_minus1 + 1”の値に設定される。加えて、各レイヤ間参照ピクチャの属するレイヤを示すレイヤ識別子（inter_layer_pred_layer_idc[i]）がILRP情報に含まれる。 The ILRP information includes an inter-layer prediction enabled flag (inter_layer_pred_enabled_flag). Furthermore, when the inter-layer prediction valid flag is 1 (inter-layer prediction is valid) and the number of reference layers that can be referred from the target picture (NumDirectRefLayers [nuh_layer_id]) is greater than 1, the number of inter-layer reference pictures is set. The representing syntax (num_inter_layr_ref_pics_minus1) is included in the ILRP information. The number of inter-layer reference pictures (NumActiveRefLayerPics) in the target picture is set to a value of “num_inter_layer_ref_pics_minus1 + 1”. In addition, a layer identifier (inter_layer_pred_layer_idc [i]) indicating a layer to which each inter-layer reference picture belongs is included in the ILRP information.

なお、ILRP情報に含まれる上記の各シンタックスは、自明である場合は省略されてもよい。例えば、１つのピクチャから参照可能なレイヤ間参照ピクチャ数が１枚に制限されている場合、レイヤ間参照ピクチャ数に係るシンタックスは不要である。 Note that each syntax included in the ILRP information may be omitted if it is obvious. For example, when the number of inter-layer reference pictures that can be referenced from one picture is limited to one, the syntax related to the number of inter-layer reference pictures is not necessary.

（RPS導出処理）
次にRPS情報に基づくRPS導出処理の詳細を説明する。 (RPS derivation process)
Next, details of the RPS deriving process based on the RPS information will be described.

RPS導出処理は複数のRPSのサブセットを導出処理に分けられる。RPSのサブセットは以下のように定義される。 The RPS derivation process can divide a plurality of RPS subsets into derivation processes. A subset of RPS is defined as follows:

RPSは参照可能ピクチャの種類に応じて次の２つのサブセットに分けられる。
・現ピクチャ参照可能リストListCurr：対象ピクチャにおける参照可能ピクチャのリスト
・後続ピクチャ参照可能リストListFoll：対象ピクチャでは参照されないが、対象ピクチャに復号順で後続のピクチャで参照可能なピクチャのリスト
なお、現ピクチャ参照可能リストに含まれるピクチャの数を、現ピクチャ参照可能ピクチャ数NumCurrListと呼ぶ。 The RPS is divided into the following two subsets depending on the type of the referenceable picture.
-Current picture referenceable list ListCurr: List of referenceable pictures in the target picture-Subsequent picture referenceable list ListFoll: List of pictures that are not referenced in the target picture but can be referred to in subsequent pictures in the decoding order of the target picture The number of pictures included in the picture referable list is referred to as the current picture referable picture number NumCurrList.

現ピクチャ参照可能リストは、さらに４つの部分リストから構成される。
・現ピクチャ長期参照可能リストListLtCurr：SPS長期RP情報またはSH長期RP情報により指定される現ピクチャ参照可能ピクチャ
・現ピクチャ短期前方参照可能リストListStCurrBefore：SPS短期RPS情報またはＳＨ短期RPS情報により指定される現ピクチャ参照可能ピクチャであって、表示順が対象ピクチャより早いもの
・現ピクチャ短期後方参照可能リストListStCurrAfter：SPS短期RPS情報またはＳＨ短期RPS情報により指定される現ピクチャ参照可能ピクチャであって、表示順が対象ピクチャより早いもの
・現ピクチャレイヤ間参照可能リストListIlCurr：ILRP情報により指定される現ピクチャ参照可能ピクチャ
後続ピクチャ参照可能リストは、さらに２つの部分リストから構成される。
・後続ピクチャ長期参照可能リストListLtFoll：SPS長期RP情報またはＳＨ長期RP情報により指定される後続ピクチャ参照可能ピクチャ。
・後続ピクチャ短期参照可能リストListStFoll：SPS短期RPS情報またはＳＨ短期RPS情報により指定される現ピクチャ参照可能ピクチャ。 The current picture referable list further includes four partial lists.
Current picture long-term referable list ListLtCurr: Current picture referenceable picture specified by SPS long-term RP information or SH long-term RP information Current picture short-term forward-referenceable list ListStCurrBefore: Specified by SPS short-term RPS information or SH short-term RPS information Current picture referenceable picture whose display order is earlier than the current picture. Current picture short-term backward referenceable list ListStCurrAfter: current picture referenceable picture specified by SPS short-term RPS information or SH short-term RPS information, and displayed Current picture layer referenceable list ListIlCurr: Current picture referenceable picture specified by ILRP information The subsequent picture referenceable list is further composed of two partial lists.
Subsequent picture long-term referable list ListLtFoll: Subsequent picture referenceable picture specified by SPS long-term RP information or SH long-term RP information.
Subsequent picture short-term referable list ListStFoll: current picture referable picture specified by SPS short-term RPS information or SH short-term RPS information.

RPSを構成する、現ピクチャ短期前方参照可能リストListStCurrBefore、現ピクチャ短期後方参照可能リストListStCurrAfter、現ピクチャ長期参照可能リストListLtCurr、現ピクチャレイヤ間参照可能リストListIlCurr、後続ピクチャ短期参照可能リストListStFoll、および、後続ピクチャ長期参照可能リストListLtFollを次の手順で生成する。加えて、現ピクチャ参照可能ピクチャ数NumPocTotalCurrを導出する。なお、前記各参照可能リストは、以下の処理の開始前に空に設定されている。
（Ｓ１０１）SPS短期RPS情報、および、SH短期RPS情報に基づいて、対象ピクチャの復号に用いる単一の短期参照ピクチャセットを特定する。具体的には、SH短期RPS情報に含まれるshort_term_ref_pic_set_spsの値が０である場合、SH短期RPS情報に含まれるスライスヘッダで明示的に伝送された短期RPSを選択する。それ以外（short_term_ref_pic_set_spsの値が１の場合、ＳＨ短期RPS情報に含まれるshort_term_ref_pic_set_idxが示す短期RPSを、SPS短期RPS情報に含まれる複数の短期RPSの中から選択する。
（Ｓ１０２）選択された短期RPSに含まれる参照ピクチャ各々のPOCを導出する。参照ピクチャのPOCは、参照ピクチャが前方短期参照ピクチャの場合、対象ピクチャのPOCから「delta_poc_s0_minus1[i]+1」の値を減算して導出する。一方、参照ピクチャが後方短期参照ピクチャの場合、対象ピクチャのPOCに「delta_poc_s1_minus1[i]+1」の値を加算して導出する。
（Ｓ１０３）短期RPSに含まれる前方参照ピクチャを伝送された順に確認し、関連付けられているused_by_curr_pic_s0_flag[i]の値が1である場合、当該前方参照ピクチャを現ピクチャ短期前方参照可能リストListStCurrBeforeに追加する。それ以外（used_by_curr_pic_s0_flag[i]の値が0）の場合、当該前方参照ピクチャを後続ピクチャ短期参照可能リストListStFollに追加する。
（Ｓ１０４）短期RPSに含まれる後方参照ピクチャを伝送された順に確認し、関連付けられているused_by_curr_pic_s1_flag[i]の値が1である場合、当該後方参照ピクチャを現ピクチャ短期後方参照可能リストListStCurrAfterに追加する。それ以外（used_by_curr_pic_s1_flag[i]の値が０の場合、当該前方参照ピクチャを後続ピクチャ短期参照可能リストListStFollに追加する。
（Ｓ１０５） SPS長期RP情報、および、SH長期RP情報に基づいて、対象ピクチャの復号に用いる長期参照ピクチャを特定する。具体的には、num_long_term_spsの数の参照ピクチャをSPS長期RP情報に含まれる参照ピクチャの中から選択して、順に長期RPSに追加する。選択される参照ピクチャは、lt_idx_sps[i]の示す参照ピクチャである。続いて、num_long_term_picsの数の参照ピクチャをSH長期RP情報に含まれる参照ピクチャを順に長期RPSに追加する。
（Ｓ１０６）長期RPSに含まれる参照ピクチャ各々のPOCを導出する。長期参照ピクチャのPOCは、関連付けて復号されたpoc_lst_lt[i]、または、lt_ref_pic_poc_lsb_sps[i]の値から直接導出される。
（Ｓ１０７）長期RPSに含まれる参照ピクチャを順に確認し、関連付けられているused_by_curr_pic_lt_flag[i]、または、used_by_curr_pic_lt_sps_flag[i]の値が1である場合、当該長期参照ピクチャを現ピクチャ長期参照可能リストListLtCurrに追加する。それ以外（used_by_curr_pic_lt_flag[i]、または、used_by_curr_pic_lt_sps_flag[i]の値が0）の場合、当該長期参照ピクチャを後続ピクチャ長期参照可能リストListLtFollに追加する。
（Ｓ１０８）ILRP情報により指定される参照ピクチャを現ピクチャレイヤ間参照可能リストListIlCurrに追加する。追加は、レイヤ間参照予測が有効（inter_layer_pred_enabled_flagが１）の場合に次の順に実行される。まず、ILRP情報に含まれるレイヤ間参照ピクチャ数個の参照レイヤを、参照レイヤ識別子（inter_layer_pred_layer_idc[i]）に基づき選択してアクティブ参照レイヤに設定する。次に、アクティブ参照レイヤ上の対象ピクチャと同じPOCと関連付けられたピクチャを現ピクチャレイヤ間参照可能リストListIlCurrに追加する。
（Ｓ１０９）変数NumPocTotalCurrの値を、現ピクチャから参照可能な参照ピクチャの和に設定する。すなわち、変数NumPocTotalCurrの値を、現ピクチャ短期前方参照可能リストListStCurrBefore、現ピクチャ短期後方参照可能リストListStCurrAfter、現ピクチャ長期参照可能リストListLtCurr、および、現ピクチャレイヤ間参照可能リストListIlCurrの４つのリストの各要素数の和に設定する。 The current picture short-term forward referenceable list ListStCurrBefore, current picture short-term backward referenceable list ListStCurrAfter, current picture long-term referenceable list ListLtCurr, current picture inter-layer referenceable list ListIlCurr, subsequent picture short-term referenceable list ListStFoll, and RPS The subsequent picture long-term referable list ListLtFoll is generated by the following procedure. In addition, the current picture referenceable picture number NumPocTotalCurr is derived. Each of the referable lists is set to be empty before starting the following process.
(S101) Based on the SPS short-term RPS information and the SH short-term RPS information, a single short-term reference picture set used for decoding the current picture is specified. Specifically, when the value of short_term_ref_pic_set_sps included in the SH short-term RPS information is 0, the short-term RPS explicitly transmitted in the slice header included in the SH short-term RPS information is selected. Other than that (when the value of short_term_ref_pic_set_sps is 1, the short-term RPS indicated by short_term_ref_pic_set_idx included in the SH short-term RPS information is selected from a plurality of short-term RPSs included in the SPS short-term RPS information.
(S102) The POC of each reference picture included in the selected short-term RPS is derived. When the reference picture is a forward short-term reference picture, the reference picture POC is derived by subtracting the value of “delta_poc_s0_minus1 [i] +1” from the POC of the target picture. On the other hand, when the reference picture is a backward short-term reference picture, it is derived by adding the value of “delta_poc_s1_minus1 [i] +1” to the POC of the target picture.
(S103) Confirm the forward reference pictures included in the short-term RPS in the order of transmission, and if the associated used_by_curr_pic_s0_flag [i] value is 1, add the forward reference picture to the current picture short-term forward-referenceable list ListStCurrBefore To do. Otherwise (used_by_curr_pic_s0_flag [i] value is 0), the forward reference picture is added to the subsequent picture short-term referable list ListStFoll.
(S104) The backward reference pictures included in the short-term RPS are confirmed in the order of transmission, and if the associated used_by_curr_pic_s1_flag [i] value is 1, the backward reference picture is added to the current picture short-term backward-referenceable list ListStCurrAfter To do. Other than that (when the value of used_by_curr_pic_s1_flag [i] is 0, the forward reference picture is added to the subsequent picture short-term referable list ListStFoll.
(S105) Based on the SPS long-term RP information and the SH long-term RP information, a long-term reference picture used for decoding the current picture is specified. Specifically, num_long_term_sps reference pictures are selected from the reference pictures included in the SPS long-term RP information, and are sequentially added to the long-term RPS. The selected reference picture is the reference picture indicated by lt_idx_sps [i]. Subsequently, reference pictures included in the SH long-term RP information are sequentially added to the long-term RPS as many reference pictures as num_long_term_pics.
(S106) The POC of each reference picture included in the long-term RPS is derived. The POC of the long-term reference picture is directly derived from the value of poc_lst_lt [i] or lt_ref_pic_poc_lsb_sps [i] decoded in association with each other.
(S107) Check the reference pictures included in the long-term RPS in order, and if the value of associated used_by_curr_pic_lt_flag [i] or used_by_curr_pic_lt_sps_flag [i] is 1, the long-term reference picture is listed as the current picture long-term referenceable list ListLtCurr Add to In other cases (used_by_curr_pic_lt_flag [i] or used_by_curr_pic_lt_sps_flag [i] has a value of 0), the long-term reference picture is added to the subsequent picture long-term referable list ListLtFoll.
(S108) The reference picture specified by the ILRP information is added to the current picture layer referenceable list ListIlCurr. The addition is executed in the following order when inter-layer reference prediction is enabled (inter_layer_pred_enabled_flag is 1). First, reference layers of several inter-layer reference pictures included in ILRP information are selected based on a reference layer identifier (inter_layer_pred_layer_idc [i]) and set as an active reference layer. Next, a picture associated with the same POC as the target picture on the active reference layer is added to the inter-picture-layer referenceable list ListIlCurr.
(S109) The value of the variable NumPocTotalCurr is set to the sum of reference pictures that can be referenced from the current picture. That is, the value of the variable NumPocTotalCurr is set to each of the four lists of the current picture short-term forward reference list ListStCurrBefore, the current picture short-term backward reference list ListStCurrAfter, the current picture long-term reference list ListLtCurr, and the current picture inter-layer referenceable list ListIlCurr. Set to the sum of the number of elements.

＜参照ピクチャ制御部１４３＞
参照ピクチャ制御部１４３は、入力されるRPSに基づいて、DPB１４１を更新する。概略的には、参照ピクチャ制御部１４３は、入力されたRPSにおいて対象ピクチャ（現ピクチャ）のインター予測で参照可能と示されているピクチャの参照マークを「参照使用」（「短期参照使用」または「長期参照使用」）に設定する。加えて、DPBに記録されている対象レイヤの復号ピクチャであって、前記の処理で「参照使用」とマークされないピクチャを「参照不使用」に設定する。なお、DPB上のレイヤ間参照ピクチャの参照マークは参照ピクチャ制御部１４３では変更しない。言い換えると、ベース復号ピクチャに由来するDPB上のピクチャの参照マークの変更は参照ピクチャ制御部１４３では行わず、後述のベース参照ピクチャ制御部１４４で行う。 <Reference picture control unit 143>
The reference picture control unit 143 updates the DPB 141 based on the input RPS. Schematically, the reference picture control unit 143 sets a reference mark of a picture that is indicated as being referenceable by the inter prediction of the target picture (current picture) in the input RPS as “reference use” (“short-term reference use” or Set to “Long term reference use”). In addition, a decoded picture of the target layer recorded in the DPB, which is not marked as “reference use” in the above process, is set to “reference nonuse”. The reference mark of the inter-layer reference picture on the DPB is not changed by the reference picture control unit 143. In other words, the reference mark change of the picture on the DPB derived from the base decoded picture is not performed by the reference picture control unit 143 but is performed by the base reference picture control unit 144 described later.

参照ピクチャ制御部１４３での参照マーク設定処理は以下の手順で実行される。
（Ｓ２０１）DPB上の全てのピクチャの参照マークを「参照不使用」に設定する。
（Ｓ２０２）次の（ａ）または（ｂ）の条件を満たすピクチャ各々について、対応するDPB上のピクチャ、すなわち同一のPOCが割り当てられているピクチャ、を特定して参照マークを「短期参照使用」に設定する。 The reference mark setting process in the reference picture control unit 143 is executed in the following procedure.
(S201) The reference marks of all the pictures on the DPB are set to “not use reference”.
(S202) For each picture that satisfies the following condition (a) or (b), a corresponding picture on the DPB, that is, a picture to which the same POC is assigned, is specified and the short-term reference is used as a reference mark. Set to.

（ａ）RPSの現ピクチャ短期前方参照可能リストListStCurrBeforeに含まれているピクチャ。 (A) Pictures included in the RPS current picture short-term forward referenceable list ListStCurrBefore.

（ｂ）RPSの現ピクチャ長期後方参照可能リストListStCurrAfterに含まれている各ピクチャ。
（Ｓ２０３）次の（ｃ）の条件を満たすピクチャ各々について、対応するDPB上のピクチャを特定して参照マークを「長期参照使用」に設定する。 (B) Each picture included in the list StListCurrAfter that can be referred to long-term backward reference of the current picture of RPS.
(S203) For each picture that satisfies the following condition (c), the corresponding picture on the DPB is identified and the reference mark is set to “long-term reference use”.

（ｃ）RPSの現ピクチャ長期参照可能リストListLtCurrに含まれているピクチャ。 (C) Pictures included in the RPS current picture long-term referable list ListLtCurr.

なお、上記の参照マーク設定手順は一例であり、DPB上のピクチャの参照マークが同じになる限りは別の手順で参照マークを設定してもよい。例えば、上記手順のＳ２０２とＳ２０３の処理を先に実行し、両手順で参照マークが変更されなかったピクチャであって、かつ、対象レイヤの復号ピクチャであるDPB上の各ピクチャのマークを「参照不使用」に設定してもよい。 The reference mark setting procedure described above is an example, and the reference mark may be set by another procedure as long as the reference marks of the pictures on the DPB are the same. For example, the processing of S202 and S203 of the above procedure is executed first, and the mark of each picture on the DPB that is a picture whose reference mark has not been changed in both procedures and is a decoded picture of the target layer is “referenced”. It may be set to “unused”.

＜ベース参照ピクチャ制御部１４４＞
ベース参照ピクチャ制御部１４４は、入力されるベース復号ピクチャとRPSに基づいて、DPB１４１を更新する。概略的には、ベース参照ピクチャ制御部１４４は、入力されたRPSにおいて対象ピクチャ（現ピクチャ）のレイヤ間インター予測で参照可能と示されているピクチャに対応するベース復号ピクチャをレイヤ間参照ピクチャとしてDPBに記録する。加えて、DPBに記録されたレイヤ間参照ピクチャの参照マークを「参照使用」（「短期参照使用」または「長期参照使用」）に設定する。加えて、DPB上で当該レイヤ間参照ピクチャの出力マークを「出力不要」に設定する。 <Base Reference Picture Control Unit 144>
The base reference picture control unit 144 updates the DPB 141 based on the input base decoded picture and RPS. Schematically, the base reference picture control unit 144 uses, as an inter-layer reference picture, a base decoded picture corresponding to a picture that can be referred to in inter-layer inter prediction of the target picture (current picture) in the input RPS. Record in DPB. In addition, the reference mark of the inter-layer reference picture recorded in the DPB is set to “reference use” (“short-term reference use” or “long-term reference use”). In addition, the output mark of the inter-layer reference picture is set to “output unnecessary” on the DPB.

より具体的には、ベース参照ピクチャ制御部１４４における参照マーク設定処理は、次の手順で実行される。 More specifically, the reference mark setting process in the base reference picture control unit 144 is executed in the following procedure.

（Ｓ３０１）次の（ｄ）の条件を満たすピクチャ各々について、DPB上のピクチャバッファを確保し、入力されるベース復号ピクチャを当該ピクチャバッファに記録する。 (S301) For each picture that satisfies the following condition (d), a picture buffer on the DPB is secured, and the input base decoded picture is recorded in the picture buffer.

（ｄ）RPSの現ピクチャレイヤ間参照可能リストListIlCurrに含まれているピクチャ。 (D) Pictures included in the RPS current picture layer referenceable list ListIlCurr.

なお、ベース復号ピクチャを記録する際に、必要に応じてスケーリングやフィルタリングを適用した後に前記ピクチャバッファに記録してもよい。特に、参照レイヤと対象レイヤの出力ピクチャの解像度が異なる場合（空間スケーラビリティの場合）、参照レイヤの復号ピクチャであるベース復号ピクチャを、対象レイヤの出力ピクチャの解像度に合わせてスケーリングする必要がある。 When recording the base decoded picture, it may be recorded in the picture buffer after applying scaling and filtering as necessary. In particular, when the resolutions of the output pictures of the reference layer and the target layer are different (in the case of spatial scalability), it is necessary to scale the base decoded picture that is the decoded picture of the reference layer according to the resolution of the output picture of the target layer.

（Ｓ３０２）Ｓ３０１でDPBに記録した各ピクチャの参照マークを「参照使用」に設定する。 (S302) The reference mark of each picture recorded in the DPB in S301 is set to “use reference”.

（Ｓ３０３）Ｓ３０１でDPBに記録した各ピクチャの出力マークを「出力不要」に設定する。 (S303) The output mark of each picture recorded in the DPB in S301 is set to “no output required”.

＜RPL導出部１４５＞
RPL導出部１４５は、入力RPS、および、入力シンタックス値に含まれるRPL情報に基づいて対象ピクチャの対象スライスのインター予測またはレイヤ間画像予測で利用する参照ピクチャリストを導出して出力する。 <RPL deriving unit 145>
The RPL deriving unit 145 derives and outputs a reference picture list used for inter prediction or inter-layer image prediction of the target slice of the target picture based on the input RPS and the RPL information included in the input syntax value.

（RPL情報）
RPL情報は、参照ピクチャリストRPLを構築するためにSPSまたはスライスヘッダより復号されるシンタックス値である。RPL情報は、SPSリスト修正情報、および、SHリスト修正情報から構成される。 (RPL information)
The RPL information is a syntax value decoded from the SPS or the slice header in order to construct the reference picture list RPL. The RPL information includes SPS list correction information and SH list correction information.

SPSリスト修正情報はSPSに含まれる情報であり、参照ピクチャリスト修正の制約に係る情報である。SPSリスト修正情報について、図７を再び参照して説明する。図７の（Ｃ）の部分がSPSリスト修正情報に相当する。SPSリスト修正情報には、ピクチャに含まれる前スライスで参照ピクチャリストが共通か否かを示すフラグ（restricted_ref_pic_lists_flag）、および、スライスヘッダ内にリスト並べ替えに関する情報が存在するか否かを示すフラグ（lists_modification_present_flag）が含まれる。 The SPS list modification information is information included in the SPS, and is information related to restrictions on modification of the reference picture list. The SPS list correction information will be described with reference to FIG. 7 again. 7C corresponds to the SPS list correction information. In the SPS list correction information, a flag (restricted_ref_pic_lists_flag) indicating whether or not the reference picture list is common in the previous slice included in the picture, and a flag (whether or not information related to list rearrangement exists in the slice header ( lists_modification_present_flag).

ＳＨリスト修正情報はスライスヘッダに含まれる情報であり、対象ピクチャに適用される参照ピクチャリストの長さ（参照リスト長）の更新情報、および、参照ピクチャリストの並べ替え情報（参照リスト並べ替え情報）が含まれる。SHリスト修正情報について、図１１を参照して説明する。図１１はスライスヘッダ復号時に利用されるスライスヘッダシンタックス表の一部を例示している。図１１の（Ｃ）の部分がＳＨリスト修正情報に相当する。 The SH list correction information is information included in the slice header, and the update information of the length of the reference picture list (reference list length) applied to the target picture, and the reordering information of the reference picture list (reference list reordering information) ) Is included. The SH list correction information will be described with reference to FIG. FIG. 11 illustrates a part of a slice header syntax table used when decoding a slice header. A portion (C) in FIG. 11 corresponds to SH list correction information.

参照リスト長更新情報として、リスト長の更新有無を示すフラグ（num_ref_idx_active_override_flag）が含まれる。加えて、Ｌ０参照リストの変更後の参照リスト長を表す情報（num_ref_idx_l0_active_minus1）、および、Ｌ１参照リストの変更後の参照リスト長を表す情報（num_ref_idx_l1_active_minus1）が含まれる。 The reference list length update information includes a flag (num_ref_idx_active_override_flag) indicating whether or not the list length is updated. In addition, information (num_ref_idx_l0_active_minus1) indicating the reference list length after the change of the L0 reference list and information (num_ref_idx_l1_active_minus1) indicating the reference list length after the change of the L1 reference list are included.

参照リスト並べ替え情報としてスライスヘッダに含まれる情報について、図１２を参照して説明する。図１２はスライスヘッダ復号時に利用される参照リスト並べ替え情報のシンタックス表を例示している。 Information included in the slice header as reference list rearrangement information will be described with reference to FIG. FIG. 12 exemplifies a syntax table of reference list rearrangement information used at the time of decoding the slice header.

参照リスト並べ替え情報には、Ｌ０参照リスト並べ替え有無フラグ（ref_pic_list_modification_flag_l0）が含まれる。前記フラグの値が１（Ｌ０参照リストの並べ替えが有る場合）、かつ、NumPocTotalCurrが２より大きい場合、Ｌ０参照リスト並べ替え順序（list_entry_l0[i]）が参照リスト並べ替え情報に含まれる。ここで、NumPocTotalCurrは、現ピクチャにおいて利用可能な参照ピクチャの数を表す変数である。したがって、Ｌ０参照リストの並べ替えが有る場合であって、かつ、現ピクチャにおいて利用可能な参照ピクチャ数が２より大きい場合にのみ、Ｌ０参照リスト並べ替え順序がスライスヘッダに含まれる。 The reference list rearrangement information includes an L0 reference list rearrangement presence / absence flag (ref_pic_list_modification_flag_l0). When the value of the flag is 1 (when the L0 reference list is rearranged) and NumPocTotalCurr is larger than 2, the L0 reference list rearrangement order (list_entry_l0 [i]) is included in the reference list rearrangement information. Here, NumPocTotalCurr is a variable representing the number of reference pictures that can be used in the current picture. Therefore, the L0 reference list rearrangement order is included in the slice header only when the L0 reference list is rearranged and the number of reference pictures available in the current picture is larger than two.

同様に、参照ピクチャがＢスライスである場合、つまり、対象ピクチャにおいてＬ１参照リストが利用可能である場合、Ｌ１参照リスト並べ替え有無フラグ（ref_pic_list_modification_flag_l1）が参照リスト並べ替え情報に含まれる。前記フラグの値が１、かつ、NumPocTotalCurrが２より大きい場合、Ｌ１参照リスト並べ替え順序（list_entry_l1[i]）が参照リスト並べ替え情報に含まれる。言い換えると、Ｌ１参照リストの並べ替えが有る場合であって、かつ、現ピクチャにおいて利用可能な参照ピクチャ数が２より大きい場合にのみ、Ｌ１参照リスト並べ替え順序がスライスヘッダに含まれる。 Similarly, when the reference picture is a B slice, that is, when the L1 reference list is available in the current picture, the L1 reference list rearrangement presence / absence flag (ref_pic_list_modification_flag_l1) is included in the reference list rearrangement information. When the value of the flag is 1 and NumPocTotalCurr is greater than 2, the L1 reference list rearrangement order (list_entry_l1 [i]) is included in the reference list rearrangement information. In other words, the L1 reference list rearrangement order is included in the slice header only when the L1 reference list is rearranged and the number of reference pictures available in the current picture is larger than two.

（RPL導出処理）
参照ピクチャリスト構築処理の詳細を説明する。参照ピクチャリスト導出部は、参照ピクチャセットRPSと、RPL修正情報に基づいて参照ピクチャリストRPLを生成する。 (RPL derivation process)
Details of the reference picture list construction process will be described. The reference picture list deriving unit generates a reference picture list RPL based on the reference picture set RPS and the RPL modification information.

参照ピクチャリストはＬ０参照リストとＬ１参照リストの２つのリストから構成される。始めに、Ｌ０参照リストの構築手順を説明する。Ｌ０参照リストは、以下のＳ３０１〜Ｓ３０７に示す手順で構築される。
（Ｓ３０１）暫定Ｌ０参照リストを生成して、空のリストに初期化する。
（Ｓ３０２）暫定Ｌ０参照リストに対し、現ピクチャ短期前方参照可能リストに含まれる参照ピクチャを順に追加する。
（Ｓ３０３）暫定Ｌ０参照リストに対し、現ピクチャ短期後方参照可能リストに含まれる参照ピクチャを順に追加する。
（Ｓ３０４）暫定Ｌ０参照リストに対し、現ピクチャ長期参照可能リストに含まれる参照ピクチャを順に追加する。
（Ｓ３０５）参照ピクチャリストが修正される場合（RPL修正情報に含まれるlists_modification_present_flagの値が１の場合）、以下のＳ３０６ａ〜Ｓ３０６ｂの処理を実行する。そうでない場合（lists_modification_present_flagの値が０の場合）、Ｓ３０７の処理を実行する。
（Ｓ３０６ａ）Ｌ０参照ピクチャの修正が有効な場合（RPL修正情報に含まれるref_pic_list_modification_flag_l0の値が１の場合）であって、かつ、現ピクチャ参照可能ピクチャ数NumCurrListが２に等しい場合、Ｓ３０６ｂを実行する。そうでない場合、Ｓ３０６ｃを実行する。
（Ｓ３０６ｂ）RPL修正情報に含まれるリスト並べ替え順序list_entry_l0[i]の値を下記の式により設定し、その後、Ｓ３０６ｃを実行する。 The reference picture list is composed of two lists, an L0 reference list and an L1 reference list. First, the construction procedure of the L0 reference list will be described. The L0 reference list is constructed according to the procedure shown in S301 to S307 below.
(S301) A temporary L0 reference list is generated and initialized to an empty list.
(S302) The reference pictures included in the current picture short-term forward referenceable list are sequentially added to the provisional L0 reference list.
(S303) Reference pictures included in the current picture short-term backward referenceable list are sequentially added to the provisional L0 reference list.
(S304) Reference pictures included in the current picture long-term referable list are sequentially added to the provisional L0 reference list.
(S305) When the reference picture list is modified (when the value of lists_modification_present_flag included in the RPL modification information is 1), the following processes of S306a to S306b are executed. Otherwise (when the value of lists_modification_present_flag is 0), the process of S307 is executed.
(S306a) When modification of the L0 reference picture is valid (when the value of ref_pic_list_modification_flag_l0 included in the RPL modification information is 1) and the current picture referenceable picture number NumCurrList is equal to 2, S306b is executed. . Otherwise, S306c is executed.
(S306b) The value of the list rearrangement order list_entry_l0 [i] included in the RPL correction information is set by the following equation, and then S306c is executed.

list_entry_l0[0] = 1
list_entry_l0[1] = 0
（Ｓ３０６ｃ）参照リスト並べ替え順序list_entry_l0[i]の値に基づいて、暫定Ｌ０参照リストの要素を並べ換えて、Ｌ０参照リストとする。参照ピクチャインデックスrIdxに対応するＬ０参照リストの要素RefPicList0[rIdx]は、次式により導出される。ここで、RefListTemp0[i]は、暫定Ｌ０参照リストのi番目の要素を表す。 list_entry_l0 [0] = 1
list_entry_l0 [1] = 0
(S306c) Based on the value of the reference list rearrangement order list_entry_l0 [i], the elements of the provisional L0 reference list are rearranged to form the L0 reference list. The element RefPicList0 [rIdx] of the L0 reference list corresponding to the reference picture index rIdx is derived by the following equation. Here, RefListTemp0 [i] represents the i-th element of the provisional L0 reference list.

RefPicList0[ rIdx ] = RefPicListTemp0[ list_entry_l0[ rIdx ] ]
上記の式によれば、参照リスト並べ替え順序list_entry_l0[i]において、参照ピクチャインデックスrIdxの示す位置に記録されている値を参照し、暫定Ｌ０参照リストにおいて前記値の位置に記録されている参照ピクチャを、Ｌ０参照リストのrIdxの位置の参照ピクチャとして格納する。
（Ｓ３０７）暫定Ｌ０参照リストをＬ０参照リストとする。 RefPicList0 [rIdx] = RefPicListTemp0 [list_entry_l0 [rIdx]]
According to the above formula, in the reference list rearrangement order list_entry_l0 [i], the value recorded at the position indicated by the reference picture index rIdx is referred to, and the reference recorded at the position of the value in the provisional L0 reference list The picture is stored as a reference picture at the position of rIdx in the L0 reference list.
(S307) The provisional L0 reference list is set as the L0 reference list.

次にＬ１参照リストを構築する。なお、Ｌ１参照リストも、上記Ｌ０参照リストと同様の手順で構築できる。上記Ｌ０参照リストの構築手順（Ｓ３０１〜Ｓ３０７）において、Ｌ０参照ピクチャ、Ｌ０参照リスト、暫定Ｌ０参照リスト、list_entry_l0をそれぞれ、Ｌ１参照ピクチャ、Ｌ１参照リスト、暫定Ｌ１参照リスト、list_entry_l1と置き換えればよい。 Next, an L1 reference list is constructed. Note that the L1 reference list can also be constructed in the same procedure as the L0 reference list. In the L0 reference list construction procedure (S301 to S307), the L0 reference picture, the L0 reference list, the provisional L0 reference list, and list_entry_l0 may be replaced with the L1 reference picture, the L1 reference list, the provisional L1 reference list, and list_entry_l1, respectively.

＜出力制御部１４６＞
出力制御部１４６は、概略的には、所定のタイミングでDPB141のピクチャを外部に出力するとともに出力マークを更新する。具体的には、出力制御部１４６によるピクチャ出力処理は次の手順で実行される。 <Output control unit 146>
In general, the output control unit 146 outputs the picture of the DPB 141 to the outside at a predetermined timing and updates the output mark. Specifically, the picture output process by the output control unit 146 is executed in the following procedure.

（Ｓ４０１）DPB上のピクチャであって、出力マークが「出力要」であるピクチャのうちPOCが最小であるピクチャを出力する。 (S401) Among the pictures on the DPB, the picture having the smallest POC among the pictures whose output mark is “output required” is output.

（Ｓ４０２）Ｓ４０１で出力されたピクチャの出力マークを「出力不要」に設定する。 (S402) The output mark of the picture output in S401 is set to “no output required”.

（Ｓ４０３）DPB上のピクチャの中から、参照マークが「参照不使用」であり、かつ、出力マークが「出力不要」であるピクチャを選択して、当該ピクチャをDPBから削除する。 (S403) From the pictures on the DPB, select a picture whose reference mark is “reference not used” and whose output mark is “output unnecessary”, and delete the picture from the DPB.

なお、上記の処理は一般的なDPB上の不要ピクチャ削除処理（一般的なバンピング処理）と呼称できる。一般的なバンピング処理では、DPB上の各ピクチャが対象レイヤの復号ピクチャであるかレイヤ間参照ピクチャ（参照レイヤの復号ピクチャに由来するピクチャ）であるかの判定を行うことなくDPB上のピクチャの出力およびDPB上の不要ピクチャの削除を実行できる。 The above process can be referred to as a general unnecessary picture deletion process (general bumping process) on the DPB. In general bumping processing, each picture on the DPB is a decoded picture of the target layer or an inter-layer reference picture (a picture derived from the decoded picture of the reference layer) without determining whether the picture on the DPB is a decoded picture of the target layer. Unnecessary pictures on the output and DPB can be deleted.

（ピクチャ管理処理手順）
次に、復号ピクチャ管理部１４におけるピクチャ管理処理を図１を参照して説明する。図１は、復号ピクチャ管理部１４における対象ピクチャ復号時のピクチャ管理処理の概略を示したフロー図である。対象ピクチャ復号時のピクチャ管理処理は、以下のＳ５０１〜Ｓ５１０の手順で実行される。 (Picture management processing procedure)
Next, picture management processing in the decoded picture management unit 14 will be described with reference to FIG. FIG. 1 is a flowchart showing an outline of a picture management process at the time of decoding a target picture in the decoded picture management unit 14. The picture management process at the time of decoding the target picture is executed according to the following steps S501 to S510.

（Ｓ５０１）RPS導出部１４２は、入力されたシンタックス値から対象ピクチャのRPSを導出する。導出されたRPSは参照ピクチャ制御部１４３、ベース参照ピクチャ制御部１４４、および、RPL導出部１４５に出力する。Ｓ５０２に進む。 (S501) The RPS deriving unit 142 derives the RPS of the current picture from the input syntax value. The derived RPS is output to the reference picture control unit 143, the base reference picture control unit 144, and the RPL deriving unit 145. The process proceeds to S502.

（Ｓ５０２）参照ピクチャ制御部１４３は、入力RPSに基づきDPB上の対象レイヤの復号ピクチャの参照マークを更新する。Ｓ５０３に進む。 (S502) The reference picture control unit 143 updates the reference mark of the decoded picture of the target layer on the DPB based on the input RPS. The process proceeds to S503.

（Ｓ５０３）ベース参照ピクチャ制御部１４４は、入力のRPSにレイヤ間参照ピクチャが含まれるか判定する。含まれる場合（Ｓ５０３でYESの場合）、Ｓ５０４に進む。一方、含まれない場合（Ｓ５０３でNOの場合）、Ｓ５０６に進む。 (S503) The base reference picture control unit 144 determines whether the input RPS includes an inter-layer reference picture. If included (YES in S503), the process proceeds to S504. On the other hand, if not included (NO in S503), the process proceeds to S506.

（Ｓ５０４）ベース参照ピクチャ制御部１４４は、入力のベース復号ピクチャをレイヤ間参照ピクチャとしてDPB１４１上のピクチャバッファに記録し、参照マークを設定する。Ｓ５０５に進む。 (S504) The base reference picture control unit 144 records the input base decoded picture in the picture buffer on the DPB 141 as an inter-layer reference picture, and sets a reference mark. The process proceeds to S505.

（Ｓ５０５）ベース参照ピクチャ制御部１４４は、Ｓ５０４で記録したレイヤ間参照ピクチャの出力マークを「出力不要」に設定する。Ｓ５０６に進む。 (S505) The base reference picture control unit 144 sets the output mark of the inter-layer reference picture recorded in S504 to “output unnecessary”. The process proceeds to S506.

（Ｓ５０６）出力制御部１４６は、DPB１４１上のピクチャを必要に応じて外部に出力して当該ピクチャの出力マークを更新する。Ｓ５０７に進む。 (S506) The output control unit 146 outputs the picture on the DPB 141 to the outside as necessary, and updates the output mark of the picture. The process proceeds to S507.

（Ｓ５０７）RPL導出部１４５は、入力のRPSとシンタックス値に基づき、対象ピクチャ復号時に用いられる参照ピクチャリストを生成して外部に出力する。Ｓ５０８に進む。なお、このタイミングで、外部では当該参照ピクチャリストおよびDPB１４１上の参照ピクチャを利用して対象ピクチャの復号処理を実行して、対象ピクチャの復号ピクチャを生成する。 (S507) Based on the input RPS and syntax value, the RPL deriving unit 145 generates a reference picture list used at the time of decoding the target picture and outputs the reference picture list to the outside. The process proceeds to S508. At this timing, the target picture is decoded using the reference picture list and the reference picture on the DPB 141 to generate a decoded picture of the target picture.

（Ｓ５０８）入力される対象ピクチャの復号ピクチャをDPB１４１上のピクチャバッファに記録する。Ｓ５０９に進む。 (S508) The decoded picture of the input target picture is recorded in the picture buffer on the DPB 141. The process proceeds to S509.

（Ｓ５０９）レイヤ間参照ピクチャの参照マークを「参照不使用」に設定する。Ｓ５１０に進む。 (S509) The reference mark of the inter-layer reference picture is set to “not use reference”. The process proceeds to S510.

（Ｓ５１０）対象ピクチャの参照マークを「短期参照使用」に設定する。処理を終了する。 (S510) The reference mark of the target picture is set to “use short-term reference”. The process ends.

（ピクチャ管理処理の具体例）
次に、図１を用いて説明したピクチャ管理処理の具体例を図１３を合わせて用いて説明する。図１３は、対象レイヤ上で復号順の連続する２枚のピクチャ（順にピクチャＭ、ピクチャＮ）でのピクチャ管理処理の具体例を説明する図である。図１３(a)は、ピクチャ管理処理のフローを表す。また、図１３(b)は、図１３(a)に示した各ステップにおけるDPB上のピクチャＭに関するレイヤ間参照ピクチャの参照マークと出力マークの状態を表す。 (Specific example of picture management processing)
Next, a specific example of the picture management process described with reference to FIG. 1 will be described with reference to FIG. FIG. 13 is a diagram for describing a specific example of picture management processing for two pictures (in order, picture M and picture N) in decoding order on the target layer. FIG. 13A shows a flow of picture management processing. FIG. 13B shows the state of the reference mark and output mark of the inter-layer reference picture regarding the picture M on the DPB in each step shown in FIG.

以下、図１３(a)に示したフローに従って説明する。なお、説明において、DPB上のピクチャＭに関するレイヤ間参照ピクチャの参照マークがａ、出力マークがｂである状態のことを、ピクチャＭに係るレイヤ間参照ピクチャの状態は「ａ−ｂ」である、と表記する。またピクチャＭのPOCをｍ、ピクチャＮのPOCをｎとする。さらに、各ステップの符号は図１の対応するステップを示唆する。例えば「ステップＳ５０４Ｍ」は、「ピクチャＭに対するステップＳ５０４」に相当する。 Hereinafter, the description will be made according to the flow shown in FIG. In the description, the reference mark of the inter-layer reference picture related to the picture M on the DPB is “a” and the output mark is “b”, and the state of the inter-layer reference picture related to the picture M is “ab”. . The POC of picture M is m, and the POC of picture N is n. Further, the symbols for each step indicate the corresponding step in FIG. For example, “Step S504M” corresponds to “Step S504 for picture M”.

（Ｓ５００Ｍ）ピクチャＭを対象ピクチャに設定する。ピクチャＭに係るレイヤ間参照ピクチャはこの段階では対象レイヤのDPB上には存在せず、状態は「unk−unk」である。 (S500M) The picture M is set as the target picture. The inter-layer reference picture related to the picture M does not exist on the DPB of the target layer at this stage, and the state is “unk-unk”.

（Ｓ５０４Ｍ）POCがmのベース復号ピクチャをレイヤ間参照ピクチャ（ピクチャＭに係るレイヤ間参照ピクチャ）としてDPBに記録する。加えて、参照マークを「長期参照使用」（“LR”）に設定する。ピクチャＭに係るレイヤ間参照ピクチャの状態は「LR-unk」となる。 (S504M) The base decoded picture whose POC is m is recorded in the DPB as an inter-layer reference picture (inter-layer reference picture related to the picture M). In addition, the reference mark is set to “Long Term Reference Use” (“LR”). The state of the inter-layer reference picture related to the picture M is “LR-unk”.

（Ｓ５０５Ｍ）ピクチャＭに係るレイヤ間参照ピクチャの出力マークを「出力不要」（“NNfO”）に設定する。ピクチャＭに係るレイヤ間参照ピクチャの状態は「LR-NNfO」となる。 (S505M) The output mark of the inter-layer reference picture related to the picture M is set to “output unnecessary” (“NNfO”). The state of the inter-layer reference picture related to the picture M is “LR-NNfO”.

（Ｓ５０６Ｍ）DPBのピクチャを出力して出力マークを更新する。ここで、ピクチャＭに係るレイヤ間参照ピクチャの参照マークは「長期参照使用」であるため、当該ピクチャはDPBから除去されず、出力マークも更新されない。したがって、ピクチャＭに係るレイヤ間参照ピクチャの状態は「LR-NNfO」のままとなる。 (S506M) The DPB picture is output and the output mark is updated. Here, since the reference mark of the inter-layer reference picture related to the picture M is “long-term reference use”, the picture is not removed from the DPB, and the output mark is not updated. Therefore, the state of the inter-layer reference picture related to the picture M remains “LR-NNfO”.

（Ｓ５０８Ｍ）ピクチャＭの復号ピクチャをDPBに追加する。ピクチャＭに係るレイヤ間参照ピクチャの状態は「LR-NNfO」のままとなる。 (S508M) The decoded picture of picture M is added to the DPB. The state of the inter-layer reference picture related to the picture M remains “LR-NNfO”.

（Ｓ５０９Ｍ）レイヤ間参照ピクチャの参照マークを「参照不使用」（“NR”）に更新する。ピクチャＭに係るレイヤ間参照ピクチャの状態は「NR-NNfO」となる。 (S509M) The reference mark of the inter-layer reference picture is updated to “reference not used” (“NR”). The state of the inter-layer reference picture related to the picture M is “NR-NNfO”.

（Ｓ５００Ｎ）ピクチャＮを対象ピクチャに設定する。ピクチャＭに係るレイヤ間参照ピクチャの状態は「NR-NNfO」のままとなる。 (S500N) The picture N is set as the target picture. The state of the inter-layer reference picture related to the picture M remains “NR-NNfO”.

（Ｓ５０６Ｎ）DPBのピクチャを出力して出力マークを更新する。ここで、DPB上のピクチャＭに係るレイヤ間参照ピクチャは「参照不使用」かつ「出力不要」なので、当該レイヤ間参照ピクチャはDPB上から除去される。 (S506N) The DPB picture is output and the output mark is updated. Here, since the inter-layer reference picture related to the picture M on the DPB is “reference not used” and “output unnecessary”, the inter-layer reference picture is removed from the DPB.

（ピクチャ管理処理まとめ）
以上、図１と図１３を用いて説明した階層動画像復号装置１の備える復号ピクチャ管理部１４でのピクチャ管理処理は以下のＡ１〜Ａ４に示す特徴を有している。 (Picture management process summary)
As described above, the picture management processing in the decoded picture management unit 14 included in the hierarchical video decoding device 1 described with reference to FIGS. 1 and 13 has the following characteristics A1 to A4.

Ａ１．レイヤ間参照ピクチャをDPBに追加する処理を含む。 A1. Includes a process of adding an inter-layer reference picture to the DPB.

Ａ２．レイヤ間参照ピクチャをDPBに追加する際に（Ａ１の処理の直後に）、対象ピクチャの当該レイヤ間参照ピクチャの参照マークを「長期参照使用」に設定する処理を含む。 A2. When the inter-layer reference picture is added to the DPB (immediately after the processing of A1), the processing includes setting the reference mark of the inter-layer reference picture of the target picture to “long-term reference use”.

Ａ３．レイヤ間参照ピクチャをDPBに追加する際に（Ａ１の処理の直後に）、当該レイヤ間参照ピクチャの出力マークを「出力不要」に設定する処理を含む。 A3. When the inter-layer reference picture is added to the DPB (immediately after the processing of A1), the processing includes setting the output mark of the inter-layer reference picture to “output unnecessary”.

Ａ４．対象ピクチャの復号ピクチャ生成処理の完了後、レイヤ間参照ピクチャの参照マークを「参照不使用」に更新する処理を含む。 A4. After completion of the decoded picture generation process of the target picture, the process includes a process of updating the reference mark of the inter-layer reference picture to “unused reference”.

上記特徴から、上記復号ピクチャ管理部１４によるピクチャ管理処理によれば、対象ピクチャの復号においてレイヤ間参照ピクチャとして用いられるピクチャをDPBに追加した上で、当該対象ピクチャの復号ピクチャ生成に利用できる。加えて、前記復号ピクチャ生成が完了した時点で、後続のピクチャ復号に不要となる前記レイヤ間参照ピクチャの参照マークと出力マークの状態を、一般的なバンピング処理によりDPBから破棄できる状態に設定できる。 From the above characteristics, according to the picture management processing by the decoded picture management unit 14, a picture used as an inter-layer reference picture in decoding of the target picture can be added to the DPB and used for generating a decoded picture of the target picture. In addition, when the decoded picture generation is completed, the state of the reference mark and output mark of the inter-layer reference picture that are not required for subsequent picture decoding can be set to a state that can be discarded from the DPB by a general bumping process. .

したがって、上記復号ピクチャ管理部１４によるピクチャ管理処理によれば、ピクチャ復号に用いたレイヤ間参照ピクチャを、不要になった時点でDPBから除去できるるため、少ないメモリ平均使用量で入力符号化データを復号できる。加えて、DPB上のピクチャがレイヤ間参照ピクチャか、インター予測による参照ピクチャかを区別することなくDPB上の不要ピクチャを破棄できるため、当該処理に入力符号化データの復号にレイヤ間参照ピクチャを用いない場合と同一のアルゴリズムを用いることで回路規模を少なく抑えられる。 Therefore, according to the picture management process performed by the decoded picture management unit 14, the inter-layer reference picture used for picture decoding can be removed from the DPB when it is no longer needed. Can be decrypted. In addition, since it is possible to discard unnecessary pictures on the DPB without distinguishing whether the pictures on the DPB are inter-layer reference pictures or inter-reference reference pictures, an inter-layer reference picture is used for decoding the input encoded data in this process. By using the same algorithm as when not used, the circuit scale can be reduced.

換言すると、復号ピクチャ管理部１４は、ベース復号部１５の内部に含まれる復号ピクチャ管理部が実行するバンピング処理と同一のバンピング処理により、復号ピクチャバッファ（DPB１４１）から破棄するピクチャを決定できるため、回路規模を少なく抑えられる。 In other words, the decoded picture management unit 14 can determine a picture to be discarded from the decoded picture buffer (DPB 141) by the same bumping process as that performed by the decoded picture management unit included in the base decoding unit 15. The circuit scale can be reduced.

なお、上記特徴のＡ２で参照マークを「長期参照使用」に設定すると記載したが、それに限らず、対象ピクチャの復号に使われる可能性のある旨を示す別の参照マーク（例えば「短期参照使用」や「レイヤ間参照使用」）が設定されていてもよい。対象ピクチャの復号に使われる可能性のある旨を示す参照マークを「参照使用」と表す場合、Ａ２の特徴を次のＡ２’の特徴で置き換えても良い。 In addition, although it has been described that the reference mark is set to “long-term reference use” in A2 of the above feature, the reference mark is not limited to this, but another reference mark (for example, “short-term reference use” indicating that it may be used for decoding of the target picture. Or “inter-layer reference use”) may be set. When a reference mark indicating that there is a possibility of being used for decoding the target picture is expressed as “reference use”, the feature of A2 may be replaced with the feature of A2 ′ below.

Ａ２’．レイヤ間参照ピクチャをDPBに追加する際に（Ａ１の処理の直後に）、対象ピクチャの当該レイヤ間参照ピクチャの参照マークを「参照使用」に設定する処理を含む。 A2 '. When the inter-layer reference picture is added to the DPB (immediately after the process of A1), the process includes a process of setting the reference mark of the inter-layer reference picture of the target picture to “use reference”.

また、上記のＡ１〜Ａ４や図１で記載した処理順はあくまで一例であり、それに限らない。 Further, the processing order described in the above A1 to A4 and FIG. 1 is merely an example, and is not limited thereto.

具体的には、Ａ１、Ａ２、Ａ３の処理は対象ピクチャ導出処理（ピクチャ復号部１３によるピクチャ復号処理）の開始前のタイミングで実行すればよい。 Specifically, the processing of A1, A2, and A3 may be executed at a timing before the start of the target picture derivation processing (picture decoding processing by the picture decoding unit 13).

換言すると、ピクチャ管理部１４は、上位レイヤ内の対象ピクチャの復号ピクチャ導出処理に先立って、下位レイヤの復号ピクチャに基づき生成されるレイヤ間参照ピクチャを上記復号ピクチャバッファに追加（Ａ１の処理を実行）するとともに、当該レイヤ間参照ピクチャの参照マークを参照使用に設定（Ａ２の処理を実行）し、出力マークを出力不要に設定（Ａ３の処理を実行）してもよい。 In other words, the picture management unit 14 adds an inter-layer reference picture generated based on the decoded picture of the lower layer to the decoded picture buffer prior to the decoded picture derivation process of the target picture in the upper layer (the process of A1 is performed). The reference mark of the inter-layer reference picture may be set to be used for reference (execution of the process A2), and the output mark may be set so as not to be output (the process of A3 is executed).

また、Ａ４の処理は、対象ピクチャ導出処理（ピクチャ復号部１３によるピクチャ復号処理）の完了後であって、復号順で後続の同一レイヤ内のピクチャの復号処理開始前に実行すればよい。 The process A4 may be executed after the target picture derivation process (the picture decoding process by the picture decoding unit 13) is completed and before the decoding process of the subsequent picture in the same layer is started in the decoding order.

換言すると、ピクチャ管理部１４は、対象ピクチャの復号ピクチャ導出処理の完了後、上記対象ピクチャが復号順で対象ピクチャの次のピクチャの復号開始前のタイミングで、上記レイヤ間参照ピクチャの参照マークを参照不使用に設定（Ａ４の処理を実行）すしてもよい。 In other words, the picture management unit 14 sets the reference mark of the inter-layer reference picture at a timing after the decoding of the decoded picture of the target picture is completed and before the target picture starts decoding the next picture of the target picture in decoding order. The reference may not be used (the process of A4 is executed).

（ベース復号部）
図１４を用いて、ベース復号部１５の詳細構成を説明する。図１４は、ベース復号部１５の構成について例示した機能ブロック図である。 (Base decoding unit)
The detailed structure of the base decoding part 15 is demonstrated using FIG. FIG. 14 is a functional block diagram illustrating the configuration of the base decoding unit 15.

図１４に示すように、ベース復号部１５は、可変長復号部１５１、ベースピクチャ復号部１５３、ベース復号ピクチャ管理部１５４を備える。 As illustrated in FIG. 14, the base decoding unit 15 includes a variable length decoding unit 151, a base picture decoding unit 153, and a base decoded picture management unit 154.

可変長復号部１５１は、参照レイヤ符号化データＤＡＴＡ＃Ｒに含まれるバイナリから各種符号化パラメータ（代表的には変換係数、予測情報、RPS情報、RPL情報）に係る各種シンタックス値を復号する。可変長復号部１５１が復号する予測情報および変換係数のシンタックスは、図４の可変長復号部１２と同様であるため詳細説明は省略する。 The variable length decoding unit 151 decodes various syntax values related to various coding parameters (typically transform coefficients, prediction information, RPS information, and RPL information) from the binary included in the reference layer encoded data DATA # R. . The syntax of the prediction information and the transform coefficient decoded by the variable length decoding unit 151 is the same as that of the variable length decoding unit 12 in FIG.

可変長復号部１５１は、復号したシンタックス値をベースピクチャ復号部１５３およびベース復号ピクチャ管理部１５４に供給する。 The variable length decoding unit 151 supplies the decoded syntax value to the base picture decoding unit 153 and the base decoded picture management unit 154.

ベースピクチャ復号部１５３は、参照レイヤ上の対象ピクチャに関するスライスヘッダ、予測情報、および、変換係数に係るシンタックス値に基づいて、参照レイヤ上の対象ピクチャの復号ピクチャ（ベース復号ピクチャ）を復号して出力される。なお、ベースピクチャ復号部１５３におけるベース復号ピクチャ復号処理は、図４のピクチャ復号部１３におけるピクチャ復号処理と同様であるため詳細説明は省略する。 The base picture decoding unit 153 decodes the decoded picture (base decoded picture) of the target picture on the reference layer based on the slice header, the prediction information, and the syntax value related to the transform coefficient regarding the target picture on the reference layer. Is output. The base decoded picture decoding process in the base picture decoding unit 153 is the same as the picture decoding process in the picture decoding unit 13 in FIG.

ベース復号ピクチャ管理部１５４は、入力されるベース復号ピクチャを内部のDPBに記録するとともに、入力されるシンタックス値に基づいて参照ピクチャリスト生成や出力するベース復号ピクチャの決定を行う。生成された参照ピクチャリストはベースピクチャ復号部１５３に出力され、ベース復号ピクチャは外部に出力される。なお、ベース復号ピクチャ管理部１５４における処理は、図４の復号ピクチャ管理部１４における処理と同様であり詳細説明は省略する。 The base decoded picture management unit 154 records the input base decoded picture in the internal DPB, and generates a reference picture list and determines a base decoded picture to be output based on the input syntax value. The generated reference picture list is output to the base picture decoding unit 153, and the base decoded picture is output to the outside. Note that the processing in the base decoded picture management unit 154 is the same as the processing in the decoded picture management unit 14 in FIG.

（動画像復号装置１の効果）
以上説明した本実施形態に係る階層動画像復号装置１はピクチャ管理部１４を備えおり、前記ピクチャ管理部は、レイヤ間参照ピクチャをDPBに追加し、続いて、当該レイヤ間参照ピクチャの参照マークを「参照使用」に設定するとともに、当該レイヤ間参照ピクチャの出力マークを「出力不要」に設定するベース参照ピクチャ制御部１４４を備えている。したがって、階層動画像復号装置１は、ピクチャ復号に用いたレイヤ間参照ピクチャを、レイヤ間参照ピクチャとインター予測の参照ピクチャを区別しないバンピング処理を用いて、不要になった時点でDPBから除去できるため、少ないメモリ平均使用量で入力符号化データを復号できる。 (Effect of moving image decoding apparatus 1)
The hierarchical moving picture decoding apparatus 1 according to the present embodiment described above includes the picture management unit 14, and the picture management unit adds the inter-layer reference picture to the DPB, and subsequently the reference mark of the inter-layer reference picture. Is set to “reference use”, and a base reference picture control unit 144 is provided for setting the output mark of the inter-layer reference picture to “output unnecessary”. Therefore, the hierarchical video decoding device 1 can remove the inter-layer reference picture used for picture decoding from the DPB when it becomes unnecessary by using a bumping process that does not distinguish between the inter-layer reference picture and the inter-prediction reference picture. Therefore, input encoded data can be decoded with a small average memory usage.

［変形例１：共通DPBの場合の参照マーク］
上記の階層動画像復号装置１のピクチャ管理処理では、レイヤ毎に独立に、各レイヤの復号ピクチャと参照ピクチャ管理のためのDPBを持つことを仮定して説明したが、それに限らない。例えば、復号ピクチャ管理部１４は対象レイヤである拡張レイヤの復号ピクチャを管理するDPB１４１を備え、ベース復号部１５内のベース復号ピクチャ管理部１５４はベース復号ピクチャを管理するDPBを備えているものと仮定していたが、両者が共通のDPBを用いるように階層動画像復号装置を構成することもできる。以下、複数レイヤの復号ピクチャを共通のDPBで管理する仕組みを、共通DPBモデルと称する。 [Modification 1: Reference mark for common DPB]
In the picture management processing of the hierarchical moving picture decoding apparatus 1 described above, it is assumed that each layer has a decoded picture of each layer and a DPB for managing the reference picture. However, the present invention is not limited to this. For example, the decoded picture management unit 14 includes a DPB 141 that manages the decoded picture of the enhancement layer that is the target layer, and the base decoded picture management unit 154 in the base decoding unit 15 includes a DPB that manages the base decoded picture. Although it has been assumed, the hierarchical moving picture decoding apparatus can be configured so that both use a common DPB. Hereinafter, a mechanism for managing decoded pictures of a plurality of layers with a common DPB is referred to as a common DPB model.

共通DPBモデルを用いる場合、階層動画像復号装置１で説明した構成において、少なくとも以下の点を変更する必要が生じる。 When the common DPB model is used, it is necessary to change at least the following points in the configuration described in the hierarchical video decoding device 1.

１．バンピング処理方法
２．DPB上のピクチャの参照マークおよび出力マーク処理の更新方法
以下では、それぞれの変更について説明する。 1. Bumping processing method Updating method of reference mark and output mark processing of picture on DPB Each change will be described below.

（１．バンピング処理方法の変更）
出力制御部１４６におけるバンピング処理を次の手順で実行するよう変更する。 (1. Change of bumping method)
It changes so that the bumping process in the output control part 146 may be performed in the following procedure.

（Ｓ４０１ｒ）DPB上のピクチャであって、出力マークが「出力要」である対象レイヤのピクチャのうちPOCが最小のピクチャを出力する。 (S401r) Of the pictures on the DPB, the picture with the smallest POC is output among the pictures of the target layer whose output mark is “output required”.

（Ｓ４０２ｒ）Ｓ４０１ｒで出力されたピクチャの出力マークを「出力不要」に設定する。 (S402r) The output mark of the picture output in S401r is set to “no output required”.

（Ｓ４０３ｒ）DPB上のピクチャの中から、参照マークが「参照不使用」であり、かつ、出力マークが「出力不要」であるピクチャを選択して、当該ピクチャをDPBから削除する。 (S403r) From the pictures on the DPB, select a picture whose reference mark is “reference not used” and whose output mark is “output unnecessary”, and delete the picture from the DPB.

（２．DPB上のピクチャの参照マークおよび出力マーク処理の更新方法の変更）
図１５に示す処理により図１の処理の一部を置き換える。図１５は、共通DPMモデル利用時のピクチャ管理処理の一部を表すフロー図である。以下で説明する図１５に示す処理Ｓ５０９ｒ１からＳ５１０ｒ１は、図１のＳ５０９とＳ５１０の処理を置き換える。 (2. Change of update method of reference mark and output mark processing of picture on DPB)
A part of the process of FIG. 1 is replaced by the process shown in FIG. FIG. 15 is a flowchart showing a part of the picture management process when using the common DPM model. Processes S509r1 to S510r1 shown in FIG. 15 described below replace the processes of S509 and S510 of FIG.

（Ｓ５０９ｒ１）廃棄可能フラグ（discardable_flag）の値に応じて対象ピクチャの参照マークを更新する。廃棄可能フラグは、対象ピクチャが後続のピクチャ復号時に参照ピクチャまたはレイヤ間参照ピクチャとして利用される可能性の有無を示すフラグである。廃棄可能フラグの値が１の場合、参照マークを「参照不使用」に更新する。廃棄可能フラグの値が０の場合、参照マークを「参照使用」に更新する。 (S509r1) The reference mark of the target picture is updated according to the value of the discardable flag (discardable_flag). The discardable flag is a flag indicating whether or not there is a possibility that the target picture is used as a reference picture or an inter-layer reference picture during subsequent picture decoding. If the value of the discardable flag is 1, the reference mark is updated to “not use reference”. If the value of the discardable flag is 0, the reference mark is updated to “use reference”.

（Ｓ５１０ｒ１）DPB上に記録されているピクチャのうち、下記条件Ｄ１、Ｄ２、およびＤ３を全て満たすピクチャの参照マークを「参照不使用」に設定する。 (S510r1) Of the pictures recorded on the DPB, the reference marks of the pictures that satisfy all of the following conditions D1, D2, and D3 are set to “not use reference”.

Ｄ１：対象ピクチャのPOCと同一POCを有するピクチャ。 D1: A picture having the same POC as the POC of the target picture.

Ｄ２：対象ピクチャより復号順の遅い対象ピクチャのPOCと同一POCを有するピクチャの復号時にレイヤ間参照ピクチャとして利用されないピクチャ。 D2: A picture that is not used as an inter-layer reference picture when decoding a picture having the same POC as the POC of the target picture that is later in decoding order than the target picture.

Ｄ３：同一レイヤ内の予測（例えばインター予測）により利用されないピクチャ。 D3: A picture that is not used by prediction (for example, inter prediction) in the same layer.

上記変更を加えたピクチャ管理処理により、共通DPBモデルを用いる場合であっても、レイヤ間画像予測やインター予測で参照されなくなった復号ピクチャの参照マークを「参照不使用」に設定できる。したがって、DPB上のピクチャの出力マークが「出力不要」と変更された時点でピクチャをDPBから削除できるため、メモリの平均使用量を低減できる。 With the picture management process with the above changes, even when a common DPB model is used, the reference mark of a decoded picture that is no longer referred to in inter-layer image prediction or inter prediction can be set to “not use reference”. Therefore, since the picture can be deleted from the DPB at the time when the output mark of the picture on the DPB is changed to “output unnecessary”, the average memory usage can be reduced.

なお、Ｓ５１０ｒ１を省略する構成を用いてもよい。その場合、対象ピクチャより復号順の遅い後続ピクチャにおけるレイヤ間画像予測やインター予測で参照されないピクチャに対しては廃棄可能フラグの値を１とする制約を設けることが好ましい。廃棄可能フラグ選択の自由度が低下する一方で、Ｓ５１０ｒ１の処理を省略できるため処理量が削減できるという利点がある。 Note that a configuration in which S510r1 is omitted may be used. In that case, it is preferable to provide a constraint that the value of the discardable flag is 1 for a picture that is not referenced in inter-layer image prediction or inter prediction in a subsequent picture that is later in decoding order than the target picture. While the degree of freedom in selecting the discardable flag is reduced, there is an advantage that the processing amount can be reduced because the processing of S510r1 can be omitted.

［変形例１−１：共通DPBの場合の参照マーク、サブレイヤの考慮］
変形例１に記載した共通DPBモデルに適用するピクチャ管理処理において、ピクチャの属するサブレイヤ（時間レイヤ）の情報を用いた判定を含めても良い。 [Modification 1-1: Consideration of Reference Mark and Sublayer in Common DPB]
In the picture management process applied to the common DPB model described in the first modification, determination using information on a sublayer (temporal layer) to which a picture belongs may be included.

サブレイヤは時間スケーラビリティを表現するために用いられる。一般的には各サブレイヤに属するピクチャに対して時間識別子（temporal_id）が割り当てられている。時間識別子は、低次サブレイヤほど、言い換えると低いフレームレートに相当するサブレイヤほど、小さい値が割り当てられている。特定のサブレイヤに相当するフレームレートの出力画像を得たい場合には、当該サブレイヤに割り当てられている時間識別子以下の値を持つ時間識別子が割り当てられた符号化データを階層符号化データから抽出して復号すればよい。 The sublayer is used to express temporal scalability. Generally, a time identifier (temporal_id) is assigned to a picture belonging to each sublayer. The time identifier is assigned a smaller value in lower order sublayers, in other words, in sublayers corresponding to lower frame rates. When an output image having a frame rate corresponding to a specific sublayer is to be obtained, encoded data to which a time identifier having a value less than or equal to the time identifier assigned to the sublayer is extracted from hierarchically encoded data. What is necessary is just to decode.

例えば、時間識別子の値がａであることをTid=aと表記した上で、Tid=0が15fps（frame per second）に、Tid=1が30fps、Tid=2が60fpsに相当するとする。この場合、15fpsの出力画像を得るためにはTid=0の符号化データを抽出して復号すればよい。また、60fpsの出力画像を得るためにはTid=0、Tid=1、Tid=2の符号化データを抽出して復号すればよい。なお、階層符号化データに含まれる最大の時間識別子は最高次時間識別子（Highest Temporal Id）とも呼ばれる。 For example, suppose that the value of the time identifier is a as Tid = a, Tid = 0 corresponds to 15 fps (frame per second), Tid = 1 corresponds to 30 fps, and Tid = 2 corresponds to 60 fps. In this case, in order to obtain an output image of 15 fps, encoded data of Tid = 0 may be extracted and decoded. In order to obtain an output image of 60 fps, encoded data of Tid = 0, Tid = 1, and Tid = 2 may be extracted and decoded. The maximum time identifier included in the hierarchically encoded data is also called the highest time identifier (Highest Temporal Id).

上記の特性を実現するため、Tid=kのピクチャは、k以下の時間識別子を持つピクチャのみをインター予測で参照可能という特徴を持つ。そのため、最高次時間識別子を持つピクチャは、インター予測で参照されない場合には、レイヤ間画像予測による参照ピクチャ（レイヤ間参照ピクチャ）としてのみ利用され得る。 In order to realize the above characteristics, a picture with Tid = k has a feature that only a picture having a time identifier of k or less can be referred to by inter prediction. Therefore, when the picture having the highest time identifier is not referred to in inter prediction, it can be used only as a reference picture (inter-layer reference picture) by inter-layer image prediction.

本変形例では、図１６に示す処理により図１の処理の一部を置き換える。図１６は、共通DPMモデル利用時のピクチャ管理処理の一部を表すフロー図である。以下で説明する図１６に示す処理Ｓ５０９ｒ１およびＳ５１０ｒ２〜Ｓ５１５ｒ２の処理は、図１のＳ５０９とＳ５１０の処理を置き換える。なお、Ｓ５０９の処理は変形例１で記載した同一符号の処理と同じであり説明を省略する。 In this modification, a part of the process of FIG. 1 is replaced by the process shown in FIG. FIG. 16 is a flowchart showing a part of picture management processing when the common DPM model is used. Processes S509r1 and S510r2 to S515r2 shown in FIG. 16 described below replace the processes of S509 and S510 in FIG. Note that the processing of S509 is the same as the processing of the same reference numerals described in the first modification, and description thereof is omitted.

（Ｓ５１０ｒ２）対象ピクチャが復号順で後続のピクチャにおけるレイヤ間画像予測で参照されるかを判定する。参照される場合（Ｓ５１０ｒ２でYES）、Ｓ５１４ｒ２に進む。参照されない場合（Ｓ５１０ｒ２でNO）、Ｓ５１１ｒ２に進む。 (S510r2) It is determined whether the current picture is referred to in inter-layer image prediction in the subsequent picture in decoding order. When referred to (YES in S510r2), the process proceeds to S514r2. If not referenced (NO in S510r2), the process proceeds to S511r2.

（Ｓ５１１ｒ２）最高次時間識別子（HighestTid）を持つピクチャ間でインター予測が許可されているかを判定する。許可されている場合（Ｓ５１１ｒ２でYES）、Ｓ５１３ｒ２に進む。許可されていない場合（Ｓ５１１ｒ２でNO）、Ｓ５１２ｒ２に進む。 (S511r2) It is determined whether inter prediction is permitted between pictures having the highest time identifier (HighestTid). If permitted (YES in S511r2), the process proceeds to S513r2. If not permitted (NO in S511r2), the process proceeds to S512r2.

（Ｓ５１２ｒ２）対象ピクチャの時間識別子が最高次時間識別子（HighestTid）かを判定する。最高時時間識別子である場合（Ｓ５１２ｒ２でYES）、Ｓ５１５ｒ２に進む。最高次識別子でない場合（Ｓ５１２ｒ２でNO）、Ｓ５１３ｒ２に進む。 (S512r2) It is determined whether the time identifier of the target picture is the highest time identifier (HighestTid). When it is the highest time identifier (YES in S512r2), the process proceeds to S515r2. If it is not the highest order identifier (NO in S512r2), the process proceeds to S513r2.

（Ｓ５１３ｒ２）対象ピクチャがインター予測で参照されるかを判定する。インター予測で参照される場合（Ｓ５１３ｒ２でYES）、Ｓ５１５ｒ２に進む。インター予測で参照されない場合（Ｓ５１３ｒ２でNO）、Ｓ５１４ｒ２に進む。 (S513r2) It is determined whether the current picture is referred to in inter prediction. When referred to by inter prediction (YES in S513r2), the process proceeds to S515r2. When not referred by inter prediction (it is NO at S513r2), it progresses to S514r2.

（Ｓ５１４ｒ２）対象ピクチャの参照マークを「参照使用」に設定して処理を終了する。 (S514r2) The reference mark of the target picture is set to “use reference”, and the process ends.

（Ｓ５１５ｒ２）対象ピクチャの参照マークを「参照不使用」に設定して処理を終了する。 (S515r2) The reference mark of the target picture is set to “not use reference”, and the process ends.

図１６を用いて説明した上記処理は、概略的には次のように表現することができる。すなわち、復号したピクチャの参照マークを設定するために、まず後続のレイヤ間画像予測での参照可能性有無を調べる。そして、レイヤ間画像予測で使用されない場合に、インター予測での参照可能性有無に先立って、時間識別子を用いた判定を行う。時間識別子を用いた判定が所定の条件を満たす場合、インター予測での参照可能性有無の判定を省略して、対象ピクチャの参照マークを「参照不使用」に設定する。ここで、所定の条件は、対象ピクチャが最高次サブレイヤに属し（最高次時間識別子を有し）、かつ、最高次サブレイヤに属するピクチャ間でインター予測が許可されているか否かである。 The above-described processing described with reference to FIG. 16 can be roughly expressed as follows. That is, in order to set a reference mark for a decoded picture, first, the possibility of reference in the subsequent inter-layer image prediction is checked. Then, when not used in inter-layer image prediction, a determination using a temporal identifier is performed prior to the presence or absence of reference possibility in inter prediction. When the determination using the temporal identifier satisfies a predetermined condition, the determination of the possibility of reference in inter prediction is omitted, and the reference mark of the target picture is set to “reference not used”. Here, the predetermined condition is whether or not the target picture belongs to the highest order sublayer (has the highest order time identifier) and inter prediction is permitted between pictures belonging to the highest order sublayer.

インター予測での参照可能性の判定はRPSの仕組みを用いる場合には容易ではない。なぜなら、対象ピクチャがインター予測に使用されるか否かは、復号順で次ピクチャのRPSを復号しDPBの参照マークを更新するまで分からないためである。 Judgment of reference possibility in inter prediction is not easy when using RPS mechanism. This is because it is not known whether the target picture is used for inter prediction until the RPS of the next picture is decoded in the decoding order and the reference mark of the DPB is updated.

上記ピクチャ管理処理によれば、最高次サブレイヤのピクチャ間のインター予測が制限されている場合に、その事実を利用してインター予測の利用可能性の判定を省略できる。したがって、対象ピクチャをより早い段階で「参照不使用」とマークすることができるため、出力済となった対象ピクチャをより早くDPBから削除できるためDPBのメモリ平均使用量を削減できる。 According to the picture management process described above, when inter prediction between pictures in the highest order sublayer is limited, determination of inter prediction availability can be omitted by using the fact. Accordingly, since the target picture can be marked as “reference not used” at an earlier stage, the output target picture can be deleted from the DPB earlier, and the average memory usage of the DPB can be reduced.

特に最高次サブレイヤに属するピクチャ間のインター予測適用有無の判定を行っているのは、最高次サブレイヤに属するピクチャは一般的に画質が低く、より低次のサブレイヤに属するピクチャを参照する場合が多いためである。最高次より低次のサブレイヤに属するピクチャ間でもＳ５１１ｒ２と同様の判定によりインター予測の適用有無を判定しても構わない。最高次サブレイヤに属するピクチャの数は、低次サブレイヤに属するピクチャに較べて多いことと、最高次サブレイヤに属するピクチャ間でのインター予測の有効性が低いことから、いずれかのサブレイヤに対してピクチャ間のインター予測適用有無の判定を行うのであれば、最高次サブレイヤに対して行うのが好ましい。 In particular, it is judged whether or not inter prediction is applied between pictures belonging to the highest order sublayer. Pictures belonging to the highest order sublayer generally have low image quality, and often refer to pictures belonging to lower order sublayers. Because. Whether or not to apply inter prediction may be determined between pictures belonging to sublayers lower than the highest order by the same determination as in S511r2. Since the number of pictures belonging to the highest order sublayer is larger than that of pictures belonging to the lower order sublayer and the effectiveness of inter prediction between pictures belonging to the highest order sublayer is low, pictures for any of the sublayers If it is determined whether or not inter prediction is applied, it is preferably performed on the highest order sublayer.

なお、同一の時間識別子を持つピクチャ間でインター予測の適用が禁止されている場合には、Ｓ５１１ｒ２やＳ５１２ｒ２における判定は冗長であるため、省略することが好ましい。 When application of inter prediction is prohibited between pictures having the same temporal identifier, the determination in S511r2 and S512r2 is redundant and is preferably omitted.

上記の変形例１−１の構成は、次のようにも表現できる。すなわち、上位レイヤの復号ピクチャを復元する画像復号装置において、上位レイヤの復号ピクチャバッファ（DPB）を管理するピクチャ管理部は、上位レイヤ内の対象ピクチャの復号ピクチャ導出処理に先立って、対象ピクチャの属するサブレイヤにおけるレイヤ間予測適用有無を判定し、当該判定結果がレイヤ間予測適用可能である場合に、下位レイヤの復号ピクチャに基づき生成されるレイヤ間参照ピクチャを上記復号ピクチャバッファに追加するとともに、当該レイヤ間参照ピクチャの参照マークを参照使用に設定し、出力マークを出力不要に設定する。 The configuration of the modification 1-1 can be expressed as follows. That is, in the image decoding apparatus that restores the decoded picture of the upper layer, the picture management unit that manages the decoded picture buffer (DPB) of the upper layer prior to the decoded picture derivation process of the target picture in the upper layer, When the inter-layer prediction application in the sublayer to which it belongs is determined and the determination result is applicable to inter-layer prediction, the inter-layer reference picture generated based on the decoded picture of the lower layer is added to the decoded picture buffer, The reference mark of the inter-layer reference picture is set to be used for reference, and the output mark is set to be unnecessary for output.

〔階層動画像符号化装置〕
以下では、本実施形態に係る階層動画像符号化装置２の構成について、図１７を参照して説明する。 [Hierarchical video encoding device]
Below, the structure of the hierarchy moving image encoder 2 which concerns on this embodiment is demonstrated with reference to FIG.

（階層動画像符号化装置の構成）
図１７を用いて、階層動画像符号化装置２の概略構成を説明する。図１７は、階層動画像符号化装置２の概略的構成を示した機能ブロック図である。階層動画像符号化装置２は、対象レイヤの入力画像ＰＩＮ＃Ｔを、参照レイヤ符号化データＤＡＴＡ＃Ｒを参照しながら符号化して、対象レイヤの階層符号化データＤＡＴＡを生成する。なお、参照レイヤ符号化データＤＡＴＡ＃Ｒは、参照レイヤに対応する階層動画像符号化装置において符号化済みであるとする。 (Configuration of Hierarchical Video Encoding Device)
A schematic configuration of the hierarchical video encoding device 2 will be described with reference to FIG. FIG. 17 is a functional block diagram showing a schematic configuration of the hierarchical video encoding device 2. The hierarchical video encoding device 2 encodes the input image PIN # T of the target layer with reference to the reference layer encoded data DATA # R to generate hierarchical encoded data DATA of the target layer. It is assumed that the reference layer encoded data DATA # R has been encoded in the hierarchical video encoding apparatus corresponding to the reference layer.

図１７に示すように階層動画像符号化装置２は、ピクチャ符号化部２１、可変長符号化部２２、ＮＡＬ多重化部２３、復号ピクチャ管理部１４、および、ベース復号部１５を備える。 As shown in FIG. 17, the hierarchical video encoding device 2 includes a picture encoding unit 21, a variable length encoding unit 22, a NAL multiplexing unit 23, a decoded picture management unit 14, and a base decoding unit 15.

ピクチャ符号化部２１は、入力画像ＰＩＮ＃Ｔを符号化して対応するシンタックス値を生成して出力する。また、同時に出力したシンタックス値を用いて復号処理を行うことで復号ピクチャを生成して出力する。ピクチャ符号部２１は、シンタックス値または復号ピクチャの生成に、入力される参照ピクチャリストを利用したインター予測またはレイヤ間画像予測を用いる。 The picture encoding unit 21 encodes the input image PIN # T to generate and output a corresponding syntax value. Also, a decoded picture is generated and output by performing decoding processing using the simultaneously output syntax value. The picture encoding unit 21 uses inter prediction or inter-layer image prediction using an input reference picture list to generate a syntax value or a decoded picture.

可変長符号化部は、入力されるシンタックス値を可変長符号化して対象レイヤ符号化データＤＡＴＡ＃Ｔとして出力する。 The variable length coding unit performs variable length coding on the input syntax value, and outputs the result as target layer coded data DATA # T.

ＮＡＬ多重化部２３は、入力される対象レイヤ符号化データＤＡＴＡ＃Ｔと、参照レイヤ符号化データＤＡＴＡ＃ＲとをＮＡＬユニットに格納することでＮＡＬ多重化した階層動画像符号化データＤＡＴＡを生成し、外部に出力する。 The NAL multiplexing unit 23 generates NAL-multiplexed hierarchical moving image encoded data DATA by storing the input target layer encoded data DATA # T and reference layer encoded data DATA # R in the NAL unit. And output to the outside.

復号ピクチャ管理部１４は、既に説明した階層動画像復号装置１の備える復号ピクチャ管理部１４と同一の構成要素である。ただし、階層動画像符号化装置２の備える復号ピクチャ管理部１４では、DPBに記録されたピクチャを出力ピクチャとして出力する必要はないため、当該出力は省略できる。なお、階層動画像復号装置１の復号ピクチャ管理部１４の説明において「復号」として説明した記載は「符号化」と置き換えることで、階層動画像符号化装置２の復号ピクチャ管理部１４にも適用できる。 The decoded picture management unit 14 is the same component as the decoded picture management unit 14 included in the hierarchical moving image decoding apparatus 1 described above. However, since the decoded picture management unit 14 included in the hierarchical video encoding device 2 does not need to output the picture recorded in the DPB as an output picture, the output can be omitted. Note that the description described as “decoding” in the description of the decoded picture management unit 14 of the hierarchical video decoding device 1 is also applied to the decoded picture management unit 14 of the hierarchical video encoding device 2 by replacing “encoding”. it can.

ベース復号部１５は、既に説明した階層動画像復号装置１の備えるベース復号部１５と同一の構成要素であり、詳細説明は省略する。 The base decoding unit 15 is the same component as the base decoding unit 15 included in the hierarchical video decoding device 1 described above, and detailed description thereof is omitted.

（動画像符号化装置２の効果）
以上説明した本実施形態に係る階層動画像符号化装置２はピクチャ管理部１４を備えおり、前記ピクチャ管理部は、レイヤ間参照ピクチャをDPBに追加し、続いて、当該レイヤ間参照ピクチャの参照マークを「参照使用」に設定するとともに、当該レイヤ間参照ピクチャの出力マークを「出力不要」に設定するベース参照ピクチャ制御部１４４を備えている。したがって、階層動画像符号化装置２は、ピクチャ符号化に用いたレイヤ間参照ピクチャを、レイヤ間参照ピクチャとインター予測の参照ピクチャを区別しないバンピング処理を用いて、不要になった時点でDPBから除去できるため、少ないメモリ平均使用量で入力画像を符号化できる。 (Effect of moving picture coding apparatus 2)
The hierarchical video encoding apparatus 2 according to the present embodiment described above includes the picture management unit 14, which adds the inter-layer reference picture to the DPB, and subsequently refers to the inter-layer reference picture. The base reference picture control unit 144 sets the mark to “use reference” and sets the output mark of the inter-layer reference picture to “output unnecessary”. Therefore, the hierarchical video encoding device 2 uses the bumping process that does not distinguish the inter-layer reference picture and the inter-prediction reference picture from the DPB when the inter-layer reference picture used for the picture encoding becomes unnecessary. Since it can be removed, the input image can be encoded with a small average memory usage.

（他の階層動画像符号化／復号システムへの適用例）
上述した階層動画像符号化装置２及び階層動画像復号装置１は、動画像の送信、受信、記録、再生を行う各種装置に搭載して利用できる。なお、動画像は、カメラ等により撮像された自然動画像であってもよいし、コンピュータ等により生成された人工動画像（ＣＧおよびＧＵＩを含む）であってもよい。 (Application example to other hierarchical video encoding / decoding systems)
The above-described hierarchical video encoding device 2 and hierarchical video decoding device 1 can be used by being mounted on various devices that perform transmission, reception, recording, and reproduction of moving images. The moving image may be a natural moving image captured by a camera or the like, or may be an artificial moving image (including CG and GUI) generated by a computer or the like.

図１８に基づいて、上述した階層動画像符号化装置２および階層動画像復号装置１を、動画像の送信および受信に利用できることを説明する。図１８の（ａ）は、階層動画像符号化装置２を搭載した送信装置ＰＲＯＤ＿Ａの構成を示したブロック図である。 Based on FIG. 18, it will be described that the above-described hierarchical video encoding device 2 and hierarchical video decoding device 1 can be used for transmission and reception of video. FIG. 18A is a block diagram illustrating a configuration of a transmission device PROD_A in which the hierarchical video encoding device 2 is mounted.

図１８の（ａ）に示すように、送信装置ＰＲＯＤ＿Ａは、動画像を符号化することによって符号化データを得る符号化部ＰＲＯＤ＿Ａ１と、符号化部ＰＲＯＤ＿Ａ１が得た符号化データで搬送波を変調することによって変調信号を得る変調部ＰＲＯＤ＿Ａ２と、変調部ＰＲＯＤ＿Ａ２が得た変調信号を送信する送信部ＰＲＯＤ＿Ａ３とを備えている。上述した階層動画像符号化装置２は、この符号化部ＰＲＯＤ＿Ａ１として利用される。 As illustrated in (a) of FIG. 18, the transmission device PROD_A modulates a carrier wave with an encoding unit PROD_A1 that obtains encoded data by encoding a moving image and the encoded data obtained by the encoding unit PROD_A1. Thus, a modulation unit PROD_A2 that obtains a modulation signal and a transmission unit PROD_A3 that transmits the modulation signal obtained by the modulation unit PROD_A2 are provided. The hierarchical moving image encoding apparatus 2 described above is used as the encoding unit PROD_A1.

送信装置ＰＲＯＤ＿Ａは、符号化部ＰＲＯＤ＿Ａ１に入力する動画像の供給源として、動画像を撮像するカメラＰＲＯＤ＿Ａ４、動画像を記録した記録媒体ＰＲＯＤ＿Ａ５、動画像を外部から入力するための入力端子ＰＲＯＤ＿Ａ６、及び、画像を生成または加工する画像処理部Ａ７を更に備えていてもよい。図１８の（ａ）においては、これら全てを送信装置ＰＲＯＤ＿Ａが備えた構成を例示しているが、一部を省略しても構わない。 The transmission device PROD_A is a camera PROD_A4 that captures a moving image, a recording medium PROD_A5 that records the moving image, an input terminal PROD_A6 that inputs the moving image from the outside, as a supply source of the moving image input to the encoding unit PROD_A1. An image processing unit A7 that generates or processes an image may be further provided. FIG. 18A illustrates a configuration in which the transmission apparatus PROD_A includes all of these, but a part of the configuration may be omitted.

なお、記録媒体ＰＲＯＤ＿Ａ５は、符号化されていない動画像を記録したものであってもよいし、伝送用の符号化方式とは異なる記録用の符号化方式で符号化された動画像を記録したものであってもよい。後者の場合、記録媒体ＰＲＯＤ＿Ａ５と符号化部ＰＲＯＤ＿Ａ１との間に、記録媒体ＰＲＯＤ＿Ａ５から読み出した符号化データを記録用の符号化方式に従って復号する復号部（不図示）を介在させるとよい。 The recording medium PROD_A5 may be a recording of a non-encoded moving image, or a recording of a moving image encoded by a recording encoding scheme different from the transmission encoding scheme. It may be a thing. In the latter case, a decoding unit (not shown) for decoding the encoded data read from the recording medium PROD_A5 according to the recording encoding method may be interposed between the recording medium PROD_A5 and the encoding unit PROD_A1.

図１８の（ｂ）は、階層動画像復号装置１を搭載した受信装置ＰＲＯＤ＿Ｂの構成を示したブロック図である。図１８の（ｂ）に示すように、受信装置ＰＲＯＤ＿Ｂは、変調信号を受信する受信部ＰＲＯＤ＿Ｂ１と、受信部ＰＲＯＤ＿Ｂ１が受信した変調信号を復調することによって符号化データを得る復調部ＰＲＯＤ＿Ｂ２と、復調部ＰＲＯＤ＿Ｂ２が得た符号化データを復号することによって動画像を得る復号部ＰＲＯＤ＿Ｂ３とを備えている。上述した階層動画像復号装置１は、この復号部ＰＲＯＤ＿Ｂ３として利用される。 FIG. 18B is a block diagram illustrating a configuration of a receiving device PROD_B in which the hierarchical video decoding device 1 is mounted. As illustrated in FIG. 18B, the reception device PROD_B includes a reception unit PROD_B1 that receives a modulation signal, a demodulation unit PROD_B2 that obtains encoded data by demodulating the modulation signal received by the reception unit PROD_B1, and a demodulation A decoding unit PROD_B3 that obtains a moving image by decoding the encoded data obtained by the unit PROD_B2. The above-described hierarchical video decoding device 1 is used as the decoding unit PROD_B3.

受信装置ＰＲＯＤ＿Ｂは、復号部ＰＲＯＤ＿Ｂ３が出力する動画像の供給先として、動画像を表示するディスプレイＰＲＯＤ＿Ｂ４、動画像を記録するための記録媒体ＰＲＯＤ＿Ｂ５、及び、動画像を外部に出力するための出力端子ＰＲＯＤ＿Ｂ６を更に備えていてもよい。図１８の（ｂ）においては、これら全てを受信装置ＰＲＯＤ＿Ｂが備えた構成を例示しているが、一部を省略しても構わない。 The receiving device PROD_B has a display PROD_B4 for displaying a moving image, a recording medium PROD_B5 for recording the moving image, and an output terminal for outputting the moving image to the outside as a supply destination of the moving image output by the decoding unit PROD_B3. PROD_B6 may be further provided. FIG. 18B illustrates a configuration in which the reception apparatus PROD_B includes all of these, but a part may be omitted.

なお、記録媒体ＰＲＯＤ＿Ｂ５は、符号化されていない動画像を記録するためのものであってもよいし、伝送用の符号化方式とは異なる記録用の符号化方式で符号化されたものであってもよい。後者の場合、復号部ＰＲＯＤ＿Ｂ３と記録媒体ＰＲＯＤ＿Ｂ５との間に、復号部ＰＲＯＤ＿Ｂ３から取得した動画像を記録用の符号化方式に従って符号化する符号化部（不図示）を介在させるとよい。 The recording medium PROD_B5 may be used for recording a non-encoded moving image, or may be encoded using a recording encoding method different from the transmission encoding method. May be. In the latter case, an encoding unit (not shown) for encoding the moving image acquired from the decoding unit PROD_B3 according to the recording encoding method may be interposed between the decoding unit PROD_B3 and the recording medium PROD_B5.

なお、変調信号を伝送する伝送媒体は、無線であってもよいし、有線であってもよい。また、変調信号を伝送する伝送態様は、放送（ここでは、送信先が予め特定されていない送信態様を指す）であってもよいし、通信（ここでは、送信先が予め特定されている送信態様を指す）であってもよい。すなわち、変調信号の伝送は、無線放送、有線放送、無線通信、及び有線通信の何れによって実現してもよい。 Note that the transmission medium for transmitting the modulation signal may be wireless or wired. Further, the transmission mode for transmitting the modulated signal may be broadcasting (here, a transmission mode in which the transmission destination is not specified in advance) or communication (here, transmission in which the transmission destination is specified in advance). Refers to the embodiment). That is, the transmission of the modulation signal may be realized by any of wireless broadcasting, wired broadcasting, wireless communication, and wired communication.

例えば、地上デジタル放送の放送局（放送設備など）／受信局（テレビジョン受像機など）は、変調信号を無線放送で送受信する送信装置ＰＲＯＤ＿Ａ／受信装置ＰＲＯＤ＿Ｂの一例である。また、ケーブルテレビ放送の放送局（放送設備など）／受信局（テレビジョン受像機など）は、変調信号を有線放送で送受信する送信装置ＰＲＯＤ＿Ａ／受信装置ＰＲＯＤ＿Ｂの一例である。 For example, a terrestrial digital broadcast broadcasting station (such as broadcasting equipment) / receiving station (such as a television receiver) is an example of a transmitting device PROD_A / receiving device PROD_B that transmits and receives a modulated signal by wireless broadcasting. Further, a broadcasting station (such as broadcasting equipment) / receiving station (such as a television receiver) of cable television broadcasting is an example of a transmitting device PROD_A / receiving device PROD_B that transmits and receives a modulated signal by cable broadcasting.

また、インターネットを用いたＶＯＤ（Video On Demand）サービスや動画共有サービスなどのサーバ（ワークステーションなど）／クライアント（テレビジョン受像機、パーソナルコンピュータ、スマートフォンなど）は、変調信号を通信で送受信する送信装置ＰＲＯＤ＿Ａ／受信装置ＰＲＯＤ＿Ｂの一例である（通常、ＬＡＮにおいては伝送媒体として無線又は有線の何れかが用いられ、ＷＡＮにおいては伝送媒体として有線が用いられる）。ここで、パーソナルコンピュータには、デスクトップ型ＰＣ、ラップトップ型ＰＣ、及びタブレット型ＰＣが含まれる。また、スマートフォンには、多機能携帯電話端末も含まれる。 Also, a server (workstation or the like) / client (television receiver, personal computer, smartphone, etc.) such as a VOD (Video On Demand) service or a video sharing service using the Internet transmits and receives a modulated signal by communication. This is an example of PROD_A / reception device PROD_B (usually, either a wireless or wired transmission medium is used in a LAN, and a wired transmission medium is used in a WAN). Here, the personal computer includes a desktop PC, a laptop PC, and a tablet PC. The smartphone also includes a multi-function mobile phone terminal.

なお、動画共有サービスのクライアントは、サーバからダウンロードした符号化データを復号してディスプレイに表示する機能に加え、カメラで撮像した動画像を符号化してサーバにアップロードする機能を有している。すなわち、動画共有サービスのクライアントは、送信装置ＰＲＯＤ＿Ａ及び受信装置ＰＲＯＤ＿Ｂの双方として機能する。 Note that the client of the video sharing service has a function of encoding a moving image captured by a camera and uploading it to the server in addition to a function of decoding the encoded data downloaded from the server and displaying it on the display. That is, the client of the video sharing service functions as both the transmission device PROD_A and the reception device PROD_B.

図１９に基づいて、上述した階層動画像符号化装置２および階層動画像復号装置１を、動画像の記録および再生に利用できることを説明する。図１９の（ａ）は、上述した階層動画像符号化装置２を搭載した記録装置ＰＲＯＤ＿Ｃの構成を示したブロック図である。 Based on FIG. 19, it will be described that the above-described hierarchical moving image encoding device 2 and hierarchical moving image decoding device 1 can be used for recording and reproduction of moving images. FIG. 19A is a block diagram illustrating a configuration of a recording apparatus PROD_C in which the above-described hierarchical video encoding apparatus 2 is mounted.

図１９の（ａ）に示すように、記録装置ＰＲＯＤ＿Ｃは、動画像を符号化することによって符号化データを得る符号化部ＰＲＯＤ＿Ｃ１と、符号化部ＰＲＯＤ＿Ｃ１が得た符号化データを記録媒体ＰＲＯＤ＿Ｍに書き込む書込部ＰＲＯＤ＿Ｃ２と、を備えている。上述した階層動画像符号化装置２は、この符号化部ＰＲＯＤ＿Ｃ１として利用される。 As shown in FIG. 19A, the recording apparatus PROD_C has an encoding unit PROD_C1 that obtains encoded data by encoding a moving image, and the encoded data obtained by the encoding unit PROD_C1 on the recording medium PROD_M. A writing unit PROD_C2 for writing. The hierarchical moving image encoding device 2 described above is used as the encoding unit PROD_C1.

なお、記録媒体ＰＲＯＤ＿Ｍは、（１）ＨＤＤ（Hard Disk Drive）やＳＳＤ(Solid State Drive)等のように、記録装置ＰＲＯＤ＿Ｃに内蔵されるタイプのものであってもよいし、（２）ＳＤメモリカードやＵＳＢ（Universal Serial Bus）フラッシュメモリ等のように、記録装置ＰＲＯＤ＿Ｃに接続されるタイプのものであってもよいし、（３）ＤＶＤ（Digital Versatile Disc）やＢＤ（Blu-ray Disc:登録商標）等のように、記録装置ＰＲＯＤ＿Ｃに内蔵されたドライブ装置（不図示）に装填されるものであってもよい。 The recording medium PROD_M may be of a type built in the recording device PROD_C, such as (1) HDD (Hard Disk Drive) or SSD (Solid State Drive), or (2) SD memory. It may be of the type connected to the recording device PROD_C, such as a card or USB (Universal Serial Bus) flash memory, or (3) DVD (Digital Versatile Disc) or BD (Blu-ray Disc: registration) For example, it may be loaded into a drive device (not shown) built in the recording device PROD_C.

また、記録装置ＰＲＯＤ＿Ｃは、符号化部ＰＲＯＤ＿Ｃ１に入力する動画像の供給源として、動画像を撮像するカメラＰＲＯＤ＿Ｃ３、動画像を外部から入力するための入力端子ＰＲＯＤ＿Ｃ４、動画像を受信するための受信部ＰＲＯＤ＿Ｃ５、及び、画像を生成または加工する画像処理部Ｃ６を更に備えていてもよい。図１９の（ａ）においては、これら全てを記録装置ＰＲＯＤ＿Ｃが備えた構成を例示しているが、一部を省略しても構わない。 The recording device PROD_C is a camera PROD_C3 that captures moving images as a supply source of moving images to be input to the encoding unit PROD_C1, an input terminal PROD_C4 for inputting moving images from the outside, and reception for receiving moving images. The unit PROD_C5 and an image processing unit C6 that generates or processes an image may be further provided. FIG. 19A illustrates a configuration in which the recording apparatus PROD_C includes all of these, but a part of the configuration may be omitted.

なお、受信部ＰＲＯＤ＿Ｃ５は、符号化されていない動画像を受信するものであってもよいし、記録用の符号化方式とは異なる伝送用の符号化方式で符号化された符号化データを受信するものであってもよい。後者の場合、受信部ＰＲＯＤ＿Ｃ５と符号化部ＰＲＯＤ＿Ｃ１との間に、伝送用の符号化方式で符号化された符号化データを復号する伝送用復号部（不図示）を介在させるとよい。 The receiving unit PROD_C5 may receive a non-encoded moving image, or may receive encoded data encoded by a transmission encoding scheme different from the recording encoding scheme. You may do. In the latter case, a transmission decoding unit (not shown) that decodes encoded data encoded by the transmission encoding method may be interposed between the reception unit PROD_C5 and the encoding unit PROD_C1.

このような記録装置ＰＲＯＤ＿Ｃとしては、例えば、ＤＶＤレコーダ、ＢＤレコーダ、ＨＤＤ（Hard Disk Drive）レコーダなどが挙げられる（この場合、入力端子ＰＲＯＤ＿Ｃ４又は受信部ＰＲＯＤ＿Ｃ５が動画像の主な供給源となる）。また、カムコーダ（この場合、カメラＰＲＯＤ＿Ｃ３が動画像の主な供給源となる）、パーソナルコンピュータ（この場合、受信部ＰＲＯＤ＿Ｃ５又は画像処理部Ｃ６が動画像の主な供給源となる）、スマートフォン（この場合、カメラＰＲＯＤ＿Ｃ３又は受信部ＰＲＯＤ＿Ｃ５が動画像の主な供給源となる）なども、このような記録装置ＰＲＯＤ＿Ｃの一例である。 Examples of such a recording device PROD_C include a DVD recorder, a BD recorder, and an HDD (Hard Disk Drive) recorder (in this case, the input terminal PROD_C4 or the receiving unit PROD_C5 is a main supply source of moving images). . In addition, a camcorder (in this case, the camera PROD_C3 is a main source of moving images), a personal computer (in this case, the receiving unit PROD_C5 or the image processing unit C6 is a main source of moving images), a smartphone (in this case In this case, the camera PROD_C3 or the receiving unit PROD_C5 is a main supply source of moving images) is also an example of such a recording device PROD_C.

図１９の（ｂ）は、上述した階層動画像復号装置１を搭載した再生装置ＰＲＯＤ＿Ｄの構成を示したブロックである。図１９の（ｂ）に示すように、再生装置ＰＲＯＤ＿Ｄは、記録媒体ＰＲＯＤ＿Ｍに書き込まれた符号化データを読み出す読出部ＰＲＯＤ＿Ｄ１と、読出部ＰＲＯＤ＿Ｄ１が読み出した符号化データを復号することによって動画像を得る復号部ＰＲＯＤ＿Ｄ２と、を備えている。上述した階層動画像復号装置１は、この復号部ＰＲＯＤ＿Ｄ２として利用される。 FIG. 19B is a block diagram illustrating a configuration of a playback device PROD_D in which the above-described hierarchical video decoding device 1 is mounted. As shown in (b) of FIG. 19, the playback device PROD_D reads a moving image by decoding a read unit PROD_D1 that reads encoded data written on the recording medium PROD_M and a coded data read by the read unit PROD_D1. And a decoding unit PROD_D2 to be obtained. The hierarchical moving image decoding apparatus 1 described above is used as the decoding unit PROD_D2.

なお、記録媒体ＰＲＯＤ＿Ｍは、（１）ＨＤＤやＳＳＤなどのように、再生装置ＰＲＯＤ＿Ｄに内蔵されるタイプのものであってもよいし、（２）ＳＤメモリカードやＵＳＢフラッシュメモリなどのように、再生装置ＰＲＯＤ＿Ｄに接続されるタイプのものであってもよいし、（３）ＤＶＤやＢＤなどのように、再生装置ＰＲＯＤ＿Ｄに内蔵されたドライブ装置（不図示）に装填されるものであってもよい。 Note that the recording medium PROD_M may be of the type built into the playback device PROD_D, such as (1) HDD or SSD, or (2) such as an SD memory card or USB flash memory, It may be of a type connected to the playback device PROD_D, or (3) may be loaded into a drive device (not shown) built in the playback device PROD_D, such as DVD or BD. Good.

また、再生装置ＰＲＯＤ＿Ｄは、復号部ＰＲＯＤ＿Ｄ２が出力する動画像の供給先として、動画像を表示するディスプレイＰＲＯＤ＿Ｄ３、動画像を外部に出力するための出力端子ＰＲＯＤ＿Ｄ４、及び、動画像を送信する送信部ＰＲＯＤ＿Ｄ５を更に備えていてもよい。図１９の（ｂ）においては、これら全てを再生装置ＰＲＯＤ＿Ｄが備えた構成を例示しているが、一部を省略しても構わない。 In addition, the playback device PROD_D has a display PROD_D3 that displays a moving image, an output terminal PROD_D4 that outputs the moving image to the outside, and a transmission unit that transmits the moving image as a supply destination of the moving image output by the decoding unit PROD_D2. PROD_D5 may be further provided. FIG. 19B illustrates a configuration in which the playback apparatus PROD_D includes all of these, but some of the configurations may be omitted.

なお、送信部ＰＲＯＤ＿Ｄ５は、符号化されていない動画像を送信するものであってもよいし、記録用の符号化方式とは異なる伝送用の符号化方式で符号化された符号化データを送信するものであってもよい。後者の場合、復号部ＰＲＯＤ＿Ｄ２と送信部ＰＲＯＤ＿Ｄ５との間に、動画像を伝送用の符号化方式で符号化する符号化部（不図示）を介在させるとよい。 The transmission unit PROD_D5 may transmit an unencoded moving image, or transmits encoded data encoded by a transmission encoding method different from the recording encoding method. You may do. In the latter case, it is preferable to interpose an encoding unit (not shown) that encodes a moving image with an encoding method for transmission between the decoding unit PROD_D2 and the transmission unit PROD_D5.

このような再生装置ＰＲＯＤ＿Ｄとしては、例えば、ＤＶＤプレイヤ、ＢＤプレイヤ、ＨＤＤプレイヤなどが挙げられる（この場合、テレビジョン受像機等が接続される出力端子ＰＲＯＤ＿Ｄ４が動画像の主な供給先となる）。また、テレビジョン受像機（この場合、ディスプレイＰＲＯＤ＿Ｄ３が動画像の主な供給先となる）、デジタルサイネージ（電子看板や電子掲示板等とも称され、ディスプレイＰＲＯＤ＿Ｄ３又は送信部ＰＲＯＤ＿Ｄ５が動画像の主な供給先となる）、デスクトップ型ＰＣ（この場合、出力端子ＰＲＯＤ＿Ｄ４又は送信部ＰＲＯＤ＿Ｄ５が動画像の主な供給先となる）、ラップトップ型又はタブレット型ＰＣ（この場合、ディスプレイＰＲＯＤ＿Ｄ３又は送信部ＰＲＯＤ＿Ｄ５が動画像の主な供給先となる）、スマートフォン（この場合、ディスプレイＰＲＯＤ＿Ｄ３又は送信部ＰＲＯＤ＿Ｄ５が動画像の主な供給先となる）なども、このような再生装置ＰＲＯＤ＿Ｄの一例である。 Examples of such a playback device PROD_D include a DVD player, a BD player, and an HDD player (in this case, an output terminal PROD_D4 to which a television receiver or the like is connected is a main supply destination of moving images). . In addition, a television receiver (in this case, the display PROD_D3 is a main supply destination of moving images), a digital signage (also referred to as an electronic signboard or an electronic bulletin board), and the display PROD_D3 or the transmission unit PROD_D5 is the main supply of moving images. Desktop PC (in this case, the output terminal PROD_D4 or the transmission unit PROD_D5 is the main video image supply destination), laptop or tablet PC (in this case, the display PROD_D3 or the transmission unit PROD_D5 is a moving image) A smartphone (which is a main image supply destination), a smartphone (in this case, the display PROD_D3 or the transmission unit PROD_D5 is a main moving image supply destination), and the like are also examples of such a playback device PROD_D.

（ハードウェア的実現およびソフトウェア的実現について）
最後に、階層動画像復号装置１、階層動画像符号化装置２の各ブロックは、集積回路（ＩＣチップ）上に形成された論理回路によってハードウェア的に実現してもよいし、ＣＰＵ（Central Processing Unit）を用いてソフトウェア的に実現してもよい。 (About hardware implementation and software implementation)
Finally, each block of the hierarchical video decoding device 1 and the hierarchical video encoding device 2 may be realized in hardware by a logic circuit formed on an integrated circuit (IC chip), or may be a CPU (Central It may be realized by software using a Processing Unit).

後者の場合、上記各装置は、各機能を実現する制御プログラムの命令を実行するＣＰＵ、上記プログラムを格納したＲＯＭ（Read Only Memory）、上記プログラムを展開するＲＡＭ（Random Access Memory）、上記プログラムおよび各種データを格納するメモリ等の記憶装置（記録媒体）などを備えている。そして、本発明の目的は、上述した機能を実現するソフトウェアである上記各装置の制御プログラムのプログラムコード（実行形式プログラム、中間コードプログラム、ソースプログラム）をコンピュータで読み取り可能に記録した記録媒体を、上記各装置に供給し、そのコンピュータ（またはＣＰＵやＭＰＵ（Micro Processing Unit））が記録媒体に記録されているプログラムコードを読み出し実行することによっても、達成可能である。 In the latter case, each of the devices includes a CPU that executes instructions of a control program that realizes each function, a ROM (Read Only Memory) that stores the program, a RAM (Random Access Memory) that expands the program, the program, and A storage device (recording medium) such as a memory for storing various data is provided. An object of the present invention is to provide a recording medium in which a program code (execution format program, intermediate code program, source program) of a control program for each of the above devices, which is software that realizes the above-described functions, is recorded in a computer-readable manner. This can also be achieved by supplying to each of the above devices and reading and executing the program code recorded on the recording medium by the computer (or CPU or MPU (Micro Processing Unit)).

上記記録媒体としては、例えば、磁気テープやカセットテープ等のテープ類、フロッピー（登録商標）ディスク／ハードディスク等の磁気ディスクやＣＤ−ＲＯＭ（Compact Disc Read-Only Memory）／ＭＯ（Magneto-Optical）／ＭＤ（Mini Disc）／ＤＶＤ（Digital Versatile Disk）／ＣＤ−Ｒ（CD Recordable）等の光ディスクを含むディスク類、ＩＣカード（メモリカードを含む）／光カード等のカード類、マスクＲＯＭ／ＥＰＲＯＭ（Erasable Programmable Read-only Memory）／ＥＥＰＲＯＭ（登録商標）（Electrically Erasable and Programmable Read-only Memory）／フラッシュＲＯＭ等の半導体メモリ類、あるいはＰＬＤ（Programmable Logic Device）やＦＰＧＡ（Field Programmable Gate Array）等の論理回路類などを用いることができる。 Examples of the recording medium include tapes such as magnetic tapes and cassette tapes, magnetic disks such as floppy (registered trademark) disks / hard disks, CD-ROM (Compact Disc Read-Only Memory) / MO (Magneto-Optical) / Disks including optical disks such as MD (Mini Disc) / DVD (Digital Versatile Disk) / CD-R (CD Recordable), cards such as IC cards (including memory cards) / optical cards, mask ROM / EPROM (Erasable Programmable Read-only Memory (EEPROM) (Electrically Erasable and Programmable Read-only Memory) / Semiconductor memories such as flash ROM, or logic circuits such as PLD (Programmable Logic Device) and FPGA (Field Programmable Gate Array) Etc. can be used.

また、上記各装置を通信ネットワークと接続可能に構成し、上記プログラムコードを通信ネットワークを介して供給してもよい。この通信ネットワークは、プログラムコードを伝送可能であればよく、特に限定されない。例えば、インターネット、イントラネット、エキストラネット、ＬＡＮ（Local Area Network）、ＩＳＤＮ（Integrated Services Digital Network）、ＶＡＮ（Value-Added Network）、ＣＡＴＶ（Community Antenna Television）通信網、仮想専用網（Virtual Private Network）、電話回線網、移動体通信網、衛星通信網等が利用可能である。また、この通信ネットワークを構成する伝送媒体も、プログラムコードを伝送可能な媒体であればよく、特定の構成または種類のものに限定されない。例えば、ＩＥＥＥ（Institute of Electrical and Electronic Engineers）１３９４、ＵＳＢ、電力線搬送、ケーブルＴＶ回線、電話線、ＡＤＳＬ（Asymmetric Digital Subscriber Line）回線等の有線でも、ＩｒＤＡ（Infrared Data Association）やリモコンのような赤外線、Ｂｌｕｅｔｏｏｔｈ（登録商標）、ＩＥＥＥ８０２．１１無線、ＨＤＲ（High Data Rate）、ＮＦＣ（Near Field Communication）、ＤＬＮＡ（Digital Living Network Alliance）、携帯電話網、衛星回線、地上波デジタル網等の無線でも利用可能である。なお、本発明は、上記プログラムコードが電子的な伝送で具現化された、搬送波に埋め込まれたコンピュータデータ信号の形態でも実現され得る。 Further, each of the above devices may be configured to be connectable to a communication network, and the program code may be supplied via the communication network. The communication network is not particularly limited as long as it can transmit the program code. For example, the Internet, an intranet, an extranet, a LAN (Local Area Network), an ISDN (Integrated Services Digital Network), a VAN (Value-Added Network), a CATV (Community Antenna Television) communication network, a virtual private network (Virtual Private Network), A telephone line network, a mobile communication network, a satellite communication network, etc. can be used. The transmission medium constituting the communication network may be any medium that can transmit the program code, and is not limited to a specific configuration or type. For example, infra-red such as IrDA (Infrared Data Association) or remote control even with wired such as IEEE (Institute of Electrical and Electronic Engineers) 1394, USB, power line carrier, cable TV line, telephone line, ADSL (Asymmetric Digital Subscriber Line) line, etc. , Bluetooth (registered trademark), IEEE 802.11 wireless, HDR (High Data Rate), NFC (Near Field Communication), DLNA (Digital Living Network Alliance), mobile phone network, satellite line, terrestrial digital network, etc. Is possible. The present invention can also be realized in the form of a computer data signal embedded in a carrier wave in which the program code is embodied by electronic transmission.

本発明は上述した各実施形態に限定されるものではなく、請求項に示した範囲で種々の変更が可能であり、異なる実施形態にそれぞれ開示された技術的手段を適宜組み合わせて得られる実施形態についても本発明の技術的範囲に含まれる。 The present invention is not limited to the above-described embodiments, and various modifications are possible within the scope shown in the claims, and embodiments obtained by appropriately combining technical means disclosed in different embodiments. Is also included in the technical scope of the present invention.

本発明は、画像データが階層的に符号化された符号化データを復号する階層画像復号装置、および、画像データが階層的に符号化された符号化データを生成する階層画像符号化装置に好適に適用できる。 The present invention is suitable for a hierarchical image decoding device that decodes encoded data in which image data is hierarchically encoded, and a hierarchical image encoding device that generates encoded data in which image data is hierarchically encoded. Applicable to.

１階層動画像復号装置（画像復号装置）
１１ＮＡＬ逆多重化部
１２可変長復号部（可変長復号手段）
１３ピクチャ復号部（ピクチャ復号手段）
１４復号ピクチャ管理部（復号ピクチャ管理手段）
１４１ＤＰＢ
１４２ＲＰＳ導出部
１４３参照ピクチャ制御部
１４４ベース参照ピクチャ制御部（ベース参照ピクチャ制御手段）
１４５ＲＰＬ導出部
１４６出力制御部（出力制御手段）
１５ベース復号部
１５１可変長復号部（可変長復号手段）
１５３ベースピクチャ復号部
１５４ベース復号ピクチャ管理部（ベース復号ピクチャ管理手段）
２階層動画像符号化装置（画像符号化装置）
２１ピクチャ符号化部
２２可変長符号化部
２３ＮＡＬ多重化部 1. Hierarchical video decoding device (image decoding device)
11 NAL demultiplexing unit 12 Variable length decoding unit (variable length decoding means)
13 Picture decoding unit (picture decoding means)
14 Decoded picture management unit (decoded picture management means)
141 DPB
142 RPS derivation unit 143 Reference picture control unit 144 Base reference picture control unit (base reference picture control means)
145 RPL deriving unit 146 Output control unit (output control means)
15 Base decoding unit 151 Variable length decoding unit (variable length decoding means)
153 Base picture decoding unit 154 Base decoded picture management unit (base decoded picture management means)
2. Hierarchical video encoding device (image encoding device)
21 picture encoding unit 22 variable length encoding unit 23 NAL multiplexing unit

Claims

An image decoding apparatus that decodes higher layer encoded data included in hierarchically encoded data and restores a decoded picture of an upper layer,
A picture management unit for managing a decoded picture buffer of an upper layer;
Prior to the decoded picture derivation process of the target picture in the upper layer, the picture management unit adds an inter-layer reference picture generated based on the decoded picture of the lower layer to the decoded picture buffer, and An image decoding apparatus, wherein the reference mark is set to be used for reference, and the output mark of the inter-layer reference picture is set to be unnecessary.

The picture management unit sets the reference mark of the inter-layer reference picture to non-reference at a timing after the completion of the decoded picture derivation process and before the target picture moves to the next target picture in decoding order. The image decoding apparatus according to claim 1, wherein:

The picture management unit refers to an output mark and a reference mark of each picture in the decoded picture buffer, identifies a picture that does not need to be decoded and output after the target picture, and identifies the identified picture as the decoded picture The image decoding apparatus according to claim 1, wherein a bumping process for discarding from the buffer is executed.

A base decoding unit that generates a base decoded picture that is a decoded picture of a reference layer picture;
The base decoding unit includes a base decoded picture management unit that manages a base decoded picture buffer that is a decoded picture buffer of the reference layer,
4. The image decoding apparatus according to claim 3, wherein the base picture management unit determines a picture to be discarded from the decoded picture buffer by the same bumping process as the bumping process executed by the picture management unit.

An image decoding apparatus that decodes higher layer encoded data included in hierarchically encoded data and restores a decoded picture of an upper layer,
A picture management unit for managing a decoded picture buffer common to a plurality of layers;
The picture management unit determines whether to apply inter prediction between pictures in the same layer in a sublayer to which the target picture belongs, and when the determination result indicates that inter prediction application between pictures in the same layer is not applicable, the target picture An image decoding apparatus characterized in that the reference mark is set not to be used.

6. The image decoding apparatus according to claim 5, wherein the determination of whether or not to apply inter prediction between pictures in the same layer is applied only to pictures belonging to the highest order sublayer.

An image decoding apparatus that decodes higher layer encoded data included in hierarchically encoded data and restores a decoded picture of an upper layer,
A picture management unit for managing a decoded picture buffer common to a plurality of layers;
A variable length decoding unit that decodes a syntax value from hierarchically encoded data,
The variable length decoding unit decodes the discardable flag as a syntax value,
The discardable flag takes a value of 1 when the target picture is not referenced in inter prediction or inter-layer image prediction of the subsequent picture in decoding order, and the inter-prediction or inter-layer image of the subsequent picture in the decoding order. Takes a value of 0 when referenced in the prediction,
The picture decoding apparatus, wherein the picture management unit updates a reference mark of the target picture according to a value of the discardable flag.

An image encoding device that generates encoded data of an upper layer from an input image,
A picture management unit for managing a decoded picture buffer of an upper layer;
Prior to the decoded picture derivation process of the target picture in the upper layer, the picture management unit adds an inter-layer reference picture generated based on the decoded picture of the lower layer to the decoded picture buffer, and An image encoding apparatus, wherein a reference mark is set to be used for reference, and an output mark of the inter-layer reference picture is set so as not to be output.