JP2015015626A

JP2015015626A - Image decoder and image encoder

Info

Publication number: JP2015015626A
Application number: JP2013141570A
Authority: JP
Inventors: 内海　端; Tadashi Uchiumi; 端内海; 知宏猪飼; Tomohiro Igai
Original assignee: Sharp Corp
Current assignee: Sharp Corp
Priority date: 2013-07-05
Filing date: 2013-07-05
Publication date: 2015-01-22

Abstract

PROBLEM TO BE SOLVED: To reduce redundancy of codes, associated with inter-layer prediction, in decoding processing and encoding processing on a multi-layer image.SOLUTION: An encoder and a decoder use a reference picture set including texture dependence with respect to a dependence type between layers associated with inter-layer prediction and a specifying method for reference picture, and use an alternative reference picture only for a case of the motion dependence. Further, an extension parameter associated with inter-layer prediction is defined in an extension slice header different from a slice header, and when the extension slice header is omitted, the decoder is configured to set a predetermined default value for the extension parameter and to use the default value for decoding of the inter-layer prediction.

Description

本発明は、画像復号装置および画像符号化装置に関する。 The present invention relates to an image decoding device and an image encoding device.

動画像の符号化における基本的な手法の一つとして、時間方向に連続する画像（ピクチャ）間で予測を行うフレーム間予測符号化がある。時間的に近接するピクチャは互いに相関が高くなり、ピクチャ間の差分をとるとその差分値はゼロ近傍に集中するため、差分値を利用することで大幅に符号量を削減することができる。 One of the basic methods for encoding moving images is inter-frame prediction encoding in which prediction is performed between images (pictures) continuous in the time direction. Pictures that are temporally close to each other have a high correlation, and when the difference between the pictures is taken, the difference value is concentrated in the vicinity of zero. Therefore, the code amount can be significantly reduced by using the difference value.

複数のレイヤから構成される動画像の符号化方法は、一般に、スケーラブル符号化又は階層符号化と呼ばれる。ここでいうレイヤとは、時間方向に連続する複数のピクチャで構成される動画像データのセットを意味する。スケーラブル符号化では、前述のフレーム間予測に加えてレイヤ間で予測を行うことで、高い符号化効率を実現する。レイヤ間予測を用いない基準となるレイヤはベースレイヤ、それ以外のレイヤは拡張レイヤと呼ばれる。レイヤが視点画像から構成される場合のスケーラブル符号化を、ビュースケーラブル符号化と呼ぶ。このとき、ベースレイヤはベースビュー、拡張レイヤは非ベースビュー（依存ビュー）とも呼ばれる。 A method for encoding a moving image composed of a plurality of layers is generally referred to as scalable encoding or hierarchical encoding. The layer here means a set of moving image data composed of a plurality of pictures continuous in the time direction. In scalable coding, high coding efficiency is realized by performing prediction between layers in addition to the above-described interframe prediction. A reference layer that does not use inter-layer prediction is called a base layer, and other layers are called enhancement layers. Scalable encoding in the case where a layer is composed of viewpoint images is referred to as view scalable encoding. At this time, the base layer is also called a base view, and the enhancement layer is also called a non-base view (dependent view).

また、スケーラブル符号化には、ビュースケーラブル符号化の他、空間的スケーラブル符号化（ベースレイヤとして解像度の低いピクチャ、拡張レイヤとして解像度の高いピクチャを処理）、ＳＮＲスケーラブル符号化（ベースレイヤとして画質の低いピクチャ、拡張レイヤとして画質の高いピクチャを処理）等がある。スケーラブル符号化では、各レイヤ内の既に符号化／復号された時間的に前後のピクチャを参照ピクチャとして用いることに加え、対象レイヤとは別のレイヤのピクチャを参照レイヤとして用いることができる。レイヤ内での時間的な予測は動き予測と呼ばれ、ピクチャ間のズレを示すベクトルが動きベクトルである。レイヤ間の予測はレイヤ間予測と呼ばれ、ピクチャ間のズレを示すベクトルは変位ベクトルと呼ばれる。なお、ビュースケーラブル符号化など、レイヤが視点画像から構成される場合、変位ベクトルは、視点画像間の視差を表すベクトルであり視差ベクトルとも呼ばれる。 In addition to view scalable coding, scalable coding includes spatial scalable coding (processing low-resolution pictures as a base layer and high-resolution pictures as an extension layer), SNR scalable coding (image quality as a base layer). Low picture, high quality picture as an enhancement layer). In scalable coding, in addition to using temporally preceding and succeeding pictures already encoded / decoded in each layer as reference pictures, pictures in layers other than the target layer can be used as reference layers. Temporal prediction within a layer is called motion prediction, and a vector indicating a shift between pictures is a motion vector. Prediction between layers is called inter-layer prediction, and a vector indicating a shift between pictures is called a displacement vector. In addition, when a layer is comprised from viewpoint images, such as view scalable encoding, a displacement vector is a vector showing the parallax between viewpoint images, and is also called a parallax vector.

動画像符号化方式の国際標準では、スケーラブル符号化の例として、SVC（Scalable Video Coding）やMVC（Multi-view Video Coding）がある。これらは、単一レイヤを対象とするH.264/AVC（Advanced Video Coding）を拡張して、複数レイヤ対応にした規格である。その後、次世代の符号化方式として規格化されたH.265/HEVC（High Efficiency Video Coding；非特許文献１）に関して、その拡張規格として複数レイヤ用のスケーラブル符号化方式の審議が始まっている。 In the international standard of the moving picture coding system, examples of scalable coding include SVC (Scalable Video Coding) and MVC (Multi-view Video Coding). These are standards that support multiple layers by extending H.264 / AVC (Advanced Video Coding) for single layers. Thereafter, with regard to H.265 / HEVC (High Efficiency Video Coding; Non-Patent Document 1) standardized as a next-generation encoding method, a discussion on a scalable encoding method for a plurality of layers has started as an extended standard.

このような符号化方式の拡張においては、既に標準化されたベース部分の符号化方法をできるだけ変更せずにそのまま用いるような後方互換性が重視される。H.265/HEVCをベースとするスケーラブル符号化規格においても、単一レイヤ用の規格で定められた符号化データの構造およびローレベルの処理手順をできるだけ保持したまま、拡張が行われることが望ましい。単一レイヤ用の規格では、対象ピクチャとは異なるピクチャ（ここでは時間的に異なるピクチャ）との間で予測を行うテンポラル動き予測と呼ばれる方法がある。複数レイヤを対象とするスケーラブル符号化では、これに加えて、異なるレイヤのピクチャとの間で予測を行うサンプル予測と呼ばれる方法が利用可能になり、これらを適応的に切り替えまたは組み合わせて利用することで符号化効率の向上を目指している（非特許文献２、非特許文献３）。 In such an extension of the encoding method, importance is attached to backward compatibility such that the already standardized encoding method of the base portion is used as it is without being changed as much as possible. In scalable coding standards based on H.265 / HEVC, it is desirable that the coding data structure and low-level processing procedures defined in the single-layer standard be maintained as much as possible. . In the standard for a single layer, there is a method called temporal motion prediction that performs prediction between a picture different from a target picture (here, a picture that is temporally different). In scalable coding for multiple layers, in addition to this, a method called sample prediction that performs prediction between pictures in different layers can be used, and these can be adaptively switched or combined for use. Is aimed at improving the coding efficiency (Non-patent Document 2, Non-patent Document 3).

“Recommendation ITU-T H.265 High efficiency video coding”, TELECOMMUNICATION STANDARDIZATION SECTOR OF ITU, 04/2013“Recommendation ITU-T H.265 High efficiency video coding”, TELECOMMUNICATION STANDARDIZATION SECTOR OF ITU, 04/2013 “SHVC Working Draft 2”, JCTVC-M1008, JCT-VC, Incheon, KR, 18-26 Apr. 2013“SHVC Working Draft 2”, JCTVC-M1008, JCT-VC, Incheon, KR, 18-26 Apr. 2013 “MV-HEVC Draft Text 4”, JCT3V-D1004, JCT-3V, Incheon, KR, 20-26 Apr. 2013“MV-HEVC Draft Text 4”, JCT3V-D1004, JCT-3V, Incheon, KR, 20-26 Apr. 2013

複数レイヤを対象とするスケーラブル符号化では、レイヤ間の予測符号化の使用に関するフラグやパラメータを伝送する符号が必要になる。このレイヤ間予測に関するフラグやパラメータは、非特許文献２や非特許文献３においてスライスセグメントヘッダと呼ばれるデータ構造の中に含めるように定義されている。しかしながら、スライスセグメントヘッダは単一レイヤ用の規格すなわちH.265/HEVCで既に定義されているデータ構造であるため、複数レイヤに関わる符号を含むように再定義することは、単一レイヤ用に設計・実装された復号装置やソフトウェアに影響を与えることになり、既存のスライスヘッダ処理機能を変更する必要が生じ、実装コストを増大させるという課題がある。 In scalable coding for a plurality of layers, a code for transmitting a flag and parameters related to the use of predictive coding between layers is required. The flags and parameters related to the inter-layer prediction are defined so as to be included in a data structure called a slice segment header in Non-Patent Document 2 and Non-Patent Document 3. However, since the slice segment header is a data structure already defined in the standard for single layer, that is, H.265 / HEVC, redefinition to include codes related to multiple layers is This affects the designed / implemented decoding device and software, and it is necessary to change the existing slice header processing function, which increases the implementation cost.

また、レイヤ間予測における依存関係は、ＶＰＳのレイヤ依存タイプによって、サンプル予測依存であるか、動き予測依存であるか、その両方（サンプル予測依存かつ動き予測依存）であるか、が指定される。しかしながら、非特許文献２および非特許文献３では、レイヤ依存タイプに関わらず、参照ピクチャの選択に用いられる参照ピクチャリスト（ＲＰＳ）に追加できるため、ＲＰＳに追加された参照ピクチャが、サンプル予測が可能であるか、が保障されないという課題がある。 In addition, depending on the layer dependency type of VPS, whether the dependency relationship between layers is sample prediction dependency, motion prediction dependency, or both (sample prediction dependency and motion prediction dependency) is specified. . However, in Non-Patent Document 2 and Non-Patent Document 3, since it can be added to the reference picture list (RPS) used for reference picture selection regardless of the layer-dependent type, the reference picture added to the RPS is sample-predicted. There is a problem that it is not possible or not guaranteed.

また、テンポラル動き予測の参照ピクチャcolPicを指定する方法として、参照ピクチャリスト（ＲＰＳ）から生成される参照ピクチャリスト(RPL)のピクチャと、参照ピクチャインデックスrefIdxで指定する第１の方法と、代替参照ピクチャとして、レイヤIDで指定する第２の方法の２つがある。しかしながら、非特許文献２および非特許文献３では、レイヤ依存タイプが動き予測依存であるか、その両方（サンプル予測依存かつ動き予測依存）に関わらず、第１の方法、第２の方法のいずれもが使用可能であるため、テンポラル動き予測の場合に参照ピクチャcolPicを指定する符号の伝送が冗長になるという課題がある。 Further, as a method of designating a reference picture colPic for temporal motion prediction, a first method of designating a picture in a reference picture list (RPL) generated from a reference picture list (RPS) and a reference picture index refIdx, and an alternative reference As a picture, there are two methods of specifying by a layer ID. However, in Non-Patent Document 2 and Non-Patent Document 3, regardless of whether the layer dependence type is motion prediction dependence or both (sample prediction dependence and motion prediction dependence), either the first method or the second method is used. In the case of temporal motion prediction, there is a problem that transmission of a code specifying the reference picture colPic becomes redundant.

本発明は上記の点に鑑みてなされたものであり、複数レイヤを対象とするスケーラブル符号化において、単一レイヤ用の符号化方式を拡張する場合の実装コスト上の課題、およびレイヤ間予測における参照画像の指定に関する符号の冗長性を解消する画像復号装置、画像復号方法、画像復号プログラム、画像符号化装置、画像符号化方法、及び画像符号化プログラムを提供する。 The present invention has been made in view of the above points, and in scalable coding for a plurality of layers, there is a problem in implementation cost when an encoding method for a single layer is extended, and in inter-layer prediction. Provided are an image decoding device, an image decoding method, an image decoding program, an image encoding device, an image encoding method, and an image encoding program that eliminate code redundancy related to designation of a reference image.

前述の課題を解決するために、本発明による画像復号装置は、
（１）階層符号化された符号化データを復号する画像復号装置であって、
レイヤ依存タイプを復号し、前記レイヤ依存タイプに対応するレイヤがサンプル依存を示すレイヤであるかを判定する手段を備える依存タイプ復号手段と、
前記サンプル依存を示す依存レイヤから、サンプル依存レイヤを抽出し、参照ピクチャセットを導出する参照ピクチャセット復号部と、
前記レイヤ間参照ピクチャセットを用いて導出される参照ピクチャリストで指定される参照ピクチャを用いて予測画像を生成するインター予測画像生成部を備えることを特徴とするものである。 In order to solve the above-described problem, an image decoding apparatus according to the present invention includes:
(1) An image decoding device that decodes hierarchically encoded data,
Dependency type decoding means comprising means for decoding a layer dependency type and determining whether a layer corresponding to the layer dependency type is a layer that exhibits sample dependency;
A reference picture set decoding unit for extracting a sample dependent layer from the dependency layer indicating the sample dependence and deriving a reference picture set;
An inter-prediction image generation unit that generates a prediction image using a reference picture specified by a reference picture list derived using the inter-layer reference picture set is provided.

（２）また、上記の画像復号装置において、
上記依存タイプ復号手段は、パラメータセットの符号化データから、レイヤ依存タイプを復号し、前記レイヤ依存タイプに応じて、依存レイヤがサンプル依存レイヤか否かを示すフラグであるSamplePredEnableFlagを導出し、
上記サンプル依存レイヤ復号手段は、スライスセグメントレイヤの符号化データからレイヤ間依存識別子（inter_layer_pred_layer_idc）を復号し、SamplePredEnableFlagが１である依存レイヤから、サンプル依存レイヤActiveSamplePredRefLayerIdを導出することを特徴とする。 (2) In the above image decoding apparatus,
The dependence type decoding means decodes the layer dependence type from the encoded data of the parameter set, derives SamplePredEnableFlag, which is a flag indicating whether the dependence layer is a sample dependence layer, according to the layer dependence type,
The sample dependent layer decoding means decodes the inter-layer dependency identifier (inter_layer_pred_layer_idc) from the encoded data of the slice segment layer, and derives the sample dependent layer ActiveSamplePredRefLayerId from the dependent layer whose SamplePredEnableFlag is 1.

（３）また、上記画像復号装置は、さらにレイヤ間予測のみフラグinter_layer_sample_onlyフラグを復号するレイヤ間予測のみフラグ復号手段を備え、
上記代替参照ピクチャ復号部は、前記レイヤ依存タイプに応じて、前記サンプル依存レイヤの数NumActiveSamplePredRefLayersを導出し、
上記参照ピクチャセット復号部は、前記レイヤ間予測のみフラグinter_layer_sample_onlyフラグが、レイヤ間予測のみを示す場合には、同一レイヤのピクチャを参照ピクチャリストに入れず、
前記レイヤ間予測のみフラグ復号手段は、動きのみ依存レイヤ数NumActiveSamplePredRefLayersが１以上である場合に、レイヤ間予測のみフラグinter_layer_sample_onlyフラグを復号することを特徴とする。
（４）また、本発明による別の構成の画像復号装置は、
階層符号化された符号化データを復号する画像復号装置であって、
レイヤ依存タイプを復号し、前記レイヤ依存タイプに対応するレイヤが動き依存かつサンプル依存ではないことを示す動き依存のみレイヤであるかを判定する手段を備える依存タイプ復号手段と、
前記動き依存のみレイヤから、代替参照ピクチャを復号する代替参照ピクチャ復号部と、
前記代替参照ピクチャを用いて、テンポラル動きベクトルを導出するインター予測パラメータ導出部を備えることを特徴とする。 (3) The image decoding apparatus further includes an inter-layer prediction only flag decoding means for decoding an inter-layer prediction only flag inter_layer_sample_only flag,
The alternative reference picture decoding unit derives the number NumActiveSamplePredRefLayers of the sample dependent layers according to the layer dependent type,
When the inter-layer prediction only flag inter_layer_sample_only flag indicates only inter-layer prediction, the reference picture set decoding unit does not include pictures in the same layer in the reference picture list,
The inter-layer prediction only flag decoding means decodes the inter-layer prediction only flag inter_layer_sample_only flag when the number of motion-only dependent layers NumActiveSamplePredRefLayers is 1 or more.
(4) Further, an image decoding device having another configuration according to the present invention includes:
An image decoding device that decodes hierarchically encoded data,
Dependency type decoding means comprising means for decoding a layer dependent type and determining whether the layer corresponding to the layer dependent type is a motion dependent only layer indicating that it is motion dependent and not sample dependent;
An alternative reference picture decoding unit for decoding an alternative reference picture from the motion-dependent only layer;
An inter prediction parameter deriving unit for deriving a temporal motion vector using the alternative reference picture is provided.

（５）また、上記の画像復号装置において、
上記代替参照ピクチャ復号部は、前記レイヤ依存タイプに応じて、動き依存かつサンプル依存ではないことを示す依存レイヤである動き依存のみレイヤの数NumActiveMotionPredRefLayersを導出し、前記動きのみ依存レイヤ数NumActiveMotionPredRefLayersが１以上である場合に、スライスセグメントレイヤの符号化データから代替参照ピクチャを用いるかどうかを示すフラグalt_collocated_indication_flagを復号することを特徴とする。 (5) In the above image decoding apparatus,
The alternative reference picture decoding unit derives the number of motion dependent only layers NumActiveMotionPredRefLayers, which is a dependent layer indicating that it is motion dependent and not sample dependent, according to the layer dependent type, and the number of motion only dependent layers NumActiveMotionPredRefLayers is 1. In the case described above, the flag alt_collocated_indication_flag indicating whether or not to use the alternative reference picture is decoded from the encoded data of the slice segment layer.

（６）また、上記の画像復号装置において、
上記代替参照ピクチャ復号部は、前記レイヤ依存タイプに応じて、動き依存かつサンプル依存ではないことを示す依存レイヤである動き依存のみレイヤActivnMotionPredRefLayerIdを導出し、前記動き依存のみレイヤを示すフラグであるMotionPredEnableFlagを参照し、前記動き依存のみレイヤから、代替参照ピクチャを特定する識別子collocated_ref_layer_idxを復号することを特徴とする。 (6) In the above image decoding apparatus,
The alternative reference picture decoding unit derives a motion-dependent only layer ActivnMotionPredRefLayerId, which is a dependent layer indicating that it is motion-dependent and not sample-dependent, according to the layer-dependent type, and a MotionPredEnableFlag that is a flag indicating only the motion-dependent layer The identifier collocated_ref_layer_idx that identifies the alternative reference picture is decoded from the motion-dependent only layer.

（７）本発明による画像符号化装置は、
複数レイヤの画像データを階層符号化する画像符号化装置であって、
非ベースレイヤのピクチャの符号化方法に関するパラメータとして、少なくともレイヤ間のサンプル予測に関する実依存レイヤ情報とレイヤ間の動き予測に関する実依存レイヤ情報を符号化するヘッダ情報符号化手段を備え、
該ヘッダ情報符号化手段は、対象レイヤに関するサンプル予測参照レイヤ数（NumSamplePredRefLayers）に基づいて、レイヤ間予測参照ピクチャ数（num_inter_layer_ref_pics_minus1）およびレイヤ間依存識別子（inter_layer_pred_layer_idc）の符号の有無、符号長および値の範囲を定めて符号化すると共に、符号化された前記レイヤ間予測参照ピクチャ数および前記レイヤ間依存識別子を、レイヤ間のサンプル予測に関する実依存レイヤ情報として非ベースレイヤのピクチャの符号化に用いることを特徴とする。 (7) An image encoding device according to the present invention includes:
An image encoding apparatus that hierarchically encodes multiple layers of image data,
A header information encoding means for encoding at least actual dependent layer information related to sample prediction between layers and actual dependent layer information related to motion prediction between layers as a parameter related to a coding method of a non-base layer picture;
The header information encoding means, based on the number of sample prediction reference layers (NumSamplePredRefLayers) for the target layer, the presence / absence of codes of the inter-layer prediction reference pictures (num_inter_layer_ref_pics_minus1) and the inter-layer dependency identifier (inter_layer_pred_layer_idc), the code length, and the value Encode the range and use the encoded inter-layer prediction reference picture number and the inter-layer dependency identifier for encoding non-base layer pictures as actual dependent layer information related to inter-layer sample prediction. It is characterized by.

（８）本発明によるさらに別の構成の画像復号装置は、
階層符号化された符号化データを復号する画像復号装置であって、
非ベースレイヤのピクチャの符号化方法に関するパラメータとして、少なくともレイヤnuh_layer_idに関する依存レイヤ数（NumDirectRefLayers[nuh_layer_id]）を復号するパラメータセット復号手段と、
ベースレイヤと非ベースレイヤで共通の符号化パラメータを復号するスライスセグメントヘッダ基本復号手段と、
非ベースレイヤに関する符号化パラメータを復号するスライスセグメントヘッダ拡張復号手段とを備え、
前記スライスセグメントヘッダ拡張復号手段は、非ベースレイヤのピクチャのレイヤ間予測の有無（inter_layer_pred_enable_flag）、レイヤ間予測参照ピクチャ数（num_inter_layer_ref_pics_minus1）、レイヤ間依存識別子（inter_layer_pred_layer_idc[i]）を示す各パラメータを復号し、符号化データ中に前記各シンタックスが含まれない場合には、inter_layer_pred_enable_flagをレイヤ間予測を行うことを１に、num_inter_layer_ref_pics_minus1をNumDirectRefLayers[ nuh_layer_id ] - 1に、inter_layer_pred_layer_idc[i]をパラメータセットで復号した依存レイヤをそのまま用いることを示すiに設定することを特徴とする。 (8) An image decoding apparatus having a further configuration according to the present invention is as follows:
An image decoding device that decodes hierarchically encoded data,
Parameter set decoding means for decoding at least the number of dependent layers (NumDirectRefLayers [nuh_layer_id]) related to the layer nuh_layer_id as a parameter related to the encoding method of the non-base layer picture;
Slice segment header basic decoding means for decoding encoding parameters common to the base layer and the non-base layer;
Slice segment header extension decoding means for decoding the encoding parameters for the non-base layer,
The slice segment header extended decoding means decodes each parameter indicating presence / absence of inter-layer prediction of a non-base layer picture (inter_layer_pred_enable_flag), number of inter-layer prediction reference pictures (num_inter_layer_ref_pics_minus1), and inter-layer dependency identifier (inter_layer_pred_layer_idc [i]) If each syntax is not included in the encoded data, inter_layer_pred_enable_flag is set to 1 for inter-layer prediction, num_inter_layer_ref_pics_minus1 is decoded to NumDirectRefLayers [nuh_layer_id]-1, and inter_layer_pred_layer_idc [i] is decoded using a parameter set The dependency layer is set to i indicating that the dependency layer is used as it is.

（９）本発明によるさらに別の構成の画像復号装置は、
階層符号化された符号化データを復号する画像復号装置であって、
ベースレイヤと非ベースレイヤで共通の符号化パラメータを復号するスライスセグメントヘッダ基本復号手段と、
非ベースレイヤに関する符号化パラメータを復号するスライスセグメントヘッダ拡張復号手段とを備え、
前記スライスセグメントヘッダ拡張復号手段は、
非ベースレイヤのピクチャにおけるテンポラル動きベクトル予測時の参照ピクチャを示す動き依存レイヤ代替情報（alt_collocated_indication_flag, collocated_ref_layer_idx）を復号する動き依存レイヤ代替情報復号手段と、
復号された前記動き依存レイヤ代替情報で指定されたピクチャを用いて、テンポラル動きベクトルを導出する動きベクトル導出部を備え、
前記動き依存レイヤ代替情報復号手段は、符号化データ中に拡張ヘッダ情報が含まれる場合は、前記動き依存レイヤ代替情報を符号化データから復号し、符号化データ中に拡張ヘッダ情報が含まれない場合には、動き依存レイヤ代替情報として、代替参照レイヤを用いないことを示すalt_collocated_indication_flag=0を復号することを特徴とする。 (9) An image decoding device having a further configuration according to the present invention is as follows:
An image decoding device that decodes hierarchically encoded data,
Slice segment header basic decoding means for decoding encoding parameters common to the base layer and the non-base layer;
Slice segment header extension decoding means for decoding the encoding parameters for the non-base layer,
The slice segment header extension decoding means includes:
Motion-dependent layer replacement information decoding means for decoding motion-dependent layer replacement information (alt_collocated_indication_flag, collocated_ref_layer_idx) indicating a reference picture at the time of temporal motion vector prediction in a non-base layer picture;
A motion vector deriving unit for deriving a temporal motion vector using a picture specified by the decoded motion dependent layer alternative information;
The motion dependent layer alternative information decoding means decodes the motion dependent layer alternative information from the encoded data when the encoded data includes extended header information, and the encoded data does not include the extended header information In this case, alt_collocated_indication_flag = 0 indicating that an alternative reference layer is not used is decoded as motion-dependent layer alternative information.

（１０）本発明によるさらに別の構成の画像復号装置は、
階層符号化された符号化データを復号する画像復号装置であって、
非ベースレイヤのピクチャの符号化方法に関するパラメータとして、少なくとも動き依存レイヤ数（NumMotionPredRefLayers[i]）を復号するパラメータセット復号手段と、
ベースレイヤと非ベースレイヤで共通の符号化パラメータを復号するスライスセグメントヘッダ基本復号手段と、
非ベースレイヤに関する符号化パラメータを復号するスライスセグメントヘッダ拡張復号手段とを備え、
前記スライスセグメントヘッダ拡張復号手段は、
非ベースレイヤのピクチャにおけるテンポラル動きベクトル予測時の参照ピクチャを示す動き依存レイヤ代替情報（alt_collocated_indication_flag, collocated_ref_layer_idx）を復号する動き依存レイヤ代替情報復号手段と、
復号された前記動き依存レイヤ代替情報で指定されたピクチャを用いて、テンポラル動きベクトルを導出する動きベクトル導出部を備え、
前記動き依存レイヤ代替情報復号手段は、符号化データ中に拡張ヘッダ情報が含まれる場合は、前記動き依存レイヤ代替情報を符号化データから復号し、符号化データ中に拡張ヘッダ情報が含まれない場合には、前記パラメータセット復号手段が導出した動き依存レイヤ数（NumMotionPredRefLayers[i]）が０である場合は、動き依存レイヤ代替情報として、代替参照レイヤを用いないことを示すalt_collocated_indication_flag=0を復号し、動き依存レイヤ数（NumMotionPredRefLayers[i]）が０以外である場合は、代替参照レイヤを用いることを示すalt_collocated_indication_flag=1を復号することを特徴とする。 (10) An image decoding device having still another configuration according to the present invention includes:
An image decoding device that decodes hierarchically encoded data,
Parameter set decoding means for decoding at least the number of motion-dependent layers (NumMotionPredRefLayers [i]) as a parameter related to the non-base layer picture encoding method;
Slice segment header basic decoding means for decoding encoding parameters common to the base layer and the non-base layer;
Slice segment header extension decoding means for decoding the encoding parameters for the non-base layer,
The slice segment header extension decoding means includes:
Motion-dependent layer replacement information decoding means for decoding motion-dependent layer replacement information (alt_collocated_indication_flag, collocated_ref_layer_idx) indicating a reference picture at the time of temporal motion vector prediction in a non-base layer picture;
A motion vector deriving unit for deriving a temporal motion vector using a picture specified by the decoded motion dependent layer alternative information;
The motion dependent layer alternative information decoding means decodes the motion dependent layer alternative information from the encoded data when the encoded data includes extended header information, and the encoded data does not include the extended header information In this case, when the number of motion-dependent layers (NumMotionPredRefLayers [i]) derived by the parameter set decoding unit is 0, alt_collocated_indication_flag = 0 indicating that an alternative reference layer is not used is decoded as motion-dependent layer alternative information If the number of motion-dependent layers (NumMotionPredRefLayers [i]) is other than 0, alt_collocated_indication_flag = 1 indicating that an alternative reference layer is used is decoded.

（１１）本発明によるさらに別の構成の画像復号装置は、
階層符号化された符号化データを復号する画像復号装置であって、
ベースレイヤと非ベースレイヤで共通の符号化パラメータを復号するスライスセグメントヘッダ基本復号手段と、
非ベースレイヤに関する符号化パラメータを復号するスライスセグメントヘッダ拡張復号手段とを備え、
前記スライスセグメントヘッダ拡張復号手段は、
非ベースレイヤのピクチャにおけるテンポラル動きベクトル予測時の参照ピクチャを示す動き依存レイヤ代替情報（alt_collocated_indication_flag, collocated_ref_layer_idx）を復号する動き依存レイヤ代替情報復号手段と、
参照ピクチャ情報（ＲＰＳ、ＲＰＬ）を復号する参照ピクチャ情報復号手段と、
復号された前記動き依存レイヤ代替情報で指定されたピクチャを用いて、テンポラル動きベクトルを導出する動きベクトル導出部を備え、
前記動き依存レイヤ代替情報復号手段は、符号化データ中に拡張ヘッダ情報が含まれる場合は、前記動き依存レイヤ代替情報を符号化データから復号し、符号化データ中に拡張ヘッダ情報が含まれない場合には、前記参照ピクチャ情報復号手段が導出した参照ピクチャ情報に同じ対象レイヤのピクチャが含まれる場合は、動き依存レイヤ代替情報として、代替参照レイヤを用いないことを示すalt_collocated_indication_flag=0を復号し、前記参照ピクチャ情報復号手段が導出した参照ピクチャ情報に同じ対象レイヤのピクチャが含まれない場合は、代替参照レイヤを用いることを示すalt_collocated_indication_flag=1を復号することを特徴とする。 (11) An image decoding device having a further configuration according to the present invention is as follows:
An image decoding device that decodes hierarchically encoded data,
Slice segment header basic decoding means for decoding encoding parameters common to the base layer and the non-base layer;
Slice segment header extension decoding means for decoding the encoding parameters for the non-base layer,
The slice segment header extension decoding means includes:
Motion-dependent layer replacement information decoding means for decoding motion-dependent layer replacement information (alt_collocated_indication_flag, collocated_ref_layer_idx) indicating a reference picture at the time of temporal motion vector prediction in a non-base layer picture;
Reference picture information decoding means for decoding reference picture information (RPS, RPL);
A motion vector deriving unit for deriving a temporal motion vector using a picture specified by the decoded motion dependent layer alternative information;
The motion dependent layer alternative information decoding means decodes the motion dependent layer alternative information from the encoded data when the encoded data includes extended header information, and the encoded data does not include the extended header information In this case, when the reference picture information derived by the reference picture information decoding means includes a picture of the same target layer, alt_collocated_indication_flag = 0 indicating that the alternative reference layer is not used is decoded as motion dependent layer alternative information. When the reference picture information derived by the reference picture information decoding means does not include a picture of the same target layer, alt_collocated_indication_flag = 1 indicating that an alternative reference layer is used is decoded.

（１２）本発明によるさらに別の構成の画像復号装置は、
階層符号化された符号化データを復号する画像復号装置であって、
ベースレイヤと非ベースレイヤで共通の符号化パラメータを復号するスライスセグメントヘッダ基本復号手段と、
非ベースレイヤに関する符号化パラメータを復号するスライスセグメントヘッダ拡張復号手段とを備え、
スライスセグメントヘッダ拡張復号手段は、符号化データ中に拡張ヘッダ情報が含まれる場合は、非ベースレイヤのピクチャの復号方法に関するパラメータとして、少なくともテンポラル動きベクトル予測時の参照ピクチャを示すパラメータ（alt_collocated_indication_flag, collocated_ref_layer_idx）を復号し、
符号化データ中に拡張ヘッダ情報が含まれない場合は、基本ヘッダ情報に含まれる符号化パラメータであって、ベースレイヤのテンポラル動きベクトル予測における参照ピクチャを示すパラメータ（collocated_from_0_flag, collocated_ref_idx）に基づいて、非ベースレイヤのピクチャにおける前記各パラメータを決定することを特徴とする。 (12) An image decoding device having still another configuration according to the present invention includes:
An image decoding device that decodes hierarchically encoded data,
Slice segment header basic decoding means for decoding encoding parameters common to the base layer and the non-base layer;
Slice segment header extension decoding means for decoding the encoding parameters for the non-base layer,
When the extension header information is included in the encoded data, the slice segment header extension decoding means includes at least parameters indicating a reference picture at the time of temporal motion vector prediction (alt_collocated_indication_flag, collocated_ref_layer_idx )
When extended header information is not included in the encoded data, it is an encoding parameter included in the basic header information, and is based on parameters (collocated_from_0_flag, collocated_ref_idx) indicating a reference picture in temporal motion vector prediction of the base layer, Each of the parameters in the non-base layer picture is determined.

（１３）本発明による別の構成の画像符号化装置は、
複数レイヤの画像データを階層符号化する画像符号化装置であって、
ベースレイヤと非ベースレイヤで共通の符号化パラメータを符号化する基本ヘッダ情報符号化手段と、
非ベースレイヤに関する符号化パラメータを符号化する拡張ヘッダ情報符号化手段とを備え、
拡張ヘッダ情報符号化手段は、非ベースレイヤに関する符号化パラメータのうち、レイヤ間予測に関する複数のパラメータ群の値が所定の組合せである場合に、拡張ヘッダ情報の符号化を省略することを特徴とする。 (13) Another configuration of the image coding apparatus according to the present invention is as follows:
An image encoding apparatus that hierarchically encodes multiple layers of image data,
Basic header information encoding means for encoding encoding parameters common to the base layer and the non-base layer;
An extension header information encoding means for encoding encoding parameters for the non-base layer,
The extension header information encoding means omits encoding of the extension header information when the values of a plurality of parameter groups related to inter-layer prediction among the encoding parameters related to the non-base layer are a predetermined combination. To do.

本発明によれば、複数レイヤを対象とするスケーラブル符号化／復号において、単一レイヤ用の符号化／復号方式との後方互換性を確保し、特にレイヤ間予測に関するフラグやパラメータの伝送において、冗長度を低減し伝送符号量を削減することを可能にする。 According to the present invention, in scalable encoding / decoding for a plurality of layers, backward compatibility with an encoding / decoding scheme for a single layer is ensured, particularly in transmission of flags and parameters related to inter-layer prediction. It is possible to reduce redundancy and reduce the amount of transmission code.

本発明に係る画像伝送システムの構成を示す概略図である。It is the schematic which shows the structure of the image transmission system which concerns on this invention. 本発明に係る符号化ストリームのデータの階層構造を示す図である。It is a figure which shows the hierarchical structure of the data of the encoding stream which concerns on this invention. 参照ピクチャリストの一例を示す概念図である。It is a conceptual diagram which shows an example of a reference picture list. 参照ピクチャの例を示す概念図である。It is a conceptual diagram which shows the example of a reference picture. 本発明の一実施形態に係る画像復号装置の構成を示す概略図である。It is the schematic which shows the structure of the image decoding apparatus which concerns on one Embodiment of this invention. 本発明の別の実施形態に係る画像復号装置の構成を示す概略図である。It is the schematic which shows the structure of the image decoding apparatus which concerns on another embodiment of this invention. 本発明のさらに別の実施形態に係る画像復号装置の構成を示す概略図である。It is the schematic which shows the structure of the image decoding apparatus which concerns on another embodiment of this invention. 本実施形態に係る画像データ復号部の構成を示す概略図である。It is the schematic which shows the structure of the image data decoding part which concerns on this embodiment. 本発明の実施形態に係る画像符号化装置の構成を示す概略図である。It is the schematic which shows the structure of the image coding apparatus which concerns on embodiment of this invention. 本実施形態に係る画像データ符号化部の構成を示す概略図である。It is the schematic which shows the structure of the image data encoding part which concerns on this embodiment. 本発明に係る拡張ビデオパラメータセットのシンタックステーブルの一例である。It is an example of the syntax table of the extended video parameter set which concerns on this invention. 本発明に係るスライスセグメントヘッダのシンタックステーブルの一例である。It is an example of the syntax table of the slice segment header which concerns on this invention. 本発明の実施形態に係る実依存レイヤ復号部による実依存レイヤ情報の導出手順の一例である。It is an example of the derivation | leading-out procedure of the real dependence layer information by the real dependence layer decoding part which concerns on embodiment of this invention. 本発明に係るスライスセグメントヘッダ拡張のシンタックステーブルの一例である。It is an example of the syntax table of the slice segment header extension according to the present invention. 比較例におけるレイヤ依存タイプと、サンプル依存フラグ、動き依存フラグの関係を説明する図である。It is a figure explaining the relationship between the layer dependence type in a comparative example, a sample dependence flag, and a motion dependence flag. 本発明の実施形態に係るレイヤ依存タイプと、サンプル依存フラグ、動き依存フラグの関係を説明する図である。It is a figure explaining the relationship between the layer dependence type which concerns on embodiment of this invention, a sample dependence flag, and a motion dependence flag. 比較例におけるレイヤ依存タイプと、サンプル依存フラグ、動き依存フラグ、参照ピクチャセットの関係を説明する図である。It is a figure explaining the relationship between the layer dependence type in a comparative example, a sample dependence flag, a motion dependence flag, and a reference picture set. 比較例における実参照レイヤピクチャ数、参照レイヤ識別子、レイヤ間参照ピクチャセット、総参照ピクチャ数の導出手順の一例である。It is an example of a procedure for deriving the actual reference layer picture number, the reference layer identifier, the inter-layer reference picture set, and the total reference picture number in the comparative example. 比較例におけるレイヤ間依存の関係を説明する概念図である。It is a conceptual diagram explaining the relationship of dependency between layers in a comparative example. 本発明の第１の実施形態に係るレイヤ依存タイプと、サンプル依存フラグ、動き依存フラグ、参照ピクチャセットの関係を説明する図である。It is a figure explaining the relationship between the layer dependence type which concerns on the 1st Embodiment of this invention, a sample dependence flag, a motion dependence flag, and a reference picture set. 本発明の第１の実施形態に係る実参照レイヤピクチャ数、参照レイヤ識別子、レイヤ間参照ピクチャセット、総参照ピクチャ数の導出手順の一例である。It is an example of a procedure for deriving the actual reference layer picture number, the reference layer identifier, the inter-layer reference picture set, and the total reference picture number according to the first embodiment of the present invention. 本発明の第１の実施形態に係るレイヤ間依存の関係を説明する概念図である。It is a conceptual diagram explaining the relationship of dependency between layers which concerns on the 1st Embodiment of this invention. 本発明の第１の実施形態に係る参照ピクチャリストの導出手順の一例である。It is an example of the derivation | leading-out procedure of the reference picture list which concerns on the 1st Embodiment of this invention. 比較例におけるレイヤ依存タイプと、代替コロケートピクチャの関係を説明する図である。It is a figure explaining the relationship between the layer dependence type in a comparative example, and an alternative collocated picture. 比較例におけるレイヤ間依存の関係を説明する概念図である。It is a conceptual diagram explaining the relationship of dependency between layers in a comparative example. 本発明の第１の実施形態に係るレイヤ依存タイプと、代替コロケートピクチャの関係を説明する図である。It is a figure explaining the relationship between the layer dependence type which concerns on the 1st Embodiment of this invention, and an alternative collocated picture. 本発明の第１の実施形態に係るレイヤ間依存の関係を説明する概念図である。It is a conceptual diagram explaining the relationship of dependency between layers which concerns on the 1st Embodiment of this invention.

（第１の実施形態）
以下、図面を参照しながら本発明の実施形態について説明する。 (First embodiment)
Hereinafter, embodiments of the present invention will be described with reference to the drawings.

図１は、本実施形態に係る画像伝送システム１の構成を示す概略図である。 FIG. 1 is a schematic diagram illustrating a configuration of an image transmission system 1 according to the present embodiment.

画像伝送システム１は、複数のレイヤ画像を符号化した符号を伝送し、伝送された符号を復号した画像を表示するシステムである。画像伝送システム１は、画像符号化装置１１、ネットワーク２１、画像復号装置３１及び画像表示装置４１を含んで構成される。 The image transmission system 1 is a system that transmits a code obtained by encoding a plurality of layer images and displays an image obtained by decoding the transmitted code. The image transmission system 1 includes an image encoding device 11, a network 21, an image decoding device 31, and an image display device 41.

画像符号化装置１１には、複数のレイヤ画像（テクスチャ画像ともいう）を示す信号Tが入力される。レイヤ画像とは、ある解像度及びある視点で視認もしくは撮影される画像である。複数のレイヤ画像を用いて３次元画像を符号化するビュースケーラブル符号化を行う場合、複数のレイヤ画像のそれぞれは、視点画像と呼ばれる。ここで、視点は撮影装置の位置又は観測点に相当する。例えば、複数の視点画像は、被写体に向かって左右の撮影装置のそれぞれが撮影した画像である。画像符号化装置１１は、この信号のそれぞれを符号化して符号化ストリームTe（符号化データ）を生成する。符号化ストリームTeの詳細については、後述する。視点画像とは、ある視点において観測される２次元画像（平面画像）である。視点画像は、例えば２次元平面内に配置された画素毎の輝度値、又は色信号値で示される。以下では、１枚の視点画像又は、その視点画像を示す信号をピクチャ（picture）と呼ぶ。また、複数のレイヤ画像を用いて空間スケーラブル符号化を行う場合、その複数のレイヤ画像は、解像度の低いベースレイヤ画像と、解像度の高い拡張レイヤ画像からなる。複数のレイヤ画像を用いてＳＮＲスケーラブル符号化を行う場合、その複数のレイヤ画像は、画質の低いベースレイヤ画像と、画質の高い拡張レイヤ画像からなる。なお、ビュースケーラブル符号化、空間スケーラブル符号化、ＳＮＲスケーラブル符号化を任意に組み合わせて行っても良い。本実施形態では、複数のレイヤ画像として、少なくともベースレイヤ画像と、ベースレイヤ画像以外の画像（拡張レイヤ画像）を含む画像の符号化および復号を扱う。複数のレイヤのうち、画像もしくは符号化パラメータにおいて参照関係（依存関係）にある２つのレイヤについて、参照される側の画像を、第１レイヤ画像、参照する側の画像を第２レイヤ画像と呼ぶ。例えば、ベースレイヤを参照して符号化される（ベースレイヤ以外の）拡張画像がある場合、ベースレイヤ画像を第１レイヤ画像、拡張画像を第２レイヤ画像として扱う。なお、拡張レイヤ画像の例としては、ベースビュー以外の視点画像、デプス画像、高解像度画像などがある。 A signal T indicating a plurality of layer images (also referred to as texture images) is input to the image encoding device 11. A layer image is an image that is viewed or photographed at a certain resolution and a certain viewpoint. When performing view scalable coding in which a three-dimensional image is coded using a plurality of layer images, each of the plurality of layer images is referred to as a viewpoint image. Here, the viewpoint corresponds to the position or observation point of the photographing apparatus. For example, the plurality of viewpoint images are images taken by the left and right photographing devices toward the subject. The image encoding device 11 encodes each of these signals to generate an encoded stream Te (encoded data). Details of the encoded stream Te will be described later. A viewpoint image is a two-dimensional image (planar image) observed at a certain viewpoint. The viewpoint image is indicated by, for example, a luminance value or a color signal value for each pixel arranged in a two-dimensional plane. Hereinafter, one viewpoint image or a signal indicating the viewpoint image is referred to as a picture. In addition, when performing spatial scalable coding using a plurality of layer images, the plurality of layer images include a base layer image having a low resolution and an enhancement layer image having a high resolution. When SNR scalable encoding is performed using a plurality of layer images, the plurality of layer images are composed of a base layer image with low image quality and an extended layer image with high image quality. Note that view scalable coding, spatial scalable coding, and SNR scalable coding may be arbitrarily combined. In the present embodiment, encoding and decoding of an image including at least a base layer image and an image other than the base layer image (enhancement layer image) is handled as the plurality of layer images. Of the multiple layers, for two layers that have a reference relationship (dependency relationship) in the image or encoding parameter, the image on the reference side is referred to as a first layer image, and the image on the reference side is referred to as a second layer image. . For example, when there is an extended image (other than the base layer) encoded with reference to the base layer, the base layer image is handled as a first layer image and the extended image is handled as a second layer image. Examples of enhancement layer images include viewpoint images other than the base view, depth images, and high resolution images.

ネットワーク２１は、画像符号化装置１１が生成した符号化ストリームTeを画像復号装置３１に伝送する。ネットワーク２１は、インターネット（internet）、広域ネットワーク（WAN: Wide Area Network）、小規模ネットワーク（LAN: Local Area Network）又はこれらの組み合わせである。ネットワーク２１は、必ずしも双方向の通信網に限らず、地上波ディジタル放送、衛星放送等の放送波を伝送する一方向又は双方向の通信網であっても良い。また、ネットワーク２１は、ＤＶＤ（Digital Versatile Disc）、ＢＤ（Blue-ray Disc）等の符号化ストリームTeを記録した記憶媒体で代替されても良い。 The network 21 transmits the encoded stream Te generated by the image encoding device 11 to the image decoding device 31. The network 21 is the Internet, a wide area network (WAN), a small network (LAN) or a combination thereof. The network 21 is not necessarily limited to a bidirectional communication network, and may be a unidirectional or bidirectional communication network that transmits broadcast waves such as terrestrial digital broadcasting and satellite broadcasting. The network 21 may be replaced with a storage medium that records an encoded stream Te such as a DVD (Digital Versatile Disc) or a BD (Blue-ray Disc).

画像復号装置３１は、ネットワーク２１が伝送した符号化ストリームTeのそれぞれを復号し、それぞれ復号した複数の復号レイヤ画像Td（復号視点画像Td）を生成する。 The image decoding device 31 decodes each of the encoded streams Te transmitted by the network 21, and generates a plurality of decoded layer images Td (decoded viewpoint images Td) respectively decoded.

画像表示装置４１は、画像復号装置３１が生成した複数の復号レイヤ画像Tdの全部又は一部を表示する。例えば、ビュースケーラブル符号化においては、全部の場合、３次元画像（立体画像）や自由視点画像が表示され、一部の場合、２次元画像が表示される。画像表示装置４１は、例えば、液晶ディスプレイ、有機ＥＬ（Electro-luminescence）ディスプレイ等の表示デバイスを備える。また、空間スケーラブル符号化、ＳＮＲスケーラブル符号化では、画像復号装置３１、画像表示装置４１が高い処理能力を有する場合には、画質の高い拡張レイヤ画像を表示し、より低い処理能力しか有しない場合には、拡張レイヤほど高い処理能力、表示能力を必要としないベースレイヤ画像を表示する。 The image display device 41 displays all or part of the plurality of decoded layer images Td generated by the image decoding device 31. For example, in view scalable coding, a 3D image (stereoscopic image) and a free viewpoint image are displayed in all cases, and a 2D image is displayed in some cases. The image display device 41 includes, for example, a display device such as a liquid crystal display or an organic EL (Electro-luminescence) display. In addition, in the spatial scalable coding and SNR scalable coding, when the image decoding device 31 and the image display device 41 have a high processing capability, a high-quality enhancement layer image is displayed and only a lower processing capability is provided. Displays a base layer image that does not require higher processing capability and display capability as an extension layer.

＜符号化ストリームTeの構造＞
本実施形態に係る画像符号化装置１１および画像復号装置３１の詳細な説明に先立って、画像符号化装置１１によって生成され、画像復号装置３１によって復号される符号化ストリームTeのデータ構造について説明する。 <Structure of encoded stream Te>
Prior to detailed description of the image encoding device 11 and the image decoding device 31 according to the present embodiment, a data structure of an encoded stream Te generated by the image encoding device 11 and decoded by the image decoding device 31 will be described. .

図２は、符号化ストリームTeにおけるデータの階層構造を示す図である。符号化ストリームTeは、例示的に、シーケンス、およびシーケンスを構成する複数のピクチャを含む。図２の（ａ）〜（ｆ）は、それぞれ、シーケンスSEQを既定するシーケンスレイヤ、ピクチャPICTを規定するピクチャレイヤ、スライスSを規定するスライスレイヤ、スライスデータを規定するスライスデータレイヤ、スライスデータに含まれる符号化ツリーユニットを規定する符号化ツリーレイヤ、符号化ツリーに含まれる符号化単位（Coding Unit; CU）を規定する符号化ユニットレイヤを示す図である。これら各レイヤの符号化データは、データの種類別に、NAL（Network Abstraction Layer）ユニットと呼ばれるデータ構造を構成する。NALユニットのヘッダ部分に含まれるNALユニットタイプ（nal_unit_type）によって、対象のNALユニットを構成するデータの種類を示す。また、NALユニットヘッダにはレイヤ識別子（nuh_layer_id）が含まれる。レイヤ識別子は、スケーラブル符号化の対象となる複数のレイヤ画像（複数視点の画像など）を区別するための情報である。 FIG. 2 is a diagram illustrating a hierarchical structure of data in the encoded stream Te. The encoded stream Te illustratively includes a sequence and a plurality of pictures constituting the sequence. (A) to (f) of FIG. 2 respectively show a sequence layer that defines a sequence SEQ, a picture layer that defines a picture PICT, a slice layer that defines a slice S, a slice data layer that defines slice data, and slice data. It is a figure which shows the encoding tree layer which prescribes | regulates the encoding tree layer and coding unit (Coding Unit; CU) contained in a coding tree which prescribe | regulate the coding tree unit contained. The encoded data of each layer constitutes a data structure called a NAL (Network Abstraction Layer) unit for each data type. The NAL unit type (nal_unit_type) included in the header part of the NAL unit indicates the type of data constituting the target NAL unit. The NAL unit header includes a layer identifier (nuh_layer_id). The layer identifier is information for distinguishing a plurality of layer images (multi-viewpoint images and the like) that are targets of scalable encoding.

（シーケンスレイヤ）
シーケンスレイヤでは、処理対象のシーケンスSEQ（以下、対象シーケンスとも称する）を復号するために画像復号装置３１が参照するデータの集合が規定されている。シーケンスSEQは、図２の（ａ）に示すように、ビデオパラメータセットVPS（Video Parameter Set）、シーケンスパラメータセットSPS（Sequence Parameter Set）、ピクチャパラメータセットPPS（Picture Parameter Set）、ピクチャPICT、及び、付加拡張情報SEI（Supplemental Enhancement Information）を含んでいる。ここで＃の後に示される値はレイヤＩＤを示す。図２では、＃０と＃１すなわちレイヤ０とレイヤ１の符号化データが存在する例を示すが、レイヤの種類およびレイヤの数はこれに限らない。 (Sequence layer)
In the sequence layer, a set of data referred to by the image decoding device 31 in order to decode a sequence SEQ to be processed (hereinafter also referred to as a target sequence) is defined. As shown in FIG. 2A, the sequence SEQ includes a video parameter set VPS (Video Parameter Set), a sequence parameter set SPS (Sequence Parameter Set), a picture parameter set PPS (Picture Parameter Set), a picture PICT, It includes supplemental enhancement information (SEI). Here, the value indicated after # indicates the layer ID. FIG. 2 shows an example in which encoded data of # 0 and # 1, that is, layer 0 and layer 1, exists, but the type of layer and the number of layers are not limited to this.

ビデオパラメータセットVPSでは、複数のレイヤから構成されている動画像において、複数の動画像に共通する符号化パラメータの集合および動画像に含まれる複数のレイヤおよび個々のレイヤに関連する符号化パラメータの集合が規定されている。 In the video parameter set VPS, in a moving image composed of a plurality of layers, a set of encoding parameters common to the plurality of moving images, a plurality of layers included in the moving image, and encoding parameters related to the individual layers. A set is defined.

シーケンスパラメータセットSPSでは、対象シーケンスを復号するために画像復号装置３１が参照する符号化パラメータの集合が規定されている。例えば、対象シーケンス内のピクチャの幅や高さが規定される。 In the sequence parameter set SPS, a set of encoding parameters referred to by the image decoding device 31 in order to decode the target sequence is defined. For example, the width and height of the picture in the target sequence are defined.

ピクチャパラメータセットPPSでは、対象シーケンス内の各ピクチャを復号するために画像復号装置３１が参照する符号化パラメータの集合が規定されている。例えば、ピクチャの復号に用いられる量子化幅の基準値や、重み付き予測の適用を示すフラグが含まれる。なお、PPSは同一レイヤ内で複数存在してもよい。その場合、対象シーケンス内の各ピクチャから複数のPPSの何れを適用するかを選択する。 In the picture parameter set PPS, a set of encoding parameters referred to by the image decoding device 31 in order to decode each picture in the target sequence is defined. For example, a reference value of a quantization width used for picture decoding and a flag indicating application of weighted prediction are included. A plurality of PPSs may exist in the same layer. In this case, it is selected which of a plurality of PPSs is applied from each picture in the target sequence.

（ピクチャレイヤ）
ピクチャレイヤでは、処理対象のピクチャPICT（以下、対象ピクチャとも称する）を復号するために画像復号装置３１が参照するデータの集合が規定されている。ピクチャPICTは、図２の（ｂ）に示すように、スライスS0〜SNS-1を含んでいる（NSはピクチャPICTに含まれるスライスの総数）。 (Picture layer)
In the picture layer, a set of data referred to by the image decoding device 31 for decoding a picture PICT to be processed (hereinafter also referred to as a target picture) is defined. As shown in FIG. 2B, the picture PICT includes slices S0 to SNS-1 (NS is the total number of slices included in the picture PICT).

なお、以下、スライスS0〜SNS-1のそれぞれを区別する必要が無い場合、符号の添え字を省略して記述することがある。また、以下に説明する符号化ストリームTeに含まれるデータであって、添え字を付している他のデータについても同様である。 Hereinafter, when it is not necessary to distinguish each of the slices S0 to SNS-1, the subscripts may be omitted. The same applies to data included in an encoded stream Te described below and to which other subscripts are attached.

（スライスレイヤ）
スライスレイヤでは、処理対象のスライスS（対象スライスとも称する）を復号するために画像復号装置３１が参照するデータの集合が規定されている。スライスSは、図２の（ｃ）に示すように、スライスヘッダSH、および、スライスデータSDATAを含んでいる。 (Slice layer)
In the slice layer, a set of data referred to by the image decoding device 31 for decoding the slice S to be processed (also referred to as a target slice) is defined. As shown in FIG. 2C, the slice S includes a slice header SH and slice data SDATA.

スライスヘッダSHには、対象スライスの復号方法を決定するために画像復号装置３１が参照する符号化パラメータ群が含まれる。スライスタイプを指定するスライスタイプ指定情報（slice_type）は、スライスヘッダSHに含まれる符号化パラメータの一例である。 The slice header SH includes an encoding parameter group that is referred to by the image decoding device 31 in order to determine a decoding method of the target slice. Slice type designation information (slice_type) for designating a slice type is an example of an encoding parameter included in the slice header SH.

スライスタイプ指定情報により指定可能なスライスタイプとしては、（１）符号化の際にイントラ予測のみを用いるＩスライス、（２）符号化の際に単方向予測、または、イントラ予測を用いるＰスライス、（３）符号化の際に単方向予測、双方向予測、または、イントラ予測を用いるＢスライスなどが挙げられる。 As slice types that can be specified by the slice type specification information, (1) I slice using only intra prediction at the time of encoding, (2) P slice using unidirectional prediction or intra prediction at the time of encoding, (3) B-slice using unidirectional prediction, bidirectional prediction, or intra prediction at the time of encoding may be used.

また、スライスヘッダSHには、スライスタイプがＰスライスまたはＢスライスである時に、後述するテンポラル動き情報を指定するためのコロケート情報collocated_from_l0_flag，collocated_ref_idxが含まれる。 The slice header SH includes collocated information collocated_from_l0_flag and collocated_ref_idx for designating temporal motion information described later when the slice type is P slice or B slice.

なお、スライスヘッダSHは、上記シーケンスレイヤに含まれる、ピクチャパラメータセットPPSへの参照を示すＰＰＳ識別子（pic_parameter_set_id）を含んでいても良い。 Note that the slice header SH may include a PPS identifier (pic_parameter_set_id) indicating a reference to the picture parameter set PPS included in the sequence layer.

（スライスデータレイヤ）
スライスデータレイヤでは、処理対象のスライスデータSDATAを復号するために画像復号装置３１が参照するデータの集合が規定されている。スライスデータSDATAは、図２の（ｄ）に示すように、符号化ツリーユニット（CTU: Coding Tree Unit）を含んでいる。符号化ツリーユニットCTUは、スライスを構成する固定サイズ（例えば画素数６４×６４）の画像領域である。なお、符号化ツリーユニットCTUに対応する画像ブロックを、符号化ツリーブロック（CTB: Coding Tree Block）と称する。 (Slice data layer)
In the slice data layer, a set of data referred to by the image decoding device 31 for decoding the slice data SDATA to be processed is defined. The slice data SDATA includes a coding tree unit (CTU) as shown in (d) of FIG. The coding tree unit CTU is an image area having a fixed size (for example, 64 × 64 pixels) that constitutes a slice. An image block corresponding to the coding tree unit CTU is referred to as a coding tree block (CTB).

なお、スライスレイヤは、スライスをさらに分割したスライスセグメントレイヤ単位で扱ってもよく、その場合、前述のスライスヘッダSHはスライスセグメントヘッダと呼び、同様に、スライスデータSDATAはスライスセグメントデータと呼ぶ。以降の説明では、一部、簡単化のためスライスヘッダ、スライスデータと記述するが、それぞれスライスセグメントヘッダ、スライスセグメントデータに置き換え可能である。また、スライスセグメントヘッダ、スライスセグメントデータもそれぞれスライスヘッダ、スライスデータに置き換え可能である。 The slice layer may be handled in units of slice segment layers obtained by further dividing the slice. In this case, the aforementioned slice header SH is called a slice segment header, and similarly, the slice data SDATA is called slice segment data. In the following description, for the sake of simplicity, it will be described as a slice header and slice data, but can be replaced with a slice segment header and slice segment data, respectively. Also, the slice segment header and slice segment data can be replaced with the slice header and slice data, respectively.

（符号化ツリーレイヤ）
符号化ツリーレイヤは、図２の（ｅ）に示すように、処理対象の符号化ツリーユニットを復号するために画像復号装置３１が参照するデータの集合が規定されている。符号化ツリーユニットは、再帰的な４分木分割により分割される。再帰的な４分木分割により得られる木構造のことを符号化ツリー（coding tree）と称する。符号化ツリーユニットCTUは、分割フラグ（split_flag）を含み、split_flagが１の場合には、符号化ツリーユニットCTUはさらに４つのCTUに分割される。split_flagが０の場合には、符号化ツリーユニットCTUは４つの符号化ユニット（CU: Coding Unit）に分割される。符号化ユニットCUは符号化ツリーレイヤの末端ノードであり、このレイヤではこれ以上分割されない。符号化ユニットCUは、符号化／復号処理の基本的な単位となる。 (Encoding tree layer)
As shown in FIG. 2E, the coding tree layer defines a set of data that the image decoding device 31 refers to in order to decode the coding tree unit to be processed. The coding tree unit is divided by recursive quadtree division. A tree structure obtained by recursive quadtree partitioning is called a coding tree. The coding tree unit CTU includes a split flag (split_flag). When the split_flag is 1, the coding tree unit CTU is further divided into four CTUs. When split_flag is 0, the coding tree unit CTU is divided into four coding units (CU: Coding Unit). The coding unit CU is a terminal node of the coding tree layer and is not further divided in this layer. The encoding unit CU is a basic unit of encoding / decoding processing.

符号化ユニットCUのサイズは、符号化ツリーユニットCTUのサイズが６４×６４画素の場合には、６４×６４画素、３２×３２画素、１６×１６画素、および、８×８画素の何れかをとり得る。なお、符号化ユニットCUに対応する画像ブロックを、符号化ブロック（CB: Coding Block）と称する。 When the size of the encoding tree unit CTU is 64 × 64 pixels, the size of the encoding unit CU is 64 × 64 pixels, 32 × 32 pixels, 16 × 16 pixels, or 8 × 8 pixels. It can take. Note that an image block corresponding to the coding unit CU is referred to as a coding block (CB).

（符号化ユニットレイヤ）
符号化ユニットレイヤは、図２の（ｆ）に示すように、処理対象の符号化ユニットを復号するために画像復号装置３１が参照するデータの集合が規定されている。具体的には、符号化ユニットは、ＣＵヘッダCUH、１つ以上の予測ユニット（PU: Prediction Unit）で構成される予測ツリー、１つ以上の変換ユニット（TU: Transform Unit）で構成される変換ツリーを含んで構成される。 (Encoding unit layer)
As shown in (f) of FIG. 2, the encoding unit layer defines a set of data referred to by the image decoding device 31 in order to decode the processing target encoding unit. Specifically, the encoding unit is a prediction tree composed of a CU header CUH, one or more prediction units (PU: Prediction Unit), and a transform composed of one or more transform units (TU: Transform Unit). Consists of a tree.

ＣＵヘッダCUHでは、符号化ユニットが、イントラ予測を用いるユニットであるか、インター予測を用いるユニットであるかなどが規定される。 The CU header CUH defines whether the encoding unit is a unit that uses intra prediction or a unit that uses inter prediction.

予測ツリーでは、符号化ユニットCUが１または複数の予測ユニットPUに分割され、各予測ユニットの位置とサイズとが規定される。予測ツリーにおける分割の種類は、大まかにいえば、イントラ予測の場合と、インター予測の場合との２つがある。イントラ予測とは、同一ピクチャ内の予測であり、インター予測とは、互いに異なるピクチャ間（例えば、表示時刻間、レイヤ画像間）で行われる予測処理を指す。 In the prediction tree, the encoding unit CU is divided into one or a plurality of prediction units PU, and the position and size of each prediction unit are defined. Broadly speaking, there are two types of division in the prediction tree: intra prediction and inter prediction. Intra prediction is prediction within the same picture, and inter prediction refers to prediction processing performed between different pictures (for example, between display times and between layer images).

イントラ予測の場合、分割方法は、２Ｎ×２Ｎ（符号化ユニットと同一サイズ）と、Ｎ×Ｎとがある。 In the case of intra prediction, there are 2N × 2N (the same size as the encoding unit) and N × N division methods.

インター予測の場合、分割方法は、ＣＵヘッダCUHに含まれる分割モード（part_mode）により規定され、２Ｎ×２Ｎ（符号化ユニットと同一サイズ）、２Ｎ×Ｎ、２Ｎ×ｎＵ、２Ｎ×ｎＤ、Ｎ×２Ｎ、ｎＬ×２Ｎ、ｎＲ×２Ｎ、および、Ｎ×Ｎなどがある。なお、２Ｎ×ｎＵは、２Ｎ×２Ｎの符号化ユニットを上から順に２Ｎ×０．５Ｎと２Ｎ×１．５Ｎの２領域に分割することを示す。２Ｎ×ｎＤは、２Ｎ×２Ｎの符号化ユニットを上から順に２Ｎ×１．５Ｎと２Ｎ×０．５Ｎの２領域に分割することを示す。ｎＬ×２Ｎは、２Ｎ×２Ｎの符号化ユニットを左から順に０．５Ｎ×２Ｎと１．５Ｎ×２Ｎの２領域に分割することを示す。ｎＲ×２Ｎは、２Ｎ×２Ｎの符号化ユニットを左から順に１．５Ｎ×２Ｎと０．５Ｎ×２Ｎの２領域に分割することを示す。分割数は１、２、４のいずれかであるため、ＣＵに含まれるＰＵは１個から４個である。これらのＰＵを順にＰＵ０、ＰＵ１、ＰＵ２、ＰＵ３と表現する。 In the case of inter prediction, the division method is defined by the division mode (part_mode) included in the CU header CUH, 2N × 2N (the same size as the encoding unit), 2N × N, 2N × nU, 2N × nD, N × 2N, nL × 2N, nR × 2N, and N × N. Note that 2N × nU indicates that a 2N × 2N encoding unit is divided into two regions of 2N × 0.5N and 2N × 1.5N in order from the top. 2N × nD indicates that a 2N × 2N encoding unit is divided into two regions of 2N × 1.5N and 2N × 0.5N in order from the top. nL × 2N indicates that a 2N × 2N encoding unit is divided into two regions of 0.5N × 2N and 1.5N × 2N in order from the left. nR × 2N indicates that a 2N × 2N coding unit is divided into two regions of 1.5N × 2N and 0.5N × 2N in order from the left. Since the number of divisions is one of 1, 2, and 4, PUs included in the CU are 1 to 4. These PUs are expressed as PU0, PU1, PU2, and PU3 in order.

また、変換ツリーにおいては、符号化ユニットCUが１または複数の変換ユニットTUに分割され、各変換ユニットの位置とサイズとが規定される。変換ツリーにおける分割には、符号化ユニットと同一のサイズの領域を変換ユニットとして割り付けるものと、上述した符号化ツリーブロックの分割と同様、再帰的な４分木分割によるものがある。 In the conversion tree, the encoding unit CU is divided into one or a plurality of conversion units TU, and the position and size of each conversion unit are defined. There are two types of division in the conversion tree: one in which an area having the same size as the encoding unit is allocated as a conversion unit, and the other in division by recursive quadtree division, similar to the above-described division in the encoding tree block.

なお、予測ユニットPU、変換ユニットTUに対応する画像ブロックを、それぞれ予測ブロック（PB: Prediction Block）、変換ブロック（TB: Transform Block）と称する。 Note that the image blocks corresponding to the prediction unit PU and the transform unit TU are referred to as a prediction block (PB: Prediction Block) and a transform block (TB: Transform Block), respectively.

（予測パラメータ）
予測ユニットの予測画像は、予測ユニットに付随する予測パラメータによって導出される。予測パラメータには、イントラ予測の予測パラメータもしくはインター予測の予測パラメータがある。以下、インター予測の予測パラメータ（インター予測パラメータ）について説明する。インター予測パラメータは、予測リスト利用フラグpredFlagL0、predFlagL1と、参照ピクチャインデックスrefIdxL0、refIdxL1と、ベクトルmvL0、mvL1から構成される。予測リスト利用フラグpredFlagL0、predFlagL1は、各々Ｌ０リストRefPicList0、Ｌ１リストRefPicList1と呼ばれる参照ピクチャリストが用いられるか否かを示すフラグであり、値が１の場合に、各フラグに対応する参照ピクチャリストが用いられる。なお、本明細書中「ＸＸであるか否かを示すフラグ」と記す場合、値が１である時はＸＸである場合、値が０である時はＸＸではない場合とし、論理否定、論理積などでは１を真、０を偽と扱う（以下同様）。但し、実際の装置や方法では真値、偽値として他の値を用いてもよい。２つの参照ピクチャリストが用いられる場合、つまり、predFlagL0=1，predFlagL1=1の場合、その予測方法を双予測といい、１つの参照ピクチャリストを用いる場合、すなわち（predFlagL0，predFlagL1）＝（1,0）もしくは（predFlagL0，predFlagL1）＝（0,1）の場合、その予測方法を単予測という。なお、予測リスト利用フラグの情報は、後述のインター予測フラグinter_pred_idcで表現することもできる。 (Prediction parameter)
The prediction image of the prediction unit is derived by a prediction parameter associated with the prediction unit. The prediction parameters include a prediction parameter for intra prediction or a prediction parameter for inter prediction. Hereinafter, prediction parameters for inter prediction (inter prediction parameters) will be described. The inter prediction parameter includes prediction list use flags predFlagL0 and predFlagL1, reference picture indexes refIdxL0 and refIdxL1, and vectors mvL0 and mvL1. Prediction list use flags predFlagL0 and predFlagL1 are flags indicating whether or not reference picture lists called L0 list RefPicList0 and L1 list RefPicList1 are used. When the value is 1, the reference picture list corresponding to each flag is used. Used. In this specification, when “flag indicating whether or not XX” is described, it is assumed that the value is XX when the value is 1, and that it is not XX when the value is 0. In products and the like, 1 is treated as true and 0 is treated as false (the same applies hereinafter). However, other values may be used as a true value and a false value in an actual apparatus or method. When two reference picture lists are used, that is, when predFlagL0 = 1 and predFlagL1 = 1, the prediction method is called bi-prediction, and when one reference picture list is used, that is, (predFlagL0, predFlagL1) = (1, 0) or (predFlagL0, predFlagL1) = (0,1), the prediction method is called single prediction. Note that the prediction list use flag information can also be expressed by an inter prediction flag inter_pred_idc described later.

符号化データに含まれるインター予測パラメータを導出するためのシンタックス要素には、例えば、分割モードpart_mode、マージフラグmerge_flag、マージインデックスmerge_idx、インター予測フラグinter_pred_idc、参照ピクチャインデックスrefIdxLX、予測ベクトルインデックスmvp_LX_idx、差分ベクトルmvdLXがある。 Syntax elements for deriving inter prediction parameters included in the encoded data include, for example, a partition mode part_mode, a merge flag merge_flag, a merge index merge_idx, an inter prediction flag inter_pred_idc, a reference picture index refIdxLX, a prediction vector index mvp_LX_idx, and a difference There is a vector mvdLX.

（参照ピクチャリスト）
次に、参照ピクチャリストRefの一例について説明する。参照ピクチャリストとは、後述する参照ピクチャメモリ３０６（図８）に記憶される参照ピクチャからなる情報セットである。図３は、参照ピクチャリストの一例を示す概念図である。参照ピクチャリスト６０１において、左右に一列に配列された５個の矩形は、それぞれ参照ピクチャを示す。左端から右へ順に示されている符号Ｐ０、Ｐ１、Ｑ２、Ｐ３、Ｐ４は、それぞれの参照ピクチャを示す符号である。Ｐ１等のＰとは、あるレイヤＰを示し、そしてＱ２のＱとは、レイヤＰとは異なるレイヤＱを示す。Ｐ及びＱの添字は、ピクチャ順序番号POC（Picture Order Count）を示す。参照ピクチャリストは、スライスレイヤ単位で更新することのできる情報セットである。参照ピクチャ内のどの参照ピクチャを参照するかは、別途符号化ユニットレイヤ内の参照ピクチャインデックスrefIdxLXで示す。 (Reference picture list)
Next, an example of the reference picture list Ref will be described. The reference picture list is an information set including reference pictures stored in a reference picture memory 306 (FIG. 8) described later. FIG. 3 is a conceptual diagram illustrating an example of a reference picture list. In the reference picture list 601, five rectangles arranged in a line on the left and right indicate reference pictures, respectively. Reference symbols P0, P1, Q2, P3, and P4 shown in order from the left end to the right are symbols indicating respective reference pictures. P such as P1 indicates a certain layer P, and Q of Q2 indicates a layer Q different from the layer P. The subscripts P and Q indicate a picture order number POC (Picture Order Count). The reference picture list is an information set that can be updated in units of slice layers. Which reference picture in the reference picture is referred to is separately indicated by a reference picture index refIdxLX in the coding unit layer.

（参照ピクチャの例）
次に、ベクトルを導出する際に用いる参照ピクチャの例について説明する。図４は、参照ピクチャの例を示す概念図である。図４において、横軸は表示時刻を示し、縦軸はレイヤ（例えば視点の異なる動画像）を示す。図示した複数の長方形は、それぞれピクチャを示す。図示したピクチャのうち、下行のピクチャＰ２は復号対象のピクチャ（対象ピクチャ）である。対象ピクチャから上向きの矢印で示される参照ピクチャＱ２は対象ピクチャと同じ表示時刻であって視点が異なるピクチャである。対象ピクチャを基準とする変位予測においては、参照ピクチャＱ０が用いられる。対象ピクチャから左向きの矢印で示される参照ピクチャＰ０，Ｐ１は、対象ピクチャと同じ視点であって、過去のピクチャである。対象ピクチャから右向きの矢印で示される参照ピクチャＰ３，Ｐ４は、対象ピクチャと同じ視点であって、未来のピクチャである。対象ピクチャを基準とする動き予測においては、参照ピクチャＰ０，Ｐ１，Ｐ３，Ｐ４が用いられる。 (Reference picture example)
Next, an example of a reference picture used for deriving a vector will be described. FIG. 4 is a conceptual diagram illustrating an example of a reference picture. In FIG. 4, the horizontal axis indicates display time, and the vertical axis indicates layers (for example, moving images with different viewpoints). Each of the illustrated rectangles represents a picture. Of the illustrated pictures, the picture P2 in the lower row is a decoding target picture (target picture). A reference picture Q2 indicated by an upward arrow from the target picture is a picture that has the same display time as the target picture and a different viewpoint. In the displacement prediction based on the target picture, the reference picture Q0 is used. Reference pictures P0 and P1 indicated by left-pointing arrows from the target picture are past pictures at the same viewpoint as the target picture. Reference pictures P3 and P4 indicated by right-pointing arrows from the target picture are the same viewpoint as the target picture and are future pictures. In motion prediction based on the target picture, reference pictures P0, P1, P3, and P4 are used.

（インター予測フラグと予測リスト利用フラグ）
インター予測フラグと、予測リスト利用フラグpredFlagL0、predFlagL1の関係は以下のように相互に変換可能である。そのため、インター予測パラメータとしては、予測リスト利用フラグを用いても良いし、インター予測フラグを用いてもよい。また、以下、予測リスト利用フラグを用いた判定は、インター予測フラグに置き替えても可能である。逆に、インター予測フラグを用いた判定は、予測リスト利用フラグに置き替えても可能である。 (Inter prediction flag and prediction list usage flag)
The relationship between the inter prediction flag and the prediction list use flags predFlagL0 and predFlagL1 can be mutually converted as follows. Therefore, as an inter prediction parameter, a prediction list use flag may be used, or an inter prediction flag may be used. In addition, hereinafter, the determination using the prediction list use flag may be replaced with the inter prediction flag. Conversely, the determination using the inter prediction flag can be performed by replacing the prediction list use flag.

インター予測フラグ＝（predFlagL1 << 1）+ predFlagL0
predFlagL0 ＝インター予測フラグ & 1
predFlagL1 ＝インター予測フラグ >> 1
ここで、>>は右シフト、<<は左シフトである。 Inter prediction flag = (predFlagL1 << 1) + predFlagL0
predFlagL0 = Inter prediction flag & 1
predFlagL1 = Inter prediction flag >> 1
Here, >> is a right shift, and << is a left shift.

インター予測フラグinter_pred_idcは、参照ピクチャの種類および数を示すデータであり、Pred_L0、Pred_L1、Pred_Biの何れかの値をとる。Pred_L0、Pred_L1は、各々参照ピクチャリストRefPicList0、参照ピクチャリストRefPicList1に記憶された参照ピクチャが用いられることを示し、共に１枚の参照ピクチャを用いること（単予測）を示す。参照ピクチャリストRefPicList0、参照ピクチャリストRefPicList1を用いた予測を各々Ｌ０予測、Ｌ１予測と呼ぶ。Pred_Biは２枚の参照ピクチャを用いること（双予測）を示し、参照ピクチャリストRefPicList0と参照ピクチャリストRefPicList1で指定された参照ピクチャ２つを用いることを示す。予測ベクトルインデックスmvp_LX_idxは予測ベクトルを示すインデックスであり、参照ピクチャインデックスrefIdxLXは、参照ピクチャリストに記憶された参照ピクチャを示すインデックスである。なお、ＬＸは、Ｌ０予測とＬ１予測を区別しない場合に用いられる記述方法であり、ＬＸをＬ０、Ｌ１に置き換えることで参照ピクチャリストRefPicList0で指定される参照ピクチャに対するパラメータと、参照ピクチャリストRefPicList1で指定される参照ピクチャに対するパラメータを区別する。例えば、refIdxL0はＬ０予測に用いる参照ピクチャインデックス、refIdxL1はＬ１予測に用いる参照ピクチャインデックス、refIdx（またはrefIdxLX）は、refIdxL0とrefIdxL1を区別しない場合に用いられる表記である。 The inter prediction flag inter_pred_idc is data indicating the type and number of reference pictures, and takes any value of Pred_L0, Pred_L1, and Pred_Bi. Pred_L0 and Pred_L1 indicate that reference pictures stored in the reference picture list RefPicList0 and the reference picture list RefPicList1, respectively, are used, and both indicate that one reference picture is used (single prediction). Prediction using the reference picture list RefPicList0 and the reference picture list RefPicList1 is referred to as L0 prediction and L1 prediction, respectively. Pred_Bi indicates that two reference pictures are used (bi-prediction), and indicates that two reference pictures specified by the reference picture list RefPicList0 and the reference picture list RefPicList1 are used. The prediction vector index mvp_LX_idx is an index indicating a prediction vector, and the reference picture index refIdxLX is an index indicating a reference picture stored in the reference picture list. Note that LX is a description method used when L0 prediction and L1 prediction are not distinguished. By replacing LX with L0 and L1, parameters for the reference picture specified in the reference picture list RefPicList0 and the reference picture list RefPicList1 are used. Differentiate the parameters for the specified reference picture. For example, refIdxL0 is a reference picture index used for L0 prediction, refIdxL1 is a reference picture index used for L1 prediction, and refIdx (or refIdxLX) is a notation used when refIdxL0 and refIdxL1 are not distinguished.

（動きベクトルと変位ベクトル）
ベクトルmvLXには、動きベクトルと変位ベクトル（disparity vector、視差ベクトル）がある。動きベクトルとは、あるレイヤのある表示時刻でのピクチャにおけるブロックの位置と、同一レイヤ内の異なる表示時刻（例えば、隣接する離散時刻）でのピクチャにおける対応するブロックの位置との間の位置のずれを示すベクトルである。変位ベクトルとは、あるレイヤのある表示時刻でのピクチャにおけるブロックの位置と、同一の表示時刻における異なるレイヤのピクチャにおける対応するブロックの位置との間の位置のずれを示すベクトルである。異なるレイヤのピクチャとしては、異なる視点のピクチャである場合、もしくは、異なる解像度のピクチャである場合などがある。特に、異なる視点のピクチャに対応する変位ベクトルを視差ベクトルと呼ぶ。以降の説明では、動きベクトルと変位ベクトルを区別しない場合には、単にベクトルmvLXと呼ぶ。ベクトルmvLXに関する予測ベクトル、差分ベクトルを、それぞれ予測ベクトルmvpLX、差分ベクトルmvdLXと呼ぶ。ベクトルmvLXおよび差分ベクトルmvdLXが、動きベクトルであるか変位ベクトルであるかは、ベクトルに付随する参照ピクチャインデックスrefIdxLXを用いて区別される。 (Motion vector and displacement vector)
The vector mvLX includes a motion vector and a displacement vector (disparity vector). A motion vector is a position between a block position in a picture at a certain display time of a layer and a corresponding block position in a picture at a different display time (for example, adjacent discrete time) in the same layer. It is a vector indicating a deviation. The displacement vector is a vector indicating a positional shift between the position of a block in a picture at a certain display time of a certain layer and the position of a corresponding block in a picture of a different layer at the same display time. The pictures in different layers may be pictures from different viewpoints or pictures with different resolutions. In particular, a displacement vector corresponding to pictures of different viewpoints is called a disparity vector. In the following description, when a motion vector and a displacement vector are not distinguished, they are simply referred to as a vector mvLX. A prediction vector and a difference vector related to the vector mvLX are referred to as a prediction vector mvpLX and a difference vector mvdLX, respectively. Whether the vector mvLX and the difference vector mvdLX are motion vectors or displacement vectors is distinguished using a reference picture index refIdxLX attached to the vector.

（VPS拡張）
次に、レイヤ間予測におけるレイヤ間の参照関係を示すパラメータについて説明する。レイヤ間予測におけるレイヤ間の参照関係は、VPSを拡張したデータ集合であるVPS拡張（vps_extension）で指定する。図１１に、VPS拡張のデータ構造の例を示す。この中で、レイヤ依存フラグdirect_dependency_flag[i][j]およびレイヤ依存タイプdirect_dependency_type[i][j]で、レイヤ間の参照関係を示す。具体的には、レイヤ依存フラグdirect_dependency_flag[i][j]が1である場合、対象レイヤiは別のレイヤjを参照し、レイヤ依存フラグが0であれば対象レイヤiはレイヤjを参照しない。レイヤ依存フラグが1である場合にはさらに、レイヤ依存タイプdirect_dependency_type[i][j]で、対象レイヤiに対する参照レイヤjのレイヤ依存タイプを示す。レイヤ依存タイプは、サンプル予測のみ、動き予測のみ、またはその両方を指定することができる。direct_dependency_type[i][j]の値とレイヤ依存タイプの値との関係を以下に示す。 (VPS expansion)
Next, parameters indicating a reference relationship between layers in inter-layer prediction will be described. The reference relationship between layers in inter-layer prediction is specified by a VPS extension (vps_extension) that is a data set obtained by extending a VPS. FIG. 11 shows an example of the data structure of VPS extension. Among these, the layer dependence flag direct_dependency_flag [i] [j] and the layer dependence type direct_dependency_type [i] [j] indicate the reference relationship between layers. Specifically, when the layer dependence flag direct_dependency_flag [i] [j] is 1, the target layer i refers to another layer j, and when the layer dependence flag is 0, the target layer i does not refer to the layer j. . When the layer dependency flag is 1, the layer dependency type direct_dependency_type [i] [j] indicates the layer dependency type of the reference layer j with respect to the target layer i. The layer dependent type can specify sample prediction only, motion prediction only, or both. The relationship between the value of direct_dependency_type [i] [j] and the value of the layer dependency type is shown below.

direct_dependency_type[i][j] = 0 … サンプル予測および動き予測
direct_dependency_type[i][j] = 1 … 動き予測のみ
direct_dependency_type[i][j] = 2 … サンプル予測のみ
以上の符号から、後述するパラメータセット復号部（図５〜図７）において、レイヤ間予測に関するパラメータが導出される。 direct_dependency_type [i] [j] = 0… sample prediction and motion prediction
direct_dependency_type [i] [j] = 1… motion prediction only
direct_dependency_type [i] [j] = 2 ... Sample prediction only From the above codes, parameters related to inter-layer prediction are derived in a parameter set decoding unit (FIGS. 5 to 7) described later.

（RPS情報）
RPSに係るシンタックス値（以下、RPS情報）は以下を含む。
１．SPS短期RPS情報：SPSに含まれる短期参照ピクチャセット情報
２．SPS長期RP情報：SPSに含まれる長期参照ピクチャ情報
３．SH短期RPS情報：スライスヘッダに含まれる短期参照ピクチャセット情報
４．SH長期RP情報：スライスヘッダに含まれる長期参照ピクチャ情報
５．ILRP情報：レイヤ間参照ピクチャ情報
（１．SPS短期RPS情報）
SPS短期RPS情報は、SPSを参照する各ピクチャから利用され得る複数の短期RPSの情報を含む。なお、短期RPSは、対象ピクチャに対する相対的な位置（例えば対象ピクチャとのPOC差分）により指定される参照ピクチャ（短期参照ピクチャ）となり得るピクチャの集合である。 (RPS information)
Syntax values related to RPS (hereinafter referred to as RPS information) include the following.
1. 1. SPS short-term RPS information: short-term reference picture set information included in the SPS 2. SPS long-term RP information: long-term reference picture information included in SPS SH short-term RPS information: short-term reference picture set information included in the slice header SH long-term RP information: long-term reference picture information included in the slice header ILRP information: Inter-layer reference picture information (1. SPS short-term RPS information)
The SPS short-term RPS information includes information on a plurality of short-term RPSs that can be used from each picture referring to the SPS. Note that the short-term RPS is a set of pictures that can be a reference picture (short-term reference picture) specified by a relative position with respect to the target picture (for example, a POC difference from the target picture).

短期RPS情報には、対象ピクチャより表示順が早い短期参照ピクチャ数（num_negative_pics）、および、対象ピクチャより表示順が遅い短期参照ピクチャ数（num_positive_pics）が含まれる。なお、以下では、対象ピクチャより表示順が早い短期参照ピクチャを前方短期参照ピクチャ、対象ピクチャより表示順が遅い短期参照ピクチャを後方短期参照ピクチャと呼ぶ。 The short-term RPS information includes the number of short-term reference pictures (num_negative_pics) whose display order is earlier than that of the target picture and the number of short-term reference pictures (num_positive_pics) whose display order is later than that of the target picture. In the following, a short-term reference picture whose display order is earlier than the target picture is referred to as a front short-term reference picture, and a short-term reference picture whose display order is later than the target picture is referred to as a rear short-term reference picture.

また、短期RPS情報には、各前方短期参照ピクチャに対して、対象ピクチャに対するPOC差分の絶対値（delta_poc_s0_minus1[i]）、および、対象ピクチャの参照ピクチャとして使用される可能性の有無（used_by_curr_pic_s0_flag[i]）が含まれる。加えて、各後方短期参照ピクチャに対して、対象ピクチャに対するPOC差分の絶対値（delta_poc_s1_minus1[i]）、および、対象ピクチャの参照ピクチャとして使用される可能性の有無（used_by_curr_pic_s1_flag[i]）が含まれる。 The short-term RPS information includes, for each forward short-term reference picture, the absolute value of the POC difference with respect to the target picture (delta_poc_s0_minus1 [i]), and the presence / absence of use as a reference picture of the target picture (used_by_curr_pic_s0_flag [ i]). In addition, for each backward short-term reference picture, the absolute value of the POC difference with respect to the target picture (delta_poc_s1_minus1 [i]) and the possibility of being used as the reference picture of the target picture (used_by_curr_pic_s1_flag [i]) are included It is.

（２．SPS長期RP情報）
SPS長期RP情報は、SPSを参照する各ピクチャから利用され得る複数の長期参照ピクチャの情報を含む。なお、長期参照ピクチャとは、シーケンス内の絶対的な位置（例えばPOC）により指定されるピクチャである。 (2. SPS long-term RP information)
The SPS long-term RP information includes information on a plurality of long-term reference pictures that can be used from each picture that references the SPS. A long-term reference picture is a picture specified by an absolute position (for example, POC) in a sequence.

SPS長期RP情報には、SPSで伝送される長期参照ピクチャの有無を示す情報（long_term_ref_pics_present_flag）、SPSに含まれる長期参照ピクチャ数（num_long_term_ref_pics_sps）、および、各長期参照ピクチャの情報が含まれる。長期参照ピクチャの情報には、参照ピクチャのPOC（lt_ref_pic_poc_lsb_sps[i]）、および、対象ピクチャの参照ピクチャとして使用される可能性の有無（used_by_curr_pic_lt_sps_flag[i]）が含まれる。 The SPS long-term RP information includes information (long_term_ref_pics_present_flag) indicating the presence or absence of a long-term reference picture transmitted in the SPS, the number of long-term reference pictures included in the SPS (num_long_term_ref_pics_sps), and information on each long-term reference picture. The long-term reference picture information includes the POC (lt_ref_pic_poc_lsb_sps [i]) of the reference picture and the possibility of being used as the reference picture of the target picture (used_by_curr_pic_lt_sps_flag [i]).

なお、上記参照ピクチャのPOCは、参照ピクチャに関連付けられたPOCの値自体であってもよいし、POCのLSB（Least Significant Bit）、すなわち、POCを既定の２のべき乗の数で割った余りの値を用いてもよい。 The POC of the reference picture may be the POC value itself associated with the reference picture, or the LSB (Least Significant Bit) of the POC, that is, the remainder obtained by dividing the POC by a predetermined power of 2 The value of may be used.

（３．SH短期RPS情報）
SH短期RPS情報は、スライスヘッダを参照するピクチャから利用され得る単一の短期RPSの情報を含む。 (3. SH short-term RPS information)
The SH short-term RPS information includes information of a single short-term RPS that can be used from a picture that references a slice header.

（４．SH長期RP情報）
SH長期RP情報は、スライスヘッダを参照するピクチャから利用され得る長期参照ピクチャの情報を含む。 (4. SH long-term RP information)
The SH long-term RP information includes information on a long-term reference picture that can be used from a picture that references a slice header.

（５．ILRP情報）
ILRP情報は、レイヤ間予測で参照され得るレイヤ間参照ピクチャの情報である。ILRP情報は、レイヤ間予測有効フラグ（inter_layer_pred_enabled_flag）と、レイヤ間参照ピクチャ数（NumActiveRefLayerPics）、各レイヤ間参照ピクチャの属するレイヤを示すレイヤ識別子（inter_layer_pred_layer_idc[i]）を含んで構成される。 (5. ILRP information)
ILRP information is information on an inter-layer reference picture that can be referred to in inter-layer prediction. The ILRP information includes an inter-layer prediction valid flag (inter_layer_pred_enabled_flag), the number of inter-layer reference pictures (NumActiveRefLayerPics), and a layer identifier (inter_layer_pred_layer_idc [i]) indicating a layer to which each inter-layer reference picture belongs.

（比較例の構成）
本発明の一実施形態に係る画像復号装置３１の構成を説明する前に、比較例について説明する。図１５は、比較例にかかるレイヤ依存タイプと、サンプル依存フラグSamplePredEnableFlag、動き依存フラグSamplePredEnableFlagの関係を説明する図である。図１５（ａ）は、比較例では、レイヤ依存タイプが０以外（ここでは２の場合）に、サンプル依存フラグSamplePredEnableFlagと動き依存フラグSamplePredEnableFlagの両者が１であることを示し、図１５（ｂ）は、比較例でのサンプル依存フラグSamplePredEnableFlagと動き依存フラグSamplePredEnableFlagの導出処理を示す。 (Configuration of comparative example)
Before describing the configuration of the image decoding device 31 according to an embodiment of the present invention, a comparative example will be described. FIG. 15 is a diagram for explaining the relationship between the layer dependence type, the sample dependence flag SamplePredEnableFlag, and the motion dependence flag SamplePredEnableFlag according to the comparative example. FIG. 15A shows that, in the comparative example, the layer dependence type is other than 0 (here, 2), and both the sample dependence flag SamplePredEnableFlag and the motion dependence flag SamplePredEnableFlag are 1. FIG. These show the derivation | leading-out processing of the sample dependence flag SamplePredEnableFlag and the motion dependence flag SamplePredEnableFlag in a comparative example.

図１７は、比較例における参照ピクチャセットRPSに入れることができるピクチャのレイヤ依存タイプの状態を示すものである。図１７に示すように、比較例では、レイヤ依存タイプの状態（サンプル依存フラグ SamplePredEnableFlagおよび動き依存フラグMotionPredEnableFlag）に関わらず、参照ピクチャセットRPSに入れることができる。 FIG. 17 shows the state of the layer dependent type of pictures that can be included in the reference picture set RPS in the comparative example. As shown in FIG. 17, in the comparative example, the reference picture set RPS can be included regardless of the layer-dependent type state (sample-dependent flag SamplePredEnableFlag and motion-dependent flag MotionPredEnableFlag).

図１８は、比較例における参照ピクチャセットRPSの導出処理を示す図である。図１８（ａ）は、比較例における実依存レイヤ数NumActiveRefLayerPicsを導出する処理を示す。図１８（ｂ）は、比較例における参照ピクチャレイヤRefPicLayerIdを導出する処理を示す。図１８（ｃ）は、比較例におけるRPSの導出処理を示す。図１８（ｃ）に示すように、RefPicSetInterLayer[ i ]が長期予測参照"used for long-term reference"にマークされることにより、RPSに参照ピクチャが入れられる。図１８（ｃ）の処理では、図１８（ｂ）の処理で導出された参照ピクチャレイヤRefPicLayerId[ i ]に含まれるnuh_layer_idのレイヤが、参照ピクチャに入れられるが、参照ピクチャレイヤRefPicLayerId[ i ]は、レイヤ依存タイプに依存しない。従って、比較例では、図１７に示すように、レイヤ依存タイプに関わらず、参照ピクチャセットRPSに入れることができる。 FIG. 18 is a diagram illustrating a derivation process of the reference picture set RPS in the comparative example. FIG. 18A shows processing for deriving the actual dependent layer number NumActiveRefLayerPics in the comparative example. FIG. 18B shows processing for deriving the reference picture layer RefPicLayerId in the comparative example. FIG. 18C shows RPS derivation processing in the comparative example. As shown in FIG. 18 (c), RefPicSetInterLayer [i] is marked with a long-term prediction reference “used for long-term reference”, whereby a reference picture is put into the RPS. In the process of FIG. 18C, the layer of nuh_layer_id included in the reference picture layer RefPicLayerId [i] derived by the process of FIG. 18B is put in the reference picture, but the reference picture layer RefPicLayerId [i] Does not depend on the layer dependency type. Therefore, in the comparative example, as shown in FIG. 17, the reference picture set RPS can be included regardless of the layer-dependent type.

図１９は、比較例におけるＲＰＳの導出処理を説明する図である。実依存レイヤ数NumActiveRefLayerPicsは３、参照ピクチャレイヤRefPicLayerId[][]は、nuh_layer_id =0,2,3のレイヤが設定されていることが分かる。 FIG. 19 is a diagram illustrating RPS derivation processing in the comparative example. It can be seen that the actual dependent layer number NumActiveRefLayerPics is set to 3, and the reference picture layer RefPicLayerId [] [] is set to nuh_layer_id = 0,2,3.

図２４は、比較例における代替コロケートピクチャcolPicに用いることができるピクチャのレイヤ依存タイプの状態を示すものである。図２４（ａ）に示すように、比較例では、レイヤ依存タイプの状態として、サンプル依存フラグ SamplePredEnableFlagが１である場合にも、動き依存フラグMotionPredEnableFlagが１であれば、代替コロケートピクチャcolPicに用いることができる。図２４（ｂ）は、比較例における代替コロケートピクチャcolPicの導出処理に用いられる動き依存レイヤ（実動き依存レイヤ識別子ActiveMotionPredRefLayerId[]）と動き依存レイヤ数（実動き依存レイヤ数NumActiveMotionPredRefLayers））の導出処理を示す。比較例では、動き依存レイヤは、サンプル依存フラグ SamplePredEnableFlagによらずに設定される。 FIG. 24 shows a state of a layer dependent type of a picture that can be used for the alternative collocated picture colPic in the comparative example. As shown in FIG. 24A, in the comparative example, even when the sample dependence flag SamplePredEnableFlag is 1 as the layer dependence type state, if the motion dependence flag MotionPredEnableFlag is 1, it is used for the alternative collocated picture colPic. Can do. FIG. 24B shows a derivation process of a motion-dependent layer (actual motion-dependent layer identifier ActiveMotionPredRefLayerId []) and the number of motion-dependent layers (actual motion-dependent layer number NumActiveMotionPredRefLayers) used in the derivation process of the alternative collocated picture colPic in the comparative example. Indicates. In the comparative example, the motion dependent layer is set regardless of the sample dependent flag SamplePredEnableFlag.

図１５、図１７〜図１９、図２４〜図２５で示す比較例は、各々、本発明の実施形態の図１６、図２０〜図２２、図２６〜図２７に対応する。 Comparative examples shown in FIGS. 15, 17 to 19, and 24 to 25 correspond to FIGS. 16, 20 to 22, and 26 to 27, respectively, of the embodiment of the present invention.

（画像復号装置の構成）
次に、本発明の一実施形態に係る画像復号装置３１の構成について説明する。図５は、本実施形態に係る画像復号装置３１の構成を示す概略図である。画像復号装置３１は、画像データ復号部３００、ヘッダ復号部４００を含んで構成される。 (Configuration of image decoding device)
Next, the configuration of the image decoding device 31 according to an embodiment of the present invention will be described. FIG. 5 is a schematic diagram illustrating a configuration of the image decoding device 31 according to the present embodiment. The image decoding device 31 includes an image data decoding unit 300 and a header decoding unit 400.

画像データ復号部３００は、後述するヘッダ復号部４００から入力されるスライスレイヤ符号化パラメータに基づいて、入力される符号化ストリームTeから画像データを復号する。復号された画像データは、復号レイヤ画像Td（復号視点画像Td）として画像復号装置の外部へ出力される。画像データ復号部３００の詳細については後述する。 The image data decoding unit 300 decodes image data from the input encoded stream Te based on a slice layer encoding parameter input from the header decoding unit 400 described later. The decoded image data is output to the outside of the image decoding apparatus as a decoded layer image Td (decoded viewpoint image Td). Details of the image data decoding unit 300 will be described later.

ヘッダ復号部４００は、パラメータセット復号部４０１と、スライスセグメントヘッダ復号部４０２を含んで構成される。パラメータセット復号部４０１は、符号化ストリームTeに含まれるビデオパラメータセットVPS、シーケンスパラメータセットSPS、ピクチャパラメータセットPPS等の、スライスレイヤよりも上位のレイヤの符号化データを、所定のシンタックス規則に従って復号し、各レイヤでを復号する際に必要なパラメータを決定する。例えば、前述のVPS拡張を復号し、レイヤ依存フラグdirect_dependency_flag[i][j]およびレイヤ依存タイプdirect_dependency_type[i][j]などを導出する。 The header decoding unit 400 includes a parameter set decoding unit 401 and a slice segment header decoding unit 402. The parameter set decoding unit 401 converts encoded data of layers higher than the slice layer, such as a video parameter set VPS, a sequence parameter set SPS, and a picture parameter set PPS, included in the encoded stream Te, according to a predetermined syntax rule. Decode and determine parameters required for decoding at each layer. For example, the VPS extension described above is decoded to derive the layer dependency flag direct_dependency_flag [i] [j], the layer dependency type direct_dependency_type [i] [j], and the like.

パラメータセット復号部４０１は、内部で依存タイプ復号部を含む。 The parameter set decoding unit 401 includes a dependency type decoding unit internally.

図１６は、本発明の実施形態に係るレイヤ依存タイプと、サンプル依存フラグSamplePredEnableFlag、動き依存フラグSamplePredEnableFlagの関係を説明する図である。図１６（ａ）は、レイヤ依存タイプが０の場合に、サンプル依存フラグSamplePredEnableFlagと動き依存フラグSamplePredEnableFlagの両者が１であることを示す。 FIG. 16 is a diagram for explaining the relationship between the layer dependence type, the sample dependence flag SamplePredEnableFlag, and the motion dependence flag SamplePredEnableFlag according to the embodiment of the present invention. FIG. 16A shows that when the layer dependency type is 0, both the sample dependency flag SamplePredEnableFlag and the motion dependency flag SamplePredEnableFlag are 1.

図１６（ｂ）は、依存タイプ復号部におけるパラメータセット復号部４０１はサンプル依存フラグSamplePredEnableFlagと動き依存フラグSamplePredEnableFlagの導出処理を示す。 FIG. 16B shows the parameter set decoding unit 401 in the dependency type decoding unit derivation processing of the sample dependency flag SamplePredEnableFlag and the motion dependency flag SamplePredEnableFlag.

図１６（ｂ）に示すように、パラメータセット復号部４０１は、各対象レイヤi、依存レイヤjに対して、以下の式により、SamplePredEnableFlag[iNuhLid][j]、およびMotionPredEnableFlag[iNuhLid][j]を導出する。 As shown in FIG. 16B, the parameter set decoding unit 401 performs SamplePredEnableFlag [iNuhLid] [j] and MotionPredEnableFlag [iNuhLid] [j] for each target layer i and dependent layer j according to the following equations. Is derived.

SamplePredEnabledFlag[ iNuhLId ][ j ] =
( ( 3 - (direct_dependency_type[ i ][ j ] & 3)) & 1 )
MotionPredEnabledFlag[ iNuhLId ][ j ] =
( ( ( 3 - (direct_dependency_type[ i ][ j ] & 3)) & 2 ) >> 1 )
上記式において、記号&はビット積（論理積）、記号>>は右ビットシフトを表す。 SamplePredEnabledFlag [iNuhLId] [j] =
((3-(direct_dependency_type [i] [j] & 3)) & 1)
MotionPredEnabledFlag [iNuhLId] [j] =
((((3-(direct_dependency_type [i] [j] & 3)) & 2) >> 1)
In the above equation, the symbol & represents a bit product (logical product), and the symbol >> represents a right bit shift.

これにより、上述したdirect_dependency_type[i][j]の値とレイヤ依存タイプの値との関係に従って、SamplePredEnabledFlagとMotionPredEnabledFlagを導出することができる。 Thereby, SamplePredEnabledFlag and MotionPredEnabledFlag can be derived according to the relationship between the value of direct_dependency_type [i] [j] and the value of the layer dependence type.

上記の処理を含むパラメータセット復号部４０１によれば、direct_dependency_type[i][j]が０の場合に、サンプル予測の依存性有り、つまり対応するSamplePredEnabledFlag[iNuhLId][j]が１、かつ、動き予測の依存性有り、つまり対応するMotionPredEnabledFlag[iNuhLId][j]が１となるように、サンプル依存フラグSamplePredEnableFlagと動き依存フラグSamplePredEnableFlagを導出する。これにより、最も頻繁に用いられるサンプル予測および動き予測の依存性があるケースに対して、最も短いビット長で表現される０が割り当てられるため、direct_dependency_type[i][j]の符号量を低減する効果を奏する。 According to the parameter set decoding unit 401 including the above processing, when direct_dependency_type [i] [j] is 0, there is sample prediction dependency, that is, the corresponding SamplePredEnabledFlag [iNuhLId] [j] is 1 and the motion The sample dependency flag SamplePredEnableFlag and the motion dependency flag SamplePredEnableFlag are derived so that there is prediction dependency, that is, the corresponding MotionPredEnabledFlag [iNuhLId] [j] is 1. As a result, the code amount of direct_dependency_type [i] [j] is reduced because 0 represented by the shortest bit length is assigned to the case where there is dependency on the most frequently used sample prediction and motion prediction. There is an effect.

なお、direct_dependency_type[i][j]と3との論理積(&)をとってから、SamplePredEnabledFlagとMotionPredEnabledFlagを導出することによって、direct_dependency_type[i][j]の下位２ビットの値に応じてSamplePredEnabledFlagとMotionPredEnabledFlagを設定することができる。これにより、下位２ビットでは、サンプル依存、動き依存の依存性を表現し、下位２ビットより上のビットにおいて、サンプル依存、動き依存以外の依存性の有無を表現することができる。 Note that the SamplePredEnabledFlag and the MotionPredEnabledFlag are derived after taking the logical product (&) of the direct_dependency_type [i] [j] and 3, and the SamplePredEnabledFlag and the DirectPredependency_type [i] [j] MotionPredEnabledFlag can be set. As a result, the low-order 2 bits can express sample dependency and motion dependency, and the bits above the low-order 2 bits can express the presence or absence of dependencies other than sample dependency and motion dependency.

上記説明において、nuh_layer_id、iNuhLidはいずれも対象レイヤiに対応するレイヤ識別子であり、vps_extension()内で明示される場合はその値（layer_id_in_nuh[i]）を用い、そうでない場合は前述したNALユニットヘッダ内で指定されるレイヤ識別子nuh_layer_idをそのまま用いる。これら導出したパラメータは、スライスセグメントヘッダ復号部４０２へ出力する。
（スライスセグメントヘッダ復号部）
スライスセグメントヘッダ復号部４０２は、入力される符号化ストリームTeに含まれるスライスセグメントヘッダを、パラメータセット復号部４０１から入力される上位レイヤのパラメータに基づいて復号し、対象スライスセグメントの動画像データの復号に必要なスライスレイヤ符号化パラメータを導出する。スライスセグメントヘッダ復号部４０２は、実依存レイヤ復号部４０３、参照ピクチャセット導出部４０４、コロケートピクチャ復号部４０５をさらに含んで構成される。 In the above description, nuh_layer_id and iNuhLid are both layer identifiers corresponding to the target layer i. If specified in vps_extension (), use the value (layer_id_in_nuh [i]), otherwise use the NAL unit described above. The layer identifier nuh_layer_id specified in the header is used as it is. These derived parameters are output to the slice segment header decoding unit 402.
(Slice segment header decoding unit)
The slice segment header decoding unit 402 decodes the slice segment header included in the input encoded stream Te based on the parameters of the upper layer input from the parameter set decoding unit 401, and the moving image data of the target slice segment Deriving slice layer coding parameters necessary for decoding. The slice segment header decoding unit 402 further includes an actual dependency layer decoding unit 403, a reference picture set deriving unit 404, and a collocated picture decoding unit 405.

図２０は、本発明の実施形態に係る参照ピクチャセットRPSに入れることができるピクチャのレイヤ依存タイプの状態を示すものである。図２０に示すように、本発明の実施形態では、レイヤ依存タイプによる導出されるサンプル依存フラグ SamplePredEnableFlagが１のピクチャのみ、参照ピクチャセットRPSに入れることができる。 FIG. 20 shows the state of the layer dependent type of pictures that can be included in the reference picture set RPS according to the embodiment of the present invention. As shown in FIG. 20, in the embodiment of the present invention, only a picture having a sample dependency flag SamplePredEnableFlag derived from the layer dependency type can be included in the reference picture set RPS.

図２１は、本発明の実施形態に係る参照ピクチャセットRPSの導出処理を示す図である。図２１（ａ）は、本発明の実施形態の実依存レイヤ数NumActiveRefLayerPicsを導出する処理を示す。これは、図１６（ａ）の比較例と同じである。図２１（ｂ）は、本発明の実施形態のサンプル依存レイヤ（実サンプル依存レイヤ識別子ActiveSamplePredRefLayerId[]）および、サンプル依存レイヤ数（実サンプル依存レイヤ数NumActiveSamplePredRefLayers）の導出処理を示す。図２１（ｃ）は、本発明の実施形態のRPSの導出処理を示す。図２１（ｃ）に示すように、RefPicSetInterLayer[ i ]が長期予測参照"used for long-term reference"にマークされることにより、RPSに参照ピクチャが入れられる。図２１（ｃ）の処理では、図２１（ｂ）の処理で導出されたサンプル依存レイヤ（実サンプル依存レイヤ識別子ActiveSamplePredRefLayerId[]）に含まれるnuh_layer_idのレイヤが、参照ピクチャに入れられる。サンプル依存レイヤは、サンプル依存フラグ SamplePredEnableFlagが１のピクチャであることから、レイヤ依存タイプの状態に応じて、RPSに入れられるかが定まる。 FIG. 21 is a diagram showing a derivation process of the reference picture set RPS according to the embodiment of the present invention. FIG. 21A shows processing for deriving the actual dependent layer number NumActiveRefLayerPics according to the embodiment of the present invention. This is the same as the comparative example of FIG. FIG. 21B shows a derivation process of the sample dependent layer (actual sample dependent layer identifier ActiveSamplePredRefLayerId []) and the number of sample dependent layers (actual sample dependent layer number NumActiveSamplePredRefLayers) according to the embodiment of the present invention. FIG. 21C illustrates the RPS derivation process according to the embodiment of this invention. As shown in FIG. 21 (c), RefPicSetInterLayer [i] is marked with a long-term prediction reference “used for long-term reference”, whereby a reference picture is put into the RPS. In the process of FIG. 21C, the layer of nuh_layer_id included in the sample dependent layer (actual sample dependent layer identifier ActiveSamplePredRefLayerId []) derived in the process of FIG. Since the sample-dependent layer is a picture whose sample-dependent flag SamplePredEnableFlag is 1, it is determined whether the sample-dependent layer can be put into the RPS according to the state of the layer-dependent type.

図２２は、本発明の実施形態に係るＲＰＳの導出処理を説明する図である。サンプル依存レイヤ数（実サンプル依存レイヤ数NumActiveSamplePredRefLayers）は２、サンプル依存レイヤ（実サンプル依存レイヤ識別子ActiveSamplePredRefLayerId[]）は、nuh_layer_id =0,3のレイヤが設定されていることが分かる。比較例の図１９と異なり、サンプル依存ではなく、動き依存のみのレイヤであるnuh_layer_id =2のレイヤは、RPSに入れられない。 FIG. 22 is a diagram illustrating RPS derivation processing according to the embodiment of the present invention. It can be seen that the sample dependent layer number (actual sample dependent layer number NumActiveSamplePredRefLayers) is 2, and the sample dependent layer (actual sample dependent layer identifier ActiveSamplePredRefLayerId []) is nuh_layer_id = 0,3. Unlike FIG. 19 of the comparative example, a layer with nuh_layer_id = 2 that is not a sample-dependent layer but a motion-dependent layer is not included in the RPS.

図２３は、本発明の実施形態に係るレイヤ間予測限定フラグの導出処理を説明する図である。図２３（ａ）に示す用に、レイヤ間予測限定フラグinter_layer_sample_pred_only_flagが１である場合には、インター予測ＲＰＬ有効フラグInterRefEnabledInRPLFlagを０に設定する。図２３（ｂ）に示す用に、インター予測ＲＰＬ有効フラグInterRefEnabledInRPLFlagが１である場合にのみ、暫定参照ピクチャリストRefPicListTempX[]に、短期前方参照ピクチャセットRefPicSetStCurrBefore、短期後方参照ピクチャセットRefPicSetStCurrAfter[ i ]、および長期参照ピクチャセットRefPicSetLtCurr[ i ]の参照ピクチャリストが格納される。レイヤ間参照ピクチャリストRefPicSetInterLayerの参照ピクチャはインター予測ＲＰＬ有効フラグInterRefEnabledInRPLFlagによらずに格納される。暫定参照ピクチャリストRefPicListTempX[]から、参照ピクチャリストが構築される。 FIG. 23 is a diagram for explaining the inter-layer prediction limitation flag derivation process according to the embodiment of the present invention. As shown in FIG. 23A, when the inter-layer prediction only flag inter_layer_sample_pred_only_flag is 1, the inter prediction RPL valid flag InterRefEnabledInRPLFlag is set to 0. As shown in FIG. 23B, only when the inter prediction RPL valid flag InterRefEnabledInRPLFlag is 1, the temporary reference picture list RefPicListTempX [] includes a short-term forward reference picture set RefPicSetStCurrBefore, a short-term backward reference picture set RefPicSetStCurrAfter [i], In addition, a reference picture list of the long-term reference picture set RefPicSetLtCurr [i] is stored. The reference picture of the inter-layer reference picture list RefPicSetInterLayer is stored regardless of the inter prediction RPL enabled flag InterRefEnabledInRPLFlag. A reference picture list is constructed from the temporary reference picture list RefPicListTempX [].

図２６は、本発明の実施形態に係る代替コロケートピクチャに用いることができるピクチャのレイヤ依存タイプの状態を示すものである。図２６（ａ）に示すように、本発明の実施形態では、レイヤ依存タイプの状態として、サンプル依存フラグ SamplePredEnableFlagが１である場合には、動き依存フラグMotionPredEnableFlagが１であっても、代替コロケートピクチャに用いない。これは、サンプル依存フラグSamplePredEnableFlagが１の依存レイヤは、参照ピクチャセットRPSに格納される、参照ピクチャインデックスrefIdxを用いて、colPicを特定することができるためである。本実施形態では、RPSに格納される参照ピクチャは、参照ピクチャインデックスの識別子collocated_ref_idxを用いて特定し、RPSに格納されない参照ピクチャは、レイヤＩＤの識別子collocated_ref_layer_idxを用いて特定するため、colPicを特定するための情報に冗長性が生じない。図２６（ｂ）は、本発明の実施形態に係るコロケートピクチャcolPicの導出処理に用いられる動き依存限定レイヤ（実動き依存レイヤ識別子ActiveMotionPredRefLayerId[]）と動き依存限定レイヤ数（実動き依存レイヤ数NumActiveMotionPredRefLayers））の導出処理を示す。本実施形態では、動き依存限定レイヤは、図２４で示す比較例の動き依存レイヤと異なり、サンプル依存フラグ SamplePredEnableFlagが１の場合には設定されない。なお、図１６のレイヤ依存タイプの導出、図２０〜図２２の参照ピクチャセット、参照ピクチャリスト導出、図２３のレイヤ間予測限定、図２６の代替コロケートピクチャ導出の各構成は、各々独立に用いても良いし、同時に用いても良い。 FIG. 26 shows a state of a layer dependent type of a picture that can be used for an alternative collocated picture according to an embodiment of the present invention. As shown in FIG. 26A, in the embodiment of the present invention, when the sample dependent flag SamplePredEnableFlag is 1 as the layer dependent type state, even if the motion dependent flag MotionPredEnableFlag is 1, the alternative collocated picture Not used for. This is because a dependent layer whose sample dependency flag SamplePredEnableFlag is 1 can specify colPic using the reference picture index refIdx stored in the reference picture set RPS. In the present embodiment, the reference picture stored in the RPS is specified using the reference picture index identifier collocated_ref_idx, and the reference picture not stored in the RPS is specified using the layer ID identifier collocated_ref_layer_idx, so the colPic is specified. Therefore, there is no redundancy in the information. FIG. 26B shows a motion-dependent limited layer (actual motion-dependent layer identifier ActiveMotionPredRefLayerId []) and a motion-dependent limited layer number (actual motion-dependent layer number NumActiveMotionPredRefLayers) used in the derivation process of the collocated picture colPic according to the embodiment of the present invention. )) Derivation processing is shown. In the present embodiment, the motion dependence limited layer is not set when the sample dependence flag SamplePredEnableFlag is 1, unlike the motion dependence layer of the comparative example shown in FIG. The derivation of the layer-dependent type in FIG. 16, the reference picture set in FIGS. 20 to 22, the reference picture list derivation, the inter-layer prediction limitation in FIG. 23, and the alternative collocated picture derivation in FIG. 26 are used independently. You may use it simultaneously.

図１２に、スライスセグメントヘッダ復号部４０２が復号する、スライスセグメントヘッダのデータ構造の一例を示す。スライスセグメントヘッダ復号部４０２は、非ベースレイヤ（レイヤ識別子nuh_layer_id>0 であるレイヤ）に関して、依存レイヤが存在する、すなわち依存レイヤ数NumDirectRefLayers[nuh_layer_id]が0より大きい場合、対象スライスセグメントにおける実依存レイヤに関するシンタックスを次のように復号する。まず、レイヤ間予測の有無を示すレイヤ間依存フラグinter_layer_pred_enabled_flagを復号する。レイヤ間予測が有効すなわちinter_layer_pred_enabled_flag=1の場合、さらにnum_inter_layer_ref_pics_minus1によって、レイヤ間の参照ピクチャ数を復号する。ただし、前述の依存レイヤ数NumDirectRefLayers[nuh_layer_id]が1の場合や、VPS拡張内で指定されるmax_one_active_ref_layer_flagが1の場合、つまり実際の依存レイヤ数が1に限定される場合は、num_inter_layer_ref_pics_minus1は復号しない。 FIG. 12 shows an example of the data structure of the slice segment header decoded by the slice segment header decoding unit 402. The slice segment header decoding unit 402, when a dependent layer exists for a non-base layer (a layer having a layer identifier nuh_layer_id> 0), that is, when the number of dependent layers NumDirectRefLayers [nuh_layer_id] is greater than 0, the actual dependent layer in the target slice segment The syntax for is decoded as follows: First, the inter-layer dependency flag inter_layer_pred_enabled_flag indicating the presence / absence of inter-layer prediction is decoded. When inter-layer prediction is enabled, that is, when inter_layer_pred_enabled_flag = 1, the number of reference pictures between layers is further decoded by num_inter_layer_ref_pics_minus1. However, when the number of dependent layers NumDirectRefLayers [nuh_layer_id] is 1 or when max_one_active_ref_layer_flag specified in the VPS extension is 1, that is, when the actual number of dependent layers is limited to 1, num_inter_layer_ref_pics_minus1 is not decoded.

スライスセグメントヘッダ復号部４０２の備える実依存レイヤ復号部４０３は、図２１（ａ）に示す処理により、、対象スライスセグメント（対象ピクチャ）に関する実依存レイヤピクチャ数を示すパラメータNumActiveRefLayerPicsを導出する。具体的には、対象レイヤがベースレイヤ（nuh_layer_id==0）であるか、参照レイヤが存在しない（NumDirectRefLayers[nuh_layer_id]==0）場合、あるいはレイヤ間予測が有効でない（inter_layer_pred_enabled_flag==0）場合は、NumActiveRefLayerPics=0に設定する。また、参照レイヤ数が1に限定される場合（max_one_active_ref_layer=1）もしくは、依存レイヤ数NumDirectRefLayers[nuh_layer_id]が１の場合、NumActiveRefLayerPics=1に設定する。それ以外の場合には、符号化データからnum_inter_layer_ref_pics_minus1が復号し、NumActiveRefLayerPicsにnum_inter_layer_ref_pics_minus+1の値を設定する。実依存レイヤ復号部４０３は、さらに、導出された実依存レイヤピクチャ数に対応するレイヤ間予測の依存レイヤを示す、レイヤ間依存識別子inter_layer_pred_layer_idc[i]を復号する。ただし、対象レイヤがベースレイヤ（nuh_layer_id==0）である場合や、参照レイヤが存在しない（NumDirectRefLayers[nuh_layer_id]==0）場合、あるいはレイヤ間予測が有効でない（inter_layer_pred_enabled_flag==0）場合は、実依存レイヤピクチャ数が0になり、その場合はinter_layer_pred_layer_idc[i]は復号しない。 The actual dependency layer decoding unit 403 included in the slice segment header decoding unit 402 derives a parameter NumActiveRefLayerPics indicating the number of actual dependency layer pictures related to the target slice segment (target picture) by the process shown in FIG. Specifically, if the target layer is a base layer (nuh_layer_id == 0), there is no reference layer (NumDirectRefLayers [nuh_layer_id] == 0), or inter-layer prediction is not enabled (inter_layer_pred_enabled_flag == 0) Sets NumActiveRefLayerPics = 0. Also, when the number of reference layers is limited to 1 (max_one_active_ref_layer = 1) or when the number of dependent layers NumDirectRefLayers [nuh_layer_id] is 1, NumActiveRefLayerPics = 1 is set. In other cases, num_inter_layer_ref_pics_minus1 is decoded from the encoded data, and the value of num_inter_layer_ref_pics_minus + 1 is set in NumActiveRefLayerPics. The actual dependency layer decoding unit 403 further decodes an inter-layer dependency identifier inter_layer_pred_layer_idc [i] indicating a dependency layer of inter-layer prediction corresponding to the derived actual dependency layer picture number. However, if the target layer is a base layer (nuh_layer_id == 0), there are no reference layers (NumDirectRefLayers [nuh_layer_id] == 0), or inter-layer prediction is not enabled (inter_layer_pred_enabled_flag == 0) In this case, inter_layer_pred_layer_idc [i] is not decoded.

実依存レイヤ復号部４０３は、対象スライスセグメントにおけるレイヤ間予測に関する実依存レイヤ情報として、実依存レイヤピクチャごとのレイヤ識別情報である参照ピクチャレイヤ識別子RefPicLayerId[]、実参照レイヤピクチャの中で実際にサンプル予測および動き予測で参照するレイヤ数を示す、実サンプル依存レイヤ数NumActiveSamplePredRefLayersおよび実動き依存レイヤ数NumActiveMotionPredRefLayers、そして、サンプル予測と動き予測それぞれの実依存レイヤ数に対応する依存レイヤの識別情報である、実サンプル依存レイヤ識別子ActiveSamplePredRefLayerId[]および実動き依存レイヤ識別子ActiveMotionPredRefLayerId[]を導出する。これら各パラメータの導出手順を図２１（ｂ）および図１３に示す。 The actual dependency layer decoding unit 403 uses the reference picture layer identifier RefPicLayerId [], which is layer identification information for each actual dependency layer picture, as actual dependency layer information regarding inter-layer prediction in the target slice segment. NumActiveSamplePredRefLayers and actual motion dependent layer number NumActiveMotionPredRefLayers indicating the number of layers to be referenced in sample prediction and motion prediction, and identification information of the dependent layer corresponding to the actual dependent layer number of each of sample prediction and motion prediction The real sample dependent layer identifier ActiveSamplePredRefLayerId [] and the actual motion dependent layer identifier ActiveMotionPredRefLayerId [] are derived. The procedure for deriving these parameters is shown in FIG. 21 (b) and FIG.

実依存レイヤ復号部４０３は、図２１（ｂ）に示すように、０から実依存レイヤ数NumActiveRefLayerPicsの値をとる各実依存レイヤiに対し、以下の処理を行う。まず、実依存レイヤiから、依存レイヤ識別子inter_layer_pred_layer_idc[ i ]を特定する。 As shown in FIG. 21B, the actual dependency layer decoding unit 403 performs the following processing on each actual dependency layer i that takes a value of 0 from the actual dependency layer number NumActiveRefLayerPics. First, the dependent layer identifier inter_layer_pred_layer_idc [i] is specified from the actual dependent layer i.

具体的には、実依存レイヤ復号部４０３は、各実依存レイヤiに対応する、パラメータセット復号部４０１により導出された、サンプル依存フラグSamplePredEnabledFlag[][]を参照し、対象レイヤnuh_layer_idに対する参照レイヤinter_layer_pred_layer_idc[ i ]がサンプル依存を示す場合、つまり、SamplePredEnabledFlag[ nuh_layer_id ][ inter_layer_pred_layer_idc[ i ] ]が１の場合に、そのサンプル依存のレイヤ（ここではレイヤＩＤ，RefLayerId[ nuh_layer_id ][ inter_layer_pred_layer_idc[ i ] ]）を、以下の式により、実サンプル予測レイヤに加える。 Specifically, the actual dependency layer decoding unit 403 refers to the sample dependency flag SamplePredEnabledFlag [] [] derived by the parameter set decoding unit 401 corresponding to each actual dependency layer i, and refers to the reference layer for the target layer nuh_layer_id If inter_layer_pred_layer_idc [i] indicates sample dependency, that is, SamplePredEnabledFlag [nuh_layer_id] [inter_layer_pred_layer_idc [i]] is 1, the sample dependent layer (here, layer ID, RefLayerId [nuh_layer_id] [inter_layer_pred_layer_idc [i] ) Is added to the actual sample prediction layer according to the following equation:

ActiveSamplePredRefLayerId[ k++ ] = RefLayerId[ nuh_layer_id ][ inter_layer_pred_layer_idc[ i ] ]
実サンプル依存レイヤ数NumActiveSamplePredRefLayersのカウントkを１だけインクリメントする。 ActiveSamplePredRefLayerId [k ++] = RefLayerId [nuh_layer_id] [inter_layer_pred_layer_idc [i]]
The count k of the actual sample dependent layer number NumActiveSamplePredRefLayers is incremented by one.

全ての実依存レイヤiの処理が終了した時点のカウントｋとして、実サンプル依存レイヤ数NumActiveSamplePredRefLayersが求められる。 The actual sample dependent layer number NumActiveSamplePredRefLayers is obtained as the count k at the time when all the actual dependent layers i have been processed.

上記処理により、レイヤ依存タイプが、サンプル依存を示す依存レイヤのみを含む実サンプル依存レイヤActiveSamplePredRefLayerId[]が得られる。 As a result of the above processing, an actual sample dependent layer ActiveSamplePredRefLayerId [] is obtained that includes only a dependency layer whose layer dependency type indicates sample dependency.

ActiveSamplePredRefLayerId[]を、サンプル予測に関する実依存レイヤ情報と呼ぶ。 ActiveSamplePredRefLayerId [] is called actual dependency layer information related to sample prediction.

実依存レイヤ復号部４０３は、図２６（ｂ）に示すように、０から実依存レイヤ数NumActiveRefLayerPicsの値をとる各実依存レイヤiに対し、以下の処理を行う。まず、実依存レイヤiから、依存レイヤ識別子inter_layer_pred_layer_idc[ i ]を特定する。
パラメータセット復号部４０１により導出された、サンプル依存フラグSamplePredEnabledFlag[][]を参照し、対象レイヤnuh_layer_idに対する参照レイヤinter_layer_pred_layer_idc[ i ]が動き限定依存を示す場合、つまり、SamplePredEnabledFlag[ nuh_layer_id ][ inter_layer_pred_layer_idc[ i ] ]が０でMotionPredEnabledFlag[ nuh_layer_id ][ inter_layer_pred_layer_idc[ i ] ]が１の場合に、その動き限定依存のレイヤ（ここではレイヤＩＤ，RefLayerId[ nuh_layer_id ][ inter_layer_pred_layer_idc[ i ] ]）を、以下の式により、実動き予測レイヤに加える。 As shown in FIG. 26B, the actual dependency layer decoding unit 403 performs the following processing for each actual dependency layer i that takes a value of 0 from the actual dependency layer number NumActiveRefLayerPics. First, the dependent layer identifier inter_layer_pred_layer_idc [i] is specified from the actual dependent layer i.
When the sample dependency flag SamplePredEnabledFlag [] [] derived by the parameter set decoding unit 401 is referred to and the reference layer inter_layer_pred_layer_idc [i] with respect to the target layer nuh_layer_id indicates motion limited dependency, that is, SamplePredEnabledFlag [nuh_layer_id] [inter_layer_pred_layer_idc [i ]] Is 0 and MotionPredEnabledFlag [nuh_layer_id] [inter_layer_pred_layer_idc [i]] is 1, the motion-dependent layer (here, layer ID, RefLayerId [nuh_layer_id] [inter_layer_pred_layer_idc [i]]) is To the actual motion prediction layer.

ActiveMotionPredRefLayerId[ k++ ] = RefLayerId[ nuh_layer_id ][ inter_layer_pred_layer_idc[ i ] ]
実動き依存レイヤ数NumActiveMotionPredRefLayersのカウントjを１だけインクリメントする。 ActiveMotionPredRefLayerId [k ++] = RefLayerId [nuh_layer_id] [inter_layer_pred_layer_idc [i]]
The count j of the actual motion dependent layer number NumActiveMotionPredRefLayers is incremented by one.

全ての実依存レイヤiの処理が終了した時点のカウントjとして、実動き依存レイヤ数NumActiveMotionPredRefLayersが求められる。 The actual motion dependent layer number NumActiveMotionPredRefLayers is obtained as the count j when all the actual dependent layers i have been processed.

上記処理により、レイヤ依存タイプが、動き限定依存を示す依存レイヤのみを含む実動き依存レイヤActiveMotionPredRefLayerId[]が得られる。 With the above processing, an actual motion dependent layer ActiveMotionPredRefLayerId [] including only a dependent layer whose layer dependence type indicates motion limited dependence is obtained.

上記の処理を含む実依存レイヤ復号部４０３によれば、依存レイヤに対応するサンプル依存フラグSamplePredEnabledFlag[][]が０の場合にのみ、実動き依存レイヤ識別子ActiveMotionPredRefLayerIdを設定する。このような構成により、後述するように、実動き依存レイヤ識別子ActiveMotionPredRefLayerIdに設定されたレイヤのみが、代替コロケートピクチャcolPicの導出処理に用いられ、サンプル依存が有効なレイヤに関しては、代替コロケートピクチャの導出処理を省略できるという効果を奏する。 The actual dependency layer decoding unit 403 including the above processing sets the actual motion dependency layer identifier ActiveMotionPredRefLayerId only when the sample dependency flag SamplePredEnabledFlag [] [] corresponding to the dependency layer is 0. With this configuration, as will be described later, only the layer set in the actual motion dependent layer identifier ActiveMotionPredRefLayerId is used for the derivation process of the alternative collocated picture colPic. There is an effect that the processing can be omitted.

実依存レイヤ復号部４０３は、対象ピクチャに関してサンプル依存レイヤがある場合、すなわち実サンプル依存レイヤ数NumActiveSamplePredRefLayersが0より大きい場合は、さらに、レイヤ間予測をサンプル予測に限定するか否かを示す、レイヤ間サンプル予測限定フラグinter_layer_sample_pred_only_flagを復号する。レイヤ間サンプル予測限定フラグが1の場合は、対象ピクチャの復号時に同一画像レイヤ内でのフレーム間予測が用いられず、同フラグが0であれば、同一画像レイヤ内でのフレーム間予測が用いられる。 If there is a sample dependent layer for the target picture, that is, if the actual sample dependent layer number NumActiveSamplePredRefLayers is greater than 0, the actual dependent layer decoding unit 403 further indicates whether or not to limit inter-layer prediction to sample prediction. The inter-sample prediction only flag inter_layer_sample_pred_only_flag is decoded. When the inter-layer sample prediction only flag is 1, inter-frame prediction within the same image layer is not used when the target picture is decoded, and when the flag is 0, inter-frame prediction within the same image layer is used. It is done.

参照ピクチャセット導出部４０４は、導出された実依存レイヤ情報に基づいて、対象ピクチャの復号処理に用いる参照ピクチャセットRPSを導出する。 The reference picture set deriving unit 404 derives a reference picture set RPS used for the decoding process of the current picture based on the derived actual dependency layer information.

参照ピクチャセット導出部４０４の行うRPS導出処理の詳細を説明する。 Details of the RPS deriving process performed by the reference picture set deriving unit 404 will be described.

RPS導出処理は複数のRPSのサブセットを導出処理に分けられる。RPSのサブセットは以下のように定義される。 The RPS derivation process can divide a plurality of RPS subsets into derivation processes. A subset of RPS is defined as follows:

RPSは参照可能ピクチャの種類に応じて次の２つのサブセットに分けられる。
・現ピクチャ参照ピクチャセットListCurr：対象ピクチャにおける参照可能ピクチャのリスト
・後続ピクチャ参照ピクチャセットListFoll：対象ピクチャでは参照されないが、対象ピクチャに復号順で後続のピクチャで参照可能なピクチャのリスト
なお、現ピクチャ参照ピクチャセットに含まれるピクチャの数を、現ピクチャ参照可能ピクチャ数NumCurrListと呼ぶ。 The RPS is divided into the following two subsets depending on the type of the referenceable picture.
-Current picture reference picture set ListCurr: List of pictures that can be referred to in the target picture-Subsequent picture reference picture set ListFoll: List of pictures that are not referenced in the target picture but can be referred to in subsequent pictures in decoding order in the target picture The number of pictures included in the picture reference picture set is called the current picture referenceable picture number NumCurrList.

現ピクチャ参照ピクチャセットは、さらに４つの部分リストから構成される。
・長期参照ピクチャセットRefPicSetLtCurr：SPS長期RP情報またはSH長期RP情報により指定される現ピクチャ参照可能ピクチャ
・短期前方参照ピクチャセットListStCurrBefore：SPS短期RPS情報またはＳＨ短期RPS情報により指定される現ピクチャ参照可能ピクチャであって、表示順が対象ピクチャより早いもの
・短期後方参照ピクチャセットListStCurrAfter：SPS短期RPS情報またはＳＨ短期RPS情報により指定される現ピクチャ参照可能ピクチャであって、表示順が対象ピクチャより早いもの
・レイヤ間参照ピクチャセットRefPicSetInterLayer：ILRP情報により指定される現ピクチャ参照可能ピクチャ
後続ピクチャ参照ピクチャセットは、さらに２つの部分リストから構成される。
・後続ピクチャ長期参照ピクチャセットListLtFoll：SPS長期RP情報またはＳＨ長期RP情報により指定される後続ピクチャ参照可能ピクチャ。
・後続ピクチャ短期参照ピクチャセットListStFoll：SPS短期RPS情報またはＳＨ短期RPS情報により指定される現ピクチャ参照可能ピクチャ。 The current picture reference picture set is further composed of four partial lists.
・ Long-term reference picture set RefPicSetLtCurr: Picture that can be referred to the current picture specified by SPS long-term RP information or SH long-term RP information ・ Short-term forward reference picture set ListStCurrBefore: Current picture reference specified by SPS short-term RPS information or SH short-term RPS information A picture whose display order is earlier than the target picture. Short-term backward reference picture set ListStCurrAfter: current picture referenceable picture specified by SPS short-term RPS information or SH short-term RPS information, and the display order is earlier than the target picture. Inter-layer reference picture set RefPicSetInterLayer: Current picture referenceable picture specified by ILRP information The subsequent picture reference picture set is further composed of two partial lists.
Subsequent picture long-term reference picture set ListLtFoll: A picture that can be referred to a subsequent picture specified by SPS long-term RP information or SH long-term RP information.
Subsequent picture short-term reference picture set ListStFoll: Picture that can be referred to the current picture specified by SPS short-term RPS information or SH short-term RPS information.

RPSを構成する、短期前方参照ピクチャセットListStCurrBefore、短期後方参照ピクチャセットListStCurrAfter、長期参照ピクチャセットRefPicSetLtCurr、レイヤ間参照ピクチャセットRefPicSetInterLayer、後続ピクチャ短期参照ピクチャセットRefPicSetFoll、および、後続ピクチャ長期参照ピクチャセットListLtFollを次の手順で生成する。加えて、現ピクチャ参照可能ピクチャ数NumPocTotalCurrを導出する。なお、前記各参照ピクチャセットは、以下の処理の開始前に空に設定されている。
（Ｓ１０１）SPS短期RPS情報、および、SH短期RPS情報に基づいて、対象ピクチャの復号に用いる単一の短期参照ピクチャセットを特定する。具体的には、SH短期RPS情報に含まれるshort_term_ref_pic_set_spsの値が０である場合、SH短期RPS情報に含まれるスライスセグメントヘッダで明示的に伝送された短期RPSを選択する。それ以外（short_term_ref_pic_set_spsの値が１の場合、ＳＨ短期RPS情報に含まれるshort_term_ref_pic_set_idxが示す短期RPSを、SPS短期RPS情報に含まれる複数の短期RPSの中から選択する。
（Ｓ１０２）選択された短期RPSに含まれる参照ピクチャ各々のPOCを導出する。参照ピクチャのPOCは、参照ピクチャが前方短期参照ピクチャの場合、対象ピクチャのPOCから「delta_poc_s0_minus1[i]+1」の値を減算して導出する。一方、参照ピクチャが後方短期参照ピクチャの場合、対象ピクチャのPOCに「delta_poc_s1_minus1[i]+1」の値を加算して導出する。
（Ｓ１０３）短期RPSに含まれる前方参照ピクチャを伝送された順に確認し、関連付けられているused_by_curr_pic_s0_flag[i]の値が1である場合、当該前方参照ピクチャを短期前方参照ピクチャセットRefPicSetCurrBeforeに追加する。それ以外（used_by_curr_pic_s0_flag[i]の値が0）の場合、当該前方参照ピクチャを後続ピクチャ短期参照ピクチャセットRefPicSetFollに追加する。
（Ｓ１０４）短期RPSに含まれる後方参照ピクチャを伝送された順に確認し、関連付けられているused_by_curr_pic_s1_flag[i]の値が1である場合、当該後方参照ピクチャを短期後方参照ピクチャセットRefPicSetCurrAfterに追加する。それ以外（used_by_curr_pic_s1_flag[i]の値が０の場合、当該前方参照ピクチャを後続ピクチャ短期参照ピクチャセットRefPicSetFollに追加する。
（Ｓ１０５） SPS長期RP情報、および、SH長期RP情報に基づいて、対象ピクチャの復号に用いる長期参照ピクチャを特定する。具体的には、num_long_term_spsの数の参照ピクチャをSPS長期RP情報に含まれる参照ピクチャの中から選択して、順に長期RPSに追加する。選択される参照ピクチャは、lt_idx_sps[i]の示す参照ピクチャである。続いて、num_long_term_picsの数の参照ピクチャをSH長期RP情報に含まれる参照ピクチャを順に長期RPSに追加する。
（Ｓ１０６）長期RPSに含まれる参照ピクチャ各々のPOCを導出する。長期参照ピクチャのPOCは、関連付けて復号されたpoc_lst_lt[i]、または、lt_ref_pic_poc_lsb_sps[i]の値から直接導出される。
（Ｓ１０７）長期RPSに含まれる参照ピクチャを順に確認し、関連付けられているused_by_curr_pic_lt_flag[i]、または、used_by_curr_pic_lt_sps_flag[i]の値が1である場合、当該長期参照ピクチャを長期参照ピクチャセットRefPicSetLtCurrに追加する。それ以外（used_by_curr_pic_lt_flag[i]、または、used_by_curr_pic_lt_sps_flag[i]の値が0）の場合、当該長期参照ピクチャを後続ピクチャ長期参照ピクチャセットListLtFollに追加する。
（Ｓ１０８）実依存レイヤ復号部４０３において復号された実依存レイヤ情報（ILRP情報）に従って、参照ピクチャ（レイヤ間参照ピクチャ）をレイヤ間参照ピクチャセットRefPicSetInterLayerに追加する。具体的には、図２１（ｃ）に示すように、対象ピクチャと同じ時間のピクチャpicXが復号ピクチャバッファ内にあり、そのpicXのレイヤnuh_layer_idがActiveSamplePredRefPicLayerId[i]と等しい時、レイヤ間参照ピクチャセットRefPicSetInterLayer[i]にpicXを設定し、picXが、長期予測参照（used for long-term reference）の示すマークをつける。上記条件が成立しない場合は、RefPicSetInterLayer[i]に参照ピクチャがないことを示すマークをつける。上記処理において長期参照ピクチャのマークがついた参照ピクチャを、レイヤ間参照ピクチャとして、レイヤ間参照ピクチャセットRefPicSetInterLayerに追加する。
（Ｓ１０９）変数NumPocTotalCurrの値を、現ピクチャから参照可能な参照ピクチャの和に設定する。すなわち、変数NumPocTotalCurrの値を、短期前方参照ピクチャセットRefPicSetCurrBefore、短期後方参照ピクチャセットRefPicSetCurrAfter、長期参照ピクチャセットRefPicSetLtCurr、および、レイヤ間参照ピクチャセットRefPicSetInterLayerの４つのリストの各要素数の和に設定する。レイヤ間参照ピクチャセットRefPicSetInterLayerの要素数の加算は、図２１（ｄ）に示すように、サンプル依存レイヤ数（実サンプル依存レイヤ数NumActiveSamplePredRefLayers）を加算することで行われる。参照ピクチャセット導出部４０４は、参照ピクチャリスト(RPL)を構築する。以下、参照ピクチャリスト構築処理の詳細を説明する。参照ピクチャリスト導出部は、参照ピクチャセットRPSと、RPL修正情報に基づいて参照ピクチャリストRPLを生成する。 Short-term forward reference picture set ListStCurrBefore, short-term backward reference picture set ListStCurrAfter, long-term reference picture set RefPicSetLtCurr, inter-layer reference picture set RefPicSetInterLayer, subsequent picture short-term reference picture set RefPicSetFoll, and subsequent picture long-term reference picture set ListLtFoll Generate by the following procedure. In addition, the current picture referenceable picture number NumPocTotalCurr is derived. Each reference picture set is set to be empty before starting the following processing.
(S101) Based on the SPS short-term RPS information and the SH short-term RPS information, a single short-term reference picture set used for decoding the current picture is specified. Specifically, when the value of short_term_ref_pic_set_sps included in the SH short-term RPS information is 0, the short-term RPS explicitly transmitted in the slice segment header included in the SH short-term RPS information is selected. Other than that (when the value of short_term_ref_pic_set_sps is 1, the short-term RPS indicated by short_term_ref_pic_set_idx included in the SH short-term RPS information is selected from a plurality of short-term RPSs included in the SPS short-term RPS information.
(S102) The POC of each reference picture included in the selected short-term RPS is derived. When the reference picture is a forward short-term reference picture, the reference picture POC is derived by subtracting the value of “delta_poc_s0_minus1 [i] +1” from the POC of the target picture. On the other hand, when the reference picture is a backward short-term reference picture, it is derived by adding the value of “delta_poc_s1_minus1 [i] +1” to the POC of the target picture.
(S103) The forward reference pictures included in the short-term RPS are confirmed in the order of transmission, and when the value of used_by_curr_pic_s0_flag [i] associated with the forward reference picture is 1, the forward reference picture is added to the short-term forward reference picture set RefPicSetCurrBefore. Otherwise (used_by_curr_pic_s0_flag [i] value is 0), the forward reference picture is added to the subsequent picture short-term reference picture set RefPicSetFoll.
(S104) The backward reference pictures included in the short-term RPS are confirmed in the order of transmission. If the value of used_by_curr_pic_s1_flag [i] associated with the backward reference picture is 1, the backward reference picture is added to the short-term backward reference picture set RefPicSetCurrAfter. Other than that (when the value of used_by_curr_pic_s1_flag [i] is 0, the forward reference picture is added to the subsequent picture short-term reference picture set RefPicSetFoll.
(S105) Based on the SPS long-term RP information and the SH long-term RP information, a long-term reference picture used for decoding the current picture is specified. Specifically, num_long_term_sps reference pictures are selected from the reference pictures included in the SPS long-term RP information, and are sequentially added to the long-term RPS. The selected reference picture is the reference picture indicated by lt_idx_sps [i]. Subsequently, reference pictures included in the SH long-term RP information are sequentially added to the long-term RPS as many reference pictures as num_long_term_pics.
(S106) The POC of each reference picture included in the long-term RPS is derived. The POC of the long-term reference picture is directly derived from the value of poc_lst_lt [i] or lt_ref_pic_poc_lsb_sps [i] decoded in association with each other.
(S107) Check the reference pictures included in the long-term RPS in order, and if the value of associated used_by_curr_pic_lt_flag [i] or used_by_curr_pic_lt_sps_flag [i] is 1, add the long-term reference picture to the long-term reference picture set RefPicSetLtCurr To do. In other cases (used_by_curr_pic_lt_flag [i] or used_by_curr_pic_lt_sps_flag [i] has a value of 0), the long-term reference picture is added to the subsequent picture long-term reference picture set ListLtFoll.
(S108) The reference picture (inter-layer reference picture) is added to the inter-layer reference picture set RefPicSetInterLayer according to the real dependence layer information (ILRP information) decoded by the real dependence layer decoding unit 403. Specifically, as shown in FIG. 21C, when a picture picX of the same time as the target picture is in the decoded picture buffer and the layer nuh_layer_id of the picX is equal to ActiveSamplePredRefPicLayerId [i], an inter-layer reference picture set PicX is set to RefPicSetInterLayer [i], and picX puts a mark indicating a used for long-term reference. If the above condition is not satisfied, a mark indicating that there is no reference picture is attached to RefPicSetInterLayer [i]. The reference picture marked with the long-term reference picture in the above process is added to the inter-layer reference picture set RefPicSetInterLayer as an inter-layer reference picture.
(S109) The value of the variable NumPocTotalCurr is set to the sum of reference pictures that can be referenced from the current picture. That is, the value of the variable NumPocTotalCurr is set to the sum of the numbers of elements in the four lists of the short-term forward reference picture set RefPicSetCurrBefore, the short-term backward reference picture set RefPicSetCurrAfter, the long-term reference picture set RefPicSetLtCurr, and the inter-layer reference picture set RefPicSetInterLayer. The addition of the number of elements of the inter-layer reference picture set RefPicSetInterLayer is performed by adding the number of sample dependent layers (number of actual sample dependent layers NumActiveSamplePredRefLayers) as shown in FIG. The reference picture set deriving unit 404 constructs a reference picture list (RPL). Details of the reference picture list construction process will be described below. The reference picture list deriving unit generates a reference picture list RPL based on the reference picture set RPS and the RPL modification information.

参照ピクチャリストはＬ０参照リストとＬ１参照リストの２つのリストから構成される。始めに、ＬＸ参照リスト（Ｘは０もしくは１）の構築手順を説明する。図２３（ｂ）に示すように、Ｌ０参照リストは、以下のＳ３０１〜Ｓ３０７に示す手順で構築される。
（Ｓ３０１）暫定ＬＸ参照リストRefPicListTempXを生成して、空のリストに初期化する。
（Ｓ３０２）インター予測ＲＰＬ有効フラグInterRefEnabledInRPLFlagが１である場合に、暫定ＬＸ参照リストRefPicListTempXに対し、短期前方参照ピクチャセットRefPicSetStCurrBeforeに含まれる参照ピクチャを順に追加する。
（Ｓ３０３）インター予測ＲＰＬ有効フラグInterRefEnabledInRPLFlagが１である場合に、暫定ＬＸ参照リストに対し、短期後方参照ピクチャセットRefPicSetStCurrAfterに含まれる参照ピクチャを順に追加する。
（Ｓ３０４）インター予測ＲＰＬ有効フラグInterRefEnabledInRPLFlagが１である場合に、暫定ＬＸ参照リストに対し、長期参照ピクチャセットRefPicSetLtCurrに含まれる参照ピクチャを順に追加する。
（Ｓ３０５）暫定ＬＸ参照リストに対し、ピクチャレイヤ間参照ピクチャセットRefPicSetInterLayerに含まれる参照ピクチャを順に追加する。
（Ｓ３０６）符号化データから復号されるlists_modification_present_flagが１の場合には、暫定ＬＸ参照リストの修正を行う。
（Ｓ３０７）暫定ＬＸ参照リストをＬＸ参照リストとする。 The reference picture list is composed of two lists, an L0 reference list and an L1 reference list. First, the construction procedure of the LX reference list (X is 0 or 1) will be described. As shown in FIG. 23 (b), the L0 reference list is constructed according to the procedure shown in S301 to S307 below.
(S301) A provisional LX reference list RefPicListTempX is generated and initialized to an empty list.
(S302) When the inter prediction RPL valid flag InterRefEnabledInRPLFlag is 1, reference pictures included in the short-term forward reference picture set RefPicSetStCurrBefore are sequentially added to the provisional LX reference list RefPicListTempX.
(S303) When the inter prediction RPL valid flag InterRefEnabledInRPLFlag is 1, reference pictures included in the short-term backward reference picture set RefPicSetStCurrAfter are sequentially added to the provisional LX reference list.
(S304) When the inter prediction RPL valid flag InterRefEnabledInRPLFlag is 1, reference pictures included in the long-term reference picture set RefPicSetLtCurr are sequentially added to the provisional LX reference list.
(S305) Reference pictures included in the inter-picture layer reference picture set RefPicSetInterLayer are sequentially added to the provisional LX reference list.
(S306) When the lists_modification_present_flag decoded from the encoded data is 1, the provisional LX reference list is modified.
(S307) The provisional LX reference list is set as an LX reference list.

次に、Ｌ１参照リストが同様の処理によって構築される。但し、Ｌ１参照リストの構築では、Ｓ３０３とＳ３０２の処理順が逆になる。つまり、短期前方参照ピクチャセットRefPicSetStCurrAfterが短期前方参照ピクチャセットRefPicSetStCurrBeforeより先に格納される。 Next, the L1 reference list is constructed by a similar process. However, in the construction of the L1 reference list, the processing order of S303 and S302 is reversed. That is, the short-term forward reference picture set RefPicSetStCurrAfter is stored before the short-term forward reference picture set RefPicSetStCurrBefore.

以上の構成の参照ピクチャセット導出部４０４によれば、ＲＰＳに格納される参照ピクチャは、全てサンプル予測が可能であることが保証されるため、レイヤ間サンプル予測処理の簡略化を図ることができるという効果を奏する。 According to the reference picture set deriving unit 404 having the above configuration, it is guaranteed that all the reference pictures stored in the RPS can be sample-predicted, so that the inter-layer sample prediction process can be simplified. There is an effect.

コロケートピクチャ復号部４０５は、対象スライスセグメントにおいてテンポラル動き予測が有効な場合（slice_temporal_mvp_enabled_flag==1）、テンポラル動き予測の参照ピクチャcolPicを指定するためのパラメータを、スライスセグメントヘッダから復号する。テンポラル動き予測の参照ピクチャは、対象レイヤがベースレイヤであるか、実動き依存レイヤ数NumActiveMotionPredRefLayersが0であるか、もしくは、代替コロケート指示フラグalt_collocated_indication_flagが0である場合には、参照ピクチャインデックスを用いて指定され、alt_collocated_indication_flagが１である場合には、コロケート参照レイヤ指示子collocated_ref_layer_idxを用いて指定される。 When temporal motion prediction is valid in the target slice segment (slice_temporal_mvp_enabled_flag == 1), the collocated picture decoding unit 405 decodes a parameter for specifying the reference picture colPic for temporal motion prediction from the slice segment header. If the target layer is the base layer, the actual motion-dependent layer number NumActiveMotionPredRefLayers is 0, or the alternative collocated indication flag alt_collocated_indication_flag is 0, the reference picture index of the temporal motion prediction is used. If it is specified and alt_collocated_indication_flag is 1, it is specified using a collocated reference layer indicator collocated_ref_layer_idx.

コロケートピクチャ復号部４０５は、図１２に示すように、テンポラル動き予測が有効であって、対象レイヤが非ベースレイヤ（nuh_layer_id＞０）かつ実動き依存レイヤが存在する（NumActiveMotionPredRefLayers>0）時、代替コロケート指示フラグalt_collocated_indication_flagを復号する。代替コロケート指示フラグを復号しない場合、コロケートピクチャ復号部４０５は、alt_collocated_indication_flagに0を設定する。代替コロケート指示フラグalt_collocated_indication_flagが1の時、コロケートピクチャ復号部はさらに、コロケート参照レイヤ指示子collocated_ref_layer_idxを復号する。ただし、実動き依存レイヤ数が１の場合（NumActiveMotionPredRefLayers==1）は、コロケートピクチャ復号部は、コロケート参照レイヤ指示子を復号せず、collocated_ref_layer_idxに0を設定する。 As shown in FIG. 12, the collocated picture decoding unit 405 substitutes when temporal motion prediction is effective, the target layer is a non-base layer (nuh_layer_id> 0), and an actual motion dependent layer exists (NumActiveMotionPredRefLayers> 0). The collocated indication flag alt_collocated_indication_flag is decoded. When the alternative collocation instruction flag is not decoded, the collocated picture decoding unit 405 sets 0 to alt_collocated_indication_flag. When the alternative collocation indication flag alt_collocated_indication_flag is 1, the collocated picture decoding unit further decodes the collocated reference layer indicator collocated_ref_layer_idx. However, when the number of actual motion-dependent layers is 1 (NumActiveMotionPredRefLayers == 1), the collocated picture decoding unit does not decode the collocated reference layer indicator and sets collocated_ref_layer_idx to 0.

コロケートピクチャ復号部４０５は、代替コロケート指示フラグが有効でない場合(alt_collocated_indication_flagが０)には、collocated_from_l0_flagで選択される参照ピクチャリストのインデックスcollocated_ref_idxで示されるピクチャを、テンポラル動き予測の参照ピクチャcolPicに設定する。具体的には、コロケートピクチャ復号部４０５は、collocated_from_l0_flagが０の場合には、RefPicList1[ collocated_ref_idx ]で示されるピクチャを参照ピクチャcolPicに設定し、collocated_from_l0_flagが１の場合には、RefPicList0[ collocated_ref_idx ]で示されるピクチャを参照ピクチャcolPicに設定する。 When the alternative collocation indication flag is not valid (alt_collocated_indication_flag is 0), the collocated picture decoding unit 405 sets the picture indicated by the index collocated_ref_idx of the reference picture list selected by collocated_from_l0_flag as the reference picture colPic for temporal motion prediction . Specifically, when collocated_from_l0_flag is 0, the collocated picture decoding unit 405 sets the picture indicated by RefPicList1 [collocated_ref_idx] to the reference picture colPic, and when collocated_from_l0_flag is 1, it is indicated by RefPicList0 [collocated_ref_idx] Set the picture to be the reference picture colPic.

コロケートピクチャ復号部４０５は、代替コロケート指示フラグが有効である場合(alt_collocated_indication_flagが１)には、コロケート参照レイヤ指示子collocated_ref_layer_idxに対応する実動き依存レイヤ識別子ActiveMotionPredRefLayerId[collocated_ref_layer_idx]で示されるレイヤにおける同時刻のピクチャを、テンポラル動き予測の参照ピクチャcolPicに設定する。 When the alternative collocation indication flag is valid (alt_collocated_indication_flag is 1), the collocated picture decoding unit 405 at the same time in the layer indicated by the actual motion dependent layer identifier ActiveMotionPredRefLayerId [collocated_ref_layer_idx] corresponding to the collocated reference layer indicator collocated_ref_layer_idx The picture is set as a reference picture colPic for temporal motion prediction.

（画像データ復号部の構成）
次に、画像データ復号部３００の構成について説明する。 (Configuration of image data decoding unit)
Next, the configuration of the image data decoding unit 300 will be described.

図８は、本実施形態に係る画像データ復号部３００の構成を示す概略図である。画像データ復号部３００は、エントロピー復号部３０１、予測パラメータ復号部３０２、参照ピクチャメモリ（参照画像記憶部、フレームメモリ）３０６、予測パラメータメモリ（予測パラメータ記憶部、フレームメモリ）３０７、予測画像生成部３０８、逆量子化・逆ＤＣＴ部３１１、加算部３１２及びレイヤ間動きマッピング部３１３を含んで構成される。 FIG. 8 is a schematic diagram illustrating a configuration of the image data decoding unit 300 according to the present embodiment. The image data decoding unit 300 includes an entropy decoding unit 301, a prediction parameter decoding unit 302, a reference picture memory (reference image storage unit, frame memory) 306, a prediction parameter memory (prediction parameter storage unit, frame memory) 307, and a prediction image generation unit. 308, an inverse quantization / inverse DCT unit 311, an adder 312, and an inter-layer motion mapping unit 313.

また、予測パラメータ復号部３０２は、インター予測パラメータ復号部３０３及びイントラ予測パラメータ復号部３０４を含んで構成される。予測画像生成部３０８は、インター予測画像生成部３０９及びイントラ予測画像生成部３１０を含んで構成される。 The prediction parameter decoding unit 302 includes an inter prediction parameter decoding unit 303 and an intra prediction parameter decoding unit 304. The predicted image generation unit 308 includes an inter predicted image generation unit 309 and an intra predicted image generation unit 310.

エントロピー復号部３０１は、入力された符号化データに対してエントロピー復号を行って、個々の符号（シンタックス要素）を分離し復号する。エントロピー復号部は、ヘッダ復号部４００で復号された情報を利用して、一部の符号の有無を判定する。その際に利用する情報としては、例えば、対象スライスの符号化タイプを示すslice_typeがある。分離された符号には、予測画像を生成するための予測情報および、差分画像を生成するための残差情報などがある。 The entropy decoding unit 301 performs entropy decoding on the input encoded data, and separates and decodes individual codes (syntax elements). The entropy decoding unit uses the information decoded by the header decoding unit 400 to determine the presence or absence of some codes. As information used at that time, for example, there is slice_type indicating the coding type of the target slice. The separated codes include prediction information for generating a prediction image and residual information for generating a difference image.

エントロピー復号部３０１は、分離した符号の一部を予測パラメータ復号部３０２に出力する。分離した符号の一部とは、例えば、予測モードpredMode、分割モードpart_mode、マージフラグmerge_flag、マージインデックスmerge_idx、インター予測フラグinter_pred_idc、参照ピクチャインデックスrefIdxLX、予測ベクトルインデックスmvp_LX_idx、差分ベクトルmvdLXなどである。どの符号を復号するか否かの制御は、予測パラメータ復号部３０２の指示に基づいて行われる。エントロピー復号部３０１は、量子化係数を逆量子化・逆ＤＣＴ部３１１に出力する。この量子化係数は、符号化処理において、残差信号に対してＤＣＴ（Discrete Cosine Transform、離散コサイン変換）を行い量子化して得られる係数である。 The entropy decoding unit 301 outputs a part of the separated code to the prediction parameter decoding unit 302. Some of the separated codes are, for example, a prediction mode predMode, a partition mode part_mode, a merge flag merge_flag, a merge index merge_idx, an inter prediction flag inter_pred_idc, a reference picture index refIdxLX, a prediction vector index mvp_LX_idx, a difference vector mvdLX, and the like. Control of which code to decode is performed based on an instruction from the prediction parameter decoding unit 302. The entropy decoding unit 301 outputs the quantization coefficient to the inverse quantization / inverse DCT unit 311. This quantization coefficient is a coefficient obtained by performing quantization and performing DCT (Discrete Cosine Transform) on the residual signal in the encoding process.

予測パラメータ復号部３０２は、ヘッダ復号部４００から入力されるスライスレイヤ符号化パラメータおよびエントロピー復号部３０１から入力された符号に基づいて、インター予測パラメータまたはイントラ予測パラメータを復号する。そして、復号された予測パラメータを予測画像生成部３０８に出力し、また予測パラメータメモリ３０７に記憶する。 The prediction parameter decoding unit 302 decodes the inter prediction parameter or the intra prediction parameter based on the slice layer coding parameter input from the header decoding unit 400 and the code input from the entropy decoding unit 301. The decoded prediction parameter is output to the prediction image generation unit 308 and stored in the prediction parameter memory 307.

インター予測パラメータ復号部３０３は、予測パラメータメモリ３０７に記憶された予測パラメータを参照してインター予測パラメータを出力する。インター予測パラメータには、例えば、予測リスト利用フラグpredFlagLX、参照ピクチャインデックスrefIdxLX、動きベクトルmvLX等がある。例えば、
インター予測パラメータ復号部３０３は、内部にテンポラル動きベクトル導出部を備える。テンポラル動きベクトル導出部は、テンポラル動き予測が有効な場合、コロケートピクチャ復号部４０５が設定した参照ピクチャcolPicで指定されるピクチャ上の空間近傍ブロックの動きパラメータを予測パラメータメモリから読み出し、読み出した動きパラメータに基づいて対象ブロックの動きベクトルmvLXを導出する。 The inter prediction parameter decoding unit 303 refers to the prediction parameters stored in the prediction parameter memory 307 and outputs inter prediction parameters. The inter prediction parameters include, for example, a prediction list use flag predFlagLX, a reference picture index refIdxLX, a motion vector mvLX, and the like. For example,
The inter prediction parameter decoding unit 303 includes a temporal motion vector deriving unit. When the temporal motion prediction is valid, the temporal motion vector deriving unit reads the motion parameter of the spatial neighborhood block on the picture specified by the reference picture colPic set by the collocated picture decoding unit 405 from the prediction parameter memory, and reads the read motion parameter Based on the above, the motion vector mvLX of the target block is derived.

具体的には、テンポラル動きベクトル導出部は、参照ピクチャcolPicからコロケートブロックを特定し、コロケートブロックからテンポラル動きベクトルmvLXColとテンポラル動きベクトルmvLXColの利用可能性を示す利用可能性フラグavailableFlagLXColを導出する。 Specifically, the temporal motion vector deriving unit identifies a collocated block from the reference picture colPic, and derives an availability flag availableFlagLXCol indicating the availability of the temporal motion vector mvLXCol and the temporal motion vector mvLXCol from the collocated block.

次に、テンポラル動きベクトル導出部は、参照ピクチャcolPic中の対象ブロックと同一座標のブロックの、右下に隣接するブロックcolBrの座標を求める。対象ブロックの座標を(xPb, yPb)、対象ブロックの幅をnPbW、高さをnPbHとした場合、ブロックcolBrの座標(xColBr, yColBr)は、以下の式で求める。 Next, the temporal motion vector deriving unit obtains the coordinates of the block colBr adjacent to the lower right of the block having the same coordinates as the target block in the reference picture colPic. When the coordinates of the target block are (xPb, yPb), the width of the target block is nPbW, and the height is nPbH, the coordinates (xColBr, yColBr) of the block colBr are obtained by the following equations.

xColBr = xPb + nPbW
yColBr = yPb + nPbH
次に、テンポラル動きベクトル導出部は、ブロックcolBrの座標が対象ブロックの座標を含む符号化ツリーユニット行内に存在し、なお且つ、ブロックBrの座標がピクチャ内を示しているか否かを判定する。ブロックcolBrの座標が対象ブロックの座標を含む符号化ツリーユニット行内に存在するか否かは、符号化ツリーユニットのサイズの２の対数をCtbLog2SizeYとした場合、以下の式（Ａ−１）で判定する。 xColBr = xPb + nPbW
yColBr = yPb + nPbH
Next, the temporal motion vector deriving unit determines whether or not the coordinates of the block colBr are present in the coding tree unit row including the coordinates of the target block, and the coordinates of the block Br indicate the inside of the picture. Whether or not the coordinates of the block colBr are present in the coding tree unit row including the coordinates of the target block is determined by the following equation (A-1) when the logarithm of 2 of the size of the coding tree unit is CtbLog2SizeY. To do.

(yPb >> CtbLog2SizeY) == (yColBr >> CtbLog2SizeY) （Ａ−１）
また、ブロックcolBrがピクチャ内を示しているか否かは、ピクチャの横幅をpic_width_in_luma_samples、高さをpic_height_in_luma_samplesとした場合、以下の式（Ａ−２）で判定する。 (yPb >> CtbLog2SizeY) == (yColBr >> CtbLog2SizeY) (A-1)
Whether or not the block colBr indicates the inside of a picture is determined by the following equation (A-2) when the horizontal width of the picture is pic_width_in_luma_samples and the height is pic_height_in_luma_samples.

xColPb < pic_width_in_luma_samples && yColPb < pic_height_in_luma_samples （Ａ−２）
テンポラル動きベクトル導出部は、式（Ａ−１）、（Ａ−２）の結果が共に真の場合には、コロケートブロックの座標(xColPb, yColPb)を以下の式で設定する。 xColPb <pic_width_in_luma_samples && yColPb <pic_height_in_luma_samples (A-2)
The temporal motion vector derivation unit sets the coordinates (xColPb, yColPb) of the collocated block by the following formulas when the results of formulas (A-1) and (A-2) are both true.

xColPb = (xColBr >> 4) << 4
yColPb = (yColBr >> 4) << 4
テンポラル動きベクトル導出部は、コロケートブロックが有する動きベクトルmvLXからテンポラル動きベクトルmvLXColを導出し、利用可能性フラグavailableFlagLXCol.に１を設定する。ただし、コロケートブロックの予測モードがイントラモードである場合は、テンポラル動きベクトルmvLXCol[0]、mvLXCol[1]に０を設定し、利用可能性フラグavailableFlagLXColに０を設定する。 xColPb = (xColBr >> 4) << 4
yColPb = (yColBr >> 4) << 4
The temporal motion vector deriving unit derives a temporal motion vector mvLXCol from the motion vector mvLX included in the collocated block, and sets 1 to the availability flag availableFlagLXCol. However, when the prediction mode of the collocated block is the intra mode, 0 is set to the temporal motion vectors mvLXCol [0] and mvLXCol [1], and 0 is set to the availability flag availableFlagLXCol.

一方、テンポラル動きベクトル導出部は、式（Ａ−１）、（Ａ−２）の結果のどちらかが偽の場合には、テンポラル動きベクトルmvLXCol[0]、mvLXCol[1]に０を設定し、利用可能性フラグavailableFlagLXColに０を設定する。 On the other hand, the temporal motion vector deriving unit sets 0 to the temporal motion vectors mvLXCol [0] and mvLXCol [1] when either of the results of the expressions (A-1) and (A-2) is false. The availability flag availableFlagLXCol is set to 0.

テンポラル動きベクトル導出部は、利用可能性フラグavailableFlagLXColが０の場合には、参照ピクチャcolPic中の対象ブロックと同一座標のブロックの中心のブロックを、新たなコロケートブロックとして設定する。対象ブロックの座標を(xPb, yPb)、対象ブロックの幅をnPbW、高さをnPbH、とした場合、コロケートブロックの座標(xColPb, yColPb)は、以下の式で求める。 When the availability flag availableFlagLXCol is 0, the temporal motion vector derivation unit sets the block at the center of the block having the same coordinates as the target block in the reference picture colPic as a new collocated block. When the coordinates of the target block are (xPb, yPb), the width of the target block is nPbW, and the height is nPbH, the coordinates of the collocated block (xColPb, yColPb) are obtained by the following equations.

xColPb = ((xPb + (nPbW >> 1)) >> 4) << 4
yColPb = ((yPb + (nPbH >> 1)) >> 4) << 4
次に、テンポラル動きベクトル導出部は、コロケートブロックが有する動きベクトルmvLXからテンポラル動きベクトルmvLXColを導出し、利用可能性フラグavailableFlagLXColに１を設定する。ただし、コロケートブロックの予測モードがイントラモードである場合は、テンポラル動きベクトルmvLXCol[0]、mvLXCol[1]に０を設定し、利用可能性フラグavailableFlagLXColに０を設定する。 xColPb = ((xPb + (nPbW >> 1)) >> 4) << 4
yColPb = ((yPb + (nPbH >> 1)) >> 4) << 4
Next, the temporal motion vector derivation unit derives a temporal motion vector mvLXCol from the motion vector mvLX included in the collocated block, and sets 1 to the availability flag availableFlagLXCol. However, when the prediction mode of the collocated block is the intra mode, 0 is set to the temporal motion vectors mvLXCol [0] and mvLXCol [1], and 0 is set to the availability flag availableFlagLXCol.

導出されたテンポラル動きベクトルmvLXColは、ブロックがマージモードとなる場合には、ブロックの動きベクトルとして用いられる。また、テンポラル動きベクトルmvLXColは、予測動きベクトルとしても用いられ、エントロピー復号部３０１から得られた差分ベクトルとの和をとることで、動きベクトルmvLXが導出される。 The derived temporal motion vector mvLXCol is used as a motion vector of the block when the block is in the merge mode. The temporal motion vector mvLXCol is also used as a predicted motion vector, and a motion vector mvLX is derived by taking the sum of the difference vector obtained from the entropy decoding unit 301.

イントラ予測パラメータ復号部３０４は、エントロピー復号部３０１から入力された符号に基づいて、予測パラメータメモリ３０７に記憶された予測パラメータを参照してイントラ予測パラメータを復号する。イントラ予測パラメータとは、ピクチャブロックを１つのピクチャ内で予測する処理で用いるパラメータ、例えば、イントラ予測モードIntraPredModeである。イントラ予測パラメータ復号部３０４は、復号したイントラ予測パラメータを予測画像生成部３０８に出力し、また予測パラメータメモリ３０７に記憶する。 Based on the code input from the entropy decoding unit 301, the intra prediction parameter decoding unit 304 refers to the prediction parameter stored in the prediction parameter memory 307 and decodes the intra prediction parameter. The intra prediction parameter is a parameter used in a process of predicting a picture block within one picture, for example, an intra prediction mode IntraPredMode. The intra prediction parameter decoding unit 304 outputs the decoded intra prediction parameter to the prediction image generation unit 308 and stores it in the prediction parameter memory 307.

参照ピクチャメモリ３０６は、後述する加算部３１２が生成した参照ピクチャのブロック（参照ピクチャブロック）を、復号対象のピクチャ及びブロック毎に予め定めた位置に記憶する。 The reference picture memory 306 stores a reference picture block (reference picture block) generated by an adder 312 described later at a predetermined position for each picture and block to be decoded.

予測パラメータメモリ３０７は、予測パラメータを、復号対象のピクチャ及びブロック毎に予め定めた位置に記憶する。具体的には、予測パラメータメモリ３０７は、インター予測パラメータ復号部３０３が復号したインター予測パラメータ、イントラ予測パラメータ復号部３０４が復号したイントラ予測パラメータ及びエントロピー復号部３０１が分離した予測モードpredModeを記憶する。記憶されるインター予測パラメータには、例えば、予測リスト利用フラグpredFlagLX（インター予測フラグinter_pred_idc）、参照ピクチャインデックスrefIdxLX、ベクトルmvLXがある。 The prediction parameter memory 307 stores the prediction parameter at a predetermined position for each picture and block to be decoded. Specifically, the prediction parameter memory 307 stores the inter prediction parameter decoded by the inter prediction parameter decoding unit 303, the intra prediction parameter decoded by the intra prediction parameter decoding unit 304, and the prediction mode predMode separated by the entropy decoding unit 301. . The stored inter prediction parameters include, for example, a prediction list use flag predFlagLX (inter prediction flag inter_pred_idc), a reference picture index refIdxLX, and a vector mvLX.

予測画像生成部３０８には、エントロピー復号部３０１から入力された予測モードpredModeが入力され、また予測パラメータ復号部３０２から予測パラメータが入力される。また、予測画像生成部３０８は、参照ピクチャメモリ３０６から参照ピクチャを読み出す。予測画像生成部３０８は、予測モードpredModeが示す予測モードに応じて、入力された予測パラメータと読み出した参照ピクチャを用いて予測ピクチャブロックＰ（予測画像）を生成する。 The prediction image generation unit 308 receives the prediction mode predMode input from the entropy decoding unit 301, and also receives prediction parameters from the prediction parameter decoding unit 302. Further, the predicted image generation unit 308 reads a reference picture from the reference picture memory 306. The predicted image generation unit 308 generates a predicted picture block P (predicted image) using the input prediction parameter and the read reference picture according to the prediction mode indicated by the prediction mode predMode.

ここで、予測モードpredModeがインター予測モードを示す場合、インター予測画像生成部３０９は、インター予測パラメータ復号部３０３から入力されたインター予測パラメータと読み出した参照ピクチャを用いてインター予測により予測ピクチャブロックＰを生成する。予測ピクチャブロックＰはＰＵに対応する。ＰＵは、上述したように予測処理を行う単位となる複数の画素からなるピクチャの一部分、つまり１度に予測処理が行われる復号対象ブロックに相当する。 Here, when the prediction mode predMode indicates the inter prediction mode, the inter prediction image generation unit 309 uses the inter prediction parameter input from the inter prediction parameter decoding unit 303 and the read reference picture to perform the prediction picture block P by inter prediction. Is generated. The predicted picture block P corresponds to the PU. The PU corresponds to a part of a picture composed of a plurality of pixels as a unit for performing the prediction process as described above, that is, a decoding target block on which the prediction process is performed at once.

インター予測画像生成部３０９は、参照ピクチャセット(RPS)から導出される参照ピクチャリスト(RPL)で指定される参照ピクチャを用いて予測画像を生成する。 The inter prediction image generation unit 309 generates a prediction image using a reference picture specified by a reference picture list (RPL) derived from a reference picture set (RPS).

すなわち、予測リスト利用フラグpredFlagLXが１である参照ピクチャリスト（RefPicListL0、もしくはRefPicListL1）に対し、参照ピクチャインデックスrefIdxLXで示される参照ピクチャ(RefPicListLX[refIdxLX])から、復号対象ブロックを基準としてベクトルmvLXが示す位置にある参照ピクチャブロックを参照ピクチャメモリ３０６から読み出す。インター予測画像生成部３０９は、読み出した参照ピクチャブロックについて予測を行って予測ピクチャブロックＰを生成する。インター予測画像生成部３０９は、生成した予測ピクチャブロックＰを加算部３１２に出力する。 That is, for a reference picture list (RefPicListL0 or RefPicListL1) whose prediction list usage flag predFlagLX is 1, a vector mvLX indicates from a reference picture (RefPicListLX [refIdxLX]) indicated by a reference picture index refIdxLX with reference to a decoding target block The reference picture block at the position is read from the reference picture memory 306. The inter prediction image generation unit 309 performs prediction on the read reference picture block to generate a prediction picture block P. The inter prediction image generation unit 309 outputs the generated prediction picture block P to the addition unit 312.

予測モードpredModeがイントラ予測モードを示す場合、イントラ予測画像生成部３１０は、イントラ予測パラメータ復号部３０４から入力されたイントラ予測パラメータと読み出した参照ピクチャを用いてイントラ予測を行う。具体的には、イントラ予測画像生成部３１０は、復号対象のピクチャであって、既に復号されたブロックのうち復号対象ブロックから予め定めた範囲にある参照ピクチャブロックを参照ピクチャメモリ３０６から読み出す。予め定めた範囲とは、復号対象ブロックがいわゆるラスタースキャンの順序で順次移動する場合、例えば、左、左上、上、右上の隣接ブロックのうちのいずれかであり、イントラ予測モードによって異なる。ラスタースキャンの順序とは、各ピクチャにおいて、上端から下端まで各行について、順次左端から右端まで移動させる順序である。 When the prediction mode predMode indicates the intra prediction mode, the intra predicted image generation unit 310 performs intra prediction using the intra prediction parameter input from the intra prediction parameter decoding unit 304 and the read reference picture. Specifically, the intra predicted image generation unit 310 reads, from the reference picture memory 306, a reference picture block that is a decoding target picture and is in a predetermined range from the decoding target block among blocks that have already been decoded. The predetermined range is, for example, any of the left, upper left, upper, and upper right adjacent blocks when the decoding target block sequentially moves in a so-called raster scan order, and varies depending on the intra prediction mode. The raster scan order is an order in which each row is sequentially moved from the left end to the right end in each picture from the upper end to the lower end.

イントラ予測画像生成部３１０は、読み出した参照ピクチャブロックについてイントラ予測モードIntraPredModeが示す予測モードで予測を行って予測ピクチャブロックを生成する。イントラ予測画像生成部３１０は、生成した予測ピクチャブロックＰを加算部３１２に出力する。 The intra predicted image generation unit 310 generates a predicted picture block by performing prediction in the prediction mode indicated by the intra prediction mode IntraPredMode for the read reference picture block. The intra predicted image generation unit 310 outputs the generated predicted picture block P to the addition unit 312.

逆量子化・逆ＤＣＴ部３１１は、エントロピー復号部３０１から入力された量子化係数を逆量子化してＤＣＴ係数を求める。逆量子化・逆ＤＣＴ部３１１は、求めたＤＣＴ係数について逆ＤＣＴ（Inverse Discrete Cosine Transform、逆離散コサイン変換）を行い、復号残差信号を算出する。逆量子化・逆ＤＣＴ部３１１は、算出した復号残差信号を加算部３１２に出力する。 The inverse quantization / inverse DCT unit 311 performs inverse quantization on the quantization coefficient input from the entropy decoding unit 301 to obtain a DCT coefficient. The inverse quantization / inverse DCT unit 311 performs inverse DCT (Inverse Discrete Cosine Transform) on the obtained DCT coefficient to calculate a decoded residual signal. The inverse quantization / inverse DCT unit 311 outputs the calculated decoded residual signal to the adder 312.

加算部３１２は、インター予測画像生成部３０９及びイントラ予測画像生成部３１０から入力された予測ピクチャブロックＰと逆量子化・逆ＤＣＴ部３１１から入力された復号残差信号の信号値を画素毎に加算して、参照ピクチャブロックを生成する。加算部３１２は、生成した参照ピクチャブロックを参照ピクチャメモリ３０６に記憶し、生成した参照ピクチャブロックをピクチャ毎に統合した復号レイヤ画像Ｔｄを外部に出力する。 The adder 312 outputs the prediction picture block P input from the inter prediction image generation unit 309 and the intra prediction image generation unit 310 and the signal value of the decoded residual signal input from the inverse quantization / inverse DCT unit 311 for each pixel. Addition to generate a reference picture block. The adder 312 stores the generated reference picture block in the reference picture memory 306, and outputs a decoded layer image Td in which the generated reference picture block is integrated for each picture to the outside.

上記構成の画像復号装置３１の１構成では、図２０〜図２２を用いて説明したように、レイヤ依存タイプを復号し、前記レイヤ依存タイプに対応するレイヤがサンプル依存を示すレイヤであるかを判定する手段を備える依存タイプ復号部と、前記サンプル依存を示す依存レイヤから、サンプル依存レイヤを抽出する依存レイヤ導出部（実依存レイヤ復号部４０３）と、前記サンプル依存レイヤから参照ピクチャセットを導出する参照ピクチャセット導出部４０４と、前記レイヤ間参照ピクチャセットを用いて導出される参照ピクチャリストで指定される参照ピクチャを用いて予測画像を生成するインター予測画像生成部３０９を備えることを特徴とする。 In one configuration of the image decoding device 31 having the above configuration, as described with reference to FIGS. 20 to 22, the layer dependence type is decoded, and whether the layer corresponding to the layer dependence type is a layer showing sample dependence is determined. A dependency type decoding unit having a means for determining, a dependency layer deriving unit (actual dependency layer decoding unit 403) for extracting a sample dependency layer from the dependency layer indicating the sample dependency, and deriving a reference picture set from the sample dependency layer A reference picture set deriving unit 404 and an inter predicted image generating unit 309 that generates a predicted image using a reference picture specified by a reference picture list derived using the inter-layer reference picture set. To do.

上記構成の画像復号装置３１は、より具体的には、依存タイプ復号部において、パラメータセットの符号化データから、レイヤ依存タイプを復号し、前記レイヤ依存タイプに応じて、依存レイヤがサンプル依存レイヤか否かを示すフラグであるSamplePredEnableFlagを導出し、依存レイヤ導出部（実依存レイヤ復号部４０３）において、スライスセグメントレイヤの符号化データからレイヤ間予測参照ピクチャのレイヤ間依存識別子（inter_layer_pred_layer_idc）を復号し、SamplePredEnableFlagが１である依存レイヤから、サンプル依存レイヤActiveSamplePredRefLayerIdを導出する。 More specifically, in the image decoding device 31 configured as described above, the dependency type decoding unit decodes the layer dependent type from the encoded data of the parameter set, and the dependent layer is a sample dependent layer according to the layer dependent type. SamplePredEnableFlag, which is a flag indicating whether or not, is derived, and in the dependent layer deriving unit (actually dependent layer decoding unit 403), the inter layer dependency identifier (inter_layer_pred_layer_idc) of the inter layer prediction reference picture is decoded from the encoded data of the slice segment layer Then, from the dependency layer whose SamplePredEnableFlag is 1, the sample dependency layer ActiveSamplePredRefLayerId is derived.

上記構成の画像復号装置３１の１構成では、図２６を用いて説明したように、レイヤ間予測限定フラグinter_layer_sample_onlyフラグを復号するレイヤ間予測限定フラグ復号部（実依存レイヤ復号部４０３）を備え、上記サンプル依存レイヤ復号手段（実依存レイヤ復号部４０３）は、前記レイヤ依存タイプに応じて、前記サンプル依存レイヤの数NumActiveSamplePredRefLayersを導出し、上記参照ピクチャセット導出部４０４は、前記レイヤ間予測限定フラグinter_layer_sample_onlyフラグが、レイヤ間予測のみを示す場合には、同一レイヤのピクチャを参照ピクチャリストに入れず、前記レイヤ間予測限定フラグ復号手段は、動きのみ依存レイヤ数NumActiveSamplePredRefLayersが１以上である場合に、レイヤ間予測限定フラグinter_layer_sample_onlyフラグを復号することを特徴とする。 In one configuration of the image decoding device 31 having the above configuration, as described with reference to FIG. 26, an inter-layer prediction limited flag decoding unit (actually dependent layer decoding unit 403) that decodes an inter-layer prediction limited flag inter_layer_sample_only flag is provided. The sample dependent layer decoding means (real dependent layer decoding unit 403) derives the number of sample dependent layers NumActiveSamplePredRefLayers according to the layer dependent type, and the reference picture set deriving unit 404 includes the inter-layer prediction limited flag. When the inter_layer_sample_only flag indicates only inter-layer prediction, a picture of the same layer is not included in the reference picture list, and the inter-layer prediction limited flag decoding unit is configured such that when the number of motion-dependent dependent layers NumActiveSamplePredRefLayers is 1 or more, Special feature for decoding inter-layer prediction only flag inter_layer_sample_only flag To.

上記構成の画像復号装置３１の１構成では、図２６、図１２を用いて説明したように、レイヤ依存タイプを復号し、前記レイヤ依存タイプに対応するレイヤが動き依存かつサンプル依存ではないことを示す動き依存限定レイヤであるかを判定する手段を備える依存タイプ復号部と、前記動き依存限定レイヤから、代替参照ピクチャを復号する代替参照ピクチャ復号部（コロケートピクチャ復号部４０５）と、前記代替参照ピクチャを用いて、テンポラル動きベクトルを導出するインター予測パラメータ復号部３０３を備えることを特徴とする。 In one configuration of the image decoding device 31 configured as described above, as described with reference to FIGS. 26 and 12, the layer-dependent type is decoded, and the layer corresponding to the layer-dependent type is determined not to be motion-dependent and sample-dependent. A dependency type decoding unit comprising means for determining whether the motion dependence limited layer is indicated, an alternative reference picture decoding unit (collocated picture decoding unit 405) for decoding an alternative reference picture from the motion dependency limited layer, and the alternative reference An inter prediction parameter decoding unit 303 that derives a temporal motion vector using a picture is provided.

上記構成の画像復号装置３１の１構成では、図２５を用いて説明したように、上記代替参照ピクチャ復号部（コロケートピクチャ復号部４０５）は、前記レイヤ依存タイプに応じて、動き依存かつサンプル依存ではないことを示す依存レイヤである実動き依存レイヤ数NumActiveMotionPredRefLayersを導出し、前記実動き依存レイヤ数NumActiveMotionPredRefLayersが１以上である場合に、スライスセグメントレイヤの符号化データから代替参照ピクチャを用いるかどうかを示すフラグalt_collocated_indication_flagを復号することを特徴とする。 In one configuration of the image decoding device 31 having the above configuration, as described with reference to FIG. 25, the alternative reference picture decoding unit (collocated picture decoding unit 405) is motion-dependent and sample-dependent depending on the layer-dependent type. Deriving the actual motion dependent layer number NumActiveMotionPredRefLayers, which is a dependent layer indicating that, if the actual motion dependent layer number NumActiveMotionPredRefLayers is 1 or more, whether to use an alternative reference picture from the encoded data of the slice segment layer It is characterized by decoding the indicated flag alt_collocated_indication_flag.

（第２の実施形態）
次に、本発明に関わる画像復号装置の第２の実施形態について説明する。図６は、本実施形態に関わる画像復号装置３１ａの構成を示す概略図である。画像復号装置３１は、画像データ復号部３００、ヘッダ復号部４００ａを含んで構成される。 (Second Embodiment)
Next, a second embodiment of the image decoding apparatus according to the present invention will be described. FIG. 6 is a schematic diagram illustrating the configuration of the image decoding device 31a according to the present embodiment. The image decoding device 31 includes an image data decoding unit 300 and a header decoding unit 400a.

画像データ復号部３００は、前述の第一の実施形態と同様であるので説明を省略する。同様に、既に説明したものと同一の符号を付した機能ブロックについては、本実施形態でも同様の働きをするため、以降説明を省略する。 Since the image data decoding unit 300 is the same as that of the first embodiment described above, description thereof is omitted. Similarly, functional blocks having the same reference numerals as those already described function in the same manner in this embodiment, and thus description thereof will be omitted.

ヘッダ復号部４００ａは、パラメータセット復号部４０１と、スライスセグメントヘッダ基本復号部４０２１、スライスセグメントヘッダ拡張復号部４０２２ａを含んで構成される。スライスセグメントヘッダ基本復号部４０２１は、パラメータセット復号部４０１で復号・決定されたフラグやパラメータに基づいて、スライスヘッダSHのうち、ベースレイヤと非ベースレイヤで共通の符号化パラメータを復号する。共通の符号化パラメータとは、例えば、対象スライスの符号化タイプを示すslice_type、テンポラル動き予測が有効かどうかを示すslice_temporal_mvp_enabled_flag等である。スライスセグメントヘッダ拡張復号部４０２２ａは、非ベースレイヤに関する符号化パラメータを復号する。非ベースレイヤに関する符号化パラメータは、例えば、レイヤ間予測が有効かどうかを示すフラグinter_layer_pred_enabled_flagや、レイヤ間の動き予測の参照情報を示すalt_collocated_indication_flag、collocated_ref_layer_idx等である。 The header decoding unit 400a includes a parameter set decoding unit 401, a slice segment header basic decoding unit 4021, and a slice segment header extended decoding unit 4022a. Based on the flags and parameters decoded and determined by the parameter set decoding unit 401, the slice segment header basic decoding unit 4021 decodes the encoding parameters common to the base layer and the non-base layer in the slice header SH. Common encoding parameters are, for example, slice_type indicating the encoding type of the target slice, slice_temporal_mvp_enabled_flag indicating whether temporal motion prediction is valid, or the like. The slice segment header extension decoding unit 4022a decodes the encoding parameters related to the non-base layer. The encoding parameters related to the non-base layer are, for example, a flag inter_layer_pred_enabled_flag indicating whether inter-layer prediction is enabled, alt_collocated_indication_flag indicating col- lation prediction information, collocated_ref_layer_idx, or the like.

なお、ヘッダ復号部に入力される符号化ストリーム中に、スライスセグメントヘッダ拡張自体が含まれない場合もある。スライスセグメントヘッダ拡張が含まれるかどうかは、パラメータセット復号部で判定する。パラメータセット復号部は、例えば、ピクチャパラメータセットPPSの中に含まれる、スライスセグメントヘッダ拡張が有効であるか否かを示すフラグslice_segment_header_extension_present_flagを復号し、フラグが有効である場合、スライスセグメントヘッダ拡張が符号化ストリーム中に含まれると判定する。その場合、スライスセグメントヘッダ拡張復号部が、スライスセグメントヘッダ拡張を復号する。 The encoded stream input to the header decoding unit may not include the slice segment header extension itself. Whether or not the slice segment header extension is included is determined by the parameter set decoding unit. For example, the parameter set decoding unit decodes a flag slice_segment_header_extension_present_flag included in the picture parameter set PPS that indicates whether the slice segment header extension is valid. If the flag is valid, the slice segment header extension is encoded. It is determined that it is included in the stream. In that case, the slice segment header extension decoding unit decodes the slice segment header extension.

図１４に、スライスセグメントヘッダ拡張のデータ構造の一例を示す。 FIG. 14 shows an example of the data structure of the slice segment header extension.

スライスセグメントヘッダ拡張復号部４０２２ａは、入力されたスライスセグメントヘッダ拡張から、レイヤ間予測の有無（inter_layer_pred_enable_flag）、レイヤ間予測参照ピクチャ数（num_inter_layer_ref_pics_minus1）、レイヤ間依存識別子（inter_layer_pred_layer_idc[i]）を復号する。復号する際の手順は、前述の実施例で説明したスライスセグメントヘッダ復号部４０２と同様である。inter_layer_pred_enable_flaを符号化データから復号しない場合、スライスセグメントヘッダ拡張復号部は、以下の式のようにレイヤ間予測を行うことを示す値に設定する。 The slice segment header extension decoding unit 4022a decodes the presence / absence of inter-layer prediction (inter_layer_pred_enable_flag), the number of inter-layer prediction reference pictures (num_inter_layer_ref_pics_minus1), and the inter-layer dependency identifier (inter_layer_pred_layer_idc [i]) from the input slice segment header extension . The decoding procedure is the same as that of the slice segment header decoding unit 402 described in the above embodiment. When inter_layer_pred_enable_fla is not decoded from the encoded data, the slice segment header extension decoding unit sets a value indicating that inter-layer prediction is performed as in the following equation.

inter_layer_pred_enable_flag＝１
また、スライスセグメントヘッダ拡張復号部は、レイヤ間予測参照ピクチャ数を示すnum_inter_layer_ref_pics_minus1を符号化データから復号しない場合、レイヤ間予測参照ピクチャ数に、VPS拡張から復号された依存レイヤ数であるNumDirectRefLayers[nuh_layer_id]を設定する。具体的には、num_inter_layer_ref_pics_minus1は、レイヤ間予測参照ピクチャ数から１だけ引いた値であるから、以下の式のように設定する。 inter_layer_pred_enable_flag = 1
Further, the slice segment header extension decoding unit, when not decoding num_inter_layer_ref_pics_minus1 indicating the number of inter-layer prediction reference pictures from the encoded data, the NumDirectRefLayers [nuh_layer_id, which is the number of dependent layers decoded from the VPS extension, to the number of inter-layer prediction reference pictures ] Is set. Specifically, since num_inter_layer_ref_pics_minus1 is a value obtained by subtracting 1 from the number of inter-layer prediction reference pictures, it is set as the following equation.

num_inter_layer_ref_pics_minus1＝NumDirectRefLayers[nuh_layer_id]-1
また、スライスセグメントヘッダ拡張復号部は、レイヤ間予測参照ピクチャのレイヤ間依存識別子を、VPS拡張から復号された依存レイヤをそのまま用いることを示すように設定する。具体的には、以下の式のように設定する。 num_inter_layer_ref_pics_minus1 = NumDirectRefLayers [nuh_layer_id] -1
Further, the slice segment header extension decoding unit sets the inter-layer dependency identifier of the inter-layer prediction reference picture so as to indicate that the dependency layer decoded from the VPS extension is used as it is. Specifically, the following equation is set.

inter_layer_pred_layer_idc[i]＝i
上記の構成によれば、スライスセグメントヘッダ拡張自体が省略されて入力されない場合には、inter_layer_pred_enable_flag、num_inter_layer_ref_pics_minus1、inter_layer_pred_layer_idcは符号化データから復号されないため、各々、上記で説明した値が設定される。 inter_layer_pred_layer_idc [i] = i
According to the above configuration, when the slice segment header extension itself is omitted and not input, inter_layer_pred_enable_flag, num_inter_layer_ref_pics_minus1, and inter_layer_pred_layer_idc are not decoded from the encoded data, and thus the values described above are set.

（コロケートピクチャ復号部４０５）
スライスセグメントヘッダ拡張復号部４０２２ａは、コロケートピクチャ復号部４０５ａを含むように構成してもよい。コロケートピクチャ復号部４０５ａは、入力されたスライスセグメントヘッダ拡張から、代替コロケート指示フラグ（alt_collocated_indication_flag）、コロケート参照レイヤ指示子（collocated_ref_layer_idx）を復号する。復号する際の手順は、前述の実施例で説明したコロケートピクチャ復号部４０５と同様である。代替コロケート指示フラグalt_collocated_indication_flagが符号化データから復号されない場合、コロケートピクチャ復号部４０５ａは、代替コロケート指示フラグに対応する情報として、テンポラル動きベクトル予測時の参照ピクチャを、代替手段であるコロケート参照レイヤ指示子では指定しないことを示すように、以下の式のように設定する。 (Collocate picture decoding unit 405)
The slice segment header extended decoding unit 4022a may be configured to include a collocated picture decoding unit 405a. The collocated picture decoding unit 405a decodes the alternative collocation indication flag (alt_collocated_indication_flag) and the collocated reference layer indicator (collocated_ref_layer_idx) from the input slice segment header extension. The decoding procedure is the same as that of the collocated picture decoding unit 405 described in the above embodiment. When the alternative collocation indication flag alt_collocated_indication_flag is not decoded from the encoded data, the collocated picture decoding unit 405a uses, as information corresponding to the alternative collocation indication flag, a reference picture at the time of temporal motion vector prediction as a collocated reference layer indicator that is an alternative means. In order to indicate that it is not specified in, set as the following formula.

alt_collocated_indication_flag=0
（コロケートピクチャ復号部４０５ａの変形例）
コロケートピクチャ復号部４０５ａの変形例では、代替コロケート指示フラグalt_collocated_indication_flagが符号化データから復号されない場合、対象レイヤに関して動き依存レイヤが存在しない（NumMotionPredRefLayers[i]==0）場合には、代替コロケート指示フラグに対応する情報として、代替手段であるコロケート参照レイヤ指示子では指定しないことを示す、alt_collocated_indication_flag=0を設定し、動き依存レイヤが存在する（NumMotionPredRefLayers[i]>0）場合には、代替コロケート指示フラグに対応する情報として、代替手段であるコロケート参照レイヤ指示子で指定することを示す、alt_collocated_indication_flag=1を設定する。 alt_collocated_indication_flag = 0
(Modification of Colocate Picture Decoding Unit 405a)
In the modified example of the collocated picture decoding unit 405a, when the alternative collocation indication flag alt_collocated_indication_flag is not decoded from the encoded data, there is no motion dependent layer for the target layer (NumMotionPredRefLayers [i] == 0), the alternative collocation indication flag If the alt_collocated_indication_flag = 0 is set and the motion dependent layer exists (NumMotionPredRefLayers [i]> 0) indicating that it is not specified by the collocated reference layer indicator that is an alternative means as the information corresponding to As information corresponding to the flag, alt_collocated_indication_flag = 1 is set, which indicates that the information is specified by a collocated reference layer indicator that is an alternative means.

（画像復号装置の変形例）
以下、画像復号装置の変形例として、画像復号装置３１ｂを説明する。図７は、画像復号装置３１ｂの構成を示す概略図である。画像復号装置３１ｂは、ヘッダ復号部４００ｂと画像データ復号部３００を備える。 (Modification of image decoding device)
Hereinafter, an image decoding device 31b will be described as a modification of the image decoding device. FIG. 7 is a schematic diagram illustrating a configuration of the image decoding device 31b. The image decoding device 31b includes a header decoding unit 400b and an image data decoding unit 300.

ヘッダ復号部４００ｂは、パラメータセット復号部４０１と、スライスセグメントヘッダ復号部４０２ｂを備える。スライスセグメントヘッダ復号部４０２ｂは、実依存レイヤ復号部４０３、参照ピクチャセット導出部４０４、コロケートピクチャ復号部４０５ｂを含むように構成される。 The header decoding unit 400b includes a parameter set decoding unit 401 and a slice segment header decoding unit 402b. The slice segment header decoding unit 402b is configured to include a real dependency layer decoding unit 403, a reference picture set deriving unit 404, and a collocated picture decoding unit 405b.

コロケートピクチャ復号部４０５ｂは、スライスセグメントヘッダ拡張から、代替コロケート指示フラグ（alt_collocated_indication_flag）、コロケート参照レイヤ指示子（collocated_ref_layer_idx）を復号する。 The collocated picture decoding unit 405b decodes the alternative collocation indication flag (alt_collocated_indication_flag) and the collocated reference layer indicator (collocated_ref_layer_idx) from the slice segment header extension.

コロケートピクチャ復号部４０５ｂは、alt_collocated_indication_flagが符号化データから復号されない場合には、参照ピクチャセット導出部４０４が導出する参照ピクチャ情報に応じて、代替コロケート指示フラグ（alt_collocated_indication_flag）を設定する。 When the alt_collocated_indication_flag is not decoded from the encoded data, the collocated picture decoding unit 405b sets an alternative collocation indication flag (alt_collocated_indication_flag) according to the reference picture information derived by the reference picture set deriving unit 404.

具体的には、コロケートピクチャ復号部４０５ｂは、参照ピクチャセット導出部４０４が導出した参照ピクチャ情報に、同じ対象レイヤのピクチャが含まれる場合は、代替参照レイヤを用いないように設定し（alt_collocated_indication_flag=0）、前記参照ピクチャ情報に、同じ対象レイヤのピクチャが含まれない場合は、代替参照レイヤを用いるように設定する（alt_collocated_indication_flag=1）。また、alt_collocated_indication_flag=1に設定した場合、コロケートピクチャ復号部４０５ｂは、コロケート参照レイヤ指示子に対応する情報として、ベースレイヤの実動き依存レイヤ識別子を設定する（collocated_ref_layer_idx= ActiveMotionPredRefLayerId[0]）。コロケートピクチャ復号部４０５ｂは、以上の情報に基づいて、テンポラル動き予測の参照ピクチャcolPicを導出する。 Specifically, when the reference picture information derived by the reference picture set deriving unit 404 includes a picture of the same target layer, the collocated picture decoding unit 405b sets the alternative reference layer not to be used (alt_collocated_indication_flag = 0) When the picture of the same target layer is not included in the reference picture information, an alternative reference layer is set to be used (alt_collocated_indication_flag = 1). When alt_collocated_indication_flag = 1 is set, the collocated picture decoding unit 405b sets the actual motion dependent layer identifier of the base layer as information corresponding to the collocated reference layer indicator (collocated_ref_layer_idx = ActiveMotionPredRefLayerId [0]). The collocated picture decoding unit 405b derives a temporal motion prediction reference picture colPic based on the above information.

上記のように、非ベースレイヤ用のレイヤ間予測に関する初期パラメータを規定することにより、レイヤ間予測が有効な場合でも、スライスセグメントヘッダ拡張を省略して伝送符号量を削減し、符号化効率を向上させることができる。また、スライスレイヤの情報に関して、スライスセグメントヘッダを基本部分と拡張部分に分離することにより、単一レイヤのみの規格に従う画像復号装置やソフトウェアへ与える影響を抑え、単一レイヤ対応の画像復号装置やソフトウェアをベースにした複数レイヤ対応（複数視点画像など）を容易にするという効果が得られる。 As described above, by defining the initial parameters related to inter-layer prediction for non-base layers, even when inter-layer prediction is enabled, the slice segment header extension is omitted to reduce the transmission code amount and to improve the coding efficiency. Can be improved. In addition, regarding slice layer information, by separating the slice segment header into a basic part and an extended part, the influence on the image decoding apparatus and software conforming to the standard of only a single layer is suppressed, and the image decoding apparatus compatible with a single layer, The effect of facilitating the correspondence to a plurality of layers (such as a plurality of viewpoint images) based on software can be obtained.

（画像符号化装置の構成）
次に、本発明の一実施形態に係る画像符号化装置１１の構成について説明する。図９は、本実施形態に係る画像符号化装置１１の構成を示す概略図である。画像符号化装置１１は、画像データ符号化部１０００とヘッダ符号化部１４００を含んで構成される。 (Configuration of image encoding device)
Next, the configuration of the image encoding device 11 according to an embodiment of the present invention will be described. FIG. 9 is a schematic diagram illustrating a configuration of the image encoding device 11 according to the present embodiment. The image encoding device 11 includes an image data encoding unit 1000 and a header encoding unit 1400.

画像データ符号化部１０００は、装置外部から入力される複数のレイヤ画像Tを符号化して符号化画像データVDを生成する。また、符号化処理の過程で決定された符号化モードなどのパラメータをヘッダ符号化部１４００へ出力する。 The image data encoding unit 1000 encodes a plurality of layer images T input from the outside of the apparatus to generate encoded image data VD. Also, parameters such as the encoding mode determined in the course of the encoding process are output to the header encoding unit 1400.

ヘッダ符号化部１４００は、パラメータセット符号化部１４０１と、スライスセグメントヘッダ符号化部１４０２を含んで構成される。パラメータセット符号化部１４０１は、複数のレイヤ画像Tおよび、画像データ符号化部１０００から入力されるパラメータに基づいて、スライスレイヤよりも上位レイヤのパラメータセットを生成する。上位レイヤのパラメータは、例えば、ビデオパラメータセットVPS、シーケンスパラメータセットSPS、ピクチャパラメータセットPPSである。スライスセグメントヘッダ符号化部１４０２は、同様に、スライスレイヤのパラメータを生成する。スライスレイヤのパラメータは、例えば、スライスセグメントヘッダ、あるいは、スライスセグメントヘッダおよびスライスセグメントヘッダ拡張である。ヘッダ符号化部１４００は、これら生成したパラメータセット等を含むヘッダ情報HIを出力する。ヘッダ符号化部１４００の詳細については後述する。 The header encoding unit 1400 includes a parameter set encoding unit 1401 and a slice segment header encoding unit 1402. The parameter set encoding unit 1401 generates a parameter set of a higher layer than the slice layer based on the plurality of layer images T and the parameters input from the image data encoding unit 1000. Upper layer parameters are, for example, a video parameter set VPS, a sequence parameter set SPS, and a picture parameter set PPS. Similarly, the slice segment header encoding unit 1402 generates a slice layer parameter. The parameter of the slice layer is, for example, a slice segment header, or a slice segment header and a slice segment header extension. The header encoding unit 1400 outputs header information HI including the generated parameter set and the like. Details of the header encoding unit 1400 will be described later.

画像符号化装置１１は、生成されたヘッダ情報HIと符号化画像データVDを所定の規則に従って連結させた符号化ストリームTeを生成し、装置外部へ出力する。 The image encoding device 11 generates an encoded stream Te obtained by connecting the generated header information HI and the encoded image data VD according to a predetermined rule, and outputs the encoded stream Te to the outside of the device.

（画像符号化部の構成）
次に、画像データ符号化部１０００の構成について説明する。 (Configuration of image encoding unit)
Next, the configuration of the image data encoding unit 1000 will be described.

図１０は、本実施形態に係る画像データ復号部３００の構成を示す概略図である。画像データ符号化部１０００は、予測画像生成部１０１、減算部１０２、ＤＣＴ・量子化部１０３、エントロピー符号化部１０４、逆量子化・逆ＤＣＴ部１０５、加算部１０６、予測パラメータメモリ（予測パラメータ記憶部、フレームメモリ）１０８、参照ピクチャメモリ（参照画像記憶部、フレームメモリ）１０９、符号化パラメータ決定部１１０、予測パラメータ符号化部１１１、レイヤ間動きマッピング部１１２５を含んで構成される。予測パラメータ符号化部１１１は、インター予測パラメータ符号化部１１２及びイントラ予測パラメータ符号化部１１３を含んで構成される。 FIG. 10 is a schematic diagram illustrating a configuration of the image data decoding unit 300 according to the present embodiment. The image data encoding unit 1000 includes a prediction image generation unit 101, a subtraction unit 102, a DCT / quantization unit 103, an entropy encoding unit 104, an inverse quantization / inverse DCT unit 105, an addition unit 106, a prediction parameter memory (prediction parameter). A storage unit, a frame memory) 108, a reference picture memory (reference image storage unit, frame memory) 109, an encoding parameter determination unit 110, a prediction parameter encoding unit 111, and an inter-layer motion mapping unit 1125 are configured. The prediction parameter encoding unit 111 includes an inter prediction parameter encoding unit 112 and an intra prediction parameter encoding unit 113.

予測画像生成部１０１は、外部から入力されたレイヤ画像Ｔの視点毎の各ピクチャについて、そのピクチャを分割した領域であるブロック毎に予測ピクチャブロックＰを生成する。ここで、予測画像生成部１０１は、予測パラメータ符号化部１１１から入力された予測パラメータに基づいて参照ピクチャメモリ１０９から参照ピクチャブロックを読み出す。予測パラメータ符号化部１１１から入力された予測パラメータとは、例えば、動きベクトル又は変位ベクトルである。予測画像生成部１０１は、符号化対象ブロックを起点として予測された動きベクトル又は変位ベクトルが示す位置にあるブロックの参照ピクチャブロックを読み出す。予測画像生成部１０１は、読み出した参照ピクチャブロックについて複数の予測方式のうちの１つの予測方式を用いて予測ピクチャブロックＰを生成する。予測画像生成部１０１は、生成した予測ピクチャブロックＰを減算部１０２に出力する。なお、予測画像生成部１０１は、既に説明した予測画像生成部３０８と同じ動作であるため予測ピクチャブロックＰの生成の詳細は省略する。 The predicted image generation unit 101 generates a predicted picture block P for each block which is an area obtained by dividing the picture for each viewpoint of the layer image T input from the outside. Here, the predicted image generation unit 101 reads the reference picture block from the reference picture memory 109 based on the prediction parameter input from the prediction parameter encoding unit 111. The prediction parameter input from the prediction parameter encoding unit 111 is, for example, a motion vector or a displacement vector. The predicted image generation unit 101 reads the reference picture block of the block at the position indicated by the motion vector or the displacement vector predicted from the encoding target block. The prediction image generation unit 101 generates a prediction picture block P using one prediction method among a plurality of prediction methods for the read reference picture block. The predicted image generation unit 101 outputs the generated predicted picture block P to the subtraction unit 102. Note that since the predicted image generation unit 101 performs the same operation as the predicted image generation unit 308 already described, details of generation of the predicted picture block P are omitted.

減算部１０２は、予測画像生成部１０１から入力された予測ピクチャブロックＰの信号値を、外部から入力されたレイヤ画像Ｔの対応するブロックの信号値から画素毎に減算して、残差信号を生成する。減算部１０２は、生成した残差信号をＤＣＴ・量子化部１０３に出力する。 The subtraction unit 102 subtracts the signal value of the prediction picture block P input from the prediction image generation unit 101 for each pixel from the signal value of the corresponding block of the layer image T input from the outside, and generates a residual signal. Generate. The subtraction unit 102 outputs the generated residual signal to the DCT / quantization unit 103.

ＤＣＴ・量子化部１０３は、減算部１０２から入力された残差信号についてＤＣＴを行い、ＤＣＴ係数を算出する。ＤＣＴ・量子化部１０３は、算出したＤＣＴ係数を量子化して量子化係数を求める。ＤＣＴ・量子化部１０３は、求めた量子化係数をエントロピー符号化部１０４及び逆量子化・逆ＤＣＴ部１０５に出力する。 The DCT / quantization unit 103 performs DCT on the residual signal input from the subtraction unit 102 and calculates a DCT coefficient. The DCT / quantization unit 103 quantizes the calculated DCT coefficient to obtain a quantization coefficient. The DCT / quantization unit 103 outputs the obtained quantization coefficient to the entropy encoding unit 104 and the inverse quantization / inverse DCT unit 105.

エントロピー符号化部１０４には、ＤＣＴ・量子化部１０３から量子化係数が入力され、予測パラメータ符号化部１１１から符号化パラメータが入力される。入力される符号化パラメータには、例えば、参照ピクチャインデックスrefIdxLX、ベクトルインデックスmvp_LX_idx、差分ベクトルmvdLX、予測モードpredMode、及びマージインデックスmerge_idx等の符号がある。 The entropy coding unit 104 receives a quantization coefficient from the DCT / quantization unit 103 and receives a coding parameter from the prediction parameter coding unit 111. Input encoding parameters include codes such as a reference picture index refIdxLX, a vector index mvp_LX_idx, a difference vector mvdLX, a prediction mode predMode, and a merge index merge_idx.

エントロピー符号化部１０４は、入力された量子化係数と符号化パラメータをエントロピー符号化して符号化画像データVDを生成し、出力する。 The entropy encoding unit 104 generates and outputs encoded image data VD by entropy encoding the input quantization coefficient and encoding parameter.

逆量子化・逆ＤＣＴ部１０５は、ＤＣＴ・量子化部１０３から入力された量子化係数を逆量子化してＤＣＴ係数を求める。逆量子化・逆ＤＣＴ部１０５は、求めたＤＣＴ係数について逆ＤＣＴを行い、復号残差信号を算出する。逆量子化・逆ＤＣＴ部１０５は、算出した復号残差信号を加算部１０６に出力する。 The inverse quantization / inverse DCT unit 105 inversely quantizes the quantization coefficient input from the DCT / quantization unit 103 to obtain a DCT coefficient. The inverse quantization / inverse DCT unit 105 performs inverse DCT on the obtained DCT coefficient to calculate a decoded residual signal. The inverse quantization / inverse DCT unit 105 outputs the calculated decoded residual signal to the addition unit 106.

加算部１０６は、予測画像生成部１０１から入力された予測ピクチャブロックＰの信号値と逆量子化・逆ＤＣＴ部１０５から入力された復号残差信号の信号値を画素毎に加算して、参照ピクチャブロックを生成する。加算部１０６は、生成した参照ピクチャブロックを参照ピクチャメモリ１０９に記憶する。 The addition unit 106 adds the signal value of the predicted picture block P input from the predicted image generation unit 101 and the signal value of the decoded residual signal input from the inverse quantization / inverse DCT unit 105 for each pixel, and refers to them. Generate a picture block. The adding unit 106 stores the generated reference picture block in the reference picture memory 109.

予測パラメータメモリ１０８は、予測パラメータ符号化部１１１が生成した予測パラメータを、符号化対象のピクチャ及びブロック毎に予め定めた位置に記憶する。 The prediction parameter memory 108 stores the prediction parameter generated by the prediction parameter encoding unit 111 at a predetermined position for each picture and block to be encoded.

参照ピクチャメモリ１０９は、加算部１０６が生成した参照ピクチャブロックを、符号化対象のピクチャ及びブロック毎に予め定めた位置に記憶する。 The reference picture memory 109 stores the reference picture block generated by the addition unit 106 at a predetermined position for each picture and block to be encoded.

符号化パラメータ決定部１１０は、符号化パラメータの複数のセットのうち、１つのセットを選択する。符号化パラメータとは、上述した予測パラメータやこの予測パラメータに関連して生成される符号化の対象となるパラメータである。また、決定したパラメータのうち、スライスレイヤ以上のレイヤに共通するパラメータを、ヘッダ符号化部１４００へ出力する。スライスレイヤ以上のレイヤに共通するパラメータについては後述する。 The encoding parameter determination unit 110 selects one set from among a plurality of sets of encoding parameters. The encoding parameter is a parameter to be encoded that is generated in association with the above-described prediction parameter or the prediction parameter. In addition, among the determined parameters, a parameter common to layers equal to or higher than the slice layer is output to header encoding section 1400. Parameters common to layers above the slice layer will be described later.

符号化パラメータ決定部１１０は、上記符号化パラメータの複数のセットの各々について情報量の大きさと符号化誤差を示すコスト値を算出する。コスト値は、例えば、符号量と二乗誤差に係数λを乗じた値との和である。符号量は、量子化誤差と符号化パラメータをエントロピー符号化して得られる符号化ストリームTeの情報量である。二乗誤差は、減算部１０２において算出された残差信号の残差値の二乗値についての画素間の総和である。係数λは、予め設定されたゼロよりも大きい実数である。符号化パラメータ決定部１１０は、算出したコスト値が最小となる符号化パラメータのセットを選択する。これにより、エントロピー符号化部１０４は、選択した符号化パラメータのセットを符号化ストリームTeとして外部に出力し、選択されなかった符号化パラメータのセットを出力しない。 The encoding parameter determination unit 110 calculates a cost value indicating the amount of information and the encoding error for each of the plurality of sets of the encoding parameters. The cost value is, for example, the sum of a code amount and a square error multiplied by a coefficient λ. The code amount is the information amount of the encoded stream Te obtained by entropy encoding the quantization error and the encoding parameter. The square error is the sum between pixels regarding the square value of the residual value of the residual signal calculated by the subtracting unit 102. The coefficient λ is a real number larger than a preset zero. The encoding parameter determination unit 110 selects a set of encoding parameters that minimizes the calculated cost value. As a result, the entropy encoding unit 104 outputs the selected set of encoding parameters to the outside as the encoded stream Te, and does not output the set of unselected encoding parameters.

選択される符号化パラメータのセットは、選択する予測方式に依存する。選択の対象となる予測方式は、符号化対象のピクチャがベースビューピクチャである場合には、イントラ予測、動き予測及びマージ予測である。動き予測とは、上述のインター予測のうち、表示時刻間の予測である。マージ予測とは、既に符号化されたブロックであって、符号化対象ブロックから予め定めた範囲内にあるブロックと同一の参照ピクチャブロック及び予測パラメータを用いる予測である。符号化対象のピクチャが非ベースビューピクチャである場合には、選択の対象となる予測方式は、イントラ予測、動き予測、マージ予測、及び変位予測である。変位予測（視差予測）とは、上述のインター予測のうち、別レイヤ画像（別視点画像）間の予測である。 The set of coding parameters that are selected depends on the prediction scheme that is selected. The prediction method to be selected is intra prediction, motion prediction, and merge prediction when the picture to be encoded is a base view picture. Motion prediction is prediction between display times among the above-mentioned inter predictions. The merge prediction is a prediction that uses the same reference picture block and prediction parameter as a block that has already been encoded and is within a predetermined range from the encoding target block. When the encoding target picture is a non-base view picture, the prediction methods to be selected are intra prediction, motion prediction, merge prediction, and displacement prediction. The displacement prediction (disparity prediction) is prediction between different layer images (different viewpoint images) in the above-described inter prediction.

符号化パラメータ決定部１１０は、選択した予測方式に対応する予測モードpredModeを予測パラメータ符号化部１１１に出力する。 The encoding parameter determination unit 110 outputs the prediction mode predMode corresponding to the selected prediction method to the prediction parameter encoding unit 111.

予測方式として動き予測を選択した場合、符号化パラメータ決定部１１０は、動きベクトルmvLXも併せて出力する。動きベクトルmvLXは、符号化対象ブロックの位置から予測ピクチャブロックＰを生成する際の参照ピクチャブロックの位置までのベクトルを示す。動きベクトルmvLXを示す情報には、参照ピクチャを示す情報（例えば、参照ピクチャインデックスrefIdxLX、ピクチャ順序番号POC）を含み、予測パラメータを表すものであっても良い。 When motion prediction is selected as the prediction method, the encoding parameter determination unit 110 also outputs a motion vector mvLX. The motion vector mvLX indicates a vector from the position of the encoding target block to the position of the reference picture block when the predicted picture block P is generated. The information indicating the motion vector mvLX may include information indicating a reference picture (for example, a reference picture index refIdxLX, a picture order number POC), and may represent a prediction parameter.

予測方式として変位予測を選択した場合、符号化パラメータ決定部１１０は、変位ベクトルdvLXも併せて出力する。変位ベクトルdvLXは、符号化対象ブロックの位置から予測ピクチャブロックＰを生成する際の参照ピクチャブロックの位置までのベクトルを示す。変位ベクトルdvLXを示す情報には、参照ピクチャを示す情報（例えば、参照ピクチャインデックスrefIdxLX、ビュー識別子view_id）を含み、予測パラメータを表すものであっても良い。 When displacement prediction is selected as the prediction method, the encoding parameter determination unit 110 also outputs a displacement vector dvLX. The displacement vector dvLX indicates a vector from the position of the encoding target block to the position of the reference picture block when the predicted picture block P is generated. The information indicating the displacement vector dvLX may include information indicating a reference picture (for example, a reference picture index refIdxLX, a view identifier view_id) and may represent a prediction parameter.

予測方式としてマージ予測を選択した場合、符号化パラメータ決定部１１０は、マージインデックスmerge_idxも併せて出力する。 When merge prediction is selected as the prediction method, the encoding parameter determination unit 110 also outputs a merge index merge_idx.

予測パラメータ符号化部１１１は、符号化パラメータ決定部１１０から入力されたパラメータに基づいて予測ピクチャを生成する際に用いる予測パラメータを導出し、導出した予測パラメータを符号化して符号化パラメータのセットを生成する。予測パラメータ符号化部１１１は、生成した符号化パラメータのセットをエントロピー符号化部１０４に出力する。 The prediction parameter encoding unit 111 derives a prediction parameter to be used when generating a prediction picture based on the parameter input from the encoding parameter determination unit 110, encodes the derived prediction parameter, and sets a set of encoding parameters. Generate. The prediction parameter encoding unit 111 outputs the generated set of encoding parameters to the entropy encoding unit 104.

予測パラメータ符号化部１１１は、生成した符号化パラメータのセットのうち符号化パラメータ決定部１１０が選択したものに対応する予測パラメータを予測パラメータメモリ１０８に記憶する。 The prediction parameter encoding unit 111 stores, in the prediction parameter memory 108, a prediction parameter corresponding to the one selected by the encoding parameter determination unit 110 from the generated set of encoding parameters.

予測パラメータ符号化部１１１は、符号化パラメータ決定部１１０から入力された予測モードpredModeがインター予測モードを示す場合、インター予測パラメータ符号化部１１２を動作させる。予測パラメータ符号化部１１１は、予測モードpredModeがイントラ予測モードを示す場合、イントラ予測パラメータ符号化部１１３を動作させる。 The prediction parameter encoding unit 111 operates the inter prediction parameter encoding unit 112 when the prediction mode predMode input from the encoding parameter determination unit 110 indicates the inter prediction mode. The prediction parameter encoding unit 111 operates the intra prediction parameter encoding unit 113 when the prediction mode predMode indicates the intra prediction mode.

インター予測パラメータ符号化部１１２は、符号化パラメータ決定部１１０から入力された予測パラメータに基づいてインター予測パラメータを導出する。インター予測パラメータ符号化部１１２は、インター予測パラメータを導出する構成として、インター予測パラメータ復号部３０３がインター予測パラメータを導出する構成と同一の構成を含む。 The inter prediction parameter encoding unit 112 derives an inter prediction parameter based on the prediction parameter input from the encoding parameter determination unit 110. The inter prediction parameter encoding unit 112 includes the same configuration as the configuration in which the inter prediction parameter decoding unit 303 derives the inter prediction parameter as a configuration for deriving the inter prediction parameter.

イントラ予測パラメータ符号化部１１３は、符号化パラメータ決定部１１０から入力された予測モードpredModeが示すイントラ予測モードIntraPredModeをイントラ予測パラメータのセットとして定める。 The intra prediction parameter encoding unit 113 determines the intra prediction mode IntraPredMode indicated by the prediction mode predMode input from the encoding parameter determination unit 110 as a set of intra prediction parameters.

ここで、符号化パラメータ決定部１１０におけるレイヤ間予測に関するパラメータ決定と、決定されたパラメータに基づいた、ヘッダ符号化部１４００におけるヘッダ情報の符号化処理の詳細について説明する。 Here, details of parameter determination relating to inter-layer prediction in the encoding parameter determination unit 110 and header information encoding processing in the header encoding unit 1400 based on the determined parameters will be described.

符号化パラメータ決定部１１０は、各レイヤ（複数視点画像など）の符号化処理過程で選択した予測モードや参照ピクチャを示す情報を記憶しておく。その中で、レイヤ間の依存性に関する情報、すなわちサンプル予測やテンポラル動き予測に関するレイヤ間の依存関係が、全レイヤに関して決定されたら、全レイヤに関するレイヤ間依存情報をヘッダ符号化部１４００へ出力する。レイヤ間依存情報は、レイヤiがレイヤjを参照するかどうかを示す情報と、各参照がサンプル予測によるものか動き予測によるものかを示す情報を含む。ヘッダ符号化部１４００は、符号化パラメータ決定部１１０からのレイヤ間依存情報をパラメータセット符号化部１４０１へ入力する。 The encoding parameter determination unit 110 stores information indicating the prediction mode and the reference picture selected in the encoding process of each layer (multi-viewpoint image or the like). If the inter-layer dependency information, that is, the inter-layer dependency relationship regarding sample prediction or temporal motion prediction is determined for all layers, the inter-layer dependency information regarding all layers is output to the header encoding unit 1400. . The inter-layer dependency information includes information indicating whether the layer i refers to the layer j and information indicating whether each reference is based on sample prediction or motion prediction. The header encoding unit 1400 inputs the inter-layer dependency information from the encoding parameter determination unit 110 to the parameter set encoding unit 1401.

パラメータセット符号化部１４０１は、入力されたレイヤ間依存情報に基づいて、レイヤ依存フラグdirect_dependency_flag[i][j]およびレイヤ依存タイプdirect_dependency_type[i][j]を符号化し、これらを含むデータ構造であるVPS拡張（図１１）を生成するとともに、符号化した情報をスライスセグメントヘッダ符号化部１４０２へ出力する。 The parameter set encoding unit 1401 encodes the layer dependency flag direct_dependency_flag [i] [j] and the layer dependency type direct_dependency_type [i] [j] based on the input inter-layer dependency information, and has a data structure including them. A certain VPS extension (FIG. 11) is generated and the encoded information is output to the slice segment header encoding unit 1402.

また、符号化パラメータ決定部１１０は、スライスセグメント単位の符号化処理過程で選択した予測モードや参照ピクチャ情報を含む、スライスレイヤ符号化パラメータを、ヘッダ符号化部１４００へ出力する。ヘッダ符号化部１４００は、符号化パラメータ決定部１１０からのスライスレイヤ符号化パラメータをスライスセグメントヘッダ符号化部１４０２へ入力する。 Also, the coding parameter determination unit 110 outputs slice layer coding parameters including the prediction mode and reference picture information selected in the coding process in units of slice segments to the header coding unit 1400. The header encoding unit 1400 inputs the slice layer encoding parameter from the encoding parameter determining unit 110 to the slice segment header encoding unit 1402.

スライスセグメントヘッダ符号化部１４０２は、入力されたスライスレイヤ符号化パラメータと、パラメータセット符号化部１４０１から入力されたレイヤ間依存に関する符号化パラメータに基づいて、レイヤ間予測の有無を示すレイヤ間依存フラグinter_layer_pred_enabled_flagを符号化する。レイヤ間予測が有効である場合には、対象レイヤに関する依存レイヤ数（NumDirectRefLayers[]）と実依存レイヤピクチャ数（NumActiveRefLayerPics）に基づいて、レイヤ間依存ピクチャ数（num_inter_layer_ref_pics_minus1）およびレイヤ間依存識別子（inter_layer_pred_layer_idc[]）を符号化し、これらを含むスライスセグメントヘッダ（図１２）を生成する。具体的にはnum_inter_layer_ref_pics_minus1 には“NumActiveRefLayerPics-1”の値を設定し、inter_layer_pred_layer_idc[i]には、対象レイヤiの参照レイヤjを示すレイヤ識別子を設定する。その際、実参照レイヤピクチャが存在しない場合（NumActiveRefLayerPics==0）は、レイヤ識別子inter_layer_pred_layer_idc[]は符号化せず、参照レイヤ数が１つのみの場合は、レイヤ間依存ピクチャ数num_inter_layer_ref_pics_minus1も符号化しない。また、これらの符号を符号化する際のビット長は、num_inter_layer_ref_pics_minus1に関してはNumDirectRefLayers[]を表現できる最小のビット数になるように、inter_layer_pred_layer_idc[]に関しては、NumActiveRefLayerPicsを表現できる最小のビット数になるように、それぞれ符号化する。 The slice segment header encoding unit 1402 is based on the input slice layer encoding parameter and the inter-layer dependency indicating whether or not there is inter-layer prediction based on the encoding parameter related to inter-layer dependency input from the parameter set encoding unit 1401. The flag inter_layer_pred_enabled_flag is encoded. When inter-layer prediction is enabled, the number of inter-layer dependent pictures (num_inter_layer_ref_pics_minus1) and inter-layer dependent identifiers (inter_layer_pred_layer_idc) based on the number of dependent layers (NumDirectRefLayers []) and the actual number of dependent layer pictures (NumActiveRefLayerPics) for the target layer []) Are encoded, and a slice segment header (FIG. 12) including them is generated. Specifically, the value of “NumActiveRefLayerPics-1” is set in num_inter_layer_ref_pics_minus1, and the layer identifier indicating the reference layer j of the target layer i is set in inter_layer_pred_layer_idc [i]. At that time, if there is no actual reference layer picture (NumActiveRefLayerPics == 0), the layer identifier inter_layer_pred_layer_idc [] is not encoded. If there is only one reference layer, the number of inter-layer dependent pictures num_inter_layer_ref_pics_minus1 is also encoded. do not do. In addition, the bit length when encoding these codes is the minimum number of bits that can represent NumDirectRefLayers [] for num_inter_layer_ref_pics_minus1, and the minimum number of bits that can represent NumActiveRefLayerPics for inter_layer_pred_layer_idc []. Are encoded respectively.

なお、スライスセグメントヘッダ符号化部１４０２は、レイヤ間予測の参照ピクチャ数（num_inter_layer_ref_pics_minus1）とレイヤ間依存識別子（inter_layer_pred_layer_idc[]）を符号化する際に、依存レイヤ数（NumDirectRefLayers[]）の代わりに、サンプル依存レイヤ数（NumSamplePredRefLayers）に基づいて、これら符号の有無、符号長および値の範囲を定めて符号化してもよい。 Note that the slice segment header encoding unit 1402 encodes the inter-layer prediction reference picture number (num_inter_layer_ref_pics_minus1) and the inter-layer dependency identifier (inter_layer_pred_layer_idc []) instead of the dependent layer number (NumDirectRefLayers []). Based on the number of sample-dependent layers (NumSamplePredRefLayers), the presence or absence of these codes, the code length, and the value range may be determined and encoded.

また、画像符号化装置１１の別の構成として、スライスセグメントヘッダ符号化部１４０２は、スライスセグメントヘッダを、ベースレイヤと非ベースレイヤで共通の部分と、図１４で示したような非ベースレイヤに関するスライスセグメントヘッダ拡張の部分に分離して符号化する構成であってもよい。この構成においては、スライスセグメントヘッダ符号化部１４０２は、非ベースレイヤの対象スライスセグメントに関して、レイヤ間予測が有効であって（inter_layer_pred_enabled_flag=1）、レイヤ間依存ピクチャ数が依存レイヤ数と等しく（num_inter_layer_ref_pics_minus1 = NumDirectRefLayers[nuh_layer_id]?1）、対象レイヤiの参照レイヤを示す識別子がVPS拡張で指定した値と等しい場合（inter_layer_pred_layer_idc[i]=1）には、スライスセグメントヘッダ拡張を符号化しない。 Further, as another configuration of the image encoding device 11, the slice segment header encoding unit 1402 relates the slice segment header to the common part between the base layer and the non-base layer, and the non-base layer as illustrated in FIG. A configuration may be used in which the slice segment header extension is encoded separately. In this configuration, the slice segment header encoding unit 1402 has inter-layer prediction enabled for the target slice segment of the non-base layer (inter_layer_pred_enabled_flag = 1), and the number of inter-layer dependent pictures is equal to the number of dependent layers (num_inter_layer_ref_pics_minus1 = NumDirectRefLayers [nuh_layer_id]? 1) When the identifier indicating the reference layer of the target layer i is equal to the value specified by the VPS extension (inter_layer_pred_layer_idc [i] = 1), the slice segment header extension is not encoded.

以上のような構成を備えることによって、本実施形態に係る画像符号化装置は、複数レイヤの画像データを階層符号化する際に、非ベースレイヤ用のレイヤ間予測に関する複数のパラメータ群の値が所定の組合せである場合に、拡張ヘッダ情報の符号化を省略するため、階層符号化のために必要になるパラメータの符号量を削減し、符号化効率を向上させることができる。また、スライスレイヤの情報に関して、スライスセグメントヘッダを基本部分と拡張部分に分離することにより、単一レイヤのみの規格に従う画像符号化装置やソフトウェアへ与える影響を抑え、単一レイヤ対応の画像符号化装置やソフトウェアをベースにした複数レイヤ対応（複数視点画像など）を容易にするという効果が得られる。 With the configuration as described above, the image encoding apparatus according to the present embodiment has a plurality of parameter group values related to inter-layer prediction for non-base layers when hierarchically encoding image data of a plurality of layers. When the predetermined combination is used, encoding of the extension header information is omitted, so that the amount of parameters required for hierarchical encoding can be reduced and the encoding efficiency can be improved. In addition, regarding slice layer information, by separating the slice segment header into a basic part and an extended part, the influence on the image coding apparatus and software conforming to the standard of only a single layer is suppressed, and image coding corresponding to a single layer is performed. The effect of facilitating multi-layer correspondence (multi-viewpoint image, etc.) based on the device or software can be obtained.

なお、上述した実施形態における画像符号化装置１１、画像復号装置３１の一部、例えば、ヘッダ復号部４００、パラメータセット復号部４０１、スライスセグメントヘッダ復号部４０２、実依存レイヤ復号部４０３、参照ピクチャセット導出部４０４、コロケートピクチャ復号部４０５、スライスセグメントヘッダ基本復号部４０２１、スライスセグメントヘッダ拡張復号部４０２２、ヘッダ符号化部１４００、パラメータセット符号化部１４０１、スライスセグメントヘッダ符号化部１４０２、エントロピー復号部３０１、予測パラメータ復号部３０２、予測画像生成部１０１、ＤＣＴ・量子化部１０３、エントロピー符号化部１０４、逆量子化・逆ＤＣＴ部１０５、符号化パラメータ決定部１１０、予測パラメータ符号化部１１１、エントロピー復号部３０１、予測パラメータ復号部３０２、予測画像生成部３０８、逆量子化・逆ＤＣＴ部３１１をコンピュータで実現するようにしても良い。その場合、この制御機能を実現するためのプログラムをコンピュータ読み取り可能な記録媒体に記録して、この記録媒体に記録されたプログラムをコンピュータシステムに読み込ませ、実行することによって実現しても良い。なお、ここでいう「コンピュータシステム」とは、画像符号化装置１１や画像復号装置３１に内蔵されたコンピュータシステムであって、ＯＳや周辺機器等のハードウェアを含むものとする。また、「コンピュータ読み取り可能な記録媒体」とは、フレキシブルディスク、光磁気ディスク、ＣＤ−ＲＯＭ、ＤＶＤ−ＲＯＭ等の可搬媒体、コンピュータシステムに内蔵されるハードディスク等の記憶装置のことをいう。さらに「コンピュータ読み取り可能な記録媒体」とは、インターネット等のネットワークや電話回線等の通信回線を介してプログラムを送信する場合の通信線のように、短時間、動的にプログラムを保持するもの、その場合のサーバやクライアントとなるコンピュータシステム内部の揮発性メモリのように、一定時間プログラムを保持しているものも含んで良い。また上記プログラムは、前述した機能の一部を実現するためのものであっても良く、さらに前述した機能をコンピュータシステムにすでに記録されているプログラムとの組み合わせで実現できるものであっても良い。 In addition, a part of the image encoding device 11 and the image decoding device 31 in the above-described embodiment, for example, the header decoding unit 400, the parameter set decoding unit 401, the slice segment header decoding unit 402, the actual dependence layer decoding unit 403, the reference picture Set derivation unit 404, collocated picture decoding unit 405, slice segment header basic decoding unit 4021, slice segment header extended decoding unit 4022, header encoding unit 1400, parameter set encoding unit 1401, slice segment header encoding unit 1402, entropy decoding Unit 301, prediction parameter decoding unit 302, prediction image generation unit 101, DCT / quantization unit 103, entropy encoding unit 104, inverse quantization / inverse DCT unit 105, encoding parameter determination unit 110, prediction parameter encoding unit 111 , D Toropi decoding unit 301, the prediction parameter decoding section 302, predicted image generation unit 308 may be the inverse quantization and inverse DCT unit 311 to be realized by a computer. In that case, the program for realizing the control function may be recorded on a computer-readable recording medium, and the program recorded on the recording medium may be read by a computer system and executed. Here, the “computer system” is a computer system built in the image encoding device 11 or the image decoding device 31 and includes an OS and hardware such as peripheral devices. The “computer-readable recording medium” refers to a storage device such as a portable medium such as a flexible disk, a magneto-optical disk, a CD-ROM, a DVD-ROM, or a hard disk built in a computer system. Furthermore, the “computer-readable recording medium” is a medium that dynamically holds a program for a short time, such as a communication line when transmitting a program via a network such as the Internet or a communication line such as a telephone line, In this case, it may include a volatile memory inside a computer system serving as a server or a client in such a case that holds a program for a certain period of time. The program may be a program for realizing a part of the functions described above, and may be a program capable of realizing the functions described above in combination with a program already recorded in a computer system.

また、上述した実施形態における画像符号化装置１１、画像復号装置３１の一部、または全部を、ＬＳＩ（Large Scale Integration）等の集積回路として実現しても良い。画像符号化装置１１、画像復号装置３１の各機能ブロックは個別にプロセッサ化しても良いし、一部、または全部を集積してプロセッサ化しても良い。また、集積回路化の手法はＬＳＩに限らず専用回路、または汎用プロセッサで実現しても良い。また、半導体技術の進歩によりＬＳＩに代替する集積回路化の技術が出現した場合、当該技術による集積回路を用いても良い。 Moreover, you may implement | achieve part or all of the image coding apparatus 11 in the embodiment mentioned above and the image decoding apparatus 31 as integrated circuits, such as LSI (Large Scale Integration). Each functional block of the image encoding device 11 and the image decoding device 31 may be individually made into a processor, or a part or all of them may be integrated into a processor. Further, the method of circuit integration is not limited to LSI, and may be realized by a dedicated circuit or a general-purpose processor. Further, in the case where an integrated circuit technology that replaces LSI appears due to progress in semiconductor technology, an integrated circuit based on the technology may be used.

以上、図面を参照してこの発明の複数の実施形態について詳しく説明してきたが、具体的な構成は上述のものに限られることはなく、この発明の要旨を逸脱しない範囲内において様々な設計変更等をすることが可能である。 As described above, the embodiments of the present invention have been described in detail with reference to the drawings. However, the specific configuration is not limited to that described above, and various design changes can be made without departing from the scope of the present invention. Etc. are possible.

本発明は、画像データが符号化された符号化データを復号する画像復号装置、および、画像データが符号化された符号化データを生成する画像符号化装置に好適に適用することができる。また、画像符号化装置によって生成され、画像復号装置によって参照される符号化データのデータ構造に好適に適用することができる。 The present invention can be suitably applied to an image decoding apparatus that decodes encoded data obtained by encoding image data and an image encoding apparatus that generates encoded data obtained by encoding image data. Further, the present invention can be suitably applied to the data structure of encoded data generated by an image encoding device and referenced by the image decoding device.

１…画像伝送システム
１１…画像符号化装置
３００…画像データ復号部
４００…ヘッダ復号部
４０１…パラメータセット復号部
４０２…スライスセグメントヘッダ復号部
４０２１…スライスセグメントヘッダ基本復号部
４０２２…スライスセグメントヘッダ拡張復号部
４０３…実依存レイヤ復号部
４０４…参照ピクチャセット導出部
４０５…コロケートピクチャ復号部
１０１…予測画像生成部
１０２…減算部
１０３…ＤＣＴ・量子化部
１０４…エントロピー符号化部
１０５…逆量子化・逆ＤＣＴ部
１０６…加算部
１０８…予測パラメータメモリ（フレームメモリ）
１０９…参照ピクチャメモリ（フレームメモリ）
１１０…符号化パラメータ決定部
１１１…予測パラメータ符号化部
１１２…インター予測パラメータ符号化部
１１３…イントラ予測パラメータ符号化部
１０００…画像データ符号化部
１４００…ヘッダ符号化部
１４０１…パラメータセット符号化部
１４０２…スライスセグメントヘッダ符号化部
２１…ネットワーク
３１…画像復号装置
３０１…エントロピー復号部
３０２…予測パラメータ復号部
３０３…インター予測パラメータ復号部
３０４…イントラ予測パラメータ復号部
３０６…参照ピクチャメモリ（フレームメモリ）
３０７…予測パラメータメモリ（フレームメモリ）
３０８…予測画像生成部
３０９…インター予測画像生成部
３１０…イントラ予測画像生成部
３１１…逆量子化・逆ＤＣＴ部
３１２…加算部
４１…画像表示装置 DESCRIPTION OF SYMBOLS 1 ... Image transmission system 11 ... Image coding apparatus 300 ... Image data decoding part 400 ... Header decoding part 401 ... Parameter set decoding part 402 ... Slice segment header decoding part 4021 ... Slice segment header basic decoding part 4022 ... Slice segment header extended decoding Unit 403 ... real dependency layer decoding unit 404 ... reference picture set deriving unit 405 ... collocated picture decoding unit 101 ... predicted image generation unit 102 ... subtraction unit 103 ... DCT / quantization unit 104 ... entropy encoding unit 105 ... inverse quantization / Inverse DCT unit 106 ... addition unit 108 ... prediction parameter memory (frame memory)
109 ... Reference picture memory (frame memory)
DESCRIPTION OF SYMBOLS 110 ... Coding parameter determination part 111 ... Prediction parameter encoding part 112 ... Inter prediction parameter encoding part 113 ... Intra prediction parameter encoding part 1000 ... Image data encoding part 1400 ... Header encoding part 1401 ... Parameter set encoding part 1402 ... Slice segment header encoding unit 21 ... Network 31 ... Image decoding device 301 ... Entropy decoding unit 302 ... Prediction parameter decoding unit 303 ... Inter prediction parameter decoding unit 304 ... Intra prediction parameter decoding unit 306 ... Reference picture memory (frame memory)
307 ... Prediction parameter memory (frame memory)
308 ... Prediction image generation unit 309 ... Inter prediction image generation unit 310 ... Intra prediction image generation unit 311 ... Inverse quantization / inverse DCT unit 312 ... Addition unit 41 ... Image display device

Claims

An image decoding device that decodes hierarchically encoded data,
Dependency type decoding means comprising means for decoding a layer dependency type and determining whether a layer corresponding to the layer dependency type is a layer that exhibits sample dependency;
A dependency layer deriving unit for extracting a sample dependency layer from the dependency layer indicating the sample dependency, a reference picture set decoding unit for deriving a reference picture set using the sample dependency layer,
An image decoding apparatus comprising: an inter prediction image generation unit that generates a prediction image using a reference picture specified by a reference picture list derived using the inter-layer reference picture set.

The dependence type decoding means decodes the layer dependence type from the encoded data of the parameter set, derives SamplePredEnableFlag, which is a flag indicating whether the dependence layer is a sample dependence layer, according to the layer dependence type,
The dependency layer deriving unit decodes the inter-layer dependency identifier (inter_layer_pred_layer_idc) of the inter-layer prediction reference picture from the encoded data of the slice segment layer, and derives the sample dependency layer ActiveSamplePredRefLayerId from the dependency layer whose SamplePredEnableFlag is 1 A featured image decoding apparatus.

The image decoding apparatus further includes inter-layer prediction limited flag decoding means for decoding an inter-layer prediction limited flag inter_layer_sample_only flag,
The dependency layer derivation unit derives the number NumActiveSamplePredRefLayers of the sample dependency layers according to the layer dependency type,
When the inter-layer prediction only flag inter_layer_sample_only flag indicates only inter-layer prediction, the reference picture set decoding unit does not include pictures in the same layer in the reference picture list,
The inter-layer prediction limited flag decoding means decodes an inter-layer prediction limited flag inter_layer_sample_only flag when the number of motion-dependent layers NumActiveSamplePredRefLayers is 1 or more.

An image decoding device that decodes hierarchically encoded data,
Dependency type decoding means comprising means for decoding a layer dependent type and determining whether the layer corresponding to the layer dependent type is a motion dependent limited layer indicating that it is motion dependent and not sample dependent;
An alternative reference picture decoding unit for decoding an alternative reference picture from the motion-dependent limited layer;
An image decoding apparatus comprising: an inter prediction parameter decoding unit that derives a temporal motion vector using the alternative reference picture.

The alternative reference picture decoding unit derives the number of motion-dependent limited layers NumActiveMotionPredRefLayers, which is a dependency layer indicating that it is motion-dependent and not sample-dependent, according to the layer-dependent type, and the number of motion-dependent dependent layers NumActiveMotionPredRefLayers is 1. 4. The image decoding apparatus according to claim 3, wherein a flag alt_collocated_indication_flag indicating whether or not to use an alternative reference picture is decoded from encoded data of a slice segment layer when the above is true.

The alternative reference picture decoding unit derives a motion dependent limited layer ActivnMotionPredRefLayerId, which is a dependent layer indicating that it is motion-dependent and not sample-dependent, according to the layer-dependent type,
The image decoding device according to claim 3, wherein an identifier collocated_ref_layer_idx that identifies an alternative reference picture is decoded from the motion-dependent limited layer with reference to MotionPredEnableFlag that is a flag indicating the motion-dependent limited layer.

An image encoding apparatus that hierarchically encodes multiple layers of image data,
As a parameter relating to the coding method of the non-base layer picture, it includes header coding means for coding at least actual dependent layer information related to sample prediction between layers and actual dependent layer information related to motion prediction between layers,
Based on the number of sample-dependent layers (NumSamplePredRefLayers) for the target layer, the header encoding means determines the presence / absence of codes of the inter-layer prediction reference pictures (num_inter_layer_ref_pics_minus1) and inter-layer dependency identifiers (inter_layer_pred_layer_idc), code lengths, and value ranges. And encoding the inter-layer prediction reference picture number and the inter-layer dependency identifier, as real dependent layer information related to inter-layer sample prediction, for encoding a non-base layer picture. An image encoding device.

An image decoding device that decodes hierarchically encoded data,
Parameter set decoding means for decoding at least the number of dependent layers (NumDirectRefLayers [nuh_layer_id]) related to the layer nuh_layer_id as a parameter related to the encoding method of the non-base layer picture;
Slice segment header basic decoding means for decoding encoding parameters common to the base layer and the non-base layer;
Slice segment header extension decoding means for decoding the encoding parameters for the non-base layer,
The slice segment header extended decoding means decodes each parameter indicating presence / absence of inter-layer prediction of a non-base layer picture (inter_layer_pred_enable_flag), number of inter-layer prediction reference pictures (num_inter_layer_ref_pics_minus1), and inter-layer dependency identifier (inter_layer_pred_layer_idc [i]) If the encoded data does not include the above syntax, inter_layer_pred_enable_flag is set to 1 indicating that inter-layer prediction is performed, num_inter_layer_ref_pics_minus1 is set to NumDirectRefLayers [nuh_layer_id]-1, and inter_layer_pred_layer_idc [i] is set as a parameter set. An image decoding device, wherein i is set to indicate that the decoded dependency layer is used as it is.

An image decoding device that decodes hierarchically encoded data,
Slice segment header basic decoding means for decoding encoding parameters common to the base layer and the non-base layer;
Slice segment header extension decoding means for decoding the encoding parameters for the non-base layer,
The slice segment header extension decoding means includes:
Alternative reference picture decoding means for decoding alternative reference picture information (alt_collocated_indication_flag, collocated_ref_layer_idx) indicating a reference picture at the time of temporal motion vector prediction in a non-base layer picture;
A motion vector deriving unit for deriving a temporal motion vector using a picture specified by the decoded alternative reference picture information;
The alternative reference picture decoding means decodes the alternative reference picture information from the encoded data when the encoded data includes a slice segment header extension, and the encoded data does not include the slice segment header extension An image decoding apparatus characterized by decoding alt_collocated_indication_flag = 0 indicating that an alternative reference layer is not used as alternative reference picture information.

An image decoding device that decodes hierarchically encoded data,
Parameter set decoding means for decoding at least the number of motion-dependent layers (NumMotionPredRefLayers [i]) as a parameter related to the non-base layer picture encoding method;
Slice segment header basic decoding means for decoding encoding parameters common to the base layer and the non-base layer;
Slice segment header extension decoding means for decoding the encoding parameters for the non-base layer,
The slice segment header extension decoding means includes:
Alternative reference picture decoding means for decoding alternative reference picture information (alt_collocated_indication_flag, collocated_ref_layer_idx) indicating a reference picture at the time of temporal motion vector prediction in a non-base layer picture;
A motion vector deriving unit for deriving a temporal motion vector using a picture specified by the decoded alternative reference picture information;
The alternative reference picture decoding means decodes the alternative reference picture information from the encoded data when the encoded data includes a slice segment header extension, and the encoded data does not include the slice segment header extension When the number of motion dependent layers (NumMotionPredRefLayers [i]) derived by the parameter set decoding unit is 0, alt_collocated_indication_flag = 0 indicating that the alternative reference layer is not used is decoded as the alternative reference picture information. An image decoding device characterized by decoding alt_collocated_indication_flag = 1 indicating that an alternative reference layer is used when the number of motion-dependent layers (NumMotionPredRefLayers [i]) is other than 0.

An image decoding device that decodes hierarchically encoded data,
Slice segment header basic decoding means for decoding encoding parameters common to the base layer and the non-base layer;
Slice segment header extension decoding means for decoding the encoding parameters for the non-base layer,
The slice segment header extension decoding means includes:
Alternative reference picture decoding means for decoding alternative reference picture information (alt_collocated_indication_flag, collocated_ref_layer_idx) indicating a reference picture at the time of temporal motion vector prediction in a non-base layer picture;
Reference picture set deriving means for decoding reference picture information (RPS, RPL);
A motion vector deriving unit for deriving a temporal motion vector using a picture specified by the decoded alternative reference picture information;
The alternative reference picture decoding means decodes the alternative reference picture information from the encoded data when the encoded data includes a slice segment header extension, and the encoded data does not include the slice segment header extension If the reference picture set derived by the reference picture set deriving means includes a picture of the same target layer, alt_collocated_indication_flag = 0 indicating that the alternative reference layer is not used is decoded as alternative reference picture information, An image decoding apparatus characterized by decoding alt_collocated_indication_flag = 1 indicating that an alternative reference layer is used when the reference picture information derived by the alternative reference picture decoding means does not include a picture of the same target layer.

An image decoding device that decodes hierarchically encoded data,
Slice segment header basic decoding means for decoding encoding parameters common to the base layer and the non-base layer;
Slice segment header extension decoding means for decoding the encoding parameters for the non-base layer,
The slice segment header extension decoding means, when the slice data header extension is included in the encoded data, as a parameter relating to the decoding method of the non-base layer picture, at least a parameter indicating the reference picture at the time of temporal motion vector prediction (alt_collocated_indication_flag, collocated_ref_layer_idx)
If the encoded data does not include the slice segment header extension, it is based on the encoding parameters included in the slice segment header and indicating the reference picture in temporal motion vector prediction of the base layer (collocated_from_0_flag, collocated_ref_idx) An image decoding apparatus for determining each parameter in a non-base layer picture.

An image encoding apparatus that hierarchically encodes multiple layers of image data,
A slice segment header basic encoding means for encoding a common encoding parameter in the base layer and the non-base layer;
Slice segment header extension encoding means for encoding the encoding parameters for the non-base layer,
The slice segment header extension encoding means omits encoding of the slice segment header extension when the values of a plurality of parameter groups related to inter-layer prediction among the encoding parameters related to the non-base layer are a predetermined combination. An image encoding device.