JP2016507910A

JP2016507910A - Video decoder using signaling

Info

Publication number: JP2016507910A
Application number: JP2015533777A
Authority: JP
Inventors: サーチンジー．デシュパンダ
Original assignee: Sharp Corp
Current assignee: Sharp Corp
Priority date: 2013-01-15
Filing date: 2014-01-14
Publication date: 2016-03-10
Anticipated expiration: 2034-01-14
Also published as: US10027978B2; EP2946558A1; CN104919803B; US9674524B2; EP2946558B1; US10230979B2; US20160345024A1; WO2014112354A1; US20190208224A1; CN104919803A; HK1214708A1; US20170208342A1; JP6209772B2; US20140198857A1; EP2946558A4

Abstract

ビデオビットストリームを復号する方法であって、該ビデオビットストリームから参照ピクチャセットパラメータを受信するステップと；該参照ピクチャセットに基づきインター予測を用いて現ピクチャを復号するステップと；将来のインター予測のために参照される該復号されたピクチャを復号ピクチャバッファに格納するステップとを含み、該参照ピクチャセットは、少なくとも（ａ）参照ピクチャのピクチャオーダカウント（ＰＯＣ）の最下位ビット（ＬＳＢ）の選択された数にそれぞれ基づく、１つ以上の参照ピクチャ識別子と；（ｂ）該参照ピクチャのＰＯＣのＭＳＢを決定する後続データが存在するか否かを指定するフラグとを用いることにより復号され、該後続データは、以下の条件（ｉ）該復号ピクチャバッファにピクチャオーダーカウントをＭａｘＰｉｃＯｒｄｅｒＣｎｔＬｓｂで割った余りがＰｏｃＬｓｂＬｔに等しい１つより多くの参照ピクチャを有すること、および（ｉｉ）条件（ｉ）を満たすピクチャの後、０のＴｅｍｐｏｒａｌＩｄ値を有し、サブストリームのトレイリングピクチャまたは参照ピクチャである復号順で最初のピクチャまでを含む、復号順で後続するピクチャ、のうちの１つを満たすピクチャにつき存在する、方法。A method of decoding a video bitstream, receiving reference picture set parameters from the video bitstream; decoding a current picture using inter prediction based on the reference picture set; Storing the decoded picture referenced for decoding in a decoded picture buffer, wherein the reference picture set selects at least (a) the least significant bit (LSB) of the picture order count (POC) of the reference picture Decoded by using one or more reference picture identifiers based on each of the number of received pictures; and (b) a flag that specifies whether there is subsequent data that determines the MSB of the POC of the reference picture Subsequent data has the following condition (i): Sub-trailing trailing picture with 0 TemporalId value after the picture satisfying condition (i), and (ii) the remainder satisfying condition (i) Or a method that exists for a picture that satisfies one of the following pictures in decoding order, including a reference picture up to the first picture in decoding order.

Description

関連出願の相互参照
なし。 Cross-reference of related applications None.

本発明は、ビデオ符号化および／または復号に関する。 The present invention relates to video encoding and / or decoding.

デジタルビデオは通常、一連の画像またはフレームとして表され、そのそれぞれが画素の配列を含む。各画素は、輝度および／または色情報等の情報を含む。多くの場合には、各画素は３色のセットとして表され、各色を８ビット色値で定義することができる。 Digital video is typically represented as a series of images or frames, each containing an array of pixels. Each pixel includes information such as luminance and / or color information. In many cases, each pixel is represented as a set of three colors, and each color can be defined with an 8-bit color value.

例えばＨ．２６４／ＭＰＥＧ‐４ＡＶＣ（Ｈ．２６４／ＡＶＣ）等のビデオ符号化技術は通常、複雑さが増すのと引き換えに、より高い符号化効率を提供する。ビデオ符号化技術に関する画質要求の高まりおよび画像解像度要求の高まりも、符号化の複雑度を高める。並列復号に適したビデオデコーダにより、復号プロセスの速度を改善し、メモリ要求を低減することができ、並列符号化に適したビデオエンコーダにより、符号化プロセスの速度を改善し、メモリ要求を低減することができる。 For example, H.C. Video coding techniques such as H.264 / MPEG-4 AVC (H.264 / AVC) typically provide higher coding efficiency at the expense of increased complexity. Increasing image quality requirements and image resolution requirements for video encoding techniques also increase the complexity of encoding. A video decoder suitable for parallel decoding can improve the speed of the decoding process and reduce memory requirements, and a video encoder suitable for parallel coding improves the speed of the encoding process and reduces memory requirements be able to.

いずれも参照により全体として本明細書に組み込まれる、Ｈ．２６４／ＭＰＥＧ‐４ＡＶＣ［ＩＴＵ‐ＴＶＣＥＧおよびＩＳＯ／ＩＥＣＭＰＥＧの合同ビデオチーム（ＪｏｉｎｔＶｉｄｅｏＴｅａｍ）、「Ｈ．２６４：一般視聴覚サービスのための高度ビデオ符号化（Ａｄｖａｎｃｅｄｖｉｄｅｏｃｏｄｉｎｇｆｏｒｇｅｎｅｒｉｃａｕｄｉｏｖｉｓｕａｌｓｅｒｖｉｃｅｓ）」、ＩＴＵ‐ＴＲｅｃ．Ｈ．２６４およびＩＳＯ／ＩＥＣ１４４９６‐１０（ＭＰＥＧ４‐Ｐａｒｔ１０）、２００７年１１月］、および同様に、ＪＣＴ‐ＶＣ、［「検討中のテストモデル案（ＤｒａｆｔＴｅｓｔＭｏｄｅｌＵｎｄｅｒＣｏｎｓｉｄｅｒａｔｉｏｎ）」、ＪＣＴＶＣ‐Ａ２０５、ＪＣＴ‐ＶＣミーティング、ドレスデン、２０１０年４月（ＪＣＴ‐ＶＣ）］は、圧縮効率のためにビデオシーケンス内の参照ピクチャに基づいてピクチャを復号するビデオコーデック（エンコーダ／デコーダ）の仕様書である。 Each of which is incorporated herein by reference in its entirety. H.264 / MPEG-4AVC [joint video team of ITU-T VCEG and ISO / IEC MPEG, “H.264: Advanced video coding for generic audios” ITU-T Rec. H. H.264 and ISO / IEC 14496-10 (MPEG4-Part10), November 2007], and similarly, JCT-VC, ["Draft Test Model Under Consideration", JCTVC-A205, JCT- VC Meeting, Dresden, April 2010 (JCT-VC)] is a video codec (encoder / decoder) specification that decodes pictures based on reference pictures in a video sequence for compression efficiency.

本発明の以上およびその他の目的、特徴および利点は、本発明の以下の詳細な説明を添付の図面に関連して考慮すれば、より容易に理解されるであろう。 The above and other objects, features and advantages of the present invention will be more readily understood when the following detailed description of the invention is considered in conjunction with the accompanying drawings.

本発明の一実施形態は、ビデオビットストリームを復号する方法であって、該ビデオビットストリームから参照ピクチャセットパラメータを受信するステップと；該参照ピクチャセットに基づきインター予測を用いて現ピクチャを復号するステップと；将来のインター予測のために参照される該復号されたピクチャを復号ピクチャバッファに格納するステップとを含み、該参照ピクチャセットは、少なくとも、（ａ）参照ピクチャのピクチャオーダーカウント（ＰＯＣ；ｐｉｃｔｕｒｅｏｒｄｅｒｃｏｕｎｔ）の最下位ビット（ＬＳＢ；ｌｅａｓｔｓｉｇｎｉｆｉｃａｎｔｂｉｔ）の選択された数にそれぞれ基づく１つ以上の参照ピクチャ識別子と；（ｂ）該参照ピクチャのＰＯＣのＭＳＢを決定する後続データが存在するか否かを指定するフラグとを用いることにより復号され、該後続データは、以下の条件（ｉ）該復号ピクチャバッファにピクチャオーダーカウントをＭａｘＰｉｃＯｒｄｅｒＣｎｔＬｓｂで割った余りがＰｏｃＬｓｂＬｔに等しい参照ピクチャを１つより多く有すること、および（ｉｉ）条件（ｉ）を満たすピクチャの後、ＴｅｍｐｏｒａｌＩｄ値が０であり、サブストリームのトレイリングピクチャまたは参照ピクチャである、復号順で最初のピクチャまでを含む、復号順で後続するピクチャ、のうちの１つを満たすピクチャにつき存在する、方法を開示する。 One embodiment of the present invention is a method of decoding a video bitstream, receiving a reference picture set parameter from the video bitstream; decoding a current picture using inter prediction based on the reference picture set Storing the decoded picture referenced for future inter prediction in a decoded picture buffer, the reference picture set comprising at least (a) a picture order count (POC) of a reference picture; one or more reference picture identifiers each based on a selected number of least significant bits (LSB) of picture order count (LSB); and (b) subsequent data determining the MSB of the POC of the reference picture Whether finger And the subsequent data has more than one reference picture with the remainder of dividing the picture order count by MaxPicOrderCntLsb in the decoded picture buffer equal to PocLsbLt: (Ii) After the picture satisfying the condition (i), the temporalId value is 0, and the trailing picture or reference picture of the substream, including the first picture in the decoding order and subsequent pictures in the decoding order Disclose methods that exist for pictures that satisfy one of them.

Ｈ．２６４／ＡＶＣビデオエンコーダを示した図である。H. 2 is a diagram illustrating a H.264 / AVC video encoder. FIG. Ｈ．２６４／ＡＶＣビデオデコーダを示した図である。H. 2 is a diagram illustrating a H.264 / AVC video decoder. FIG. 例示的なスライス構造を示した図である。FIG. 3 shows an exemplary slice structure. 別の例示的なスライス構造を示した図である。FIG. 5 illustrates another exemplary slice structure. エントロピースライスの再構築を示した図である。It is the figure which showed reconstruction of the entropy slice. 図５のエントロピースライスの一部の再構築を示した図である。FIG. 6 is a diagram illustrating reconstruction of a part of the entropy slice of FIG. 5. 省略されたＬＳＢカウント値を用いたエントロピースライスの再構築を示した図である。It is the figure which showed reconstruction of the entropy slice using the abbreviated LSB count value. 長期ピクチャ値を用いたエントロピースライスの再構築を示した図である。It is the figure which showed reconstruction of the entropy slice using a long-term picture value. 長期ピクチャ値を用いて最初の先行フレームを選択することによるエントロピースライスの再構築を示した図である。FIG. 6 is a diagram illustrating restructuring of an entropy slice by selecting a first preceding frame using a long-term picture value. 同じ最下位ビットカウント値を有する重複長期ピクチャフレームを用いることによるエントロピースライスの再構築を示した図である。FIG. 5 shows the reconstruction of entropy slices by using overlapping long-term picture frames having the same least significant bit count value. 参照フレームを選択するための技術を示した図である。It is the figure which showed the technique for selecting a reference frame. 参照フレームを選択するための技術を示した図である。It is the figure which showed the technique for selecting a reference frame. 参照フレームを選択するための別の技術を示した図である。FIG. 6 shows another technique for selecting a reference frame. 参照フレームを選択するための別の技術を示した図である。FIG. 6 shows another technique for selecting a reference frame. 参照フレームを選択するための別の技術を示した図である。FIG. 6 shows another technique for selecting a reference frame. 参照フレームを選択するための別の技術を示した図である。FIG. 6 shows another technique for selecting a reference frame. ｄｅｌｔａ＿ｐｏｃ＿ｍｓｂ＿ｐｒｅｓｅｎｔ＿ｆｌａｇ［ｉ］が１に設定されるピクチャを示した図である。It is the figure which showed the picture in which delta_poc_msb_present_flag [i] is set to 1.

本明細書に記載される実施形態は、符号化／復号を用いる任意のビデオコーダ／デコーダ（コーデック）に対応しうるが、例示的実施形態は、単に説明の目的で、Ｈ．２６４／ＡＶＣエンコーダおよびＨ．２６４／ＡＶＣデコーダに関して記載される。多くのビデオ符号化技術は、ブロックベースのハイブリッドビデオ符号化アプローチに基づき、情報源符号化技術は、フレーム間とも考えうるピクチャ間予測、フレーム内とも考えうるピクチャ内予測、および予測残差の変換符号化のハイブリッドである。フレーム間予測は時間的冗長性を利用し、フレーム内および予測残差の変換符号化は空間的冗長性を利用することができる。 Although the embodiments described herein may correspond to any video coder / decoder (codec) that uses encoding / decoding, the exemplary embodiments are described in H.264 only for illustrative purposes. H.264 / AVC encoder and H.264 The H.264 / AVC decoder will be described. Many video coding techniques are based on a block-based hybrid video coding approach, and source coding techniques involve inter-picture prediction that can be considered between frames, intra-picture prediction that can be considered as intra-frame, and transformation of prediction residuals. It is a hybrid of encoding. Interframe prediction can use temporal redundancy, and intraframe and transform residual prediction coding can use spatial redundancy.

図１は、電子デバイス１０２の例示的なエンコーダ１０４を示したブロック図である。電子デバイス１０２内に含まれるものとして示された要素の１つ以上は、ハードウェアおよび／またはソフトウェアで実装されうることに注意しなければならない。例えば、電子デバイス１０２はエンコーダ１０４を含むが、エンコーダ１０４は、ハードウェアおよび／またはソフトウェアで実装されればよい。 FIG. 1 is a block diagram illustrating an exemplary encoder 104 of electronic device 102. It should be noted that one or more of the elements shown as included in the electronic device 102 may be implemented in hardware and / or software. For example, the electronic device 102 includes an encoder 104, which may be implemented in hardware and / or software.

電子デバイス１０２は、供給器１３４を含んでもよい。供給器１３４は、ピクチャまたは画像データ（例えばビデオ）をソース１０６としてエンコーダ１０４に供給することができる。供給器１３４の非限定的な例には、画像センサ、メモリ、通信インタフェース、ネットワークインタフェース、無線受信器、ポート、ビデオフレームコンテンツ、前に符号化されたビデオコンテンツ、符号化されていないビデオコンテンツなどが含まれる。 The electronic device 102 may include a supplier 134. The supplier 134 can supply picture or image data (eg, video) as the source 106 to the encoder 104. Non-limiting examples of the supplier 134 include an image sensor, memory, communication interface, network interface, wireless receiver, port, video frame content, previously encoded video content, unencoded video content, etc. Is included.

ソース１０６が、フレーム内予測モジュールおよび再構築バッファ１４０に供給されうる。ソース１０６は、動き推定および動き補償モジュール１６６に、さらに減算モジュール１４６にも供給されうる。 Source 106 may be provided to an intra-frame prediction module and reconstruction buffer 140. Source 106 may also be provided to motion estimation and motion compensation module 166 and also to subtraction module 146.

フレーム内予測モジュールおよび再構築バッファ１４０は、ソース１０６および再構築データ１８０に基づいて、イントラモード情報１４８およびイントラ信号１４２を発生させうる。動き推定および動き補償モジュール１６６は、ソース１０６と参照ピクチャバッファ１９６の信号１９８とに基づいて、インターモード情報１６８およびインター信号１４４を発生させうる。 Intraframe prediction module and reconstruction buffer 140 may generate intra mode information 148 and intra signal 142 based on source 106 and reconstruction data 180. Motion estimation and motion compensation module 166 may generate inter-mode information 168 and inter signal 144 based on source 106 and signal 198 in reference picture buffer 196.

参照ピクチャバッファ１９６の信号１９８は、参照ピクチャバッファ１９６に格納された１つ以上の参照ピクチャからのデータを含みうる。参照ピクチャバッファ１９６は、ＲＰＳインデックス初期化モジュール１０８も含みうる。初期化モジュール１０８は、ＲＰＳのバッファリングおよびリスト構築に対応する参照ピクチャを処理しうる。 The signal 198 in the reference picture buffer 196 may include data from one or more reference pictures stored in the reference picture buffer 196. Reference picture buffer 196 may also include an RPS index initialization module 108. The initialization module 108 may process reference pictures corresponding to RPS buffering and list construction.

エンコーダ１０４は、モードにしたがってイントラ信号１４２とインター信号１４４との間で選択することができる。イントラ符号化モードにおいてピクチャ内の空間的特徴を利用するために、イントラ信号１４２を使用することができる。インター符号化モードにおいてピクチャ間の時間的特徴を利用するために、インター信号１４４を使用することができる。一方、イントラ符号化モードは、イントラ信号１４２が減算モジュール１４６に供給され、イントラモード情報１５８がエントロピー符号化モジュール１６０に供給されればよい。一方、インター符号化モードは、インター信号１４４が減算モジュール１４６に供給され、インターモード情報１６８がエントロピー符号化モジュール１６０に供給されればよい。 The encoder 104 can select between the intra signal 142 and the inter signal 144 according to the mode. Intra signal 142 can be used to take advantage of spatial features in a picture in intra coding mode. The inter signal 144 can be used to take advantage of temporal features between pictures in the inter coding mode. On the other hand, in the intra coding mode, the intra signal 142 may be supplied to the subtraction module 146 and the intra mode information 158 may be supplied to the entropy coding module 160. On the other hand, in the inter coding mode, the inter signal 144 may be supplied to the subtraction module 146 and the inter mode information 168 may be supplied to the entropy coding module 160.

予測残差１４８を生成するために、減算モジュール１４６で（モードに応じて）イントラ信号１４２またはインター信号１４４のいずれかが、ソース１０６から減算される。予測残差１４８は、変換モジュール１５０に供給される。変換モジュール１５０は、予測残差１４８を圧縮して、量子化モジュール１５４に供給される変換信号１５２を生成することができる。量子化モジュール１５４は、変換信号１５２を量子化して、変換・量子化された係数（ＴＱＣ；ｔｒａｎｓｆｏｒｍｅｄａｎｄｑｕａｎｔｉｚｅｄｃｏｅｆｆｉｃｉｅｎｔ）１５６を生成する。 To generate the prediction residual 148, either the intra signal 142 or the inter signal 144 is subtracted from the source 106 (depending on the mode) at the subtraction module 146. The prediction residual 148 is supplied to the conversion module 150. The transform module 150 can compress the prediction residual 148 to generate a transformed signal 152 that is provided to the quantization module 154. The quantization module 154 quantizes the transformed signal 152 to generate a transformed and quantized coefficient (TQC) 156.

ＴＱＣ１５６は、エントロピー符号化モジュール１６０および逆量子化モジュール１７０に供給される。逆量子化モジュール１７０は、ＴＱＣ１５６に逆量子化を行って、逆変換モジュール１７４に供給される逆量子化信号１７２を生成する。逆変換モジュール１７４は、逆量子化信号１７２を伸張して、再構築モジュール１７８に供給される伸張信号１７６を生成する。 The TQC 156 is supplied to the entropy encoding module 160 and the inverse quantization module 170. The inverse quantization module 170 performs inverse quantization on the TQC 156 to generate an inverse quantization signal 172 supplied to the inverse transform module 174. The inverse transform module 174 decompresses the inverse quantized signal 172 to generate a decompressed signal 176 that is supplied to the reconstruction module 178.

再構築モジュール１７８は、伸張信号１７６に基づいて再構築データ１８０を生成することができる。例えば再構築モジュール１７８は、（修正された）ピクチャを再構築することができる。再構築データ１８０は、デブロッキングフィルタ１８２に、そしてイントラ予測モジュールおよび再構築バッファ１４０に、供給されればよい。デブロッキングフィルタ１８２は、再構築データ１８０に基づいて、フィルタされた信号１８４を生成しうる。 The reconstruction module 178 can generate the reconstruction data 180 based on the decompressed signal 176. For example, the reconstruction module 178 can reconstruct a (modified) picture. Reconstruction data 180 may be provided to deblocking filter 182 and to intra prediction module and reconstruction buffer 140. Deblocking filter 182 may generate filtered signal 184 based on reconstruction data 180.

フィルタされた信号１８４は、サンプル適応オフセット（ＳＡＯ；ｓａｍｐｌｅａｄａｐｔｉｖｅｏｆｆｓｅｔ）モジュール１８６に供給されればよい。ＳＡＯモジュール１８６は、エントロピー符号化モジュール１６０に供給されるＳＡＯ情報１８８と、適応ループフィルタ（ＡＬＦ；ａｄａｐｔｉｖｅｌｏｏｐｆｉｌｔｅｒ）１９２に供給されるＳＡＯ信号１９０とを生成しうる。ＡＬＦ１９２は、参照ピクチャバッファ１９６に供給されるＡＬＦ信号１９４を生成する。ＡＬＦ信号１９４は、参照ピクチャとして使用することができる１つ以上のピクチャからのデータを含みうる。 The filtered signal 184 may be provided to a sample adaptive offset (SAO) module 186. The SAO module 186 may generate SAO information 188 that is supplied to the entropy encoding module 160 and a SAO signal 190 that is supplied to an adaptive loop filter (ALF) 192. The ALF 192 generates an ALF signal 194 that is supplied to the reference picture buffer 196. The ALF signal 194 may include data from one or more pictures that can be used as reference pictures.

エントロピー符号化モジュール１６０は、ＴＱＣ１５６を符号化して、ビットストリーム１１４を生成しうる。また、エントロピー符号化モジュール１６０は、コンテキスト適応型可変長符号化（ＣＡＶＬＣ；Ｃｏｎｔｅｘｔ‐ＡｄａｐｔｉｖｅＶａｒｉａｂｌｅＬｅｎｇｔｈＣｏｄｉｎｇ）またはコンテキスト適応型２値算術符号化（ＣＡＢＡＣ；Ｃｏｎｔｅｘｔ‐ＡｄａｐｔｉｖｅＢｉｎａｒｙＡｒｉｔｈｍｅｔｉｃＣｏｄｉｎｇ）を用いてＴＱＣ１５６を符号化しうる。特に、エントロピー符号化モジュール１６０は、イントラモード情報１５８、インターモード情報１６８およびＳＡＯ情報１８８のうちの１つ以上に基づいてＴＱＣ１５６を符号化しうる。ビットストリーム１１４は、符号化ピクチャデータを含みうる。エンコーダは、一般にマクロブロックと呼ばれるブロックのシーケンスとしてフレームを符号化することが多い。 Entropy encoding module 160 may encode TQC 156 to generate bitstream 114. The entropy coding module 160 uses context-adaptive variable length coding (CAVLC) or context-adaptive binary arithmetic coding (CABAC) using context-adaptive binary arithmetic coding (CBAC15). Can be encoded. In particular, entropy encoding module 160 may encode TQC 156 based on one or more of intra mode information 158, inter mode information 168, and SAO information 188. The bitstream 114 can include encoded picture data. Encoders often encode frames as a sequence of blocks, commonly referred to as macroblocks.

ＨＥＶＣ等のビデオ圧縮に関わる量子化は、ある範囲の値を１つの値に圧縮することにより達成される、損失を伴う圧縮技術である。量子化パラメータ（ＱＰ；ｑｕａｎｔｉｚａｔｉｏｎｐａｒａｍｅｔｅｒ）は、再構築されたビデオの品質および圧縮率の両方に基づいて量子化を行うために使用される所定のスケーリングパラメータである。ブロックタイプは、ＨＥＶＣにおいて所与のブロックの特徴をブロックサイズおよびその色情報に基づいて表すように定義される。ＱＰ、解像度情報、およびブロックタイプは、エントロピー符号化の前に決定されればよい。例えば、電子デバイス１０２（例えばエンコーダ１０４）がＱＰ、解像度情報、およびブロックタイプを決定し、これらがエントロピー符号化モジュール１６０に供給されればよい。 Quantization associated with video compression, such as HEVC, is a lossy compression technique that is achieved by compressing a range of values into a single value. A quantization parameter (QP) is a predetermined scaling parameter used to perform quantization based on both the quality of the reconstructed video and the compression rate. A block type is defined to represent the characteristics of a given block in HEVC based on block size and its color information. The QP, resolution information, and block type may be determined before entropy coding. For example, the electronic device 102 (eg, the encoder 104) may determine the QP, resolution information, and block type, and these may be supplied to the entropy encoding module 160.

エントロピー符号化モジュール１６０は、ＴＱＣ１５６のブロックに基づいてブロックサイズを決定してもよい。例えばブロックサイズは、１次元のＴＱＣのブロックに沿ったＴＱＣ１５６の数であればよい。換言すれば、ＴＱＣのブロックの中のＴＱＣ１５６の数は、ブロックサイズの２乗に等しければよい。例えば、ＴＱＣのブロックの中のＴＱＣ１５６の数の平方根としてブロックサイズが決定されればよい。解像度は、画素幅ｘ画素高さとして定義できる。解像度情報には、ピクチャの幅、ピクチャの高さ、または両方の画素数を含みうる。ブロックサイズは、１次元のＴＱＣの２Ｄブロックに沿ったＴＱＣ１５６の数として定義できる。 Entropy encoding module 160 may determine the block size based on the block of TQC 156. For example, the block size may be the number of TQC 156 along a one-dimensional TQC block. In other words, the number of TQC 156 in the block of TQC may be equal to the square of the block size. For example, the block size may be determined as the square root of the number of TQC 156 in the block of TQC. Resolution can be defined as pixel width x pixel height. The resolution information can include picture width, picture height, or the number of pixels of both. Block size can be defined as the number of TQCs 156 along a 2D block of one-dimensional TQC.

一部の構成では、ビットストリーム１１４は、別の電子デバイスに伝送されてもよい。例えば、ビットストリーム１１４は、通信インタフェース、ネットワークインタフェース、無線伝送器、ポートなどに供給されればよい。例えば、ビットストリーム１１４は、ＬＡＮ、インターネット、携帯電話基地局などを介して別の電子デバイスへ伝送されればよい。ビットストリーム１１４は、追加的または代替的に電子デバイス１０２または他の電子デバイスのメモリに格納されてもよい。 In some configurations, the bitstream 114 may be transmitted to another electronic device. For example, the bit stream 114 may be supplied to a communication interface, a network interface, a wireless transmitter, a port, and the like. For example, the bit stream 114 may be transmitted to another electronic device via a LAN, the Internet, a mobile phone base station, or the like. The bitstream 114 may additionally or alternatively be stored in the memory of the electronic device 102 or other electronic device.

図２は、電子デバイス２０２上の例示的なデコーダ２１２を示したブロック図である。デコーダ２１２は、電子デバイス２０２に含まれていてもよい。例えばデコーダ２１２は、ＨＥＶＣデコーダであればよい。デコーダ２１２および／またはデコーダ２１２に含まれるものとして図示された要素の１つ以上は、ハードウェアおよび／またはソフトウェアで実装されてもよい。デコーダ２１２は、復号のためにビットストリーム２１４（例えばビットストリーム２１４に含まれる１つ以上の符号化されたピクチャ）を受信すればよい。一部の構成では、受信ビットストリーム２１４は、受信スライスヘッダ、受信ＰＰＳ、受信バッファ記述情報等の、受信オーバーヘッド情報を含めばよい。ビットストリーム２１４に含まれる符号化されたピクチャは、１つ以上の符号化された参照ピクチャおよび／または１つ以上の他の符号化されたピクチャを含めばよい。 FIG. 2 is a block diagram illustrating an exemplary decoder 212 on the electronic device 202. The decoder 212 may be included in the electronic device 202. For example, the decoder 212 may be a HEVC decoder. Decoder 212 and / or one or more of the elements illustrated as included in decoder 212 may be implemented in hardware and / or software. Decoder 212 may receive bitstream 214 (eg, one or more encoded pictures included in bitstream 214) for decoding. In some configurations, the received bitstream 214 may include reception overhead information such as a received slice header, received PPS, and received buffer description information. The encoded pictures included in the bitstream 214 may include one or more encoded reference pictures and / or one or more other encoded pictures.

（ビットストリーム２１４に含まれる１つ以上の符号化されたピクチャの中の）受信シンボルは、エントロピー復号モジュール２６８によってエントロピー復号され、これにより動き情報信号２７０ならびに量子化、スケーリングおよび／または変換された係数２７２が生成される。 Received symbols (in one or more encoded pictures included in bitstream 214) are entropy decoded by entropy decoding module 268, which results in motion information signal 270 and quantized, scaled and / or transformed. A coefficient 272 is generated.

動き情報信号２７０は、動き補償モジュール２７４で、フレームメモリ２７８からの参照フレーム信号２９８の一部分と組み合わされ、それによってフレーム間予測信号２８２を生成しうる。量子化、デスケーリングおよび／または変換された係数２７２は、逆モジュール２６２によって逆量子化、スケーリングおよび逆変換され、それによって復号された残差信号２８４を生成しうる。復号された残差信号２８４は、結合信号２８６を生成するために予測信号２９２に加算されうる。予測信号２９２は、フレーム間予測信号２８２か、またはフレーム内予測モジュール２８８によって生成されたフレーム内予測信号２９０のいずれかから選択された信号であればよい。いくつかの構成において、この信号選択は、ビットストリーム２１４に基づいてもよい（例えば、ビットストリーム２１４によって制御される）。 Motion information signal 270 may be combined with a portion of reference frame signal 298 from frame memory 278 at motion compensation module 274, thereby generating inter-frame prediction signal 282. Quantized, descaled and / or transformed coefficients 272 may be inverse quantized, scaled and inverse transformed by inverse module 262, thereby producing a decoded residual signal 284. The decoded residual signal 284 can be added to the predicted signal 292 to generate a combined signal 286. The prediction signal 292 may be a signal selected from either the inter-frame prediction signal 282 or the intra-frame prediction signal 290 generated by the intra-frame prediction module 288. In some configurations, this signal selection may be based on the bitstream 214 (eg, controlled by the bitstream 214).

フレーム内予測信号２９０は、（例えば、現フレームにおいて）前に復号された結合信号２９２からの情報から予測できる。結合信号２９２は、さらにデブロッキングフィルタ２９４によりフィルタされればよい。結果として生じるフィルタされた信号２９６は、フレームメモリ２７８に書き込まれればよい。結果として生じるフィルタされた信号２９６は、復号されたピクチャを含みうる。 The intra-frame prediction signal 290 can be predicted from information from the previously decoded combined signal 292 (eg, in the current frame). The combined signal 292 may be further filtered by the deblocking filter 294. The resulting filtered signal 296 may be written to the frame memory 278. The resulting filtered signal 296 may include a decoded picture.

フレームメモリ７７８は、本明細書に記載されるＤＰＢを含みうる。ＤＰＢは、短期または長期参照フレームとして保持されうる１つ以上の復号されたピクチャを含みうる。フレームメモリ２７８は、復号されたピクチャに対応するオーバーヘッド情報も含みうる。例えばフレームメモリ２７８は、スライスヘッダ、ＰＰＳ情報、サイクルパラメータ、バッファ記述情報などを含めばよい。これらの情報の１つ以上は、エンコーダ（例えばエンコーダ１０４）からシグナリングされてもよい。フレームメモリ２７８は、復号されたピクチャ７１８を供給しうる。 Frame memory 778 may include a DPB as described herein. The DPB may include one or more decoded pictures that can be held as short-term or long-term reference frames. Frame memory 278 may also include overhead information corresponding to the decoded picture. For example, the frame memory 278 may include a slice header, PPS information, cycle parameters, buffer description information, and the like. One or more of these pieces of information may be signaled from an encoder (eg, encoder 104). Frame memory 278 may provide decoded picture 718.

複数のマクロブロックを含む入力ピクチャは、１つまたはいくつかのスライスに分割されうる。エンコーダおよびデコーダで使用される参照ピクチャが同じであり、デブロッキングフィルタリングがスライス境界を越える情報を使用しないならば、スライスが表すピクチャのエリア内のサンプルの値は、他のスライスからのデータを使用せずに適切に復号できる。したがって、あるスライスのエントロピー復号およびマクロブロック再構築は、他のスライスに依存しない。特に、各スライスの開始時にエントロピー符号化状態がリセットされればよい。他のスライスのデータは、エントロピー復号および再構築の両方のための近傍利用可能性を定義する際に利用不可能としてマークされればよい。スライスは、並列にエントロピー復号および再構築されればよい。イントラ予測および動きベクトル予測は、スライスの境界を越えられないのが好ましい。対して、デブロッキングフィルタリングは、スライス境界を越える情報を使用してもよい。 An input picture including multiple macroblocks may be divided into one or several slices. If the reference pictures used by the encoder and decoder are the same and deblocking filtering does not use information across slice boundaries, the value of the sample in the area of the picture that the slice represents uses data from other slices Can be properly decrypted. Thus, entropy decoding and macroblock reconstruction for one slice are independent of other slices. In particular, the entropy coding state may be reset at the start of each slice. Other slice data may be marked as unavailable when defining neighborhood availability for both entropy decoding and reconstruction. Slices may be entropy decoded and reconstructed in parallel. Intra prediction and motion vector prediction preferably do not cross slice boundaries. In contrast, deblocking filtering may use information across slice boundaries.

図３は、水平方向に１１個のマクロブロック、垂直方向に９個のマクロブロックを含む、例示的ビデオピクチャ９０を示す（９個の例示的なマクロブロックが９１〜９９と標識されている）。図３は、「スライス＃０」と示された第１のスライス８９、「スライス＃１」と示された第２のスライス８８、および「スライス＃２」と示された第３のスライス８７の、３つの例示的なスライスを示す。Ｈ．２６４／ＡＶＣデコーダは、３つのスライス８７、８８、８９を並列に復号および再構築しうる。各スライスが、走査線順に逐次伝送されてもよい。各スライスの復号／再構築処理の始めに、エントロピー復号２６８が初期化またはリセットされ、他のスライスのマクロブロックが、エントロピー復号およびマクロブロック再構築のいずれにも利用不可能としてマークされる。したがって、「スライス＃１」の中のあるマクロブロック、例えば９３と標識されたマクロブロックについて、「スライス＃０」の中のマクロブロック（例えば９１および９２と標識されたマクロブロック）をエントロピー復号または再構築のために使用することはできない。一方、「スライス＃１」の中にあるマクロブロック、例えば９５と標識されたマクロブロックについて、「スライス＃１」の中の他のマクロブロック（例えば９３および９４と標識されたマクロブロック）をエントロピー復号または再構築のために使用することはできる。したがって、エントロピー復号およびマクロブロック再構築は、スライス内で順次進行する。スライスがフレキシブルマクロブロック順序付け（ＦＭＯ；ｆｌｅｘｉｂｌｅｍａｃｒｏｂｌｏｃｋｏｒｄｅｒｉｎｇ）を用いて定義されない限り、スライス内のマクロブロックは、ラスタ走査順に処理される。 FIG. 3 shows an exemplary video picture 90 that includes 11 macroblocks in the horizontal direction and 9 macroblocks in the vertical direction (9 exemplary macroblocks are labeled 91-99). . FIG. 3 shows a first slice 89 labeled “Slice # 0”, a second slice 88 labeled “Slice # 1”, and a third slice 87 labeled “Slice # 2”. Three exemplary slices are shown. H. The H.264 / AVC decoder can decode and reconstruct the three slices 87, 88, 89 in parallel. Each slice may be sequentially transmitted in the scanning line order. At the beginning of the decoding / reconstruction process for each slice, entropy decoding 268 is initialized or reset, and macroblocks in other slices are marked as unavailable for both entropy decoding and macroblock reconstruction. Thus, for a macroblock in “slice # 1”, eg, the macroblock labeled 93, the macroblocks in “slice # 0” (eg, the macroblocks labeled 91 and 92) are entropy decoded or It cannot be used for reconstruction. On the other hand, for macroblocks in “slice # 1”, eg, macroblocks labeled 95, other macroblocks in “slice # 1” (eg, macroblocks labeled 93 and 94) are entropy. Can be used for decryption or reconstruction. Thus, entropy decoding and macroblock reconstruction proceed sequentially within a slice. Unless the slice is defined using flexible macroblock ordering (FMO), the macroblocks in the slice are processed in raster scan order.

フレキシブルマクロブロック順序付けは、スライスグループを定義して、１つのピクチャがどのようにスライスに分割されるかを変更する。スライスグループ内のマクロブロックは、マクロブロック対スライスグループマップにより定義され、マクロブロック対スライスグループマップは、ピクチャパラメータセットのコンテンツおよびスライスヘッダ中の追加情報によってシグナリングされる。マクロブロック対スライスグループマップは、ピクチャ内の各マクロブロックに対するスライスグループ識別番号からなる。スライスグループ識別番号は、関連するマクロブロックがどのスライスグループに属するかを指定する。各スライスグループは、１つ以上のスライスに分割することができ、スライスは、特定のスライスグループのマクロブロックのセット内でラスタ走査順に処理される同じスライスグループ内の一連のマクロブロックである。エントロピー復号およびマクロブロック再構築は、スライスグループ内で順次進行する。 Flexible macroblock ordering defines slice groups and changes how a picture is divided into slices. Macroblocks within a slice group are defined by a macroblock-to-slice group map, which is signaled by the contents of the picture parameter set and additional information in the slice header. The macroblock-to-slice group map includes a slice group identification number for each macroblock in the picture. The slice group identification number specifies which slice group the associated macroblock belongs to. Each slice group can be divided into one or more slices, where a slice is a series of macroblocks within the same slice group that are processed in raster scan order within a set of macroblocks of a particular slice group. Entropy decoding and macroblock reconstruction proceed sequentially within a slice group.

図４は、「スライスグループ＃０」と示された第１のスライスグループ８６、「スライスグループ＃１」と示された第２のスライスグループ８５、および「スライスグループ＃２」と示された第３のスライスグループ８４の３つのスライスグループへの、例示的なマクロブロックの割り当てを示す。これらのスライスグループ８４、８５、８６は、ピクチャ９０の中の２つの前景領域および１つの背景領域にそれぞれ関連付けられる。 FIG. 4 shows a first slice group 86 indicated as “slice group # 0”, a second slice group 85 indicated as “slice group # 1”, and a first slice group 85 indicated as “slice group # 2”. 3 shows exemplary macroblock assignments of three slice groups 84 to three slice groups. These slice groups 84, 85, 86 are associated with two foreground regions and one background region in the picture 90, respectively.

ピクチャは、１つ以上のスライスに分割されてもよく、エンコーダおよびデコーダで使用される参照ピクチャが同一であるならば、スライスは、当該スライスが表すピクチャのエリア内のサンプルの値を、他のスライスからのデータを用いずに正しく再構築できるという点で、自己完結的である。スライス内の全ての再構築されたマクロブロックは、再構築のための近傍の定義において利用可能である。 A picture may be divided into one or more slices, and if the reference pictures used in the encoder and decoder are the same, the slice will return the value of the sample in the area of the picture that the slice represents It is self-contained in that it can be reconstructed correctly without using data from the slice. All reconstructed macroblocks in the slice are available in the neighborhood definition for reconstruction.

スライスは、１つより多くのエントロピースライスに分割されてもよく、エントロピースライスは、当該エントロピースライスが表すピクチャのエリアを、他のエントロピースライスからのデータを用いずに正しくエントロピー復号できるという点で、自己完結的である。各エントロピースライスの復号開始時に、エントロピー復号２６８がリセットされればよい。他のエントロピースライス中のデータは、エントロピー復号のための近傍利用可能性を定義する際に利用不可能としてマークされうる。 A slice may be divided into more than one entropy slice, which can correctly entropy decode the area of a picture that the entropy slice represents without using data from other entropy slices, Self-contained. The entropy decoding 268 may be reset at the start of decoding of each entropy slice. Data in other entropy slices may be marked as unavailable when defining neighborhood availability for entropy decoding.

ピクチャを復号するために構成されたデバイスは、現ピクチャを含む一連のピクチャを含むビットストリームを取得または別途受信する。デバイスは、現ピクチャまたはビットストリームにおいてピクチャがシグナリングされる順序で現ピクチャに後続するピクチャの復号に使用できる他のフレームの識別のために使用できる、参照ピクチャセット（ＲＰＳ）パラメータをさらに取得する。 A device configured to decode a picture obtains or otherwise receives a bitstream that includes a series of pictures including the current picture. The device further obtains a reference picture set (RPS) parameter that can be used to identify other frames that can be used to decode a picture that follows the current picture in the order in which the picture is signaled in the current picture or bitstream.

ＲＰＳは、現フレームに関連する参照ピクチャのセットの識別を提供する。ＲＰＳは、現ピクチャのインター予測に使用できる、復号順で現ピクチャより前の参照ピクチャを識別し、および／または現ピクチャのインター予測に使用できる、復号順で現ピクチャの後の参照ピクチャを識別しうる。例えば、システムがフレーム１、３、５を受信し、５が３を参照に使用し、エンコーダがフレーム１をフレーム７の予測に使用する場合、５のＲＰＳは、フレーム１がフレーム５の参照に使用されなくても、フレーム３および１の両方をフレームメモリ２７８に維持するようシグナリングしうる。一実施形態では、５のＲＰＳは、［−２ −４］であればよい。加えて、フレームメモリ２７８は、表示ピクチャバッファ、または同等にＤＰＢと呼称されてもよい。 The RPS provides identification of a set of reference pictures associated with the current frame. RPS identifies a reference picture before the current picture in decoding order that can be used for inter prediction of the current picture and / or identifies a reference picture after the current picture in decoding order that can be used for inter prediction of the current picture Yes. For example, if the system receives frames 1, 3, 5, 5 uses 3 as a reference, and the encoder uses frame 1 for prediction of frame 7, an RPS of 5 will cause frame 1 to reference frame 5. Even if not used, both frames 3 and 1 may be signaled to be maintained in the frame memory 278. In one embodiment, the RPS of 5 may be [−2−4]. In addition, the frame memory 278 may be referred to as a display picture buffer, or equivalently DPB.

ＲＰＳは、後の使用のために復号ピクチャバッファ（ＤＰＢ；ｄｅｃｏｄｅｄｐｉｃｔｕｒｅｂｕｆｆｅｒ）に少なくとも限られた持続時間にわたり保持されるべき１つ以上の参照ピクチャを記述する。このＲＰＳの識別は、各ピクチャのスライスヘッダに、ピクチャとともに、および／またはピクチャ群とともに、含まれればよい。加えて、ＲＰＳのリストが、ピクチャパラメータセット（ＰＰＳ；ｐｉｃｔｕｒｅｐａｒａｍｅｔｅｒｓｅｔ）で送信されてもよい。そして、スライスヘッダは、そのスライスでの使用をシグナリングするために、ＰＰＳで送信されるＲＰＳの１つを指せばよい。例えば、ピクチャ群のＲＰＳが、ピクチャパラメータセット（ＰＰＳ）においてシグナリングされればよい。現フレームのＲＰＳの一部ではない、ＤＰＢの任意のピクチャは、「参照のために用いられない」ものとしてマークされればよい。 The RPS describes one or more reference pictures to be retained for at least a limited duration in a decoded picture buffer (DPB) for later use. This RPS identification may be included in the slice header of each picture together with the picture and / or together with the picture group. In addition, a list of RPSs may be sent in a picture parameter set (PPS). The slice header may then point to one of the RPSs transmitted in the PPS in order to signal usage in that slice. For example, the RPS of the picture group may be signaled in the picture parameter set (PPS). Any picture in the DPB that is not part of the RPS of the current frame may be marked as “not used for reference”.

ＤＰＢは、デコーダで再構築された（例えば復号された）ピクチャを格納するために使用されうる。これらの格納されたピクチャはその後、例えばインター予測技術において使用することができる。また、ＤＰＢ内のピクチャは、ピクチャオーダーカウント（ＰＯＣ）に関連付けられればよい。ＰＯＣは、各符号化ピクチャに関連付けられ、出力順におけるピクチャ位置の増加に伴って増加する値を有する、変数とすることができる。換言すると、ＰＯＣは、ピクチャを表示のために正しい順番で送るためにデコーダにより使用されうる。ＰＯＣは、参照ピクチャリストの構築中の参照ピクチャの識別および復号された参照ピクチャの識別にも使用されうる。さらに、ＰＯＣは、エンコーダからデコーダへの伝送中に失われたピクチャの識別に使用されうる。 The DPB can be used to store pictures reconstructed (eg, decoded) at the decoder. These stored pictures can then be used, for example, in inter prediction techniques. In addition, a picture in the DPB may be associated with a picture order count (POC). The POC may be a variable associated with each coded picture and having a value that increases with increasing picture position in output order. In other words, the POC can be used by the decoder to send the pictures in the correct order for display. The POC may also be used for identification of reference pictures during construction of the reference picture list and identification of decoded reference pictures. Furthermore, POC can be used to identify pictures lost during transmission from the encoder to the decoder.

図５を参照すると、エンコーダからデコーダに供給されるフレームのセット３００の一例が示されている。各フレームは、関連するＰＯＣ３１０を有していればよい。図のように、ＰＯＣは、負の数から大きな正数までインクリメントしてもよい。一部の実施形態では、ＰＯＣは、ゼロからより大きな正数までインクリメントするだけでもよい。ＰＯＣは、典型的にはフレーム毎に１ずつインクリメントされるが、１つ以上のＰＯＣがスキップされ、または他の方法で省略される場合もある。例えば、エンコーダ中のフレームのセットのＰＯＣは、０、１、２、３、４、５などとすることができる。例えば、エンコーダ中の同じフレームのセットまたは別のフレームのセットのＰＯＣが、０、１、２、４、５などであり、ＰＯＣ３がスキップされ、または他の方法で省略されてもよい。 Referring to FIG. 5, an example of a set 300 of frames supplied from an encoder to a decoder is shown. Each frame only needs to have an associated POC 310. As shown, the POC may be incremented from a negative number to a large positive number. In some embodiments, the POC may only increment from zero to a larger positive number. The POC is typically incremented by 1 for each frame, but one or more POCs may be skipped or otherwise omitted. For example, the POC of the set of frames in the encoder can be 0, 1, 2, 3, 4, 5, etc. For example, the POC of the same set of frames or another set of frames in the encoder may be 0, 1, 2, 4, 5, etc., and POC3 may be skipped or otherwise omitted.

ＰＯＣが十分に大きくなるのに伴い、ＰＯＣを用いて各フレームを識別するために多数のビットが必要となるであろう。エンコーダは、各フレームを識別するために、４ビットなど、ＰＯＣの最下位ビット（ＬＳＢ）の選択された数を使用することにより、特定のＰＯＣを識別するために使用されるビット数を削減することができる。現フレームを復号するために用いられる参照フレームは、現フレームの時間的に近くに位置することが多いため、この識別技術は適切であり、システムの計算複雑度の低減およびビデオのビットレートの全体的削減がもたらされる。ピクチャを識別するために使用するＬＳＢの数は、ビットストリームにおいてデコーダにシグナリングされてもよい。 As the POC becomes sufficiently large, a large number of bits will be required to identify each frame using the POC. The encoder reduces the number of bits used to identify a particular POC by using a selected number of least significant bits (LSBs) of the POC, such as 4 bits, to identify each frame be able to. Since the reference frame used to decode the current frame is often located close in time to the current frame, this identification technique is appropriate, reducing the computational complexity of the system and the overall video bit rate. Reduction. The number of LSBs used to identify a picture may be signaled to the decoder in the bitstream.

図のように、ＰＯＣのＬＳＢの選択された数が４の場合、ＬＳＢが４ビットであり、ＬＳＢインデックスは１６値（２＾４）毎に繰り返す。したがって、フレーム０は０の値を有するＬＳＢを有し、フレーム１は１の値を有するＬＳＢを有し、・・・、フレーム１４は１４の値を有するＬＳＢを有し、フレーム１５は１５の値を有するＬＳＢを有する。しかし、フレーム１６は再び０の値を有するＬＳＢを有し、フレーム１７は再び１の値を有するＬＳＢを有し、フレーム２０は４の値を有するＬＳＢを有する。ＬＳＢ識別子（一般にＰＯＣのＬＳＢ、または同等にＰＯＣＬＳＢとも呼称される）は、ＬＳＢ＝ＰＯＣ％１６の特徴を有していればよく、％は、１６（２＾最下位ビットの数、ここでは４）で割った後の余りである。同様に、ＰＯＣを識別するためのＬＳＢの選択された数がＮビットである場合、ＬＳＢ識別子は、ＬＳＢ＝ＰＯＣ％（２＾Ｎ）の特徴を有していればよく、２＾Ｎは、２のＮ乗を示す。フレームを識別するためにビットストリームにＰＯＣを含めるより、エンコーダがデコーダへのビットストリームにおいてＬＳＢインデックス（一般にＰＯＣのＬＳＢ、または同等にＰＯＣＬＳＢとも呼称される）を供給するのが好ましい。 As shown in the figure, when the selected number of POC LSBs is 4, the LSB is 4 bits and the LSB index repeats every 16 values (2 ^ 4). Thus, frame 0 has an LSB with a value of 0, frame 1 has an LSB with a value of 1, ..., frame 14 has an LSB with a value of 14, and frame 15 has 15 LSB with value. However, frame 16 again has an LSB with a value of 0, frame 17 again has an LSB with a value of 1, and frame 20 has an LSB with a value of 4. The LSB identifier (generally also referred to as POC LSB, or equivalently POC LSB), should only have the characteristics LSB = POC% 16, where% is 16 (2 ^ number of least significant bits, here This is the remainder after dividing by 4). Similarly, if the selected number of LSBs for identifying POC is N bits, the LSB identifier only needs to have the characteristics LSB = POC% (2 ^ N), where 2 ^ N is 2 to the Nth power. Rather than including POC in the bitstream to identify the frame, it is preferred that the encoder supply an LSB index (commonly referred to as POC LSB, or equivalently POC LSB) in the bitstream to the decoder.

現フレームまたは現フレームの後のフレームのインター予測に用いられる参照フレームは、相対（例えばデルタ）参照（例えばｄｅｌｔａＰＯＣおよびｃｕｒｒｅｎｔＰＯＣを使用して）または絶対参照（例えばＰＯＣを使用して）のいずれかを用いてＲＰＳにより識別できる。例えば、ＰＯＣ５３１０により識別され、ＬＳＢ５３２０としてビットストリームでデコーダにシグナリングされるフレームは、［−５，−２，−１］の関連するＲＰＳ３３０を有しうる。ＲＰＳ値の意味は後述する。 The reference frame used for inter prediction of the current frame or a frame after the current frame is either a relative (eg delta) reference (eg using deltaPOC and currentPOC) or an absolute reference (eg using POC). And can be identified by RPS. For example, a frame identified by POC5 310 and signaled to the decoder in the bitstream as LSB5 320 may have an associated RPS 330 of [-5, -2, -1]. The meaning of the RPS value will be described later.

図５の一部を示した図６を参照すると、［−５，−２，−１］のＲＰＳは、５つ前のフレーム３２０、２つ前のフレーム３２１、および１つ前のフレーム３２２を含むフレームを指す。このＲＰＳはさらに、図６に示すように、それぞれ０、３、および４のＰＯＣ値を指す。ＲＰＳは、現フレームのＰＯＣ値と、前のフレームのＰＯＣ値との間の差を指すのが典型的である。例えば、５のＰＯＣ値を有する現フレームについての［−５，−２，−１］のＲＰＳは、５−５＝０；５−２＝３；および５−１＝４のＰＯＣ値を有するフレームを指す。本明細書の例には示していないが、ＲＰＳは、将来のフレームも含むことができることに注意されたい。これらの将来のフレームは、ＲＰＳにおいて正のデルタＰＣＯ値を用いて示されることとなる。 Referring to FIG. 6 showing a part of FIG. 5, the RPS of [−5, −2, −1] includes the fifth previous frame 320, the second previous frame 321, and the previous previous frame 322. Refers to the containing frame. This RPS further refers to POC values of 0, 3, and 4, respectively, as shown in FIG. RPS typically refers to the difference between the POC value of the current frame and the POC value of the previous frame. For example, an RPS of [-5, -2, -1] for a current frame with a POC value of 5 is a frame with POC values of 5-5 = 0; 5-2 = 3; and 5-1 = 4 Point to. Note that although not shown in the examples herein, the RPS may also include future frames. These future frames will be indicated with a positive delta PCO value in the RPS.

１つ以上のＰＯＣ値がスキップされ、または別途ビットストリームの一部で省略される場合など、ＰＯＣ値が連続していない場合、図７に示されるように、現フレームのＰＯＣ値と前のフレームのＰＯＣ値との間の差は、前のフレームと現フレームとの間に出力されるフレームの数と異なりうる。ＲＰＳは、フレームとともに供給される、またはフレームのセットとともに供給されるなど、任意の適切な様式でビットストリームにおいてシグナリングされればよい。 If the POC values are not consecutive, such as when one or more POC values are skipped or omitted separately in part of the bitstream, the POC value of the current frame and the previous frame, as shown in FIG. May be different from the number of frames output between the previous frame and the current frame. The RPS may be signaled in the bitstream in any suitable manner, such as supplied with a frame or supplied with a set of frames.

図８を参照すると、参照フレームをシグナリングするための別の技術は、一般に長期ピクチャと呼ばれる絶対参照をフレームに関連するＲＰＳにおいて用いることである。参照フレームが絶対参照を用いてシグナリングされるか相対参照を用いてシグナリングされるかに応じて、動きベクトル予測技術等の復号処理が異なりうる。絶対参照（便宜上ＬＴと呼ぶ）は、前のフレームまたは後続フレーム等の参照フレームに関連する特定のＬＳＢカウント値を指す。例えば、ＬＴ＝３（ＬＴ３）の絶対参照は、ＰＯＣＬＳＢ値が３の参照フレームを指す。したがって、［ＬＴ３，−５］のＲＰＳは、ＰＯＣＬＳＢ値が３の参照フレーム、およびＰＯＣが現フレームのＰＯＣマイナス５に等しい参照フレームを指す。図８では、このＲＰＳは、ＰＯＣが３に等しい参照フレーム４４４、およびＰＯＣが０に等しい参照フレーム３２０に対応する。典型的には、ＬＴ３は、ＰＯＣＬＳＢ値が３に等しい、現フレームより前の最初のフレームを指す。一実施形態では、ＬＴ３は、ＰＯＣＬＳＢ値が３である、出力順で現フレームより前の最初のフレームを指す。第二の実施形態では、ＬＴ３は、ＰＯＣＬＳＢ値が３である、伝送順で現フレームより前の最初のフレームを指す。このようなシステムは、多くのビットストリームに適するが、ＬＳＢカウント値が３の直前のフレームとは異なる、ＬＳＢカウント値が３のフレームを選択するために十分に強力ではない。 Referring to FIG. 8, another technique for signaling a reference frame is to use an absolute reference, commonly referred to as a long-term picture, in the RPS associated with the frame. Depending on whether the reference frame is signaled using an absolute reference or relative reference, the decoding process such as a motion vector prediction technique may be different. Absolute reference (referred to as LT for convenience) refers to a specific LSB count value associated with a reference frame, such as a previous frame or a subsequent frame. For example, an absolute reference with LT = 3 (LT3) refers to a reference frame with a POC LSB value of 3. Thus, the RPS of [LT3, -5] refers to a reference frame with a POC LSB value of 3 and a reference frame with POC equal to POC minus 5 of the current frame. In FIG. 8, this RPS corresponds to a reference frame 444 with a POC equal to 3 and a reference frame 320 with a POC equal to 0. Typically, LT3 refers to the first frame before the current frame with a POC LSB value equal to 3. In one embodiment, LT3 refers to the first frame prior to the current frame in output order with a POC LSB value of 3. In the second embodiment, LT3 refers to the first frame prior to the current frame in transmission order, with a POC LSB value of 3. Such a system is suitable for many bitstreams, but is not powerful enough to select a frame with an LSB count value of 3, which is different from the previous frame with an LSB count value of 3.

図９を参照すると、例えばエンコーダがフレーム３１（ＰＯＣ＝３１）を符号化しており、システムが、ＰＯＣＬＳＢ＝０（ＬＴ０）の長期ピクチャの使用をシグナリングする場合、フレーム１６（ＰＯＣ＝１６）がＬＳＢ＝０である１つ前のフレームであることから、このシグナリングはフレーム１６（ＰＯＣ＝１６）を指す（Ａ）。しかし、エンコーダが、同様に０のＬＳＢカウント値を持つ長期ピクチャフレーム０をシグナリングしたかったとしても、このように１つ前のフレームを参照するスキームでは達成できない。この限界を克服するための一つの技術は、長期フレームＬＳＢまたはＰＯＣＬＳＢをシグナリングするために使用される最下位ビットの数を増加させることである。このような最下位ビット数の増加は可能であるが、その結果、相当の追加ビットがビットストリームに追加される。 Referring to FIG. 9, for example, if the encoder is encoding frame 31 (POC = 31) and the system signals the use of a long-term picture with POC LSB = 0 (LT0), then frame 16 (POC = 16) Since this is the previous frame with LSB = 0, this signaling points to frame 16 (POC = 16) (A). However, even if the encoder wants to signal a long-term picture frame 0 having an LSB count value of 0, it cannot be achieved by a scheme that refers to the previous frame in this way. One technique for overcoming this limitation is to increase the number of least significant bits used to signal the long-term frame LSB or POC LSB. Such an increase in the number of least significant bits is possible, but as a result, considerable additional bits are added to the bitstream.

ビットストリームに追加される追加ビットがより少なくなる、より好適な技術は、対応するＰＯＣＬＳＢ値を持つ１つ前のフレームとは異なる長期ピクチャをシグナリングすることである。例えば、システムは、［ＬＴ０｜２］として絶対参照を有する現フレームのＲＰＳを指示しうる。ここで０は、ＰＯＣＬＳＢカウント値を指し、２は、ＰＯＣＬＳＢカウント値が０に等しい前のフレームのうちで使用すべきフレームを指し、ここでは２つ前の０のＰＯＣＬＳＢ値（例えば図９のフレーム０）を指す。２つ目の参照が含まれない場合、システムは、直前のＰＯＣＬＳＢ＝０［ＬＴ０］のフレーム（例えば図９のフレーム１６）にデフォルトすればよい。 A more preferred technique in which fewer additional bits are added to the bitstream is to signal a different long-term picture than the previous frame with a corresponding POC LSB value. For example, the system may indicate the RPS of the current frame with an absolute reference as [LT0 | 2]. Here, 0 indicates the POC LSB count value, 2 indicates the frame to be used among the previous frames whose POC LSB count value is equal to 0, and here, the POC LSB value of the previous 0 (for example, FIG. 9 frame 0). If the second reference is not included, the system may default to the previous POC LSB = 0 [LT0] frame (eg, frame 16 in FIG. 9).

多くの場合、絶対参照を用いて対応するＰＯＣＬＳＢ値を持つ最初の直前のフレームではないフレームをシグナリングする要求の発生頻度は比較的稀であろう。絶対参照を用いて対応するＰＯＣＬＳＢ値を持つ最初の直前のフレームとは異なるフレームをシグナリングする能力を可能にしながら、使用すべきフレームを示す全体のビットレートをさらに削減するために、システムは、重複技術を使用することができる。例えば、ＲＰＳが［ＬＴ０，ＬＴ０｜３］のように構築されればよい。同じＲＰＳ信号内のＬＴ０の重複は、ＰＯＣＬＳＢ値が０の異なるフレーム、ここでは３つ前に生じる０のＰＯＣＬＳＢ値を使用するようデコーダにシグナリングする。一般に、特定のＰＯＣＬＳＢ値が特定のＰＯＣＬＳＢ値のサイクルに含まれない可能性を除けば、所望のＰＯＣＬＳＢ値は、指示された前のフレームの発生に対応する。 In many cases, the frequency of requests to signal a frame that is not the first previous frame with the corresponding POC LSB value using an absolute reference will be relatively rare. In order to further reduce the overall bit rate indicating the frame to use, while allowing the ability to signal a frame different from the first previous frame with the corresponding POC LSB value using an absolute reference, the system Duplicate techniques can be used. For example, the RPS may be constructed as [LT0, LT0 | 3]. The overlap of LT0 in the same RPS signal signals to the decoder to use a different frame with a POC LSB value of 0, here a POC LSB value of 0 occurring three times before. In general, the desired POC LSB value corresponds to the occurrence of the indicated previous frame, except that a specific POC LSB value may not be included in a cycle of a specific POC LSB value.

図１０を参照すると、重複技術は、以下のように示すことができる。ＲＰＳは、ＰＯＣＬＳＢ値を有する長期ピクチャの信号を含む（４００）（例えば［ＬＴ３］）。同じＲＰＳは、同じＰＯＣＬＳＢ値を有する長期ピクチャの別の信号を含む（４１０）（例えば［ＬＴ３，ＬＴ３］）。同じＲＰＳは、所望のフレームの位置を示す、同じＬＳＢカウント値（４１０）を有する２番目の長期ピクチャの別の信号を含む（４２０）［ＬＴ３，ＬＴ３｜２］。 Referring to FIG. 10, the overlap technique can be shown as follows. The RPS includes a long-term picture signal having a POC LSB value (400) (eg, [LT3]). The same RPS includes another signal of a long-term picture having the same POC LSB value (410) (eg, [LT3, LT3]). The same RPS includes another signal of the second long-term picture having the same LSB count value (410) indicating the position of the desired frame (420) [LT3, LT3 | 2].

所望のフレームの位置のシグナリングは、任意の適切な様式で行われればよい。例えば図１１を参照すると、所望のフレームの位置は、現フレームに対していくつか前のＰＯＣＬＳＢ値のサイクル、例えば３つ前のサイクル等であればよい。例えば図１２を参照すると、位置は、現フレームからオフセットしたフレーム数の絶対値に基づいてもよい。例えば図１３を参照すると、位置は、所望のＰＯＣＬＳＢ値を持つ最初の直前のフレームに対して数サイクル前のＰＯＣＬＳＢ値に基づいてもよい。例えば図１４を参照すると、位置は、所望のＰＯＣＬＳＢ値を持つ最初の直前のフレームに対してオフセットしたフレーム数の絶対値に基づいてもよい。 Signaling of the desired frame location may be done in any suitable manner. For example, referring to FIG. 11, the position of a desired frame may be a cycle of a POC LSB value several times before the current frame, for example, three cycles before. For example, referring to FIG. 12, the position may be based on the absolute value of the number of frames offset from the current frame. For example, referring to FIG. 13, the position may be based on the POC LSB value several cycles before the first previous frame with the desired POC LSB value. For example, referring to FIG. 14, the position may be based on the absolute value of the number of frames offset with respect to the first previous frame having the desired POC LSB value.

そのような技術の一つの実装例は、以下のシンタックスを使用しうる。
One implementation of such a technique may use the following syntax:

１に等しいｆｉｒｓｔ＿ｓｌｉｃｅ＿ｓｅｇｍｅｎｔ＿ｉｎ＿ｐｉｃ＿ｆｌａｇは、スライスセグメントが復号順でピクチャの最初のスライスセグメントであることを指定する。０に等しいｆｉｒｓｔ＿ｓｌｉｃｅ＿ｓｅｇｍｅｎｔ＿ｉｎ＿ｐｉｃ＿ｆｌａｇは、スライスセグメントが復号順でピクチャの最初のスライスセグメントではないことを指定する。 First_slice_segment_in_pic_flag equal to 1 specifies that the slice segment is the first slice segment of the picture in decoding order. First_slice_segment_in_pic_flag equal to 0 specifies that the slice segment is not the first slice segment of the picture in decoding order.

ｎｏ＿ｏｕｔｐｕｔ＿ｏｆ＿ｐｒｉｏｒ＿ｐｉｃｓ＿ｆｌａｇは、復号ピクチャバッファ内の前に復号されたピクチャが、ＩＤＲまたはＢＬＡピクチャの復号後にいかに処理されるかを指定する。現ピクチャがＣＲＡピクチャであるとき、または現ピクチャがビットストリームの最初のピクチャであるＩＤＲまたはＢＬＡピクチャであるときには、ｎｏ＿ｏｕｔｐｕｔ＿ｏｆ＿ｐｒｉｏｒ＿ｐｉｃｓ＿ｆｌａｇの値は、復号プロセスに影響を与えない。現ピクチャがビットストリームの最初のピクチャではないＩＤＲまたはＢＬＡピクチャであり、アクティブシーケンスパラメータセットから導出されるｐｉｃ＿ｗｉｄｔｈ＿ｉｎ＿ｌｕｍａ＿ｓａｍｐｌｅｓもしくはｐｉｃ＿ｈｅｉｇｈｔ＿ｉｎ＿ｌｕｍａ＿ｓａｍｐｌｅｓまたはｓｐｓ＿ｍａｘ＿ｄｅｃ＿ｐｉｃ＿ｂｕｆｆｅｒｉｎｇ［ＨｉｇｈｅｓｔＴｉｄ］の値が、先行するピクチャに関してアクティブなシーケンスパラメータセットから導出されるｐｉｃ＿ｗｉｄｔｈ＿ｉｎ＿ｌｕｍａ＿ｓａｍｐｌｅｓもしくはｐｉｃ＿ｈｅｉｇｈｔ＿ｉｎ＿ｌｕｍａ＿ｓａｍｐｌｅｓまたはｓｐｓ＿ｍａｘ＿ｄｅｃ＿ｐｉｃ＿ｂｕｆｆｅｒｉｎｇ［ＨｉｇｈｅｓｔＴｉｄ］の値と異なるときには、ｎｏ＿ｏｕｔｐｕｔ＿ｏｆ＿ｐｒｉｏｒ＿ｐｉｃｓ＿ｆｌａｇの実際値に関わらず、１に等しいｎｏ＿ｏｕｔｐｕｔ＿ｏｆ＿ｐｒｉｏｒ＿ｐｉｃｓ＿ｆｌａｇがデコーダによって推定（されるべきではないが）される。ここで、ＨｉｇｈｅｓｔＴｉｄは、最大時間識別子値を指す。 no_output_of_prior_pics_flag specifies how a previously decoded picture in the decoded picture buffer is processed after decoding an IDR or BLA picture. When the current picture is a CRA picture, or when the current picture is an IDR or BLA picture that is the first picture of a bitstream, the value of no_output_of_prior_pics_flag does not affect the decoding process. The current picture is an IDR or BLA picture that is not the first picture of the bitstream and pic_width_in_luma_samples or pic_height_in_luma_samples or sps_max_dec_pic_buffering derived from the active sequence parameter set is derived from the value that is derived from the active sequence leading picture set value No_output_of_prior_pic if different from pic_width_in_luma_samples or pic_height_in_luma_samples or sps_max_dec_pic_buffering [HighestTid] Regardless of the actual value of _flag, equal to 1 no_output_of_prior_pics_flag is (should not be) estimated by the decoder is the. Here, HighestTid indicates the maximum time identifier value.

ｓｌｉｃｅ＿ｐｉｃ＿ｐａｒａｍｅｔｅｒ＿ｓｅｔ＿ｉｄは、使用中のピクチャパラメータセットのｐｐｓ＿ｐｉｃ＿ｐａｒａｍｅｔｅｒ＿ｓｅｔの値を指定する。ｓｌｉｃｅ＿ｐｉｃ＿ｐａｒａｍｅｔｅｒ＿ｓｅｔ＿ｉｄの値は、両端値を含めて、０から６３までの範囲内にあるものとする。 The slice_pic_parameter_set_id specifies the value of pps_pic_parameter_set of the picture parameter set being used. It is assumed that the value of slice_pic_parameter_set_id is in the range from 0 to 63 including both end values.

１に等しいｄｅｐｅｎｄｅｎｔ＿ｓｌｉｃｅ＿ｓｅｇｍｅｎｔ＿ｆｌａｇは、存在しない各スライスセグメントヘッダのシンタックス要素の値が、スライスヘッダ中の対応するスライスセグメントヘッダのシンタックス要素の値に等しいと推定されることを指定する。ｄｅｐｅｎｄｅｎｔ＿ｓｌｉｃｅ＿ｓｅｇｍｅｎｔ＿ｆｌａｇが存在しない場合、ｄｅｐｅｎｄｅｎｔ＿ｓｌｉｃｅ＿ｓｅｇｍｅｎｔ＿ｆｌａｇの値は０に等しいと推定される。 The dependent_slice_segment_flag equal to 1 specifies that the value of the syntax element of each non-existent slice segment header is estimated to be equal to the value of the syntax element of the corresponding slice segment header in the slice header. When dependent_slice_segment_flag does not exist, the value of dependent_slice_segment_flag is estimated to be equal to zero.

変数ＳｌｉｃｅＡｄｄｒＲＳは、次のように導出される。ｄｅｐｅｎｄｅｎｔ＿ｓｌｉｃｅ＿ｓｅｇｍｅｎｔ＿ｆｌａｇが０に等しければ、ＳｌｉｃｅＡｄｄｒＲＳは、ｓｌｉｃｅ＿ｓｅｇｍｅｎｔ＿ａｄｄｒｅｓｓに等しく設定される。そうでない場合は、ＳｌｉｃｅＡｄｄｒＲＳは、符号化ツリーブロックアドレスがｃｔｂＡｄｄｒＴＳｔｏＲＳ［ｃｔｂＡｄｄｒＲＳｔｏＴＳ［ｓｌｉｃｅ＿ｓｅｇｍｅｎｔ＿ａｄｄｒｅｓｓ］−１］である符号化ツリーブロックを含む先行スライスセグメントのＳｌｉｃｅＡｄｄｒＲＳに等しく設定される。 The variable SliceAddrRS is derived as follows. If dependent_slice_segment_flag is equal to 0, SliceAddrRS is set equal to slice_segment_address. Otherwise, SliceAddrRS is set equal to SliceAddrRS of the preceding slice segment containing the coding tree block whose coding tree block address is ctbAddrTStoRS [ctbAddrRStoTS [slice_segment_address] -1].

ｓｌｉｃｅ＿ｓｅｇｍｅｎｔ＿ａｄｄｒｅｓｓは、ピクチャの符号化ツリーブロックのラスタスキャンにおける、スライスセグメントの最初の符号化ツリーブロックのアドレスを指定する。ｓｌｉｃｅ＿ｓｅｇｍｅｎｔ＿ａｄｄｒｅｓｓシンタックス要素の長さは、Ｃｅｉｌ（Ｌｏｇ２（ＰｉｃＳｉｚｅＩｎＣｔｂｓＹ））ビットである。ｓｌｉｃｅ＿ｓｅｇｍｅｎｔ＿ａｄｄｒｅｓｓの値は、両端値を含めて、１からＰｉｃＳｉｚｅＩｎＣｔｂｓＹ−１までの範囲内にあるものとし、ｓｌｉｃｅ＿ｓｅｇｍｅｎｔ＿ａｄｄｒｅｓｓの値は、同じ符号化ピクチャの他のいずれの符号化スライスセグメントのＮＡＬユニットのｓｌｉｃｅ＿ｓｅｇｍｅｎｔ＿ａｄｄｒｅｓｓの値とも等しくないものとする。ｓｌｉｃｅ＿ｓｅｇｍｅｎｔ＿ａｄｄｒｅｓｓが存在しないとき、ｓｌｉｃｅ＿ｓｅｇｍｅｎｔ＿ａｄｄｒｅｓｓは０に等しいと推定される。 The slice_segment_address specifies the address of the first coding tree block of the slice segment in the raster scan of the coding tree block of the picture. The length of the slice_segment_address syntax element is Ceil (Log2 (PicSizeInCtbsY)) bits. The value of slice_segment_address is in the range from 1 to PicSizeInCtbsY-1 including both end values, and the value of slice_segment_address is the value of slice_segment_address of the NAL unit of any other coded slice segment of the same coded picture. Neither shall be equal. When slice_segment_address does not exist, slice_segment_address is estimated to be equal to zero.

変数ＣｔｂＡｄｄｒＩｎＲＳは、ピクチャの符号化ツリーブロックラスタスキャンにおける符号化ツリーブロックアドレスを指定し、ｓｌｉｃｅ＿ｓｅｇｍｅｎｔ＿ａｄｄｒｅｓｓに等しく設定される。変数ＣｔｂＡｄｄｒＩｎＴＳは、タイルスキャンにおける符号化ツリーブロックアドレスを指定し、ＣｔｂＡｄｄｒＲＳｔｏＴＳ［ＣｔｂＡｄｄｒＩｎＲＳ］に等しく設定される。変数ＣｕＱｐＤｅｌｔａは、ｃｕ＿ｑｐ＿ｄｅｌｔａ＿ａｂｓを含む符号化ユニットについての輝度信号量子化パラメータとその予測値との間の差分を指定し、０に等しく設定される。 A variable CtbAddrInRS designates an encoding tree block address in an encoding tree block raster scan of a picture, and is set equal to slice_segment_address. The variable CtbAddrInTS specifies the coding tree block address in tile scan and is set equal to CtbAddrRStoTS [CtbAddrInRS]. The variable CuQpDelta specifies the difference between the luminance signal quantization parameter and its predicted value for the coding unit including cu_qp_delta_abs and is set equal to 0.

ｓｌｉｃｅ＿ｒｅｓｅｒｖｅｄ＿ｕｎｄｅｔｅｒｍｉｎｅｄ＿ｆｌａｇ［ｉ］は、将来のＩＴＵ‐Ｔ｜ＩＳＯ／ＩＥＣによる規格のためにリザーブされたセマンティクスおよび値を有する。デコーダは、ｓｌｉｃｅ＿ｒｅｓｅｒｖｅｄ＿ｕｎｄｅｔｅｒｍｉｎｅｄ＿ｆｌａｇ［ｉ］の存在および値を無視するものとする。 slice_reserved_undetermined_flag [i] has reserved semantics and values for future ITU-T | ISO / IEC standards. The decoder shall ignore the presence and value of slice_reserved_undetermined_flag [i].

ｓｌｉｃｅ＿ｔｙｐｅは、以下のテーブルにしたがってスライスの符号化タイプを指定する。 The slice_type specifies the coding type of the slice according to the following table.

ｎａｌ＿ｕｎｉｔ＿ｔｙｐｅが、両端値を含めて１６から２３までの範囲内の値を有するとき（ＲＡＰピクチャ）、ｓｌｉｃｅ＿ｔｙｐｅは２に等しいものとする。 When nal_unit_type has a value in the range from 16 to 23 including both end values (RAP picture), slice_type is assumed to be equal to 2.

ｓｐｓ＿ｍａｘ＿ｄｅｃ＿ｐｉｃ＿ｂｕｆｆｅｒｉｎｇ［ＴｅｍｐｏｒａｌＩｄ］が０に等しいとき、ｓｌｉｃｅ＿ｔｙｐｅは２に等しいものとする。 When sps_max_dec_pic_buffering [TemporalId] is equal to 0, slice_type is assumed to be equal to 2.

ｐｉｃ＿ｏｕｔｐｕｔ＿ｆｌａｇは、復号ピクチャの出力および削除プロセスに影響する。ｐｉｃ＿ｏｕｔｐｕｔ＿ｆｌａｇが存在しないとき、ｐｉｃ＿ｏｕｔｐｕｔ＿ｆｌａｇは１に等しいものと推定される。 pic_output_flag affects the decoding picture output and deletion process. When pic_output_flag does not exist, pic_output_flag is estimated to be equal to 1.

ｃｏｌｏｕｒ＿ｐｌａｎｅ＿ｉｄは、ｓｅｐａｒａｔｅ＿ｃｏｌｏｕｒ＿ｐｌａｎｅ＿ｆｌａｇが１に等しいときの、現スライスＲＢＳＰに関連する色平面を指定する。ｃｏｌｏｕｒ＿ｐｌａｎｅ＿ｉｄの値は、両端値を含めて０から２までの範囲内にあるものとする。０、１、および２に等しいｃｏｌｏｕｒ＿ｐｌａｎｅ＿ｉｄはそれぞれ、Ｙ、ＣｂおよびＣｒ平面に対応する。 color_plane_id specifies the color plane associated with the current slice RBSP when separate_color_plane_flag is equal to 1. The value of color_plane_id is assumed to be in the range from 0 to 2 including both end values. A color_plane_id equal to 0, 1, and 2 corresponds to the Y, Cb, and Cr planes, respectively.

ｐｉｃ＿ｏｒｄｅｒ＿ｃｎｔ＿ｌｓｂは、現ピクチャについてのピクチャオーダーカウントをＭａｘＰｉｃＯｒｄｅｒＣｎｔＬｓｂで割った余りを指定する。ｐｉｃ＿ｏｒｄｅｒ＿ｃｎｔ＿ｌｓｂシンタックス要素の長さは、ｌｏｇ２＿ｍａｘ＿ｐｉｃ＿ｏｒｄｅｒ＿ｃｎｔ＿ｌｓｂ＿ｍｉｎｕｓ４＋４ビットである。ｐｉｃ＿ｏｒｄｅｒ＿ｃｎｔ＿ｌｓｂの値は、両端値を含めて、０からＭａｘＰｉｃＯｒｄｅｒＣｎｔＬｓｂ−１までの範囲内にあるものとする。ｐｉｃ＿ｏｒｄｅｒ＿ｃｎｔ＿ｌｓｂが存在しないとき、ほとんどの場合ｐｉｃ＿ｏｒｄｅｒ＿ｃｎｔ＿ｌｓｂは０に等しいものと推定される。 pic_order_cnt_lsb specifies the remainder obtained by dividing the picture order count for the current picture by MaxPicOrderCntLsb. The length of the pic_order_cnt_lsb syntax element is log2_max_pic_order_cnt_lsb_minus4 + 4 bits. It is assumed that the value of pic_order_cnt_lsb is in the range from 0 to MaxPicOrderCntLsb-1, including both end values. When pic_order_cnt_lsb is not present, in most cases pic_order_cnt_lsb is estimated to be equal to zero.

１に等しいｓｈｏｒｔ＿ｔｅｒｍ＿ｒｅｆ＿ｐｉｃ＿ｓｅｔ＿ｓｐｓ＿ｆｌａｇは、現ピクチャの短期参照ピクチャセットが、アクティブシーケンスパラメータセット内のシンタックス要素を使用して作成されることを指定する。０に等しいｓｈｏｒｔ＿ｔｅｒｍ＿ｒｅｆ＿ｐｉｃ＿ｓｅｔ＿ｓｐｓ＿ｆｌａｇは、現ピクチャの短期参照ピクチャセットが、スライスヘッダ内のｓｈｏｒｔ＿ｔｅｒｍ＿ｒｅｆ＿ｐｉｃ＿ｓｅｔ（）シンタックス構造のシンタックス要素を使用して作成されることを指定する。 Short_term_ref_pic_set_sps_flag equal to 1 specifies that a short-term reference picture set for the current picture is created using syntax elements in the active sequence parameter set. Short_term_ref_pic_set_sps_flag equal to 0 specifies that a short-term reference picture set for the current picture is created using the syntax element of the short_term_ref_pic_set () syntax structure in the slice header.

ｓｈｏｒｔ＿ｔｅｒｍ＿ｒｅｆ＿ｐｉｃ＿ｓｅｔ＿ｉｄｘは、現ピクチャの参照ピクチャセットの作成に使用されるアクティブシーケンスパラメータセットにおいて指定された短期参照ピクチャセットのリストに対するインデックスを指定する。シンタックス要素ｓｈｏｒｔ＿ｔｅｒｍ＿ｒｅｆ＿ｐｉｃ＿ｓｅｔ＿ｉｄｘは、Ｃｅｉｌ（Ｌｏｇ２（ｎｕｍ＿ｓｈｏｒｔ＿ｔｅｒｍ＿ｒｅｆ＿ｐｉｃ＿ｓｅｔｓ））ビットにより表される。ｓｈｏｒｔ＿ｔｅｒｍ＿ｒｅｆ＿ｐｉｃ＿ｓｅｔ＿ｉｄｘが存在しない場合、ｓｈｏｒｔ＿ｔｅｒｍ＿ｒｅｆ＿ｐｉｃ＿ｓｅｔ＿ｉｄｘの値は０に等しいものと推定される。ｓｈｏｒｔ＿ｔｅｒｍ＿ｒｅｆ＿ｐｉｃ＿ｓｅｔ＿ｉｄｘの値は、両端値を含めて、０からｎｕｍ＿ｓｈｏｒｔ＿ｔｅｒｍ＿ｒｅｆ＿ｐｉｃ＿ｓｅｔｓ−１までの範囲内にあるものとする。 short_term_ref_pic_set_idx specifies an index to the list of short-term reference picture sets specified in the active sequence parameter set used to create the reference picture set for the current picture. The syntax element short_term_ref_pic_set_idx is represented by the Ceil (Log2 (num_short_term_ref_pic_sets)) bit. If short_term_ref_pic_set_idx does not exist, the value of short_term_ref_pic_set_idx is estimated to be equal to zero. It is assumed that the value of short_term_ref_pic_set_idx is in a range from 0 to num_short_term_ref_pic_sets-1, including both end values.

変数ＳｔＲｐｓＩｄｘは、以下のように導出される：ｓｈｏｒｔ＿ｔｅｒｍ＿ｒｅｆ＿ｐｉｃ＿ｓｅｔ＿ｓｐｓ＿ｆｌａｇが１に等しい場合、ＳｔＲｐｓＩｄｘは、ｓｈｏｒｔ＿ｔｅｒｍ＿ｒｅｆ＿ｐｉｃ＿ｓｅｔ＿ｉｄｘに等しく設定される。そうでない場合は、ＳｔＲｐｓＩｄｘは、ｎｕｍ＿ｓｈｏｒｔ＿ｔｅｒｍ＿ｒｅｆ＿ｐｉｃ＿ｓｅｔｓに等しく設定される。 The variable StRpsIdx is derived as follows: If short_term_ref_pic_set_sps_flag is equal to 1, StRpsIdx is set equal to short_term_ref_pic_set_idx. Otherwise, StRpsIdx is set equal to num_short_term_ref_pic_sets.

ｎｕｍ＿ｌｏｎｇ＿ｔｅｒｍ＿ｓｐｓは、現ピクチャの長期参照ピクチャセットに含まれる、アクティブシーケンスパラメータセットにおいて指定される候補長期参照ピクチャの数を指定する。ｎｕｍ＿ｌｏｎｇ＿ｔｅｒｍ＿ｓｐｓの値は、両端値を含めて、０からＭｉｎ（ｎｕｍ＿ｌｏｎｇ＿ｔｅｒｍ＿ｒｅｆ＿ｐｉｃｓ＿ｓｐｓ，ｓｐｓ＿ｍａｘ＿ｄｅｃ＿ｐｉｃ＿ｂｕｆｆｅｒｉｎｇ［ｓｐｓ＿ｍａｘ＿ｓｕｂ＿ｌａｙｅｒｓ＿ｍｉｎｕｓ１］−ＮｕｍＮｅｇａｔｉｖｅＰｉｃｓ［ＳｔＲｐｓＩｄｘ］−ＮｕｍＰｏｓｉｔｉｖｅＰｉｃｓ［ＳｔＲｐｓＩｄｘ］）までの範囲内にあるものとする。ｎｕｍ＿ｌｏｎｇ＿ｔｅｒｍ＿ｓｐｓが存在しない場合、ｎｕｍ＿ｌｏｎｇ＿ｔｅｒｍ＿ｓｐｓの値は０に等しいものと推定される。 num_long_term_sps specifies the number of candidate long-term reference pictures specified in the active sequence parameter set included in the long-term reference picture set of the current picture. The value of num_long_term_sps is 0 to Min (num_long_term_ref_pics_sps, sps_max_dec_pic_buffering [sps_max_sub_layers_minus1]-NumNegativePixSxPsPs]. If num_long_term_sps does not exist, the value of num_long_term_sps is estimated to be equal to zero.

ｎｕｍ＿ｌｏｎｇ＿ｔｅｒｍ＿ｐｉｃｓは、現ピクチャの長期参照ピクチャセットに含まれる、スライスヘッダにおいて指定される長期参照ピクチャの数を指定する。ｎｕｍ＿ｌｏｎｇ＿ｔｅｒｍ＿ｐｉｃｓの値は、両端値を含めて、０からｓｐｓ＿ｍａｘ＿ｄｅｃ＿ｐｉｃ＿ｂｕｆｆｅｒｉｎｇ［ｓｐｓ＿ｍａｘ＿ｓｕｂ＿ｌａｙｅｒｓ＿ｍｉｎｕｓ１］−ＮｕｍＮｅｇａｔｉｖｅＰｉｃｓ［ＳｔＲｐｓＩｄｘ］−ＮｕｍＰｏｓｉｔｉｖｅＰｉｃｓ［ＳｔＲｐｓＩｄｘ］−ｎｕｍ＿ｌｏｎｇ＿ｔｅｒｍ＿ｓｐｓまでの範囲内にあるものとする。存在しない場合、ｎｕｍ＿ｌｏｎｇ＿ｔｅｒｍ＿ｐｉｃｓの値は０に等しいものと推定される。 num_long_term_pics specifies the number of long-term reference pictures specified in the slice header included in the long-term reference picture set of the current picture. The value of num_long_term_pics is between 0 and sps_max_dec_pic_buffering [sps_max_sub_layers_minus1] -NumNegativePics [StRpsIdx] _NumPosids_sx_sx_Pix_Psx] If not, the value of num_long_term_pics is estimated to be equal to 0.

ｌｔ＿ｉｄｘ＿ｓｐｓ［ｉ］は、現ピクチャの長期参照ピクチャセットに含まれるピクチャの特定のために、アクティブシーケンスパラメータセットにおいて指定された候補長期参照ピクチャのリストに対するインデックスを指定する。ｌｔ＿ｉｄｘ＿ｓｐｓ［ｉ］を表すために使用されるビットの数は、Ｃｅｉｌ（Ｌｏｇ２（ｎｕｍ＿ｌｏｎｇ＿ｔｅｒｍ＿ｒｅｆ＿ｐｉｃｓ＿ｓｐｓ））に等しい。存在しない場合、ｌｔ＿ｉｄｘ＿ｓｐｓ［ｉ］の値は０に等しいものと推定される。ｌｔ＿ｉｄｘ＿ｓｐｓ［ｉ］の値は、両端値を含めて、０からｎｕｍ＿ｌｏｎｇ＿ｔｅｒｍ＿ｒｅｆ＿ｐｉｃｓ＿ｓｐｓ−１までの範囲内にあるものとする。 lt_idx_sps [i] specifies an index to the list of candidate long-term reference pictures specified in the active sequence parameter set in order to identify the pictures included in the long-term reference picture set of the current picture. The number of bits used to represent lt_idx_sps [i] is equal to Ceil (Log2 (num_long_term_ref_pics_sps)). If not, the value of lt_idx_sps [i] is estimated to be equal to 0. The value of lt_idx_sps [i] is assumed to be within a range from 0 to num_long_term_ref_pics_sps−1 including both end values.

ｐｏｃ＿ｌｓｂ＿ｌｔ［ｉ］は、現ピクチャの長期参照ピクチャセットに含まれるｉ番目の長期参照ピクチャのピクチャオーダーカウントをＭａｘＰｉｃＯｒｄｅｒＣｎｔＬｓｂで割った余りの値を指定する。ｐｏｃ＿ｌｓｂ＿ｌｔ［ｉ］シンタックス要素の長さは、ｌｏｇ２＿ｍａｘ＿ｐｉｃ＿ｏｒｄｅｒ＿ｃｎｔ＿ｌｓｂ＿ｍｉｎｕｓ４＋４ビットである。 poc_lsb_lt [i] specifies a remainder value obtained by dividing the picture order count of the i-th long-term reference picture included in the long-term reference picture set of the current picture by MaxPicOrderCntLsb. The length of the poc_lsb_lt [i] syntax element is log2_max_pic_order_cnt_lsb_minus4 + 4 bits.

０に等しいｕｓｅｄ＿ｂｙ＿ｃｕｒｒ＿ｐｉｃ＿ｌｔ＿ｆｌａｇ［ｉ］は、現ピクチャの長期参照ピクチャセットに含まれるｉ番目の長期参照ピクチャが、現ピクチャによる参照のために用いられないことを指定する。 Used_by_curr_pic_lt_flag [i] equal to 0 specifies that the i-th long-term reference picture included in the long-term reference picture set of the current picture is not used for reference by the current picture.

変数ＰｏｃＬｓｂＬｔ［ｉ］およびＵｓｅｄＢｙＣｕｒｒＰｉｃＬｔ［ｉ］は、以下のように導出される。ｉがｎｕｍ＿ｌｏｎｇ＿ｔｅｒｍ＿ｓｐｓ未満であれば、ＰｏｃＬｓｂＬｔ［ｉ］は、ｌｔ＿ｒｅｆ＿ｐｉｃ＿ｐｏｃ＿ｌｓｂ＿ｓｐｓ［ｌｔ＿ｉｄｘ＿ｓｐｓ［ｉ］］に等しく設定され、ＵｓｅｄＢｙＣｕｒｒＰｉｃＬｔ［ｉ］は、ｕｓｅｄ＿ｂｙ＿ｃｕｒｒ＿ｐｉｃ＿ｌｔ＿ｓｐｓ＿ｆｌａｇ［ｌｔ＿ｉｄｘ＿ｓｐｓ［ｉ］］に等しく設定される。そうでない場合は、ＰｏｃＬｓｂＬｔ［ｉ］は、ｐｏｃ＿ｌｓｂ＿ｌｔ［ｉ］に等しく設定され、ＵｓｅｄＢｙＣｕｒｒＰｉｃＬｔ［ｉ］は、ｕｓｅｄ＿ｂｙ＿ｃｕｒｒ＿ｐｉｃ＿ｌｔ＿ｆｌａｇ［ｉ］に等しく設定される。 The variables PocLsbLt [i] and UsedByCurrPicLt [i] are derived as follows. If i is less than num_long_term_sps, PocLsbLt [i] is set equal to lt_ref_pic_poc_lsb_sps [lt_idx_sps [i]] and UsedByCurrPicLt [p] is set to used_by_curr_pic_ps Otherwise, PocLsbLt [i] is set equal to poc_lsb_lt [i], and UsedByCurrPicLt [i] is set equal to used_by_curr_pic_lt_flag [i].

１に等しいｄｅｌｔａ＿ｐｏｃ＿ｍｓｂ＿ｐｒｅｓｅｎｔ＿ｆｌａｇ［ｉ］は、ｄｅｌｔａ＿ｐｏｃ＿ｍｓｂ＿ｃｙｃｌｅ＿ｌｔ［ｉ］が存在することを指定する。０に等しいｄｅｌｔａ＿ｐｏｃ＿ｍｓｂ＿ｐｒｅｓｅｎｔ＿ｆｌａｇ［ｉ］は、ｄｅｌｔａ＿ｐｏｃ＿ｍｓｂ＿ｃｙｃｌｅ＿ｌｔ［ｉ］が存在しないことを指定する。復号ピクチャバッファにピクチャオーダーカウントをＭａｘＰｉｃＯｒｄｅｒＣｎｔＬｓｂで割った余りがＰｏｃＬｓｂＬｔ［ｉ］に等しい参照ピクチャが１つより多くあるとき、ｄｅｌｔａ＿ｐｏｃ＿ｍｓｂ＿ｐｒｅｓｅｎｔ＿ｆｌａｇ［ｉ］は１に等しいものとする。 Delta_poc_msb_present_flag [i] equal to 1 specifies that delta_poc_msb_cycle_lt [i] exists. Delta_poc_msb_present_flag [i] equal to 0 specifies that delta_poc_msb_cycle_lt [i] does not exist. Delta_poc_msb_present_flag [i] is equal to 1 when there is more than one reference picture in the decoded picture buffer with the remainder of dividing the picture order count by MaxPicOrderCntLsb equal to PocLsbLt [i].

ｄｅｌｔａ＿ｐｏｃ＿ｍｓｂ＿ｃｙｃｌｅ＿ｌｔ［ｉ］は、現ピクチャの長期参照ピクチャセットに含まれるｉ番目の長期参照ピクチャのピクチャオーダーカウント値の最上位ビットの値を決定するために用いられる。ｄｅｌｔａ＿ｐｏｃ＿ｍｓｂ＿ｃｙｃｌｅ＿ｌｔ［ｉ］が存在しないとき、ｄｅｌｔａ＿ｐｏｃ＿ｍｓｂ＿ｃｙｃｌｅ＿ｌｔ［ｉ］は０に等しいものと推定される。 delta_poc_msb_cycle_lt [i] is used to determine the value of the most significant bit of the picture order count value of the i-th long-term reference picture included in the long-term reference picture set of the current picture. When delta_poc_msb_cycle_lt [i] does not exist, delta_poc_msb_cycle_lt [i] is estimated to be equal to zero.

変数ＤｅｌｔａＰｏｃＭＳＢＣｙｃｌｅＬｔ［ｉ］は、以下のように導出される：
The variable DeltaPocMSBCycleLt [i] is derived as follows:

長期参照ピクチャセットに関して、復号プロセスが以下のように行われうる： For a long-term reference picture set, the decoding process can be performed as follows:

このプロセスは、ピクチャ毎に１回、スライスヘッダの復号の後であるが、任意の符号化ユニットの復号より前およびスライスの参照ピクチャリスト構築のための復号プロセスより前に、呼び出される。このプロセスの結果、ＤＰＢの１つ以上の参照ピクチャが、「参照のために用いられない」または「長期参照のために用いられる」ものとしてマークされうる。 This process is invoked once per picture, after the decoding of the slice header, but before the decoding of any coding unit and before the decoding process for building the reference picture list of the slice. As a result of this process, one or more reference pictures of the DPB may be marked as “not used for reference” or “used for long-term reference”.

参照ピクチャセットは、現在および将来の符号化ピクチャの復号プロセスで使用される参照ピクチャの絶対記述であることに注意されたい。参照ピクチャセットシグナリングは、参照ピクチャセットに含まれる全ての参照ピクチャが明示的にリストされるという意味で、明示的である。 Note that the reference picture set is an absolute description of the reference picture used in the decoding process of the current and future coded pictures. Reference picture set signaling is explicit in the sense that all reference pictures included in the reference picture set are explicitly listed.

ピクチャは、「参照のために用いられない」、「短期参照のために用いられる」、または「長期参照のために用いられる」ものとしてマークされうるが、これら３つの中の１つとしてのみマークされる。ピクチャにこれらのマーキングのうちの１つを割り当てることにより、これらのマーキングのうちの別のマーキングが、適用可能な場合に黙示的に削除される。ピクチャが「参照のために用いられる」ものとしてマークされていると言うときには、「短期参照のために用いられる」または「長期参照のために用いられる」ものとしてマークされたピクチャ（両方ではない）を集合的に指す。 A picture may be marked as "not used for reference", "used for short-term reference", or "used for long-term reference", but only marked as one of these three Is done. By assigning one of these markings to a picture, another of these markings is implicitly deleted when applicable. When a picture is said to be marked as “used for reference”, a picture marked as “used for short-term reference” or “used for long-term reference” (but not both) Collectively.

現ピクチャがビットストリームの最初のピクチャであるとき、ＤＰＢは空のピクチャのセットに初期化される。 When the current picture is the first picture in the bitstream, the DPB is initialized to an empty set of pictures.

現ピクチャがＩＤＲピクチャまたはＢＬＡピクチャであるとき、現在ＤＰＢ内にある全ての参照ピクチャ（もしあれば）が、「参照のために用いられない」ものとしてマークされる。 When the current picture is an IDR picture or a BLA picture, all reference pictures (if any) currently in the DPB are marked as “not used for reference”.

短期参照ピクチャは、そのＰｉｃＯｒｄｅｒＣｎｔＶａｌ値により識別される。長期参照ピクチャは、そのＰｉｃＯｒｄｅｒＣｎｔＶａｌ値またはｐｉｃ＿ｏｒｄｅｒ＿ｃｎｔ＿ｌｓｂ値により識別される。 A short-term reference picture is identified by its PicOrderCntVal value. A long-term reference picture is identified by its PicOrderCntVal value or pic_order_cnt_lsb value.

参照ピクチャセットを導出するために、ピクチャオーダーカウント値の５つのリストが構築される；ＮｕｍＰｏｃＳｔＣｕｒｒＢｅｆｏｒｅ、ＮｕｍＰｏｃＳｔＣｕｒｒＡｆｔｅｒ、ＮｕｍＰｏｃＳｔＦｏｌｌ、ＮｕｍＰｏｃＬｔＣｕｒｒ、およびＮｕｍＰｏｃＬｔＦｏｌｌの要素数をそれぞれ有する、ＰｏｃＳｔＣｕｒｒＢｅｆｏｒｅ、ＰｏｃＳｔＣｕｒｒＡｆｔｅｒ、ＰｏｃＳｔＦｏｌｌ、ＰｏｃＬｔＣｕｒｒおよびＰｏｃＬｔＦｏｌｌである。 To derive the reference picture set, five lists of picture order count values are built; NumPocStCurrBefore, NumPocStCurrAfter, NumPocStFoll, NumPocStFoll, NumPocLtCurr, NumPocLtCurr, NumPocLtCurr, and NumPocLtFoll. .

現ピクチャがＩＤＲピクチャである場合、ＰｏｃＳｔＣｕｒｒＢｅｆｏｒｅ、ＰｏｃＳｔＣｕｒｒＡｆｔｅｒ、ＰｏｃＳｔＦｏｌｌ、ＰｏｃＬｔＣｕｒｒ、およびＰｏｃＬｔＦｏｌｌは全て空に設定され、ＮｕｍＰｏｃＳｔＣｕｒｒＢｅｆｏｒｅ、ＮｕｍＰｏｃＳｔＣｕｒｒＡｆｔｅｒ、ＮｕｍＰｏｃＳｔＦｏｌｌ、ＮｕｍＰｏｃＬｔＣｕｒｒ、およびＮｕｍＰｏｃＬｔＦｏｌｌは全て０に設定される。 If the current picture is an IDR picture, PocStCurrBefore, PocStCurrAfter, PocStFoll, PocLtCurr, PocLtCur, and PocLtFall are all set to empty, NumPocStCurrBefore, NumPocStCurBefore, NumPocStCureBefore.

そうでない場合は、ピクチャオーダーカウント値の５つのリストおよびエントリ数の導出に以下が適用される。

ここでＰｉｃＯｒｄｅｒＣｎｔＶａｌは、現ピクチャのピクチャオーダーカウントである。 Otherwise, the following applies to the derivation of the five lists of picture order count values and the number of entries.

Here, PicOrderCntVal is a picture order count of the current picture.

代替的実施形態においては、ｄｅｌｔａ＿ｐｏｃ＿ｍｓｂ＿ｐｒｅｓｅｎｔ＿ｆｌａｇ［ｉ］につき以下のセマンティクスを定義することができる。 In an alternative embodiment, the following semantics can be defined for delta_poc_msb_present_flag [i]:

１に等しいｄｅｌｔａ＿ｐｏｃ＿ｍｓｂ＿ｐｒｅｓｅｎｔ＿ｆｌａｇ［ｉ］は、ｄｅｌｔａ＿ｐｏｃ＿ｍｓｂ＿ｃｙｃｌｅ＿ｌｔ［ｉ］が存在することを指定する。０に等しいｄｅｌｔａ＿ｐｏｃ＿ｍｓｂ＿ｐｒｅｓｅｎｔ＿ｆｌａｇ［ｉ］は、ｄｅｌｔａ＿ｐｏｃ＿ｍｓｂ＿ｃｙｃｌｅ＿ｌｔ［ｉ］が存在しないことを指定する。復号ピクチャバッファにピクチャオーダーカウントをＭａｘＰｉｃＯｒｄｅｒＣｎｔＬｓｂで割った余りがＰｏｃＬｓｂＬｔ［ｉ］に等しい参照ピクチャが１つより多くあるとき、ｄｅｌｔａ＿ｐｏｃ＿ｍｓｂ＿ｐｒｅｓｅｎｔ＿ｆｌａｇ［ｉ］は１に等しいものとする。上述の条件に基づいて現ピクチャにつきｄｅｌｔａ＿ｐｏｃ＿ｍｓｂ＿ｐｒｅｓｅｎｔ＿ｆｌａｇ［ｉ］が１に等しく設定されると、復号順でそれ以降のピクチャから、０のＴｅｍｐｏｒａｌＩｄ値を有し、ＴＲＡＩＬ＿Ｒ、ＴＳＡ＿Ｒ、ＳＴＳＡ＿Ｒのうちの１つであるＮＡＬユニットタイプを有する復号順で最初のピクチャまでを含む、全てのピクチャにつき、ｄｅｌｔａ＿ｐｏｃ＿ｍｓｂ＿ｐｒｅｓｅｎｔ＿ｆｌａｇ［ｉ］が１に設定されるものとする。 Delta_poc_msb_present_flag [i] equal to 1 specifies that delta_poc_msb_cycle_lt [i] exists. Delta_poc_msb_present_flag [i] equal to 0 specifies that delta_poc_msb_cycle_lt [i] does not exist. Delta_poc_msb_present_flag [i] is equal to 1 when there is more than one reference picture in the decoded picture buffer with the remainder of dividing the picture order count by MaxPicOrderCntLsb equal to PocLsbLt [i]. If delta_poc_msb_present_flag [i] is set equal to 1 for the current picture based on the above conditions, it has a TemporalId value of 0 from the subsequent pictures in decoding order, and is one of TRAIL_R, TSA_R, STSA_R. It is assumed that delta_poc_msb_present_flag [i] is set to 1 for all pictures including the first picture in decoding order having a certain NAL unit type.

さらに別の実施形態においては、ｄｅｌｔａ＿ｐｏｃ＿ｍｓｂ＿ｐｒｅｓｅｎｔ＿ｆｌａｇ［ｉ］につき以下のセマンティクスを定義することができる。 In yet another embodiment, the following semantics can be defined for delta_poc_msb_present_flag [i].

１に等しいｄｅｌｔａ＿ｐｏｃ＿ｍｓｂ＿ｐｒｅｓｅｎｔ＿ｆｌａｇ［ｉ］は、ｄｅｌｔａ＿ｐｏｃ＿ｍｓｂ＿ｃｙｃｌｅ＿ｌｔ［ｉ］が存在することを指定する。０に等しいｄｅｌｔａ＿ｐｏｃ＿ｍｓｂ＿ｐｒｅｓｅｎｔ＿ｆｌａｇ［ｉ］は、ｄｅｌｔａ＿ｐｏｃ＿ｍｓｂ＿ｃｙｃｌｅ＿ｌｔ［ｉ］が存在しないことを指定する。復号ピクチャバッファにピクチャオーダーカウントをＭａｘＰｉｃＯｒｄｅｒＣｎｔＬｓｂで割った余りがＰｏｃＬｓｂＬｔ［ｉ］に等しい参照ピクチャが１つより多くあるとき、ｄｅｌｔａ＿ｐｏｃ＿ｍｓｂ＿ｐｒｅｓｅｎｔ＿ｆｌａｇ［ｉ］は１に等しいものとする。上述の条件に基づいて現ピクチャにつきｄｅｌｔａ＿ｐｏｃ＿ｍｓｂ＿ｐｒｅｓｅｎｔ＿ｆｌａｇ［ｉ］が１に等しく設定されると、復号順でそれ以降のピクチャから、０のＴｅｍｐｏｒａｌＩｄ値を有し、ＴＲＡＩＬ＿Ｒのうちの１つであるＮＡＬユニットタイプを有する復号順で最初のピクチャまでを含む全てのピクチャにつき、ｄｅｌｔａ＿ｐｏｃ＿ｍｓｂ＿ｐｒｅｓｅｎｔ＿ｆｌａｇ［ｉ］が１に設定されるものとする。 Delta_poc_msb_present_flag [i] equal to 1 specifies that delta_poc_msb_cycle_lt [i] exists. Delta_poc_msb_present_flag [i] equal to 0 specifies that delta_poc_msb_cycle_lt [i] does not exist. Delta_poc_msb_present_flag [i] is equal to 1 when there is more than one reference picture in the decoded picture buffer with the remainder of dividing the picture order count by MaxPicOrderCntLsb equal to PocLsbLt [i]. If delta_poc_msb_present_flag [i] is set equal to 1 for the current picture based on the above conditions, a NAL unit type that has a TemporalId value of 0 and is one of TRAIL_R from subsequent pictures in decoding order. It is assumed that delta_poc_msb_present_flag [i] is set to 1 for all pictures including the first picture in the decoding order.

さらなる代替的実施形態においては、ｄｅｌｔａ＿ｐｏｃ＿ｍｓｂ＿ｐｒｅｓｅｎｔ＿ｆｌａｇ［ｉ］につき以下のセマンティクスを定義することができる。 In a further alternative embodiment, the following semantics can be defined for delta_poc_msb_present_flag [i]:

１に等しいｄｅｌｔａ＿ｐｏｃ＿ｍｓｂ＿ｐｒｅｓｅｎｔ＿ｆｌａｇ［ｉ］は、ｄｅｌｔａ＿ｐｏｃ＿ｍｓｂ＿ｃｙｃｌｅ＿ｌｔ［ｉ］が存在することを指定する。０に等しいｄｅｌｔａ＿ｐｏｃ＿ｍｓｂ＿ｐｒｅｓｅｎｔ＿ｆｌａｇ［ｉ］は、ｄｅｌｔａ＿ｐｏｃ＿ｍｓｂ＿ｃｙｃｌｅ＿ｌｔ［ｉ］が存在しないことを指定する。復号ピクチャバッファにピクチャオーダーカウントをＭａｘＰｉｃＯｒｄｅｒＣｎｔＬｓｂで割った余りがＰｏｃＬｓｂＬｔ［ｉ］に等しい参照ピクチャが１つより多くあるとき、ｄｅｌｔａ＿ｐｏｃ＿ｍｓｂ＿ｐｒｅｓｅｎｔ＿ｆｌａｇ［ｉ］は１に等しいものとする。上述の条件に基づいて現ピクチャにつきｄｅｌｔａ＿ｐｏｃ＿ｍｓｂ＿ｐｒｅｓｅｎｔ＿ｆｌａｇ［ｉ］が１に等しく設定されると、復号順でそれ以降のピクチャから、０のＴｅｍｐｏｒａｌＩｄ値を有し、ＴＲＡＩＬ＿Ｒ、ＴＳＡ＿Ｒ、ＳＴＳＡ＿Ｒ、ＲＡＤＬ＿Ｒ、ＲＡＳＬ＿Ｒのうちの１つであるＮＡＬユニットタイプを有する復号順で最初のピクチャまでを含む全てのピクチャにつき、ｄｅｌｔａ＿ｐｏｃ＿ｍｓｂ＿ｐｒｅｓｅｎｔ＿ｆｌａｇ［ｉ］が１に設定されるものとする。 Delta_poc_msb_present_flag [i] equal to 1 specifies that delta_poc_msb_cycle_lt [i] exists. Delta_poc_msb_present_flag [i] equal to 0 specifies that delta_poc_msb_cycle_lt [i] does not exist. Delta_poc_msb_present_flag [i] is equal to 1 when there is more than one reference picture in the decoded picture buffer with the remainder of dividing the picture order count by MaxPicOrderCntLsb equal to PocLsbLt [i]. If delta_poc_msb_present_flag [i] is set equal to 1 for the current picture based on the above-mentioned conditions, it has a TemporalId value of 0 from the subsequent pictures in decoding order, and among TRAIL_R, TSA_R, STSA_R, RADL_R, and RASL_R It is assumed that delta_poc_msb_present_flag [i] is set to 1 for all pictures including the first picture in the decoding order having the NAL unit type that is one of the above.

１に等しいｄｅｌｔａ＿ｐｏｃ＿ｍｓｂ＿ｐｒｅｓｅｎｔ＿ｆｌａｇ［ｉ］は、ｄｅｌｔａ＿ｐｏｃ＿ｍｓｂ＿ｃｙｃｌｅ＿ｌｔ［ｉ］が存在することを指定する。０に等しいｄｅｌｔａ＿ｐｏｃ＿ｍｓｂ＿ｐｒｅｓｅｎｔ＿ｆｌａｇ［ｉ］は、ｄｅｌｔａ＿ｐｏｃ＿ｍｓｂ＿ｃｙｃｌｅ＿ｌｔ［ｉ］が存在しないことを指定する。復号ピクチャバッファにピクチャオーダーカウントをＭａｘＰｉｃＯｒｄｅｒＣｎｔＬｓｂで割った余りがＰｏｃＬｓｂＬｔ［ｉ］に等しい参照ピクチャが１つより多くあるとき、ｄｅｌｔａ＿ｐｏｃ＿ｍｓｂ＿ｐｒｅｓｅｎｔ＿ｆｌａｇ［ｉ］は１に等しいものとする。上述の条件に基づいて現ピクチャにつきｄｅｌｔａ＿ｐｏｃ＿ｍｓｂ＿ｐｒｅｓｅｎｔ＿ｆｌａｇ［ｉ］が１に等しく設定されると、復号順でそれ以降のピクチャから、０のＴｅｍｐｏｒａｌＩｄ値を有する復号順で最初のピクチャまでを含む全てのピクチャにつき、ｄｅｌｔａ＿ｐｏｃ＿ｍｓｂ＿ｐｒｅｓｅｎｔ＿ｆｌａｇ［ｉ］が１に設定されるものとする。 Delta_poc_msb_present_flag [i] equal to 1 specifies that delta_poc_msb_cycle_lt [i] exists. Delta_poc_msb_present_flag [i] equal to 0 specifies that delta_poc_msb_cycle_lt [i] does not exist. Delta_poc_msb_present_flag [i] is equal to 1 when there is more than one reference picture in the decoded picture buffer with the remainder of dividing the picture order count by MaxPicOrderCntLsb equal to PocLsbLt [i]. If delta_poc_msb_present_flag [i] is set equal to 1 for the current picture based on the above conditions, for all pictures including from the subsequent picture in decoding order to the first picture in decoding order having a TemporalId value of 0 , Delta_poc_msb_present_flag [i] is set to 1.

１に等しいｄｅｌｔａ＿ｐｏｃ＿ｍｓｂ＿ｐｒｅｓｅｎｔ＿ｆｌａｇ［ｉ］は、ｄｅｌｔａ＿ｐｏｃ＿ｍｓｂ＿ｃｙｃｌｅ＿ｌｔ［ｉ］が存在することを指定する。０に等しいｄｅｌｔａ＿ｐｏｃ＿ｍｓｂ＿ｐｒｅｓｅｎｔ＿ｆｌａｇ［ｉ］は、ｄｅｌｔａ＿ｐｏｃ＿ｍｓｂ＿ｃｙｃｌｅ＿ｌｔ［ｉ］が存在しないことを指定する。復号ピクチャバッファにピクチャオーダーカウントをＭａｘＰｉｃＯｒｄｅｒＣｎｔＬｓｂで割った余りがＰｏｃＬｓｂＬｔ［ｉ］に等しい参照ピクチャが１つより多くあるとき、ｄｅｌｔａ＿ｐｏｃ＿ｍｓｂ＿ｐｒｅｓｅｎｔ＿ｆｌａｇ［ｉ］は１に等しいものとする。上述の条件に基づいて現ピクチャにつきｄｅｌｔａ＿ｐｏｃ＿ｍｓｂ＿ｐｒｅｓｅｎｔ＿ｆｌａｇ［ｉ］が１に等しく設定されると、０より大きいＴｅｍｐｏｒａｌＩｄ値を有するかまたはＴＲＡＩＬ＿Ｎ、ＴＳＡ＿Ｎ、ＳＴＳＡ＿Ｎ、ＲＡＤＬ＿Ｎ、ＲＡＳＬ＿Ｎ、ＲＳＶ＿ＶＣＬ＿Ｎ１０、ＲＳＶ＿ＶＣＬ＿Ｎ１２もしくはＲＳＶ＿ＶＣＬ＿Ｎ１４のうちの１つであるＮＡＬユニットタイプを有する復号順で後続する全てのピクチャにつき、ｄｅｌｔａ＿ｐｏｃ＿ｍｓｂ＿ｐｒｅｓｅｎｔ＿ｆｌａｇ［ｉ］が１に設定されるものとする。 Delta_poc_msb_present_flag [i] equal to 1 specifies that delta_poc_msb_cycle_lt [i] exists. Delta_poc_msb_present_flag [i] equal to 0 specifies that delta_poc_msb_cycle_lt [i] does not exist. Delta_poc_msb_present_flag [i] is equal to 1 when there is more than one reference picture in the decoded picture buffer with the remainder of dividing the picture order count by MaxPicOrderCntLsb equal to PocLsbLt [i]. If delta_poc_msb_present_flag [i] is set equal to 1 for the current picture based on the above conditions, it has a TemporalId value greater than 0 or TRAIL_N, TSA_N, STSA_N, RADL_N, RASL_N, RSV_VCL_N10, RSV_VCL_N12, RSV_VCL_N It is assumed that delta_poc_msb_present_flag [i] is set to 1 for all pictures that follow in decoding order with a NAL unit type.

１に等しいｄｅｌｔａ＿ｐｏｃ＿ｍｓｂ＿ｐｒｅｓｅｎｔ＿ｆｌａｇ［ｉ］は、ｄｅｌｔａ＿ｐｏｃ＿ｍｓｂ＿ｃｙｃｌｅ＿ｌｔ［ｉ］が存在することを指定する。０に等しいｄｅｌｔａ＿ｐｏｃ＿ｍｓｂ＿ｐｒｅｓｅｎｔ＿ｆｌａｇ［ｉ］は、ｄｅｌｔａ＿ｐｏｃ＿ｍｓｂ＿ｃｙｃｌｅ＿ｌｔ［ｉ］が存在しないことを指定する。復号ピクチャバッファにピクチャオーダーカウントをＭａｘＰｉｃＯｒｄｅｒＣｎｔＬｓｂで割った余りがＰｏｃＬｓｂＬｔ［ｉ］に等しい参照ピクチャが１つより多くあるとき、ｄｅｌｔａ＿ｐｏｃ＿ｍｓｂ＿ｐｒｅｓｅｎｔ＿ｆｌａｇ［ｉ］は１に等しいものとする。上述の条件に基づいて現ピクチャにつきｄｅｌｔａ＿ｐｏｃ＿ｍｓｂ＿ｐｒｅｓｅｎｔ＿ｆｌａｇ［ｉ］が１に等しく設定されたら、復号順でそれ以降のピクチャから、０に等しい時間識別子値を有し、参照ピクチャとして使用されるか、または個別に破棄できない復号順で最初のピクチャまでを含む全てのピクチャにつき、ｄｅｌｔａ＿ｐｏｃ＿ｍｓｂ＿ｐｒｅｓｅｎｔ＿ｆｌａｇ［ｉ］が１に設定されるものとする。 Delta_poc_msb_present_flag [i] equal to 1 specifies that delta_poc_msb_cycle_lt [i] exists. Delta_poc_msb_present_flag [i] equal to 0 specifies that delta_poc_msb_cycle_lt [i] does not exist. Delta_poc_msb_present_flag [i] is equal to 1 when there is more than one reference picture in the decoded picture buffer with the remainder of dividing the picture order count by MaxPicOrderCntLsb equal to PocLsbLt [i]. If delta_poc_msb_present_flag [i] is set equal to 1 for the current picture based on the above conditions, it has a time identifier value equal to 0 from the subsequent pictures in decoding order and is used as a reference picture or individually It is assumed that delta_poc_msb_present_flag [i] is set to 1 for all pictures including the first picture in the decoding order that cannot be discarded.

１に等しいｄｅｌｔａ＿ｐｏｃ＿ｍｓｂ＿ｐｒｅｓｅｎｔ＿ｆｌａｇ［ｉ］は、ｄｅｌｔａ＿ｐｏｃ＿ｍｓｂ＿ｃｙｃｌｅ＿ｌｔ［ｉ］が存在することを指定する。０に等しいｄｅｌｔａ＿ｐｏｃ＿ｍｓｂ＿ｐｒｅｓｅｎｔ＿ｆｌａｇ［ｉ］は、ｄｅｌｔａ＿ｐｏｃ＿ｍｓｂ＿ｃｙｃｌｅ＿ｌｔ［ｉ］が存在しないことを指定する。復号ピクチャバッファにピクチャオーダーカウントをＭａｘＰｉｃＯｒｄｅｒＣｎｔＬｓｂで割った余りがＰｏｃＬｓｂＬｔ［ｉ］に等しい参照ピクチャが１つより多くあるとき、現ピクチャと、復号順で現ピクチャ以降のピクチャから、０のＴｅｍｐｏｒａｌＩｄ値を有し、ＴＲＡＩＬ＿Ｒのうちの１つであるＮＡＬユニットタイプを有する、現ピクチャ以降の復号順で最初のピクチャまでを含む全てのピクチャとにつき、ｄｅｌｔａ＿ｐｏｃ＿ｍｓｂ＿ｐｒｅｓｅｎｔ＿ｆｌａｇ［ｉ］は１に等しいものとする。 Delta_poc_msb_present_flag [i] equal to 1 specifies that delta_poc_msb_cycle_lt [i] exists. Delta_poc_msb_present_flag [i] equal to 0 specifies that delta_poc_msb_cycle_lt [i] does not exist. If there is more than one reference picture in the decoded picture buffer with the remainder of dividing the picture order count by MaxPicOrderCntLsb equal to PocLsbLt [i], the current picture and the pictures following the current picture in decoding order have a TemporalId value of 0. Delta_poc_msb_present_flag [i] is equal to 1 for all pictures having a NAL unit type that is one of TRAIL_R and including the first picture in the decoding order after the current picture.

１に等しいｄｅｌｔａ＿ｐｏｃ＿ｍｓｂ＿ｐｒｅｓｅｎｔ＿ｆｌａｇ［ｉ］は、ｄｅｌｔａ＿ｐｏｃ＿ｍｓｂ＿ｃｙｃｌｅ＿ｌｔ［ｉ］が存在することを指定する。０に等しいｄｅｌｔａ＿ｐｏｃ＿ｍｓｂ＿ｐｒｅｓｅｎｔ＿ｆｌａｇ［ｉ］は、ｄｅｌｔａ＿ｐｏｃ＿ｍｓｂ＿ｃｙｃｌｅ＿ｌｔ［ｉ］が存在しないことを指定する。復号ピクチャバッファにピクチャオーダーカウントをＭａｘＰｉｃＯｒｄｅｒＣｎｔＬｓｂで割った余りがＰｏｃＬｓｂＬｔ［ｉ］に等しい参照ピクチャが１つより多くあるとき、現ピクチャと、復号順で現ピクチャ以降のピクチャから、０のＴｅｍｐｏｒａｌＩｄ値を有し、ＴＲＡＩＬ＿Ｒ、ＴＳＡ＿Ｒ、ＳＴＳＡ＿Ｒのうちの１つであるＮＡＬユニットタイプを有する、現ピクチャ以降の復号順で最初のピクチャまでを含む全てのピクチャとにつき、ｄｅｌｔａ＿ｐｏｃ＿ｍｓｂ＿ｐｒｅｓｅｎｔ＿ｆｌａｇ［ｉ］は１に等しいものとする。 Delta_poc_msb_present_flag [i] equal to 1 specifies that delta_poc_msb_cycle_lt [i] exists. Delta_poc_msb_present_flag [i] equal to 0 specifies that delta_poc_msb_cycle_lt [i] does not exist. If there is more than one reference picture in the decoded picture buffer with the remainder of dividing the picture order count by MaxPicOrderCntLsb equal to PocLsbLt [i], the current picture and the pictures following the current picture in decoding order have a TemporalId value of 0. Delta_poc_msb_present_flag [i] is equal to 1 for all pictures having the NAL unit type that is one of TRAIL_R, TSA_R, and STSA_R and including the first picture in the decoding order after the current picture.

１に等しいｄｅｌｔａ＿ｐｏｃ＿ｍｓｂ＿ｐｒｅｓｅｎｔ＿ｆｌａｇ［ｉ］は、ｄｅｌｔａ＿ｐｏｃ＿ｍｓｂ＿ｃｙｃｌｅ＿ｌｔ［ｉ］が存在することを指定する。０に等しいｄｅｌｔａ＿ｐｏｃ＿ｍｓｂ＿ｐｒｅｓｅｎｔ＿ｆｌａｇ［ｉ］は、ｄｅｌｔａ＿ｐｏｃ＿ｍｓｂ＿ｃｙｃｌｅ＿ｌｔ［ｉ］が存在しないことを指定する。復号ピクチャバッファにピクチャオーダーカウントをＭａｘＰｉｃＯｒｄｅｒＣｎｔＬｓｂで割った余りがＰｏｃＬｓｂＬｔ［ｉ］に等しい参照ピクチャが１つより多くあるとき、現ピクチャと、復号順で現ピクチャ以降のピクチャから、０のＴｅｍｐｏｒａｌＩｄ値を有し、ＴＲＡＩＬ＿Ｒ、ＴＳＡ＿Ｒ、ＳＴＳＡ＿Ｒ、ＲＡＤＬ＿Ｒ、ＲＡＳＬ＿Ｒのうちの１であるＮＡＬユニットタイプを有する、現ピクチャ以降の復号順で最初のピクチャまでを含む全てのピクチャとにつき、ｄｅｌｔａ＿ｐｏｃ＿ｍｓｂ＿ｐｒｅｓｅｎｔ＿ｆｌａｇ［ｉ］は１に等しいものとする。 Delta_poc_msb_present_flag [i] equal to 1 specifies that delta_poc_msb_cycle_lt [i] exists. Delta_poc_msb_present_flag [i] equal to 0 specifies that delta_poc_msb_cycle_lt [i] does not exist. If there is more than one reference picture in the decoded picture buffer with the remainder of dividing the picture order count by MaxPicOrderCntLsb equal to PocLsbLt [i], the current picture and the pictures following the current picture in decoding order have a TemporalId value of 0. Delta_poc_msb_present_flag [i] is equal to 1 for all pictures having a NAL unit type that is one of TRAIL_R, TSA_R, STSA_R, RADL_R, and RASL_R and including the first picture in the decoding order after the current picture. And

１に等しいｄｅｌｔａ＿ｐｏｃ＿ｍｓｂ＿ｐｒｅｓｅｎｔ＿ｆｌａｇ［ｉ］は、ｄｅｌｔａ＿ｐｏｃ＿ｍｓｂ＿ｃｙｃｌｅ＿ｌｔ［ｉ］が存在することを指定する。０に等しいｄｅｌｔａ＿ｐｏｃ＿ｍｓｂ＿ｐｒｅｓｅｎｔ＿ｆｌａｇ［ｉ］は、ｄｅｌｔａ＿ｐｏｃ＿ｍｓｂ＿ｃｙｃｌｅ＿ｌｔ［ｉ］が存在しないことを指定する。復号ピクチャバッファにピクチャオーダーカウントをＭａｘＰｉｃＯｒｄｅｒＣｎｔＬｓｂで割った余りがＰｏｃＬｓｂＬｔ［ｉ］に等しい参照ピクチャが１つより多くあるとき、現ピクチャと、復号順で現ピクチャ以降のピクチャから、０より大きいＴｅｍｐｏｒａｌＩｄ値を有するかまたはＴＲＡＩＬ＿Ｎ、ＴＳＡ＿Ｎ、ＳＴＳＡ＿Ｎ、ＲＡＤＬ＿Ｎ、ＲＡＳＬ＿Ｎ、ＲＳＶ＿ＶＣＬ＿Ｎ１０、ＲＳＶ＿ＶＣＬ＿Ｎ１２もしくはＲＳＶ＿ＶＣＬ＿Ｎ１４のうちの１つであるＮＡＬユニットタイプを有する全てのピクチャとにつき、ｄｅｌｔａ＿ｐｏｃ＿ｍｓｂ＿ｐｒｅｓｅｎｔ＿ｆｌａｇ［ｉ］は１に等しいものとする。 Delta_poc_msb_present_flag [i] equal to 1 specifies that delta_poc_msb_cycle_lt [i] exists. Delta_poc_msb_present_flag [i] equal to 0 specifies that delta_poc_msb_cycle_lt [i] does not exist. When the decoded picture buffer has more than one reference picture in which the remainder obtained by dividing the picture order count by MaxPicOrderCntLsb is equal to PocLsbLt [i], a TemporalId value greater than 0 is obtained from the current picture and the pictures after the current picture in decoding order. Delta_poc_msb_present_flag is equal to delta_poc_msb_present_flag for all pictures having a NAL unit type of TRAIL_N, TSA_N, STSA_N, RADL_N, RASL_N, RSV_VCL_N10, RSV_VCL_N12 or RSV_VCL_N14.

１に等しいｄｅｌｔａ＿ｐｏｃ＿ｍｓｂ＿ｐｒｅｓｅｎｔ＿ｆｌａｇ［ｉ］は、ｄｅｌｔａ＿ｐｏｃ＿ｍｓｂ＿ｃｙｃｌｅ＿ｌｔ［ｉ］が存在することを指定する。０に等しいｄｅｌｔａ＿ｐｏｃ＿ｍｓｂ＿ｐｒｅｓｅｎｔ＿ｆｌａｇ［ｉ］は、ｄｅｌｔａ＿ｐｏｃ＿ｍｓｂ＿ｃｙｃｌｅ＿ｌｔ［ｉ］が存在しないことを指定する。復号ピクチャバッファにピクチャオーダーカウントをＭａｘＰｉｃＯｒｄｅｒＣｎｔＬｓｂで割った余りがＰｏｃＬｓｂＬｔ［ｉ］に等しい参照ピクチャが１つより多くあるとき、現ピクチャと、復号順で現ピクチャ以降のピクチャから、０のＴｅｍｐｏｒａｌＩｄ値を有する、現ピクチャ以降の復号順で最初のピクチャまでを含む全てのピクチャとにつき、ｄｅｌｔａ＿ｐｏｃ＿ｍｓｂ＿ｐｒｅｓｅｎｔ＿ｆｌａｇ［ｉ］は１に等しいものとする。 Delta_poc_msb_present_flag [i] equal to 1 specifies that delta_poc_msb_cycle_lt [i] exists. Delta_poc_msb_present_flag [i] equal to 0 specifies that delta_poc_msb_cycle_lt [i] does not exist. When there is more than one reference picture in the decoded picture buffer with the remainder of dividing the picture order count by MaxPicOrderCntLsb equal to PocLsbLt [i], the current picture and the pictures following the current picture in decoding order have a TemporalId value of 0. Delta_poc_msb_present_flag [i] is assumed to be equal to 1 for all pictures including the first picture in the decoding order after the current picture.

１に等しいｄｅｌｔａ＿ｐｏｃ＿ｍｓｂ＿ｐｒｅｓｅｎｔ＿ｆｌａｇ［ｉ］は、ｄｅｌｔａ＿ｐｏｃ＿ｍｓｂ＿ｃｙｃｌｅ＿ｌｔ［ｉ］が存在することを指定する。０に等しいｄｅｌｔａ＿ｐｏｃ＿ｍｓｂ＿ｐｒｅｓｅｎｔ＿ｆｌａｇ［ｉ］は、ｄｅｌｔａ＿ｐｏｃ＿ｍｓｂ＿ｃｙｃｌｅ＿ｌｔ［ｉ］が存在しないことを指定する。復号ピクチャバッファにピクチャオーダーカウントをＭａｘＰｉｃＯｒｄｅｒＣｎｔＬｓｂで割った余りがＰｏｃＬｓｂＬｔ［ｉ］に等しい参照ピクチャが１つより多くあるとき、現ピクチャと、復号順で現ピクチャ以降のピクチャから、０のＴｅｍｐｏｒａｌＩｄ値を有し、参照ピクチャとして使用されるか、または個別に破棄できない、現ピクチャ以降の復号順で最初のピクチャまでを含む全てのピクチャとにつき、ｄｅｌｔａ＿ｐｏｃ＿ｍｓｂ＿ｐｒｅｓｅｎｔ＿ｆｌａｇ［ｉ］は１に等しいものとする。 Delta_poc_msb_present_flag [i] equal to 1 specifies that delta_poc_msb_cycle_lt [i] exists. Delta_poc_msb_present_flag [i] equal to 0 specifies that delta_poc_msb_cycle_lt [i] does not exist. If there is more than one reference picture in the decoded picture buffer with the remainder of dividing the picture order count by MaxPicOrderCntLsb equal to PocLsbLt [i], the current picture and the pictures following the current picture in decoding order have a TemporalId value of 0. Delta_poc_msb_present_flag [i] is equal to 1 for all pictures including the first picture in the decoding order after the current picture that can be used as reference pictures or cannot be discarded individually.

ここでｎａｌ＿ｕｎｉｔ＿ｔｙｐｅは、以下のセマンティクスにより定義される。 Here, nal_unit_type is defined by the following semantics.

ｎａｌ＿ｕｎｉｔ＿ｔｙｐｅは、以下のテーブルｙに指定されるようにＮＡＬユニットに含まれるＲＢＳＰデータ構造のタイプを指定する。

nal_unit_type specifies the type of the RBSP data structure included in the NAL unit as specified in the table y below.

ここで、以下の定義が適用される。 Here, the following definitions apply:

アクセスユニット：特定の分類ルールにしたがって互いに関連し、復号順で連続し、正確に１つの符号化ピクチャを含む、ＮＡＬユニットのセット。 Access unit: A set of NAL units that are related to each other according to a specific classification rule, are consecutive in decoding order, and contain exactly one coded picture.

アクセスユニットは、符号化ピクチャの符号化スライスセグメントのＮＡＬユニットを含むのに加え、符号化ピクチャのスライスセグメントを含まない他のＮＡＬユニットも含みうる。アクセスユニットの復号の結果、常に復号されたピクチャが生じる。 The access unit may include other NAL units that do not include the slice segment of the encoded picture in addition to the NAL unit of the encoded slice segment of the encoded picture. The decoding of the access unit always results in a decoded picture.

関連非ＶＣＬＮＡＬユニット：特定のＶＣＬＮＡＬユニットが非ＶＣＬＮＡＬユニットの関連ＶＣＬＮＡＬユニットである、非ＶＣＬＮＡＬユニット。 Associated non-VCL NAL unit: A non-VCL NAL unit in which a particular VCL NAL unit is an associated VCL NAL unit of a non-VCL NAL unit.

関連ＲＡＰピクチャ：復号順で前のＲＡＰピクチャ（あれば）。 Associated RAP picture: previous RAP picture (if any) in decoding order.

関連ＶＣＬＮＡＬユニット：ｎａｌ＿ｕｎｉｔ＿ｔｙｐｅがＥＯＳ＿ＮＵＴ、ＥＯＢ＿ＮＵＴ、ＦＤ＿ＮＵＴ、またはＳＵＦＦＩＸ＿ＳＥＩ＿ＮＵＴに等しく、またはＲＳＶ＿ＮＶＣＬ４５からＲＳＶ＿ＮＶＣＬ４７の範囲内にあり、もしくはＵＮＳＰＥＣ４８からＵＮＳＰＥＣ６３の範囲内にある非ＶＣＬＮＡＬユニットについては、復号順で先行するＶＣＬＮＡＬユニット；ｎａｌ＿ｕｎｉｔ＿ｔｙｐｅが他の値に等しい非ＶＣＬＮＡＬユニットについては、復号順で次のＶＣＬＮＡＬユニット。 Associated VCL NAL unit: nal_unit_type is equal to EOS_NUT, EOB_NUT, FD_NUT, or SUFFIX_SEI_NUT, or is in the range of RSV_NVCL45 to RSV_NVCL47, or in the range of UNSPEC48 to UNSPEC63, V NAL unit; for non-VCL NAL units with nal_unit_type equal to other values, the next VCL NAL unit in decoding order.

ブロークンリンクアクセス（ＢＬＡ；ｂｒｏｋｅｎｌｉｎｋａｃｃｅｓｓ）アクセスユニット：符号化ピクチャがＢＬＡピクチャであるアクセスユニット。 Broken link access (BLA) access unit: An access unit in which the coded picture is a BLA picture.

ブロークンリンクアクセス（ＢＬＡ）ピクチャ：各スライスセグメントがＢＬＡ＿Ｗ＿ＬＰ、ＢＬＡ＿Ｗ＿ＲＡＤＬまたはＢＬＡ＿Ｎ＿ＬＰに等しいｎａｌ＿ｕｎｉｔ＿ｔｙｐｅを有するＲＡＰピクチャ。 Broken Link Access (BLA) picture: A RAP picture in which each slice segment has nal_unit_type equal to BLA_W_LP, BLA_W_RADL or BLA_N_LP.

ＢＬＡピクチャはＩスライスのみを含み、復号順でビットストリームの最初のピクチャでもよいし、またはビットストリームの後の方で現れてもよい。各ＢＬＡピクチャは、新たな符号化ビデオシーケンスを開始し、復号プロセスに対してＩＤＲピクチャと同じ影響を持つ。しかし、ＢＬＡピクチャは、空でない参照ピクチャセットを指定するシンタックス要素を含む。ＢＬＡピクチャがＢＬＡ＿Ｗ＿ＬＰに等しいｎａｌ＿ｕｎｉｔ＿ｔｙｐｅを有するときには、ビットストリーム中に存在しないピクチャの参照を含みうるためにデコーダにより出力されず復号できないかもしれない関連ＲＡＳＬピクチャを該ＢＬＡピクチャが有しうる。ＢＬＡピクチャがＢＬＡ＿Ｗ＿ＬＰに等しいｎａｌ＿ｕｎｉｔ＿ｔｙｐｅを有するときには、該ＢＬＡピクチャは復号されるよう指定された関連ＲＡＤＬピクチャも有しうる。ＢＬＡピクチャがＢＬＡ＿Ｗ＿ＲＡＤＬに等しいｎａｌ＿ｕｎｉｔ＿ｔｙｐｅを有するときには、関連ＲＡＳＬピクチャを有しないが、復号されるよう指定された関連ＲＡＤＬピクチャを有しうる。ＢＬＡピクチャがＢＬＡ＿Ｎ＿ＬＰに等しいｎａｌ＿ｕｎｉｔ＿ｔｙｐｅを有するときには、関連リーディングピクチャを有しない。 A BLA picture contains only I slices and may be the first picture of the bitstream in decoding order or may appear later in the bitstream. Each BLA picture starts a new encoded video sequence and has the same effect on the decoding process as an IDR picture. However, the BLA picture includes a syntax element that specifies a non-empty reference picture set. When a BLA picture has nal_unit_type equal to BLA_W_LP, the BLA picture may have an associated RASL picture that may not be output and decoded by the decoder because it may include a reference to a picture that does not exist in the bitstream. When a BLA picture has nal_unit_type equal to BLA_W_LP, the BLA picture may also have an associated RADL picture designated to be decoded. When a BLA picture has nal_unit_type equal to BLA_W_RADL, it does not have an associated RASL picture but may have an associated RADL picture designated to be decoded. When a BLA picture has nal_unit_type equal to BLA_N_LP, it does not have an associated leading picture.

クリーンランダムアクセス（ＣＲＡ；ｃｌｅａｎｒａｎｄｏｍａｃｃｅｓｓ）アクセスユニット：符号化ピクチャがＣＲＡピクチャであるアクセスユニット。 Clean Random Access (CRA) access unit: An access unit in which the coded picture is a CRA picture.

クリーンランダムアクセス（ＣＲＡ）ピクチャ：各スライスセグメントがＣＲＡ＿ＮＵＴに等しいｎａｌ＿ｕｎｉｔ＿ｔｙｐｅを有するＲＡＰピクチャ。 Clean Random Access (CRA) picture: A RAP picture in which each slice segment has nal_unit_type equal to CRA_NUT.

ＣＲＡピクチャはＩスライスのみを含み、復号順でビットストリームの最初のピクチャでもよいし、またはビットストリームの後の方で現れてもよい。ＣＲＡピクチャは、関連ＲＡＤＬまたはＲＡＳＬピクチャを有しうる。ＣＲＡピクチャが、復号順でビットストリームの最初のピクチャであるときには、そのＣＲＡピクチャは、復号順で符号化ビデオシーケンスの最初のピクチャであり、関連ＲＡＳＬピクチャは、ビットストリーム中に存在しないピクチャの参照を含みうるためにデコーダにより出力されず復号できないかもしれない。 A CRA picture may contain only I slices and may be the first picture of the bitstream in decoding order, or it may appear later in the bitstream. A CRA picture may have an associated RADL or RASL picture. When a CRA picture is the first picture of a bitstream in decoding order, the CRA picture is the first picture of an encoded video sequence in decoding order, and the associated RASL picture is a reference to a picture that does not exist in the bitstream May not be output and cannot be decoded by the decoder.

フィラーデータＮＡＬユニット：ｎａｌ＿ｕｎｉｔ＿ｔｙｐｅがＦＤ＿ＮＵＴに等しいＮＡＬユニット。 Filler data NAL unit: NAL unit whose nal_unit_type is equal to FD_NUT.

瞬時復号リフレッシュ（ＩＤＲ；ｉｎｓｔａｎｔａｎｅｏｕｓｄｅｃｏｄｉｎｇｒｅｆｒｅｓｈ）アクセスユニット：符号化ピクチャがＩＤＲピクチャであるアクセスユニット。 Instantaneous decoding refresh (IDR) access unit: An access unit whose coded picture is an IDR picture.

瞬時復号リフレッシュ（ＩＤＲ）ピクチャ：各スライスセグメントがＩＤＲ＿Ｗ＿ＲＡＤＬまたはＩＤＲ＿Ｎ＿ＬＰに等しいｎａｌ＿ｕｎｉｔ＿ｔｙｐｅを有するＲＡＰピクチャ。 Instantaneous decoding refresh (IDR) picture: A RAP picture in which each slice segment has nal_unit_type equal to IDR_W_RADL or IDR_N_LP.

ＩＤＲピクチャはＩスライスのみを含み、復号順でビットストリームの最初のピクチャであってもよいし、またはビットストリームの後の方で現れてもよい。各ＩＤＲピクチャは、復号順で符号化ビデオシーケンスの最初のピクチャである。ＩＤＲピクチャがＩＤＲ＿Ｗ＿ＲＡＤＬに等しいｎａｌ＿ｕｎｉｔ＿ｔｙｐｅを有するときには、該ＩＤＲピクチャは関連ＲＡＤＬピクチャを有しうる。ＩＤＲピクチャがＩＤＲ＿Ｎ＿ＬＰに等しいｎａｌ＿ｕｎｉｔ＿ｔｙｐｅを有するときには、該ＩＤＲピクチャは関連リーディングピクチャを有しない。ＩＤＲピクチャは、関連ＲＡＳＬピクチャを有しない。 An IDR picture includes only I slices and may be the first picture of the bitstream in decoding order or may appear later in the bitstream. Each IDR picture is the first picture of the encoded video sequence in decoding order. When an IDR picture has nal_unit_type equal to IDR_W_RADL, the IDR picture may have an associated RADL picture. When an IDR picture has nal_unit_type equal to IDR_N_LP, the IDR picture does not have an associated leading picture. An IDR picture does not have an associated RASL picture.

長期参照ピクチャ：「長期参照のために用いられる」ものとしてマークされたピクチャ。 Long-term reference picture: A picture marked as “used for long-term reference”.

長期参照ピクチャセット：長期参照ピクチャを含みうる２つの参照ピクチャセットリスト。 Long-term reference picture sets: Two reference picture set lists that may contain long-term reference pictures.

ネットワーク抽象化層（ＮＡＬ；ｎｅｔｗｏｒｋａｂｓｔｒａｃｔｉｏｎｌａｙｅｒ）ユニット：続くデータのタイプの指示およびそのデータを必要に応じてエミュレーション防止バイトがちりばめられたＲＢＳＰの形で含むバイトを含むシンタックス構造。 Network abstraction layer (NAL) unit: A syntax structure that includes a byte that includes an indication of the type of data that follows and an RBSP that is interspersed with emulation prevention bytes as needed.

ネットワーク抽象化層（ＮＡＬ）ユニットストリーム：一連のＮＡＬユニット。 Network abstraction layer (NAL) unit stream: A series of NAL units.

非参照ピクチャ：「参照のために用いられない」ものとしてマークされたピクチャ。 Non-reference picture: A picture marked as “not used for reference”.

非参照ピクチャは、復号順で後続のピクチャの復号プロセスにおけるインター予測のために使用できないサンプルを含む。 Non-reference pictures include samples that cannot be used for inter prediction in the decoding process of subsequent pictures in decoding order.

ピクチャパラメータセット（ＰＰＳ）：各スライスセグメントヘッダに含まれるシンタックス要素により定義されるゼロ以上の符号化ピクチャ全体に適用されるシンタックス要素を含むシンタックス構造。 Picture Parameter Set (PPS): A syntax structure that includes syntax elements that apply to the entire coded picture of zero or more defined by the syntax elements included in each slice segment header.

プレフィックスＳＥＩＮＡＬユニット：ＰＲＥＦＩＸ＿ＳＥＩ＿ＮＵＴに等しいｎａｌ＿ｕｎｉｔ＿ｔｙｐｅを有するＳＥＩＮＡＬユニット。 Prefix SEI NAL unit: SEI NAL unit with nal_unit_type equal to PREFIX_SEI_NUT.

ランダムアクセス復号可能リーディング（ＲＡＤＬ；ｒａｎｄｏｍａｃｃｅｓｓｄｅｃｏｄａｂｌｅｌｅａｄｉｎｇ）アクセスユニット：符号化ピクチャがＲＡＤＬピクチャであるアクセスユニット。 Random access decodable reading (RADL) access unit: An access unit whose coded picture is a RADL picture.

ランダムアクセス復号可能リーディング（ＲＡＤＬ）ピクチャ：各スライスセグメントがＲＡＤＬ＿ＮＵＴに等しいｎａｌ＿ｕｎｉｔ＿ｔｙｐｅを有する符号化ピクチャ。 Random access decodable leading (RADL) picture: A coded picture in which each slice segment has nal_unit_type equal to RADL_NUT.

全てのＲＡＤＬピクチャは、リーディングピクチャである。ＲＡＤＬピクチャは、同じ関連ＲＡＰピクチャのトレイリングピクチャの復号プロセスのために参照ピクチャとして用いられない。存在する場合、全てのＲＡＤＬピクチャは、復号順で同じ関連ＲＡＰピクチャの全てのトレイリングピクチャに先行する。 All RADL pictures are leading pictures. The RADL picture is not used as a reference picture for the decoding process of the trailing picture of the same associated RAP picture. If present, all RADL pictures precede all trailing pictures of the same associated RAP picture in decoding order.

ランダムアクセスポイント（ＲＡＰ；ｒａｎｄｏｍａｃｃｅｓｓｐｏｉｎｔ）アクセスユニット：符号化ピクチャがＲＡＰピクチャであるアクセスユニット。 Random access point (RAP) access unit: An access unit whose coded picture is a RAP picture.

ランダムアクセスポイント（ＲＡＰ）ピクチャ：各スライスセグメントが両端値を含めて７から１２までの範囲内のｎａｌ＿ｕｎｉｔ＿ｔｙｐｅを有する符号化ピクチャ。 Random access point (RAP) picture: A coded picture in which each slice segment has nal_unit_type in the range of 7 to 12 including both end values.

ＲＡＰピクチャはＩスライスのみを含み、ＢＬＡピクチャ、ＣＲＡピクチャまたはＩＤＲピクチャでありうる。ビットストリームの最初のピクチャは、ＲＡＰピクチャでなければならない。必要なパラメータセットをアクティブ化が必要な際に利用できるならば、復号順でＲＡＰピクチャに先行するいずれのピクチャの復号プロセスも行わずに、ＲＡＰピクチャおよび復号順で後続する全ての非ＲＡＳＬピクチャを適切に復号することができる。ＲＡＰピクチャではないＩスライスのみを含むピクチャもストリーム中に存在しうる。 A RAP picture includes only an I slice and may be a BLA picture, a CRA picture, or an IDR picture. The first picture of the bitstream must be a RAP picture. If the required parameter set is available when activation is required, the RAP picture and all non-RASL pictures that follow it in decoding order are processed without performing the decoding process of any pictures that precede the RAP picture in decoding order. It can be decoded properly. Pictures that contain only I slices that are not RAP pictures may also be present in the stream.

ランダムアクセススキップリーディング（ＲＡＳＬ；ｒａｎｄｏｍａｃｃｅｓｓｓｋｉｐｐｅｄｌｅａｄｉｎｇ）アクセスユニット：符号化ピクチャがＲＡＳＬピクチャであるアクセスユニット。 Random access skipped leading (RASL) access unit: An access unit whose coded picture is a RASL picture.

ランダムアクセススキップリーディング（ＲＡＳＬ）ピクチャ：各スライスセグメントがＲＡＳＬ＿ＮＵＴに等しいｎａｌ＿ｕｎｉｔ＿ｔｙｐｅを有する符号化ピクチャ。 Random Access Skip Reading (RASL) picture: A coded picture in which each slice segment has nal_unit_type equal to RASL_NUT.

全てのＲＡＳＬピクチャは、関連ＢＬＡまたはＣＲＡピクチャのリーディングピクチャである。関連ＲＡＰピクチャがＢＬＡピクチャであるか、またはビットストリームの最初の符号化ピクチャであるとき、ＲＡＳＬピクチャは、ビットストリームに存在しないピクチャの参照を含みうるために出力されず適切に復号できないかもしれない。ＲＡＳＬピクチャは、非ＲＡＳＬピクチャの復号プロセスのための参照ピクチャとして用いられない。存在する場合、全てのＲＡＳＬピクチャは、復号順で同じ関連ＲＡＰピクチャの全てのトレイリングピクチャに先行する。 All RASL pictures are leading pictures of related BLA or CRA pictures. When the associated RAP picture is a BLA picture or the first coded picture of a bitstream, the RASL picture may not be output and decoded properly because it may contain references to pictures that are not present in the bitstream . RASL pictures are not used as reference pictures for the decoding process of non-RASL pictures. If present, all RASL pictures precede all trailing pictures of the same associated RAP picture in decoding order.

ローバイトシーケンスペイロード（ＲＢＳＰ；ｒａｗｂｙｔｅｓｅｑｕｅｎｃｅｐａｙｌｏａｄ）：ＮＡＬユニットにカプセル化された整数のバイト数を含むシンタックス構造。ＲＢＳＰは、空であるか、またはＲＢＳＰストップビットを従え、０に等しいゼロ以上の後続ビットを従えたシンタックス要素を含むデータビットのストリングの形式を有する。 Raw byte sequence payload (RBSP): A syntax structure containing an integer number of bytes encapsulated in a NAL unit. The RBSP is empty or has the form of a string of data bits that includes a syntax element followed by an RBSP stop bit and followed by zero or more subsequent bits equal to zero.

ローバイトシーケンスペイロード（ＲＢＳＰ）ストップビット：ローバイトシーケンスペイロード（ＲＢＳＰ）内のデータビットのストリングの後に存在する１に等しいビット。ＲＢＳＰの終端からＲＢＳＰ内の最後の０ではないビットであるＲＢＳＰストップビットを探索することにより、ＲＢＳＰ内のデータビットのストリングの終端の位置を特定できる。 Raw Byte Sequence Payload (RBSP) stop bit: A bit equal to 1 present after a string of data bits in the Raw Byte Sequence Payload (RBSP). By searching for the RBSP stop bit that is the last non-zero bit in the RBSP from the end of the RBSP, the position of the end of the string of data bits in the RBSP can be identified.

参照ピクチャ：短期参照ピクチャまたは長期参照ピクチャであるピクチャ。 Reference picture: A picture that is a short-term reference picture or a long-term reference picture.

参照ピクチャは、復号順で後続するピクチャの復号プロセスにおけるインター予測に使用されうるサンプルを含む。 A reference picture includes samples that can be used for inter prediction in the decoding process of pictures that follow in decoding order.

シーケンスパラメータセット（ＳＰＳ）：各スライスセグメントヘッダに含まれるシンタックス要素により参照されるピクチャパラメータセットに含まれるシンタックス要素の内容により決定される、ゼロ以上の符号化ビデオシーケンス全体に適用されるシンタックス要素を含むシンタックス構造。 Sequence parameter set (SPS): A syntax applied to the entire encoded video sequence of zero or more determined by the content of the syntax element included in the picture parameter set referenced by the syntax element included in each slice segment header. A syntax structure containing tax elements.

短期参照ピクチャ：「短期参照のために用いられる」ものとしてマークされたピクチャ。 Short-term reference picture: A picture marked as “used for short-term reference”.

段階的時間サブレイヤアクセス（ＳＴＳＡ：ｓｔｅｐ‐ｗｉｓｅｔｅｍｐｏｒａｌｓｕｂ‐ｌａｙｅｒａｃｃｅｓｓ）アクセスユニット：符号化ピクチャがＳＴＳＡピクチャであるアクセスユニット。 Step-wise temporal sub-layer access (STSA) access unit: An access unit whose coded picture is an STSA picture.

段階的時間サブレイヤアクセス（ＳＴＳＡ）ピクチャ：各スライスセグメントがＳＴＳＡ＿ＲまたはＳＴＳＡ＿Ｎに等しいｎａｌ＿ｕｎｉｔ＿ｔｙｐｅを有する符号化ピクチャ。 Staged temporal sublayer access (STSA) picture: A coded picture in which each slice segment has nal_unit_type equal to STSA_R or STSA_N.

ＳＴＳＡピクチャは、当該ＳＴＳＡピクチャと同じＴｅｍｐｏｒａｌＩｄのピクチャをインター予測参照に使用しない。ＳＴＳＡピクチャと同じＴｅｍｐｏｒａｌＩｄを有する復号順でＳＴＳＡピクチャに続くピクチャは、ＳＴＳＡピクチャと同じＴｅｍｐｏｒａｌＩｄを有する復号順でＳＴＳＡピクチャより前のピクチャを、インター予測参照に使用しない。ＳＴＳＡピクチャは、当該ＳＴＳＡピクチャで直近下位のサブレイヤから当該ＳＴＳＡピクチャを含むサブレイヤへの上方切り替えを可能にする。ＳＴＳＡピクチャは、０より大きいＴｅｍｐｏｒａｌＩｄを有しなければならない。 The STSA picture does not use the same TemporalId picture as the STSA picture for inter prediction reference. A picture that follows the STSA picture in the decoding order having the same TemporalId as the STSA picture does not use a picture preceding the STSA picture in the decoding order having the same TemporalId as the STSA picture for inter prediction reference. The STSA picture enables upward switching from a sublayer immediately below the STSA picture to a sublayer including the STSA picture. An STSA picture must have a TemporalId greater than zero.

サフィックスＳＥＩＮＡＬユニット：ＳＵＦＦＩＸ＿ＳＥＩ＿ＮＵＴに等しいｎａｌ＿ｕｎｉｔ＿ｔｙｐｅを有するＳＥＩＮＡＬユニット。 Suffix SEI NAL unit: SEI NAL unit with nal_unit_type equal to SUFIX_SEI_NUT.

補足的な付加情報（ＳＥＩ；ｓｕｐｐｌｅｍｅｎｔａｌｅｎｈａｎｃｅｍｅｎｔｉｎｆｏｒｍａｔｉｏｎ）ＮＡＬユニット：ＰＲＥＦＩＸ＿ＳＥＩ＿ＮＵＴまたはＳＵＦＦＩＸ＿ＳＥＩ＿ＮＵＴに等しいｎａｌ＿ｕｎｉｔ＿ｔｙｐｅを有するＮＡＬユニット。 Supplemental enhancement information (SEI) NAL unit: NAL unit with nal_unit_type equal to PREFIX_SEI_NUT or SUFFIX_SEI_NUT.

時間サブレイヤアクセス（ＴＳＡ：ｔｅｍｐｏｒａｌｓｕｂ‐ｌａｙｅｒａｃｃｅｓｓ）アクセスユニット：符号化ピクチャがＴＳＡピクチャであるアクセスユニット。 Temporary sub-layer access (TSA) access unit: An access unit whose coded picture is a TSA picture.

時間サブレイヤアクセス（ＴＳＡ）ピクチャ：各スライスセグメントがＴＳＡ＿ＲまたはＴＳＡ＿Ｎに等しいｎａｌ＿ｕｎｉｔ＿ｔｙｐｅを有する符号化ピクチャ。 Temporal sublayer access (TSA) picture: A coded picture in which each slice segment has nal_unit_type equal to TSA_R or TSA_N.

ＴＳＡピクチャおよび復号においてＴＳＡピクチャに続くピクチャは、ＴＳＡピクチャのＴｅｍｐｏｒａｌＩｄ以上のＴｅｍｐｏｒａｌＩｄのピクチャをインター予測参照に使用しない。ＴＳＡピクチャは、当該ＴＳＡピクチャで直近下位のサブレイヤから当該ＴＳＡピクチャを含むサブレイヤまたは任意のより上位のサブレイヤへの上方切り替えを可能にする。ＴＳＡピクチャは、０より大きいＴｅｍｐｏｒａｌＩｄを有しなければならない。 The TSA picture and the picture following the TSA picture in decoding do not use a TemporalId picture equal to or higher than the TemporalId of the TSA picture for inter prediction reference. The TSA picture enables upward switching from a sublayer immediately below the TSA picture to a sublayer including the TSA picture or any higher sublayer. A TSA picture must have a TemporalId greater than zero.

トレイリングピクチャ：出力順で関連ＲＡＰピクチャに続くピクチャ。 Trailing picture: A picture that follows an associated RAP picture in output order.

ビデオ符号化レイヤ（ＶＣＬ；ｖｉｄｅｏｃｏｄｉｎｇｌａｙｅｒ）ＮＡＬユニット：ＶＣＬＮＡＬユニットとして分類されるｎａｌ＿ｕｎｉｔ＿ｔｙｐｅのリザーブド値を有する符号化スライスセグメントＮＡＬユニットおよびＮＡＬユニットのサブセットの集合名。 Video coding layer (VCL) NAL unit: A set name of a subset of a coded slice segment NAL unit and a NAL unit having a reserved value of nal_unit_type classified as a VCL NAL unit.

ビデオパラメータセット（ＶＰＳ；ｖｉｄｅｏｐａｒａｍｅｔｅｒｓｅｔ）：各スライスセグメントヘッダに含まれるシンタックス要素により参照されるピクチャパラメータセットに含まれるシンタックス要素により参照されるシーケンスパラメータセットに含まれるシンタックス要素の内容により決定される、ゼロ以上の符号化ビデオシーケンス全体に適用されるシンタックス要素を含むシンタックス構造。 Video parameter set (VPS): Depending on the content of the syntax element included in the sequence parameter set referenced by the syntax element included in the picture parameter set referenced by the syntax element included in each slice segment header A syntax structure that includes syntax elements that are applied to the entire determined zero or more encoded video sequence.

ここで、時間識別子値は、以下のように定義される。 Here, the time identifier value is defined as follows.

ｎｕｈ＿ｔｅｍｐｏｒａｌ＿ｉｄ＿ｐｌｕｓ１−１は、ＮＡＬユニットの時間識別子を指定する。ｎｕｈ＿ｔｅｍｐｏｒａｌ＿ｉｄ＿ｐｌｕｓ１の値は、０に等しくないものとする。 nuh_temporal_id_plus1-1 specifies the time identifier of the NAL unit. The value of nuh_temporal_id_plus1 is not equal to 0.

変数ＴｅｍｐｏｒａｌＩｄは、 The variable TemporalId is

ＴｅｍｐｏｒａｌＩｄ＝ｎｕｈ＿ｔｅｍｐｏｒａｌ＿ｉｄ＿ｐｌｕｓ１−１として指定される。 It is specified as TemporalId = nuh_temporal_id_plus1-1.

ｎａｌ＿ｕｎｉｔ＿ｔｙｐｅが、両端値を含めて１６から２３までの範囲内にある場合（ＲＡＰピクチャの符号化スライスセグメント）、ＴｅｍｐｏｒａｌＩｄは０に等しいものとする；そうでない場合は、ｎａｌ＿ｕｎｉｔ＿ｔｙｐｅがＴＳＡ＿Ｒ、ＴＳＡ＿Ｎ、ＳＴＳＡ＿ＲまたはＳＴＳＡ＿Ｎに等しいとき、ＴｅｍｐｏｒａｌＩｄは０に等しくないものとする。 If nal_unit_type is in the range from 16 to 23 including both end values (encoded slice segment of RAP picture), TemporalId shall be equal to 0; otherwise, nal_unit_type will be TSA_R, TSA_N, STSA_R or When equal to STSA_N, TemporalId shall not be equal to zero.

ＴｅｍｐｏｒａｌＩｄの値は、アクセスユニットの全てのＶＣＬＮＡＬユニットにつき同じとする。アクセスユニットのＴｅｍｐｏｒａｌＩｄの値は、当該アクセスユニットのＶＣＬＮＡＬユニットのＴｅｍｐｏｒａｌＩｄの値である。 The value of TemporalId is the same for all VCL NAL units of the access unit. The TemporalId value of the access unit is the TemporalId value of the VCL NAL unit of the access unit.

例えば図１５を参照すると、ピクチャＢに関して、復号ピクチャバッファにピクチャオーダーカウントをＭａｘＰｉｃＯｒｄｅｒＣｎｔＬｓｂで割った余りがＰｏｃＬｓｂＬｔ［ｉ］に等しい１つより多くの参照ピクチャがあり、そのためｄｅｌｔａ＿ｐｏｃ＿ｍｓｂ＿ｐｒｅｓｅｎｔ＿ｆｌａｇ［ｉ］は１に等しくなる。そして、本明細書に記載される実施形態にしたがって、ｄｅｌｔａ＿ｐｏｃ＿ｍｓｂ＿ｐｒｅｓｅｎｔ＿ｆｌａｇ［ｉ］は、ピクチャＣおよびＤについても１に等しく設定される。ピクチャＤは、ＴｅｍｐｏｒａｌＩｄ＝０に帰属するため、復号順でＤに後続するピクチャは、それらの後続ピクチャに関して復号ピクチャバッファにピクチャオーダーカウントをＭａｘＰｉｃＯｒｄｅｒＣｎｔＬｓｂで割った余りがＰｏｃＬｓｂＬｔ［ｉ］に等しい１つより多くの参照ピクチャがない限り、ｄｅｌｔａ＿ｐｏｃ＿ｍｓｂ＿ｐｒｅｓｅｎｔ＿ｆｌａｇ［ｉ］を１に設定する必要はない。 For example, referring to FIG. 15, for picture B, the decoded picture buffer has more than one reference picture with the remainder of dividing the picture order count by MaxPicOrderCntLsb equal to PocLsbLt [i], so delta_poc_msb_present_flag [i] is equal to 1 Become. And according to the embodiments described herein, delta_poc_msb_present_flag [i] is set equal to 1 for pictures C and D. Since picture D belongs to TemporalId = 0, the pictures following D in decoding order are obtained from the one in which the remainder obtained by dividing the picture order count by MaxPicOrderCntLsb in the decoded picture buffer for those subsequent pictures is equal to PocLsbLt [i] Delta_poc_msb_present_flag [i] need not be set to 1 unless there are many reference pictures.

以上の明細書において使用される用語および表現は、制限ではなく説明のための用語として使用され、このような用語および表現の使用には、図と共に記載された特徴またはその一部の等価物を除外する意図はなく、本発明の範囲は、以下の請求の範囲のみにより制限されることが理解されよう。 The terms and expressions used in the foregoing specification are used as explanatory terms, not as limitations, and the use of such terms and expressions may include features described in conjunction with the drawings or equivalents thereof. It is understood that there is no intent to exclude and that the scope of the present invention is limited only by the following claims.

Claims

A method for decoding a video bitstream comprising:
Receiving a reference picture set parameter from the video bitstream;
Decoding a current picture using inter prediction based on the reference picture set;
Storing the decoded picture referenced for future inter prediction in a decoded picture buffer;
The reference picture set includes at least (a) one or more reference picture identifiers each based on a selected number of least significant bits (LSB) of the picture order count (POC) of a reference picture;
(B) is decoded by using a flag that specifies whether or not there is subsequent data for determining the MSB of the POC of the reference picture, and the subsequent data is: (i) the decoded picture buffer Having more than one reference picture with the remainder of dividing the picture order count by MaxPicOrderCntLsb equal to PocLsbLt, and (ii) having a TemporalId value of 0 after the picture satisfying condition (i) A method present for the picture that satisfies one of the following pictures in decoding order, including up to the first picture in decoding order that is a trailing picture or reference picture of a stream.