JP6396566B2

JP6396566B2 - Electronic device, encoding method and program

Info

Publication number: JP6396566B2
Application number: JP2017203573A
Authority: JP
Inventors: 昭行谷沢; 中條　健; 健中條
Original assignee: Toshiba Corp
Current assignee: Toshiba Corp
Priority date: 2017-10-20
Filing date: 2017-10-20
Publication date: 2018-09-26
Anticipated expiration: 2031-10-17
Also published as: JP2018042266A

Description

本発明の実施形態は、電子機器、符号化方法及びプログラムに関する。 Embodiments described herein relate generally to an electronic device, an encoding method, and a program.

近年、符号化効率を大幅に向上させた画像符号化方法が、ＩＴＵ−Ｔ（International Telecommunication Union Telecommunication Standardization Sector）とＩＳＯ（International Organization for Standardization）／ＩＥＣ（International Electrotechnical Commission）との共同で、ＩＴＵ−ＴＲＥＣ．Ｈ．２６４及びＩＳＯ／ＩＥＣ１４４９６−１０（以下、「Ｈ．２６４」という）として勧告されている。 In recent years, an image coding method with greatly improved coding efficiency has been developed in collaboration with ITU-T (International Telecommunication Union Telecommunication Standardization Sector) and ISO (International Organization for Standardization) / IEC (International Electrotechnical Commission). T REC. H. H.264 and ISO / IEC 14496-10 (hereinafter referred to as “H.264”).

Ｈ．２６４には、符号化済みの画像を参照画像に用いて分数精度の動き補償予測を行うことにより、時間方向の冗長性を削除し、高い符号化効率を実現するインター予測符号化方式が開示されている。 H. H.264 discloses an inter-prediction coding scheme that eliminates temporal redundancy and realizes high coding efficiency by performing fractional accuracy motion compensated prediction using a coded image as a reference image. ing.

また、ＩＳＯ／ＩＥＣＭＰＥＧ（Moving Picture Experts Group）−１，２，４におけるインター予測符号化方式よりも、フェードやディゾルブ効果を含む動画像を高効率に符号化する方式も提案されている。この方式では、時間方向における明度変化を予測する枠組みとして、輝度と２つの色差とを有する入力動画像に対して分数精度の動き補償予測を行う。そして、参照画像と、輝度及び２つの色差毎の重み係数と、輝度及び２つの色差毎のオフセットと、を含む組み合わせ示すインデックスを用いて、予測画像に重み係数を乗じ、オフセットを加算する。 Also, a scheme for encoding moving images including fade and dissolve effects with higher efficiency than the inter prediction encoding schemes in ISO / IEC MPEG (Moving Picture Experts Group) -1, 2, 4 has been proposed. In this method, as a framework for predicting a change in brightness in the time direction, fractional accuracy motion compensation prediction is performed on an input moving image having luminance and two color differences. Then, the prediction image is multiplied by the weighting factor using an index indicating a combination including the reference image, the weighting factor for each luminance and two color differences, and the offset for each luminance and two color differences, and the offset is added.

特開２００４−７３７７号公報JP 2004-7377 A

しかしながら、上述したような従来技術では、インデックスを直値のまま符号化するため、符号化効率が低下してしまう。本発明が解決しようとする課題は、符号化効率を向上できる電子機器、復号方法及びプログラムを提供することである。 However, in the conventional technique as described above, since the index is encoded with the direct value, the encoding efficiency is lowered. The problem to be solved by the present invention is to provide an electronic device, a decoding method, and a program that can improve encoding efficiency.

実施形態の電子機器は、符号化データが入力される電子機器であって、処理部を備える。処理部は、輝度の重み係数の第１固定小数点精度を符号化し、第１固定小数点精度と、色差の重み係数の第２固定小数点精度との差異と同じ値である、第１差分値を符号化し、第１固定小数点精度で定められるビット数だけ“１”を左シフトすることによって得られる値と同じ値である第１基準値と、輝度の重み係数と、の差異と同じ値である、輝度の重み係数の第２差分値を符号化し、第２固定小数点精度で定められるビット数だけ“１”を左シフトすることによって得られる値である第２基準値と、色差の重み係数と、の差異と同じ値である、色差の重み係数の第３差分値を符号化し、画素値の最大値の中央値から、中央値に対して色差の重み係数を乗算しかつ第２固定小数点精度により定められるビット数だけ右シフトすることによって得られる値を減算することにより得られる値と同じ値である第３基準値と、色差のオフセットと、の差異と同じ値である、色差のオフセットの第４差分値を符号化する。 The electronic device according to the embodiment is an electronic device to which encoded data is input, and includes a processing unit. The processing unit encodes the first fixed-point precision of the luminance weighting coefficient, and encodes the first difference value that is the same value as the difference between the first fixed-point precision and the second fixed-point precision of the color difference weighting coefficient. And the same value as the difference between the first reference value, which is the same value as the value obtained by left-shifting “1” by the number of bits determined by the first fixed-point precision, and the luminance weighting factor, A second reference value that is a value obtained by encoding the second difference value of the luminance weighting coefficient and left-shifting “1” by the number of bits determined by the second fixed-point precision; a color difference weighting coefficient; The third difference value of the color difference weighting coefficient, which is the same value as the difference between the two, is encoded, the median value is multiplied by the color difference weighting coefficient from the median of the maximum pixel values, and the second fixed point precision is obtained. By shifting right by the specified number of bits A third reference value is the same value as the value obtained by subtracting the resulting value is the same value as the offset and, for differences in color difference, encoding a fourth differential value of the offset of the color difference.

第１実施形態の符号化装置の例を示すブロック図。The block diagram which shows the example of the encoding apparatus of 1st Embodiment. 第１実施形態における画素ブロックの予測符号化順序例を示す説明図。Explanatory drawing which shows the example of the prediction encoding order of the pixel block in 1st Embodiment. 第１実施形態におけるコーディングツリーブロックのブロックサイズ例を示す図。The figure which shows the block size example of the coding tree block in 1st Embodiment. 第１実施形態のコーディングツリーブロックの具体例を示す図。The figure which shows the specific example of the coding tree block of 1st Embodiment. 第１実施形態のコーディングツリーブロックの具体例を示す図。The figure which shows the specific example of the coding tree block of 1st Embodiment. 第１実施形態のコーディングツリーブロックの具体例を示す図。The figure which shows the specific example of the coding tree block of 1st Embodiment. 第１実施形態の予測画像生成部の例を示すブロック図。The block diagram which shows the example of the estimated image generation part of 1st Embodiment. 第１実施形態の双方向予測における動き補償予測の動きベクトルの関係の例を示す図。The figure which shows the example of the relationship of the motion vector of the motion compensation prediction in the bidirectional | two-way prediction of 1st Embodiment. 第１実施形態の複数フレーム動き補償部の例を示すブロック図。The block diagram which shows the example of the multi-frame motion compensation part of 1st Embodiment. 第１実施形態における重み係数の固定小数点精度の例の説明図。Explanatory drawing of the example of the fixed point precision of the weighting coefficient in 1st Embodiment. 第１実施形態のＷＰパラメータ情報例を示す図。The figure which shows the WP parameter information example of 1st Embodiment. 第１実施形態のＷＰパラメータ情報例を示す図。The figure which shows the WP parameter information example of 1st Embodiment. 第１実施形態のシンタクスの例を示す図。The figure which shows the example of the syntax of 1st Embodiment. 第１実施形態のピクチャパラメータセットシンタクスの例を示す図。The figure which shows the example of the picture parameter set syntax of 1st Embodiment. 第１実施形態のスライスヘッダーシンタクスの例を示す図。The figure which shows the example of the slice header syntax of 1st Embodiment. 第１実施形態のプレッドウェイトテーブルシンタクスの例を示す図。The figure which shows the example of the tread weight table syntax of 1st Embodiment. 第１実施形態の予測方法を明示的に示したシンタクス構成の例を示す図。The figure which shows the example of the syntax structure which showed the prediction method of 1st Embodiment explicitly. 第１実施形態の固定小数点精度の予測処理例を示すフローチャート。The flowchart which shows the example of a prediction process of the fixed point precision of 1st Embodiment. 第１実施形態の固定小数点精度の復元処理例を示すフローチャート。6 is a flowchart illustrating an example of fixed point precision restoration processing according to the first embodiment. 第１実施形態の重み係数の予測処理例を示すフローチャート。The flowchart which shows the example of the prediction process of the weighting coefficient of 1st Embodiment. 第１実施形態の重み係数の復元処理例を示すフローチャート。6 is a flowchart showing an example of weight coefficient restoration processing according to the first embodiment. 第１実施形態の重み係数の予測処理の他の例を示すフローチャート。7 is a flowchart illustrating another example of weight coefficient prediction processing according to the first embodiment. 第１実施形態の重み係数の復元処理の他の例を示すフローチャート。7 is a flowchart illustrating another example of the weighting factor restoration process according to the first embodiment. 第１実施形態の色差信号の予測処理例を示すフローチャート。5 is a flowchart illustrating an example of color difference signal prediction processing according to the first embodiment. 第１実施形態の色差信号の復元処理例を示すフローチャート。5 is a flowchart illustrating an example of color difference signal restoration processing according to the first embodiment. 第１実施形態の重み係数の予測処理例の他の例を示すフローチャート。9 is a flowchart illustrating another example of weighting factor prediction processing according to the first embodiment. 第１実施形態の重み係数の復元処理例の他の例を示すフローチャート。12 is a flowchart illustrating another example of the weighting factor restoration process according to the first embodiment. 第２実施形態の復号装置の構成例を示すブロック図。The block diagram which shows the structural example of the decoding apparatus of 2nd Embodiment.

以下、添付図面を参照しながら、実施形態を詳細に説明する。以下の各実施形態の符号化装置及び復号装置は、ＬＳＩ（Large-Scale Integration）チップ、ＤＳＰ（Digital Signal Processor）、又はＦＰＧＡ（Field Programmable Gate Array）などのハードウェアにより実現できる。また、以下の各実施形態の符号化装置及び復号装置は、コンピュータにプログラムを実行させること、即ち、ソフトウェアにより実現させることもできる。なお、以降の説明において、「画像」という用語は、「映像」、「画素」、「画像信号」、「絵」、又は「画像データ」などの用語に適宜読み替えることができる。 Hereinafter, embodiments will be described in detail with reference to the accompanying drawings. The encoding device and decoding device of each of the following embodiments can be realized by hardware such as an LSI (Large-Scale Integration) chip, a DSP (Digital Signal Processor), or an FPGA (Field Programmable Gate Array). In addition, the encoding device and the decoding device of each of the following embodiments can be realized by software by causing a computer to execute a program. In the following description, the term “image” can be appropriately replaced with a term such as “video”, “pixel”, “image signal”, “picture”, or “image data”.

（第１実施形態）
第１実施形態では、動画像を符号化する符号化装置について説明する。 (First embodiment)
In the first embodiment, an encoding apparatus that encodes a moving image will be described.

図１は、第１実施形態の符号化装置１００の構成の一例を示すブロック図である。 FIG. 1 is a block diagram illustrating an example of a configuration of the encoding device 100 according to the first embodiment.

符号化装置１００は、入力画像を構成する各フレーム又は各フィールドを複数の画素ブロックに分割し、符号化制御部１１１から入力される符号化パラメータを用いて、分割した画素ブロックに対して予測符号化を行い、予測画像を生成する。そして符号化装置１００は、複数の画素ブロックに分割した入力画像と予測画像とを減算して予測誤差を生成し、生成した予測誤差を直交変換及び量子化し、更にエントロピー符号化を行って符号化データを生成し、出力する。 The encoding apparatus 100 divides each frame or each field constituting the input image into a plurality of pixel blocks, and uses the encoding parameters input from the encoding control unit 111 to predict the encoded code for the divided pixel blocks. To generate a predicted image. The encoding apparatus 100 subtracts the input image divided into a plurality of pixel blocks and the prediction image to generate a prediction error, orthogonally transforms and quantizes the generated prediction error, and further performs entropy encoding to perform encoding. Generate and output data.

符号化装置１００は、画素ブロックのブロックサイズ及び予測画像の生成方法の少なくともいずれかが異なる複数の予測モードを選択的に適用して予測符号化を行う。予測画像の生成方法は、大別すると、符号化対象フレーム内で予測を行うイントラ予測と、時間的に異なる１以上の参照フレームを用いて動き補償予測を行うインター予測との２種類である。なお、イントラ予測は、画面内予測又はフレーム内予測などとも称され、インター予測は、画面間予測、フレーム間予測、又は動き補償予測などとも称される。 The encoding apparatus 100 performs predictive encoding by selectively applying a plurality of prediction modes in which at least one of the block size of the pixel block and the prediction image generation method is different. The generation method of the prediction image is roughly classified into two types: intra prediction that performs prediction within the encoding target frame and inter prediction that performs motion compensation prediction using one or more reference frames that are temporally different. Note that intra prediction is also referred to as intra prediction or intra frame prediction, and inter prediction is also referred to as inter prediction, inter frame prediction, motion compensation prediction, or the like.

図２は、第１実施形態における画素ブロックの予測符号化順序の一例を示す説明図である。図２に示す例では、符号化装置１００は、画素ブロックの左上から右下に向かって予測符号化を行っており、符号化処理対象のフレームｆにおいて、符号化対象画素ブロックｃよりも左側及び上側に符号化済み画素ブロックｐが位置している。以下では、説明の簡単化のため、符号化装置１００は、図２に示す順序で予測符号化を行うものとするが、予測符号化の順序はこれに限定されるものではない。 FIG. 2 is an explanatory diagram illustrating an example of a predictive coding order of pixel blocks in the first embodiment. In the example illustrated in FIG. 2, the encoding device 100 performs predictive encoding from the upper left to the lower right of the pixel block, and the left side of the encoding target pixel block c in the encoding target frame f and The encoded pixel block p is located on the upper side. Hereinafter, for simplification of description, the encoding apparatus 100 performs predictive encoding in the order shown in FIG. 2, but the order of predictive encoding is not limited to this.

画素ブロックは、画像を処理する単位を示し、例えば、Ｍ×Ｎサイズのブロック（Ｍ及びＮは自然数）、コーディングツリーブロック、マクロブロック、サブブロック、又は１画素などが該当する。以降の説明では、基本的に、画素ブロックをコーディングツリーブロックの意味で使用するが、他の意味で使用する場合もある。例えば、プレディクションユニットの説明では、画素ブロックを、プレディクションユニットの画素ブロックの意味で使用する。また、ブロックはユニットなどの名称で呼ばれることもある。例えばコーディングブロックをコーディングユニットと呼ぶ。 The pixel block represents a unit for processing an image, and corresponds to, for example, an M × N size block (M and N are natural numbers), a coding tree block, a macro block, a sub block, or one pixel. In the following description, the pixel block is basically used in the meaning of the coding tree block, but may be used in other meanings. For example, in the description of the prediction unit, a pixel block is used to mean a pixel block of the prediction unit. A block may be called by a name such as a unit. For example, a coding block is called a coding unit.

図３Ａは、第１実施形態におけるコーディングツリーブロックのブロックサイズの一例を示す図である。コーディングツリーブロックは、典型的には、図３Ａに示すような６４×６４の画素ブロックである。但し、これに限定されるものではなく、３２×３２の画素ブロック、１６×１６の画素ブロック、８×８の画素ブロック、又は４×４の画素ブロックなどであってもよい。また、コーディングツリーブロックは、正方形でなくてもよく、例えば、Ｍ×Ｎサイズ（Ｍ≠Ｎ）の画素ブロックであってもよい。 FIG. 3A is a diagram illustrating an example of a block size of a coding tree block according to the first embodiment. The coding tree block is typically a 64 × 64 pixel block as shown in FIG. 3A. However, the present invention is not limited to this, and it may be a 32 × 32 pixel block, a 16 × 16 pixel block, an 8 × 8 pixel block, a 4 × 4 pixel block, or the like. Further, the coding tree block may not be a square, and may be, for example, an M × N size (M ≠ N) pixel block.

図３Ｂ〜図３Ｄは、第１実施形態のコーディングツリーブロックの具体例を示す図である。図３Ｂは、ブロックサイズが６４×６４（Ｎ＝３２）のコーディングツリーブロックを示している。Ｎは、基準となるコーディングツリーブロックのサイズを表しており、分割された場合のサイズがＮ、分割されない場合のサイズが２Ｎと定義されている。図３Ｃは、図３Ｂのコーディングツリーブロックを四分木分割したコーディングツリーブロックを示している。コーディングツリーブロックは、図３Ｃに示すように、四分木構造を持つ。コーディングツリーブロックが分割された場合、分割後の４つの画素ブロックに対して、図３Ｃに示すように、Ｚスキャン順で番号が付される。 3B to 3D are diagrams illustrating specific examples of the coding tree block according to the first embodiment. FIG. 3B shows a coding tree block having a block size of 64 × 64 (N = 32). N represents the size of the reference coding tree block. The size when divided is defined as N, and the size when not divided is defined as 2N. FIG. 3C shows a coding tree block obtained by dividing the coding tree block of FIG. 3B into quadtrees. The coding tree block has a quadtree structure as shown in FIG. 3C. When the coding tree block is divided, the four pixel blocks after the division are numbered in the Z-scan order as shown in FIG. 3C.

なお、コーディングツリーブロックは、１つの四分木の番号内で更に四分木分割することができる。これにより、コーディングツリーブロックを階層的に分割することができる。この場合、分割の深さは、Ｄｅｐｔｈで定義される。図３Ｄは、図３Ｂのコーディングツリーブロックを四分木分割したコーディングツリーブロックの１つを示し、ブロックサイズが３２×３２（Ｎ＝１６）となっている。図３Ｂに示すコーディングツリーブロックのＤｅｐｔｈは０であり、図３Ｄに示すコーディングツリーブロックのＤｅｐｔｈは、１である。なお、最もユニットが大きいコーディングツリーブロックは、ラージコーディングツリーブロックと呼ばれ、この単位で入力画像信号がラスタースキャン順に符号化される。 The coding tree block can be further divided into quadtrees within one quadtree number. Thereby, a coding tree block can be divided | segmented hierarchically. In this case, the depth of division is defined by Depth. FIG. 3D shows one of the coding tree blocks obtained by dividing the coding tree block of FIG. 3B into quadtrees, and the block size is 32 × 32 (N = 16). The Depth of the coding tree block shown in FIG. 3B is 0, and the Depth of the coding tree block shown in FIG. The coding tree block having the largest unit is called a large coding tree block, and the input image signal is encoded in this unit in the raster scan order.

以降の説明では、入力画像の符号化対象ブロック又はコーディングツリーブロックを予測対象ブロック又は予測画素ブロックと称することもある。なお、符号化単位は画素ブロックに限らず、フレーム、フィールド、スライス、ライン、及び画素の少なくともいずれかを用いることもできる。 In the following description, the encoding target block or coding tree block of the input image may be referred to as a prediction target block or a prediction pixel block. The encoding unit is not limited to a pixel block, and at least one of a frame, a field, a slice, a line, and a pixel can be used.

符号化装置１００は、図１に示すように、減算部１０１と、直交変換部１０２と、量子化部１０３と、逆量子化部１０４と、逆直交変換部１０５と、加算部１０６と、予測画像生成部１０７と、インデックス設定部１０８と、動き評価部１０９と、符号化部１１０とを、備える。なお、図１に示す符号化制御部１１１は、符号化装置１００を制御するものであり、例えば、ＣＰＵ（Central Processing Unit）などにより実現できる。 As illustrated in FIG. 1, the encoding device 100 includes a subtraction unit 101, an orthogonal transformation unit 102, a quantization unit 103, an inverse quantization unit 104, an inverse orthogonal transformation unit 105, an addition unit 106, and a prediction The image generation unit 107, the index setting unit 108, the motion evaluation unit 109, and the encoding unit 110 are provided. Note that the encoding control unit 111 illustrated in FIG. 1 controls the encoding apparatus 100 and can be realized by, for example, a CPU (Central Processing Unit).

減算部１０１は、画素ブロックに分割された入力画像から対応する予測画像を減算して予測誤差を得る。減算部１０１は、予測誤差を出力し、直交変換部１０２に入力する。 The subtraction unit 101 subtracts the corresponding prediction image from the input image divided into pixel blocks to obtain a prediction error. The subtraction unit 101 outputs a prediction error and inputs it to the orthogonal transform unit 102.

直交変換部１０２は、減算部１０１から入力された予測誤差に対して、例えば、離散コサイン変換（ＤＣＴ）又は離散サイン変換（ＤＳＴ）のような直交変換を行い、変換係数を得る。直交変換部１０２は、変換係数を出力し、量子化部１０３に入力する。 The orthogonal transform unit 102 performs orthogonal transform such as discrete cosine transform (DCT) or discrete sine transform (DST) on the prediction error input from the subtraction unit 101 to obtain transform coefficients. The orthogonal transform unit 102 outputs transform coefficients and inputs them to the quantization unit 103.

量子化部１０３は、直交変換部１０２から入力された変換係数に対して量子化処理を行い、量子化変換係数を得る。具体的には、量子化部１０３は、符号化制御部１１１によって指定される量子化パラメータや量子化マトリクスなどの量子化情報に従って量子化を行う。より詳細には、量子化部１０３は、変換係数を量子化情報によって導出される量子化ステップサイズで除算し、量子化変換係数を得る。量子化パラメータは、量子化の細かさを示す。量子化マトリクスは、量子化の細かさを変換係数の成分毎に重み付けするために使用される。量子化部１０３は、量子化変換係数を出力し、逆量子化部１０４及び符号化部１１０に入力する。 The quantization unit 103 performs a quantization process on the transform coefficient input from the orthogonal transform unit 102 to obtain a quantized transform coefficient. Specifically, the quantization unit 103 performs quantization according to quantization information such as a quantization parameter or a quantization matrix specified by the encoding control unit 111. More specifically, the quantization unit 103 divides the transform coefficient by the quantization step size derived from the quantization information to obtain a quantized transform coefficient. The quantization parameter indicates the fineness of quantization. The quantization matrix is used for weighting the fineness of quantization for each component of the transform coefficient. The quantization unit 103 outputs the quantized transform coefficient and inputs it to the inverse quantization unit 104 and the encoding unit 110.

逆量子化部１０４は、量子化部１０３から入力された量子化変換係数に対して逆量子化処理を行い、復元変換係数を得る。具体的には、逆量子化部１０４は、量子化部１０３において使用された量子化情報に従って逆量子化を行う。より詳細には、逆量子化部１０４は、量子化情報によって導出された量子化ステップサイズを量子化変換係数に乗算し、復元変換係数を得る。なお、量子化部１０３において使用された量子化情報は、符号化制御部１１１の図示せぬ内部メモリからロードされて利用される。逆量子化部１０４は、復元変換係数を出力し、逆直交変換部１０５に入力する。 The inverse quantization unit 104 performs inverse quantization processing on the quantized transform coefficient input from the quantization unit 103 to obtain a restored transform coefficient. Specifically, the inverse quantization unit 104 performs inverse quantization according to the quantization information used in the quantization unit 103. More specifically, the inverse quantization unit 104 multiplies the quantization transform coefficient by the quantization step size derived from the quantization information to obtain a restored transform coefficient. Note that the quantization information used in the quantization unit 103 is loaded from an internal memory (not shown) of the encoding control unit 111 and used. The inverse quantization unit 104 outputs the restored transform coefficient and inputs it to the inverse orthogonal transform unit 105.

逆直交変換部１０５は、逆量子化部１０４から入力された復元変換係数に対して、例えば、逆離散コサイン変換（ＩＤＣＴ）又は逆離散サイン変換（ＩＤＳＴ）などのような逆直交変換を行い、復元予測誤差を得る。なお、逆直交変換部１０５が行う逆直交変換は、直交変換部１０２において行われた直交変換に対応する。逆直交変換部１０５は、復元予測誤差を出力し、加算部１０６に入力する。 The inverse orthogonal transform unit 105 performs inverse orthogonal transform such as inverse discrete cosine transform (IDCT) or inverse discrete sine transform (IDST) on the restored transform coefficient input from the inverse quantization unit 104, Get the restoration prediction error. Note that the inverse orthogonal transform performed by the inverse orthogonal transform unit 105 corresponds to the orthogonal transform performed by the orthogonal transform unit 102. The inverse orthogonal transform unit 105 outputs the reconstruction prediction error and inputs it to the addition unit 106.

加算部１０６は、逆直交変換部１０５から入力された復元予測誤差と対応する予測画像とを加算し、局所復号画像を生成する。加算部１０６は、局所復号画像を出力し、予測画像生成部１０７に入力する。 The adding unit 106 adds the restored prediction error input from the inverse orthogonal transform unit 105 and the corresponding predicted image to generate a locally decoded image. The adding unit 106 outputs the local decoded image and inputs it to the predicted image generation unit 107.

予測画像生成部１０７は、加算部１０６から入力された局所復号画像を参照画像としてメモリ（図１では図示省略）に蓄積し、メモリに蓄積した参照画像を出力し、動き評価部１０９に入力する。また予測画像生成部１０７は、動き評価部１０９から入力される動き情報及びＷＰパラメータ情報に基づいて重み付き動き補償予測を行い、予測画像を生成する。予測画像生成部１０７は、予測画像を出力し、減算部１０１及び加算部１０６に入力する。 The predicted image generation unit 107 stores the locally decoded image input from the addition unit 106 as a reference image in a memory (not shown in FIG. 1), outputs the reference image stored in the memory, and inputs the reference image to the motion evaluation unit 109. . Further, the predicted image generation unit 107 performs weighted motion compensation prediction based on the motion information and WP parameter information input from the motion evaluation unit 109, and generates a predicted image. The predicted image generation unit 107 outputs a predicted image and inputs it to the subtracting unit 101 and the adding unit 106.

図４は、第１実施形態の予測画像生成部１０７の構成の一例を示すブロック図である。予測画像生成部１０７は、図４に示すように、複数フレーム動き補償部２０１と、メモリ２０２と、単方向動き補償部２０３と、予測パラメータ制御部２０４と、参照画像セレクタ２０５と、フレームメモリ２０６と、参照画像制御部２０７と、を備える。 FIG. 4 is a block diagram illustrating an example of the configuration of the predicted image generation unit 107 of the first embodiment. As shown in FIG. 4, the predicted image generation unit 107 includes a multi-frame motion compensation unit 201, a memory 202, a unidirectional motion compensation unit 203, a prediction parameter control unit 204, a reference image selector 205, and a frame memory 206. And a reference image control unit 207.

フレームメモリ２０６は、参照画像制御部２０７の制御の下、加算部１０６から入力された局所復号画像を参照画像として格納する。フレームメモリ２０６は、参照画像を一時保持するための複数のメモリセットＦＭ１〜ＦＭＮ（Ｎ≧２）を有する。 The frame memory 206 stores the locally decoded image input from the addition unit 106 as a reference image under the control of the reference image control unit 207. The frame memory 206 has a plurality of memory sets FM1 to FMN (N ≧ 2) for temporarily storing reference images.

予測パラメータ制御部２０４は、動き評価部１０９から入力される動き情報に基づいて、参照画像番号と予測パラメータとの複数の組み合わせをテーブルとして用意している。ここで、動き情報とは、動き補償予測で用いられる動きのズレ量を示す動きベクトルや参照画像番号、単方向／双方向予測などの予測モードに関する情報などを指す。予測パラメータは、動きベクトル及び予測モードに関する情報を指す。そして予測パラメータ制御部２０４は、入力画像に基づいて、予測画像の生成に用いる参照画像番号と予測パラメータとの組み合わせを選択して出力し、参照画像番号を参照画像セレクタ２０５に入力し、予測パラメータを単方向動き補償部２０３に入力する。 The prediction parameter control unit 204 prepares a plurality of combinations of reference image numbers and prediction parameters as a table based on the motion information input from the motion evaluation unit 109. Here, the motion information indicates a motion vector indicating a shift amount of motion used in motion compensation prediction, a reference image number, information on a prediction mode such as unidirectional / bidirectional prediction, and the like. The prediction parameter refers to information regarding a motion vector and a prediction mode. Then, the prediction parameter control unit 204 selects and outputs a combination of the reference image number and the prediction parameter used for generating the prediction image based on the input image, inputs the reference image number to the reference image selector 205, and outputs the prediction parameter. Is input to the unidirectional motion compensation unit 203.

参照画像セレクタ２０５は、フレームメモリ２０６が有するフレームメモリＦＭ１〜ＦＭＮのいずれの出力端を接続するかを、予測パラメータ制御部２０４から入力された参照画像番号に従って切り替えるスイッチである。参照画像セレクタ２０５は、例えば、参照画像番号が０であれば、ＦＭ１の出力端を参照画像セレクタ２０５の出力端に接続し、参照画像番号がＮ−１であれば、ＦＭＮの出力端を参照画像セレクタ２０５の出力端に接続する。参照画像セレクタ２０５は、フレームメモリ２０６が有するフレームメモリＦＭ１〜ＦＭＮのうち、出力端が接続されているフレームメモリに格納されている参照画像を出力し、単方向動き補償部２０３及び動き評価部１０９へ入力する。 The reference image selector 205 is a switch for switching which output terminal of the frame memories FM1 to FMN included in the frame memory 206 is connected according to the reference image number input from the prediction parameter control unit 204. For example, if the reference image number is 0, the reference image selector 205 connects the output end of FM1 to the output end of the reference image selector 205. If the reference image number is N-1, the reference image selector 205 refers to the output end of the FMN. Connected to the output terminal of the image selector 205. The reference image selector 205 outputs the reference image stored in the frame memory to which the output terminal is connected among the frame memories FM1 to FMN included in the frame memory 206, and the unidirectional motion compensation unit 203 and the motion evaluation unit 109. To enter.

単方向予測動き補償部２０３は、予測パラメータ制御部２０４から入力された予測パラメータと参照画像セレクタ２０５から入力された参照画像に従って、動き補償予測処理を行い、単方向予測画像を生成する。 The unidirectional prediction motion compensation unit 203 performs a motion compensation prediction process according to the prediction parameter input from the prediction parameter control unit 204 and the reference image input from the reference image selector 205, and generates a unidirectional prediction image.

図５は、第１実施形態の双方向予測における動き補償予測の動きベクトルの関係の一例を示す図である。動き補償予測では、参照画像を用いて補間処理が行われ、作成された補間画像と入力画像との符号化対象位置の画素ブロックからの動きのズレ量を元に単方向予測画像が生成される。ここで、ズレ量は、動きベクトルである。図５に示すように、双方向予測スライス（Ｂ−ｓｌｉｃｅ）では、２種類の参照画像と動きベクトルのセットを用いて予測画像が生成される。補間処理としては、１／２画素精度の補間処理や、１／４画素精度の補間処理などが用いられ、参照画像に対してフィルタリング処理が行われることによって、補間画像の値が生成される。例えば、輝度信号に対して１／４画素精度までの補間処理な可能なＨ．２６４では、ズレ量は整数画素精度の４倍で表現される。 FIG. 5 is a diagram illustrating an example of a motion vector relationship of motion compensation prediction in bidirectional prediction according to the first embodiment. In motion compensated prediction, interpolation processing is performed using a reference image, and a unidirectional prediction image is generated based on the amount of motion deviation from the pixel block at the encoding target position between the generated interpolation image and the input image. . Here, the amount of deviation is a motion vector. As shown in FIG. 5, in a bi-directional prediction slice (B-slice), a prediction image is generated using a set of two types of reference images and motion vectors. As the interpolation process, an interpolation process with 1/2 pixel accuracy, an interpolation process with 1/4 pixel accuracy, or the like is used, and the value of the interpolation image is generated by performing the filtering process on the reference image. For example, H.P. capable of performing interpolation processing up to 1/4 pixel accuracy on a luminance signal. In H.264, the shift amount is expressed by four times the integer pixel accuracy.

単方向予測動き補償部２０３は、単方向予測画像を出力し、メモリ２０２に一時的に格納する。ここで、動き情報（予測パラメータ）が双方向予測を示す場合には、複数フレーム動き補償部２０１が２種類の単方向予測画像を用いて重み付き予測を行うため、単方向予測動き補償部２０３は、１つ目に対応する単方向予測画像をメモリ２０２に格納し、２つ目に対応する単法予測画像を複数フレーム動き補償部２０１に直接出力する。ここでは、１つ目に対応する単方向予測画像を第一予測画像とし、２つ目に対応する単方向予測画像を第二予測画像とする。 The unidirectional prediction motion compensation unit 203 outputs a unidirectional prediction image and temporarily stores it in the memory 202. Here, when the motion information (prediction parameter) indicates bidirectional prediction, the multi-frame motion compensation unit 201 performs weighted prediction using two types of unidirectional prediction images, and thus the unidirectional prediction motion compensation unit 203. Stores the unidirectional predicted image corresponding to the first in the memory 202, and directly outputs the unidirectional predicted image corresponding to the second to the multi-frame motion compensation unit 201. Here, the first unidirectional prediction image corresponding to the first is the first prediction image, and the second unidirectional prediction image is the second prediction image.

なお、単方向動き補償部２０３を２つ用意し、それぞれが２つの単方向予測画像を生成するようにしてもよい。この場合、動き情報（予測パラメータ）が単方向予測を示すときには、単方向動き補償部２０３が、１つ目の単方向予測画像を第一予測画像として複数フレーム動き補償部２０１に直接出力すればよい。 Two unidirectional motion compensation units 203 may be prepared, and each may generate two unidirectional prediction images. In this case, when the motion information (prediction parameter) indicates unidirectional prediction, the unidirectional motion compensation unit 203 directly outputs the first unidirectional prediction image as the first prediction image to the multi-frame motion compensation unit 201. Good.

複数フレーム動き補償部２０１は、メモリ２０２から入力される第一予測画像、単方向予測動き補償部２０３から入力される第二予測画像、及び動き評価部１０９から入力されるＷＰパラメータ情報を用いて、重み付き予測を行って予測画像を生成する。複数フレーム動き補償部２０１は、予測画像を出力し、減算部１０１及び加算部１０６に入力する。 The multi-frame motion compensation unit 201 uses the first prediction image input from the memory 202, the second prediction image input from the unidirectional prediction motion compensation unit 203, and the WP parameter information input from the motion evaluation unit 109. Then, a prediction image is generated by performing weighted prediction. The multi-frame motion compensation unit 201 outputs a prediction image and inputs the prediction image to the subtraction unit 101 and the addition unit 106.

図６は、第１実施形態の複数フレーム動き補償部２０１の構成の一例を示すブロック図である。複数フレーム動き補償部２０１は、図６に示すように、デフォルト動き補償部３０１と、重み付き動き補償部３０２と、ＷＰパラメータ制御部３０３と、ＷＰセレクタ３０４、３０５とを、備える。 FIG. 6 is a block diagram illustrating an example of the configuration of the multi-frame motion compensation unit 201 according to the first embodiment. As shown in FIG. 6, the multi-frame motion compensation unit 201 includes a default motion compensation unit 301, a weighted motion compensation unit 302, a WP parameter control unit 303, and WP selectors 304 and 305.

ＷＰパラメータ制御部３０３は、動き評価部１０９から入力されるＷＰパラメータ情報に基づいて、ＷＰ適用フラグ及び重み情報を出力し、ＷＰ適用フラグをＷＰセレクタ３０４、３０５に入力し、重み情報を重み付き動き補償部３０２に入力する。 The WP parameter control unit 303 outputs a WP application flag and weight information based on the WP parameter information input from the motion evaluation unit 109, inputs the WP application flag to the WP selectors 304 and 305, and weights the weight information. Input to the motion compensation unit 302.

ここで、ＷＰパラメータ情報は、重み係数の固定小数点精度、第一予測画像に対応する第一ＷＰ適用フラグ，第一重み係数，及び第一オフセット、並びに第二予測画像に対応する第二ＷＰ適応フラグ，第二重み係数，及び第二オフセットの情報を含む。ＷＰ適用フラグは、該当する参照画像及び信号成分毎に設定可能なパラメータであり、重み付き動き補償予測を行うかどうかを示す。重み情報は、重み係数の固定小数点精度、第一重み係数、第一オフセット、第二重み係数、及び第二オフセットの情報を含む。 Here, the WP parameter information includes the fixed-point precision of the weighting factor, the first WP application flag corresponding to the first predicted image, the first weighting factor, the first offset, and the second WP adaptation corresponding to the second predicted image. Contains information on flag, second weighting factor, and second offset. The WP application flag is a parameter that can be set for each corresponding reference image and signal component, and indicates whether to perform weighted motion compensation prediction. The weight information includes information on the fixed point precision of the weight coefficient, the first weight coefficient, the first offset, the second weight coefficient, and the second offset.

詳細には、ＷＰパラメータ制御部３０３は、動き評価部１０９からＷＰパラメータ情報が入力されると、ＷＰパラメータ情報を第一ＷＰ適用フラグ、第二ＷＰ適用フラグ、及び重み情報に分離して出力し、第一ＷＰ適用フラグをＷＰセレクタ３０４に入力し、第二ＷＰ適用フラグをＷＰセレクタ３０５に入力し、重み情報を重み付き動き補償部３０２に入力する。 Specifically, when WP parameter information is input from the motion evaluation unit 109, the WP parameter control unit 303 separates the WP parameter information into a first WP application flag, a second WP application flag, and weight information, and outputs them. The first WP application flag is input to the WP selector 304, the second WP application flag is input to the WP selector 305, and the weight information is input to the weighted motion compensation unit 302.

ＷＰセレクタ３０４、３０５は、ＷＰパラメータ制御部３０３から入力されたＷＰ適用フラグに基づいて、各々の予測画像の接続端を切り替える。ＷＰセレクタ３０４、３０５は、各々のＷＰ適用フラグが０の場合、各々の出力端をデフォルト動き補償部３０１へ接続する。そしてＷＰセレクタ３０４、３０５は、第一予測画像及び第二予測画像を出力し、デフォルト動き補償部３０１に入力する。一方、ＷＰセレクタ３０４、３０５は、各々のＷＰ適用フラグが１の場合、各々の出力端を重み付き動き補償部３０２へ接続する。そしてＷＰセレクタ３０４、３０５は、第一予測画像及び第二予測画像を出力し、重み付き動き補償部３０２に入力する。 The WP selectors 304 and 305 switch the connection end of each predicted image based on the WP application flag input from the WP parameter control unit 303. When each WP application flag is 0, the WP selectors 304 and 305 connect each output terminal to the default motion compensation unit 301. Then, the WP selectors 304 and 305 output the first predicted image and the second predicted image and input them to the default motion compensation unit 301. On the other hand, when each WP application flag is 1, the WP selectors 304 and 305 connect each output terminal to the weighted motion compensation unit 302. The WP selectors 304 and 305 output the first predicted image and the second predicted image, and input them to the weighted motion compensation unit 302.

デフォルト動き補償部３０１は、ＷＰセレクタ３０４、３０５から入力された２つの単方向予測画像（第一予測画像及び第二予測画像）を元に平均値処理を行い、予測画像を生成する。具体的には、デフォルト動き補償部３０１は、第一ＷＰ適用フラグ及び第二ＷＰ適用フラグが０の場合、数式（１）に基づいて平均値処理を行う。 The default motion compensation unit 301 performs an average value process based on the two unidirectional predicted images (first predicted image and second predicted image) input from the WP selectors 304 and 305 to generate a predicted image. Specifically, when the first WP application flag and the second WP application flag are 0, the default motion compensation unit 301 performs average value processing based on Expression (1).

Ｐ［ｘ，ｙ］＝Ｃｌｉｐ１（（ＰＬ０［ｘ，ｙ］＋ＰＬ１［ｘ，ｙ］＋ｏｆｆｓｅｔ２）＞＞（ｓｈｉｆｔ２）) …（１） P [x, y] = Clip1 ((PL0 [x, y] + PL1 [x, y] + offset2) >> (shift2)) (1)

ここで、Ｐ［ｘ，ｙ］は予測画像、ＰＬ０［ｘ，ｙ］は第一予測画像、ＰＬ１［ｘ，ｙ］は第二予測画像である。ｏｆｆｓｅｔ２及びｓｈｉｆｔ２は平均値処理における丸め処理のパラメータであり、第一予測画像及び第二予測画像の内部演算精度によって定まる。予測画像のビット精度をＬ、第一予測画像及び第二予測画像のビット精度をＭ（Ｌ≦Ｍ）とすると、ｓｈｉｆｔ２は数式（２）で定式化され、ｏｆｆｓｅｔ２は数式（３）で定式化される。 Here, P [x, y] is a predicted image, PL0 [x, y] is a first predicted image, and PL1 [x, y] is a second predicted image. offset2 and shift2 are parameters of the rounding process in the average value process, and are determined by the internal calculation accuracy of the first predicted image and the second predicted image. If the bit accuracy of the prediction image is L and the bit accuracy of the first prediction image and the second prediction image is M (L ≦ M), shift2 is formulated by Equation (2), and offset2 is formulated by Equation (3). Is done.

ｓｈｉｆｔ２＝（Ｍ−Ｌ＋１） …（２） shift2 = (ML + 1) (2)

ｏｆｆｓｅｔ２＝（１＜＜（ｓｈｉｆｔ２−１） …（３） offset2 = (1 << (shift2-1) (3)

例えば、予測画像のビット精度が８であり、第一予測画像及び第二予測画像のビット精度が１４である場合、数式（２）よりｓｈｉｆｔ２＝７、数式（３）よりｏｆｆｓｅｔ２＝（１＜＜６）となる。 For example, when the bit accuracy of the predicted image is 8 and the bit accuracy of the first predicted image and the second predicted image is 14, shift2 = 7 from Equation (2), and offset2 = (1 << from Equation (3). 6).

なお、動き情報（予測パラメータ）で示される予測モードが単方向予測である場合、デフォルト動き補償部３０１は、第一予測画像のみを用いて、数式（４）に基づいて最終的な予測画像を算出する。 When the prediction mode indicated by the motion information (prediction parameter) is unidirectional prediction, the default motion compensation unit 301 uses only the first prediction image and calculates a final prediction image based on Expression (4). calculate.

Ｐ［ｘ，ｙ］＝Ｃｌｉｐ１（（ＰＬＸ［ｘ，ｙ］＋ｏｆｆｓｅｔ１）＞＞（ｓｈｉｆｔ１）) …（４） P [x, y] = Clip1 ((PLX [x, y] + offset1) >> (shift1)) (4)

ここで、ＰＬＸ［ｘ，ｙ］は単方向予測画像（第一予測画像）を示しており、Ｘは参照リストの０又は１のいずれかを示す識別子である。例えば、参照リストが０の場合はＰＬ０［ｘ，ｙ］、参照リストが１の場合はＰＬ１［ｘ，ｙ］となる。ｏｆｆｓｅｔ１及びｓｈｉｆｔ１は丸め処理のパラメータであり、第一予測画像の内部演算精度によって定まる。予測画像のビット精度をＬ、第一予測画像のビット精度をＭとすると、ｓｈｉｆｔ１は数式（５）で定式化され、ｏｆｆｓｅｔ１は数式（６）で定式化される。 Here, PLX [x, y] indicates a unidirectional prediction image (first prediction image), and X is an identifier indicating either 0 or 1 in the reference list. For example, when the reference list is 0, PL0 [x, y] is obtained, and when the reference list is 1, PL1 [x, y] is obtained. offset1 and shift1 are parameters of the rounding process, and are determined by the internal calculation accuracy of the first predicted image. When the bit accuracy of the predicted image is L and the bit accuracy of the first predicted image is M, shift1 is formulated by Equation (5), and offset1 is formulated by Equation (6).

ｓｈｉｆｔ１＝（Ｍ−Ｌ） …（５） shift1 = (ML) (5)

ｏｆｆｓｅｔ１＝（１＜＜（ｓｈｉｆｔ１−１） …（６） offset1 = (1 << (shift1-1) (6)

例えば、予測画像のビット精度が８であり、第一予測画像のビット精度が１４である場合、数式（５）よりｓｈｉｆｔ１＝６、数式（６）よりｏｆｆｓｅｔ１＝（１＜＜５）となる。 For example, when the bit accuracy of the predicted image is 8 and the bit accuracy of the first predicted image is 14, shift1 = 6 from Equation (5) and offset1 = (1 << 5) from Equation (6).

重み付き動き補償部３０２は、ＷＰセレクタ３０４、３０５から入力された２つの単方向予測画像（第一予測画像及び第二予測画像）とＷＰパラメータ制御部３０３から入力された重み情報とを元に重み付き動き補償を行う。具体的には、重み付き動き補償部３０２は、第一ＷＰ適用フラグ及び第二ＷＰ適用フラグが１の場合、数式（７）に基づいて重み付き処理を行う。 The weighted motion compensation unit 302 is based on the two unidirectional prediction images (first prediction image and second prediction image) input from the WP selectors 304 and 305 and the weight information input from the WP parameter control unit 303. Performs weighted motion compensation. Specifically, when the first WP application flag and the second WP application flag are 1, the weighted motion compensation unit 302 performs the weighting process based on Expression (7).

Ｐ［ｘ，ｙ］＝Ｃｌｉｐ１（（（ＰＬ０［ｘ，ｙ］＊ｗ０Ｃ＋ＰＬ１［ｘ，ｙ］＊ｗ１Ｃ＋（１＜＜ｌｏｇＷＤＣ））＞＞（ｌｏｇＷＤＣ＋１））＋（（ｏ０Ｃ＋ｏ１Ｃ＋１）＞＞１）） …（７） P [x, y] = Clip1 (((PL0 [x, y] * w0C + PL1 [x, y] * w1C + (1 << logWDC)) >> (logWDC + 1)) + ((o0C + o1C + 1) >> 1)) (7)

ここで、ｗ０Ｃは第一予測画像に対応する重み係数、ｗ１Ｃは第二予測画像に対応する重み係数、ｏ０Ｃは第一予測画像に対応するオフセット、ｏ１Ｃは第二予測画像に対応するオフセットを表す。以後、それぞれを第一重み係数、第二重み係数、第一オフセット、第二オフセットと呼ぶ。ｌｏｇＷＤＣはそれぞれの重み係数の固定小数点精度を示すパラメータである。変数Ｃは、信号成分を意味する。例えば、ＹＵＶ空間信号の場合、輝度信号をＣ＝Ｙとし、Ｃｒ色差信号をＣ＝Ｃｒ、Ｃｂ色差成分をＣ＝Ｃｂと表す。 Here, w0C represents a weighting factor corresponding to the first predicted image, w1C represents a weighting factor corresponding to the second predicted image, o0C represents an offset corresponding to the first predicted image, and o1C represents an offset corresponding to the second predicted image. . Hereinafter, these are referred to as a first weighting factor, a second weighting factor, a first offset, and a second offset, respectively. logWDC is a parameter indicating the fixed-point precision of each weighting factor. The variable C means a signal component. For example, in the case of a YUV spatial signal, the luminance signal is C = Y, the Cr color difference signal is C = Cr, and the Cb color difference component is C = Cb.

なお、重み付き動き補償部３０２は、第一予測画像及び第二予測画像と予測画像との演算精度が異なる場合、固定小数点精度であるｌｏｇＷＤＣを数式（８）のように制御することで丸め処理を実現する。 Note that the weighted motion compensation unit 302 performs rounding processing by controlling logWDC, which is fixed-point precision, as in Expression (8) when the calculation accuracy of the first predicted image, the second predicted image, and the predicted image is different. To realize.

ｌｏｇＷＤ’Ｃ＝ｌｏｇＷＤＣ＋ｏｆｆｓｅｔ１ …（８） logWD'C = logWDC + offset1 (8)

丸め処理は、数式（７）のｌｏｇＷＤＣを、数式（８）のｌｏｇＷＤ’Ｃに置き換えることで実現できる。例えば、予測画像のビット精度が８であり、第一予測画像及び第二予測画像のビット精度が１４である場合、ｌｏｇＷＤＣを再設定することにより、数式（１）のｓｈｉｆｔ２と同様の演算精度における一括丸め処理を実現することが可能となる。 The rounding process can be realized by replacing logWDC in Expression (7) with logWD′C in Expression (8). For example, when the bit accuracy of the predicted image is 8 and the bit accuracy of the first predicted image and the second predicted image is 14, by resetting logWDC, the calculation accuracy similar to shift2 in Equation (1) can be obtained. A batch rounding process can be realized.

なお、動き情報（予測パラメータ）で示される予測モードが単方向予測である場合、重み付き動き補償部３０２は、第一予測画像のみを用いて、数式（９）に基づいて最終的な予測画像を算出する。 When the prediction mode indicated by the motion information (prediction parameter) is unidirectional prediction, the weighted motion compensation unit 302 uses only the first predicted image and uses the final predicted image based on Equation (9). Is calculated.

Ｐ［ｘ，ｙ］＝Ｃｌｉｐ１（（ＰＬＸ［ｘ，ｙ］＊ｗＸＣ＋（１＜＜ｌｏｇＷＤＣ−１））＞＞（ｌｏｇＷＤＣ）） …（９） P [x, y] = Clip1 ((PLX [x, y] * wXC + (1 << logWDC-1)) >> (logWDC)) (9)

ここで、ＰＬＸ［ｘ，ｙ］は単方向予測画像（第一予測画像）を示し、ｗＸＣは単方向予測に対応する重み係数を示しており、Ｘは参照リストの０又は１のいずれかを示す識別子である。例えば、参照リストが０の場合はＰＬ０［ｘ，ｙ］、ｗ０Ｃ、参照リストが１の場合はＰＬ１［ｘ，ｙ］、ｗ１Ｃとなる。 Here, PLX [x, y] indicates a unidirectional prediction image (first prediction image), wXC indicates a weighting factor corresponding to unidirectional prediction, and X indicates either 0 or 1 in the reference list. It is an identifier to indicate. For example, when the reference list is 0, PL0 [x, y] and w0C are obtained, and when the reference list is 1, PL1 [x, y] and w1C are obtained.

なお、重み付き動き補償部３０２は、第一予測画像及び第二予測画像と予測画像との演算精度が異なる場合、固定小数点精度であるｌｏｇＷＤＣを双方向予測時と同様に数式（８）のように制御することで丸め処理を実現する。 Note that the weighted motion compensation unit 302, when the calculation accuracy of the first prediction image, the second prediction image, and the prediction image is different, uses the log WDC, which is a fixed-point accuracy, as in Expression (8) as in the bidirectional prediction. Rounding processing is realized by controlling to.

丸め処理は、数式（７）のｌｏｇＷＤＣを、数式（８）のｌｏｇＷＤ’Ｃに置き換えることで実現できる。例えば、予測画像のビット精度が８であり、第一予測画像のビット精度が１４である場合、ｌｏｇＷＤＣを再設定することにより、数式（４）のｓｈｉｆｔ１と同様の演算精度における一括丸め処理を実現することが可能となる。 The rounding process can be realized by replacing logWDC in Expression (7) with logWD′C in Expression (8). For example, when the bit accuracy of the predicted image is 8 and the bit accuracy of the first predicted image is 14, the round rounding process with the same calculation accuracy as that of shift1 of Expression (4) is realized by resetting logWDC It becomes possible to do.

図７は、第１実施形態における重み係数の固定小数点精度の一例の説明図であり、時間方向の明度変化がある動画像と階調値との変化の一例を示す図である。図７に示す例では、符号化対象フレームをＦｒａｍｅ（ｔ）とし、時間的に１つ前のフレームをＦｒａｍｅ（ｔ−１）、時間的に１つ後のフレームをＦｒａｍｅ（ｔ＋１）としている。図７に示すように、白から黒に変化するフェード画像では、画像の明度（階調値）が時間とともに減少していく。重み係数は、図７における変化の度合いを意味しており、数式（７）及び数式（９）から明らかなように、明度変化がない場合に１．０の値を取る。固定小数点精度は、重み係数の小数点に対応する刻み幅を制御するパラメータであり、明度変化がない場合の重み係数は、１＜＜ｌｏｇＷＤＣとなる。 FIG. 7 is an explanatory diagram of an example of the fixed-point precision of the weighting coefficient in the first embodiment, and is a diagram illustrating an example of a change between a moving image having a brightness change in the time direction and a gradation value. In the example shown in FIG. 7, the encoding target frame is Frame (t), the previous frame in time is Frame (t−1), and the next frame in time is Frame (t + 1). As shown in FIG. 7, in a fade image that changes from white to black, the brightness (gradation value) of the image decreases with time. The weighting coefficient means the degree of change in FIG. 7 and takes a value of 1.0 when there is no change in brightness, as is clear from Equation (7) and Equation (9). The fixed-point precision is a parameter for controlling the step size corresponding to the decimal point of the weighting coefficient, and the weighting coefficient when there is no change in brightness is 1 << logWDC.

なお、単方向予測の場合には、第二予測画像に対応する各種パラメータ（第二ＷＰ適応フラグ，第二重み係数，及び第二オフセットの情報）は利用されないため、予め定めた初期値に設定されていてもよい。 In the case of unidirectional prediction, various parameters (second WP adaptive flag, second weighting factor, and second offset information) corresponding to the second predicted image are not used, and are set to predetermined initial values. May be.

図１に戻り、動き評価部１０９は、入力画像と予測画像生成部１０７から入力された参照画像とに基づき複数フレーム間の動き評価を行い、動き情報及びＷＰパラメータ情報を出力し、動き情報を予測画像生成部１０７及び符号化部１１０に入力し、ＷＰパラメータ情報を予測画像生成部１０７及びインデックス設定部１０８に入力する。 Returning to FIG. 1, the motion evaluation unit 109 performs motion evaluation between a plurality of frames based on the input image and the reference image input from the predicted image generation unit 107, outputs motion information and WP parameter information, and stores the motion information. The prediction image generation unit 107 and the encoding unit 110 are input, and the WP parameter information is input to the prediction image generation unit 107 and the index setting unit 108.

動き評価部１０９は、例えば、予測対象画素ブロックの入力画像と同位置に対応する複数の参照画像を起点として差分値を計算することで誤差を算出し、この位置を分数精度でずらし、誤差最小のブロックを探すブロックマッチングなどの手法により、最適な動き情報を算出する。動き評価部１０９は、双方向予測の場合には、単方向予測で導出された動き情報を用いて、数式（１）及び数式（４）に示すようなデフォルト動き補償予測を含むブロックマッチングを行うことにより、双方向予測の動き情報を算出する。 For example, the motion evaluation unit 109 calculates an error by calculating a difference value starting from a plurality of reference images corresponding to the same position as the input image of the prediction target pixel block, shifts the position with a fractional accuracy, and minimizes the error. Optimal motion information is calculated by a technique such as block matching for searching for a block. In the case of bidirectional prediction, the motion evaluation unit 109 performs block matching including default motion compensation prediction as shown in Equation (1) and Equation (4) using motion information derived by unidirectional prediction. Thus, motion information for bidirectional prediction is calculated.

この際、動き評価部１０９は、数式（７）及び数式（９）で示されるような重み付き動き補償予測を含むブロックマッチングを行うことにより、ＷＰパラメータ情報を算出できる。なお、ＷＰパラメータ情報の算出には、入力画像の明度勾配を用いて重み係数やオフセットを算出する方法や、符号化した際の予測誤差の累積による重み係数やオフセットの算出方法などを用いてもよい。またＷＰパラメータ情報は、符号化装置毎に予め定めた固定値を用いてもよい。 At this time, the motion evaluation unit 109 can calculate WP parameter information by performing block matching including weighted motion compensated prediction as represented by Equation (7) and Equation (9). Note that the WP parameter information may be calculated using a method of calculating a weighting factor or an offset using the brightness gradient of the input image, or a method of calculating a weighting factor or an offset by accumulating prediction errors when encoded. Good. The WP parameter information may be a fixed value determined in advance for each encoding device.

ここで、図７を参照しながら、時間的に明度変化のある動画像から、重み係数、重み係数の固定小数点精度、及びオフセットを算出する方法を説明する。前述したように、図７に示すような白から黒に変化するフェード画像では、画像の明度（階調値）が時間とともに減少していく。動き評価部１０９は、この傾きを計算することにより、重み係数を算出することができる。 Here, a method for calculating a weighting factor, a fixed point precision of the weighting factor, and an offset from a moving image having a temporal change in brightness will be described with reference to FIG. As described above, in a fade image that changes from white to black as shown in FIG. 7, the brightness (tone value) of the image decreases with time. The motion evaluation unit 109 can calculate a weighting coefficient by calculating this inclination.

また、重み係数の固定小数点精度は、この傾きの精度を示す情報であり、動き評価部１０９は、参照画像の時間的な距離と画像明度の変化度から、最適な値を計算できる。例えば、図７において、Ｆｒａｍｅ（ｔ−１）〜Ｆｒａｍｅ（ｔ＋１）間の重み係数が小数点精度で０．７５である場合、１／４精度であれば、３／４が表現できるため、動き評価部１０９は、固定小数点精度を２（１＜＜２）に設定する。固定小数点精度の値は、重み係数を符号化した場合の符号量に影響を与えるため、符号量と予測精度を考慮して最適な値を選択すればよい。なお、固定小数点精度の値は、予め定めた固定値としてもよい。 The fixed-point precision of the weighting coefficient is information indicating the precision of the slope, and the motion evaluation unit 109 can calculate an optimum value from the temporal distance of the reference image and the degree of change in image brightness. For example, in FIG. 7, when the weighting coefficient between Frame (t-1) and Frame (t + 1) is 0.75 in decimal point precision, 3/4 can be expressed with 1/4 precision. The unit 109 sets the fixed point precision to 2 (1 << 2). Since the value of the fixed-point precision affects the code amount when the weighting coefficient is encoded, an optimal value may be selected in consideration of the code amount and the prediction accuracy. Note that the fixed-point precision value may be a predetermined fixed value.

また、動き評価部１０９は、傾きが一致しない場合、一次関数の切片に対応する補正値（ズレ量）を求めることでオフセット値を算出できる。例えば、図７において、Ｆｒａｍｅ（ｔ−１）〜Ｆｒａｍｅ（ｔ＋１）間の重み係数が小数点精度で０．６０であり、固定小数点精度が１（１＜＜１）である場合、重み係数は１（つまり、重み係数の小数点精度０．５０に該当）が設定される可能性が高い。この場合、重み係数の小数点精度は、最適な値である０．６０から０．１０ずれているため、動き評価部１０９は、この分の補正値を画素の最大値から計算し、オフセット値として設定する。画素の最大値が２５５である場合、動き評価部１０９は、２５（２５５×０．１）などの値を設定すればよい。 Further, when the inclinations do not match, the motion evaluation unit 109 can calculate the offset value by obtaining a correction value (deviation amount) corresponding to the intercept of the linear function. For example, in FIG. 7, when the weighting coefficient between Frame (t−1) and Frame (t + 1) is 0.60 in decimal point precision and the fixed point precision is 1 (1 << 1), the weighting coefficient is 1 (That is, it corresponds to the decimal point precision 0.50 of the weighting factor) is likely to be set. In this case, since the decimal point accuracy of the weighting coefficient is deviated from 0.60, which is the optimum value, by 0.10, the motion evaluation unit 109 calculates the correction value for this amount from the maximum value of the pixel and uses it as an offset value. Set. When the maximum value of the pixel is 255, the motion evaluation unit 109 may set a value such as 25 (255 × 0.1).

なお第１実施形態では、符号化装置１００の一機能として動き評価部１０９を例示しているが、動き評価部１０９は符号化装置１００の必須の構成ではなく、例えば、動き評価部１０９を符号化装置１００外の装置としてもよい。この場合、動き評価部１０９で算出された動き情報及びＷＰパラメータ情報を符号化装置１００にロードするようにすればよい。 In the first embodiment, the motion evaluation unit 109 is illustrated as a function of the encoding device 100. However, the motion evaluation unit 109 is not an essential component of the encoding device 100. For example, the motion evaluation unit 109 is encoded. It is good also as an apparatus outside the conversion apparatus 100. In this case, the motion information and WP parameter information calculated by the motion evaluation unit 109 may be loaded into the encoding device 100.

インデックス設定部１０８は、動き評価部１０９から入力されたＷＰパラメータ情報を受け取り、参照リスト（リスト番号）と参照画像（参照番号）とを確認して、インデックス情報を出力し、符号化部１１０に入力する。インデックス設定部１０８は、動き評価部１０９から入力されたＷＰパラメータ情報を、後述するシンタクス要素にマッピングしてインデックス情報を生成する。 The index setting unit 108 receives the WP parameter information input from the motion evaluation unit 109, checks the reference list (list number) and the reference image (reference number), outputs the index information, and outputs the index information to the encoding unit 110. input. The index setting unit 108 generates index information by mapping the WP parameter information input from the motion evaluation unit 109 to syntax elements described later.

図８Ａ及び図８Ｂは、第１実施形態のＷＰパラメータ情報の一例を示す図である。Ｐ−ｓｌｉｃｅ時のＷＰパラメータ情報の一例は、図８Ａに示すとおりであり、Ｂ−ｓｌｉｃｅ時のＷＰパラメータ情報の一例は、図８Ａ及び図８Ｂに示すとおりである。リスト番号は、予測方向を示す識別子であり、単方向予測時は０の値を取り、双方向予測時は２種類の予測を用いることができるため、０と１の２つの値を取る。参照番号は、フレームメモリ２０６に示される１〜Ｎに対応する値である。ＷＰパラメータ情報は、参照リストと参照画像毎に保持されるため、Ｂ−ｓｌｉｃｅ時で必要な情報は、参照画像がＮ個とすると２Ｎ個となる。 8A and 8B are diagrams illustrating examples of WP parameter information according to the first embodiment. An example of WP parameter information at the time of P-slice is as shown in FIG. 8A, and an example of WP parameter information at the time of B-slice is as shown in FIGS. 8A and 8B. The list number is an identifier indicating the prediction direction, and takes a value of 0 when unidirectional prediction is used, and two values of 0 and 1 can be used when bidirectional prediction is used. The reference numbers are values corresponding to 1 to N indicated in the frame memory 206. Since the WP parameter information is held for each reference list and reference image, the information necessary for the B-slice is 2N when the number of reference images is N.

図１に戻り、符号化部１１０は、量子化部１０３から入力された量子化変換係数、動き評価部１０９から入力された動き情報、インデックス設定部１０８から入力されたインデックス情報、及び符号化制御部１１１によって指定される量子化情報などの様々な符号化パラメータに対して符号化処理を行い、符号化データを生成する。符号化処理は、例えば、ハフマン符号化や算術符号化などが該当する。 Returning to FIG. 1, the encoding unit 110 receives the quantized transform coefficient input from the quantization unit 103, the motion information input from the motion evaluation unit 109, the index information input from the index setting unit 108, and the encoding control. Encoding processing is performed on various encoding parameters such as quantization information specified by the unit 111 to generate encoded data. The encoding process corresponds to, for example, Huffman encoding or arithmetic encoding.

符号化パラメータとは、予測方法などを示す予測情報、量子化変換係数に関する情報、及び量子化に関する情報などの復号に必要となるパラメータである。例えば、符号化制御部１１１が図示せぬ内部メモリを持ち、この内部メモリに符号化パラメータが保持され、画素ブロックを符号化する際に隣接する既に符号化済みの画素ブロックの符号化パラメータを用いるようにできる。例えば、Ｈ．２６４のイントラ予測では、符号化済みの隣接ブロックの予測情報から、画素ブロックの予測情報を導出することができる。 The encoding parameter is a parameter required for decoding, such as prediction information indicating a prediction method, information on quantization transform coefficients, information on quantization, and the like. For example, the encoding control unit 111 has an internal memory (not shown), the encoding parameters are held in the internal memory, and the encoding parameters of the adjacent already encoded pixel block are used when encoding the pixel block. You can For example, H.M. In the H.264 intra prediction, the prediction information of the pixel block can be derived from the prediction information of the encoded adjacent block.

符号化部１１０は、生成した符号化データを、符号化制御部１１１が管理する適切な出力タイミングに従って出力する。出力された符号化データは、例えば、図示せぬ多重化部などで様々な情報が多重化されて、図示せぬ出力バッファなどに一時的に蓄積された後に、例えば、図示せぬ蓄積系（蓄積メディア）又は伝送系（通信回線）へ出力される。 The encoding unit 110 outputs the generated encoded data according to an appropriate output timing managed by the encoding control unit 111. The output encoded data is, for example, multiplexed with various information such as a multiplexing unit (not shown) and temporarily stored in an output buffer (not shown). (Storage medium) or transmission system (communication line).

符号化部１１０は、エントロピー符号化部１１０Ａと、インデックス再構成部１１０Ｂとを、備える。 The encoding unit 110 includes an entropy encoding unit 110A and an index reconstruction unit 110B.

エントロピー符号化部１１０Ａは、入力されてきた情報に対して可変長符号化や算術符号化などの符号化処理を行う。例えば、Ｈ．２６４では、コンテキスト適応型の可変長符号化（ＣＡＶＬＣ：ＣｏｎｔｅｘｔｂａｓｅｄＡｄａｐｔｉｖｅＶａｒｉａｂｌｅＬｅｎｇｔｈＣｏｄｉｎｇ）やコンテキスト適応型の算術符号化（ＣＡＢＡＣ：ＣｏｎｔｅｘｔｂａｓｅｄＡｄａｐｔｉｖｅＢｉｎａｒｙＡｒｉｔｈｍｅｔｉｃＣｏｄｉｎｇ）などが用いられる。 The entropy encoding unit 110A performs encoding processing such as variable length encoding and arithmetic encoding on the input information. For example, H.M. In H.264, context-adaptive variable-length coding (CAVLC) and context-adaptive arithmetic coding (CABAC: Context based Adaptive Binary Coding) are used.

インデックス再構成部１１０Ｂは、インデックス設定部１０８から入力されたインデックス情報のシンタクス要素の符号長を削減するため、シンタクス要素のパラメータの特徴に応じて予測処理を行い、シンタクス要素のそのままの値（直値）と予測値の差分値とを計算し、エントロピー符号化部１１０Ａに出力する。予測処理の具体例は、後述する。 In order to reduce the code length of the syntax element of the index information input from the index setting unit 108, the index reconstruction unit 110B performs a prediction process according to the characteristics of the parameter of the syntax element, and uses the value of the syntax element as it is (directly). Value) and the difference between the predicted values and output to the entropy encoding unit 110A. A specific example of the prediction process will be described later.

図９は、第１実施形態の符号化装置１００が利用するシンタクス５００の一例を示す図である。シンタクス５００は、符号化装置１００が入力画像（動画像データ）を符号化して生成した符号化データの構造を示している。符号化データを復号化する場合、後述の復号装置は、シンタクス５００と同一のシンタクス構造を参照して動画像のシンタクス解釈を行う。 FIG. 9 is a diagram illustrating an example of the syntax 500 used by the encoding device 100 according to the first embodiment. A syntax 500 indicates a structure of encoded data generated by the encoding apparatus 100 by encoding an input image (moving image data). When decoding the encoded data, a decoding device described later refers to the same syntax structure as that of the syntax 500 and interprets the syntax of the moving image.

シンタクス５００は、ハイレベルシンタクス５０１、スライスレベルシンタクス５０２及びコーディングツリーレベルシンタクス５０３の３つのパートを含む。ハイレベルシンタクス５０１は、スライスよりも上位のレイヤのシンタクス情報を含む。スライスとは、フレーム若しくはフィールドに含まれる矩形領域又は連続領域を指す。スライスレベルシンタクス５０２は、各スライスを復号化するために必要な情報を含む。コーディングツリーレベルシンタクス５０３は、各コーディングツリー（即ち、各コーディングツリーブロック）を復号するために必要な情報を含む。これら各パートは、更に詳細なシンタクスを含む。 The syntax 500 includes three parts: a high level syntax 501, a slice level syntax 502, and a coding tree level syntax 503. The high level syntax 501 includes syntax information of a layer higher than the slice. A slice refers to a rectangular area or a continuous area included in a frame or a field. The slice level syntax 502 includes information necessary for decoding each slice. The coding tree level syntax 503 includes information necessary for decoding each coding tree (ie, each coding tree block). Each of these parts includes more detailed syntax.

ハイレベルシンタクス５０１は、シーケンスパラメータセットシンタクス５０４、ピクチャパラメータセットシンタクス５０５、及びアダプテーションパラメータセットシンタクス５０６などのシーケンス及びピクチャレベルのシンタクスを含む。 The high level syntax 501 includes sequence and picture level syntaxes such as a sequence parameter set syntax 504, a picture parameter set syntax 505, and an adaptation parameter set syntax 506.

スライスレベルシンタクス５０２は、スライスヘッダーシンタクス５０７、プレッドウェイトテーブルシンタクス５０８、及びスライスデータシンタクス５０９などを含む。プレッドウェイトテーブルシンタクス５０８は、スライスヘッダーシンタクス５０７から呼び出される。 The slice level syntax 502 includes a slice header syntax 507, a breadweight table syntax 508, a slice data syntax 509, and the like. The tread weight table syntax 508 is called from the slice header syntax 507.

コーディングツリーレベルシンタクス５０３は、コーディングツリーユニットシンタクス５１０、トランスフォームユニットシンタクス５１１、及びプレディクションユニットシンタクス５１２などを含む。コーディングツリーユニットシンタクス５１０は、四分木構造を持つことができる。具体的には、コーディングツリーユニットシンタクス５１０のシンタクス要素として、更にコーディングツリーユニットシンタクス５１０を再帰呼び出しすることができる。即ち、１つのコーディングツリーブロックを四分木で細分化することができる。また、コーディングツリーユニットシンタクス５１０内にはトランスフォームユニットシンタクス５１１が含まれている。トランスフォームユニットシンタクス５１１は、四分木の最末端の各コーディングツリーユニットシンタクス５１０において呼び出される。トランスフォームユニットシンタクス５１１は、逆直交変換及び量子化などに関わる情報が記述されている。これらのシンタクスには、重み付き動き補償予測に関する情報が記述されてもよい。 The coding tree level syntax 503 includes a coding tree unit syntax 510, a transform unit syntax 511, a prediction unit syntax 512, and the like. The coding tree unit syntax 510 may have a quadtree structure. Specifically, the coding tree unit syntax 510 can be recursively called as a syntax element of the coding tree unit syntax 510. That is, one coding tree block can be subdivided with a quadtree. The coding tree unit syntax 510 includes a transform unit syntax 511. The transform unit syntax 511 is called in each coding tree unit syntax 510 at the extreme end of the quadtree. The transform unit syntax 511 describes information related to inverse orthogonal transformation and quantization. In these syntaxes, information related to weighted motion compensation prediction may be described.

図１０は、第１実施形態のピクチャパラメータセットシンタクス５０５の一例を示す図である。weighted_pred_flagは、例えば、Ｐ−ｓｌｉｃｅに関する第１実施形態の重み付き補償予測の有効又は無効を示すシンタクス要素である。weighted_pred_flagが０である場合、Ｐ−ｓｌｉｃｅ内での第１実施形態の重み付き動き補償予測は無効となる。従って、ＷＰパラメータ情報に含まれるＷＰ適用フラグは常に０に設定され、ＷＰセレクタ３０４、３０５は、各々の出力端をデフォルト動き補償部３０１へ接続する。一方、weighted_pred_flagが１である場合、Ｐ−ｓｌｉｃｅ内での第１実施形態の重み付き動き補償予測は有効となる。 FIG. 10 is a diagram illustrating an example of the picture parameter set syntax 505 according to the first embodiment. The weighted_pred_flag is a syntax element indicating, for example, whether the weighted compensated prediction of the first embodiment related to P-slice is valid or invalid. When weighted_pred_flag is 0, the weighted motion compensation prediction of the first embodiment in the P-slice is invalid. Accordingly, the WP application flag included in the WP parameter information is always set to 0, and the WP selectors 304 and 305 connect the respective output terminals to the default motion compensation unit 301. On the other hand, when weighted_pred_flag is 1, the weighted motion compensation prediction of the first embodiment in the P-slice is valid.

なお、別の例として、weighted_pred_flagが１である場合には、より下位のレイヤ（スライスヘッダー、コーディングツリーブロック、トランスフォームユニット、及びプレディクションユニットなど）のシンタクスにおいて、スライス内部の局所領域毎に第１実施形態の重み付き動き補償予測の有効又は無効を規定するようにしてもよい。 As another example, when weighted_pred_flag is 1, in the syntax of lower layers (slice header, coding tree block, transform unit, prediction unit, etc.) The validity or invalidity of the weighted motion compensated prediction of one embodiment may be specified.

weighted_bipred_idcは、例えば、Ｂ−ｓｌｉｃｅに関する第１実施形態の重み付き補償予測の有効又は無効を示すシンタクス要素である。weighted_bipred_idcが０である場合、Ｂ−ｓｌｉｃｅ内での第１実施形態の重み付き動き補償予測は無効となる。従って、ＷＰパラメータ情報に含まれるＷＰ適用フラグは常に０に設定され、ＷＰセレクタ３０４、３０５は、各々の出力端をデフォルト動き補償部３０１へ接続する。一方、weighted_bipred_idcが１である場合、Ｂ−ｓｌｉｃｅ内での第１実施形態の重み付き動き補償予測は有効となる。 For example, weighted_bipred_idc is a syntax element indicating whether the weighted compensated prediction of the first embodiment related to B-slice is valid or invalid. When weighted_bipred_idc is 0, the weighted motion compensated prediction of the first embodiment in the B-slice is invalid. Accordingly, the WP application flag included in the WP parameter information is always set to 0, and the WP selectors 304 and 305 connect the respective output terminals to the default motion compensation unit 301. On the other hand, when weighted_bipred_idc is 1, the weighted motion compensated prediction of the first embodiment in the B-slice is valid.

なお、別の例として、weighted_bipred_idcが１である場合には、より下位のレイヤ（スライスヘッダー、コーディングツリーブロック、及びトランスフォームユニットなど）のシンタクスにおいて、スライス内部の局所領域毎に第１実施形態の重み付き動き補償予測の有効又は無効を規定するようにしてもよい。 As another example, when weighted_bipred_idc is 1, in the syntax of lower layers (slice header, coding tree block, transform unit, etc.), the syntax of the first embodiment is determined for each local region inside the slice. The validity or invalidity of the weighted motion compensation prediction may be defined.

図１１は、第１実施形態のスライスヘッダーシンタクス５０７の一例を示す図である。slice_typeはスライスのスライスタイプ（Ｉ−ｓｌｉｃｅ，Ｐ−ｓｌｉｃｅ，Ｂ−ｓｌｉｃｅなど）を示している。pic_parameter_set_idは、いずれのピクチャパラメータセットシンタクス５０５を参照するかを示す識別子である。num_ref_idx_active_override_flagは、有効な参照画像の数を更新するかどうかを示すフラグであり、本フラグが１の場合、参照リストの参照画像数を定義するnum_ref_idx_l0_active_minus1及びnum_ref_idx_l1_active_minus1が利用できる。pred_weight_table()は、重み付き動き補償予測に利用するプレッドウェイトテーブルシンタクスを示す関数であり、前述のweighted_pred_flagが１かつＰ−ｓｌｉｃｅの場合、及びweighted_bipred_idcが１かつＢ−ｓｌｉｃｅの場合に、本関数が呼び出される。 FIG. 11 is a diagram illustrating an example of the slice header syntax 507 according to the first embodiment. slice_type indicates the slice type (I-slice, P-slice, B-slice, etc.) of the slice. pic_parameter_set_id is an identifier indicating which picture parameter set syntax 505 is to be referred to. num_ref_idx_active_override_flag is a flag indicating whether or not the number of valid reference images is updated. When this flag is 1, num_ref_idx_l0_active_minus1 and num_ref_idx_l1_active_minus1 that define the number of reference images in the reference list can be used. pred_weight_table () is a function indicating the predicate weight table syntax used for weighted motion compensation prediction. When the above weighted_pred_flag is 1 and P-slice, and when weighted_bipred_idc is 1 and B-slice, this function is Called.

図１２は、第１実施形態のプレッドウェイトテーブルシンタクス５０８の一例を示す図である。luma_log2_weight_denomは、スライスにおける輝度信号の重み係数の固定小数点精度を表しており、数式（７）又は数式（９）のｌｏｇＷＤＣに対応する値である。chroma_log2_weight_denomは、スライスにおける色差信号の重み係数の固定小数点精度を表しており、数式（７）又は数式（９）のｌｏｇＷＤＣに対応する値である。chroma_format_idcは、色空間を表す識別子であり、MONO_IDXはモノクロ映像を示す値である。num_ref_common_active_minus1は、スライスにおける共通リストに含まれる参照画像の数から１を引いた値を示している。 FIG. 12 is a diagram illustrating an example of the tread weight table syntax 508 according to the first embodiment. luma_log2_weight_denom represents the fixed-point precision of the weighting factor of the luminance signal in the slice, and is a value corresponding to the log WDC in Equation (7) or Equation (9). chroma_log2_weight_denom represents the fixed-point precision of the weight coefficient of the color difference signal in the slice, and is a value corresponding to the log WDC in Equation (7) or Equation (9). chroma_format_idc is an identifier representing a color space, and MONO_IDX is a value indicating a monochrome video. num_ref_common_active_minus1 indicates a value obtained by subtracting 1 from the number of reference images included in the common list in the slice.

luma_weight_l0_flag及びluma_weight_l1_flagは、リスト０及びリスト１のそれぞれに対応する輝度信号におけるＷＰ適応フラグを示している。本フラグが１の場合、スライス内全域で第１実施形態の輝度信号の重み付き動き補償予測が有効となる。chroma_weight_l0_flag及びchroma_weight_l1_flagは、リスト０及びリスト１のそれぞれに対応する色差信号におけるＷＰ適応フラグを示している。本フラグが１の場合、スライス内全域で第１実施形態の色差信号の重み付き動き補償予測が有効となる。luma_weight_l0[i]及びluma_weight_l1[i]は、リスト０及びリスト１のそれぞれで管理されたi番目に対応する輝度信号の重み係数である。luma_offset_l0[i]及びluma_offset_l1[i]は、リスト０及びリスト１のそれぞれで管理されたi番目に対応する輝度信号のオフセットである。これらは、それぞれ、数式（７）又は数式（９）のｗ０Ｃ、ｗ１Ｃ、ｏ０Ｃ、ｏ１Ｃに対応する値である。但し、Ｃ＝Ｙとする。 luma_weight_l0_flag and luma_weight_l1_flag indicate WP adaptation flags in the luminance signals corresponding to List 0 and List 1, respectively. When this flag is 1, the weighted motion compensated prediction of the luminance signal of the first embodiment is effective over the entire slice. chroma_weight_l0_flag and chroma_weight_l1_flag indicate WP adaptation flags in the color difference signals corresponding to List 0 and List 1, respectively. When this flag is 1, the weighted motion compensation prediction of the chrominance signal of the first embodiment is effective over the entire slice. luma_weight_l0 [i] and luma_weight_l1 [i] are the weight coefficients of the i-th corresponding luminance signal managed in list 0 and list 1, respectively. luma_offset_l0 [i] and luma_offset_l1 [i] are offsets of the i-th corresponding luminance signal managed in list 0 and list 1, respectively. These are values corresponding to w0C, w1C, o0C, and o1C, respectively, of Equation (7) or Equation (9). However, C = Y.

chroma_weight_l0[i][j]及びchroma_weight_l1[i][j]は、リスト０及びリスト１のそれぞれで管理されたi番目に対応する色差信号の重み係数である。chroma_offset_l0[i][j]及びchroma_offset_l1[i][j]は、リスト０及びリスト１のそれぞれで管理されたi番目に対応する色差信号のオフセットである。これらは、それぞれ、数式（７）又は数式（９）のｗ０Ｃ、ｗ１Ｃ、ｏ０Ｃ、ｏ１Ｃに対応する値である。但し、Ｃ＝Ｃｒ又はＣｂとする。ｊは色差成分のコンポーネントを示しており、例えばＹＵＶ４：２：０信号の場合、j=0がＣｒ成分、j=1がＣｂ成分であることを示す。 chroma_weight_l0 [i] [j] and chroma_weight_l1 [i] [j] are weight coefficients of the i-th corresponding color difference signal managed in list 0 and list 1, respectively. chroma_offset_l0 [i] [j] and chroma_offset_l1 [i] [j] are offsets of the i-th color difference signal managed in list 0 and list 1, respectively. These are values corresponding to w0C, w1C, o0C, and o1C, respectively, of Equation (7) or Equation (9). However, C = Cr or Cb. j indicates a component of a color difference component. For example, in the case of a YUV 4: 2: 0 signal, j = 0 indicates a Cr component and j = 1 indicates a Cb component.

ここで、シンタクス構成における重み付き予測に関連するそれぞれのシンタクス要素の予測方法の詳細について説明する。シンタクス要素の予測は、インデックス再構成部１１０Ｂにより行われる。図１３は、第１実施形態の予測方法を明示的に示したシンタクス構成一例を示す図である。図１３に示す例では、予測を導入したシンタクス要素をdeltaの接頭語を付けて示しているが、これらのシンタクス構成は基本的に図１２で示したシンタクス構成を同じ構成要素を持つ。 Here, the detail of the prediction method of each syntax element relevant to the weighted prediction in a syntax structure is demonstrated. The prediction of syntax elements is performed by the index reconstruction unit 110B. FIG. 13 is a diagram illustrating an example of a syntax configuration that explicitly indicates the prediction method of the first embodiment. In the example shown in FIG. 13, syntax elements into which prediction is introduced are shown with a delta prefix, but these syntax configurations basically have the same components as the syntax configuration shown in FIG. 12.

まず、重み係数の固定小数点精度を示すluma_log2_weight_denom及びchroma_log2_weight_denomの信号間の予測方法について説明する。インデックス再構成部１１０Ｂは、数式（１０）を用いて、luma_log2_weight_denom及びchroma_log2_weight_denomの信号間の予測処理を行い、数式（１１）を用いて、復元処理を行う。ここでは、図１２及び図１３に示すとおり、luma_log2_weight_denomが先に定義されているため、luma_log2_weight_denomの値からchroma_log2_weight_denomを予測する。 First, a prediction method between signals of luma_log2_weight_denom and chroma_log2_weight_denom indicating the fixed point precision of the weight coefficient will be described. The index reconstruction unit 110B performs prediction processing between the luma_log2_weight_denom and chroma_log2_weight_denom signals using Equation (10), and performs restoration processing using Equation (11). Here, as shown in FIGS. 12 and 13, since luma_log2_weight_denom is defined first, chroma_log2_weight_denom is predicted from the value of luma_log2_weight_denom.

delta_chroma_log2_weight_denom = (chroma_log2_weight_denom - luma_log2_weight_denom) …（１０） delta_chroma_log2_weight_denom = (chroma_log2_weight_denom-luma_log2_weight_denom)… (10)

chroma_log2_weight_denom = (luma_log2_weight_denom + delta_chroma_log2_weight_denom) …（１１） chroma_log2_weight_denom = (luma_log2_weight_denom + delta_chroma_log2_weight_denom)… (11)

図１４は、第１実施形態のchroma_log2_weight_denomの予測処理の一例を示すフローチャートである。 FIG. 14 is a flowchart illustrating an example of chroma_log2_weight_denom prediction processing according to the first embodiment.

まず、インデックス再構成部１１０Ｂは、インデックス情報に設定されているluma_log2_weight_denomを予測値として導出する（ステップＳ１０１）。 First, the index reconstruction unit 110B derives luma_log2_weight_denom set in the index information as a predicted value (step S101).

続いて、インデックス再構成部１１０Ｂは、chroma_log2_weight_denomからluma_log2_weight_denomを減算し（ステップＳ１０２）、差分値をdelta_chroma_log2_weight_denomとしてインデックス情報に設定する（ステップＳ１０３）。 Subsequently, the index reconstruction unit 110B subtracts luma_log2_weight_denom from chroma_log2_weight_denom (step S102), and sets the difference value as index information in delta_chroma_log2_weight_denom (step S103).

図１５は、第１実施形態のchroma_log2_weight_denomの復元処理の一例を示すフローチャートである。 FIG. 15 is a flowchart illustrating an example of chroma_log2_weight_denom restoration processing according to the first embodiment.

まず、インデックス再構成部１１０Ｂは、インデックス情報に既に設定されているluma_log2_weight_denomを予測値として導出する（ステップＳ２０１）。 First, the index reconstruction unit 110B derives luma_log2_weight_denom already set in the index information as a predicted value (step S201).

続いて、インデックス再構成部１１０Ｂは、luma_log2_weight_denomをdelta_chroma_log2_weight_denomに加算し（ステップＳ２０２）、加算値をchroma_log2_weight_denomとしてインデックス情報に設定する（ステップＳ２０３）。 Subsequently, the index reconstruction unit 110B adds luma_log2_weight_denom to delta_chroma_log2_weight_denom (step S202), and sets the addition value as index information as chroma_log2_weight_denom (step S203).

フェード効果は、一般的に色空間別に異なる時間的変化をさせるケースが少ないため、信号成分毎の固定小数点精度は、輝度成分と色差成分で強い相関がある。このため、このように色空間内で予測することにより、固定小数点精度を示す情報量を削減できる。 Since the fading effect generally has few cases of causing different temporal changes depending on the color space, the fixed-point accuracy for each signal component has a strong correlation between the luminance component and the color difference component. For this reason, it is possible to reduce the amount of information indicating the fixed-point precision by performing prediction in the color space in this way.

なお、数式（１０）では、色差成分から輝度成分を減算しているが、輝度成分から色差成分を減算してもよい。この場合、数式（１０）に応じて数式（１１）も式を変形すればよい。 In Equation (10), the luminance component is subtracted from the color difference component, but the color difference component may be subtracted from the luminance component. In this case, the equation (11) may be modified according to the equation (10).

次に、輝度及び色差信号の重み係数をそれぞれ表すluma_weight_lx[i]及びchroma_weight_lx[i][j]の予測方法について説明する。ここで、xは、0又は1を示す識別子である。luma_weight_lx[i]及びchroma_weight_lx[i][j]の値は、それぞれ、luma_log2_weight_denom及びchroma_log2_weight_denomの値に応じて増減する。例えば、luma_log2_weight_denomの値が3である場合、明度変化がないと仮定した場合のluma_weight_lx[i]は、(1<<3)となる。一方、luma_log2_weight_denomの値が5である場合、明度変化がないと仮定した場合のluma_weight_lx[i]は、(1<<5)となる。 Next, a description will be given of prediction methods for luma_weight_lx [i] and chroma_weight_lx [i] [j] representing the weighting coefficients of the luminance and chrominance signals, respectively. Here, x is an identifier indicating 0 or 1. The values of luma_weight_lx [i] and chroma_weight_lx [i] [j] increase or decrease according to the values of luma_log2_weight_denom and chroma_log2_weight_denom, respectively. For example, when the value of luma_log2_weight_denom is 3, luma_weight_lx [i] assuming that there is no change in brightness is (1 << 3). On the other hand, when the value of luma_log2_weight_denom is 5, luma_weight_lx [i] assuming that there is no change in brightness is (1 << 5).

このため、インデックス再構成部１１０Ｂは、明度変化がない場合の重み係数を基準係数(デフォルト値)として予測処理を行う。具体的には、インデックス再構成部１１０Ｂは、数式（１２）〜（１３）を用いてluma_weight_lx[i]の予測処理を行い、数式（１４）を用いて、復元処理を行う。同様に、インデックス再構成部１１０Ｂは、数式（１５）〜（１６）を用いてchroma_weight_lx[i]の予測処理を行い、数式（１７）を用いて、復元処理を行う。 For this reason, the index reconstruction unit 110B performs a prediction process using the weighting coefficient when there is no change in brightness as a reference coefficient (default value). Specifically, the index reconstruction unit 110B performs prediction processing of luma_weight_lx [i] using Equations (12) to (13), and performs restoration processing using Equation (14). Similarly, the index reconstruction unit 110B performs the prediction process of chroma_weight_lx [i] using Expressions (15) to (16), and performs the restoration process using Expression (17).

delta_luma_weight_lx[i] = (luma_weight_lx[i] - default_luma_weight_lx) …（１２） delta_luma_weight_lx [i] = (luma_weight_lx [i]-default_luma_weight_lx) (12)

default_luma_weight_lx = (1<<luma_log2_weight_denom) …（１３） default_luma_weight_lx = (1 << luma_log2_weight_denom) ... (13)

luma_weight_lx[i] = (default_luma_weight_lx + delta_luma_weight_lx[i]) …（１４） luma_weight_lx [i] = (default_luma_weight_lx + delta_luma_weight_lx [i]) (14)

delta_chroma_weight_lx[i][j] = (chroma_weight_lx[i][j] - default_chroma_weight_lx) …（１５） delta_chroma_weight_lx [i] [j] = (chroma_weight_lx [i] [j]-default_chroma_weight_lx) (15)

default_chroma_weight_lx = (1<<chroma_log2_weight_denom) …（１６） default_chroma_weight_lx = (1 << chroma_log2_weight_denom) ... (16)

chroma_weight_lx[i][j] = (default_chroma_weight_lx + delta_chroma_weight_lx[i][j]) …（１７） chroma_weight_lx [i] [j] = (default_chroma_weight_lx + delta_chroma_weight_lx [i] [j]) (17)

ここで、default_luma_weight_lx、default_chroma_weight_lxは、それぞれ、輝度成分、色差成分における明度変化がないデフォルト値である。 Here, default_luma_weight_lx and default_chroma_weight_lx are default values having no change in brightness in the luminance component and the color difference component, respectively.

図１６は、第１実施形態のluma_weight_lx[i]の予測処理の一例を示すフローチャートである。 FIG. 16 is a flowchart illustrating an example of the prediction process of luma_weight_lx [i] according to the first embodiment.

まず、インデックス再構成部１１０Ｂは、インデックス情報に設定されているluma_log2_weight_denomを導出し（ステップＳ３０１）、default_luma_weight_lxを予測値として算出する（ステップＳ３０２）。 First, the index reconstruction unit 110B derives luma_log2_weight_denom set in the index information (step S301), and calculates default_luma_weight_lx as a predicted value (step S302).

続いて、インデックス再構成部１１０Ｂは、luma_weight_lx[i]からdefault_luma_weight_lxを減算し（ステップＳ３０３）、差分値をdelta_luma_weight_lx[i]としてインデックス情報に設定する（ステップＳ３０４）。 Subsequently, the index reconstruction unit 110B subtracts default_luma_weight_lx from luma_weight_lx [i] (step S303), and sets the difference value as index information in the index information as delta_luma_weight_lx [i] (step S304).

なお、本処理を参照画像枚数分だけ繰り返すことでluma_weight_lx[i]に予測処理を適用できる。 Note that the prediction process can be applied to luma_weight_lx [i] by repeating this process for the number of reference images.

図１７は、第１実施形態のluma_weight_lx[i]の復元処理の一例を示すフローチャートである。 FIG. 17 is a flowchart illustrating an example of luma_weight_lx [i] restoration processing according to the first embodiment.

まず、インデックス再構成部１１０Ｂは、インデックス情報に既に設定されているdelta_luma_weight_lx[i]を導出し（ステップＳ４０１）、default_luma_weight_lxを予測値として算出する（ステップＳ４０２）。 First, the index reconstruction unit 110B derives delta_luma_weight_lx [i] already set in the index information (step S401), and calculates default_luma_weight_lx as a predicted value (step S402).

続いて、インデックス再構成部１１０Ｂは、delta_luma_weight_lx[i]をdefault_luma_weight_lxに加算し（ステップＳ４０３）、加算値をluma_weight_lx[i]としてインデックス情報に設定する（ステップＳ４０４）。 Subsequently, the index reconstruction unit 110B adds delta_luma_weight_lx [i] to default_luma_weight_lx (step S403), and sets the addition value as index information (luma_weight_lx [i]) (step S404).

なお、ここでは輝度成分に対するフローチャートを示したが、色差成分（chroma_weight_lx[i][j]）に関しても同様に予測処理と復元処理が実現できる。 Although the flowchart for the luminance component is shown here, the prediction process and the restoration process can be similarly implemented for the color difference component (chroma_weight_lx [i] [j]).

フェード効果を含む画像は、ある特定のフェード変化点でフェードを行い、それ以外の画像は通常の自然画像又はフェード効果のない画像であることが多い。この場合、重み係数は明度変化がない場合を取ることが多くなる。そこで、明度変化がない場合の初期値を、固定小数点精度から導出し、予測値として用いることで重み係数の符号量を削減できる。 An image including a fade effect is often faded at a specific fade change point, and the other images are often normal natural images or images without a fade effect. In this case, the weighting coefficient often takes a case where there is no change in brightness. Therefore, by deriving the initial value when there is no change in brightness from the fixed-point precision and using it as the predicted value, the code amount of the weighting coefficient can be reduced.

なお、輝度及び色差信号の重み係数（luma_weight_lx[i]及びchroma_weight_lx[i][j]）の予測値を、異なる参照番号又は異なるＰＯＣ番号で導出してもよい。この場合、符号化対象スライスから最も距離の近い参照番号をbase_idxとすると、インデックス再構成部１１０Ｂは、数式（１８）を用いてluma_weight_lx[i]の予測処理を行い、数式（１９）を用いて、復元処理を行う。同様に、インデックス再構成部１１０Ｂは、数式（２０）を用いてchroma_weight_lx[i][j]の予測処理を行い、数式（２１）を用いて、復元処理を行う。 Note that the predicted values of the weighting coefficients (luma_weight_lx [i] and chroma_weight_lx [i] [j]) of the luminance and chrominance signals may be derived with different reference numbers or different POC numbers. In this case, if the reference number closest to the encoding target slice is base_idx, the index reconstruction unit 110B performs prediction processing of luma_weight_lx [i] using Equation (18), and uses Equation (19). Perform the restoration process. Similarly, the index reconstruction unit 110B performs a prediction process of chroma_weight_lx [i] [j] using Expression (20), and performs a restoration process using Expression (21).

delta_luma_weight_lx[i] = (luma_weight_lx[i] - luma_weight_lx[base_idx]) …（１８） delta_luma_weight_lx [i] = (luma_weight_lx [i]-luma_weight_lx [base_idx]) (18)

luma_weight_lx[i] = (delta_luma_weight_lx[i] + luma_weight_lx[base_idx]) …（１９） luma_weight_lx [i] = (delta_luma_weight_lx [i] + luma_weight_lx [base_idx]) (19)

delta_chroma_weight_lx[i][j] = (chroma_weight_lx[i][j] - chroma_weight_lx[base_idx][j]) …（２０） delta_chroma_weight_lx [i] [j] = (chroma_weight_lx [i] [j]-chroma_weight_lx [base_idx] [j]) (20)

chroma_weight_lx[i][j] = (delta_chroma_weight_lx[i][j] + chroma_weight_lx[base_idx][j]) …（２１） chroma_weight_lx [i] [j] = (delta_chroma_weight_lx [i] [j] + chroma_weight_lx [base_idx] [j]) (21)

ここで、数式（１８）及び（２０）では、i≠base_idxとなる。base_idxで示される参照番号の重み係数は、数式（１８）及び（２０）で利用できないので、数式（１２）〜（１３）及び（１５）〜（１６）を利用すればよい。 Here, in Equations (18) and (20), i ≠ base_idx. Since the weighting coefficient of the reference number indicated by base_idx cannot be used in Expressions (18) and (20), Expressions (12) to (13) and (15) to (16) may be used.

図１８は、第１実施形態のluma_weight_lx[i]の予測処理の他の例を示すフローチャートである。 FIG. 18 is a flowchart illustrating another example of the luma_weight_lx [i] prediction process according to the first embodiment.

まず、インデックス再構成部１１０Ｂは、基準となる参照番号を示すbaseidxを設定する（ステップＳ５０１）。ここでは、baseidxの値を、仮に０とする。 First, the index reconstruction unit 110B sets baseidx indicating a reference number serving as a reference (step S501). Here, the value of baseidx is assumed to be zero.

続いて、インデックス再構成部１１０Ｂは、baseidxに基づいて、インデックス情報からluma_weight_lx[baseidx]を予測値として導出する（ステップＳ５０２）。なお、baseidxで示されるインデックス情報のluma_weight_lx[baseidx]は、例えば、予測を行わず直値で符号化されている。 Subsequently, the index reconfiguration unit 110B derives luma_weight_lx [baseidx] from the index information as a predicted value based on the baseidx (step S502). Note that the index information indicated by baseidx, luma_weight_lx [baseidx], for example, is encoded with a direct value without performing prediction.

続いて、インデックス再構成部１１０Ｂは、luma_weight_lx[i]からluma_weight_lx[baseidx]を減算し（ステップＳ５０３）、差分値をdelta_luma_weight_lx[i]としてインデックス情報に設定する（ステップＳ５０４）。 Subsequently, the index reconstruction unit 110B subtracts luma_weight_lx [baseidx] from luma_weight_lx [i] (step S503), and sets the difference value as index information in the index information as delta_luma_weight_lx [i] (step S504).

なお、本処理を参照画像枚数分だけ繰り返すことでbaseidx以外のluma_weight_lx[i]に予測処理を適用できる。 Note that the prediction process can be applied to luma_weight_lx [i] other than baseidx by repeating this process for the number of reference images.

図１９は、第１実施形態のluma_weight_lx[i]の復元処理の他の例を示すフローチャートである。 FIG. 19 is a flowchart illustrating another example of the luma_weight_lx [i] restoration process according to the first embodiment.

まず、インデックス再構成部１１０Ｂは、基準となる参照番号を示すbaseidxを設定する（ステップＳ６０１）。ここでは、baseidxの値を、仮に０とする。 First, the index reconstruction unit 110B sets baseidx indicating a reference number serving as a reference (step S601). Here, the value of baseidx is assumed to be zero.

続いて、インデックス再構成部１１０Ｂは、baseidxに基づいて、インデックス情報からluma_weight_lx[baseidx]を予測値として導出する（ステップＳ６０２）。なお、baseidxで示されるインデックス情報のluma_weight_lx[baseidx]は、例えば、予測を行わず直値で符号化又は復号されている。 Subsequently, the index reconstruction unit 110B derives luma_weight_lx [baseidx] from the index information as a predicted value based on the baseidx (step S602). Note that luma_weight_lx [baseidx] of index information indicated by baseidx is encoded or decoded with a direct value without performing prediction, for example.

続いて、インデックス再構成部１１０Ｂは、delta_luma_weight_lx[i]をluma_weight_lx[baseidx]に加算し（ステップＳ６０３）、加算値をluma_weight_lx[i]としてインデックス情報に設定する（ステップＳ６０４）。 Subsequently, the index reconstruction unit 110B adds delta_luma_weight_lx [i] to luma_weight_lx [baseidx] (step S603), and sets the added value as index information in the index information as luma_weight_lx [i] (step S604).

なお、ここでは輝度成分に対するフローチャートを示したが、色差成分（chroma_weight_lx[i][j]）に関しても同様に予測処理と復元処理が実現できる。また、ここでは、例として、luma_weight_lx[i]の予測方法及び復元方法を説明したが、luma_offset_lx[i]に関しても同様に予測、復元が可能である。 Although the flowchart for the luminance component is shown here, the prediction process and the restoration process can be similarly implemented for the color difference component (chroma_weight_lx [i] [j]). In addition, here, as an example, the prediction method and restoration method of luma_weight_lx [i] have been described, but prediction and restoration can be similarly performed for luma_offset_lx [i].

また、輝度及び色差信号の重み係数（luma_weight_lx[i]及びchroma_weight_lx[i][j]）の予測値を、符号化対象の参照スライスとの距離を用いて導出してもよい。この場合、インデックス再構成部１１０Ｂは、数式（２２）を用いてluma_weight_lx[i]の予測処理を行い、数式（２３）を用いて、復元処理を行う。同様に、インデックス再構成部１１０Ｂは、数式（２４）を用いてchroma_weight_lx[i][j]の予測処理を行い、数式（２５）を用いて、復元処理を行う。 Moreover, you may derive | lead-out the prediction value of the weight coefficient (luma_weight_lx [i] and chroma_weight_lx [i] [j]) of a brightness | luminance and a colour-difference signal using the distance with the encoding reference slice. In this case, the index reconstruction unit 110B performs prediction processing of luma_weight_lx [i] using Equation (22), and performs restoration processing using Equation (23). Similarly, the index reconstruction unit 110B performs a prediction process of chroma_weight_lx [i] [j] using Expression (24), and performs a restoration process using Expression (25).

delta_luma_weight_lx[i] = (luma_weight_lx[i] - luma_weight_lx[i-1]) …（２２） delta_luma_weight_lx [i] = (luma_weight_lx [i]-luma_weight_lx [i-1]) (22)

luma_weight_lx[i] = (delta_luma_weight_lx[i] + luma_weight_lx[i-1]) …（２３） luma_weight_lx [i] = (delta_luma_weight_lx [i] + luma_weight_lx [i-1]) (23)

delta_chroma_weight_lx[i][j] = (chroma_weight_lx[i][j] - chroma_weight_lx[i-1][j]) …（２４） delta_chroma_weight_lx [i] [j] = (chroma_weight_lx [i] [j]-chroma_weight_lx [i-1] [j]) (24)

chroma_weight_lx[i][j] = (delta_chroma_weight_lx[i][j] + chroma_weight_lx[i-1][j]) …（２５） chroma_weight_lx [i] [j] = (delta_chroma_weight_lx [i] [j] + chroma_weight_lx [i-1] [j]) (25)

ここで、数式（２２）及び（２４）では、i≠0となる。 Here, in Equations (22) and (24), i ≠ 0.

なお、本予測処理及び復号処理は、図１８及び図１９のフローチャートにおいて、baseidxにi-1番目の値（i≠０）を導入することと等価であるため、説明は省略する。なお、ここでは輝度成分に対するフローチャートを示したが、色差成分（chroma_weight_lx[i][j]）に関しても同様に予測処理と復元処理が実現できる。また、ここでは、例として、luma_weight_lx[i]の予測方法及び復元方法を説明したが、luma_offset_lx[i]に関しても同様に予測、復元が可能である。 Note that the present prediction process and decoding process are equivalent to introducing the i−1th value (i ≠ 0) into baseidx in the flowcharts of FIGS. Although the flowchart for the luminance component is shown here, the prediction process and the restoration process can be similarly implemented for the color difference component (chroma_weight_lx [i] [j]). In addition, here, as an example, the prediction method and restoration method of luma_weight_lx [i] have been described, but prediction and restoration can be similarly performed for luma_offset_lx [i].

符号化対象スライスが参照可能な参照スライスは、符号化効率の観点で符号化対象スライスから時間距離的又は空間距離的に近いスライスが設定される場合が多い。ここで、時間距離的に連続するスライスの輝度変化は相関が高いため、重み係数及びオフセットの時間距離的な相関も高い。そこで、基準となる参照スライスの重み係数及びオフセット値を用いて、時間的に異なる参照スライスの重み係数及びオフセット値を予測することで、効率良く符号量を削減できる。なお、空間的に同一の参照スライスは、同じ重み係数及びオフセット値を取る場合が多いため、同様の理由で予測を導入することにより、符号量を削減できる。 In many cases, the reference slice that can be referred to by the encoding target slice is set to a slice that is close to the encoding target slice in terms of temporal distance or spatial distance from the viewpoint of encoding efficiency. Here, since the luminance change of the slices that are continuous in time and distance has a high correlation, the time-distance correlation of the weight coefficient and the offset is also high. Thus, the amount of codes can be efficiently reduced by predicting the weight coefficient and offset value of the reference slice that are temporally different using the weight coefficient and offset value of the reference slice serving as a reference. Since spatially identical reference slices often have the same weighting factor and offset value, the amount of codes can be reduced by introducing prediction for the same reason.

次に、色差信号のオフセットを表すchroma_offset_lx[i][j]の予測方法について説明する。YUVの色空間では、色差成分は、中央値からのズレ量で色を表現する。このため、重み係数を用いて中央値を考慮した明度変化からの変化量を予測値とすることができる。具体的には、インデックス再構成部１１０Ｂは、数式（２６）〜（２７）を用いてchroma_offset_lx[i][j]の予測処理を行い、数式（２８）を用いて、復元処理を行う。 Next, a prediction method of chroma_offset_lx [i] [j] representing the offset of the color difference signal will be described. In the YUV color space, the color difference component expresses the color by the amount of deviation from the median. For this reason, the amount of change from the brightness change in consideration of the median value using the weight coefficient can be used as the predicted value. Specifically, the index reconstruction unit 110B performs a prediction process of chroma_offset_lx [i] [j] using Expressions (26) to (27), and performs a restoration process using Expression (28).

delta_chroma_offset_lx[i][j] = (chroma_offset_lx[i][j] + ( ( MED * chroma_weight_lx[i][j])>> chroma_log2_weight_denom) - MED ) …（２６） delta_chroma_offset_lx [i] [j] = (chroma_offset_lx [i] [j] + ((MED * chroma_weight_lx [i] [j]) >> chroma_log2_weight_denom)-MED)… (26)

MED = (MaxChromaValue>>1) …（２７） MED = (MaxChromaValue >> 1) (27)

ここで、MaxChromaValueは色差信号が取れる最大明度を示している。例えば、8ビットの信号である場合、MaxChromaValueは255であり、MEDは128となる。 Here, MaxChromaValue indicates the maximum brightness with which a color difference signal can be obtained. For example, in the case of an 8-bit signal, MaxChromaValue is 255 and MED is 128.

chroma_offset_lx[i][j] = (delta_chroma_offset_lx[i][j] - ( ( MED * chroma_weight_lx[i][j])>> chroma_log2_weight_denom) + MED ) …（２８） chroma_offset_lx [i] [j] = (delta_chroma_offset_lx [i] [j]-((MED * chroma_weight_lx [i] [j]) >> chroma_log2_weight_denom) + MED)… (28)

図２０は、第１実施形態のchroma_offset_lx[i][j]の予測処理の一例を示すフローチャートである。 FIG. 20 is a flowchart illustrating an example of chroma_offset_lx [i] [j] prediction processing according to the first embodiment.

まず、インデックス再構成部１１０Ｂは、インデックス情報に設定されているchroma_log2_weight_denomを導出する（ステップＳ７０１）。 First, the index reconstruction unit 110B derives chroma_log2_weight_denom set in the index information (step S701).

続いて、インデックス再構成部１１０Ｂは、インデックス情報に設定されているchroma_offset_lx[i][j]を導出する（ステップＳ７０２）。 Subsequently, the index reconstruction unit 110B derives chroma_offset_lx [i] [j] set in the index information (step S702).

続いて、インデックス再構成部１１０Ｂは、色差信号の最大値（最大信号）の中間値を導出する（ステップＳ７０３）。 Subsequently, the index reconstruction unit 110B derives an intermediate value of the maximum value (maximum signal) of the color difference signal (step S703).

続いて、インデックス再構成部１１０Ｂは、delta_chroma_offsett_lx[i][j]を導出し、インデックス情報に設定する（ステップＳ７０４）。 Subsequently, the index reconstruction unit 110B derives delta_chroma_offsett_lx [i] [j] and sets it as index information (step S704).

図２１は、第１実施形態のchroma_offset_lx[i][j]の復元処理の一例を示すフローチャートである。 FIG. 21 is a flowchart illustrating an example of the restoration process of chroma_offset_lx [i] [j] according to the first embodiment.

まず、インデックス再構成部１１０Ｂは、インデックス情報に既に設定されているchroma_log2_weight_denomを導出する（ステップＳ８０１）。 First, the index reconstruction unit 110B derives chroma_log2_weight_denom already set in the index information (step S801).

続いて、インデックス再構成部１１０Ｂは、インデックス情報に設定されているchroma_offset_lx[i][j]を導出する（ステップＳ８０２）。 Subsequently, the index reconstruction unit 110B derives chroma_offset_lx [i] [j] set in the index information (step S802).

続いて、インデックス再構成部１１０Ｂは、色差信号の最大値（最大信号）の中間値を導出する（ステップＳ８０３）。 Subsequently, the index reconstruction unit 110B derives an intermediate value of the maximum value (maximum signal) of the color difference signal (step S803).

続いて、インデックス再構成部１１０Ｂは、chroma_offsett_lx[i][j]を導出し、インデックス情報に設定する（ステップＳ８０４）。 Subsequently, the index reconstruction unit 110B derives chroma_offsett_lx [i] [j] and sets it as index information (step S804).

色差信号の信号特性を利用して、中央値からのズレ量を考慮した予測値を導入することにより、オフセット値をそのまま符号化する場合と比較して、色差信号のオフセット値の符号量を削減できる。 By using a signal characteristic of the color difference signal and introducing a prediction value that takes into account the amount of deviation from the median value, the code amount of the offset value of the color difference signal is reduced compared to when the offset value is encoded as it is. it can.

次に、Ｈ．２６４などで定義されている重み付き予測の暗黙的重み付き予測のＷＰパラメータ導出方法を用いた重み係数及び固定小数点精度の予測値導出手法を説明する。Ｈ．２６４の暗黙的重み付き予測では、参照スライス間の時間的距離（POC番号の時間比率）に応じて、重み係数を導出している（オフセットは０となる）。参照スライス間の時間的距離は、POC番号に基づいて、符号化対象スライスと参照スライス間の距離を導出し、その距離比に基づいて重み係数が定まる。この際、固定小数点精度は固定値5に設定される。 Next, H.I. A weight coefficient and a prediction value derivation method with fixed-point accuracy using the WP parameter derivation method of implicit weighted prediction of weighted prediction defined in H.264 and the like will be described. H. In the H.264 implicit weighted prediction, a weighting factor is derived according to the temporal distance between reference slices (time ratio of POC number) (offset is 0). As the temporal distance between reference slices, the distance between the encoding target slice and the reference slice is derived based on the POC number, and the weight coefficient is determined based on the distance ratio. At this time, the fixed-point precision is set to a fixed value of 5.

例えばＨ．２６４では、数式（２９）に示す疑似コードに従って重み係数が導出される。 For example, H.C. In H.264, a weighting factor is derived according to the pseudo code shown in Equation (29).

td = Clip3(-128, 127, POCA - POCB)
tb = Clip3(-128, 127, POCT - POCA)
tx = ( td != 0 ) ? ( ( 16384 + abs( td / 2 ) ) / td ) : (0)
DistScaleFactor = Clip3( -1024, 1023, ( tb * tx + 32 ) >> 6 )
implicit_luma_weight_l0[i] = 64 - (DistScaleFactor >> 2)
implicit_luma_weight_l1[i] = DistScaleFactor >> 2 …（２９） td = Clip3 (-128, 127, POCA-POCB)
tb = Clip3 (-128, 127, POCT-POCA)
tx = (td! = 0)? ((16384 + abs (td / 2)) / td): (0)
DistScaleFactor = Clip3 (-1024, 1023, (tb * tx + 32) >> 6)
implicit_luma_weight_l0 [i] = 64-(DistScaleFactor >> 2)
implicit_luma_weight_l1 [i] = DistScaleFactor >> 2… (29)

ここで、POCAは、リスト１に対応する参照画像AのPOC番号、POCBは、リスト０に対応する参照画像BのPOC番号、POCTは、予測対象画像のPOC番号を示している。Clip3(L,M,N)は、最後の引数Nが、最初の２つが示す最小値Lと最大値Mの範囲を超えないようにクリップ処理を行う関数である。abs()関数は、引数の絶対値を返す関数である。td及びtbは、時間比を表しており、tdは、リスト１に対応する参照画像のPOC番号とリスト０に対応する参照画像のPOC番号の差、tbは、予測対象画像のPOC番号とリスト０に対応する参照画像のPOC番号の差を示す。これらの値によって重み係数の距離におけるスケーリング変数DistScaleFactorが導出される。DistScaleFactorに基づいて、リスト０及びリスト１に対応する重み係数（implicit_luma_weight_l0[i]、implicit_luma_weight_l1[i]）が導出される。なお、色差信号についても同様に設定される。インデックス再構成部１１０Ｂは、ここで導出される固定小数点精度implicit_log2_weight_denomを用いて、数式（３０）で固定小数点精度を予測する。 Here, POCA is the POC number of the reference image A corresponding to the list 1, POCB is the POC number of the reference image B corresponding to the list 0, and POCT is the POC number of the prediction target image. Clip3 (L, M, N) is a function that performs clip processing so that the last argument N does not exceed the range between the minimum value L and the maximum value M indicated by the first two. The abs () function is a function that returns the absolute value of an argument. td and tb represent time ratios, td is the difference between the POC number of the reference image corresponding to list 1 and the POC number of the reference image corresponding to list 0, and tb is the POC number and list of the prediction target image A difference in the POC number of the reference image corresponding to 0 is shown. These values derive the scaling variable DistScaleFactor at the weighting factor distance. Based on DistScaleFactor, weight coefficients (implicit_luma_weight_l0 [i], implicit_luma_weight_l1 [i]) corresponding to list 0 and list 1 are derived. The color difference signal is set similarly. The index reconstructing unit 110B predicts the fixed-point precision using Equation (30) using the fixed-point precision implicit_log2_weight_denom derived here.

delta_luma_log2_weight_denom = (luma_log2_weight_denom - implicit_log2_weight_denom) …（３０） delta_luma_log2_weight_denom = (luma_log2_weight_denom-implicit_log2_weight_denom)… (30)

なお、色差信号の固定小数点精度も数式（３０）で予測できる。この値は、数式（３１）で復元される。 Note that the fixed-point precision of the color difference signal can also be predicted by Expression (30). This value is restored by Equation (31).

luma_log2_weight_denom = (delta_luma_log2_weight_denom + implicit_log2_weight_denom) …（３１） luma_log2_weight_denom = (delta_luma_log2_weight_denom + implicit_log2_weight_denom)… (31)

なお、色差信号の固定小数点精度も数式（３１）と同様の方法で復元できる。 Note that the fixed-point precision of the color difference signal can also be restored by the same method as in equation (31).

次に、重み係数の予測式を説明する。暗黙的重み係数をimplicit_luma_weight_lx[i]とすると、インデックス再構成部１１０Ｂは、数式（３２）で重み係数luma_weight_lx[i]を予測し、数式（３３）で復元する。 Next, the prediction formula of the weight coefficient will be described. Assuming that the implicit weight coefficient is implicit_luma_weight_lx [i], the index reconstruction unit 110B predicts the weight coefficient luma_weight_lx [i] using Expression (32) and restores it using Expression (33).

if(luma_log2_weight_denom >= implicit_log2_weight_denom ) ｛
norm_denom = (luma_log2_weight_denom - implicit_log2_weight_denom)
delta_luma_weight_lx[i] = (luma_weight_lx[i] - (implicit_luma_weight_lx[i] << norm_denom) )
｝
else｛
norm_denom = (implicit_log2_weight_denom - luma_log2_weight_denom)
delta_luma_weight_lx[i] = (luma_weight_lx[i] - (implicit_luma_weight_lx[i] >> norm_denom) )
｝ …（３２） if (luma_log2_weight_denom> = implicit_log2_weight_denom) {
norm_denom = (luma_log2_weight_denom-implicit_log2_weight_denom)
delta_luma_weight_lx [i] = (luma_weight_lx [i]-(implicit_luma_weight_lx [i] << norm_denom))
}
else {
norm_denom = (implicit_log2_weight_denom-luma_log2_weight_denom)
delta_luma_weight_lx [i] = (luma_weight_lx [i]-(implicit_luma_weight_lx [i] >> norm_denom))
} (32)

ここでは、インデックス再構成部１１０Ｂは、暗黙的重み付き予測の固定小数点精度から大きいか小さいかにより、重み係数を補正して予測に利用する。 Here, the index reconstruction unit 110B corrects the weighting coefficient and uses it for prediction depending on whether the fixed-point precision of the implicit weighted prediction is larger or smaller.

if(luma_log2_weight_denom >= implicit_log2_weight_denom ) ｛
norm_denom = (luma_log2_weight_denom - implicit_log2_weight_denom)
luma_weight_lx[i] = (delta_luma_weight_lx[i] + (implicit_luma_weight_lx[i] << norm_denom) )
｝
else｛
norm_denom = (implicit_log2_weight_denom - luma_log2_weight_denom)
luma_weight_lx[i] = (delta_luma_weight_lx[i] + (implicit_luma_weight_lx[i] >> norm_denom) )
｝ …（３３） if (luma_log2_weight_denom> = implicit_log2_weight_denom) {
norm_denom = (luma_log2_weight_denom-implicit_log2_weight_denom)
luma_weight_lx [i] = (delta_luma_weight_lx [i] + (implicit_luma_weight_lx [i] << norm_denom))
}
else {
norm_denom = (implicit_log2_weight_denom-luma_log2_weight_denom)
luma_weight_lx [i] = (delta_luma_weight_lx [i] + (implicit_luma_weight_lx [i] >> norm_denom))
} (33)

なお、数式（３２）では輝度成分の重み係数の例を示したが、色差成分についても同様な方法を用いることにより予測値を導出できる。 In addition, although the example of the weighting coefficient of the luminance component is shown in Expression (32), the predicted value can be derived for the color difference component by using the same method.

図２２は、第１実施形態のluma_weight_lx[i]の予測処理の他の例を示すフローチャートである。 FIG. 22 is a flowchart illustrating another example of the luma_weight_lx [i] prediction process according to the first embodiment.

まず、インデックス再構成部１１０Ｂは、インデックス情報に設定されているluma_log2_weight_denomを導出する（ステップＳ９０１）。 First, the index reconstruction unit 110B derives luma_log2_weight_denom set in the index information (step S901).

続いて、インデックス再構成部１１０Ｂは、Ｈ．２６４の暗黙的重み付き予測の導出方法に従って、implicit_log2_weight_denom及びimplicit_luma_weight_lx[i]を導出する（ステップＳ９０２、Ｓ９０３）。 Subsequently, the index reconstruction unit 110B performs the H.264 operation. In accordance with the H.264 implicit weighted prediction derivation method, implicit_log2_weight_denom and implicit_luma_weight_lx [i] are derived (steps S902 and S903).

続いて、インデックス再構成部１１０Ｂは、luma_log2_weight_denomがimplicit_log2_weight_denom以上かどうかを判断する（ステップＳ９０４）。 Subsequently, the index reconstruction unit 110B determines whether luma_log2_weight_denom is equal to or greater than implicit_log2_weight_denom (step S904).

luma_log2_weight_denomがimplicit_log2_weight_denom以上の場合（ステップＳ９０４でＹｅｓ）、インデックス再構成部１１０Ｂは、luma_log2_weight_denomからimplicit_log2_weight_denomを減算し（ステップＳ９０５）、implicit_luma_weight_lx[i]を減算した値分左シフトし、予測値を導出する（ステップＳ９０６）。 When luma_log2_weight_denom is greater than or equal to implicit_log2_weight_denom (Yes in step S904), the index reconstruction unit 110B subtracts implicit_log2_weight_denom from luma_log2_weight_denom (step S905), and shifts the prediction value to the left by subtracting the value obtained by subtracting implicit_luma_weight_lx [i]. Step S906).

一方、luma_log2_weight_denomがimplicit_log2_weight_denom以上でない場合（ステップＳ９０４でＮｏ）、インデックス再構成部１１０Ｂは、implicit_log2_weight_denomからluma_log2_weight_denomを減算し（ステップＳ９０７）、implicit_luma_weight_lx[i]を減算した値分右シフトし、予測値を導出する（ステップＳ９０８）。 On the other hand, if luma_log2_weight_denom is not equal to or greater than implicit_log2_weight_denom (No in step S904), the index reconfiguration unit 110B subtracts luma_log2_weight_denom from implicit_log2_weight_denom (step S907), and shifts the prediction value to the right by the value obtained by subtracting implicit_luma_weight_lx [i]. (Step S908).

続いて、インデックス再構成部１１０Ｂは、luma_weight_lx[i]から導出した予測値を減算し（ステップＳ９０９）、減算した値（差分値）をインデックス情報に設定する（ステップＳ９１０）。 Subsequently, the index reconstruction unit 110B subtracts the predicted value derived from luma_weight_lx [i] (step S909), and sets the subtracted value (difference value) in the index information (step S910).

図２３は、第１実施形態のluma_weight_lx[i]の復元処理の他の例を示すフローチャートである。 FIG. 23 is a flowchart illustrating another example of the luma_weight_lx [i] restoration process according to the first embodiment.

まず、インデックス再構成部１１０Ｂは、インデックス情報に既に設定されているluma_log2_weight_denomを導出する（ステップＳ１００１）。 First, the index reconstruction unit 110B derives luma_log2_weight_denom that has already been set in the index information (step S1001).

続いて、インデックス再構成部１１０Ｂは、Ｈ．２６４の暗黙的重み付き予測の導出方法に従って、implicit_log2_weight_denom及びimplicit_luma_weight_lx[i]を導出する（ステップＳ１００２、Ｓ１００３）。 Subsequently, the index reconstruction unit 110B performs the H.264 operation. In accordance with the H.264 implicit weighted prediction derivation method, implicit_log2_weight_denom and implicit_luma_weight_lx [i] are derived (steps S1002 and S1003).

続いて、インデックス再構成部１１０Ｂは、luma_log2_weight_denomがimplicit_log2_weight_denom以上かどうかを判断する（ステップＳ１００４）。 Subsequently, the index reconstruction unit 110B determines whether luma_log2_weight_denom is equal to or greater than implicit_log2_weight_denom (step S1004).

luma_log2_weight_denomがimplicit_log2_weight_denom以上の場合（ステップＳ１００４でＹｅｓ）、インデックス再構成部１１０Ｂは、luma_log2_weight_denomからimplicit_log2_weight_denomを減算し（ステップＳ１００５）、implicit_luma_weight_lx[i]を減算した値分左シフトし、予測値を導出する（ステップＳ１００６）。 When luma_log2_weight_denom is greater than or equal to implicit_log2_weight_denom (Yes in step S1004), the index reconstruction unit 110B subtracts implicit_log2_weight_denom from luma_log2_weight_denom (step S1005), and shifts the predicted value to the left by subtracting the value obtained by subtracting implicit_luma_weight_lx [i]. Step S1006).

一方、luma_log2_weight_denomがimplicit_log2_weight_denom以上でない場合（ステップＳ１００４でＮｏ）、インデックス再構成部１１０Ｂは、implicit_log2_weight_denomからluma_log2_weight_denomを減算し（ステップＳ１００７）、implicit_luma_weight_lx[i]を減算した値分右シフトし、予測値を導出する（ステップＳ１００８）。 On the other hand, if luma_log2_weight_denom is not equal to or greater than implicit_log2_weight_denom (No in step S1004), the index reconfiguration unit 110B subtracts luma_log2_weight_denom from implicit_log2_weight_denom (step S1007), and shifts the prediction value to the right by the value obtained by subtracting it. (Step S1008).

続いて、インデックス再構成部１１０Ｂは、delta_luma_weight_lx[i]に導出した予測値を加算し（ステップＳ１００９）、加算した値をインデックス情報に設定する（ステップＳ１０１０）。 Subsequently, the index reconstruction unit 110B adds the predicted value derived to delta_luma_weight_lx [i] (step S1009), and sets the added value as index information (step S1010).

なお、上記で説明した複数の予測手法は、単独で使用するだけでなく組み合わせて利用することもできる。例えば、数式（１０）、数式（１２）〜（１３）、数式（１５）〜（１６）、及び数式（２６）〜（２７）を組み合わせるなどとすることにより、インデックス情報におけるシンタクス要素の符号量を効率良く削減することが可能となる。 The plurality of prediction methods described above can be used not only independently but also in combination. For example, the code amount of the syntax element in the index information is obtained by combining the mathematical formula (10), the mathematical formulas (12) to (13), the mathematical formulas (15) to (16), and the mathematical formulas (26) to (27). Can be efficiently reduced.

以上のように第１実施形態では、インデックス設定部１０８は、ＷＰパラメータ情報を対応するシンタクス構成にマッピングしたインデックス情報を出力し、インデックス再構成部１１０Ｂは、シンタクス要素の冗長な表現を、当該スライス内で符号化される情報に基づいて予測する。従って、第１実施形態によれば、シンタクス要素をそのまま（直値で）符号化する場合と比較して、符号量を削減することができる。 As described above, in the first embodiment, the index setting unit 108 outputs the index information obtained by mapping the WP parameter information to the corresponding syntax configuration, and the index reconstruction unit 110B converts the redundant representation of the syntax element into the slice. Prediction based on the information encoded within. Therefore, according to the first embodiment, it is possible to reduce the code amount as compared with the case where the syntax element is encoded as it is (directly).

ここで、符号化対象スライスで利用されるシンタクス要素の定義順（符号化順序）に基づいて、既に符号化済みのシンタクス要素から画面内相関として予測値を導出することや、明度変化がないことを仮定したデフォルト値から、予測値を導出することで、シンタクス要素の特徴を活かした予測が可能となり、結果としてシンタクス要素の符号化に必要なオーバーヘッドを削減できる効果を奏する。 Here, based on the definition order (encoding order) of syntax elements used in the encoding target slice, a prediction value is derived as an intra-screen correlation from already encoded syntax elements, and there is no change in brightness. By deriving a predicted value from a default value that assumes the above, it is possible to make a prediction that makes use of the characteristics of syntax elements, and as a result, there is an effect of reducing the overhead required for encoding syntax elements.

なお、第１実施形態の図１０〜図１３に例示するシンタクステーブルの行間には、本実施形態において規定していないシンタクス要素が挿入されてもよいし、その他の条件分岐に関する記述が含まれていてもよい。また、シンタクステーブルを複数のテーブルに分割したり、複数のシンタクステーブルを統合したりしてもよい。また、例示した各シンタクス要素の用語は、任意に変更可能である。 Note that syntax elements not defined in the present embodiment may be inserted between the rows of the syntax tables illustrated in FIGS. 10 to 13 of the first embodiment, and descriptions regarding other conditional branches are included. May be. Further, the syntax table may be divided into a plurality of tables, or a plurality of syntax tables may be integrated. Moreover, the term of each illustrated syntax element can be changed arbitrarily.

以上説明したように、第１実施形態の符号化装置１００は、符号化する情報のパラメータの相関を利用して空間冗長性を削除することにより、符号化効率が低下する問題を解消する。符号化装置１００は、重み付き動き補償予測で用いるシンタクス要素をそのまま（直値で）符号化していた従来の構成と比較して、符号量を削減することができる。 As described above, the encoding apparatus 100 according to the first embodiment eliminates the problem of a decrease in encoding efficiency by deleting spatial redundancy using the correlation of parameters of information to be encoded. The encoding apparatus 100 can reduce the amount of codes compared to a conventional configuration in which syntax elements used in weighted motion compensation prediction are encoded as they are (direct values).

（第２実施形態）
第２実施形態では、第１実施形態の符号化装置で符号化された符号化データを復号する復号装置について説明する。 (Second Embodiment)
In the second embodiment, a decoding device that decodes encoded data encoded by the encoding device of the first embodiment will be described.

図２４は、第２実施形態の復号装置８００の構成の一例を示すブロック図である。 FIG. 24 is a block diagram illustrating an example of the configuration of the decoding device 800 according to the second embodiment.

復号装置８００は、図示せぬ入力バッファなどに蓄積された符号化データを復号画像に復号し、出力画像として図示せぬ出力バッファに出力する。符号化データは、例えば、図１の符号化装置１００などから出力され、図示せぬ蓄積系、伝送系、又はバッファなどを経て、復号装置８００に入力される。 The decoding device 800 decodes encoded data stored in an input buffer (not shown) into a decoded image and outputs the decoded image to an output buffer (not shown) as an output image. The encoded data is output from, for example, the encoding device 100 of FIG. 1 and the like, and is input to the decoding device 800 via a storage system, a transmission system, or a buffer (not shown).

復号装置８００は、図２４に示すように、復号部８０１と、逆量子化部８０２と、逆直交変換部８０３と、加算部８０４と、予測画像生成部８０５と、インデックス設定部８０６とを、備える。逆量子化部８０２、逆直交変換部８０３、加算部８０４、予測画像生成部８０５は、それぞれ、図１の逆量子化部１０４、逆直交変換部１０５、加算部１０６、予測画像生成部１０７、と実質的に同一又は類似の要素である。なお、図２４に示す復号制御部８０７は、復号装置８００を制御するものであり、例えば、ＣＰＵなどにより実現できる。 As shown in FIG. 24, the decoding apparatus 800 includes a decoding unit 801, an inverse quantization unit 802, an inverse orthogonal transform unit 803, an addition unit 804, a predicted image generation unit 805, and an index setting unit 806. Prepare. The inverse quantization unit 802, the inverse orthogonal transform unit 803, the addition unit 804, and the predicted image generation unit 805 are respectively the inverse quantization unit 104, the inverse orthogonal transform unit 105, the addition unit 106, the predicted image generation unit 107, FIG. Are substantially the same or similar elements. Note that the decoding control unit 807 shown in FIG. 24 controls the decoding device 800 and can be realized by, for example, a CPU.

復号部８０１は、符号化データの復号のために、１フレーム又は１フィールド毎にシンタクスに基づいて解読を行う。復号部８０１は、エントロピー復号部８０１Ａと、インデックス再構成部８０１Ｂとを、備える。 The decoding unit 801 performs decoding based on the syntax for each frame or field in order to decode the encoded data. The decoding unit 801 includes an entropy decoding unit 801A and an index reconstruction unit 801B.

エントロピー復号部８０１Ａは、各シンタクスの符号列を順次エントロピー復号し、予測モード、動きベクトル、及び参照番号などを含む動き情報、重み付き動き補償予測のためのインデックス情報、並びに量子化変換係数などの符号化対象ブロックの符号化パラメータを再生する。ここで、符号化パラメータとは、上記以外にも変換係数に関する情報、量子化に関する情報、などの復号に必要となるすべてのパラメータである。 The entropy decoding unit 801A sequentially entropy-decodes the code string of each syntax, such as motion information including a prediction mode, a motion vector, and a reference number, index information for weighted motion compensated prediction, a quantization transform coefficient, and the like. The encoding parameter of the encoding target block is reproduced. Here, the encoding parameters are all parameters necessary for decoding such as information on transform coefficients and information on quantization in addition to the above.

具体的には、エントロピー復号部８０１Ａは、入力されてきた符号化データに対して可変長復号処理や算術復号処理などの復号処理を行う機能を有する。例えば、Ｈ．２６４ではコンテキスト適応型の可変長符号化（ＣＡＶＬＣ：ＣｏｎｔｅｘｔｂａｓｅｄＡｄａｐｔｉｖｅＶａｒｉａｂｌｅＬｅｎｇｔｈＣｏｄｉｎｇ）やコンテキスト適応型の算術符号化（ＣＡＢＡＣ：ＣｏｎｔｅｘｔｂａｓｅｄＡｄａｐｔｉｖｅＢｉｎａｒｙＡｒｉｔｈｍｅｔｉｃＣｏｄｉｎｇ）などが用いられている。これらの処理は解読処理とも呼ばれる。 Specifically, the entropy decoding unit 801A has a function of performing decoding processing such as variable length decoding processing and arithmetic decoding processing on the input encoded data. For example, H.M. In H.264, context-adaptive variable-length coding (CAVLC) and context-adaptive arithmetic coding (CABAC: Context-based Adaptive Arbitrary Coding) are used. These processes are also called decryption processes.

インデックス再構成部８０１Ｂは、解読されたインデックス情報を復元し、インデックス情報を再構成する。具体的には、インデックス再構成部８０１Ｂは、解読されたインデックス情報のシンタクス要素の符号長を削減するために、シンタクス要素のパラメータの特徴に応じて予測処理を行い、シンタクス要素を復元し、インデックス情報を再構成する。予測処理の具体例は、後述する。 The index reconstruction unit 801B restores the decrypted index information and reconstructs the index information. Specifically, the index reconfiguration unit 801B performs prediction processing according to the characteristics of the syntax element parameters to reduce the code length of the decoded index information syntax element, restores the syntax element, and Reconstruct information. A specific example of the prediction process will be described later.

復号部８０１は、動き情報、インデックス情報、及び量子化変換係数を出力し、量子化変換係数を逆量子化部８０２に入力し、インデックス情報をインデックス設定部８０６に入力し、動き情報を予測画像生成部８０５に入力する。 The decoding unit 801 outputs the motion information, the index information, and the quantized transform coefficient, inputs the quantized transform coefficient to the inverse quantization unit 802, inputs the index information to the index setting unit 806, and converts the motion information into the predicted image. Input to the generation unit 805.

逆量子化部８０２は、復号部８０１から入力された量子化変換係数に対して逆量子化処理を行い、復元変換係数を得る。具体的には、逆量子化部８０２は、復号部８０１において使用された量子化情報に従って逆量子化を行う。より詳細には、逆量子化部８０２は、量子化情報によって導出された量子化ステップサイズを量子化変換係数に乗算し、復元変換係数を得る。逆量子化部８０２は、復元変換係数を出力し、逆直交変換部８０３に入力する。 The inverse quantization unit 802 performs an inverse quantization process on the quantized transform coefficient input from the decoding unit 801 to obtain a restored transform coefficient. Specifically, the inverse quantization unit 802 performs inverse quantization according to the quantization information used in the decoding unit 801. More specifically, the inverse quantization unit 802 multiplies the quantized transform coefficient by the quantization step size derived from the quantization information to obtain a restored transform coefficient. The inverse quantization unit 802 outputs the restored transform coefficient and inputs it to the inverse orthogonal transform unit 803.

逆直交変換部８０３は、逆量子化部８０２から入力された復元変換係数に対して、符号化側において行われた直交変換に対応する逆直交変換を行い、復元予測誤差を得る。逆直交変換部８０３は、復元予測誤差を出力し、加算部８０４に入力する。 The inverse orthogonal transform unit 803 performs inverse orthogonal transform corresponding to the orthogonal transform performed on the encoding side on the reconstructed transform coefficient input from the inverse quantization unit 802 to obtain a reconstructed prediction error. The inverse orthogonal transform unit 803 outputs the restoration prediction error and inputs it to the addition unit 804.

加算部８０４は、逆直交変換部８０３から入力された復元予測誤差と、対応する予測画像とを加算し、復号画像を生成する。加算部８０４は、復号画像を出力し、予測画像生成部８０５に入力する。また加算部８０４は、復号画像を出力画像として外部に出力する。出力画像は、その後図示せぬ外部の出力バッファなどに一時的に蓄積され、例えば、復号制御部８０７によって管理される出力タイミングに従って、図示せぬディスプレイやモニタなどの表示装置系又は映像デバイス系へ出力される。 The adding unit 804 adds the restored prediction error input from the inverse orthogonal transform unit 803 and the corresponding predicted image to generate a decoded image. The adding unit 804 outputs the decoded image and inputs the decoded image to the predicted image generation unit 805. The adding unit 804 outputs the decoded image to the outside as an output image. The output image is then temporarily stored in an external output buffer (not shown) or the like. For example, in accordance with the output timing managed by the decoding control unit 807, the output image is sent to a display device system or a video device system (not shown). Is output.

インデックス設定部８０６は、復号部８０１から入力されたインデックス情報を受け取り、ＷＰパラメータ情報に変換して出力し、予測画像生成部８０５に入力する。具体的には、インデックス設定部８０６は、エントロピー復号部８０１Ａで復号処理され、インデックス再構成部８０１Ｂで再構成されたインデックス情報を受け取る。そしてインデックス設定部８０６は、参照画像のリストと参照番号を確認して、ＷＰパラメータ情報に変換し、変換したＷＰパラメータ情報を予測画像生成部８０５へ出力する。ＷＰパラメータ情報については、図８Ａ及び図８Ｂを参照して既に説明しているため、説明を省略する。 The index setting unit 806 receives the index information input from the decoding unit 801, converts it into WP parameter information, outputs it, and inputs it to the predicted image generation unit 805. Specifically, the index setting unit 806 receives the index information decoded by the entropy decoding unit 801A and reconfigured by the index reconfiguration unit 801B. Then, the index setting unit 806 confirms the list of reference images and the reference number, converts them into WP parameter information, and outputs the converted WP parameter information to the predicted image generation unit 805. Since the WP parameter information has already been described with reference to FIGS. 8A and 8B, description thereof will be omitted.

予測画像生成部８０５は、復号部８０１から入力された動き情報、インデックス設定部８０６から入力されたＷＰパラメータ情報、及び加算部８０４から入力された復号画像を用いて、予測画像８１５を生成する。 The predicted image generation unit 805 generates a predicted image 815 using the motion information input from the decoding unit 801, the WP parameter information input from the index setting unit 806, and the decoded image input from the addition unit 804.

ここで、図４を参照しながら、予測画像生成部８０５の詳細について説明する。予測画像生成部８０５は、予測画像生成部１０７同様、複数フレーム動き補償部２０１と、メモリ２０２と、単方向動き補償部２０３と、予測パラメータ制御部２０４と、参照画像セレクタ２０５と、フレームメモリ２０６と、参照画像制御部２０７と、を備える。 Here, the details of the predicted image generation unit 805 will be described with reference to FIG. Like the predicted image generation unit 107, the predicted image generation unit 805 includes a multi-frame motion compensation unit 201, a memory 202, a unidirectional motion compensation unit 203, a prediction parameter control unit 204, a reference image selector 205, and a frame memory 206. And a reference image control unit 207.

フレームメモリ２０６は、参照画像制御部２０７の制御の下、加算部１０６から入力された復号画像を参照画像として格納する。フレームメモリ２０６は、参照画像を一時保持するための複数のメモリセットＦＭ１〜ＦＭＮ（Ｎ≧２）を有する。 The frame memory 206 stores the decoded image input from the addition unit 106 as a reference image under the control of the reference image control unit 207. The frame memory 206 has a plurality of memory sets FM1 to FMN (N ≧ 2) for temporarily storing reference images.

予測パラメータ制御部２０４は、復号部８０１から入力される動き情報に基づいて、参照画像番号と予測パラメータとの複数の組み合わせをテーブルとして用意している。ここで、動き情報とは、動き補償予測で用いられる動きのズレ量を示す動きベクトルや参照画像番号、単方向／双方向予測などの予測モードに関する情報などを指す。予測パラメータは、動きベクトル及び予測モードに関する情報を指す。そして予測パラメータ制御部２０４は、動き情報に基づいて、予測画像の生成に用いる参照画像番号と予測パラメータとの組み合わせを選択して出力し、参照画像番号を参照画像セレクタ２０５に入力し、予測パラメータを単方向動き補償部２０３に入力する。 The prediction parameter control unit 204 prepares a plurality of combinations of reference image numbers and prediction parameters as a table based on the motion information input from the decoding unit 801. Here, the motion information indicates a motion vector indicating a shift amount of motion used in motion compensation prediction, a reference image number, information on a prediction mode such as unidirectional / bidirectional prediction, and the like. The prediction parameter refers to information regarding a motion vector and a prediction mode. Then, the prediction parameter control unit 204 selects and outputs a combination of a reference image number and a prediction parameter used for generating a prediction image based on the motion information, inputs the reference image number to the reference image selector 205, and outputs the prediction parameter. Is input to the unidirectional motion compensation unit 203.

参照画像セレクタ２０５は、フレームメモリ２０６が有するフレームメモリＦＭ１〜ＦＭＮのいずれの出力端を接続するかを、予測パラメータ制御部２０４から入力された参照画像番号に従って切り替えるスイッチである。参照画像セレクタ２０５は、例えば、参照画像番号が０であれば、ＦＭ１の出力端を参照画像セレクタ２０５の出力端に接続し、参照画像番号がＮ−１であれば、ＦＭＮの出力端を参照画像セレクタ２０５の出力端に接続する。参照画像セレクタ２０５は、フレームメモリ２０６が有するフレームメモリＦＭ１〜ＦＭＮのうち、出力端が接続されているフレームメモリに格納されている参照画像を出力し、単方向動き補償部２０３へ入力する。なお、復号装置８００では、予測画像生成部８０５以外で参照画像は利用されないため、予測画像生成部８０５の外部へ参照画像を出力しなくてもよい。 The reference image selector 205 is a switch for switching which output terminal of the frame memories FM1 to FMN included in the frame memory 206 is connected according to the reference image number input from the prediction parameter control unit 204. For example, if the reference image number is 0, the reference image selector 205 connects the output end of FM1 to the output end of the reference image selector 205. If the reference image number is N-1, the reference image selector 205 refers to the output end of the FMN. Connected to the output terminal of the image selector 205. The reference image selector 205 outputs the reference image stored in the frame memory to which the output terminal is connected among the frame memories FM1 to FMN included in the frame memory 206, and inputs the reference image to the unidirectional motion compensation unit 203. Note that in the decoding device 800, since the reference image is not used except by the predicted image generation unit 805, the reference image does not have to be output to the outside of the predicted image generation unit 805.

単方向予測動き補償部２０３は、予測パラメータ制御部２０４から入力された予測パラメータと参照画像セレクタ２０５から入力された参照画像に従って、動き補償予測処理を行い、単方向予測画像を生成する。動き補償予測については、図５を参照して既に説明しているため、説明を省略する。 The unidirectional prediction motion compensation unit 203 performs a motion compensation prediction process according to the prediction parameter input from the prediction parameter control unit 204 and the reference image input from the reference image selector 205, and generates a unidirectional prediction image. Since motion compensation prediction has already been described with reference to FIG. 5, description thereof will be omitted.

複数フレーム動き補償部２０１は、メモリ２０２から入力される第一予測画像、単方向予測動き補償部２０３から入力される第二予測画像、及び動き評価部１０９から入力されるＷＰパラメータ情報を用いて、重み付き予測を行って予測画像を生成する。複数フレーム動き補償部２０１は、予測画像を出力し、加算部８０４に入力する。 The multi-frame motion compensation unit 201 uses the first prediction image input from the memory 202, the second prediction image input from the unidirectional prediction motion compensation unit 203, and the WP parameter information input from the motion evaluation unit 109. Then, a prediction image is generated by performing weighted prediction. The multi-frame motion compensation unit 201 outputs a prediction image and inputs the prediction image to the addition unit 804.

ここで、図６を参照しながら、複数フレーム動き補償部２０１の詳細について説明する。複数フレーム動き補償部２０１は、予測画像生成部１０７同様、デフォルト動き補償部３０１と、重み付き動き補償部３０２と、ＷＰパラメータ制御部３０３と、ＷＰセレクタ３０４、３０５とを、備える。 Here, the details of the multi-frame motion compensation unit 201 will be described with reference to FIG. Similar to the predicted image generation unit 107, the multi-frame motion compensation unit 201 includes a default motion compensation unit 301, a weighted motion compensation unit 302, a WP parameter control unit 303, and WP selectors 304 and 305.

ＷＰパラメータ制御部３０３は、インデックス設定部８０６から入力されるＷＰパラメータ情報に基づいて、ＷＰ適用フラグ及び重み情報を出力し、ＷＰ適用フラグをＷＰセレクタ３０４、３０５に入力し、重み情報を重み付き動き補償部３０２に入力する。 The WP parameter control unit 303 outputs a WP application flag and weight information based on the WP parameter information input from the index setting unit 806, inputs the WP application flag to the WP selectors 304 and 305, and weights the weight information. Input to the motion compensation unit 302.

ここで、ＷＰパラメータ情報は、重み係数の固定小数点精度、第一予測画像に対応する第一ＷＰ適用フラグ，第一重み係数，及び第一オフセット、並びに第二予測画像に対応する第二ＷＰ適応フラグ，第二重み係数，及び第二オフセットの情報を含む。ＷＰ適用フラグは、該当する参照画像及び信号成分毎に設定可能なパラメータであり、重み付き動き補償予測を行うかどうかを示す。重み情報は、重み係数の固定小数点精度、第一重み係数、第一オフセット、第二重み係数、及び第二オフセットの情報を含む。なおＷＰパラメータ情報は、第１実施形態と同様の情報を表す。 Here, the WP parameter information includes the fixed-point precision of the weighting factor, the first WP application flag corresponding to the first predicted image, the first weighting factor, the first offset, and the second WP adaptation corresponding to the second predicted image. Contains information on flag, second weighting factor, and second offset. The WP application flag is a parameter that can be set for each corresponding reference image and signal component, and indicates whether to perform weighted motion compensation prediction. The weight information includes information on the fixed point precision of the weight coefficient, the first weight coefficient, the first offset, the second weight coefficient, and the second offset. The WP parameter information represents the same information as in the first embodiment.

詳細には、ＷＰパラメータ制御部３０３は、インデックス設定部８０６からＷＰパラメータ情報が入力されると、ＷＰパラメータ情報を第一ＷＰ適用フラグ、第二ＷＰ適用フラグ、及び重み情報に分離して出力し、第一ＷＰ適用フラグをＷＰセレクタ３０４に入力し、第二ＷＰ適用フラグをＷＰセレクタ３０５に入力し、重み情報を重み付き動き補償部３０２に入力する。 Specifically, when the WP parameter information is input from the index setting unit 806, the WP parameter control unit 303 separates the WP parameter information into a first WP application flag, a second WP application flag, and weight information, and outputs them. The first WP application flag is input to the WP selector 304, the second WP application flag is input to the WP selector 305, and the weight information is input to the weighted motion compensation unit 302.

また、動き情報（予測パラメータ）で示される予測モードが単方向予測である場合、重み付き動き補償部３０２は、第一予測画像のみを用いて、数式（９）に基づいて最終的な予測画像を算出する。 Also, when the prediction mode indicated by the motion information (prediction parameter) is unidirectional prediction, the weighted motion compensation unit 302 uses only the first prediction image and uses the final prediction image based on Equation (9). Is calculated.

重み係数の固定小数点精度については、図７を参照して既に説明しているため、説明を省略する。なお、単方向予測の場合には、第二予測画像に対応する各種パラメータ（第二ＷＰ適応フラグ，第二重み係数，及び第二オフセットの情報）は利用されないため、予め定めた初期値に設定されていてもよい。 The fixed-point precision of the weighting factor has already been described with reference to FIG. In the case of unidirectional prediction, various parameters (second WP adaptive flag, second weighting factor, and second offset information) corresponding to the second predicted image are not used, and are set to predetermined initial values. May be.

復号部８０１は、図９に示すシンタクス５００を利用する。シンタクス５００は、復号部８０１の復号対象の符号化データの構造を示している。シンタクス５００については、図９を参照して既に説明しているため、説明を省略する。また、ピクチャパラメータセットシンタクス５０５については、符号化が復号である点を除き、図１０を参照して既に説明しているため、説明を省略する。また、スライスヘッダーシンタクス５０７についても、符号化が復号である点を除き、図１１を参照して既に説明しているため、説明を省略する。また、プレッドウェイトテーブルシンタクス５０８についても、符号化が復号である点を除き、図１２を参照して既に説明しているため、説明を省略する。 Decoding section 801 uses syntax 500 shown in FIG. A syntax 500 indicates a structure of encoded data to be decoded by the decoding unit 801. The syntax 500 has already been described with reference to FIG. Also, the picture parameter set syntax 505 has already been described with reference to FIG. 10 except that encoding is decoding, and thus description thereof will be omitted. The slice header syntax 507 has already been described with reference to FIG. 11 except that the encoding is decoding, and thus the description thereof is omitted. Also, the treadweight table syntax 508 has already been described with reference to FIG. 12 except that the encoding is decoding, and thus the description thereof will be omitted.

ここで、シンタクス構成における重み付き予測に関連するそれぞれのシンタクス要素の予測方法の詳細について説明する。シンタクス要素の予測は、インデックス再構成部８０１Ｂにより行われる。第２実施形態の予測方法を明示的に示したシンタクス構成は、第２実施形態と同様であり、図１３に示すとおりである。 Here, the detail of the prediction method of each syntax element relevant to the weighted prediction in a syntax structure is demonstrated. The prediction of syntax elements is performed by the index reconstruction unit 801B. The syntax configuration that explicitly shows the prediction method of the second embodiment is the same as that of the second embodiment, as shown in FIG.

重み係数の固定小数点精度を示すluma_log2_weight_denom及びchroma_log2_weight_denomの信号間の予測方法については、数式（１１）を用いて、復元処理が行われる。復元処理の詳細は、図１５に示すとおりである。 For the prediction method between the signals of luma_log2_weight_denom and chroma_log2_weight_denom indicating the fixed point precision of the weighting coefficient, restoration processing is performed using Equation (11). The details of the restoration process are as shown in FIG.

輝度及び色差信号の重み係数を表すluma_weight_lx[i]及びchroma_weight_lx[i][j]の予測方法については、数式（１４）及び（１７）を用いて、復元処理が行われる。復元処理の詳細は、図１７に示すとおりである。 For the prediction method of luma_weight_lx [i] and chroma_weight_lx [i] [j] representing the weighting coefficients of the luminance and chrominance signals, restoration processing is performed using Equations (14) and (17). The details of the restoration process are as shown in FIG.

輝度及び色差信号の重み係数（luma_weight_lx[i]及びchroma_weight_lx[i][j]）の予測値を、異なる参照番号又は異なるＰＯＣ番号で導出する予測方法については、数式（１９）及び（２１）を用いて、復元処理が行われる。復元処理の詳細は、図１９に示すとおりである。 For prediction methods for deriving predicted values of luminance and color difference signal weight coefficients (luma_weight_lx [i] and chroma_weight_lx [i] [j]) with different reference numbers or different POC numbers, Equations (19) and (21) are used. The restoration process is performed using the data. The details of the restoration process are as shown in FIG.

輝度及び色差信号の重み係数（luma_weight_lx[i]及びchroma_weight_lx[i][j]）の予測値を、符号化対象の参照スライスとの距離を用いて導出する予測方法については、数式（２３）及び（２５）を用いて、復元処理が行われる。復元処理の詳細は、図１９のフローチャートにおいて、baseidxにi-1番目の値（i≠０）を導入することと等価である。 For a prediction method for deriving predicted values of luminance and color difference signal weighting coefficients (luma_weight_lx [i] and chroma_weight_lx [i] [j]) using a distance from a reference slice to be encoded, Equation (23) and The restoration process is performed using (25). The details of the restoration process are equivalent to introducing the i−1th value (i ≠ 0) into baseidx in the flowchart of FIG.

Ｈ．２６４などで定義されている重み付き予測の暗黙的重み付き予測のＷＰパラメータ導出方法を用いた重み係数及び固定小数点精度の予測値導出手法については、数式（３１）及び（３３）を用いて、復元処理が行われる。復元処理の詳細は、図２３に示すとおりである。 H. For the weighting coefficient and fixed-point precision prediction value derivation method using the WP parameter derivation method of implicit weighted prediction of weighted prediction defined in H.264 etc., using Equations (31) and (33), A restoration process is performed. Details of the restoration process are as shown in FIG.

なお、上記説明した複数の予測手法は、単独で使用するだけでなく組み合わせて利用することもできる。例えば、数式（１１）、数式（１４）、数式（１７）、及び数式（２８）を組み合わせるなどとすることにより、インデックス情報におけるシンタクス要素の符号量を効率良く削減することが可能となる。 The plurality of prediction methods described above can be used not only independently but also in combination. For example, by combining Formula (11), Formula (14), Formula (17), and Formula (28), it is possible to efficiently reduce the code amount of syntax elements in the index information.

以上のように第２実施形態では、復号装置８００は、符号化する情報のパラメータの相関を利用して空間冗長性を削除することにより、符号化効率が低下する問題を解消する。復号装置８００は、重み付き動き補償予測で用いるシンタクス要素をそのまま（直値で）符号化していた従来の構成と比較して、符号量を削減することができる。 As described above, in the second embodiment, the decoding apparatus 800 eliminates the problem of lowering the coding efficiency by deleting the spatial redundancy using the correlation of the parameters of the information to be coded. Decoding apparatus 800 can reduce the amount of code compared to a conventional configuration in which syntax elements used in weighted motion compensation prediction are encoded as they are (direct values).

（変形例）
上記第１〜第２実施形態では、フレームを１６×１６画素サイズなどの矩形ブロックに分割し、画面左上のブロックから右下に向かって順に符号化／復号を行う例について説明している（図２Ａを参照）。しかしながら、符号化順序及び復号順序はこの例に限定されない。例えば、右下から左上に向かって順に符号化及び復号が行われてもよいし、画面中央から画面端に向かって渦巻を描くように符号化及び復号が行われてもよい。更に、右上から左下に向かって順に符号化及び復号が行われてもよいし、画面端から画面中央に向かって渦巻きを描くように符号化及び復号が行われてもよい。この場合、符号化順序によって参照できる隣接画素ブロックの位置が変わるので、適宜利用可能な位置に変更すればよい。 (Modification)
In the first to second embodiments, an example is described in which a frame is divided into rectangular blocks of 16 × 16 pixel size and the like, and encoding / decoding is sequentially performed from the upper left block to the lower right side of the screen (FIG. See 2A). However, the encoding order and the decoding order are not limited to this example. For example, encoding and decoding may be performed in order from the lower right to the upper left, or encoding and decoding may be performed so as to draw a spiral from the center of the screen toward the screen end. Furthermore, encoding and decoding may be performed in order from the upper right to the lower left, or encoding and decoding may be performed so as to draw a spiral from the screen end toward the center of the screen. In this case, since the position of the adjacent pixel block that can be referred to changes depending on the encoding order, the position may be changed to a usable position as appropriate.

上記第１〜第２実施形態では、４×４画素ブロック、８×８画素ブロック、１６×１６画素ブロックなどの予測対象ブロックサイズを例示して説明を行ったが、予測対象ブロックは均一なブロック形状でなくてもよい。例えば、予測対象ブロックサイズは、１６×８画素ブロック、８×１６画素ブロック、８×４画素ブロック、４×８画素ブロックなどであってもよい。また、１つのコーディングツリーブロック内で全てのブロックサイズを統一させる必要はなく、複数の異なるブロックサイズを混在させてもよい。１つのコーディングツリーブロック内で複数の異なるブロックサイズを混在させる場合、分割数の増加に伴って分割情報を符号化又は復号するための符号量も増加する。そこで、分割情報の符号量と局部復号画像または復号画像の品質との間のバランスを考慮して、ブロックサイズを選択することが望ましい。 In the first to second embodiments described above, the prediction target block size such as a 4 × 4 pixel block, an 8 × 8 pixel block, a 16 × 16 pixel block, and the like has been described as an example, but the prediction target block is a uniform block. It does not have to be a shape. For example, the prediction target block size may be a 16 × 8 pixel block, an 8 × 16 pixel block, an 8 × 4 pixel block, a 4 × 8 pixel block, or the like. Moreover, it is not necessary to unify all the block sizes within one coding tree block, and a plurality of different block sizes may be mixed. When a plurality of different block sizes are mixed in one coding tree block, the code amount for encoding or decoding the division information increases as the number of divisions increases. Therefore, it is desirable to select the block size in consideration of the balance between the code amount of the division information and the quality of the locally decoded image or the decoded image.

上記第１〜第２実施形態では、簡単化のために、輝度信号と色差信号における予測処理とを区別せず、色信号成分に関して包括的な説明を記述した。しかしながら、予測処理が輝度信号と色差信号との間で異なる場合には、同一または異なる予測方法が用いられてよい。輝度信号と色差信号との間で異なる予測方法が用いられるならば、色差信号に対して選択した予測方法を輝度信号と同様の方法で符号化又は復号できる。 In the first to second embodiments, for the sake of simplification, a comprehensive description of the color signal component is described without distinguishing between the prediction process for the luminance signal and the color difference signal. However, when the prediction process is different between the luminance signal and the color difference signal, the same or different prediction methods may be used. If different prediction methods are used between the luminance signal and the color difference signal, the prediction method selected for the color difference signal can be encoded or decoded in the same manner as the luminance signal.

上記第１〜第２実施形態では、簡単化のために、輝度信号と色差信号における重み付き動き補償予測処理とを区別せず、色信号成分に関して包括的な説明を記述した。しかしながら、重み付き動き補償予測処理が輝度信号と色差信号との間で異なる場合には、同一または異なる重み付き動き補償予測処理が用いられてよい。輝度信号と色差信号との間で異なる重み付き動き補償予測処理が用いられるならば、色差信号に対して選択した重み付き動き補償予測処理を輝度信号と同様の方法で符号化又は復号できる。 In the first to second embodiments, for the sake of simplification, the luminance signal and the weighted motion compensation prediction process for the color difference signal are not distinguished from each other, and the comprehensive explanation is described regarding the color signal component. However, when the weighted motion compensation prediction process is different between the luminance signal and the color difference signal, the same or different weighted motion compensation prediction process may be used. If a weighted motion compensation prediction process that is different between the luminance signal and the color difference signal is used, the weighted motion compensation prediction process selected for the color difference signal can be encoded or decoded in the same manner as the luminance signal.

上記第１〜第２実施形態では、シンタクス構成に示す表の行間には、本実施形態で規定していないシンタクス要素が挿入されることも可能であるし、それ以外の条件分岐に関する記述が含まれていても構わない。或いは、シンタクステーブルを複数のテーブルに分割、統合することも可能である。また、必ずしも同一の用語を用いる必要は無く、利用する形態によって任意に変更しても構わない。 In the first and second embodiments, syntax elements not defined in this embodiment can be inserted between the rows of the table shown in the syntax configuration, and other conditional branch descriptions are included. It does not matter. Alternatively, the syntax table can be divided and integrated into a plurality of tables. Moreover, it is not always necessary to use the same term, and it may be arbitrarily changed depending on the form to be used.

以上説明したように、各実施形態は、重み付き動き補償予測を行う際にシンタクス構成の冗長な情報を符号化する問題を解消しつつ、高効率な重み付き動き補償予測処理を実現する。故に、各実施形態によれば、符号化効率が向上し、ひいては主観画質も向上する。 As described above, each embodiment realizes highly efficient weighted motion compensation prediction processing while solving the problem of encoding redundant information of syntax structure when performing weighted motion compensation prediction. Therefore, according to each embodiment, the encoding efficiency is improved, and the subjective image quality is also improved.

本発明のいくつかの実施形態を説明したが、これらの実施形態は、例として提示したものであり、発明の範囲を限定することは意図していない。これら新規な実施形態は、その他の様々な形態で実施されることが可能であり、発明の要旨を逸脱しない範囲で、種々の省略、置き換え、変更を行うことができる。これら実施形態やその変形は、発明の範囲や要旨に含まれるとともに、特許請求の範囲に記載された発明とその均等の範囲に含まれる。 Although several embodiments of the present invention have been described, these embodiments are presented by way of example and are not intended to limit the scope of the invention. These novel embodiments can be implemented in various other forms, and various omissions, replacements, and changes can be made without departing from the scope of the invention. These embodiments and modifications thereof are included in the scope and gist of the invention, and are included in the invention described in the claims and the equivalents thereof.

例えば、上記各実施形態の処理を実現するプログラムを、コンピュータで読み取り可能な記憶媒体に格納して提供することも可能である。記憶媒体としては、磁気ディスク、光ディスク（ＣＤ−ＲＯＭ、ＣＤ−Ｒ、ＤＶＤ等）、光磁気ディスク（ＭＯ等）、半導体メモリなど、プログラムを記憶でき、かつ、コンピュータが読み取り可能な記憶媒体であれば、その記憶形式は何れの形態であってもよい。 For example, it is possible to provide a program that realizes the processing of each of the above embodiments by storing it in a computer-readable storage medium. The storage medium may be a computer-readable storage medium such as a magnetic disk, optical disk (CD-ROM, CD-R, DVD, etc.), magneto-optical disk (MO, etc.), semiconductor memory, etc. For example, the storage format may be any form.

また、上記各実施形態の処理を実現するプログラムを、インターネットなどのネットワークに接続されたコンピュータ（サーバ）上に格納し、ネットワーク経由でコンピュータ（クライアント）にダウンロードさせてもよい。 Further, the program for realizing the processing of each of the above embodiments may be stored on a computer (server) connected to a network such as the Internet and downloaded to the computer (client) via the network.

１００符号化装置
１０１減算部
１０２直交変換部
１０３量子化部
１０４逆量子化部
１０５逆直交変換部
１０６加算部
１０７予測画像生成部
１０８インデックス設定部
１０９動き評価部
１１０符号化部
１１０Ａエントロピー符号化部
１１０Ｂインデックス再構成部
１１１符号化制御部
２０１複数フレーム動き補償部
２０２メモリ
２０３単方向動き補償部
２０４予測パラメータ制御部
２０５参照画像セレクタ
２０６フレームメモリ
２０７参照画像制御部
３０１デフォルト動き補償部
３０２重み付き動き補償部
３０３ＷＰパラメータ制御部
３０４、３０５ＷＰセレクタ
８００復号装置
８０１復号部
８０１Ａエントロピー復号部
８０１Ｂインデックス再構成部
８０２逆量子化部
８０３逆直交変換部
８０４加算部
８０５予測画像生成部
８０６インデックス設定部
８０７復号制御部 DESCRIPTION OF SYMBOLS 100 Coding apparatus 101 Subtraction part 102 Orthogonal transformation part 103 Quantization part 104 Inverse quantization part 105 Inverse orthogonal transformation part 106 Adder part 107 Predictive image generation part 108 Index setting part 109 Motion evaluation part 110 Coding part 110A Entropy coding part 110B Index reconstruction unit 111 Coding control unit 201 Multiple frame motion compensation unit 202 Memory 203 Unidirectional motion compensation unit 204 Prediction parameter control unit 205 Reference image selector 206 Frame memory 207 Reference image control unit 301 Default motion compensation unit 302 Weighted motion Compensation unit 303 WP parameter control unit 304, 305 WP selector 800 decoding device 801 decoding unit 801A entropy decoding unit 801B index reconstruction unit 802 inverse quantization unit 803 inverse orthogonal transform unit 804 Calculation unit 805 predicted image generator 806 index setting unit 807 the decoding control unit

Claims

An electronic device that generates encoded data,
Encoding the first fixed point precision of the luminance weighting factor;
Encoding a first difference value that is the same value as the difference between the first fixed-point precision and the second fixed-point precision of the color difference weighting factor;
The same value as the difference between the first reference value, which is the same value as the value obtained by left-shifting “1” by the number of bits determined by the first fixed-point precision, and the luminance weighting factor; A second difference value of the luminance weighting coefficient is encoded;
The color difference is the same value as the difference between the second reference value, which is a value obtained by left-shifting “1” by the number of bits determined by the second fixed point precision, and the color difference weighting factor. Encoding the third difference value of the weighting factor;
Obtained by subtracting the value obtained by multiplying the median by the color difference weighting factor and shifting the value to the right by the number of bits determined by the second fixed-point precision from the median of the maximum pixel values. An electronic apparatus comprising a processing unit that encodes a fourth difference value of the color difference offset that is the same value as a difference between a third reference value that is the same value as the obtained value and the color difference offset.

The value obtained by left-shifting “1” by the number of bits determined by the first fixed-point precision is a luminance weight coefficient used when there is no change in luminance between the reference image and the target image. The electronic device according to claim 1, wherein the electronic device has the same value as the value.

A buffer for temporarily storing at least part of the encoded data;
A transmitter for transmitting the encoded data via a communication line;
Further comprising
The electronic device according to claim 1, wherein the processing unit is a CPU (Central Processing Unit).

A buffer for temporarily storing at least part of the encoded data;
A transmitter for transmitting the encoded data via a communication line;
Further comprising
The electronic device according to claim 1, wherein the processing unit is a DSP (Digital Signal Processor) or an FPGA (Field Programmable Gate Array).

Encoding the first fixed point precision of the luminance weighting factor;
Encoding a first difference value that is the same value as the difference between the first fixed-point precision and the second fixed-point precision of the color difference weighting factor;
The same value as the difference between the first reference value, which is the same value as the value obtained by left-shifting “1” by the number of bits determined by the first fixed-point precision, and the luminance weighting factor; Encoding a second difference value of the luminance weighting factor;
The color difference is the same value as the difference between the second reference value, which is a value obtained by left-shifting “1” by the number of bits determined by the second fixed point precision, and the color difference weighting factor. Encoding a third difference value of the weighting factor;
Obtained by subtracting the value obtained by multiplying the median by the color difference weighting factor and shifting the value to the right by the number of bits determined by the second fixed-point precision from the median of the maximum pixel values. Encoding a fourth difference value of the color difference offset that is the same value as a difference between a third reference value that is the same value as the obtained value and the color difference offset;
An encoding method including:

The value obtained by left-shifting “1” by the number of bits determined by the first fixed-point precision is a luminance weight coefficient used when there is no change in luminance between the reference image and the target image. The encoding method according to claim 5, wherein the encoding value is the same value.

A buffer temporarily storing at least part of the generated encoded data;
A transmission unit transmitting the encoded data via a communication line;
Further including
The CPU (Central Processing Unit) executes a step of encoding the first difference value, the second difference value, the third difference value, and the fourth difference value. The encoding method described.

A buffer temporarily storing at least part of the generated encoded data;
A transmission unit transmitting the encoded data via a communication line;
Further including
A digital signal processor (DSP) or a field programmable gate array (FPGA) executes a step of encoding the first difference value, the second difference value, the third difference value, and the fourth difference value. The encoding method according to claim 5 or 6.

Encoding the first fixed point precision of the luminance weighting factor;
Encoding a first difference value that is the same value as the difference between the first fixed-point precision and the second fixed-point precision of the color difference weighting factor;
The same value as the difference between the first reference value, which is the same value as the value obtained by left-shifting “1” by the number of bits determined by the first fixed-point precision, and the luminance weighting factor; Encoding a second difference value of the luminance weighting factor;
The color difference is the same value as the difference between the second reference value, which is a value obtained by left-shifting “1” by the number of bits determined by the second fixed point precision, and the color difference weighting factor. Encoding a third difference value of the weighting factor;
Obtained by subtracting the value obtained by multiplying the median by the color difference weighting factor and shifting the value to the right by the number of bits determined by the second fixed-point precision from the median of the maximum pixel values. Encoding a fourth difference value of the color difference offset that is the same value as a difference between a third reference value that is the same value as the obtained value and the color difference offset;
A program that causes a computer to execute.

The value obtained by left-shifting “1” by the number of bits determined by the first fixed-point precision is a luminance weight coefficient used when there is no change in luminance between the reference image and the target image. The program according to claim 9, which is the same value as the value.

A buffer temporarily storing at least part of the generated encoded data;
A transmission unit transmitting the encoded data via a communication line;
Is further executed by the computer,
The CPU (Central Processing Unit) included in the computer is caused to execute a step of encoding the first difference value, the second difference value, the third difference value, and the fourth difference value. The program according to claim 10.

A buffer temporarily storing at least part of the generated encoded data;
A transmission unit transmitting the encoded data via a communication line;
Is further executed by the computer,
A step of encoding the first difference value, the second difference value, the third difference value, and the fourth difference value in a DSP (Digital Signal Processor) or FPGA (Field Programmable Gate Array) included in the computer. The program according to claim 9 or 10, wherein the program is executed.