JP6042478B2

JP6042478B2 - Image decoding device

Info

Publication number: JP6042478B2
Application number: JP2015075390A
Authority: JP
Inventors: 山口　潤; 潤山口; 昭行谷沢
Original assignee: Toshiba Corp
Current assignee: Toshiba Corp
Priority date: 2015-04-01
Filing date: 2015-04-01
Publication date: 2016-12-14
Anticipated expiration: 2030-07-15
Also published as: JP2015146624A

Description

本発明の実施形態は、動画像の符号化及び復号化における直交変換及び逆直交変換に関する。 Embodiments described herein relate generally to orthogonal transformation and inverse orthogonal transformation in encoding and decoding of moving images.

近年、大幅に符号化効率を向上させた画像符号化方法がＩＴＵ−ＴとＩＳＯ／ＩＥＣとの共同で、ＩＴＵ−ＴＲＥＣ．Ｈ．２６４及びＩＳＯ／ＩＥＣ１４４９６−１０（以下、「Ｈ．２６４」という。）として勧告されている。Ｈ．２６４では、対象画素ブロックに適用される予測方法に関わらず、対象画素ブロックの予測誤差に対する直交変換及び逆直交変換として離散コサイン変換（ＤＣＴ）及び逆離散コサイン変換（ＩＤＣＴ）が夫々行われる。 In recent years, an image encoding method with greatly improved encoding efficiency has been jointly developed by ITU-T and ISO / IEC. H. H.264 and ISO / IEC 14496-10 (hereinafter referred to as “H.264”). H. In H.264, regardless of the prediction method applied to the target pixel block, discrete cosine transform (DCT) and inverse discrete cosine transform (IDCT) are performed as orthogonal transform and inverse orthogonal transform for the prediction error of the target pixel block, respectively.

Ｈ．２６４の拡張として、画面内予測（イントラ予測）において規定されている９種類の予測モードの夫々について個別の変換基底を用いて直交変換及び逆直交変換を行うことにより、符号化効率を向上させることが想定される。 H. As an extension of H.264, encoding efficiency is improved by performing orthogonal transform and inverse orthogonal transform using individual transform bases for each of nine types of prediction modes defined in intra prediction (intra prediction). Is assumed.

M. Karczewicz, “Improved intra coding”, ITU-T SG16/Q.6, VCEG Document, VCEG-AF15, April 2007.M. Karczewicz, “Improved intra coding”, ITU-T SG16 / Q.6, VCEG Document, VCEG-AF15, April 2007.

しかしながら、複数種類の予測モードの夫々について個別の変換基底を用いて直交変換及び逆直交変換を行うことは、実装上の困難を伴う。例えば、ハードウェア実装のためには、Ｈ．２６４において必要とされるＤＣＴ及びＩＤＣＴのための専用ハードウェアに加えて、上記複数種類の予測方向の夫々について個別の直交変換及び逆直交変換のための専用ハードウェアを設ける必要がある。これら専用ハードウェアの追加によって、回路規模が増大する。 However, performing orthogonal transform and inverse orthogonal transform using individual transform bases for each of a plurality of types of prediction modes involves difficulty in implementation. For example, H. In addition to the dedicated hardware for DCT and IDCT required in H.264, it is necessary to provide dedicated hardware for individual orthogonal transform and inverse orthogonal transform for each of the above-described multiple types of prediction directions. The addition of these dedicated hardware increases the circuit scale.

ソフトウェア実装に関して、ＤＣＴ行列に加えて複数種類の予測方向の夫々について個別の変換行列をメモリから適宜ロードしたり、適宜キャッシュメモリに保持したりすることが可能である。この場合には、所望の直交変換及び逆直交変換を汎用乗算器によって実現できるものの、メモリバンド幅の増加によるコスト増またはキャッシュメモリサイズの増加によるコスト増が問題となる。 Regarding software implementation, in addition to the DCT matrix, individual transformation matrices for each of a plurality of types of prediction directions can be appropriately loaded from a memory, or appropriately held in a cache memory. In this case, although desired orthogonal transformation and inverse orthogonal transformation can be realized by a general-purpose multiplier, there is a problem of cost increase due to increase in memory bandwidth or cost increase due to increase in cache memory size.

従って、実施形態は、符号化効率を向上可能な直交変換または逆直交変換を提供することを目的とする。 Therefore, an object of the embodiment is to provide orthogonal transform or inverse orthogonal transform that can improve coding efficiency.

実施形態によれば、画像復号化装置は、蓄積部と、復号化部と、セット部と、変換部と、加算部とを備える。蓄積部は、他の装置から通信回線を介して出力された符号化データを蓄積する。復号化部は、蓄積された符号化データから復号化対象の変換係数を復号化する。セット部は、予測画像生成方法に応じて予め定められた関係に基づいて、復号化対象の予測モードに対応する垂直変換行列と水平変換行列との組み合わせを設定する。変換部は、設定された垂直変換行列と水平変換行列とを用いて、変換係数に対して垂直変換及び水平変換を行って予測誤差を得る。加算部は、予測誤差に基づいて復号画像を生成する。組み合わせは、第１の変換行列同士の組み合わせと、第１の変換行列とは異なる第２の変換行列同士の組み合わせとのうちいずれか一方である。第２の変換行列同士の組み合わせは、Diagonal down right、Vertical rightおよびHorizontal down方向にそれぞれ対応する複数のイントラ予測モードに設定される。 According to the embodiment, the image decoding apparatus includes a storage unit, a decoding unit, a set unit, a conversion unit, and an addition unit. The accumulation unit accumulates encoded data output from another device via a communication line. The decoding unit decodes the transform coefficient to be decoded from the accumulated encoded data. The set unit sets a combination of a vertical transformation matrix and a horizontal transformation matrix corresponding to a prediction mode to be decoded based on a relationship predetermined according to a predicted image generation method. The conversion unit obtains a prediction error by performing vertical conversion and horizontal conversion on the conversion coefficient using the set vertical conversion matrix and horizontal conversion matrix. The adder generates a decoded image based on the prediction error. The combination is one of a combination of first conversion matrices and a combination of second conversion matrices different from the first conversion matrix. A combination of the second transformation matrices is set to a plurality of intra prediction modes respectively corresponding to the diagonal down right, vertical right, and horizontal down directions.

第１の実施形態に係る画像符号化装置を例示するブロック図。1 is a block diagram illustrating an image encoding device according to a first embodiment. 第１の実施形態に係る直交変換部を例示するブロック図。The block diagram which illustrates the orthogonal transformation part concerning a 1st embodiment. 第１の実施形態に係る逆直交変換部を例示するブロック図。FIG. 3 is a block diagram illustrating an inverse orthogonal transform unit according to the first embodiment. 第１の実施形態に係る、予測モードと垂直変換インデックス及び水平変換インデックスとの対応を例示するテーブル図。The table figure which illustrates a response | compatibility with prediction mode, a vertical conversion index, and a horizontal conversion index based on 1st Embodiment. 第１の実施形態に係る、垂直変換インデックスと１Ｄ変換行列との対応を例示するテーブル図。The table figure which illustrates a response | compatibility with the vertical conversion index and 1D conversion matrix based on 1st Embodiment. 第１の実施形態に係る、水平変換インデックスと１Ｄ変換行列との対応を例示するテーブル図。The table figure which illustrates a response | compatibility with the horizontal conversion index and 1D conversion matrix based on 1st Embodiment. 第１の実施形態に係る、変換インデックスと垂直変換インデックス及び水平変換インデックスとの対応を例示するテーブル図。The table figure which illustrates a response | compatibility with a conversion index, a vertical conversion index, and a horizontal conversion index based on 1st Embodiment. 図４Ａ及び図４Ｄを統合したテーブル図。The table figure which integrated FIG. 4A and 4D. 第１の実施形態に係る係数順制御部を例示するブロック図。The block diagram which illustrates the coefficient order control part concerning a 1st embodiment. 第１の実施形態に係る係数順制御部を例示するブロック図。The block diagram which illustrates the coefficient order control part concerning a 1st embodiment. 画素ブロックの予測符号化順の説明図。Explanatory drawing of the prediction encoding order of a pixel block. 画素ブロックサイズの一例の説明図。Explanatory drawing of an example of pixel block size. 画素ブロックサイズの別の例の説明図。Explanatory drawing of another example of pixel block size. 画素ブロックサイズの別の例の説明図。Explanatory drawing of another example of pixel block size. イントラ予測モードの説明図。Explanatory drawing of intra prediction mode. 予測対象画素と参照画素との配置関係の説明図。Explanatory drawing of the arrangement | positioning relationship between a prediction object pixel and a reference pixel. イントラ予測モード１の説明図。Explanatory drawing of the intra prediction mode 1. FIG. イントラ予測モード４の説明図。Explanatory drawing of the intra prediction mode 4. FIG. ジグザグスキャンの説明図。Explanatory drawing of a zigzag scan. ジグザグスキャンの説明図。Explanatory drawing of a zigzag scan. ジグザグスキャンを利用した２Ｄ−１Ｄ変換を示すテーブル図。The table figure which shows 2D-1D conversion using a zigzag scan. 予測モード毎の個別の２Ｄ−１Ｄ変換を例示するテーブル図。The table figure which illustrates individual 2D-1D conversion for every prediction mode. 図１の画像符号化装置が符号化対象ブロックに対して行う処理を例示するフローチャート。3 is a flowchart illustrating processing performed by the image encoding device in FIG. 1 on an encoding target block. 図１の画像符号化装置が符号化対象ブロックに対して行う処理を例示するフローチャート。3 is a flowchart illustrating processing performed by the image encoding device in FIG. 1 on an encoding target block. シンタクス構造の説明図。Explanatory drawing of a syntax structure. スライスヘッダーシンタクスの説明図。Explanatory drawing of a slice header syntax. コーディングツリーユニットシンタクスの説明図。Explanatory drawing of coding tree unit syntax. トランスフォームユニットシンタクスの説明図。Explanatory drawing of a transform unit syntax. ９種類の予測方向の夫々について個別の変換基底を用いて直交変換を行う直交変換部を例示するブロック図。The block diagram which illustrates the orthogonal transformation part which performs orthogonal transformation using each conversion base about each of nine types of prediction directions. 第２の実施形態に係る直交変換部を例示するブロック図。The block diagram which illustrates the orthogonal transformation part concerning a 2nd embodiment. 第２の実施形態に係る逆直交変換部を例示するブロック図。The block diagram which illustrates the inverse orthogonal transformation part which concerns on 2nd Embodiment. 第２の実施形態に係る、予測モードと垂直変換インデックス及び水平変換インデックスとの対応を例示するテーブル図。The table figure which illustrates a response | compatibility with prediction mode, a vertical conversion index, and a horizontal conversion index based on 2nd Embodiment. 第２の実施形態に係る、垂直変換インデックスと１Ｄ変換行列との対応を例示するテーブル図。The table figure which illustrates a response | compatibility with the vertical conversion index and 1D conversion matrix based on 2nd Embodiment. 第２の実施形態に係る、水平変換インデックスと１Ｄ変換行列との対応を例示するテーブル図。The table figure which illustrates a response | compatibility with a horizontal conversion index and 1D conversion matrix based on 2nd Embodiment. 第２の実施形態に係る、変換インデックスと垂直変換インデックス及び水平変換インデックスとの対応を例示するテーブル図。The table figure which illustrates a response | compatibility with a conversion index, a vertical conversion index, and a horizontal conversion index based on 2nd Embodiment. 図１８Ａ及び図１８Ｄを統合したテーブル図。The table figure which integrated FIG. 18A and FIG. 18D. 第３の実施形態に係る直交変換部を例示するブロック図。The block diagram which illustrates the orthogonal transformation part concerning a 3rd embodiment. 第３の実施形態に係る逆直交変換部を例示するブロック図。The block diagram which illustrates the inverse orthogonal transformation part concerning a 3rd embodiment. 第３の実施形態に係る、予測モードと垂直変換インデックス及び水平変換インデックスとの対応を例示するテーブル図。The table figure which illustrates a response | compatibility with prediction mode, a vertical conversion index, and a horizontal conversion index based on 3rd Embodiment. 第３の実施形態に係る、垂直変換インデックスと１Ｄ変換行列との対応を例示するテーブル図。The table figure which illustrates a response | compatibility with the vertical conversion index and 1D conversion matrix based on 3rd Embodiment. 第３の実施形態に係る、水平変換インデックスと１Ｄ変換行列との対応を例示するテーブル図。The table figure which illustrates a response | compatibility with the horizontal conversion index and 1D conversion matrix based on 3rd Embodiment. 第３の実施形態に係る、変換インデックスと垂直変換インデックス及び水平変換インデックスとの対応を例示するテーブル図。The table figure which illustrates a correspondence with a conversion index, a vertical conversion index, and a horizontal conversion index concerning a 3rd embodiment. 図２１Ａ及び図２１Ｄを統合したテーブル図。The table figure which integrated FIG. 21A and FIG. 21D. 第４の実施形態に係る画像復号化装置を例示するブロック図。The block diagram which illustrates the picture decoding device concerning a 4th embodiment. 第４の実施形態に係る係数順制御部を例示するブロック図。The block diagram which illustrates the coefficient order control part concerning a 4th embodiment. 第４の実施形態に係る係数順制御部を例示するブロック図。The block diagram which illustrates the coefficient order control part concerning a 4th embodiment.

以下、図面を参照して、各実施形態について説明する。尚、以降の説明において、「画像」という用語は、「画像信号」、「画像データ」などの用語として適宜読み替えることができる。
（第１の実施形態）
第１の実施形態は、画像符号化装置に関する。本実施形態に係る画像符号化装置に対応する画像復号化装置は、第４の実施形態において説明する。この画像符号化装置は、ＬＳＩ（Large-Scale Integration）チップやＤＳＰ（Digital Signal Processor）、ＦＰＧＡ（Field Programmable Gate Array）などのハードウェアにより実現可能である。また、この画像符号化装置は、コンピュータに画像符号化プログラムを実行させることによっても実現可能である。 Hereinafter, each embodiment will be described with reference to the drawings. In the following description, the term “image” can be appropriately replaced with terms such as “image signal” and “image data”.
(First embodiment)
The first embodiment relates to an image encoding device. An image decoding apparatus corresponding to the image encoding apparatus according to the present embodiment will be described in a fourth embodiment. This image encoding device can be realized by hardware such as an LSI (Large-Scale Integration) chip, a DSP (Digital Signal Processor), or an FPGA (Field Programmable Gate Array). The image encoding apparatus can also be realized by causing a computer to execute an image encoding program.

図１に示すように、本実施形態に係る画像符号化装置は、減算部１０１、直交変換部１０２、量子化部１０３、逆量子化部１０４、逆直交変換部１０５、加算部１０６、参照画像メモリ１０７、イントラ予測部１０８、インター予測部１０９、予測選択部１１０、予測選択スイッチ１１１、１Ｄ（１次元）変換行列セット部１１２、係数順制御部１１３、エントロピー符号化部１１４、出力バッファ１１５及び符号化制御部１１６を有する。 As shown in FIG. 1, the image coding apparatus according to the present embodiment includes a subtraction unit 101, an orthogonal transformation unit 102, a quantization unit 103, an inverse quantization unit 104, an inverse orthogonal transformation unit 105, an addition unit 106, and a reference image. A memory 107, an intra prediction unit 108, an inter prediction unit 109, a prediction selection unit 110, a prediction selection switch 111, a 1D (one-dimensional) transform matrix set unit 112, a coefficient order control unit 113, an entropy encoding unit 114, an output buffer 115, An encoding control unit 116 is included.

図１の画像符号化装置は、入力画像１１８を構成する各フレームまたは各フィールドを複数の画素ブロックに分割し、これら分割した画素ブロックに対して予測符号化を行って、符号化データ１３０を出力する。以降の説明では、簡単化のために、図６Ａに示されるように左上から右下に向かって画素ブロックの予測符号化が行われることを仮定する。図６Ａでは、符号化処理対象のフレームｆにおいて、符号化対象画素ブロックｃよりも左側及び上側に符号化済み画素ブロックｐが位置している。 The image coding apparatus in FIG. 1 divides each frame or each field constituting the input image 118 into a plurality of pixel blocks, performs predictive coding on the divided pixel blocks, and outputs coded data 130. To do. In the following description, for the sake of simplicity, it is assumed that pixel blocks are predictively encoded from the upper left to the lower right as shown in FIG. 6A. In FIG. 6A, the encoded pixel block p is located on the left side and the upper side of the encoding target pixel block c in the encoding processing target frame f.

ここで、画素ブロックは、例えば、コーディングツリーユニット、マクロブロック、サブブロック、１画素などを指す。尚、以降の説明では、画素ブロックをコーディングツリーユニットの意味で基本的に使用するが、説明を適宜読み替えることにより画素ブロックを別の意味で解釈することも可能である。コーディングツリーユニットは、典型的には、例えば図６Ｂに示す１６×１６画素ブロックであるが、図６Ｃに示す３２×３２画素ブロック、図６Ｄに示す６４×６４画素ブロックであってもよいし、図示しない８×８画素ブロック、４×４画素ブロックであってもよい。コーディングツリーユニットは必ずしも正方形である必要はない。以下、入力画像１１８の符号化対象ブロックもしくはコーディングツリーユニットを「予測対象ブロック」と称することもある。また、符号化単位には、コーディングツリーユニットのような画素ブロックに限らず、フレームまたはフィールド、或いはこれらの組み合わせを用いることができる。 Here, the pixel block indicates, for example, a coding tree unit, a macro block, a sub block, and one pixel. In the following description, the pixel block is basically used in the meaning of the coding tree unit, but the pixel block can be interpreted in another meaning by appropriately replacing the description. The coding tree unit is typically a 16 × 16 pixel block shown in FIG. 6B, for example, but may be a 32 × 32 pixel block shown in FIG. 6C, or a 64 × 64 pixel block shown in FIG. 6D, It may be an 8 × 8 pixel block (not shown) or a 4 × 4 pixel block. The coding tree unit need not necessarily be square. Hereinafter, the encoding target block or coding tree unit of the input image 118 may be referred to as a “prediction target block”. In addition, the encoding unit is not limited to a pixel block such as a coding tree unit, and a frame, a field, or a combination thereof can be used.

図１の画像符号化装置は、符号化制御部１１６から入力される符号化パラメータに基づいて、画素ブロックに対するイントラ予測（画面内予測、フレーム内予測などとも称される）またはインター予測（画面間予測、フレーム間予測などとも称される）を行って、予測画像１２７を生成する。この画像符号化装置は、画素ブロック（入力画像１１８）と予測画像１２７との間の予測誤差１１９を直交変換及び量子化し、エントロピー符号化を行って符号化データ１３０を生成して出力する。 The image encoding apparatus in FIG. 1 performs intra prediction (also referred to as intra-frame prediction, intra-frame prediction, etc.) or inter prediction (inter-screen prediction) on a pixel block based on the encoding parameter input from the encoding control unit 116. Prediction image 127 is generated by performing prediction (also referred to as prediction, inter-frame prediction). This image encoding apparatus orthogonally transforms and quantizes the prediction error 119 between the pixel block (input image 118) and the predicted image 127, performs entropy encoding, and generates and outputs encoded data 130.

図１の画像符号化装置は、ブロックサイズ及び予測画像１２７の生成方法の異なる複数の予測モードを選択的に適用して符号化を行う。予測画像１２７の生成方法は、大別すると、符号化対象フレーム内で予測を行うイントラ予測と、時間的に異なる１つまたは複数の参照フレームを用いて予測を行うインター予測との２種類である。本実施形態では、イントラ予測を用いて予測画像を生成する場合の直交変換及び逆直交変換について詳細に説明する。 The image encoding device in FIG. 1 performs encoding by selectively applying a plurality of prediction modes having different block sizes and generation methods of the predicted image 127. The generation method of the predicted image 127 can be broadly divided into two types: intra prediction in which prediction is performed within the encoding target frame and inter prediction in which prediction is performed using one or a plurality of reference frames that are temporally different. . In the present embodiment, an orthogonal transform and an inverse orthogonal transform when generating a predicted image using intra prediction will be described in detail.

以下、図１の画像符号化装置に含まれる各要素を説明する。
減算器１０１は、入力画像１１８の符号化対象ブロックから、対応する予測画像１２７を減算して予測誤差１１９を得る。減算器１０１は、予測誤差１１９を直交変換部１０２に入力する。 Hereinafter, each element included in the image encoding device in FIG. 1 will be described.
The subtracter 101 subtracts the corresponding predicted image 127 from the encoding target block of the input image 118 to obtain a prediction error 119. The subtractor 101 inputs the prediction error 119 to the orthogonal transform unit 102.

直交変換部１０２は、減算器１０１からの予測誤差１１９に対して直交変換を行い、変換係数１２０を得る。尚、直交変換部１０２の詳細は後述される。直交変換部１０２は、変換係数１２０を量子化部１０３に入力する。 The orthogonal transform unit 102 performs orthogonal transform on the prediction error 119 from the subtractor 101 to obtain a transform coefficient 120. Details of the orthogonal transform unit 102 will be described later. The orthogonal transform unit 102 inputs the transform coefficient 120 to the quantization unit 103.

量子化部１０３は、直交変換部１０２からの変換係数に対して量子化を行い、量子化変換係数１２１を得る。具体的には、量子化部１０３は、符号化制御部１１６によって指定される量子化パラメータ、量子化マトリクスなどの量子化情報に従って量子化を行う。量子化パラメータは、量子化の細かさを示す。量子化マトリクスは、量子化の細かさを変換係数の成分毎に重み付けするために使用される。量子化部１０３は、量子化変換係数１２１を係数順制御部１１３及び逆量子化部１０４に入力する。 The quantization unit 103 performs quantization on the transform coefficient from the orthogonal transform unit 102 to obtain a quantized transform coefficient 121. Specifically, the quantization unit 103 performs quantization according to quantization information such as a quantization parameter and a quantization matrix specified by the encoding control unit 116. The quantization parameter indicates the fineness of quantization. The quantization matrix is used for weighting the fineness of quantization for each component of the transform coefficient. The quantization unit 103 inputs the quantized transform coefficient 121 to the coefficient order control unit 113 and the inverse quantization unit 104.

係数順制御部１１３は、２次元（２Ｄ）表現である量子化変換係数１２１を、１次元（１Ｄ）表現である量子化変換係数列１１７に変換し、エントロピー符号化部１１４に入力する。尚、係数制御部１１３の詳細は後述される。 The coefficient order control unit 113 converts the quantized transform coefficient 121 that is a two-dimensional (2D) representation into a quantized transform coefficient sequence 117 that is a one-dimensional (1D) representation, and inputs the quantized transform coefficient sequence 117 to the entropy encoding unit 114. Details of the coefficient control unit 113 will be described later.

エントロピー符号化部１１４は、係数制御部１１３からの量子化変換係数列１１７、予測選択部１１０からの予測情報１２６、符号化制御部１１６によって指定される量子化情報などの様々な符号化パラメータに対してエントロピー符号化（例えば、ハフマン符号化、算術符号化など）を行い、符号化データを生成する。尚、符号化パラメータとは、予測情報１２６、変換係数に関する情報、量子化に関する情報、などの復号に必要となるパラメータである。符号化パラメータは、符号化制御部１１６の内部メモリ（図示しない）に保持され、予測対象ブロックを符号化する際に隣接する既に符号化済みの画素ブロックの符号化パラメータを用いることが可能である。例えば、Ｈ．２６４のイントラ予測では符号化済みの隣接ブロックの予測モード情報から、予測対象ブロックの予測モードの予測値を導出することが可能である。 The entropy encoding unit 114 applies various encoding parameters such as the quantized transform coefficient sequence 117 from the coefficient control unit 113, the prediction information 126 from the prediction selection unit 110, and the quantization information specified by the encoding control unit 116. Entropy coding (for example, Huffman coding, arithmetic coding, etc.) is performed on the data to generate coded data. The encoding parameter is a parameter necessary for decoding, such as prediction information 126, information on transform coefficients, information on quantization, and the like. The encoding parameter is held in an internal memory (not shown) of the encoding control unit 116, and it is possible to use the encoding parameter of an already encoded pixel block adjacent when encoding the prediction target block. . For example, H.M. In the H.264 intra prediction, the prediction value of the prediction mode of the prediction target block can be derived from the prediction mode information of the encoded adjacent block.

エントロピー符号化部１１４によって生成された符号化データは、例えば多重化を経て出力バッファ１１５に一時的に蓄積され、符号化制御部１１６が管理する適切な出力タイミングに従って符号化データ１３０として出力される。符号化データ１３０は、例えば、図示しない蓄積系（蓄積メディア）または伝送系（通信回線）へ出力される。 The encoded data generated by the entropy encoding unit 114 is temporarily accumulated in the output buffer 115 through multiplexing, for example, and output as encoded data 130 according to an appropriate output timing managed by the encoding control unit 116. . The encoded data 130 is output to a storage system (storage medium) or a transmission system (communication line) (not shown), for example.

逆量子化部１０４は、量子化部１０３からの量子化変換係数１２１に対して逆量子化を行い、復元変換係数１２２を得る。具体的には、逆量子化部１０４は、量子化部１０３において使用された量子化情報に従って逆量子化を行う。量子化部１０３において使用された量子化情報は、符号化制御部１１６の内部メモリからロードされる。逆量子化部１０４は、復元変換係数１２２を逆直交変換部１０５に入力する。 The inverse quantization unit 104 performs inverse quantization on the quantized transform coefficient 121 from the quantization unit 103 to obtain a restored transform coefficient 122. Specifically, the inverse quantization unit 104 performs inverse quantization according to the quantization information used in the quantization unit 103. The quantization information used in the quantization unit 103 is loaded from the internal memory of the encoding control unit 116. The inverse quantization unit 104 inputs the restored transform coefficient 122 to the inverse orthogonal transform unit 105.

逆直交変換部１０５は、逆量子化部１０４からの復元変換係数１２２に対して、直交変換部１０２において行われた直交変換に対応する逆直交変換を行い、復元予測誤差１２３を得る。尚、逆直交変換部１０５の詳細は後述される。逆直交変換部１０５は、復元予測誤差１２３を加算部１０６に入力する。 The inverse orthogonal transform unit 105 performs inverse orthogonal transform corresponding to the orthogonal transform performed in the orthogonal transform unit 102 on the reconstructed transform coefficient 122 from the inverse quantization unit 104 to obtain a reconstructed prediction error 123. Details of the inverse orthogonal transform unit 105 will be described later. The inverse orthogonal transform unit 105 inputs the restoration prediction error 123 to the addition unit 106.

加算部１０６は、復元予測誤差１２３と、対応する予測画像１２７とを加算し、局所復号画像１２４を生成する。局所復号画像１２４は、参照画像メモリ１０７に保存される。参照画像メモリ１０７に保存された局所復号画像１２４は、参照画像１２５としてイントラ予測部１０８及びインター予測部１０９によって必要に応じて参照される。 The adding unit 106 adds the restored prediction error 123 and the corresponding predicted image 127 to generate a local decoded image 124. The locally decoded image 124 is stored in the reference image memory 107. The locally decoded image 124 stored in the reference image memory 107 is referred to as the reference image 125 by the intra prediction unit 108 and the inter prediction unit 109 as necessary.

イントラ予測部１０８は、参照画像メモリ１０７に保存されている参照画像１２５を利用してイントラ予測を行う。例えば、Ｈ．２６４では、予測対象ブロックに隣接する符号化済みの参照画素値を利用して、垂直方向、水平方向などの予測方向に沿って画素補填（コピーまたは補間）を行うことによってイントラ予測画像を生成する。図７ＡにＨ．２６４におけるイントラ予測の予測方向を示す。また、図７ＢにＨ．２６４における参照画素と符号化対象画素との配置関係を示す。図７Ｃはモード１（水平予測）の予測画像生成方法を示しており、図７Ｄはモード４（対角右下予測；図４ＡのIntra_NxN_Diagonal_Down_Right）の予測画像生成方法を示している。 The intra prediction unit 108 performs intra prediction using the reference image 125 stored in the reference image memory 107. For example, H.M. In H.264, an intra prediction image is generated by performing pixel interpolation (copying or interpolation) along a prediction direction such as a vertical direction or a horizontal direction using an encoded reference pixel value adjacent to a prediction target block. . In FIG. The prediction direction of intra prediction in H.264 is shown. In FIG. The arrangement | positioning relationship between the reference pixel and encoding object pixel in H.264 is shown. FIG. 7C shows a predicted image generation method in mode 1 (horizontal prediction), and FIG. 7D shows a predicted image generation method in mode 4 (diagonal lower right prediction; Intra_NxN_Diagonal_Down_Right in FIG. 4A).

尚、イントラ予測部１０８は、予め定められた補間方法を用いて画素値を補間してから、予め定められた予測方向に補間画素値をコピーしてもよい。Ｈ．２６４のイントラ予測の予測方向を例示したが、予測方向を更に細かく規定することにより１７種類、３３種類などの任意の数の予測モードを使用するに拡張することも可能である。具体的には、Ｈ．２６４では２２．５度毎の予測角度が規定されているが、例えば１１．２５度毎の予測角度を規定すれば、ＤＣ予測を含めて１７種類の予測モードを使用できる。また、５．６２５度毎の予測角度を規定すれば、ＤＣ予測を含めて３３種類の予測モードを使用できる。また、予測角度を等間隔に配置するのではなく、第１の基準点から水平および垂直に移動させた第２の基準点を結ぶ直線によって予測方向の角度を表してもよい。以上のように予測モードの拡張は容易に可能であり、本実施形態は予測モードの数に関わらず適用可能である。 The intra prediction unit 108 may copy the interpolated pixel value in a predetermined prediction direction after interpolating the pixel value using a predetermined interpolation method. H. Although the prediction direction of the H.264 intra prediction is illustrated, the prediction direction can be extended to use any number of prediction modes such as 17 types and 33 types by further specifying the prediction direction. Specifically, H.C. H.264 defines a prediction angle for every 22.5 degrees. For example, if a prediction angle for every 11.25 degrees is defined, 17 types of prediction modes including DC prediction can be used. Further, if a prediction angle for every 5.625 degrees is defined, 33 types of prediction modes including DC prediction can be used. Further, instead of arranging the prediction angles at equal intervals, the angle in the prediction direction may be represented by a straight line connecting the second reference points moved horizontally and vertically from the first reference point. As described above, the prediction mode can be easily expanded, and this embodiment can be applied regardless of the number of prediction modes.

インター予測部１０９は、参照画像メモリ１０７に保存されている参照画像１２５を利用してインター予測を行う。具体的には、インター予測部１０９は、予測対象ブロックと参照画像１２５との間でブロックマッチング処理を行って動きのズレ量（動きベクトル）を導出する。インター予測部１０９は、この動きベクトルに基づいて補間処理（動き補償）を行ってインター予測画像を生成する。Ｈ．２６４では、１／４画素精度までの補間処理が可能である。導出された動きベクトルは予測情報１２６の一部としてエントロピー符号化される。 The inter prediction unit 109 performs inter prediction using the reference image 125 stored in the reference image memory 107. Specifically, the inter prediction unit 109 performs block matching processing between the prediction target block and the reference image 125 to derive a motion shift amount (motion vector). The inter prediction unit 109 performs an interpolation process (motion compensation) based on the motion vector to generate an inter prediction image. H. With H.264, interpolation processing up to 1/4 pixel accuracy is possible. The derived motion vector is entropy encoded as part of the prediction information 126.

選択スイッチ１１１は、イントラ予測部１０８の出力端またはインター予測部１０９の出力端を予測選択部１１０からの予測情報１２６に従って選択し、イントラ予測画像またはインター予測画像を予測画像１２７として減算部１０１及び加算部１０６に入力する。予測情報１２６がイントラ予測を示唆する場合には、選択スイッチ１１０はイントラ予測部１０８からのイントラ予測画像を予測画像１２７として取り込む。一方、予測情報１２６がインター予測を示唆する場合には、選択スイッチ１１０はインター予測部１０９からのインター予測画像を予測画像１２７として取り込む。 The selection switch 111 selects the output terminal of the intra prediction unit 108 or the output terminal of the inter prediction unit 109 according to the prediction information 126 from the prediction selection unit 110, and uses the subtraction unit 101 and the intra prediction image or the inter prediction image as the prediction image 127. The data is input to the adding unit 106. When the prediction information 126 suggests intra prediction, the selection switch 110 captures the intra prediction image from the intra prediction unit 108 as the prediction image 127. On the other hand, when the prediction information 126 suggests inter prediction, the selection switch 110 captures the inter prediction image from the inter prediction unit 109 as the prediction image 127.

予測選択部１１０は、符号化制御部１１６が制御する予測モードに従って、予測情報１２６を設定する機能を有する。前述のように、予測画像１２７の生成のためにイントラ予測またはインター予測が選択可能であるが、イントラ予測及びインター予測の夫々に複数のモードが更に選択可能である。符号化制御部１１６はイントラ予測及びインター予測の複数の予測モードのうち１つを最適な予測モードとして判定し、予測選択部１１０は判定された最適な予測モードに応じて予測情報１２６を設定する。 The prediction selection unit 110 has a function of setting the prediction information 126 according to the prediction mode controlled by the encoding control unit 116. As described above, intra prediction or inter prediction can be selected for generating the predicted image 127, but a plurality of modes can be further selected for each of intra prediction and inter prediction. The encoding control unit 116 determines one of a plurality of intra prediction modes and inter prediction modes as the optimal prediction mode, and the prediction selection unit 110 sets the prediction information 126 according to the determined optimal prediction mode. .

例えば、イントラ予測に関して、符号化制御部１１６から予測モード情報がイントラ予測部１０８に指定され、イントラ予測部１０８はこの予測モード情報に従って予測画像１２７を生成する。符号化制御部１１６は、予測モードの番号が小さい方から順に複数の予測モード情報を指定してもよいし、大きい方から順に複数の予測モード情報を指定してもよい。また、符号化制御部１１６は、入力画像の特性に従って予測モードを限定してもよい。符号化制御部１１６は、必ずしも全ての予測モードを指定する必要はなく符号化対象ブロックに対して少なくとも１つの予測モード情報を指定すればよい。 For example, regarding intra prediction, prediction mode information is designated by the encoding control unit 116 as the intra prediction unit 108, and the intra prediction unit 108 generates a prediction image 127 according to the prediction mode information. The encoding control unit 116 may specify a plurality of prediction mode information in order from the smallest prediction mode number, or may specify a plurality of prediction mode information in order from the largest. The encoding control unit 116 may limit the prediction mode according to the characteristics of the input image. The encoding control unit 116 need not always specify all prediction modes, but may specify at least one prediction mode information for the encoding target block.

例えば、符号化制御部１１６は、次の数式（１）に示すコスト関数を用いて最適な予測モードを判定する。 For example, the encoding control unit 116 determines an optimal prediction mode using a cost function expressed by the following formula (1).

数式（１）において、ＯＨは予測情報１２６（例えば、動きベクトル情報、予測ブロックサイズ情報）に関する符号量を示し、ＳＡＤは予測対象ブロックと予測画像１２７との間の差分絶対値和（即ち、予測誤差１１９の絶対値の累積和）を示す。また、λは量子化情報（量子化パラメータ）の値に基づいて決定されるラグランジュ未定乗数を示し、Ｋは符号化コストを示す。数式（１）を用いる場合には、符号化コストＫを最小化する予測モードが発生符号量及び予測誤差の観点から最適な予測モードとして判定される。数式（１）の変形として、ＯＨのみまたはＳＡＤのみから符号化コストを見積もってもよいし、ＳＡＤにアダマール変換を施した値またはその近似値を利用して符号化コストを見積もってもよい。 In Equation (1), OH indicates a code amount related to the prediction information 126 (for example, motion vector information, prediction block size information), and SAD is a sum of absolute differences between the prediction target block and the prediction image 127 (that is, prediction). Accumulated sum of absolute values of error 119). Further, λ represents a Lagrange undetermined multiplier determined based on the value of quantization information (quantization parameter), and K represents an encoding cost. When Expression (1) is used, the prediction mode that minimizes the coding cost K is determined as the optimum prediction mode from the viewpoint of the generated code amount and the prediction error. As a modification of Equation (1), the encoding cost may be estimated from OH alone or SAD alone, or the encoding cost may be estimated using a value obtained by subjecting SAD to Hadamard transform or an approximation thereof.

また、図示しない仮符号化ユニットを用いることにより最適な予測モードを判定することも可能である。例えば、符号化制御部１１６は、次の数式（２）に示すコスト関数を用いて最適な予測モードを判定する。 It is also possible to determine an optimal prediction mode by using a temporary encoding unit (not shown). For example, the encoding control unit 116 determines the optimal prediction mode using the cost function shown in the following mathematical formula (2).

数式（２）において、Ｄは予測対象ブロックと局所復号画像との間の二乗誤差和（即ち、符号化歪）を示し、Ｒは予測対象ブロックと予測モードの予測画像１２７との間の予測誤差について仮符号化によって見積もられた符号量を示し、Ｊは符号化コストを示す。数式（２）の符号化コストＪを導出する場合には予測モード毎に仮符号化処理及び局部復号化処理が必要なので、回路規模または演算量が増大する。反面、より正確な符号化歪と符号量とに基づいて符号化コストＪが導出されるので、最適な予測モードを高精度に判定して高い符号化効率を維持しやすい。尚、数式（２）の変形として、ＲのみまたはＤのみから符号化コストを見積もってもよいし、ＲまたはＤの近似値を利用して符号化コストを見積もってもよい。また、符号化制御部１１６は、予測対象ブロックに関して事前に得られる情報（周囲の画素ブロックの予測モード、画像解析の結果など）に基づいて、数式（１）または数式（２）を用いた判定を行う予測モードの候補の数を、予め絞り込んでおいてもよい。 In Equation (2), D represents a square error sum (ie, encoding distortion) between the prediction target block and the local decoded image, and R represents a prediction error between the prediction target block and the prediction image 127 in the prediction mode. Indicates the amount of code estimated by provisional encoding, and J indicates the encoding cost. In order to derive the encoding cost J of Equation (2), provisional encoding processing and local decoding processing are required for each prediction mode, so that the circuit scale or the amount of calculation increases. On the other hand, since the encoding cost J is derived based on more accurate encoding distortion and code amount, it is easy to determine the optimal prediction mode with high accuracy and maintain high encoding efficiency. As a modification of Equation (2), the encoding cost may be estimated from only R or D, or the encoding cost may be estimated using an approximate value of R or D. In addition, the encoding control unit 116 makes a determination using Formula (1) or Formula (2) based on information obtained in advance regarding the prediction target block (prediction mode of surrounding pixel blocks, results of image analysis, and the like). The number of prediction mode candidates for performing may be narrowed down in advance.

符号化制御部１１６は、図１の画像符号化装置の各要素を制御する。具体的には、符号化制御部１１６は、上述の動作を含む符号化処理のための種々の制御を行う。
１Ｄ変換行列セット部１１２は、予測選択部１１０からの予測情報１２６に含まれる予測モード情報に基づいて１Ｄ変換行列セット情報１２９を生成し、直交変換部１０２及び逆直交変換部１０５に入力する。尚、１Ｄ変換行列セット情報１２９の詳細は後述される。 The encoding control unit 116 controls each element of the image encoding device in FIG. Specifically, the encoding control unit 116 performs various controls for the encoding process including the above-described operation.
The 1D transform matrix set unit 112 generates 1D transform matrix set information 129 based on the prediction mode information included in the prediction information 126 from the prediction selection unit 110 and inputs the 1D transform matrix set information 129 to the orthogonal transform unit 102 and the inverse orthogonal transform unit 105. Details of the 1D conversion matrix set information 129 will be described later.

以下、図２を用いて本実施形態に係る直交変換部１０２の詳細を説明する。
直交変換部１０２は、選択スイッチ２０１、垂直変換部２０２、転置部２０３、選択スイッチ２０４及び水平変換部２０５を有する。垂直変換部２０２は、１Ｄ直交変換部Ａ２０６及び１Ｄ直交変換部Ｂ２０７を含む。水平変換部２０５は、１Ｄ直交変換部Ａ２０８及び１Ｄ直交変換部Ｂ２０９を含む。尚、垂直変換部２０２及び水平変換部２０５の順序は、一例であり、これらは逆順であっても構わない。 Hereinafter, details of the orthogonal transform unit 102 according to the present embodiment will be described with reference to FIG.
The orthogonal transform unit 102 includes a selection switch 201, a vertical conversion unit 202, a transposition unit 203, a selection switch 204, and a horizontal conversion unit 205. The vertical transform unit 202 includes a 1D orthogonal transform unit A206 and a 1D orthogonal transform unit B207. The horizontal transform unit 205 includes a 1D orthogonal transform unit A208 and a 1D orthogonal transform unit B209. Note that the order of the vertical conversion unit 202 and the horizontal conversion unit 205 is an example, and these may be reversed.

１Ｄ直交変換部Ａ２０６及び１Ｄ直交変換部Ａ２０８は、入力される行列に対して１Ｄ変換行列Ａを乗算する点で共通の機能を持ち、１Ｄ直交変換部Ｂ２０７及び１Ｄ直交変換部Ｂ２０９は、入力される行列に対して１Ｄ変換行列Ｂを乗算する点で共通の機能を持つ。従って、１Ｄ直交変換部Ａ２０６及び１Ｄ直交変換部Ａ２０８は、物理的に同一のハードウェアを時分割で使用することによっても実現可能である。また、１Ｄ直交変換部Ｂ２０７及び１Ｄ直交変換部Ｂ２０９も同様である。 The 1D orthogonal transform unit A206 and the 1D orthogonal transform unit A208 have a common function in that the input matrix is multiplied by the 1D transform matrix A, and the 1D orthogonal transform unit B207 and the 1D orthogonal transform unit B209 are input. It has a common function in that the 1D conversion matrix B is multiplied with the matrix. Therefore, the 1D orthogonal transform unit A206 and the 1D orthogonal transform unit A208 can also be realized by using physically identical hardware in a time division manner. The same applies to the 1D orthogonal transform unit B207 and the 1D orthogonal transform unit B209.

選択スイッチ２０１は、１Ｄ変換行列セット情報１２９に含まれる垂直変換インデックスに従って、予測誤差１１９を１Ｄ直交変換部Ａ２０６及び１Ｄ直交変換部Ｂ２０７のうちのいずれか一方に導く。１Ｄ直交変換部Ａ２０６は、入力された予測誤差（行列）１１９に対して１Ｄ変換行列Ａを乗算して出力する。１Ｄ直交変換部Ｂ２０７は、入力された予測誤差１１９に対して１Ｄ変換行列Ｂを乗算して出力する。具体的には、１Ｄ直交変換部Ａ２０６及び１Ｄ直交変換部Ｂ２０７（即ち、垂直変換部２０２）は、次の数式（３）に示す一次元の直交変換を行って、予測誤差１１９の垂直方向の相関を除去する。 The selection switch 201 guides the prediction error 119 to one of the 1D orthogonal transform unit A206 and the 1D orthogonal transform unit B207 according to the vertical transform index included in the 1D transform matrix set information 129. The 1D orthogonal transform unit A206 multiplies the input prediction error (matrix) 119 by the 1D transform matrix A and outputs the result. The 1D orthogonal transform unit B207 multiplies the input prediction error 119 by the 1D transform matrix B and outputs the result. Specifically, the 1D orthogonal transform unit A206 and the 1D orthogonal transform unit B207 (that is, the vertical transform unit 202) perform a one-dimensional orthogonal transform represented by the following equation (3), and the prediction error 119 in the vertical direction Remove correlation.

数式（３）において、Ｘは予測誤差１１９の行列（Ｎ×Ｎ）を示し、Ｖは１Ｄ変換行列Ａ及び１Ｄ変換行列Ｂ（いずれもＮ×Ｎ）を包括的に示しており、Ｙは１Ｄ直交変換部Ａ２０６及び１Ｄ直交変換部Ｂ２０７の出力行列（Ｎ×Ｎ）を示す。具体的には、変換行列Ｖは、行列Ｘの垂直方向の相関を除去するために設計された変換基底を行ベクトルとし縦に並べたＮ×Ｎの変換行列である。但し、後述するように、１Ｄ変換行列Ａ及び１Ｄ変換行列Ｂは、異なる方法で設計され、異なる性質を持つ。尚、１Ｄ変換行列Ａ及び１Ｄ変換行列Ｂは、設計された各変換基底をスカラ倍して整数化したものを使用することも可能である。 In Equation (3), X represents a matrix (N × N) of the prediction error 119, V represents a 1D conversion matrix A and a 1D conversion matrix B (both N × N), and Y represents 1D The output matrix (NxN) of orthogonal transformation part A206 and 1D orthogonal transformation part B207 is shown. Specifically, the transformation matrix V is an N × N transformation matrix in which transformation bases designed to remove the correlation in the vertical direction of the matrix X are arranged in the vertical direction as row vectors. However, as described later, the 1D conversion matrix A and the 1D conversion matrix B are designed by different methods and have different properties. As the 1D conversion matrix A and the 1D conversion matrix B, it is also possible to use an integer obtained by multiplying each designed conversion base by a scalar.

ここで、予測誤差１１９がＭ×Ｎで表現される矩形ブロックである場合、直交変換を行うブロックサイズもまたＭ×Ｎであってもよい。 Here, when the prediction error 119 is a rectangular block expressed by M × N, the block size for performing orthogonal transform may also be M × N.

転置部２０３は、垂直変換部２０２の出力行列（Ｙ）の転置を行って、選択スイッチ２０４に与える。但し、転置部２０３は、一例であって、対応するハードウェアを必ずしも用意しなくてもよい。例えば、垂直変換部２０２による１Ｄ直交変換を実行した結果（垂直変換部２０２の出力行列の各要素）を保持しておき、水平変換部２０５による１Ｄ直交変換を実行するときに適切な順序で読み出せば、転置部２０３に対応するハードウェアを用意しなくても出力行列（Ｙ）の転置を実行できる。 The transposition unit 203 transposes the output matrix (Y) of the vertical conversion unit 202 and supplies it to the selection switch 204. However, the transposition unit 203 is an example, and corresponding hardware may not necessarily be prepared. For example, the result of executing the 1D orthogonal transform by the vertical transform unit 202 (each element of the output matrix of the vertical transform unit 202) is stored and read in an appropriate order when the 1D orthogonal transform is performed by the horizontal transform unit 205. If output, the transposition of the output matrix (Y) can be performed without preparing hardware corresponding to the transposition unit 203.

選択スイッチ２０４は、１Ｄ変換行列セット情報１２９に含まれる水平変換インデックスに従って、転置部２０３からの入力行列を１Ｄ直交変換部Ａ２０８及び１Ｄ直交変換部Ｂ２０９のうちのいずれか一方に導く。１Ｄ直交変換部Ａ２０８は、入力行列に対して１Ｄ変換行列Ａを乗算して出力する。１Ｄ直交変換部Ｂ２０９は、入力行列に対して１Ｄ変換行列Ｂを乗算して出力する。具体的には、１Ｄ直交変換部Ａ２０８及び１Ｄ直交変換部Ｂ２０９（即ち、水平変換部２０５）は、次の数式（４）に示す一次元の直交変換を行って、予測誤差の水平方向の相関を除去する。 The selection switch 204 guides the input matrix from the transposition unit 203 to one of the 1D orthogonal transform unit A 208 and the 1D orthogonal transform unit B 209 according to the horizontal transform index included in the 1D transform matrix set information 129. The 1D orthogonal transform unit A208 multiplies the input matrix by the 1D transform matrix A and outputs the result. The 1D orthogonal transform unit B209 multiplies the input matrix by the 1D transform matrix B and outputs the result. Specifically, the 1D orthogonal transform unit A 208 and the 1D orthogonal transform unit B 209 (that is, the horizontal transform unit 205) perform a one-dimensional orthogonal transform represented by the following formula (4) to correlate the prediction error in the horizontal direction. Remove.

数式（４）において、Ｈは１Ｄ変換行列Ａ及び１Ｄ変換行列Ｂ（いずれもＮ×Ｎ）を包括的に示しており、Ｚは１Ｄ直交変換部Ａ２０８及び１Ｄ直交変換部Ｂ２０９の出力行列（Ｎ×Ｎ）を示しており、これは変換係数１２０を指す。具体的には、変換行列Ｈは、行列Ｙの水平方向の相関を除去するために設計された変換基底を行ベクトルとし縦に並べたＮ×Ｎの変換行列である。先の説明と重複するが、１Ｄ変換行列Ａ及び１Ｄ変換行列Ｂは、異なる方法で設計され、異なる性質を持つ。また、１Ｄ変換行列Ａ及び１Ｄ変換行列Ｂは、設計された各変換基底をスカラ倍して整数化したものを使用することも可能である。 In Equation (4), H comprehensively represents the 1D transformation matrix A and the 1D transformation matrix B (both N × N), and Z represents the output matrix (N of the 1D orthogonal transformation unit A208 and the 1D orthogonal transformation unit B209). XN), which refers to the conversion factor 120. Specifically, the transformation matrix H is an N × N transformation matrix in which transformation bases designed to remove the correlation in the horizontal direction of the matrix Y are vertically arranged as row vectors. Although overlapping with the previous description, the 1D transformation matrix A and the 1D transformation matrix B are designed in different ways and have different properties. Further, the 1D conversion matrix A and the 1D conversion matrix B may be obtained by converting each designed conversion base into an integer by scalar multiplication.

以上のように、直交変換部１０２は、予測誤差（行列）１１９に対して、１Ｄ変換行列セット部１１２から入力された１Ｄ変換行列セット情報１２９に従って直交変換を行い、変換係数（行列）１２０を生成する。尚、Ｈ．２６４を考慮すると、直交変換部１０２には、図示しないＤＣＴ部が含まれてもよいし、１Ｄ変換行列Ａと１Ｄ変換行列ＢのいずれかをＤＣＴのための行列に置き換えてもよい。例えば、１Ｄ変換行列ＢはＤＣＴのための変換行列であってもよい。更に、直交変換部１０２は、ＤＣＴに加えて、アダマール変換、後述するカルーネン・レーベ変換、離散サイン変換などの種々の直交変換を実現してもよい。 As described above, the orthogonal transform unit 102 performs orthogonal transform on the prediction error (matrix) 119 according to the 1D transform matrix set information 129 input from the 1D transform matrix set unit 112, and converts the transform coefficient (matrix) 120 into the transform coefficient (matrix) 120. Generate. H. In consideration of H.264, the orthogonal transform unit 102 may include a DCT unit (not shown), or one of the 1D transform matrix A and the 1D transform matrix B may be replaced with a matrix for DCT. For example, the 1D transformation matrix B may be a transformation matrix for DCT. Further, the orthogonal transform unit 102 may implement various orthogonal transforms such as Hadamard transform, Karhunen-Loeve transform, and discrete sine transform, which will be described later, in addition to DCT.

ここで、１Ｄ変換行列Ａと１Ｄ変換行列Ｂとの性質の差異について説明する。Ｈ．２６４などでサポートされるイントラ予測モードには、予測対象ブロックの左側及び上側の一方または両方の隣接ライン上の参照画素群を予測方向に沿ってコピーまたは補間後にコピーして予測画像を生成するものがある。すなわち、このイントラ予測モードでは、予測方向に従って参照画素群の中の少なくとも一つの参照画素が選択され、参照画素のコピーまたは参照画素からの補間により、予測画像が生成される。係るイントラ予測モードは、画像の空間的相関を利用するので、参照画素からの距離が大きくなるにつれて予測精度が低下する傾向にある。即ち、参照画素からの距離に応じて予測誤差の絶対値が増大し易い。尚、係る傾向は、予測方向によらず同様である。より具体的には、予測対象ブロックの左隣接ライン上の参照画素群のみが参照（参照画素の画素値のコピーまたは参照画素からの補間）されるイントラ予測モード（例えば、図７Ａのモード１及びモード８）に関して、予測誤差は水平方向に係る傾向を示す。予測対象ブロックに上隣接ライン上の参照画素群のみを参照するイントラ予測モード（例えば、図７Ａのモード０、モード３及びモード７）に関して、予測誤差は垂直方向に係る傾向を示す。更に、予測対象ブロックの左隣接ライン及び上隣接ライン上の参照画素群が参照される予測モード（例えば、図７Ａのモード４、モード５及びモード６）に関して、予測誤差は水平方向及び垂直方向に係る傾向を示す。概括すれば、予測画像の生成のために利用する参照画素群のラインと直交する方向に係る傾向を示すといえる。 Here, the difference in properties between the 1D conversion matrix A and the 1D conversion matrix B will be described. H. In the intra prediction mode supported by H.264, the prediction pixel is generated by copying the reference pixel group on one or both adjacent lines on the left side and the upper side of the prediction target block along the prediction direction or after interpolation. There is. That is, in this intra prediction mode, at least one reference pixel in the reference pixel group is selected according to the prediction direction, and a predicted image is generated by copying the reference pixel or interpolating from the reference pixel. Since the intra prediction mode uses the spatial correlation of images, the prediction accuracy tends to decrease as the distance from the reference pixel increases. That is, the absolute value of the prediction error is likely to increase according to the distance from the reference pixel. This tendency is the same regardless of the prediction direction. More specifically, an intra prediction mode in which only the reference pixel group on the left adjacent line of the prediction target block is referred (copying of the pixel value of the reference pixel or interpolation from the reference pixel) (for example, mode 1 in FIG. 7A and For mode 8), the prediction error shows a trend in the horizontal direction. In the intra prediction mode (for example, mode 0, mode 3, and mode 7 in FIG. 7A) in which only the reference pixel group on the upper adjacent line is referred to the prediction target block, the prediction error shows a tendency in the vertical direction. Furthermore, with respect to a prediction mode in which reference pixel groups on the left adjacent line and the upper adjacent line of the prediction target block are referred to (for example, mode 4, mode 5 and mode 6 in FIG. 7A), the prediction error is in the horizontal and vertical directions. This trend is shown. In summary, it can be said that the tendency is related to the direction orthogonal to the line of the reference pixel group used for generating the predicted image.

１Ｄ変換行列Ａは、１Ｄ変換行列Ｂに比べて、上記直交する方向（垂直方向または水平方向）について１Ｄ直交変換を行う際の係数集密度が高くなる（即ち、量子化変換係数１２１における非零係数の割合が小さくなる）ように共通の変換基底を予め設計することによって生成される。一方、１Ｄ変換行列Ｂは、このような性質を持たない汎用的な変換行列を設計することによって生成される。例えば、汎用的な変換はＤＣＴである。１Ｄ変換行列Ａを用いて、上記直交する方向について１Ｄ直交変換を行えば、イントラ予測の予測誤差の変換効率が向上し、ひいては符号化効率が向上する。例えば、モード０（垂直方向予測）の予測誤差１１９は、垂直方向には上記傾向を示す一方、水平方向には上記傾向を示さない。故に、垂直変換部２０２において１Ｄ変換行列Ａを用いて１Ｄ直交変換を行い、水平変換部２０５において１Ｄ変換行列Ｂを用いて１Ｄ直交変換を行うことにより、効率的な直交変換を実現できる。 Compared to the 1D transformation matrix B, the 1D transformation matrix A has higher coefficient density when performing 1D orthogonal transformation in the orthogonal direction (vertical direction or horizontal direction) (that is, non-zero in the quantized transformation coefficient 121). It is generated by designing a common transformation base in advance so that the ratio of the coefficients becomes small. On the other hand, the 1D transformation matrix B is generated by designing a general-purpose transformation matrix having no such property. For example, a generic conversion is DCT. If 1D orthogonal transformation is performed in the orthogonal direction using the 1D transformation matrix A, the conversion efficiency of prediction errors in intra prediction is improved, and thus the coding efficiency is improved. For example, the prediction error 119 of mode 0 (vertical prediction) shows the above tendency in the vertical direction, but does not show the above tendency in the horizontal direction. Therefore, efficient orthogonal transformation can be realized by performing 1D orthogonal transformation using the 1D transformation matrix A in the vertical transformation unit 202 and performing 1D orthogonal transformation using the 1D transformation matrix B in the horizontal transformation unit 205.

以下、図３を用いて本実施形態に係る逆直交変換部１０５の詳細を説明する。
逆直交変換部１０５は、選択スイッチ３０１、垂直逆変換部３０２、転置部３０３、選択スイッチ３０４及び水平逆変換部３０５を有する。垂直逆変換部３０２は、１Ｄ逆直交変換部Ａ３０６及び１Ｄ逆直交変換部Ｂ３０７を含む。水平逆変換部３０５は、１Ｄ逆直交変換部Ａ３０８及び１Ｄ直交変換部Ｂ３０９を含む。尚、垂直逆変換部３０２及び水平逆変換部３０５の順序は、一例であり、これらは逆順であっても構わない。 Hereinafter, the details of the inverse orthogonal transform unit 105 according to the present embodiment will be described with reference to FIG.
The inverse orthogonal transform unit 105 includes a selection switch 301, a vertical inverse transform unit 302, a transposition unit 303, a selection switch 304, and a horizontal inverse transform unit 305. The vertical inverse transform unit 302 includes a 1D inverse orthogonal transform unit A306 and a 1D inverse orthogonal transform unit B307. The horizontal inverse transform unit 305 includes a 1D inverse orthogonal transform unit A308 and a 1D orthogonal transform unit B309. The order of the vertical inverse transform unit 302 and the horizontal inverse transform unit 305 is an example, and these may be reversed.

１Ｄ逆直交変換部Ａ３０６及び１Ｄ逆直交変換部Ａ３０８は、入力される行列に対して前述の１Ｄ変換行列Ａの転置行列を乗算する点で共通の機能を持ち、１Ｄ逆直交変換部Ｂ３０７及び１Ｄ逆直交変換部Ｂ３０９は、入力される行列に対して前述の１Ｄ変換行列Ｂの転置行列を乗算する点で共通の機能を持つ。従って、１Ｄ逆直交変換部Ａ３０６及び１Ｄ逆直交変換部Ａ３０８は、物理的に同一のハードウェアを時分割で使用することによっても実現可能である。また、１Ｄ逆直交変換部Ｂ３０７及び１Ｄ逆直交変換部Ｂ３０９も同様である。 The 1D inverse orthogonal transform unit A306 and the 1D inverse orthogonal transform unit A308 have a common function in that the input matrix is multiplied by the transposed matrix of the 1D transform matrix A described above, and the 1D inverse orthogonal transform units B307 and 1D. The inverse orthogonal transform unit B309 has a common function in that the input matrix is multiplied by the transposed matrix of the 1D transform matrix B described above. Therefore, the 1D inverse orthogonal transform unit A306 and the 1D inverse orthogonal transform unit A308 can also be realized by using physically identical hardware in a time division manner. The same applies to the 1D inverse orthogonal transform unit B307 and the 1D inverse orthogonal transform unit B309.

選択スイッチ３０１は、１Ｄ変換行列セット情報１２９に含まれる垂直変換インデックスに従って、復元変換係数１２２を１Ｄ逆直交変換部Ａ３０６及び１Ｄ逆直交変換部Ｂ３０７のうちのいずれか一方に導く。１Ｄ逆直交変換部Ａ３０６は、入力された復元変換係数１２２（行列形式）に対して１Ｄ変換行列Ａの転置行列を乗算して出力する。１Ｄ逆直交変換部Ｂ３０７は、入力された復元変換係数１２２に対して１Ｄ変換行列Ｂの転置行列を乗算して出力する。具体的には、１Ｄ逆直交変換部Ａ３０６及び１Ｄ逆直交変換部Ｂ３０７（即ち、垂直逆変換部３０２）は、次の数式（５）に示す一次元の逆直交変換を行う。 The selection switch 301 guides the restoration transform coefficient 122 to one of the 1D inverse orthogonal transform unit A306 and the 1D inverse orthogonal transform unit B307 according to the vertical transform index included in the 1D transform matrix set information 129. The 1D inverse orthogonal transform unit A306 multiplies the input transform transform coefficient 122 (matrix format) by the transposed matrix of the 1D transform matrix A and outputs the result. The 1D inverse orthogonal transform unit B307 multiplies the input restored transform coefficient 122 by the transposed matrix of the 1D transform matrix B and outputs the result. Specifically, the 1D inverse orthogonal transform unit A306 and the 1D inverse orthogonal transform unit B307 (that is, the vertical inverse transform unit 302) perform one-dimensional inverse orthogonal transform represented by the following equation (5).

数式（５）において、Ｚ'は復元変換係数１２２の行列（Ｎ×Ｎ）を示し、Ｖ^Ｔは１Ｄ変換行列Ａ及び１Ｄ変換行列Ｂ（いずれもＮ×Ｎ）の転置行列を包括的に示しており、Ｙ'は１Ｄ逆直交変換部Ａ３０６及び１Ｄ逆直交変換部Ｂ３０７の出力行列（Ｎ×Ｎ）を示す。 In Equation (5), Z 'represents a matrix of the restored transform coefficients ^{122 (N × N), V} T is generically indicates a transposed matrix of 1D transform matrix A and 1D transformation matrix B (both N × N) Y ′ represents an output matrix (N × N) of the 1D inverse orthogonal transform unit A306 and the 1D inverse orthogonal transform unit B307.

転置部３０３は、垂直逆変換部３０２の出力行列（Ｙ'）の転置を行って、選択スイッチ３０４に与える。但し、転置部３０３は、一例であって、対応するハードウェアを必ずしも用意しなくてもよい。例えば、垂直逆変換部３０２による１Ｄ逆直交変換を実行した結果（垂直逆変換部３０２の出力行列の各要素）を保持しておき、水平逆変換部３０５による１Ｄ逆直交変換を実行するときに適切な順序で読み出せば、転置部３０３に対応するハードウェアを用意しなくても出力行列（Ｙ'）の転置を実行できる。 The transposition unit 303 transposes the output matrix (Y ′) of the vertical inverse transform unit 302 and gives the result to the selection switch 304. However, the transposition unit 303 is an example, and corresponding hardware may not necessarily be prepared. For example, when the 1D inverse orthogonal transform performed by the vertical inverse transform unit 302 is stored (each element of the output matrix of the vertical inverse transform unit 302) and the 1D inverse orthogonal transform is performed by the horizontal inverse transform unit 305. If read in an appropriate order, transposition of the output matrix (Y ′) can be executed without preparing hardware corresponding to the transposition unit 303.

選択スイッチ３０４は、１Ｄ変換行列セット情報１２９に含まれる水平変換インデックスに従って、転置部３０３からの入力行列を１Ｄ逆直交変換部Ａ３０８及び１Ｄ逆直交変換部Ｂ３０９のうちのいずれか一方に導く。１Ｄ逆直交変換部Ａ３０８は、入力行列に対して１Ｄ変換行列Ａの転置行列を乗算して出力する。１Ｄ逆直交変換部Ｂ３０９は、入力行列に対して１Ｄ変換行列Ｂの転置行列を乗算して出力する。具体的には、１Ｄ逆直交変換部Ａ３０８及び１Ｄ逆直交変換部Ｂ３０９（即ち、水平逆変換部３０５）は、次の数式（６）に示す一次元の逆直交変換を行う。 The selection switch 304 guides the input matrix from the transposition unit 303 to one of the 1D inverse orthogonal transform unit A308 and the 1D inverse orthogonal transform unit B309 according to the horizontal transform index included in the 1D transform matrix set information 129. The 1D inverse orthogonal transform unit A308 multiplies the input matrix by the transposed matrix of the 1D transform matrix A and outputs the result. The 1D inverse orthogonal transform unit B309 multiplies the input matrix by the transposed matrix of the 1D transform matrix B and outputs the result. Specifically, the 1D inverse orthogonal transform unit A308 and the 1D inverse orthogonal transform unit B309 (that is, the horizontal inverse transform unit 305) perform one-dimensional inverse orthogonal transform represented by the following equation (6).

数式（６）において、Ｈ^Ｔは１Ｄ変換行列Ａ及び１Ｄ変換行列Ｂ（いずれもＮ×Ｎ）の転置行列を包括的に示しており、Ｘ'は１Ｄ逆直交変換部Ａ３０８及び１Ｄ逆直交変換部Ｂ３０９の出力行列（Ｎ×Ｎ）を示しており、これは復元予測誤差１２３を指す。 In Equation (6), ^{H T} is 1D transform matrix are generically indicates the transposed matrix of A and 1D transformation matrix B (both N × N), X 'is 1D inverse orthogonal transform unit A308 and 1D inverse orthogonal transform The output matrix (N × N) of the part B309 is shown, which indicates the restoration prediction error 123.

以上のように、逆直交変換部１０５は、復元変換係数（行列）１２２に対して、１Ｄ変換行列セット部１１２から入力された１Ｄ変換行列セット情報１２９に従って逆直交変換を行い、復元予測誤差（行列）１２３を生成する。尚、Ｈ．２６４を考慮すると、逆直交変換部１０５には、図示しないＩＤＣＴ部が含まれてもよいし、１Ｄ変換行列Ａと１Ｄ変換行列ＢのいずれかをＤＣＴのための行列に置き換えてもよい。例えば、１Ｄ変換行列ＢがＤＣＴのための行列であってもよい。更に、逆直交変換部１０５は、ＩＤＣＴに加えて、直交変換部１０２と調和するようにアダマール変換、後述するカルーネン・レーベ変換、離散サイン変換などの種々の直交変換に対応する逆直交変換を実現してもよい。 As described above, the inverse orthogonal transform unit 105 performs inverse orthogonal transform on the reconstructed transform coefficient (matrix) 122 in accordance with the 1D transform matrix set information 129 input from the 1D transform matrix set unit 112, and reconstructed prediction error ( Matrix) 123 is generated. H. In consideration of H.264, the inverse orthogonal transform unit 105 may include an IDCT unit (not shown), or one of the 1D transform matrix A and the 1D transform matrix B may be replaced with a matrix for DCT. For example, the 1D transformation matrix B may be a matrix for DCT. Further, in addition to the IDCT, the inverse orthogonal transform unit 105 realizes inverse orthogonal transforms corresponding to various orthogonal transforms such as Hadamard transform, Karhunen-Loeve transform, and discrete sine transform, which will be described later, in harmony with the orthogonal transform unit 102. May be.

以下、１Ｄ変換行列セット部１１２が生成する、本実施形態に係る１Ｄ変換行列セット情報１２９の詳細を説明する。
１Ｄ変換行列セット情報１２９は、垂直直交変換及び垂直逆直交変換のために使用される変換行列を選択するための垂直変換インデックスと、水平直交変換及び水平逆直交変換のために使用される変換行列を選択するための水平変換インデックスとを直接的または間接的に示す。例えば、１Ｄ変換行列セット情報１２９は、図４Ｄに示す変換インデックス（TransformIdx）で表現することができる。図４Ｄのテーブルを参照すれば、変換インデックスから垂直変換インデックス（Vertical Transform Idx）及び水平変換インデックス（Horizontal Transform Idx）を導出できる。 Hereinafter, details of the 1D conversion matrix set information 129 according to the present embodiment generated by the 1D conversion matrix set unit 112 will be described.
The 1D transformation matrix set information 129 includes a vertical transformation index for selecting a transformation matrix used for vertical orthogonal transformation and vertical inverse orthogonal transformation, and a transformation matrix used for horizontal orthogonal transformation and horizontal inverse orthogonal transformation. The horizontal transformation index for selecting is directly or indirectly indicated. For example, the 1D transformation matrix set information 129 can be expressed by a transformation index (TransformIdx) illustrated in FIG. 4D. With reference to the table of FIG. 4D, a vertical transformation index (Vertical Transform Idx) and a horizontal transformation index (Horizontal Transform Idx) can be derived from the transformation index.

図４Ｂに示すように、垂直変換インデックスが「０」であれば、垂直直交変換または垂直逆直交変換のために前述の１Ｄ変換行列Ａ（1D_Transform_Matrix_A）またはその転置行列が選択される。一方、垂直変換インデックスが「１」であれば、垂直直交変換または垂直逆直交変換のために前述の１Ｄ変換行列Ｂ（1D_Transform_Matrix_B）またはその転置行列が選択される。 As shown in FIG. 4B, if the vertical transformation index is “0”, the 1D transformation matrix A (1D_Transform_Matrix_A) or its transposed matrix is selected for vertical orthogonal transformation or vertical inverse orthogonal transformation. On the other hand, if the vertical transformation index is “1”, the aforementioned 1D transformation matrix B (1D_Transform_Matrix_B) or its transposed matrix is selected for vertical orthogonal transformation or vertical inverse orthogonal transformation.

図４Ｃに示すように、水平変換インデックスが「０」であれば、水平直交変換または水平逆直交変換のために前述の１Ｄ変換行列Ａ（1D_Transform_Matrix_A）またはその転置行列が選択される。一方、水平変換インデックスが「１」であれば、水平直交変換または水平逆直交変換のために前述の１Ｄ変換行列Ｂ（1D_Transform_Matrix_B）またはその転置行列が選択される。 As shown in FIG. 4C, if the horizontal transformation index is “0”, the 1D transformation matrix A (1D_Transform_Matrix_A) or its transposed matrix is selected for horizontal orthogonal transformation or horizontal inverse orthogonal transformation. On the other hand, if the horizontal transformation index is “1”, the aforementioned 1D transformation matrix B (1D_Transform_Matrix_B) or its transposed matrix is selected for horizontal orthogonal transformation or horizontal inverse orthogonal transformation.

また、各（イントラ）予測モードのインデックス（IntraNxNPredModeIndex）と、その名称（Name of IntraNxNPredMode）と、対応する垂直変換インデックス及び水平変換インデックスを図４Ａに例示する。尚、図４Ａにおいて、「NxN」は予測対象ブロックのサイズを表している（Ｎ＝４，８，１６など）。予測対象ブロックのサイズは、「MxN」（即ち、正方形以外の矩形）に拡張することもできる。
ここで、図４Ａと図４Ｄを統合した、各予測モードのインデックスとその名称と、対応する変換インデックスを図４Ｅに例示する。 FIG. 4A illustrates an index (IntraNxNPredModeIndex) of each (intra) prediction mode, its name (Name of IntraNxNPredMode), and a corresponding vertical conversion index and horizontal conversion index. In FIG. 4A, “NxN” represents the size of the prediction target block (N = 4, 8, 16, etc.). The size of the prediction target block can be expanded to “MxN” (that is, a rectangle other than a square).
Here, FIG. 4E illustrates an index of each prediction mode, a name thereof, and a corresponding conversion index obtained by integrating FIGS. 4A and 4D.

１Ｄ変換行列セット部１１２は、予測情報１２６に含まれる予測モード情報から予測モードのインデックスを検出し、対応する１Ｄ変換行列セット情報１２９を生成する。尚、図４Ａ、図４Ｂ、図４Ｃ、図４Ｄ及び図４Ｅに示す各種テーブルは一例であり、１Ｄ変換行列セット部１１２はこれらのテーブルの一部または全部を使用することなく１Ｄ変換行列セット情報１２９を生成してよい。 The 1D transformation matrix set unit 112 detects a prediction mode index from the prediction mode information included in the prediction information 126, and generates corresponding 1D transformation matrix set information 129. Note that the various tables shown in FIGS. 4A, 4B, 4C, 4D, and 4E are examples, and the 1D conversion matrix set unit 112 uses the 1D conversion matrix set information without using some or all of these tables. 129 may be generated.

例えば、ＴｒａｓｎｆｏｒｍＩｄｘが０を示す場合、ＶｅｒｔｉｃａｌＴｒａｎｓｆｏｒｍｉｎｄｅｘが０を、ＨｏｒｉｚｏｎｔａｌＴｒａｎｓｆｏｒｍｉｎｄｅｘが０を示すことを意味する。つまり、垂直直交変換には１Ｄ変換行列Ａを、水平直交変換には１Ｄ変換行列Ａを使用することを意味する。また、垂直逆直交変換には１Ｄ変換行列Ａの転置行列を、水平逆直交変換には１Ｄ変換行列Ａの転置行列を使用することを意味する。 For example, when the TransformIdx indicates 0, it means that the Vertical Transform index indicates 0 and the Horizontal Transform index indicates 0. That is, it means that the 1D conversion matrix A is used for the vertical orthogonal transformation, and the 1D conversion matrix A is used for the horizontal orthogonal transformation. Further, it means that a transposed matrix of 1D transformation matrix A is used for vertical inverse orthogonal transformation, and a transposed matrix of 1D transformation matrix A is used for horizontal inverse orthogonal transformation.

ＴｒａｓｎｆｏｒｍＩｄｘが１を示す場合、ＶｅｒｔｉｃａｌＴｒａｎｓｆｏｒｍｉｎｄｅｘが０を、ＨｏｒｉｚｏｎｔａｌＴｒａｎｓｆｏｒｍｉｎｄｅｘが１を示すことを意味する。つまり、垂直直交変換には１Ｄ変換行列Ａを、水平直交変換には１Ｄ変換行列Ｂを使用することを意味する。また、垂直逆直交変換には１Ｄ変換行列Ａの転置行列を、水平逆直交変換には１Ｄ変換行列Ｂの転置行列を使用することを意味する。 When the TransformIdx indicates 1, it means that the Vertical Transform index indicates 0 and the Horizontal Transform index indicates 1. That is, it means that the 1D conversion matrix A is used for the vertical orthogonal transformation, and the 1D transformation matrix B is used for the horizontal orthogonal transformation. Further, it means that a transposed matrix of 1D transformation matrix A is used for vertical inverse orthogonal transformation, and a transposed matrix of 1D transformation matrix B is used for horizontal inverse orthogonal transformation.

ＴｒａｓｎｆｏｒｍＩｄｘが２を示す場合、ＶｅｒｔｉｃａｌＴｒａｎｓｆｏｒｍｉｎｄｅｘが１を、ＨｏｒｉｚｏｎｔａｌＴｒａｎｓｆｏｒｍｉｎｄｅｘが０を示すことを意味する。つまり、垂直直交変換には１Ｄ変換行列Ｂを、水平直交変換には１Ｄ変換行列Ａを使用することを意味する。また、垂直逆直交変換には１Ｄ変換行列Ｂの転置行列を、水平逆直交変換には１Ｄ変換行列Ａの転置行列を使用することを意味する。 When the TransformIdx indicates 2, it means that the Vertical Transform index indicates 1, and the Horizontal Transform index indicates 0. That is, it means that the 1D transformation matrix B is used for the vertical orthogonal transformation and the 1D transformation matrix A is used for the horizontal orthogonal transformation. Further, it means that a transposed matrix of 1D transformation matrix B is used for vertical inverse orthogonal transformation, and a transposed matrix of 1D transformation matrix A is used for horizontal inverse orthogonal transformation.

ＴｒａｓｎｆｏｒｍＩｄｘが３を示す場合、ＶｅｒｔｉｃａｌＴｒａｎｓｆｏｒｍｉｎｄｅｘが１をＨｏｒｉｚｏｎｔａｌＴｒａｎｓｆｏｒｍｉｎｄｅｘが１を示すことを意味する。つまり、垂直直交変換には１Ｄ変換行列Ｂを、水平直交変換には１Ｄ変換行列Ｂを使用することを意味する。また、垂直逆直交変換には１Ｄ変換行列Ｂの転置行列を、水平逆直交変換には１Ｄ変換行列Ｂの転置行列を使用することを意味する。 When TransformIdx indicates 3, it means that Vertical Transform index indicates 1, and Horizontal Transform index indicates 1. That is, it means that the 1D transformation matrix B is used for the vertical orthogonal transformation, and the 1D transformation matrix B is used for the horizontal orthogonal transformation. Further, it means that a transposed matrix of 1D transformation matrix B is used for vertical inverse orthogonal transformation, and a transposed matrix of 1D transformation matrix B is used for horizontal inverse orthogonal transformation.

図４Ａに示すテーブルは、前述の各イントラ予測モードの傾向を考慮して１Ｄ変換行列セット情報１２９を割り当てている。即ち、予測誤差の垂直方向に上記傾向を示す予測モードには、ＶｅｒｔｉｃａｌＴｒａｎｓｆｏｒｍｉｎｄｅｘに０が、水平方向に上記傾向を示すモードには、ＨｏｒｉｚｏｎｔａｌＴｒａｎｓｆｏｒｍｉｎｄｅｘに０が割り当られている。一方、上記傾向を示さない方向には、夫々１が割り当られている。各予測モードの垂直方向及び水平方向を上記傾向の有無に従って２つのクラスに分類し、垂直方向及び水平方向の夫々について適応的に１Ｄ変換行列Ａまたは１Ｄ変換行列Ｂを適用することにより、各予測モードに一律にＤＣＴなどの固定的な直交変換を施す場合に比べて、高い変換効率が達成される。 In the table shown in FIG. 4A, 1D transformation matrix set information 129 is assigned in consideration of the tendency of each intra prediction mode described above. In other words, 0 is assigned to the Vertical Transform index in the prediction mode that shows the above-mentioned tendency in the vertical direction of the prediction error, and 0 is assigned to the Horizontal Transform index in the mode that shows the above-mentioned tendency in the horizontal direction. On the other hand, 1 is assigned to each direction in which the above tendency is not exhibited. By classifying the vertical direction and horizontal direction of each prediction mode into two classes according to the presence or absence of the above-mentioned trends, each prediction mode is applied by adaptively applying the 1D conversion matrix A or the 1D conversion matrix B to each of the vertical direction and the horizontal direction. High conversion efficiency is achieved as compared with a case where fixed orthogonal transform such as DCT is uniformly applied to the mode.

以下、係数順制御部１１３の詳細を説明する。
係数順制御部１１３は、２次元表現である量子化変換係数１２１の各要素を所定の順序に従って配列することにより、１次元表現である量子化変換係数列１１７に変換する。一例として、係数順制御部１１３は、予測モードに関わらず共通の２Ｄ−１Ｄ変換を行うことができる。具体的には、係数制御部１１３は、Ｈ．２６４と同様にジグザグスキャンを利用できる。ジグザグスキャンは、図８Ａに示すような順序で量子化変換係数１２１の各要素を配列して、図８Ｂに示すような量子化変換係数列１１７に変換する。図８Ａ及び図８Ｂにおいて、（ｉ，ｊ）は各要素の量子化変換係数（行列）１２１中の座標（位置情報）を示す。また、図８Ｃは、ジグザグスキャンを利用した２Ｄ−１Ｄ変換（４×４画素ブロックの場合）を示している。具体的には、図８Ｃは、ジグザグスキャンを利用して２Ｄ−１Ｄ変換された量子化変換係数列１１７の係数順（スキャン順）を示すインデックス（ｉｄｘ）と、対応する量子化変換係数１２１の要素（ｃｉｊ）とを示している。尚、図８Ｃにおいて、ｃｉｊは、量子化変換係数（行列）１２１中の座標（ｉ，ｊ）の要素を示している。 Details of the coefficient order control unit 113 will be described below.
The coefficient order control unit 113 converts each element of the quantized transform coefficient 121 that is a two-dimensional representation into a quantized transform coefficient sequence 117 that is a one-dimensional representation by arranging the elements in a predetermined order. As an example, the coefficient order control unit 113 can perform common 2D-1D conversion regardless of the prediction mode. Specifically, the coefficient control unit 113 includes the H.264 standard. Similar to H.264, zigzag scanning can be used. In the zigzag scan, the respective elements of the quantized transform coefficients 121 are arranged in the order shown in FIG. 8A and converted into a quantized transform coefficient string 117 as shown in FIG. 8B. 8A and 8B, (i, j) indicates the coordinates (position information) in the quantized transform coefficient (matrix) 121 of each element. FIG. 8C shows 2D-1D conversion using a zigzag scan (in the case of a 4 × 4 pixel block). Specifically, FIG. 8C shows an index (idx) indicating the coefficient order (scan order) of the quantized transform coefficient sequence 117 subjected to 2D-1D conversion using zigzag scanning, and the corresponding quantized transform coefficient 121. Element (cij) is shown. In FIG. 8C, cij indicates an element of coordinates (i, j) in the quantized transform coefficient (matrix) 121.

別の例として、係数順制御部１１３は、予測モード毎の個別の２Ｄ−１Ｄ変換を行うことができる。このような動作を行う係数順制御部１１３は、図５Ａに例示されている。この係数順制御部１１３は、選択スイッチ５０１と、９種類の予測モード毎の個別の２Ｄ−１Ｄ変換部５０２，・・・，５１０とを含む。選択スイッチ５０１は、予測情報１２６に含まれる予測モード情報（例えば、図４Ａの予測モードのインデックス）に従って量子化変換係数１２１を、予測モードに応じた２Ｄ−１Ｄ変換部（５０２，・・・，５１０のうちいずれか１つ）に導く。例えば、予測モードインデックスが０であれば、選択スイッチ５０１は量子化変換係数１２１を２Ｄ−１Ｄ変換部５０２に導く。図５Ａにおいて、各予測モードと２Ｄ−１Ｄ変換部とは１対１に対応しており、量子化変換係数１２１は予測モードに応じた１つの２Ｄ−１Ｄ変換部に導かれる。図９は、各２Ｄ−１Ｄ変換部５０２，・・・，５１０が行う２Ｄ−１Ｄ変換（４×４画素ブロックの場合）を例示する。尚、図９に示されるような予測モード毎の２Ｄ−１Ｄ変換の具体的な設計手法は、後述される。各予測モードに対応する２Ｄ−１Ｄ変換部によって２Ｄ−１Ｄ変換された量子化変換係数列１１７の係数順（スキャン順）を示すインデックス（ｉｄｘ）と、対応する量子化変換係数１２１の要素（ｃｉｊ）とを示している。尚、図９において、ｃｉｊは、量子化変換係数（行列）１２１中の座標（ｉ，ｊ）の要素を示している。また、図９において、各予測モードは、その名称によって表されているが、予測モードインデックスとの対応は図４Ａに示す通りである。このように、予測モード毎の個別の２Ｄ−１Ｄ変換を適用すれば、例えば予測モード毎の量子化変換係数１２１における非零係数の発生傾向に適合した順序で係数がスキャンされるので符号化効率が向上する。 As another example, the coefficient order control unit 113 can perform individual 2D-1D conversion for each prediction mode. The coefficient order control unit 113 that performs such an operation is illustrated in FIG. 5A. The coefficient order control unit 113 includes a selection switch 501 and individual 2D-1D conversion units 502,..., 510 for each of nine types of prediction modes. The selection switch 501 converts the quantized transform coefficient 121 according to the prediction mode information included in the prediction information 126 (for example, the index of the prediction mode in FIG. 4A), and the 2D-1D conversion unit (502,. Any one of 510). For example, if the prediction mode index is 0, the selection switch 501 guides the quantized transform coefficient 121 to the 2D-1D transform unit 502. In FIG. 5A, each prediction mode and the 2D-1D conversion unit have a one-to-one correspondence, and the quantized transform coefficient 121 is guided to one 2D-1D conversion unit corresponding to the prediction mode. FIG. 9 illustrates 2D-1D conversion (in the case of a 4 × 4 pixel block) performed by each 2D-1D conversion unit 502,. A specific design method for 2D-1D conversion for each prediction mode as shown in FIG. 9 will be described later. An index (idx) indicating the coefficient order (scan order) of the quantized transform coefficient sequence 117 subjected to 2D-1D conversion by the 2D-1D transform unit corresponding to each prediction mode, and an element (cij) of the corresponding quantized transform coefficient 121 ). In FIG. 9, cij represents an element of coordinates (i, j) in the quantized transform coefficient (matrix) 121. Further, in FIG. 9, each prediction mode is represented by its name, but the correspondence with the prediction mode index is as shown in FIG. 4A. As described above, when the individual 2D-1D transform for each prediction mode is applied, for example, the coefficients are scanned in an order suitable for the generation tendency of the non-zero coefficient in the quantized transform coefficient 121 for each prediction mode. Will improve.

尚、簡単化のために４×４画素ブロックに関する例を示したが、８×８画素ブロック、１６×１６画素ブロックなどに関しても同様に、予測モード毎の個別の２Ｄ−１Ｄ変換を規定できる。また、画素ブロックがＭ×Ｎで表現される矩形ブロックであるならば、２Ｄ−１Ｄ変換を行うブロックサイズとしてＭ×Ｎを用いることもできる。この場合には、矩形ブロックに関して、予測モード毎に図９に例示されるような個別の２Ｄ−１Ｄ変換を規定すればよい。 For simplification, an example related to a 4 × 4 pixel block is shown, but individual 2D-1D conversion for each prediction mode can be defined similarly for an 8 × 8 pixel block, a 16 × 16 pixel block, and the like. Further, if the pixel block is a rectangular block expressed by M × N, M × N can be used as a block size for performing 2D-1D conversion. In this case, regarding the rectangular block, individual 2D-1D conversion as exemplified in FIG. 9 may be defined for each prediction mode.

更に別の例として、係数順制御部１１３は、２Ｄ−１Ｄ変換におけるスキャン順を動的に更新してもよい。このような動作を行う係数順制御部１１３は、図５Ｂに例示される。この係数順制御部１１３は、選択スイッチ５０１と、９種類の予測モード毎の個別の２Ｄ−１Ｄ変換部５０２，・・・，５１０と、発生頻度カウント部５１１と、係数順更新部５１２とを含む。選択スイッチ５０１は、図５Ａに関して説明した通りである。９種類の予測モード毎の個別の２Ｄ−１Ｄ変換部５０２，・・・，５１０は、そのスキャン順が係数順更新部５１２によって更新される点で図５Ａとは異なる。 As yet another example, the coefficient order control unit 113 may dynamically update the scan order in the 2D-1D conversion. The coefficient order control unit 113 that performs such an operation is illustrated in FIG. 5B. The coefficient order control unit 113 includes a selection switch 501, individual 2D-1D conversion units 502,..., 510 for each of nine types of prediction modes, an occurrence frequency counting unit 511, and a coefficient order updating unit 512. Including. The selection switch 501 is as described with reference to FIG. 5A. The individual 2D-1D conversion units 502,..., 510 for each of the nine types of prediction modes differ from FIG. 5A in that the scan order is updated by the coefficient order update unit 512.

発生頻度カウント部５１１は、予測モード毎に、量子化変換係数列１１７の各要素における非零係数の発生回数のヒストグラムを作成する。発生頻度カウント部５１１は、作成したヒストグラム５１３を係数順更新部５１２に入力する。 The occurrence frequency counting unit 511 creates a histogram of the number of occurrences of non-zero coefficients in each element of the quantized transform coefficient sequence 117 for each prediction mode. The occurrence frequency counting unit 511 inputs the created histogram 513 to the coefficient order updating unit 512.

係数順更新部５１２は、予め定められたタイミングで、ヒストグラム５１３に基づいて係数順の更新を行う。上記タイミングは、例えば、コーディングツリーユニットの符号化処理が終了したタイミング、コーディングツリーユニット内の１ライン分の符号化処理が終了したタイミングなどである。 The coefficient order update unit 512 updates the coefficient order based on the histogram 513 at a predetermined timing. The timing is, for example, the timing when the coding process of the coding tree unit is finished, the timing when the coding process for one line in the coding tree unit is finished, or the like.

具体的には、係数順更新部５１２は、ヒストグラム５１３を参照して、非零係数の発生回数が閾値以上にカウントされた要素を持つ予測モードに関して係数順の更新を行う。例えば、係数順更新部５１２は、非零係数の発生が１６回以上カウントされた要素を持つ予測モードに関して更新を行う。このような発生回数に閾値を設けることによって、係数順の更新が大域的に実施されるので、局所的な最適解に収束しにくくなる。 Specifically, the coefficient order update unit 512 refers to the histogram 513, and updates the coefficient order for the prediction mode having an element in which the number of occurrences of non-zero coefficients is counted more than a threshold. For example, the coefficient order update unit 512 updates the prediction mode having an element in which the occurrence of a non-zero coefficient is counted 16 times or more. By providing a threshold value for the number of occurrences, the coefficient order is updated globally, so that it is difficult to converge to a local optimum solution.

係数順更新部５１２は、更新対象となる予測モードに関して、非零係数の発生頻度の降順に要素をソーティングする。ソーティングは、例えばバブルソート、クイックソートなどの既存のアルゴリズムによって実現できる。そして、係数順更新部５１２は、ソーティングされた要素の順序を示す係数順更新情報５１４を、更新対象となる予測モードに対応する２Ｄ−１Ｄ変換部に入力する。 The coefficient order update unit 512 sorts the elements in descending order of the occurrence frequency of the non-zero coefficient regarding the prediction mode to be updated. Sorting can be realized by existing algorithms such as bubble sort and quick sort. Then, the coefficient order update unit 512 inputs coefficient order update information 514 indicating the order of the sorted elements to the 2D-1D conversion unit corresponding to the prediction mode to be updated.

係数順更新情報５１４が入力されると、２Ｄ−１Ｄ変換部は更新後のスキャン順に従って２Ｄ−１Ｄ変換を行う。尚、スキャン順を動的に更新する場合には、各２Ｄ−１Ｄ変換部の初期スキャン順を予め定めておく必要がある。例えば、ジグザグスキャンまたは図９に例示したスキャン順が、初期スキャン順として利用できる。 When the coefficient order update information 514 is input, the 2D-1D conversion unit performs 2D-1D conversion in accordance with the updated scan order. When the scan order is dynamically updated, the initial scan order of each 2D-1D conversion unit needs to be determined in advance. For example, the zigzag scan or the scan order illustrated in FIG. 9 can be used as the initial scan order.

このように、動的にスキャン順を更新することにより、予測画像の性質、量子化情報（量子化パラメータ）などの影響に応じて、量子化変換係数１２１における非零係数の発生傾向が変化する場合にも、安定的に高い符号化効率を期待できる。具体的には、エントロピー符号化部１１４におけるランレングス符号化の発生符号量を抑制できる。 In this way, by dynamically updating the scan order, the tendency of occurrence of non-zero coefficients in the quantized transform coefficients 121 changes according to the influence of the properties of the predicted image, quantization information (quantization parameters), and the like. Even in this case, high encoding efficiency can be expected stably. Specifically, the generated code amount of run-length encoding in the entropy encoding unit 114 can be suppressed.

尚、簡単化のためにＨ．２６４を例示して予測モードが９種類の場合を説明したが、予測モードが１７種類、３３種類などに拡張された場合にも、拡張された各予測モードに対応する２Ｄ−１Ｄ変換部を追加すれば予測モード毎の個別の２Ｄ−１Ｄ変換を行うことができる。 For simplification, H.C. In the example of H.264, there are 9 types of prediction modes. However, when the prediction modes are expanded to 17 types, 33 types, etc., a 2D-1D conversion unit corresponding to each expanded prediction mode is added. Then, individual 2D-1D conversion for each prediction mode can be performed.

以下、図１０Ａ及び図１０Ｂを用いて、図１の画像符号化装置が符号化対象ブロック（コーディングツリーユニット）に対して行う処理を説明する。尚、図１０Ａ及び図１０Ｂの例では、本実施形態に係る直交変換及び逆直交変換（即ち、１Ｄ変換行列セット情報１２９に基づく適応的な直交変換及び逆直交変換）が有効であることを前提としている。しかしながら、後述するようにシンタクスによって本実施形態に係る直交変換及び逆直交変換が無効となることが規定されてもよい。 Hereinafter, processing performed by the image coding apparatus in FIG. 1 for a coding target block (coding tree unit) will be described with reference to FIGS. 10A and 10B. In the examples of FIGS. 10A and 10B, it is assumed that the orthogonal transformation and inverse orthogonal transformation (that is, adaptive orthogonal transformation and inverse orthogonal transformation based on the 1D transformation matrix set information 129) according to the present embodiment are effective. It is said. However, as described later, it may be specified that the orthogonal transform and the inverse orthogonal transform according to the present embodiment are invalidated by the syntax.

入力画像１１８が符号化対象ブロック単位で図１の画像符号化装置に入力されると、符号化対象ブロックの符号化処理が開始する（ステップＳ６０１）。イントラ予測部１０８及びインター予測部１０９は、参照画像メモリ１０７に保存されている参照画像１２５を用いて、イントラ予測画像及びインター予測画像を生成する（ステップＳ６０２）。符号化制御部１１６は前述の符号化コストなどの観点から最適な予測モードを判定し、予測情報１２６を生成する（ステップＳ６０３）。予測情報１２６は、予測選択部１１０から前述のように各要素に入力される。ステップＳ６０３において生成された予測情報１２６がイントラ予測を示唆するのであれば処理はステップＳ６０５に進み、インター予測を示唆するのであれば処理はステップＳ６０５’に進む。 When the input image 118 is input to the image encoding apparatus in FIG. 1 in units of encoding target blocks, the encoding target block encoding process starts (step S601). The intra prediction unit 108 and the inter prediction unit 109 generate an intra prediction image and an inter prediction image using the reference image 125 stored in the reference image memory 107 (step S602). The encoding control unit 116 determines the optimal prediction mode from the viewpoint of the above-described encoding cost and generates the prediction information 126 (step S603). The prediction information 126 is input from the prediction selection unit 110 to each element as described above. If the prediction information 126 generated in step S603 indicates intra prediction, the process proceeds to step S605. If the prediction information 126 indicates inter prediction, the process proceeds to step S605 '.

ステップＳ６０５では、減算部１０１が符号化対象ブロックから（イントラ）予測画像１２７を減算して予測誤差１１９を生成し、処理はステップＳ６０６に進む。一方、ステップＳ６０５’でも同様に、減算部１０１が符号化対象ブロックから（インター）予測画像１２７を減算して予測誤差１１９を生成し、処理はステップＳ６１４’に進む。 In step S605, the subtraction unit 101 generates the prediction error 119 by subtracting the (intra) prediction image 127 from the encoding target block, and the process proceeds to step S606. On the other hand, in step S605 'as well, the subtracting unit 101 subtracts the (inter) predicted image 127 from the encoding target block to generate a prediction error 119, and the process proceeds to step S614'.

ステップＳ６０６では、１Ｄ変換行列セット部１１２が、ステップＳ６０３において生成された予測情報１２６に含まれる予測モード情報を抽出する。１Ｄ変換行列セット部１１２は、抽出した予測モード情報に基づいて（例えば、図４Ａのテーブルを参照して）１Ｄ変換行列セット情報１２９を生成する（ステップＳ６０７）。１Ｄ変換行列セット部１１２は、１Ｄ変換行列セット情報１２９を直交変換部１０２及び逆直交変換部１０５に入力する。 In step S606, the 1D conversion matrix setting unit 112 extracts prediction mode information included in the prediction information 126 generated in step S603. The 1D transformation matrix set unit 112 generates 1D transformation matrix set information 129 based on the extracted prediction mode information (for example, referring to the table of FIG. 4A) (step S607). The 1D transform matrix set unit 112 inputs 1D transform matrix set information 129 to the orthogonal transform unit 102 and the inverse orthogonal transform unit 105.

直交変換部１０２内の選択スイッチ２０１は、１Ｄ変換行列セット情報１２９に基づいて１Ｄ直交変換部Ａ２０６または１Ｄ直交変換部Ｂ２０７を選択する（ステップＳ６０８、ステップＳ６０９及びステップＳ６１０）。一方、直交変換部１０２内の選択スイッチ２０４は、１Ｄ変換行列セット情報１２９に基づいて１Ｄ直交変換部Ａ２０８または１Ｄ直交変換部Ｂ２０９を選択する（ステップＳ６１１、ステップＳ６１２及びステップＳ６１３）。その後、処理はステップＳ６１４に進む。 The selection switch 201 in the orthogonal transform unit 102 selects the 1D orthogonal transform unit A206 or the 1D orthogonal transform unit B207 based on the 1D transform matrix set information 129 (steps S608, S609, and S610). On the other hand, the selection switch 204 in the orthogonal transform unit 102 selects the 1D orthogonal transform unit A208 or the 1D orthogonal transform unit B209 based on the 1D transform matrix set information 129 (steps S611, S612, and S613). Thereafter, the process proceeds to step S614.

例えば、１Ｄ変換行列セット情報１２９の一例である変換インデックス（ＴｒａｎｓｆｏｒｍＩｄｘ）が０の場合、選択スイッチ２０１は垂直変換部２０２内の１Ｄ直交変換部Ａ２０６を選択し（ステップＳ６０９）、選択スイッチ２０４は水平変換部２０５内の１Ｄ直交変換部Ａ２０８を選択する（ステップＳ６１２）。ＴｒａｎｓｆｏｒｍＩｄｘが１の場合、選択スイッチ２０１は垂直変換部２０２内の１Ｄ直交変換部Ａ２０６を選択し（ステップＳ６０９）、選択スイッチ２０４は水平変換部２０５内の１Ｄ直交変換部Ｂ２０９を選択する（ステップＳ６１３）。ＴｒａｎｓｆｏｒｍＩｄｘが２の場合、選択スイッチ２０１は垂直変換部２０２内の１Ｄ直交変換部Ｂ２０７を選択し（ステップＳ６１０）、選択スイッチ２０４は水平変換部２０５内の１Ｄ直交変換部Ａ２０８を選択する（ステップＳ６１２）。ＴｒａｎｓｆｏｒｍＩｄｘが３の場合、選択スイッチ２０１は垂直変換部２０２内の１Ｄ直交変換部Ｂ２０７を選択し（ステップＳ６１０）、選択スイッチ２０４は水平変換部２０５内の１Ｄ直交変換部Ｂ２０９を選択する（ステップＳ６１３）。 For example, when the transformation index (TransformIdx) that is an example of the 1D transformation matrix set information 129 is 0, the selection switch 201 selects the 1D orthogonal transformation unit A206 in the vertical transformation unit 202 (step S609), and the selection switch 204 is horizontal. The 1D orthogonal transform unit A208 in the transform unit 205 is selected (step S612). When TransformIdx is 1, the selection switch 201 selects the 1D orthogonal transform unit A206 in the vertical transform unit 202 (step S609), and the selection switch 204 selects the 1D orthogonal transform unit B209 in the horizontal transform unit 205 (step S613). ). When TransformIdx is 2, the selection switch 201 selects the 1D orthogonal transform unit B207 in the vertical transform unit 202 (step S610), and the selection switch 204 selects the 1D orthogonal transform unit A208 in the horizontal transform unit 205 (step S612). ). When TransformIdx is 3, the selection switch 201 selects the 1D orthogonal transform unit B207 in the vertical transform unit 202 (step S610), and the selection switch 204 selects the 1D orthogonal transform unit B209 in the horizontal transform unit 205 (step S613). ).

ステップＳ６１４では、直交変換部１０２が予測誤差１１９に対して、ステップＳ６０８，・・・，ステップＳ６１３による設定に応じた垂直変換及び水平変換を夫々行って、変換係数１２０を生成する。続いて、量子化部１０３がステップＳ６１４において生成された変換係数１２０に量子化を行って量子化変換係数１２１を生成し（ステップＳ６１５）、処理はステップＳ６１６に進む。 In step S614, the orthogonal transform unit 102 performs vertical transform and horizontal transform on the prediction error 119 according to the settings in step S608,..., Step S613, respectively, to generate the transform coefficient 120. Subsequently, the quantization unit 103 quantizes the transform coefficient 120 generated in step S614 to generate a quantized transform coefficient 121 (step S615), and the process proceeds to step S616.

一方、ステップＳ６１４’では、直交変換部１０２が予測誤差１１９に対して、例えばＤＣＴなどの固定的な直交変換を行って、変換係数１２０を生成する。続いて、量子化部１０３がステップＳ６１４’において生成された変換係数１２０に量子化を行って量子化変換係数１２１を生成し（ステップＳ６１５’）、処理はステップＳ６１７’に進む。尚、ステップＳ６１４’において行われる直交変換は、図示しないＤＣＴ部などによって実現されてもよいし、１Ｄ直交変換部Ｂ２０７及び１Ｄ直交変換部Ｂ２０９によって実現されてもよい。 On the other hand, in step S <b> 614 ′, the orthogonal transform unit 102 performs a fixed orthogonal transform such as DCT on the prediction error 119 to generate a transform coefficient 120. Subsequently, the quantization unit 103 quantizes the transform coefficient 120 generated in step S614 'to generate a quantized transform coefficient 121 (step S615'), and the process proceeds to step S617 '. Note that the orthogonal transform performed in step S614 'may be realized by a DCT unit (not shown) or the like, or may be realized by the 1D orthogonal transform unit B207 and the 1D orthogonal transform unit B209.

ステップＳ６１６では、係数順制御部１１３が、ステップＳ６０３において生成された予測情報１２６に含まれる予測モード情報に基づいてスキャン順（即ち、図５Ａ及び図５Ｂの例であれば、選択スイッチ５０１の接続先）を設定し、処理はステップＳ６１７に進む。但し、係数制御部１１３が予測モードに関わらず共通の２Ｄ−１Ｄ変換を行うのであれば、ステップＳ６１６は省略可能である。 In step S616, the coefficient order control unit 113 scans based on the prediction mode information included in the prediction information 126 generated in step S603 (that is, the connection of the selection switch 501 in the example of FIGS. 5A and 5B). First) is set, and the process proceeds to step S617. However, if the coefficient control unit 113 performs common 2D-1D conversion regardless of the prediction mode, step S616 can be omitted.

ステップＳ６１７では、係数順制御部１１３が量子化変換係数１２１に対して、ステップＳ６１６における設定に応じた２Ｄ−１Ｄ変換を行って量子化変換係数列１１７を生成する。続いて、エントロピー符号化部１１４が、この量子化変換係数列１１７を含む符号化パラメータをエントロピー符号化する（ステップＳ６１８）。符号化データ１３０は、符号化制御部１１６によって管理される適切なタイミングで出力される。一方、逆量子化部１０４は量子化変換係数１２１に逆量子化を行って復元変換係数１２２を生成し（ステップＳ６１９）、処理はステップＳ６２０に進む。 In step S617, the coefficient order control unit 113 performs 2D-1D conversion on the quantized transform coefficient 121 according to the setting in step S616 to generate a quantized transform coefficient sequence 117. Subsequently, the entropy encoding unit 114 performs entropy encoding on the encoding parameter including the quantized transform coefficient sequence 117 (step S618). The encoded data 130 is output at an appropriate timing managed by the encoding control unit 116. On the other hand, the inverse quantization unit 104 performs inverse quantization on the quantized transform coefficient 121 to generate a restored transform coefficient 122 (step S619), and the process proceeds to step S620.

ステップＳ６１７’では、係数順制御部１１３が量子化変換係数１２１に対して、例えばジグザグスキャンまたは図９のＩｎｔｒａ＿ＮｘＮ＿ＤＣに対応する２Ｄ−１Ｄ変換などの固定的な２Ｄ−１Ｄ変換を行って量子化変換係数列１１７を生成する。続いて、エントロピー符号化部１１４が、この量子化変換係数列１１７を含む符号化パラメータをエントロピー符号化する（ステップＳ６１８’）。符号化データ１３０は、符号化制御部１１６によって管理される適切なタイミングで出力される。一方、逆量子化部１０４は量子化変換係数１２１に逆量子化を行って復元変換係数１２２を生成し（ステップＳ６１９’）、処理はステップＳ６２６’に進む。 In step S617 ′, the coefficient order control unit 113 performs a fixed 2D-1D conversion such as a zigzag scan or a 2D-1D conversion corresponding to Intra_NxN_DC in FIG. A coefficient sequence 117 is generated. Subsequently, the entropy encoding unit 114 performs entropy encoding on the encoding parameter including the quantized transform coefficient sequence 117 (step S618 '). The encoded data 130 is output at an appropriate timing managed by the encoding control unit 116. On the other hand, the inverse quantization unit 104 performs inverse quantization on the quantized transform coefficient 121 to generate the restored transform coefficient 122 (step S619 '), and the process proceeds to step S626'.

逆直交変換部１０５内の選択スイッチ３０１は、１Ｄ変換行列セット情報１２９に基づいて１Ｄ逆直交変換部Ａ３０６または１Ｄ逆直交変換部Ｂ３０７を選択する（ステップＳ６２０、ステップＳ６２１及びステップＳ６２２）。一方、逆直交変換部１０５内の選択スイッチ３０４は、１Ｄ変換行列セット情報１２９に基づいて１Ｄ逆直交変換部Ａ３０８または１Ｄ逆直交変換部Ｂ３０９を選択する（ステップＳ６２３、ステップＳ６２４及びステップＳ６２５）。その後、処理はステップＳ６２６に進む。 The selection switch 301 in the inverse orthogonal transform unit 105 selects the 1D inverse orthogonal transform unit A306 or the 1D inverse orthogonal transform unit B307 based on the 1D transform matrix set information 129 (step S620, step S621, and step S622). On the other hand, the selection switch 304 in the inverse orthogonal transform unit 105 selects the 1D inverse orthogonal transform unit A308 or the 1D inverse orthogonal transform unit B309 based on the 1D transform matrix set information 129 (steps S623, S624, and S625). Thereafter, the process proceeds to step S626.

例えば、１Ｄ変換行列セット情報１２９の一例である変換インデックス（ＴｒａｎｓｆｏｒｍＩｄｘ）が０の場合、選択スイッチ３０１は垂直逆変換部３０２内の１Ｄ逆直交変換部Ａ３０６を選択し（ステップＳ６２１）、選択スイッチ３０４は水平逆変換部３０５内の１Ｄ逆直交変換部Ａ３０８を選択する（ステップＳ６２４）。ＴｒａｎｓｆｏｒｍＩｄｘが１の場合、選択スイッチ３０１は垂直逆変換部３０２内の１Ｄ逆直交変換部Ａ３０６を選択し（ステップＳ６２１）、選択スイッチ３０４は水平逆変換部３０５内の１Ｄ逆直交変換部Ｂ３０９を選択する（ステップＳ６２５）。ＴｒａｎｓｆｏｒｍＩｄｘが２の場合、選択スイッチ３０１は垂直逆変換部３０２内の１Ｄ逆直交変換部Ｂ３０７を選択し（ステップＳ６２２）、選択スイッチ３０４は水平逆変換部３０５内の１Ｄ逆直交変換部Ａ３０８を選択する（ステップＳ６２４）。ＴｒａｎｓｆｏｒｍＩｄｘが３の場合、選択スイッチ３０１は垂直逆変換部３０２内の１Ｄ逆直交変換部Ｂ３０７を選択し（ステップＳ６２２）、選択スイッチ３０４は水平逆変換部３０５内の１Ｄ逆直交変換部Ｂ３０９を選択する（ステップＳ６２５）。 For example, when the transform index (TransformIdx), which is an example of the 1D transform matrix set information 129, is 0, the selection switch 301 selects the 1D inverse orthogonal transform unit A306 in the vertical inverse transform unit 302 (step S621), and the selection switch 304 Selects the 1D inverse orthogonal transform unit A308 in the horizontal inverse transform unit 305 (step S624). When TransformIdx is 1, the selection switch 301 selects the 1D inverse orthogonal transform unit A306 in the vertical inverse transform unit 302 (step S621), and the selection switch 304 selects the 1D inverse orthogonal transform unit B309 in the horizontal inverse transform unit 305. (Step S625). When TransformIdx is 2, the selection switch 301 selects the 1D inverse orthogonal transform unit B307 in the vertical inverse transform unit 302 (step S622), and the selection switch 304 selects the 1D inverse orthogonal transform unit A308 in the horizontal inverse transform unit 305. (Step S624). When TransformIdx is 3, the selection switch 301 selects the 1D inverse orthogonal transform unit B307 in the vertical inverse transform unit 302 (step S622), and the selection switch 304 selects the 1D inverse orthogonal transform unit B309 in the horizontal inverse transform unit 305. (Step S625).

ステップＳ６２６では、逆直交変換部１０５が復元変換係数１２２に対して、ステップＳ６２０，・・・，ステップＳ６２５による設定に応じた垂直逆変換及び水平逆変換を夫々行って復元予測誤差１２３を生成し、処理はステップＳ６２７に進む。ステップＳ６２６’では、逆直交変換部１０５が復元変換係数１２２に対して、例えばＩＤＣＴなどの逆直交変換を行って復元予測誤差１２３を生成し、処理はステップＳ６２７に進む。尚、ステップＳ６２６’において行われる固定的な逆直交変換は、図示しないＩＤＣＴ部などによって実現されてもよいし、１Ｄ逆直交変換部Ｂ３０７及び１Ｄ逆直交変換部Ｂ３０９によって実現されてもよい。 In step S626, the inverse orthogonal transform unit 105 performs the vertical inverse transform and the horizontal inverse transform according to the settings in step S620,..., Step S625 on the restored transform coefficient 122 to generate the restored prediction error 123. The process proceeds to step S627. In step S626 ', the inverse orthogonal transform unit 105 performs inverse orthogonal transform such as IDCT on the reconstructed transform coefficient 122 to generate a reconstructed prediction error 123, and the process proceeds to step S627. Note that the fixed inverse orthogonal transform performed in step S626 'may be realized by an IDCT unit (not shown) or the like, or may be realized by the 1D inverse orthogonal transform unit B307 and the 1D inverse orthogonal transform unit B309.

ステップＳ６２７において、加算部１０６はステップＳ６２６またはステップＳ６２６’において生成された復元予測誤差１２３と予測画像１２７と加算して局所復号画像１２４を生成し、この局所復号画像１２４が参照画像として参照画像メモリ１０７に保存され、符号化対象ブロックの符号化処理が終了する（ステップＳ６２８）。 In step S627, the adding unit 106 adds the reconstructed prediction error 123 generated in step S626 or step S626 ′ and the predicted image 127 to generate a local decoded image 124. The local decoded image 124 is used as a reference image as a reference image memory. In step S628, the encoding process of the block to be encoded is completed.

以下、前述の１Ｄ変換行列Ａ及び１Ｄ変換行列Ｂの設計手法について説明する。Ｈ．２６４の４×４画素ブロック及び８×８画素ブロックでは、夫々９種類の予測モードが定義されており、１６ｘ１６画素ブロックでは４種類の予測モードが定義されている。 Hereinafter, a design method of the above-described 1D conversion matrix A and 1D conversion matrix B will be described. H. In the H.264 4 × 4 pixel block and the 8 × 8 pixel block, nine types of prediction modes are defined, and in the 16 × 16 pixel block, four types of prediction modes are defined.

まず、各予測モードの予測誤差１１９を夫々生成する。各予測モードの予測誤差１１９のうち、参照画素からの距離が大きくなるにつれて予測誤差の絶対値が大きくなるという前述の傾向を垂直方向または水平方向に示すものを夫々収集する。そして、この傾向を示す方向を縦に設定して予測誤差１１９を横に並べた行列に対して特異値分解を行うことにより、係る行列の垂直方向の相関を除去する１Ｄ直交基底を設計する。この１Ｄ直交基底を行ベクトルとし縦に並べて１Ｄ変換行列Ａが生成される。 First, a prediction error 119 for each prediction mode is generated. Of the prediction errors 119 in each prediction mode, those indicating the above-mentioned tendency that the absolute value of the prediction error increases as the distance from the reference pixel increases in the vertical direction or the horizontal direction are collected. Then, by setting the direction indicating this tendency vertically and performing singular value decomposition on the matrix in which the prediction errors 119 are arranged horizontally, a 1D orthogonal basis for removing the correlation in the vertical direction of the matrix is designed. A 1D conversion matrix A is generated by vertically arranging the 1D orthogonal bases as row vectors.

一方、係る傾向を示さない方向を縦に設定して予測誤差１１９を横に並べた行列に対して、特異値分解を行うことにより、係る行列の垂直方向の相関を除去する１Ｄ直交基底を生成する。この１Ｄ直交基底を行ベクトルとし縦に並べて１Ｄ変換行列Ｂが生成される。尚、この１Ｄ変換行列Ｂは、単にＤＣＴのための行列で代用することも可能である。簡単化のために４×４画素ブロックに関する設計を例示したが、８×８画素ブロック及び１６×１６画素ブロックのための１Ｄ変換行列も同様に設計可能である。また、説明した設計手法は一例であり、前述の予測残差の性質を考慮して適宜設計を行う余地がある。 On the other hand, a singular value decomposition is performed on a matrix in which prediction directions 119 are set vertically and a prediction error 119 is arranged horizontally, thereby generating a 1D orthogonal basis for removing the vertical correlation of the matrix. To do. A 1D conversion matrix B is generated by vertically arranging the 1D orthogonal bases as row vectors. The 1D conversion matrix B can be simply replaced with a matrix for DCT. Although a design for a 4 × 4 pixel block is illustrated for simplicity, 1D transformation matrices for 8 × 8 pixel blocks and 16 × 16 pixel blocks can be designed as well. The described design method is an example, and there is room for appropriate design in consideration of the properties of the prediction residual described above.

以下、図９に例示されるような予測モード毎の２Ｄ−１Ｄ変換（スキャン順）の具体的な設計手法について説明する。予測モード毎のスキャン順は、量子化部１０３によって生成される量子化変換係数１２１に基づいて設計される。例えば、４×４画素ブロックに関する設計では、複数の訓練画像を用意して９種類の各予測モードの予測残差１１９を夫々生成する。この予測残差１１９の各々に対して数式（３）及び数式（４）に示す直交変換を行って変換係数１２０を生成し、更にこれを量子化する。量子化変換係数１２１に対して、４×４画素ブロック内の各要素について非零係数の発生回数を累積加算する。この累積加算は全ての訓練画像に対して行われ、４×４画素ブロックの１６個の要素毎に非零係数の発生頻度を示すヒストグラムが作成される。このヒストグラムに基づいて、発生頻度の高い要素から昇順にインデックス０〜１５が与えられる。このようなインデックスの割り当てが、全ての予測モードについて個別に行われる。割り当てられたインデックスの順序が、各予測モードに対応するスキャン順として使用される。 Hereinafter, a specific design method of 2D-1D conversion (scan order) for each prediction mode as exemplified in FIG. 9 will be described. The scan order for each prediction mode is designed based on the quantized transform coefficient 121 generated by the quantization unit 103. For example, in the design related to a 4 × 4 pixel block, a plurality of training images are prepared and the prediction residuals 119 of each of the nine types of prediction modes are generated. Each of the prediction residuals 119 is subjected to orthogonal transformation shown in Equation (3) and Equation (4) to generate a transform coefficient 120, which is further quantized. The number of occurrences of non-zero coefficients is cumulatively added to each quantized transform coefficient 121 for each element in the 4 × 4 pixel block. This cumulative addition is performed on all training images, and a histogram indicating the frequency of occurrence of non-zero coefficients is created for every 16 elements of the 4 × 4 pixel block. Based on this histogram, indexes 0 to 15 are given in ascending order from the element with the highest occurrence frequency. Such index assignment is performed individually for all prediction modes. The order of the assigned indexes is used as the scan order corresponding to each prediction mode.

簡単化のために４×４画素ブロックに関する設計を例示したが、８×８画素ブロック及び１６×１６画素ブロックのスキャン順も同様に設計可能である。また、予測モードが１７種類、３３種類及び任意の数に拡張しても同様の手法で設計可能である。尚、スキャン順を動的に更新する手法については、図５Ｂに関して説明した通りである。 For the sake of simplicity, the design related to the 4 × 4 pixel block is illustrated, but the scan order of the 8 × 8 pixel block and the 16 × 16 pixel block can be similarly designed. Further, even if the prediction modes are expanded to 17 types, 33 types, and an arbitrary number, the design can be performed by the same method. The method for dynamically updating the scan order is as described with reference to FIG. 5B.

以下、図１の画像符号化装置が利用するシンタクスについて説明する。
シンタクスは、画像符号化装置が動画像データを符号化する際の符号化データ（例えば、図１の符号化データ１３０）の構造を示している。この符号化データを復号化する際に、同じシンタクス構造を参照して画像復号化装置がシンタクス解釈を行う。図１の画像符号化装置が利用するシンタクス７００を図１１に例示する。 Hereinafter, the syntax used by the image encoding device in FIG. 1 will be described.
The syntax indicates the structure of encoded data (for example, encoded data 130 in FIG. 1) when the image encoding apparatus encodes moving image data. When decoding this encoded data, the image decoding apparatus interprets the syntax with reference to the same syntax structure. FIG. 11 illustrates a syntax 700 used by the image coding apparatus in FIG.

シンタクス７００は、ハイレベルシンタクス７０１、スライスレベルシンタクス７０２及びコーディングツリーレベルシンタクス７０３の３つのパートを含む。ハイレベルシンタクス７０１は、スライスよりも上位のレイヤのシンタクス情報を含む。スライスとは、フレームまたはフィールドに含まれる矩形領域もしくは連続領域を指す。スライスレベルシンタクス７０２は、各スライスを復号化するために必要な情報を含む。コーディングツリーレベルシンタクス７０３は、各コーディングツリー（即ち、各コーディングツリーユニット）を復号化するために必要な情報を含む。これら各パートは、更に詳細なシンタクスを含む。 The syntax 700 includes three parts: a high level syntax 701, a slice level syntax 702, and a coding tree level syntax 703. The high level syntax 701 includes syntax information of a layer higher than the slice. A slice refers to a rectangular area or a continuous area included in a frame or a field. The slice level syntax 702 includes information necessary for decoding each slice. The coding tree level syntax 703 includes information necessary for decoding each coding tree (ie, each coding tree unit). Each of these parts includes more detailed syntax.

ハイレベルシンタクス７０１は、シーケンスパラメータセットシンタクス７０４及びピクチャパラメータセットシンタクス７０５などの、シーケンス及びピクチャレベルのシンタクスを含む。スライスレベルシンタクス７０２は、スライスヘッダーシンタクス７０６及びスライスデータシンタクス７０７などを含む。コーディングツリーレベルシンタクス７０３は、コーディングツリーユニットシンタクス７０８及びプレディクションユニットシンタクス７０９などを含む。 High level syntax 701 includes sequence and picture level syntax, such as sequence parameter set syntax 704 and picture parameter set syntax 705. The slice level syntax 702 includes a slice header syntax 706, a slice data syntax 707, and the like. The coding tree level syntax 703 includes a coding tree unit syntax 708, a prediction unit syntax 709, and the like.

コーディングツリーユニットシンタクス７０８は、四分木構造を持つことができる。具体的には、コーディングツリーユニットシンタクス７０８のシンタクス要素として、更にコーディングツリーユニットシンタクス７０８を再帰呼び出しすることができる。即ち、１つのコーディングツリーユニットを四分木で細分化することができる。また、コーディングツリーユニットシンタクス７０８内にはトランスフォームユニットシンタクス７１０が含まれている。トランスフォームユニットシンタクス７１０は、四分木の最末端の各コーディングツリーユニットシンタクス７０８において呼び出される。トランスフォームユニットシンタクス７１０は、逆直交変換及び量子化などに関わる情報が記述されている。 The coding tree unit syntax 708 may have a quadtree structure. Specifically, the coding tree unit syntax 708 can be recursively called as a syntax element of the coding tree unit syntax 708. That is, one coding tree unit can be subdivided with a quadtree. The coding tree unit syntax 708 includes a transform unit syntax 710. The transform unit syntax 710 is invoked at each coding tree unit syntax 708 at the extreme end of the quadtree. The transform unit syntax 710 describes information related to inverse orthogonal transformation and quantization.

図１２は、本実施形態に係るスライスヘッダーシンタクス７０６を例示する。図１２に示されるｓｌｉｃｅ＿ｄｉｒｅｃｔｉｏｎａｌ＿ｕｎｉｆｉｅｄ＿ｔｒａｎｓｆｏｒｍ＿ｆｌａｇは、例えば、当該スライスに関して本実施形態に係る直交変換及び逆直交変換の有効／無効を示すシンタクス要素である。 FIG. 12 illustrates a slice header syntax 706 according to this embodiment. The slice_directive_unified_transform_flag shown in FIG. 12 is a syntax element indicating, for example, validity / invalidity of orthogonal transformation and inverse orthogonal transformation according to the present embodiment for the slice.

ｓｌｉｃｅ＿ｄｉｒｅｃｔｉｏｎａｌ＿ｕｎｉｆｉｅｄ＿ｔｒａｎｓｆｏｒｍ＿ｆｌａｇが0である場合、当該スライス内での本実施形態に係る直交変換及び逆直交変換は無効である。故に、直交変換部１０２及び逆直交変換部１０５は、ＤＣＴ及びＩＤＣＴなどの固定的な直交変換及び逆直交変換を行う。この固定的な直交変換及び逆直交変換は、１Ｄ直交変換部Ｂ２０７、１Ｄ直交変換部Ｂ２０９、１Ｄ逆直交変換部３０７及び１Ｄ逆直交変換部３０９によって（即ち、１Ｄ変換行列Ｂによって）行われてもよいし、図示しないＤＣＴ部及びＩＤＣＴ部によって行われてもよい。また、係数順制御部１１３でも固定的な２Ｄ−１Ｄ変換（例えば、ジグザグスキャン）が行われる。この固定的な２Ｄ−１Ｄ変換は、２Ｄ−１Ｄ変換部（モード２）５０４によって行われてもよいし、図示しない２Ｄ−１Ｄ変換部によって行われてもよい。 When slice_directive_unified_transform_flag is 0, the orthogonal transform and inverse orthogonal transform according to the present embodiment in the slice are invalid. Therefore, the orthogonal transform unit 102 and the inverse orthogonal transform unit 105 perform fixed orthogonal transform and inverse orthogonal transform such as DCT and IDCT. This fixed orthogonal transform and inverse orthogonal transform are performed by the 1D orthogonal transform unit B207, the 1D orthogonal transform unit B209, the 1D inverse orthogonal transform unit 307, and the 1D inverse orthogonal transform unit 309 (that is, by the 1D transform matrix B). Alternatively, it may be performed by a DCT unit and an IDCT unit (not shown). The coefficient order control unit 113 also performs fixed 2D-1D conversion (for example, zigzag scanning). This fixed 2D-1D conversion may be performed by the 2D-1D conversion unit (mode 2) 504, or may be performed by a 2D-1D conversion unit (not shown).

一例として、ｓｌｉｃｅ＿ｄｉｒｅｃｔｉｏｎａｌ＿ｕｎｉｆｉｅｄ＿ｔｒａｎｓｆｏｒｍ＿ｆｌａｇが１である場合には、当該スライス内全域で本実施形態に係る直交変換及び逆直交変換が有効となる。即ち、当該スライス内全域で図１０Ａ及び図１０Ｂに関して説明した符号化フローチャートに従って符号化処理が行われる。即ち、選択スイッチ２０１は、１Ｄ変換行列セット情報１２９に基づいて１Ｄ直交変換部Ａ２０６または１Ｄ直交変換部Ｂ２０７を選択する。選択スイッチ２０４は、１Ｄ変換行列セット情報１２９に基づいて１Ｄ直交変換部Ａ２０８または１Ｄ直交変換部Ｂ２０９を選択する。また、選択スイッチ３０１は、１Ｄ変換行列セット情報１２９に基づいて１Ｄ逆直交変換部Ａ３０６または１Ｄ逆直交変換部Ｂ３０７を選択する。選択スイッチ３０４は、１Ｄ変換行列セット情報１２９に基づいて１Ｄ逆直交変換部Ａ３０８または１Ｄ逆直交変換部Ｂ３０９を選択する。更に、選択スイッチ５０１は、予測情報１２６に含まれる予測モード情報に従って、２Ｄ−１Ｄ変換部５０２，・・・，５１０のいずれかを選択する。 As an example, when slice_directive_unified_transform_flag is 1, the orthogonal transformation and inverse orthogonal transformation according to the present embodiment are effective in the entire area in the slice. That is, the encoding process is performed in the entire area in the slice according to the encoding flowchart described with reference to FIGS. 10A and 10B. That is, the selection switch 201 selects the 1D orthogonal transform unit A206 or the 1D orthogonal transform unit B207 based on the 1D transform matrix set information 129. The selection switch 204 selects the 1D orthogonal transform unit A208 or the 1D orthogonal transform unit B209 based on the 1D transform matrix set information 129. The selection switch 301 selects the 1D inverse orthogonal transform unit A306 or the 1D inverse orthogonal transform unit B307 based on the 1D transform matrix set information 129. The selection switch 304 selects the 1D inverse orthogonal transform unit A308 or the 1D inverse orthogonal transform unit B309 based on the 1D transform matrix set information 129. Further, the selection switch 501 selects one of the 2D-1D conversion units 502,..., 510 according to the prediction mode information included in the prediction information 126.

また、別の例として、ｓｌｉｃｅ＿ｄｉｒｅｃｔｉｏｎａｌ＿ｕｎｉｆｉｅｄ＿ｔｒａｎｓｆｏｒｍ＿ｆｌａｇが１である場合には、より下位のレイヤ（コーディングツリーユニット、トランスフォームユニットなど）のシンタクスにおいて当該スライス内部の局所領域毎に本実施形態に係る直交変換及び逆直交変換の有効／無効が規定されてもよい。 As another example, when slice_directional_unified_transform_flag is 1, the orthogonal transform and inverse according to this embodiment are performed for each local region in the slice in the syntax of a lower layer (coding tree unit, transform unit, etc.). Validity / invalidity of orthogonal transformation may be defined.

図１３は、本実施形態に係るコーディングツリーユニットシンタクス７０８を例示する。図１３に示されるｃｔｂ＿ｄｉｒｅｃｔｉｏｎａｌ＿ｕｎｉｆｉｅｄ＿ｔｒａｎｓｆｏｒｍ＿ｆｌａｇは、当該コーディングツリーユニットに関して本実施形態に係る直交変換及び逆直交変換の有効／無効を示すシンタクス要素である。また、図１３に示されるｐｒｅｄ＿ｍｏｄｅはプレディクションユニットシンタクス７０９に含まれるシンタクス要素の１つであり、当該コーディングツリーユニットもしくはマクロブロック内の符号化タイプを示している。MODE_INTRAは、符号化タイプがイントラ予測であることを示す。ｃｔｂ＿ｄｉｒｅｃｔｉｏｎａｌ＿ｕｎｉｆｉｅｄ＿ｔｒａｎｓｆｏｒｍ＿ｆｌａｇは、前述のｓｌｉｃｅ＿ｄｉｒｅｃｔｉｏｎａｌ＿ｕｎｉｆｉｅｄ＿ｔｒａｎｓｆｏｒｍ＿ｆｌａｇが１であって、かつ、コーディングツリーユニットの符号化タイプがイントラ予測の時にのみ符号化される。 FIG. 13 illustrates a coding tree unit syntax 708 according to this embodiment. Ctb_directive_unified_transform_flag shown in FIG. 13 is a syntax element indicating validity / invalidity of orthogonal transform and inverse orthogonal transform according to the present embodiment with respect to the coding tree unit. Further, pred_mode shown in FIG. 13 is one of syntax elements included in the prediction unit syntax 709, and indicates the coding type in the coding tree unit or macroblock. MODE_INTRA indicates that the encoding type is intra prediction. ctb_directive_unified_transform_flag is encoded only when the above-mentioned slice_directional_unified_transform_flag is 1 and the coding type of the coding tree unit is intra prediction.

ｃｔｂ＿ｄｉｒｅｃｔｉｏｎａｌ＿ｕｎｉｆｉｅｄ＿ｔｒａｎｓｆｏｒｍ＿ｆｌａｇが0である場合、当該コーディングツリーユニット内での本実施形態に係る直交変換及び逆直交変換は無効である。故に、直交変換部１０２及び逆直交変換部１０５は、ＤＣＴ及びＩＤＣＴなどの固定的な直交変換及び逆直交変換を行う。この固定的な直交変換及び逆直交変換は、１Ｄ直交変換部Ｂ２０７、１Ｄ直交変換部Ｂ２０９、１Ｄ逆直交変換部３０７及び１Ｄ逆直交変換部３０９によって（即ち、１Ｄ変換行列Ｂによって）行われてもよいし、図示しないＤＣＴ部及びＩＤＣＴ部によって行われてもよい。また、係数順制御部１１３でも固定的な２Ｄ−１Ｄ変換（例えば、ジグザグスキャン）が行われる。この固定的な２Ｄ−１Ｄ変換は、２Ｄ−１Ｄ変換部（モード２）５０４によって行われてもよいし、図示しない２Ｄ−１Ｄ変換部によって行われてもよい。 When ctb_directive_unified_transform_flag is 0, the orthogonal transform and the inverse orthogonal transform according to the present embodiment in the coding tree unit are invalid. Therefore, the orthogonal transform unit 102 and the inverse orthogonal transform unit 105 perform fixed orthogonal transform and inverse orthogonal transform such as DCT and IDCT. This fixed orthogonal transform and inverse orthogonal transform are performed by the 1D orthogonal transform unit B207, the 1D orthogonal transform unit B209, the 1D inverse orthogonal transform unit 307, and the 1D inverse orthogonal transform unit 309 (that is, by the 1D transform matrix B). Alternatively, it may be performed by a DCT unit and an IDCT unit (not shown). The coefficient order control unit 113 also performs fixed 2D-1D conversion (for example, zigzag scanning). This fixed 2D-1D conversion may be performed by the 2D-1D conversion unit (mode 2) 504, or may be performed by a 2D-1D conversion unit (not shown).

一方、ｃｔｂ＿ｄｉｒｅｃｔｉｏｎａｌ＿ｕｎｉｆｉｅｄ＿ｔｒａｎｓｆｏｒｍ＿ｆｌａｇが１である場合、当該コーディングツリーユニット内で本実施形態に係る直交変換及び逆直交変換が有効となり、図１０Ａ及び図１０Ｂで説明した符号化フローチャートに従って符号化処理が行われる。即ち、選択スイッチ２０１は、１Ｄ変換行列セット情報１２９に基づいて１Ｄ直交変換部Ａ２０６または１Ｄ直交変換部Ｂ２０７を選択する。選択スイッチ２０４は、１Ｄ変換行列セット情報１２９に基づいて１Ｄ直交変換部Ａ２０８または１Ｄ直交変換部Ｂ２０９を選択する。また、選択スイッチ３０１は、１Ｄ変換行列セット情報１２９に基づいて１Ｄ逆直交変換部Ａ３０６または１Ｄ逆直交変換部Ｂ３０７を選択する。選択スイッチ３０４は、１Ｄ変換行列セット情報１２９に基づいて１Ｄ逆直交変換部Ａ３０８または１Ｄ逆直交変換部Ｂ３０９を選択する。更に、選択スイッチ５０１は、予測情報１２６に含まれる予測モード情報に従って、２Ｄ−１Ｄ変換部５０２，・・・，５１０のいずれかを選択する。 On the other hand, when ctb_directive_unified_transform_flag is 1, the orthogonal transform and the inverse orthogonal transform according to the present embodiment are valid in the coding tree unit, and the encoding process is performed according to the encoding flowchart described in FIGS. 10A and 10B. That is, the selection switch 201 selects the 1D orthogonal transform unit A206 or the 1D orthogonal transform unit B207 based on the 1D transform matrix set information 129. The selection switch 204 selects the 1D orthogonal transform unit A208 or the 1D orthogonal transform unit B209 based on the 1D transform matrix set information 129. The selection switch 301 selects the 1D inverse orthogonal transform unit A306 or the 1D inverse orthogonal transform unit B307 based on the 1D transform matrix set information 129. The selection switch 304 selects the 1D inverse orthogonal transform unit A308 or the 1D inverse orthogonal transform unit B309 based on the 1D transform matrix set information 129. Further, the selection switch 501 selects one of the 2D-1D conversion units 502,..., 510 according to the prediction mode information included in the prediction information 126.

図１３の例のように、コーディングツリーユニットシンタクス７０８において、本実施形態に係る直交変換及び逆直交変換の有効／無効を規定するフラグを符号化すると、このフラグを符号化しない場合に比べて情報量（符号量）は増大する。しかしながら、このフラグを符号化することにより、局所領域（即ち、コーディングツリーユニット）毎に最適な直交変換を行うことが可能となる。 In the coding tree unit syntax 708, as shown in the example of FIG. 13, when a flag specifying validity / invalidity of orthogonal transformation and inverse orthogonal transformation according to the present embodiment is encoded, information is encoded compared to a case where this flag is not encoded. The amount (code amount) increases. However, by encoding this flag, it is possible to perform optimal orthogonal transform for each local region (ie, coding tree unit).

図１４は、本実施形態に係るトランスフォームユニットシンタクス７１０を例示する。図１４に示されるｔｕ＿ｄｉｒｅｃｔｉｏｎａｌ＿ｕｎｉｆｉｅｄ＿ｔｒａｎｓｆｏｒｍ＿ｆｌａｇは、当該トランスフォームユニットに関して本実施形態に係る直交変換及び逆直交変換の有効／無効を示すシンタクス要素である。また、図１４に示されるｐｒｅｄ＿ｍｏｄｅはプレディクションユニットシンタクス７０９に含まれるシンタクス要素の１つであり、当該コーディングツリーユニットもしくはマクロブロック内の符号化タイプを示している。MODE_INTRAは、符号化タイプがイントラ予測であることを示す。ｔｕ＿ｄｉｒｅｃｔｉｏｎａｌ＿ｕｎｉｆｉｅｄ＿ｔｒａｎｓｆｏｒｍ＿ｆｌａｇはｓｌｉｃｅ＿ｄｉｒｅｃｔｉｏｎａｌ＿ｕｎｉｆｉｅｄ＿ｔｒａｎｓｆｏｒｍ＿ｆｌａｇが１であって、かつ、コーディングツリーユニットの符号化タイプがイントラ予測の時にのみ符号化される。 FIG. 14 illustrates a transform unit syntax 710 according to this embodiment. A tu_directive_unified_transform_flag shown in FIG. 14 is a syntax element indicating validity / invalidity of the orthogonal transform and the inverse orthogonal transform according to the present embodiment with respect to the transform unit. Further, pred_mode shown in FIG. 14 is one of syntax elements included in the prediction unit syntax 709, and indicates the coding type in the coding tree unit or macroblock. MODE_INTRA indicates that the encoding type is intra prediction. Tu_directive_unified_transform_flag is encoded only when slice_directive_unified_transform_flag is 1 and the coding type of the coding tree unit is intra prediction.

ｔｕ＿ｄｉｒｅｃｔｉｏｎａｌ＿ｕｎｉｆｉｅｄ＿ｔｒａｎｓｆｏｒｍ＿ｆｌａｇが0である場合、当該トランスフォームユニット内での本実施形態に係る直交変換及び逆直交変換は無効である。故に、直交変換部１０２及び逆直交変換部１０５は、ＤＣＴ及びＩＤＣＴなどの固定的な直交変換及び逆直交変換を行う。この固定的な直交変換及び逆直交変換は、１Ｄ直交変換部Ｂ２０７、１Ｄ直交変換部Ｂ２０９、１Ｄ逆直交変換部３０７及び１Ｄ逆直交変換部３０９によって（即ち、１Ｄ変換行列Ｂによって）行われてもよいし、図示しないＤＣＴ部及びＩＤＣＴ部によって行われてもよい。また、係数順制御部１１３でも固定的な２Ｄ−１Ｄ変換（例えば、ジグザグスキャン）が行われる。この固定的な２Ｄ−１Ｄ変換は、２Ｄ−１Ｄ変換部（モード２）５０４によって行われてもよいし、図示しない２Ｄ−１Ｄ変換部によって行われてもよい。 When tu_directive_unified_transform_flag is 0, the orthogonal transform and inverse orthogonal transform according to the present embodiment in the transform unit are invalid. Therefore, the orthogonal transform unit 102 and the inverse orthogonal transform unit 105 perform fixed orthogonal transform and inverse orthogonal transform such as DCT and IDCT. This fixed orthogonal transform and inverse orthogonal transform are performed by the 1D orthogonal transform unit B207, the 1D orthogonal transform unit B209, the 1D inverse orthogonal transform unit 307, and the 1D inverse orthogonal transform unit 309 (that is, by the 1D transform matrix B). Alternatively, it may be performed by a DCT unit and an IDCT unit (not shown). The coefficient order control unit 113 also performs fixed 2D-1D conversion (for example, zigzag scanning). This fixed 2D-1D conversion may be performed by the 2D-1D conversion unit (mode 2) 504, or may be performed by a 2D-1D conversion unit (not shown).

一方、ｔｕ＿ｄｉｒｅｃｔｉｏｎａｌ＿ｕｎｉｆｉｅｄ＿ｔｒａｎｓｆｏｒｍ＿ｆｌａｇが１である場合、当該トランスフォームユニット内での本実施形態に係る直交変換及び逆直交変換が有効となり、図１０Ａ及び図１０Ｂで説明した符号化フローチャートに従って符号化処理が行われる。即ち、選択スイッチ２０１は、１Ｄ変換行列セット情報１２９に基づいて１Ｄ直交変換部Ａ２０６または１Ｄ直交変換部Ｂ２０７を選択する。選択スイッチ２０４は、１Ｄ変換行列セット情報１２９に基づいて１Ｄ直交変換部Ａ２０８または１Ｄ直交変換部Ｂ２０９を選択する。また、選択スイッチ３０１は、１Ｄ変換行列セット情報１２９に基づいて１Ｄ逆直交変換部Ａ３０６または１Ｄ逆直交変換部Ｂ３０７を選択する。選択スイッチ３０４は、１Ｄ変換行列セット情報１２９に基づいて１Ｄ逆直交変換部Ａ３０８または１Ｄ逆直交変換部Ｂ３０９を選択する。更に、選択スイッチ５０１は、予測情報１２６に含まれる予測モード情報に従って、２Ｄ−１Ｄ変換部５０２，・・・，５１０のいずれかを選択する。 On the other hand, when tu_directional_unified_transform_flag is 1, the orthogonal transform and inverse orthogonal transform according to the present embodiment in the transform unit are valid, and the encoding process is performed according to the encoding flowchart described in FIGS. 10A and 10B. That is, the selection switch 201 selects the 1D orthogonal transform unit A206 or the 1D orthogonal transform unit B207 based on the 1D transform matrix set information 129. The selection switch 204 selects the 1D orthogonal transform unit A208 or the 1D orthogonal transform unit B209 based on the 1D transform matrix set information 129. The selection switch 301 selects the 1D inverse orthogonal transform unit A306 or the 1D inverse orthogonal transform unit B307 based on the 1D transform matrix set information 129. The selection switch 304 selects the 1D inverse orthogonal transform unit A308 or the 1D inverse orthogonal transform unit B309 based on the 1D transform matrix set information 129. Further, the selection switch 501 selects one of the 2D-1D conversion units 502,..., 510 according to the prediction mode information included in the prediction information 126.

図１４の例のように、トランスフォームユニットシンタクス７１０において、本実施形態に係る直交変換及び逆直交変換の有効／無効を規定するフラグを符号化すると、このフラグを符号化しない場合に比べて情報量（符号量）は増大する。しかしながら、このフラグを符号化することにより、局所領域（即ち、トランスフォームユニット）毎に最適な直交変換を行うことが可能となる。 As in the example of FIG. 14, in the transform unit syntax 710, when the flag that defines the validity / invalidity of the orthogonal transform and the inverse orthogonal transform according to the present embodiment is encoded, the information is compared with the case where the flag is not encoded. The amount (code amount) increases. However, by encoding this flag, it is possible to perform optimal orthogonal transform for each local region (that is, transform unit).

尚、図１２、図１３及び図１４に例示するシンタクステーブルの行間には、本実施形態において規定していないシンタクス要素が挿入されてもよいし、その他の条件分岐に関する記述が含まれていてもよい。また、シンタクステーブルを複数のテーブルに分割したり、複数のシンタクステーブルを統合したりしてもよい。また、例示した各シンタクス要素の用語は、任意に変更可能である。 It should be noted that syntax elements not defined in this embodiment may be inserted between the rows of the syntax tables illustrated in FIGS. 12, 13, and 14, or other conditional branch descriptions may be included. Good. Further, the syntax table may be divided into a plurality of tables, or a plurality of syntax tables may be integrated. Moreover, the term of each illustrated syntax element can be changed arbitrarily.

以上説明したように、本実施形態に係る画像符号化装置は、参照画素からの距離が大きくなるにつれて予測精度が低下するというイントラ予測の傾向を利用する。この画像符号化装置は、各予測モードの垂直方向及び水平方向を上記傾向の有無に従って２つのクラスに分類し、垂直方向及び水平方向の夫々について適応的に１Ｄ変換行列Ａまたは１Ｄ変換行列Ｂを適用する。１Ｄ変換行列Ａは、参照画素群のラインに直交する方向（垂直方向または水平方向）について１Ｄ直交変換を行う際の係数集密度が高くなる（即ち、量子化変換係数１２１における非零係数の割合が小さくなる）ように共通の変換基底を予め設計することによって生成される。一方、１Ｄ変換行列Ｂは、このような性質を持たない汎用的な変換行列を設計することによって生成される。例えば、汎用的な変換はＤＣＴである。故に、本実施形態に係る画像符号化装置によれば、各予測モードに一律にＤＣＴなどの固定的な直交変換を施す場合に比べて、高い変換効率が達成される。 As described above, the image encoding apparatus according to the present embodiment uses the tendency of intra prediction that the prediction accuracy decreases as the distance from the reference pixel increases. This image encoding apparatus classifies the vertical direction and horizontal direction of each prediction mode into two classes according to the presence or absence of the above-described tendency, and adaptively assigns the 1D conversion matrix A or 1D conversion matrix B to each of the vertical direction and horizontal direction. Apply. The 1D transform matrix A has high coefficient density when performing 1D orthogonal transform in the direction (vertical direction or horizontal direction) orthogonal to the line of the reference pixel group (that is, the ratio of non-zero coefficients in the quantized transform coefficient 121) Is generated in advance by designing a common transformation base so that On the other hand, the 1D transformation matrix B is generated by designing a general-purpose transformation matrix having no such property. For example, a generic conversion is DCT. Therefore, according to the image coding apparatus according to the present embodiment, high conversion efficiency is achieved as compared with a case where fixed orthogonal transform such as DCT is uniformly applied to each prediction mode.

また、本実施形態に係る直交変換部１０２及び逆直交変換部１０５は、ハードウェア実装及びソフトウェア実装のいずれにも好適である。
数式（３）乃至数式（６）は固定行列の乗算を表しているので、直交変換部及び逆直交変換部をハードウェア実装する場合には、乗算器よりもむしろハードワイヤードロジックによって構成されることが想定される。 Further, the orthogonal transform unit 102 and the inverse orthogonal transform unit 105 according to the present embodiment are suitable for both hardware implementation and software implementation.
Since Equations (3) to (6) represent multiplication of a fixed matrix, when the orthogonal transform unit and the inverse orthogonal transform unit are implemented by hardware, they are configured by hard-wired logic rather than a multiplier. Is assumed.

仮に、９種類のイントラ予測モードの夫々について専用の変換基底を用いて直交変換及び逆直交変換を行うとすれば、９個の２Ｄ直交変換部または図１５に示すように１８（＝９×２）個の１Ｄ直交変換部を用意する必要がある。これら９個の２Ｄ直交変換部または１８個の１Ｄ直交変換部は、夫々異なる変換行列を乗算するので、結果的に、Ｈ．２６４で必要なＤＣＴのための専用ハードウェアに加えて、追加で９個の２Ｄ直交変換部または１８個の１Ｄ直交変換部のための専用ハードウェアを設けることとなり、回路規模が増大する。 If orthogonal transform and inverse orthogonal transform are performed using a dedicated transform basis for each of the nine types of intra prediction modes, nine 2D orthogonal transform units or 18 (= 9 × 2) as shown in FIG. ) 1D orthogonal transform units must be prepared. These 9 2D orthogonal transform units or 18 1D orthogonal transform units multiply each by a different transform matrix. In addition to dedicated hardware for DCT required by H.264, additional dedicated hardware for nine 2D orthogonal transform units or 18 1D orthogonal transform units is provided, which increases the circuit scale.

一方、本実施形態に係る直交変換部及び逆直交変換部は、図２及び図３に示す通り、２個（垂直（逆）変換部及び水平（逆）変換部を時分割で共有する場合）の１Ｄ直交変換部と、行列の転置を行う回路との組み合わせによって４種類の２次元の直交変換を実行する。故に、本実施形態に係る直交変換部及び逆直交変換部によれば、ハードウェア実装における回路規模の増加を大幅に抑制できる。 On the other hand, as shown in FIGS. 2 and 3, two orthogonal transform units and inverse orthogonal transform units according to the present embodiment are used (when the vertical (inverse) transform unit and the horizontal (inverse) transform unit are shared in a time division manner). The four types of two-dimensional orthogonal transforms are executed by a combination of the 1D orthogonal transform unit and a circuit for transposing the matrix. Therefore, according to the orthogonal transformation part and the inverse orthogonal transformation part which concern on this embodiment, the increase in the circuit scale in hardware mounting can be suppressed significantly.

また、ソフトウェア実装に関して、仮に９種類のイントラ予測モードの夫々について専用の変換基底を用いて直交変換及び逆直交変換を行うとすれば、９個の２Ｄ直交変換行列または１８（＝９×２）個の１Ｄ直交変換行列をメモリに保持しておき、これら変換行列を予測モード毎に呼び出して汎用乗算機を用いて直交変換を実現することが想定される。故に、変換行列を保存するためのメモリサイズの増加によるコスト増を招いたり、変換の度に変換行列をメモリにロードすることによるメモリバンド幅の増加に繋がったりするおそれがある。 Further, regarding software implementation, if orthogonal transform and inverse orthogonal transform are performed using dedicated transform bases for each of the nine types of intra prediction modes, nine 2D orthogonal transform matrices or 18 (= 9 × 2) It is assumed that 1D orthogonal transformation matrices are stored in a memory, and these transformation matrices are called for each prediction mode to implement orthogonal transformation using a general-purpose multiplier. Therefore, there is a possibility that the cost increases due to an increase in the memory size for storing the transformation matrix, or that the memory bandwidth is increased by loading the transformation matrix into the memory for each transformation.

一方、本実施形態に係る直交変換部及び逆直交変換部は、図２及び図３に示す通り、２個の１Ｄ直交変換行列を利用した垂直変換及び水平変換を組み合わせることにより４種類の２次元の直交変換を実行する。故に、本実施形態に係る直交変換部及び逆直交変換部によれば、ソフトウェア実装におけるメモリサイズの増加を大幅に抑制できる。 On the other hand, the orthogonal transform unit and the inverse orthogonal transform unit according to the present embodiment combine four types of two-dimensional by combining vertical transform and horizontal transform using two 1D orthogonal transform matrices as shown in FIGS. The orthogonal transformation of is performed. Therefore, according to the orthogonal transformation part and the inverse orthogonal transformation part which concern on this embodiment, the increase in the memory size in software mounting can be suppressed significantly.

また、本実施形態において説明したように予測モード毎に個別のスキャン順を用意することは、符号化効率の向上に寄与する。量子化変換係数１２１は要素毎に非零係数の発生傾向が偏る性質を持つ。係る非零係数の発生傾向は、イントラ予測の予測方向毎に異なる。更に、予測方向が同一であれば、異なる入力画像１１８の画素ブロックを符号化した場合にも、非零係数の発生傾向は類似する。故に、係数順制御部１１３は、量子化係数１２１のうち非零係数の発生確率が高い要素から順に１次元の量子化変換係数列１２２に変換することによって、量子化変換係数列１２２において零係数が高確率で密集する。即ち、エントロピー符号化部１１４におけるランレングス符号化による発生符号量を削減できる。係数順制御部１１３は、図５Ａ及び図５Ｂに関して説明した通り、予測モード毎に予め学習されたスキャン順を固定的に利用してもよいし、符号化処理中に動的にスキャン順を更新して利用してもよい。予測モード毎に最適化されたスキャン順を利用すれば、例えばＨ．２６４と比較して演算量の大幅な増加を引き起こすことなく、量子化変換係数列１２２に基づく発生符号量を削減できる。 Also, as described in the present embodiment, preparing an individual scan order for each prediction mode contributes to an improvement in coding efficiency. The quantized transform coefficient 121 has a property that the generation tendency of non-zero coefficients is biased for each element. The occurrence tendency of such non-zero coefficients differs for each prediction direction of intra prediction. Furthermore, if the prediction directions are the same, the non-zero coefficient generation tendency is similar even when pixel blocks of different input images 118 are encoded. Therefore, the coefficient order control unit 113 converts the zero coefficient in the quantized transform coefficient sequence 122 by transforming the quantized transform coefficient sequence 122 into the one-dimensional quantized transform coefficient sequence 122 in order from the element having the highest non-zero coefficient occurrence probability. Are dense with high probability. That is, it is possible to reduce the amount of generated code by run length encoding in the entropy encoding unit 114. As described with reference to FIGS. 5A and 5B, the coefficient order control unit 113 may use the scan order learned in advance for each prediction mode, or dynamically update the scan order during the encoding process. You may use it. If the scan order optimized for each prediction mode is used, for example, H.264 is used. Compared with H.264, the amount of generated code based on the quantized transform coefficient sequence 122 can be reduced without causing a significant increase in the amount of computation.

（第２の実施形態）
第２の実施形態に係る画像符号化装置は、前述の第１の実施形態に係る画像符号化装置と直交変換及び逆直交変換の詳細において異なる。以降の説明では、本実施形態において第１の実施形態と同一部分には同一符号を付して示し、異なる部分を中心に説明する。本実施形態に係る画像符号化装置に対応する画像復号化装置は、第５の実施形態において説明する。 (Second Embodiment)
The image encoding device according to the second embodiment differs from the image encoding device according to the first embodiment described above in the details of orthogonal transform and inverse orthogonal transform. In the following description, the same parts as those in the first embodiment are denoted by the same reference numerals in the present embodiment, and different parts will be mainly described. An image decoding apparatus corresponding to the image encoding apparatus according to the present embodiment will be described in a fifth embodiment.

本実施形態に係る画像符号化装置は、図２に例示した直交変換部１０２の代わりに、図１６に例示する直交変換部１０２を含む。図１６の直交変換部１０２は、選択スイッチ８０１、垂直変換部８０２、転置部２０３、選択スイッチ８０４及び水平変換部８０５を有する。垂直変換部８０２は、１Ｄ直交変換部Ｃ８０６、１Ｄ直交変換部Ｄ８０７及び１Ｄ直交変換部Ｅ８０８を含む。水平変換部８０５は、１Ｄ直交変換部Ｃ８０９、１Ｄ直交変換部Ｄ８１０及び１Ｄ直交変換部Ｅ８１１を含む。尚、垂直変換部８０２及び水平変換部８０５の順序は、一例であり、これらは逆順であっても構わない。 The image encoding apparatus according to the present embodiment includes an orthogonal transform unit 102 illustrated in FIG. 16 instead of the orthogonal transform unit 102 illustrated in FIG. 16 includes a selection switch 801, a vertical conversion unit 802, a transposition unit 203, a selection switch 804, and a horizontal conversion unit 805. The vertical transform unit 802 includes a 1D orthogonal transform unit C806, a 1D orthogonal transform unit D807, and a 1D orthogonal transform unit E808. The horizontal transform unit 805 includes a 1D orthogonal transform unit C809, a 1D orthogonal transform unit D810, and a 1D orthogonal transform unit E811. Note that the order of the vertical conversion unit 802 and the horizontal conversion unit 805 is an example, and these may be reversed.

１Ｄ直交変換部Ｃ８０６及び１Ｄ直交変換部Ｃ８０９は、入力される行列に対して１Ｄ変換行列Ｃを乗算する点で共通の機能を持つ。１Ｄ直交変換部Ｄ８０７及び１Ｄ直交変換部Ｄ８１０は、入力される行列に対して１Ｄ変換行列Ｄを乗算する点で共通の機能を持つ。１Ｄ直交変換部Ｅ８０８及び１Ｄ直交変換部Ｅ８１１は、入力される行列に対して１Ｄ変換行列Ｅを乗算する点で共通の機能を持つ。 The 1D orthogonal transform unit C806 and the 1D orthogonal transform unit C809 have a common function in that the input matrix is multiplied by the 1D transform matrix C. The 1D orthogonal transform unit D807 and the 1D orthogonal transform unit D810 have a common function in that the input matrix is multiplied by the 1D transform matrix D. The 1D orthogonal transform unit E808 and the 1D orthogonal transform unit E811 have a common function in that the input matrix is multiplied by the 1D transform matrix E.

以下、本実施形態に係る１Ｄ変換行列Ｃ、１Ｄ変換行列Ｄ及び１Ｄ変換行列Ｅについて説明する。
前述のように、予測誤差１１９は参照画素からの距離が大きくなるにつれて絶対値が大きくなる傾向を持つ。係る傾向は予測方向に関わらず同様であるが、ＤＣ予測モードの予測画素１１９は垂直方向及び水平方向のいずれにも係る傾向を示すとはいえない。本実施形態では、ＤＣ予測モードに関して後述する１Ｄ変換行列Ｅを利用する。一方、ＤＣ予測モード以外の予測モードについては、前述の第１の実施形態と同様に上記傾向の有無に応じて夫々１Ｄ変換行列Ｃ及び１Ｄ変換行列Ｄを適応的に利用する。 Hereinafter, the 1D conversion matrix C, the 1D conversion matrix D, and the 1D conversion matrix E according to the present embodiment will be described.
As described above, the prediction error 119 tends to increase in absolute value as the distance from the reference pixel increases. Although the tendency is the same regardless of the prediction direction, it cannot be said that the prediction pixel 119 in the DC prediction mode shows a tendency related to either the vertical direction or the horizontal direction. In the present embodiment, a 1D conversion matrix E described later with respect to the DC prediction mode is used. On the other hand, for prediction modes other than the DC prediction mode, the 1D conversion matrix C and the 1D conversion matrix D are adaptively used according to the presence or absence of the above-described tendency, as in the first embodiment.

具体的には、１Ｄ変換行列Ｃは、前述の１Ｄ変換行列Ａと同じ設計手法によって生成することができる。また、１Ｄ変換行列Ｄは、前述の１Ｄ変換行列Ｂと類似の設計手法によって生成することができる。即ち、１Ｄ変換行列Ｄは、ＤＣ予測モードを除外したうえで、前述の１Ｄ変換行列Ｂの設計手法を実施すれば生成できる。 Specifically, the 1D conversion matrix C can be generated by the same design method as the 1D conversion matrix A described above. The 1D conversion matrix D can be generated by a design method similar to the 1D conversion matrix B described above. That is, the 1D transformation matrix D can be generated by performing the above-described design method for the 1D transformation matrix B after excluding the DC prediction mode.

１Ｄ変換行列Ｅは、ＤＣＴのための行列であってもよい。或いは、１Ｄ変換行列Ｅは、１Ｄ変換行列Ｄに比べて、ＤＣ予測モードの予測誤差１１９に対して垂直方向及び水平方向で１Ｄ直交変換を行う際の係数集密度が高くなる（即ち、量子化変換係数１２１における非零係数の割合が小さくなる）ように共通の変換基底を予め設計することによって生成されてもよい。 The 1D transformation matrix E may be a matrix for DCT. Alternatively, the 1D transform matrix E has higher coefficient density when performing 1D orthogonal transform in the vertical and horizontal directions with respect to the prediction error 119 in the DC prediction mode, compared to the 1D transform matrix D (ie, quantization). It may be generated by designing a common transformation base in advance so that the ratio of non-zero coefficients in the transformation coefficient 121 becomes smaller.

本実施形態に係る画像符号化装置は、図３に例示した逆直交変換部１０５の代わりに、図１７に例示する逆直交変換部１０５を含む。図１７の逆直交変換部１０５は、選択スイッチ９０１、垂直逆変換部９０２、転置部３０３、選択スイッチ９０４及び水平逆変換部９０５を有する。垂直逆変換部９０２は、１Ｄ逆直交変換部Ｃ９０６、１Ｄ逆直交変換部Ｄ９０７及び１Ｄ逆直交変換部Ｅ９０８を含む。水平逆変換部９０５は、１Ｄ逆直交変換部Ｃ９０９、１Ｄ逆直交変換部Ｄ９１０及び１Ｄ逆直交変換部Ｅ９１１を含む。尚、垂直逆変換部９０２及び水平逆変換部９０５の順序は、一例であり、これらは逆順であっても構わない。 The image encoding apparatus according to the present embodiment includes an inverse orthogonal transform unit 105 illustrated in FIG. 17 instead of the inverse orthogonal transform unit 105 illustrated in FIG. The inverse orthogonal transform unit 105 in FIG. 17 includes a selection switch 901, a vertical inverse transform unit 902, a transposition unit 303, a selection switch 904, and a horizontal inverse transform unit 905. The vertical inverse transform unit 902 includes a 1D inverse orthogonal transform unit C906, a 1D inverse orthogonal transform unit D907, and a 1D inverse orthogonal transform unit E908. The horizontal inverse transform unit 905 includes a 1D inverse orthogonal transform unit C909, a 1D inverse orthogonal transform unit D910, and a 1D inverse orthogonal transform unit E911. Note that the order of the vertical inverse transform unit 902 and the horizontal inverse transform unit 905 is an example, and these may be reversed.

１Ｄ逆直交変換部Ｃ９０６及び１Ｄ逆直交変換部Ｃ９０９は、入力される行列に対して１Ｄ変換行列Ｃの転置行列を乗算する点で共通の機能を持つ。１Ｄ逆直交変換部Ｄ９０７及び１Ｄ逆直交変換部Ｄ９１０は、入力される行列に対して１Ｄ変換行列Ｄの転置行列を乗算する点で共通の機能を持つ。１Ｄ逆直交変換部Ｅ９０８及び１Ｄ逆直交変換部Ｅ９１１は、入力される行列に対して１Ｄ変換行列Ｅの転置行列を乗算する点で共通の機能を持つ。 The 1D inverse orthogonal transform unit C906 and the 1D inverse orthogonal transform unit C909 have a common function in that an input matrix is multiplied by a transposed matrix of the 1D transform matrix C. The 1D inverse orthogonal transform unit D907 and the 1D inverse orthogonal transform unit D910 have a common function in that the input matrix is multiplied by the transposed matrix of the 1D transform matrix D. The 1D inverse orthogonal transform unit E908 and the 1D inverse orthogonal transform unit E911 have a common function in that the input matrix is multiplied by the transposed matrix of the 1D transform matrix E.

以下、１Ｄ変換行列セット部１１２が生成する、本実施形態に係る１Ｄ変換行列セット情報１２９の詳細を説明する。
１Ｄ変換行列セット情報１２９は、垂直直交変換及び垂直逆直交変換のために使用される変換行列を選択するための垂直変換インデックスと、水平直交変換及び水平逆直交変換のために使用される変換行列を選択するための水平変換インデックスとを直接的または間接的に示す。例えば、１Ｄ変換行列セット情報１２９は、図１８Ｄに示す変換インデックス（TransformIdx）で表現することができる。図１８Ｄのテーブルを参照すれば、変換インデックスから垂直変換インデックス（Vertical Transform Idx）及び水平変換インデックス（Horizontal Transform Idx）を導出できる。 Hereinafter, details of the 1D conversion matrix set information 129 according to the present embodiment generated by the 1D conversion matrix set unit 112 will be described.
The 1D transformation matrix set information 129 includes a vertical transformation index for selecting a transformation matrix used for vertical orthogonal transformation and vertical inverse orthogonal transformation, and a transformation matrix used for horizontal orthogonal transformation and horizontal inverse orthogonal transformation. The horizontal transformation index for selecting is directly or indirectly indicated. For example, the 1D transformation matrix set information 129 can be expressed by a transformation index (TransformIdx) illustrated in FIG. 18D. With reference to the table in FIG. 18D, a vertical transformation index (Vertical Transform Idx) and a horizontal transformation index (Horizontal Transform Idx) can be derived from the transformation index.

図１８Ｂに示すように、垂直変換インデックスが「０」であれば、垂直直交変換または垂直逆直交変換のために前述の１Ｄ変換行列Ｃ（1D_Transform_Matrix_C）またはその転置行列が選択される。一方、垂直変換インデックスが「１」であれば、垂直直交変換または垂直逆直交変換のために前述の１Ｄ変換行列Ｄ（1D_Transform_Matrix_D）またはその転置行列が選択される。更に、垂直変換インデックスが「２」であれば、垂直直交変換または垂直逆直交変換のために前述の１Ｄ変換行列Ｅ（1D_transform_Matrix_E）またはその転置行列が選択される。 As shown in FIG. 18B, if the vertical transformation index is “0”, the 1D transformation matrix C (1D_Transform_Matrix_C) or its transposed matrix is selected for vertical orthogonal transformation or vertical inverse orthogonal transformation. On the other hand, if the vertical transformation index is “1”, the aforementioned 1D transformation matrix D (1D_Transform_Matrix_D) or its transposed matrix is selected for vertical orthogonal transformation or vertical inverse orthogonal transformation. Further, if the vertical transformation index is “2”, the 1D transformation matrix E (1D_transform_Matrix_E) or its transposed matrix is selected for vertical orthogonal transformation or vertical inverse orthogonal transformation.

図１８Ｃに示すように、水平変換インデックスが「０」であれば、水平直交変換または水平逆直交変換のために前述の１Ｄ変換行列Ｃ（1D_Transform_Matrix_C）またはその転置行列が選択される。一方、水平変換インデックスが「１」であれば、水平直交変換または水平逆直交変換のために前述の１Ｄ変換行列Ｄ（1D_Transform_Matrix_D）またはその転置行列が選択される。更に、水平変換インデックスが「２」であれば、水平直交変換または水平逆直交変換のために前述の１Ｄ変換行列Ｅ（1D_Transform_Matrix_E）またはその転置行列が選択される。 As shown in FIG. 18C, if the horizontal transformation index is “0”, the 1D transformation matrix C (1D_Transform_Matrix_C) or its transposed matrix is selected for horizontal orthogonal transformation or horizontal inverse orthogonal transformation. On the other hand, if the horizontal transformation index is “1”, the aforementioned 1D transformation matrix D (1D_Transform_Matrix_D) or its transposed matrix is selected for horizontal orthogonal transformation or horizontal inverse orthogonal transformation. Furthermore, if the horizontal transformation index is “2”, the 1D transformation matrix E (1D_Transform_Matrix_E) or its transposed matrix is selected for horizontal orthogonal transformation or horizontal inverse orthogonal transformation.

また、各（イントラ）予測モードのインデックス（IntraNxNPredModeIndex）と、その名称（Name of IntraNxNPredMode）と、対応する垂直変換インデックス及び水平変換インデックスを図１８Ａに例示する。尚、図１８Ａにおいて、「NxN」は予測対象ブロックのサイズを表している（Ｎ＝４，８，１６など）。予測対象ブロックのサイズは、「MxN」（即ち、正方形以外の矩形）に拡張することもできる。
ここで、図１８Ａと図１８Ｄを統合した、各予測モードのインデックスとその名称と、対応する変換インデックスを図１８Ｅに例示する。 Further, FIG. 18A illustrates an index (IntraNxNPredModeIndex) of each (intra) prediction mode, its name (Name of IntraNxNPredMode), and a corresponding vertical conversion index and horizontal conversion index. In FIG. 18A, “NxN” represents the size of the prediction target block (N = 4, 8, 16, etc.). The size of the prediction target block can be expanded to “MxN” (that is, a rectangle other than a square).
Here, FIG. 18E illustrates an index of each prediction mode, its name, and a corresponding conversion index obtained by integrating FIGS. 18A and 18D.

１Ｄ変換行列セット部１１２は、予測情報１２６に含まれる予測モード情報から予測モードのインデックスを検出し、対応する１Ｄ変換行列セット情報１２９を生成する。尚、図１８Ａ、図１８Ｂ、図１８Ｃ、図１８Ｄ及び図１８Ｅに示す各種テーブルは一例であり、１Ｄ変換行列セット部１１２はこれらのテーブルの一部または全部を使用することなく１Ｄ変換行列セット情報１２９を生成してよい。 The 1D transformation matrix set unit 112 detects a prediction mode index from the prediction mode information included in the prediction information 126, and generates corresponding 1D transformation matrix set information 129. Note that the various tables shown in FIGS. 18A, 18B, 18C, 18D, and 18E are examples, and the 1D conversion matrix set unit 112 uses the 1D conversion matrix set information without using some or all of these tables. 129 may be generated.

例えば、ＴｒａｓｎｆｏｒｍＩｄｘが０を示す場合、ＶｅｒｔｉｃａｌＴｒａｎｓｆｏｒｍｉｎｄｅｘが０を、ＨｏｒｉｚｏｎｔａｌＴｒａｎｓｆｏｒｍｉｎｄｅｘが０を示すことを意味する。つまり、垂直直交変換には１Ｄ変換行列Ｃを、水平直交変換には１Ｄ変換行列Ｃを使用することを意味する。また、垂直逆直交変換には１Ｄ変換行列Ｃの転置行列を、水平逆直交変換には１Ｄ変換行列Ｃの転置行列を使用することを意味する。 For example, when the TransformIdx indicates 0, it means that the Vertical Transform index indicates 0 and the Horizontal Transform index indicates 0. That is, it means that the 1D transformation matrix C is used for the vertical orthogonal transformation and the 1D transformation matrix C is used for the horizontal orthogonal transformation. Further, it means that a transposed matrix of 1D transformation matrix C is used for vertical inverse orthogonal transformation, and a transposed matrix of 1D transformation matrix C is used for horizontal inverse orthogonal transformation.

ＴｒａｓｎｆｏｒｍＩｄｘが１を示す場合、ＶｅｒｔｉｃａｌＴｒａｎｓｆｏｒｍｉｎｄｅｘが０を、ＨｏｒｉｚｏｎｔａｌＴｒａｎｓｆｏｒｍｉｎｄｅｘが１を示すことを意味する。つまり、垂直直交変換には１Ｄ変換行列Ｃを、水平直交変換には１Ｄ変換行列Ｄを使用することを意味する。また、垂直逆直交変換には１Ｄ変換行列Ｃの転置行列を、水平逆直交変換には１Ｄ変換行列Ｄの転置行列を使用することを意味する。 When the TransformIdx indicates 1, it means that the Vertical Transform index indicates 0 and the Horizontal Transform index indicates 1. That is, it means that the 1D transformation matrix C is used for the vertical orthogonal transformation, and the 1D transformation matrix D is used for the horizontal orthogonal transformation. Further, it means that a transposed matrix of 1D transformation matrix C is used for vertical inverse orthogonal transformation, and a transposed matrix of 1D transformation matrix D is used for horizontal inverse orthogonal transformation.

ＴｒａｓｎｆｏｒｍＩｄｘが２を示す場合、ＶｅｒｔｉｃａｌＴｒａｎｓｆｏｒｍｉｎｄｅｘが１を、ＨｏｒｉｚｏｎｔａｌＴｒａｎｓｆｏｒｍｉｎｄｅｘが０を示すことを意味する。つまり、垂直直交変換には１Ｄ変換行列Ｄを、水平直交変換には１Ｄ変換行列Ｃを使用することを意味する。また、垂直逆直交変換には１Ｄ変換行列Ｄの転置行列を、水平逆直交変換には１Ｄ変換行列Ｃを使用することを意味する。 When the TransformIdx indicates 2, it means that the Vertical Transform index indicates 1, and the Horizontal Transform index indicates 0. That is, it means that the 1D transformation matrix D is used for the vertical orthogonal transformation, and the 1D transformation matrix C is used for the horizontal orthogonal transformation. Further, it means that a transposed matrix of the 1D transformation matrix D is used for the vertical inverse orthogonal transformation, and a 1D transformation matrix C is used for the horizontal inverse orthogonal transformation.

ＴｒａｓｎｆｏｒｍＩｄｘが３を示す場合、ＶｅｒｔｉｃａｌＴｒａｎｓｆｏｒｍｉｎｄｅｘが２をＨｏｒｉｚｏｎｔａｌＴｒａｎｓｆｏｒｍｉｎｄｅｘが２を示すことを意味する。つまり、垂直直交変換には１Ｄ変換行列Ｅを、水平直交変換には１Ｄ変換行列Ｅを使用することを意味する。また、垂直逆直交変換には１Ｄ変換行列Ｅの転置行列を、水平逆直交変換には１Ｄ変換行列Ｅの転置行列を使用することを意味する。 When TransformIdx indicates 3, it means that Vertical Transform index indicates 2 and Horizontal Transform index indicates 2. That is, it means that the 1D transformation matrix E is used for the vertical orthogonal transformation and the 1D transformation matrix E is used for the horizontal orthogonal transformation. Further, it means that a transposed matrix of 1D transformation matrix E is used for vertical inverse orthogonal transformation, and a transposed matrix of 1D transformation matrix E is used for horizontal inverse orthogonal transformation.

ここで、予測対象ブロックがＭ×Ｎで表現される矩形ブロックである場合、直交変換を行うブロックサイズもまたＭ×Ｎであってもよい。 Here, when the prediction target block is a rectangular block represented by M × N, the block size for performing orthogonal transform may also be M × N.

図１８Ａに示すテーブルは、前述の各イントラ予測モードの傾向を考慮して１Ｄ変換行列セット情報１２９を割り当てている。即ち、ＤＣ予測モードには、ＶｅｒｔｉｃａｌＴｒａｎｓｆｏｒｍｉｎｄｅｘ及びＨｏｒｉｚｏｎｔａｌＴｒａｎｓｏｆｒｍｉｎｄｅｘに共に２を割り当てている。故に、ＤＣ予測モードについて前述の１Ｄ変換行列Ｅまたはその転置行列を用いて垂直方向及び水平方向の直交変換または逆直交変換が行われ、高い変換効率が達成される。 In the table shown in FIG. 18A, 1D transformation matrix set information 129 is assigned in consideration of the tendency of each intra prediction mode described above. That is, in the DC prediction mode, 2 is assigned to both the Vertical Transform index and the Horizontal Transform index. Therefore, in the DC prediction mode, orthogonal transformation or inverse orthogonal transformation in the vertical direction and the horizontal direction is performed using the above-described 1D transformation matrix E or its transposed matrix, and high transformation efficiency is achieved.

ＤＣ予測モードを除く予測モードに関して、予測誤差の垂直方向に上記傾向を示すならばＶｅｒｔｉｃａｌＴｒａｎｓｆｏｒｍｉｎｄｅｘに０が、水平方向に上記傾向を示すならばＨｏｒｉｚｏｎｔａｌＴｒａｎｓｆｏｒｍｉｎｄｅｘに０が割り当られている。一方、上記傾向を示さない方向には、夫々１が割り当られている。ＤＣ予測モードを除く予測モードに関して、各予測モードの垂直方向及び水平方向を上記傾向の有無に従って２つのクラスに分類し、垂直方向及び水平方向の夫々について適応的に１Ｄ変換行列Ｃまたは１Ｄ変換行列Ｄを適用することにより、高い変換効率が達成される。 Regarding prediction modes other than the DC prediction mode, 0 is assigned to the Vertical Transform index if the tendency is shown in the vertical direction of the prediction error, and 0 is assigned to the Horizontal Transform index if the tendency is shown in the horizontal direction. On the other hand, 1 is assigned to each direction in which the above tendency is not exhibited. Regarding prediction modes other than the DC prediction mode, the vertical direction and the horizontal direction of each prediction mode are classified into two classes according to the presence or absence of the above-described tendency, and the 1D conversion matrix C or 1D conversion matrix is adaptively applied to each of the vertical direction and the horizontal direction. By applying D, high conversion efficiency is achieved.

以上説明したように、本実施形態に係る画像符号化装置は、第１の実施形態と同様に参照画素からの距離が大きくなるにつれて予測精度が低下するというイントラ予測の傾向を利用しつつ、ＤＣ予測を区別して直交変換及び逆直交変換を適用する。この画像符号化装置は、各予測モードの垂直方向及び水平方向を上記傾向の有無に従って２つのクラスに分類し、垂直方向及び水平方向の夫々について適応的に１Ｄ変換行列Ｃまたは１Ｄ変換行列Ｄを適用する。この画像符号化装置は、ＤＣ予測モードには１Ｄ変換行列Ｅを適用する。１Ｄ変換行列Ｃは、参照画素群のラインに直交する方向（垂直方向または水平方向）について１Ｄ直交変換を行う際の係数集密度が高くなる（即ち、量子化変換係数１２１における非零係数の割合が小さくなる）ように共通の変換基底を予め設計することによって生成される。１Ｄ変換行列Ｄは、ＤＣ予測モードを除外したうえで、このような性質を持たない汎用的な変換行列を設計することによって生成される。１Ｄ変換行列Ｅは、ＤＣＴのための行列であってもよい。或いは、１Ｄ変換行列Ｅは、ＤＣ予測モードの予測誤差１１９に対して垂直方向及び水平方向で１Ｄ直交変換を行う際の係数集密度が高くなる（即ち、量子化変換係数１２１における非零係数の割合が小さくなる）ように共通の変換基底を予め設計することによって生成される。故に、本実施形態に係る画像符号化装置によれば、各予測モードに一律にＤＣＴなどの固定的な直交変換を施す場合に比べて、高い変換効率が達成される。 As described above, the image coding apparatus according to this embodiment uses the tendency of intra prediction that the prediction accuracy decreases as the distance from the reference pixel increases as in the first embodiment, Differentiate predictions and apply orthogonal and inverse orthogonal transforms. This image encoding apparatus classifies the vertical direction and horizontal direction of each prediction mode into two classes according to the presence or absence of the above-described tendency, and adaptively sets the 1D conversion matrix C or 1D conversion matrix D for each of the vertical direction and the horizontal direction. Apply. This image encoding apparatus applies the 1D transformation matrix E to the DC prediction mode. The 1D transform matrix C has higher coefficient density when performing 1D orthogonal transform in the direction (vertical direction or horizontal direction) orthogonal to the line of the reference pixel group (that is, the ratio of non-zero coefficients in the quantized transform coefficient 121) Is generated in advance by designing a common transformation base so that The 1D transformation matrix D is generated by designing a general-purpose transformation matrix that does not have such a property after excluding the DC prediction mode. The 1D transformation matrix E may be a matrix for DCT. Alternatively, the 1D transform matrix E has a high coefficient density when performing 1D orthogonal transform in the vertical and horizontal directions with respect to the prediction error 119 in the DC prediction mode (that is, the non-zero coefficient of the quantized transform coefficient 121). It is generated by pre-designing a common transformation basis so that the ratio becomes smaller. Therefore, according to the image coding apparatus according to the present embodiment, high conversion efficiency is achieved as compared with a case where fixed orthogonal transform such as DCT is uniformly applied to each prediction mode.

（第３の実施形態）
第３の実施形態に係る画像符号化装置は、前述の第１の実施形態及び第２の実施形態に係る画像符号化装置と直交変換及び逆直交変換の詳細において異なる。以降の説明では、本実施形態において第１の実施形態または第２の実施形態と同一部分には同一符号を付して示し、異なる部分を中心に説明する。本実施形態に係る画像符号化装置に対応する画像復号化装置は、第６の実施形態において説明する。 (Third embodiment)
The image encoding device according to the third embodiment differs from the image encoding devices according to the first and second embodiments described above in details of orthogonal transform and inverse orthogonal transform. In the following description, the same reference numerals are given to the same parts as those in the first embodiment or the second embodiment in the present embodiment, and different parts will be mainly described. An image decoding apparatus corresponding to the image encoding apparatus according to the present embodiment will be described in a sixth embodiment.

本実施形態に係る画像符号化装置は、図２に例示した直交変換部１０２の代わりに、図１９に例示する直交変換部１０２を含む。図１９の直交変換部１０２は、選択スイッチ１２０１、垂直変換部１２０２、転置部２０３、選択スイッチ１２０４及び水平変換部１２０５を有する。垂直変換部１２０２は、１Ｄ直交変換部Ｆ１２０６、１Ｄ直交変換部Ｇ１２０７及び１Ｄ直交変換部Ｈ１２０８を含む。水平変換部１２０５は、１Ｄ直交変換部Ｆ１２０９、１Ｄ直交変換部Ｇ１２１０及び１Ｄ直交変換部Ｈ１２１１を含む。尚、垂直変換部１２０２及び水平変換部１２０５の順序は、一例であり、これらは逆順であっても構わない。 The image encoding apparatus according to the present embodiment includes an orthogonal transform unit 102 illustrated in FIG. 19 instead of the orthogonal transform unit 102 illustrated in FIG. 19 includes a selection switch 1201, a vertical conversion unit 1202, a transposition unit 203, a selection switch 1204, and a horizontal conversion unit 1205. The vertical transform unit 1202 includes a 1D orthogonal transform unit F1206, a 1D orthogonal transform unit G1207, and a 1D orthogonal transform unit H1208. The horizontal transformation unit 1205 includes a 1D orthogonal transformation unit F1209, a 1D orthogonal transformation unit G1210, and a 1D orthogonal transformation unit H1211. Note that the order of the vertical conversion unit 1202 and the horizontal conversion unit 1205 is an example, and these may be reversed.

１Ｄ直交変換部Ｆ１２０６及び１Ｄ直交変換部Ｆ１２０９は、入力される行列に対して１Ｄ変換行列Ｆを乗算する点で共通の機能を持つ。１Ｄ直交変換部Ｇ１２０７及び１Ｄ直交変換部Ｇ１２１０は、入力される行列に対して１Ｄ変換行列Ｇを乗算する点で共通の機能を持つ。１Ｄ直交変換部Ｈ１２０８及び１Ｄ直交変換部Ｈ１２１１は、入力される行列に対して１Ｄ変換行列Ｈを乗算する点で共通の機能を持つ。 The 1D orthogonal transform unit F1206 and the 1D orthogonal transform unit F1209 have a common function in that the input matrix is multiplied by the 1D transform matrix F. The 1D orthogonal transform unit G1207 and the 1D orthogonal transform unit G1210 have a common function in that the input matrix is multiplied by the 1D transform matrix G. The 1D orthogonal transform unit H1208 and the 1D orthogonal transform unit H1211 have a common function in that the input matrix is multiplied by the 1D transform matrix H.

以下、本実施形態に係る１Ｄ変換行列Ｆ、１Ｄ変換行列Ｇ及び１Ｄ変換行列Ｈについて説明する。
前述のように、予測誤差１１９は参照画素からの距離が大きくなるにつれて絶対値が大きくなる傾向を持つ。係る傾向は予測方向に関わらず同様であるが、イントラ予測モードには予測対象ブロックの左隣接ライン上の参照画素群のみまたは上隣接ライン上の参照画素群のみを参照（参照画素値のコピーまたは参照画素値からの補間）する予測モードもあれば、予測対象ブロックの左隣接ライン及び上隣接ライン上の参照画素群を参照する予測モードもある。１ライン上の参照画素群のみを参照する予測モードと、２ライン上の参照画素群を参照する予測モードとでは、上記傾向の現れ方に差が生じるといえる。従って、本実施形態では、１ライン上の参照画素群のみを参照する予測モードと、２ライン上の参照画素群を参照する予測モードとを区別して直交変換及び逆直交変換を行う。具体的には、２ライン上の参照画素群を参照する予測モードについては、後述する１Ｄ変換行列Ｈを利用する。一方、１ライン上の参照画素群のみを参照する予測モードについては、前述の第１の実施形態と同様に上記傾向の有無に応じて夫々１Ｄ変換行列Ｆ及び１Ｄ変換行列Ｇを適応的に利用する。 Hereinafter, the 1D conversion matrix F, the 1D conversion matrix G, and the 1D conversion matrix H according to the present embodiment will be described.
As described above, the prediction error 119 tends to increase in absolute value as the distance from the reference pixel increases. This tendency is the same regardless of the prediction direction, but in the intra prediction mode, only the reference pixel group on the left adjacent line of the prediction target block or only the reference pixel group on the upper adjacent line is referenced (a copy of the reference pixel value or There is a prediction mode in which interpolation is performed from a reference pixel value), and there is also a prediction mode in which reference pixel groups on the left adjacent line and the upper adjacent line of the prediction target block are referred to. It can be said that there is a difference in the appearance of the above-described tendency between the prediction mode that refers only to the reference pixel group on one line and the prediction mode that refers to the reference pixel group on two lines. Therefore, in this embodiment, orthogonal transformation and inverse orthogonal transformation are performed by distinguishing between a prediction mode that refers only to a reference pixel group on one line and a prediction mode that refers to a reference pixel group on two lines. Specifically, for a prediction mode that refers to a reference pixel group on two lines, a 1D conversion matrix H described later is used. On the other hand, for the prediction mode in which only the reference pixel group on one line is referred to, the 1D conversion matrix F and the 1D conversion matrix G are adaptively used according to the presence or absence of the above-described tendency, as in the first embodiment. To do.

具体的には、１Ｄ変換行列Ｆは、前述の１Ｄ変換行列Ａと類似の設計手法によって生成することができる。即ち、１Ｄ変換行列Ｆは、２ライン上の参照画素群を参照する予測モード（例えば、図７Ａのモード４、モード５及びモード６）を除外したうえで、前述の１Ｄ変換行列Ａの設計手法を実施すれば生成できる。また、１Ｄ変換行列Ｇは、前述の１Ｄ変換行列Ｂと同一の設計手法によって生成することができる。或いは、１Ｄ変換行列Ｇは、ＤＣＴのための行列であってよい。 Specifically, the 1D conversion matrix F can be generated by a design technique similar to the 1D conversion matrix A described above. That is, the 1D transformation matrix F excludes a prediction mode (for example, mode 4, mode 5 and mode 6 in FIG. 7A) that refers to a reference pixel group on two lines, and then designs the 1D transformation matrix A described above. Can be generated. Further, the 1D conversion matrix G can be generated by the same design method as the 1D conversion matrix B described above. Alternatively, the 1D transformation matrix G may be a matrix for DCT.

１Ｄ変換行列Ｈは、２ライン上の参照画素群を参照する予測モードの予測誤差１１９に対して垂直方向及び水平方向で１Ｄ直交変換を行う際の係数集密度が高くなる（即ち、量子化変換係数１２１における非零係数の割合が小さくなる）ように共通の変換基底を予め設計することによって生成することができる。 The 1D transform matrix H has high coefficient density when performing 1D orthogonal transform in the vertical direction and the horizontal direction on the prediction error 119 of the prediction mode that refers to the reference pixel group on two lines (that is, quantization transform). The common conversion base can be generated in advance so that the ratio of the non-zero coefficient in the coefficient 121 is reduced).

本実施形態に係る画像符号化装置は、図３に例示した逆直交変換部１０５の代わりに、図２０に例示する逆直交変換部１０５を含む。図２０の逆直交変換部１０５は、選択スイッチ１３０１、垂直逆変換部１３０２、転置部３０３、選択スイッチ１３０４及び水平逆変換部１３０５を有する。垂直逆変換部１３０２は、１Ｄ逆直交変換部Ｆ１３０６、１Ｄ逆直交変換部Ｇ１３０７及び１Ｄ逆直交変換部Ｈ１３０８を含む。水平逆変換部１３０５は、１Ｄ逆直交変換部Ｆ１３０９、１Ｄ逆直交変換部Ｇ１３１０及び１Ｄ逆直交変換部Ｈ１３１１を含む。尚、垂直逆変換部１３０２及び水平逆変換部１３０５の順序は、一例であり、これらは逆順であっても構わない。 The image encoding apparatus according to the present embodiment includes an inverse orthogonal transform unit 105 illustrated in FIG. 20 instead of the inverse orthogonal transform unit 105 illustrated in FIG. The inverse orthogonal transform unit 105 in FIG. 20 includes a selection switch 1301, a vertical inverse transform unit 1302, a transposition unit 303, a selection switch 1304, and a horizontal inverse transform unit 1305. The vertical inverse transform unit 1302 includes a 1D inverse orthogonal transform unit F1306, a 1D inverse orthogonal transform unit G1307, and a 1D inverse orthogonal transform unit H1308. The horizontal inverse transform unit 1305 includes a 1D inverse orthogonal transform unit F1309, a 1D inverse orthogonal transform unit G1310, and a 1D inverse orthogonal transform unit H1311. Note that the order of the vertical inverse transform unit 1302 and the horizontal inverse transform unit 1305 is an example, and these may be reversed.

１Ｄ逆直交変換部Ｆ１３０６及び１Ｄ逆直交変換部Ｆ１３０９は、入力される行列に対して１Ｄ変換行列Ｆの転置行列を乗算する点で共通の機能を持つ。１Ｄ逆直交変換部Ｇ１３０７及び１Ｄ逆直交変換部Ｇ１３１０は、入力される行列に対して１Ｄ変換行列Ｇの転置行列を乗算する点で共通の機能を持つ。１Ｄ逆直交変換部Ｈ１３０８及び１Ｄ逆直交変換部Ｈ１３１１は、入力される行列に対して１Ｄ変換行列Ｈの転置行列を乗算する点で共通の機能を持つ。 The 1D inverse orthogonal transform unit F1306 and the 1D inverse orthogonal transform unit F1309 have a common function in that an input matrix is multiplied by a transposed matrix of the 1D transform matrix F. The 1D inverse orthogonal transform unit G1307 and the 1D inverse orthogonal transform unit G1310 have a common function in that the input matrix is multiplied by the transposed matrix of the 1D transform matrix G. The 1D inverse orthogonal transform unit H1308 and the 1D inverse orthogonal transform unit H1311 have a common function in that an input matrix is multiplied by a transposed matrix of the 1D transform matrix H.

以下、１Ｄ変換行列セット部１１２が生成する、本実施形態に係る１Ｄ変換行列セット情報１２９の詳細を説明する。
１Ｄ変換行列セット情報１２９は、垂直直交変換及び垂直逆直交変換のために使用される変換行列を選択するための垂直変換インデックスと、水平直交変換及び水平逆直交変換のために使用される変換行列を選択するための水平変換インデックスとを直接的または間接的に示す。例えば、１Ｄ変換行列セット情報１２９は、図２１Ｄに示す変換インデックス（TransformIdx）で表現することができる。図２１Ｄのテーブルを参照すれば、変換インデックスから垂直変換インデックス（Vertical Transform Idx）及び水平変換インデックス（Horizontal Transform Idx）を導出できる。 Hereinafter, details of the 1D conversion matrix set information 129 according to the present embodiment generated by the 1D conversion matrix set unit 112 will be described.
The 1D transformation matrix set information 129 includes a vertical transformation index for selecting a transformation matrix used for vertical orthogonal transformation and vertical inverse orthogonal transformation, and a transformation matrix used for horizontal orthogonal transformation and horizontal inverse orthogonal transformation. The horizontal transformation index for selecting is directly or indirectly indicated. For example, the 1D transformation matrix set information 129 can be expressed by a transformation index (TransformIdx) illustrated in FIG. 21D. With reference to the table of FIG. 21D, a vertical transformation index (Vertical Transform Idx) and a horizontal transformation index (Horizontal Transform Idx) can be derived from the transformation index.

図２１Ｂに示すように、垂直変換インデックスが「０」であれば、垂直直交変換または垂直逆直交変換のために前述の１Ｄ変換行列Ｆ（1D_Transform_Matrix_F）またはその転置行列が選択される。一方、垂直変換インデックスが「１」であれば、垂直直交変換または垂直逆直交変換のために前述の１Ｄ変換行列Ｇ（1D_Transform_Matrix_G）またはその転置行列が選択される。更に、垂直変換インデックスが「２」であれば、垂直直交変換または垂直逆直交変換のために前述の１Ｄ変換行列Ｈ（1D_transform_Matrix_H）またはその転置行列が選択される。 As shown in FIG. 21B, if the vertical transform index is “0”, the 1D transform matrix F (1D_Transform_Matrix_F) or its transposed matrix is selected for the vertical orthogonal transform or the vertical inverse orthogonal transform. On the other hand, if the vertical transformation index is “1”, the aforementioned 1D transformation matrix G (1D_Transform_Matrix_G) or its transposed matrix is selected for vertical orthogonal transformation or vertical inverse orthogonal transformation. Furthermore, if the vertical transformation index is “2”, the 1D transformation matrix H (1D_transform_Matrix_H) or its transposed matrix is selected for vertical orthogonal transformation or vertical inverse orthogonal transformation.

図２１Ｃに示すように、水平変換インデックスが「０」であれば、水平直交変換または水平逆直交変換のために前述の１Ｄ変換行列Ｆ（1D_Transform_Matrix_F）またはその転置行列が選択される。一方、水平変換インデックスが「１」であれば、水平直交変換または水平逆直交変換のために前述の１Ｄ変換行列Ｇ（1D_Transform_Matrix_G）またはその転置行列が選択される。更に、水平変換インデックスが「２」であれば、水平直交変換または水平逆直交変換のために前述の１Ｄ変換行列Ｈ（1D_Transform_Matrix_H）またはその転置行列が選択される。 As shown in FIG. 21C, if the horizontal transformation index is “0”, the 1D transformation matrix F (1D_Transform_Matrix_F) or its transposed matrix is selected for horizontal orthogonal transformation or horizontal inverse orthogonal transformation. On the other hand, if the horizontal transformation index is “1”, the aforementioned 1D transformation matrix G (1D_Transform_Matrix_G) or its transposed matrix is selected for horizontal orthogonal transformation or horizontal inverse orthogonal transformation. Furthermore, if the horizontal transformation index is “2”, the 1D transformation matrix H (1D_Transform_Matrix_H) or its transposed matrix is selected for horizontal orthogonal transformation or horizontal inverse orthogonal transformation.

また、各（イントラ）予測モードのインデックス（IntraNxNPredModeIndex）と、その名称（Name of IntraNxNPredMode）と、対応する垂直変換インデックス及び水平変換インデックスを図２１Ａに例示する。尚、図２１Ａにおいて、「NxN」は予測対象ブロックのサイズを表している（Ｎ＝４，８，１６など）。予測対象ブロックのサイズは、「MxN」（即ち、正方形以外の矩形）に拡張することもできる。
ここで、図２１Ａと図２１Ｄを統合した、各予測モードのインデックスとその名称と、対応する変換インデックスを図２１Ｅに例示する。 Further, FIG. 21A illustrates an index (IntraNxNPredModeIndex) of each (intra) prediction mode, its name (Name of IntraNxNPredMode), and a corresponding vertical conversion index and horizontal conversion index. In FIG. 21A, “NxN” represents the size of the prediction target block (N = 4, 8, 16, etc.). The size of the prediction target block can be expanded to “MxN” (that is, a rectangle other than a square).
Here, FIG. 21E illustrates an index of each prediction mode, a name thereof, and a corresponding conversion index obtained by integrating FIGS. 21A and 21D.

１Ｄ変換行列セット部１１２は、予測情報１２６に含まれる予測モード情報から予測モードのインデックスを検出し、対応する１Ｄ変換行列セット情報１２９を生成する。尚、図２１Ａ、図２１Ｂ、図２１Ｃ、図２１Ｄ及び図２１Ｅに示す各種テーブルは一例であり、１Ｄ変換行列セット部１１２はこれらのテーブルの一部または全部を使用することなく１Ｄ変換行列セット情報１２９を生成してよい。 The 1D transformation matrix set unit 112 detects a prediction mode index from the prediction mode information included in the prediction information 126, and generates corresponding 1D transformation matrix set information 129. Note that the various tables shown in FIGS. 21A, 21B, 21C, 21D, and 21E are examples, and the 1D conversion matrix set unit 112 uses the 1D conversion matrix set information without using some or all of these tables. 129 may be generated.

例えば、ＴｒａｓｎｆｏｒｍＩｄｘが０を示す場合、ＶｅｒｔｉｃａｌＴｒａｎｓｆｏｒｍｉｎｄｅｘが２を、ＨｏｒｉｚｏｎｔａｌＴｒａｎｓｆｏｒｍｉｎｄｅｘが２を示すことを意味する。つまり、垂直直交変換には１Ｄ変換行列Ｈを、水平直交変換には１Ｄ変換行列Ｈを使用することを意味する。また、垂直逆直交変換には１Ｄ変換行列Ｈの転置行列を、水平逆直交変換には１Ｄ変換行列Ｈの転置行列を使用することを意味する。 For example, when TransformIdx indicates 0, it means that Vertical Transform index indicates 2, and Horizonal Transform index indicates 2. That is, it means that the 1D transformation matrix H is used for the vertical orthogonal transformation and the 1D transformation matrix H is used for the horizontal orthogonal transformation. Further, it means that a transposed matrix of 1D transformation matrix H is used for vertical inverse orthogonal transformation, and a transposed matrix of 1D transformation matrix H is used for horizontal inverse orthogonal transformation.

ＴｒａｓｎｆｏｒｍＩｄｘが１を示す場合、ＶｅｒｔｉｃａｌＴｒａｎｓｆｏｒｍｉｎｄｅｘが０を、ＨｏｒｉｚｏｎｔａｌＴｒａｎｓｆｏｒｍｉｎｄｅｘが１を示すことを意味する。つまり、垂直直交変換には１Ｄ変換行列Ｆを、水平直交変換には１Ｄ変換行列Ｇを使用することを意味する。また、垂直逆直交変換には１Ｄ変換行列Ｆの転置行列を、水平逆直交変換には１Ｄ変換行列Ｇの転置行列を使用することを意味する。 When the TransformIdx indicates 1, it means that the Vertical Transform index indicates 0 and the Horizontal Transform index indicates 1. That is, it means that the 1D transformation matrix F is used for the vertical orthogonal transformation and the 1D transformation matrix G is used for the horizontal orthogonal transformation. Further, it means that a transposed matrix of 1D transformation matrix F is used for vertical inverse orthogonal transformation, and a transposed matrix of 1D transformation matrix G is used for horizontal inverse orthogonal transformation.

ＴｒａｓｎｆｏｒｍＩｄｘが２を示す場合、ＶｅｒｔｉｃａｌＴｒａｎｓｆｏｒｍｉｎｄｅｘが１を、ＨｏｒｉｚｏｎｔａｌＴｒａｎｓｆｏｒｍｉｎｄｅｘが０を示すことを意味する。つまり、垂直直交変換には１Ｄ変換行列Ｇを、水平直交変換には１Ｄ変換行列Ｆを使用することを意味する。また、垂直逆直交変換には１Ｄ変換行列Ｇの転置行列を、水平逆直交変換には１Ｄ変換行列Ｆを使用することを意味する。 When the TransformIdx indicates 2, it means that the Vertical Transform index indicates 1, and the Horizontal Transform index indicates 0. That is, it means that the 1D transformation matrix G is used for the vertical orthogonal transformation, and the 1D transformation matrix F is used for the horizontal orthogonal transformation. Further, it means that a transposed matrix of the 1D transformation matrix G is used for the vertical inverse orthogonal transformation, and a 1D transformation matrix F is used for the horizontal inverse orthogonal transformation.

ＴｒａｓｎｆｏｒｍＩｄｘが３を示す場合、ＶｅｒｔｉｃａｌＴｒａｎｓｆｏｒｍｉｎｄｅｘが１をＨｏｒｉｚｏｎｔａｌＴｒａｎｓｆｏｒｍｉｎｄｅｘが１を示すことを意味する。つまり、垂直直交変換には１Ｄ変換行列Ｇを、水平直交変換には１Ｄ変換行列Ｇを使用することを意味する。また、垂直逆直交変換には１Ｄ変換行列Ｇの転置行列を、水平逆直交変換には１Ｄ変換行列Ｇの転置行列を使用することを意味する。 When TransformIdx indicates 3, it means that Vertical Transform index indicates 1, and Horizontal Transform index indicates 1. That is, it means that the 1D conversion matrix G is used for the vertical orthogonal transformation, and the 1D conversion matrix G is used for the horizontal orthogonal transformation. Further, it means that a transposed matrix of the 1D transformation matrix G is used for the vertical inverse orthogonal transformation, and a transposed matrix of the 1D transformation matrix G is used for the horizontal inverse orthogonal transformation.

図２１Ａに示すテーブルは、前述の各イントラ予測モードの傾向を考慮して１Ｄ変換行列セット情報１２９を割り当てている。即ち、２ライン上の参照画素群を参照する予測モードには、ＶｅｒｔｉｃａｌＴｒａｎｓｆｏｒｍｉｎｄｅｘ及びＨｏｒｉｚｏｎｔａｌＴｒａｎｓｏｆｒｍｉｎｄｅｘに共に２を割り当てている。故に、２ライン上の参照画素群を参照する予測モードについて前述の１Ｄ変換行列Ｈまたはその転置行列を用いて垂直方向及び水平方向の直交変換または逆直交変換が行われ、高い変換効率が達成される。 In the table shown in FIG. 21A, 1D transformation matrix set information 129 is assigned in consideration of the tendency of each intra prediction mode described above. That is, 2 is assigned to both the vertical transform index and the horizontal transform index in the prediction mode for referencing the reference pixel group on two lines. Therefore, for the prediction mode that refers to the reference pixel group on two lines, the orthogonal transformation or inverse orthogonal transformation in the vertical direction and the horizontal direction is performed using the 1D transformation matrix H or its transpose matrix, and high transformation efficiency is achieved. The

２ライン上の参照画素群を参照する予測モードを除く予測モードに関して、予測誤差の垂直方向に上記傾向を示すならば、ＶｅｒｔｉｃａｌＴｒａｎｓｆｏｒｍｉｎｄｅｘに０が、水平方向に上記傾向を示すならば、ＨｏｒｉｚｏｎｔａｌＴｒａｎｓｆｏｒｍｉｎｄｅｘに０が割り当てられている。一方、上記傾向を示さない方向には、夫々１が割り当てられている。２ライン上の参照画素群を参照する予測モードを除く予測モードに関して、各予測モードの垂直方向及び水平方向を上記傾向の有無に従って２つのクラスに分類し、垂直方向及び水平方向の夫々について適応的に１Ｄ変換行列Ｆまたは１Ｄ変換行列Ｇを適用することにより、高い変換効率が達成される。 For prediction modes other than the prediction mode that refers to the reference pixel group on two lines, if the tendency is shown in the vertical direction of the prediction error, 0 is in the vertical transform index, and if the tendency is shown in the horizontal direction, the horizonal transform is shown. 0 is assigned to the index. On the other hand, 1 is assigned to each direction that does not show the above tendency. Regarding prediction modes other than the prediction mode that refers to the reference pixel group on two lines, the vertical direction and horizontal direction of each prediction mode are classified into two classes according to the presence or absence of the above-mentioned tendency, and adaptive in each of the vertical direction and the horizontal direction. By applying the 1D conversion matrix F or the 1D conversion matrix G to the above, high conversion efficiency is achieved.

以上説明したように、本実施形態に係る画像符号化装置は、第１の実施形態と同様に参照画素からの距離が大きくなるにつれて予測精度が低下するというイントラ予測の傾向を利用しつつ、各予測モードを参照画素群のライン数によって区別して直交変換及び逆直交変換を適用する。この画像符号化装置は、２ライン上の参照画素群を参照する予測モードを除く予測モードに関して、垂直方向及び水平方向を上記傾向の有無に従って２つのクラスに分類し、垂直方向及び水平方向の夫々について適応的に１Ｄ変換行列Ｆまたは１Ｄ変換行列Ｇを適用する。一方、この画像符号化装置は、２ライン上の参照画素群を参照する各予測モードには１Ｄ変換行列Ｈを適用する。１Ｄ変換行列Ｆは、１ライン上の参照画素群のみを参照する各予測モードに関して、参照画素群のラインに直交する方向（垂直方向または水平方向）について１Ｄ直交変換を行う際の係数集密度が高くなる（即ち、量子化変換係数１２１における非零係数の割合が小さくなる）ように共通の変換基底を予め設計することによって生成される。一方、１Ｄ変換行列Ｇは、このような性質を持たない汎用的な変換行列を設計することによって生成される。更に、１Ｄ変換行列Ｈは、２ライン上の参照画素群を参照する各予測モードの予測誤差１１９に対して垂直方向及び水平方向で１Ｄ直交変換を行う際の係数集密度が高くなる（即ち、量子化変換係数１２１における非零係数の割合が小さくなる）ように共通の変換基底を予め設計することによって生成される。故に、本実施形態に係る画像符号化装置によれば、各予測モードに一律にＤＣＴなどの固定的な直交変換を施す場合に比べて、高い変換効率が達成される。 As described above, the image encoding apparatus according to the present embodiment uses the tendency of intra prediction that the prediction accuracy decreases as the distance from the reference pixel increases, as in the first embodiment. The prediction mode is distinguished by the number of lines of the reference pixel group, and orthogonal transformation and inverse orthogonal transformation are applied. This image encoding apparatus classifies the vertical direction and the horizontal direction into two classes according to the presence or absence of the above-described tendency with respect to the prediction modes except the prediction mode that refers to the reference pixel group on two lines, and each of the vertical direction and the horizontal direction The 1D transformation matrix F or the 1D transformation matrix G is applied adaptively. On the other hand, this image encoding apparatus applies the 1D transformation matrix H to each prediction mode that refers to reference pixel groups on two lines. The 1D transformation matrix F has a coefficient density when performing 1D orthogonal transformation in a direction (vertical direction or horizontal direction) orthogonal to the line of the reference pixel group with respect to each prediction mode in which only the reference pixel group on one line is referred to. It is generated by designing a common transform base in advance so as to be high (that is, the ratio of non-zero coefficients in the quantized transform coefficient 121 is small). On the other hand, the 1D transformation matrix G is generated by designing a general-purpose transformation matrix having no such property. Further, the 1D transform matrix H has a high coefficient density when performing 1D orthogonal transform in the vertical direction and the horizontal direction with respect to the prediction error 119 of each prediction mode referring to the reference pixel group on two lines (that is, It is generated by designing a common transform base in advance so that the ratio of the non-zero coefficient in the quantized transform coefficient 121 becomes smaller. Therefore, according to the image coding apparatus according to the present embodiment, high conversion efficiency is achieved as compared with a case where fixed orthogonal transform such as DCT is uniformly applied to each prediction mode.

第１乃至第３の実施形態では、２種類または３種類の１Ｄ変換行列を夫々用意し、予測モードに応じて垂直変換（または垂直逆変換）及び水平変換（または水平逆変換）のための１Ｄ変換行列を選択する。しかしながら、前述の２種類または３種類の１Ｄ変換行列は、例示であり、更に多くの変換行列を用意して符号化効率を向上させることも可能である。例えば、第２の実施形態と第３の実施形態とを組み合わせて４種類の１Ｄ変換行列を用意することも可能である。但し、用意する変換行列の種類の増加に伴って更なるハードウェアなどが必要となるので、変換行列の種類の増加に伴うデメリットと符号化効率とのバランスを考慮することが望ましい。 In the first to third embodiments, two or three types of 1D conversion matrices are prepared, and 1D for vertical conversion (or vertical reverse conversion) and horizontal conversion (or horizontal reverse conversion) according to the prediction mode. Select a transformation matrix. However, the above-described two or three types of 1D transformation matrices are examples, and it is possible to prepare more transformation matrices to improve the encoding efficiency. For example, it is possible to prepare four types of 1D conversion matrices by combining the second embodiment and the third embodiment. However, since additional hardware is required as the types of transformation matrices to be prepared increase, it is desirable to consider the balance between the disadvantages associated with the increase in types of transformation matrices and the coding efficiency.

（第４の実施形態）
第４の実施形態は、画像復号化装置に関する。本実施形態に係る画像復号化装置に対応する画像符号化装置は、第１の実施形態において説明した通りである。即ち、本実施形態に係る画像復号化装置は、例えば第１の実施形態に係る画像符号化装置によって生成された符号化データを復号化する。 (Fourth embodiment)
The fourth embodiment relates to an image decoding apparatus. The image coding apparatus corresponding to the image decoding apparatus according to the present embodiment is as described in the first embodiment. That is, the image decoding apparatus according to the present embodiment decodes encoded data generated by the image encoding apparatus according to the first embodiment, for example.

図２２に示すように、本実施形態に係る画像復号化装置は、入力バッファ４０１、エントロピー復号化部４０２、係数順制御部４０３、逆量子化部４０４、逆直交変換部４０５、加算部４０６、参照画像メモリ４０７、イントラ予測部４０８、インター予測部４０９、選択スイッチ４１０、１Ｄ変換行列セット部４１１及び出力バッファ４１２を含む。 As shown in FIG. 22, the image decoding apparatus according to the present embodiment includes an input buffer 401, an entropy decoding unit 402, a coefficient order control unit 403, an inverse quantization unit 404, an inverse orthogonal transform unit 405, an addition unit 406, A reference image memory 407, an intra prediction unit 408, an inter prediction unit 409, a selection switch 410, a 1D transformation matrix setting unit 411, and an output buffer 412 are included.

図２２の画像復号化装置は、入力バッファ４０１に蓄積される符号化データ４１４を復号し、復号画像４１９を出力バッファ４１２に蓄積して出力画像４２５として出力する。符号化データ４１４は、例えば図１の画像符号化装置などから出力され、図示しない蓄積系または伝送系を経て、入力バッファ４０１に一時的に蓄積される。 The image decoding apparatus in FIG. 22 decodes the encoded data 414 stored in the input buffer 401, stores the decoded image 419 in the output buffer 412, and outputs it as an output image 425. The encoded data 414 is output from, for example, the image encoding device in FIG. 1 and the like, and is temporarily stored in the input buffer 401 via a storage system or a transmission system (not shown).

エントロピー復号化部４０２は、符号化データ４１４の復号化のために、１フレームまたは１フィールド毎にシンタクスに基づいて解読を行う。エントロピー復号化部４０２は、各シンタクスの符号列を順次エントロピー復号化し、予測モード情報４２１を含む予測情報４２４、量子化変換係数列４１５などの符号化対象ブロックの符号化パラメータを再生する。符号化パラメータとは、予測情報４２４、変換係数に関する情報、量子化に関する情報、などの復号に必要となるパラメータである。量子化変換係数列４１５は、係数順制御部４０３へ入力される。また、予測情報４２４に含まれる予測モード情報４２１も同様に、係数順制御部４０３へ入力される。予測情報４２４は、１Ｄ変換行列セット部４１１及び選択スイッチ４１０に入力される。 The entropy decoding unit 402 performs decoding based on the syntax for each frame or field for decoding the encoded data 414. The entropy decoding unit 402 sequentially entropy-decodes the code string of each syntax, and reproduces the encoding parameters of the encoding target block such as the prediction information 424 including the prediction mode information 421 and the quantized transform coefficient string 415. The encoding parameter is a parameter necessary for decoding such as prediction information 424, information on transform coefficients, information on quantization, and the like. The quantized transform coefficient sequence 415 is input to the coefficient order control unit 403. Similarly, the prediction mode information 421 included in the prediction information 424 is also input to the coefficient order control unit 403. The prediction information 424 is input to the 1D conversion matrix setting unit 411 and the selection switch 410.

係数順制御部４０３は、１次元表現である量子化変換係数列４１５を、２次元表現である量子化変換係数４１６に変換し、逆量子化部４０４に入力する。尚、係数制御部４０３の詳細は後述される。 The coefficient order control unit 403 converts the quantized transform coefficient sequence 415 that is a one-dimensional representation into a quantized transform coefficient 416 that is a two-dimensional representation, and inputs the quantized transform coefficient sequence 415 to the inverse quantization unit 404. Details of the coefficient control unit 403 will be described later.

逆量子化部４０４は、係数順制御部４０３からの量子化変換係数４１６に逆量子化を行って、復元変換係数４１７を得る。具体的には、逆量子化部４０４は、エントロピー復号化部４０２によって復号化された量子化に関する情報に従って逆量子化を行う。逆量子化部４０４は、復元変換係数４１７を逆直交変換部４０５に入力する。 The inverse quantization unit 404 performs inverse quantization on the quantized transform coefficient 416 from the coefficient order control unit 403 to obtain a restored transform coefficient 417. Specifically, the inverse quantization unit 404 performs inverse quantization according to the information related to quantization decoded by the entropy decoding unit 402. The inverse quantization unit 404 inputs the restored transform coefficient 417 to the inverse orthogonal transform unit 405.

逆直交変換部４０５は、逆量子化部４０４からの復元変換係数４１７に対して、符号化側において行われた直交変換に対応する逆直交変換を行い、復元予測誤差４１８を得る。逆直交変換部４０５は、復元予測誤差４１８を加算部４０６に入力する。 The inverse orthogonal transform unit 405 performs an inverse orthogonal transform corresponding to the orthogonal transform performed on the encoding side on the restored transform coefficient 417 from the inverse quantization unit 404 to obtain a restored prediction error 418. The inverse orthogonal transform unit 405 inputs the restoration prediction error 418 to the addition unit 406.

具体的には、本実施形態に係る逆直交変換部４０５は、図３の逆直交変換部１０５と実質的に同一または類似の要素なのでその詳細な説明を省略する。特に、本実施形態に係る逆直交変換部４０５は、図３の逆直交変換部１０５と共通の１Ｄ変換行列Ａ及び１Ｄ変換行列Ｂを利用する。尚、図３における復元変換係数１２２、１Ｄ変換行列セット情報１２９及び復元予測誤差１２３は、本実施形態における復元変換係数４１７、１Ｄ変換行列セット情報４２２及び復元予測誤差信号４１８に夫々対応している。 Specifically, since the inverse orthogonal transform unit 405 according to the present embodiment is substantially the same as or similar to the inverse orthogonal transform unit 105 of FIG. 3, detailed description thereof is omitted. In particular, the inverse orthogonal transform unit 405 according to the present embodiment uses the 1D transform matrix A and the 1D transform matrix B common to the inverse orthogonal transform unit 105 of FIG. Note that the restored transform coefficient 122, 1D transform matrix set information 129, and the restored prediction error 123 in FIG. 3 respectively correspond to the restored transform coefficient 417, 1D transform matrix set information 422, and the restored prediction error signal 418 in the present embodiment. .

加算部４０６は、復元予測誤差４１８と、対応する予測画像４２３とを加算し、復号画像４１９を生成する。復号画像４１９は、出力画像４２５のために出力バッファ４１２に一時的に蓄積されると共に、参照画像４２０のために参照画像メモリ４０７にも保存される。参照画像メモリ４０７に保存された復号画像４１９は、参照画像４２０としてイントラ予測部４０８及びインター予測部４０９によって必要に応じてフレーム単位またはフィールド単位で参照される。出力バッファ４１２に一時的に蓄積された復号画像４１９は、復号化制御部４１３によって管理される出力タイミングに従って出力される。 The adding unit 406 adds the restored prediction error 418 and the corresponding predicted image 423 to generate a decoded image 419. The decoded image 419 is temporarily stored in the output buffer 412 for the output image 425 and also stored in the reference image memory 407 for the reference image 420. The decoded image 419 stored in the reference image memory 407 is referred to by the intra prediction unit 408 and the inter prediction unit 409 as the reference image 420 in units of frames or fields as necessary. The decoded image 419 temporarily stored in the output buffer 412 is output according to the output timing managed by the decoding control unit 413.

イントラ予測部４０８、インター予測部４０９及び選択スイッチ４１０は、図１のイントラ予測部１０８、インター予測部１０９及び選択スイッチ１１１と実質的に同一または類似の要素なのでその詳細な説明を省略する。復号化制御部４１３は、図２２の画像復号化装置の各要素を制御する。具体的には、復号化制御部４１３は、上述の動作を含む復号化処理のための種々の制御を行う。 The intra prediction unit 408, the inter prediction unit 409, and the selection switch 410 are substantially the same as or similar to the intra prediction unit 108, the inter prediction unit 109, and the selection switch 111 in FIG. The decoding control unit 413 controls each element of the image decoding device in FIG. Specifically, the decoding control unit 413 performs various controls for the decoding process including the above-described operation.

１Ｄ変換行列セット部４１１は、エントロピー復号化部４０２からの予測情報４２４に含まれる予測モード情報に基づいて１Ｄ変換行列セット情報４２２を生成し、逆直交変換部４０５に入力する。 The 1D transform matrix set unit 411 generates 1D transform matrix set information 422 based on the prediction mode information included in the prediction information 424 from the entropy decoding unit 402 and inputs the 1D transform matrix set information 422 to the inverse orthogonal transform unit 405.

具体的には、本実施形態に係る１Ｄ変換行列セット部４１１は、第１の実施形態に係る１Ｄ変換行列セット部１１２と実質的に同一または類似の要素なのでその詳細な説明を省略する。即ち、本実施形態に係る１Ｄ変換行列セット部４１１は、例えば図４Ａ、図４Ｂ、図４Ｃ、図４Ｄ及び図４Ｅのテーブルを利用して、１Ｄ変換行列セット情報４２２を生成する。尚、第１の実施形態における予測情報１２６及び１Ｄ変換行列セット情報１２９は、本実施形態における予測情報４２４及び１Ｄ変換行列セット情報４２２に夫々対応している。
また、図２２の画像復号化装置は、図１１、図１２、図１３及び図１４に関して説明したシンタクスと同一または類似のシンタクスを利用するのでその詳細な説明を省略する。 Specifically, the 1D transformation matrix set unit 411 according to the present embodiment is substantially the same as or similar to the 1D transformation matrix set unit 112 according to the first embodiment, and thus detailed description thereof is omitted. That is, the 1D conversion matrix set unit 411 according to the present embodiment generates 1D conversion matrix set information 422 using, for example, the tables of FIGS. 4A, 4B, 4C, 4D, and 4E. Note that the prediction information 126 and the 1D transformation matrix set information 129 in the first embodiment correspond to the prediction information 424 and the 1D transformation matrix set information 422 in the present embodiment, respectively.
The image decoding apparatus in FIG. 22 uses the same or similar syntax as the syntax described with reference to FIGS. 11, 12, 13, and 14, and thus detailed description thereof is omitted.

以下、係数順制御部４０３の詳細を説明する。
係数順制御部４０３は、１次元表現である量子化変換係数列４１５の各要素を所定の順序（即ち、符号化側と対応する順序）に従って配列することにより、２次元表現である量子化変換係数４１６に変換する。一例として、符号化側において予測モードに関わらず共通の２Ｄ−１Ｄ変換が行われているならば、係数順制御部４０３は予測モードに関わらず共通の１Ｄ−２Ｄ変換を行うことができる。具体的には、係数制御部４０３は、Ｈ．２６４と同様に逆ジグザグスキャンを利用できる。逆ジグザグスキャンは、前述のジグザグスキャンに対応する１Ｄ−２Ｄ変換である
別の例として、符号化側において予測モード毎の個別の２Ｄ−１Ｄ変換が行われているならば、係数順制御部４０３もまた予測モード毎の個別の１Ｄ−２Ｄ変換を行うことができる。このような動作を行う係数順制御部４０３は、図２３Ａに例示されている。この係数順制御部４０３は、選択スイッチ１００１と、９種類の予測モード毎の個別の１Ｄ−２Ｄ変換部１００２，・・・，１０１０とを含む。選択スイッチ１００１は、予測情報４２４に含まれる予測モード情報（例えば、図４Ａの予測モードのインデックス）に従って量子化変換係数列４１５を、予測モードに応じた１Ｄ−２Ｄ変換部（１００２，・・・，１０１０のうちいずれか１つ）に導く。例えば、予測モードインデックスが０であれば、選択スイッチ１００１は量子化変換係数列４１５を１Ｄ−２Ｄ変換部１００２に導く。図２３Ａにおいて、各予測モードと１Ｄ−２Ｄ変換部とは１対１に対応しており、量子化変換係数列４１５は予測モードに応じた１つの１Ｄ−２Ｄ変換部に導かれ、量子化変換係数４１６に変換される。 Details of the coefficient order control unit 403 will be described below.
The coefficient order control unit 403 arranges the elements of the quantized transform coefficient sequence 415 that is a one-dimensional representation according to a predetermined order (that is, the order that corresponds to the encoding side), and thereby the quantized transform that is a two-dimensional representation. Convert to coefficient 416. As an example, if the common 2D-1D conversion is performed regardless of the prediction mode on the encoding side, the coefficient order control unit 403 can perform the common 1D-2D conversion regardless of the prediction mode. Specifically, the coefficient control unit 403 includes the H.264 standard. Similar to H.264, reverse zigzag scanning can be used. Inverse zigzag scanning is 1D-2D conversion corresponding to the above-described zigzag scanning. As another example, if individual 2D-1D conversion for each prediction mode is performed on the encoding side, the coefficient order control unit 403 Can also perform individual 1D-2D conversion for each prediction mode. The coefficient order control unit 403 that performs such an operation is illustrated in FIG. 23A. The coefficient order control unit 403 includes a selection switch 1001 and individual 1D-2D conversion units 1002, ..., 1010 for each of nine types of prediction modes. The selection switch 1001 converts the quantized transform coefficient sequence 415 into a 1D-2D transform unit (1002,... , 1010). For example, if the prediction mode index is 0, the selection switch 1001 guides the quantized transform coefficient sequence 415 to the 1D-2D transform unit 1002. In FIG. 23A, each prediction mode and the 1D-2D conversion unit have a one-to-one correspondence, and the quantized transform coefficient sequence 415 is guided to one 1D-2D transform unit corresponding to the prediction mode, and the quantized transform is performed. Converted to a coefficient 416.

更に別の例として、符号化側において２Ｄ−１Ｄ変換におけるスキャン順が動的に更新されるならば、係数順制御部４０３もまた１Ｄ−２Ｄ変換におけるスキャン順を符号化側と対応するように動的に更新してもよい。このような動作を行う係数順制御部４０３は、図２３Ｂに例示されている。この係数順制御部４０３は、選択スイッチ１００１と、９種類の予測モード毎の個別の１Ｄ−２Ｄ変換部１００２，・・・，１０１０と、発生頻度カウント部１０１１と、係数順更新部１０１２とを含む。選択スイッチ１００１は、図２３Ａに関して説明した通りである。９種類の予測モード毎の個別の１Ｄ−２Ｄ変換部１００２，・・・，１０１０は、そのスキャン順が係数順更新部１０１２によって更新される点で図２３Ａとは異なる。 As yet another example, if the scan order in the 2D-1D conversion is dynamically updated on the encoding side, the coefficient order control unit 403 also causes the scan order in the 1D-2D conversion to correspond to the encoding side. It may be updated dynamically. The coefficient order control unit 403 that performs such an operation is illustrated in FIG. 23B. The coefficient order control unit 403 includes a selection switch 1001, individual 1D-2D conversion units 1002,..., 1010 for each of nine types of prediction modes, an occurrence frequency counting unit 1011, and a coefficient order update unit 1012. Including. The selection switch 1001 is as described with reference to FIG. 23A. The individual 1D-2D conversion units 1002,..., 1010 for each of the nine types of prediction modes differ from FIG. 23A in that the scan order is updated by the coefficient order update unit 1012.

発生頻度カウント部１０１１は、予測モード毎に、量子化変換係数４１６の各要素における非零係数の発生回数のヒストグラムを作成する。発生頻度カウント部１０１１は、作成したヒストグラム１０１３を係数順更新部１０１２に入力する。 The occurrence frequency counting unit 1011 creates a histogram of the number of occurrences of non-zero coefficients in each element of the quantized transform coefficient 416 for each prediction mode. The occurrence frequency counting unit 1011 inputs the created histogram 1013 to the coefficient order updating unit 1012.

係数順更新部１０１２は、予め定められたタイミングで、ヒストグラム１０１３に基づいて係数順の更新を行う。上記タイミングは、例えば、コーディングツリーユニットの復号化処理が終了したタイミング、コーディングツリーユニット内の１ライン分の復号化処理が終了したタイミングなどである。 The coefficient order update unit 1012 updates the coefficient order based on the histogram 1013 at a predetermined timing. The timing is, for example, the timing when the decoding process of the coding tree unit is completed, the timing when the decoding process for one line in the coding tree unit is completed, or the like.

具体的には、係数順更新部１０１２は、ヒストグラム１０１３を参照して、非零係数の発生回数が閾値以上にカウントされた要素を持つ予測モードに関して係数順の更新を行う。例えば、係数順更新部１０１２は、非零係数の発生が１６回以上カウントされた要素を持つ予測モードに関して更新を行う。このような発生回数に閾値を設けることによって、係数順の更新が大域的に実施されるので、局所的な最適解に収束しにくくなる。 Specifically, the coefficient order update unit 1012 refers to the histogram 1013 and updates the coefficient order for a prediction mode having an element in which the number of occurrences of non-zero coefficients is counted above a threshold. For example, the coefficient order update unit 1012 updates the prediction mode having an element in which the occurrence of a non-zero coefficient is counted 16 times or more. By providing a threshold value for the number of occurrences, the coefficient order is updated globally, so that it is difficult to converge to a local optimum solution.

係数順更新部１０１２は、更新対象となる予測モードに関して、非零係数の発生頻度の降順に要素をソーティングする。ソーティングは、例えばバブルソート、クイックソートなどの既存のアルゴリズムによって実現できる。そして、係数順更新部１０１２は、ソーティングされた要素の順序を示す係数順更新情報１０１４を、更新対象となる予測モードに対応する１Ｄ−２Ｄ変換部に入力する。 The coefficient order update unit 1012 sorts the elements in descending order of the occurrence frequency of the non-zero coefficient with respect to the prediction mode to be updated. Sorting can be realized by existing algorithms such as bubble sort and quick sort. Then, the coefficient order update unit 1012 inputs coefficient order update information 1014 indicating the order of the sorted elements to the 1D-2D conversion unit corresponding to the prediction mode to be updated.

係数順更新情報１０１４が入力されると、１Ｄ−２Ｄ変換部は更新後のスキャン順に従って１Ｄ−２Ｄ変換を行う。尚、スキャン順を動的に更新する場合には、各１Ｄ−２Ｄ変換部の符号化側と対応する初期スキャン順を予め定めておく必要がある。 When the coefficient order update information 1014 is input, the 1D-2D conversion unit performs 1D-2D conversion according to the updated scan order. Note that when the scan order is dynamically updated, it is necessary to determine in advance the initial scan order corresponding to the encoding side of each 1D-2D conversion unit.

尚、簡単化のためにＨ．２６４を例示して予測モードが９種類の場合を説明したが、予測モードが１７種類、３３種類などに拡張された場合にも、拡張された各予測モードに対応する１Ｄ−２Ｄ変換部を追加すれば予測モード毎の個別の１Ｄ−２Ｄ変換を行うことができる。 For simplification, H.C. H.264 has been described as an example of nine prediction modes. However, when the prediction mode is expanded to 17 types, 33 types, etc., a 1D-2D conversion unit corresponding to each expanded prediction mode is added. Then, individual 1D-2D conversion for each prediction mode can be performed.

以上説明したように、本実施形態に係る画像復号化装置は、前述の第１の実施形態に係る画像符号化装置と同一または類似の逆直交変換部を持つ。故に、本実施形態に係る画像復号化装置によれば、前述の第１の実施形態に係る画像符号化装置と同一または類似の効果が得られる。 As described above, the image decoding apparatus according to the present embodiment has the same or similar inverse orthogonal transform unit as the image encoding apparatus according to the first embodiment described above. Therefore, according to the image decoding apparatus according to the present embodiment, the same or similar effects as those of the image encoding apparatus according to the first embodiment described above can be obtained.

（第５の実施形態）
第５の実施形態に係る画像復号化装置は、前述の第４の実施形態に係る画像復号化装置と逆直交変換の詳細において異なる。以降の説明では、本実施形態において第４の実施形態と同一部分には同一符号を付して示し、異なる部分を中心に説明する。本実施形態に係る画像復号化装置に対応する画像符号化装置は、第２の実施形態において説明した通りである。 (Fifth embodiment)
The image decoding apparatus according to the fifth embodiment differs from the image decoding apparatus according to the above-described fourth embodiment in the details of inverse orthogonal transform. In the following description, in this embodiment, the same parts as those in the fourth embodiment are denoted by the same reference numerals, and different parts will be mainly described. The image coding apparatus corresponding to the image decoding apparatus according to the present embodiment is as described in the second embodiment.

本実施形態に係る逆直交変換部４０５は、図１７の逆直交変換部１０５と実質的に同一または類似の要素なのでその詳細な説明を省略する。特に、本実施形態に係る逆直交変換部４０５は、図１７の逆直交変換部１０５と共通の１Ｄ変換行列Ｃ、１Ｄ変換行列Ｄ及び１Ｄ変換行列Ｅを利用する。尚、図１７における復元変換係数１２２、１Ｄ変換行列セット情報１２９及び復元予測誤差１２３は、本実施形態における復元変換係数４１７、１Ｄ変換行列セット情報４２２及び復元予測誤差信号４１８に夫々対応している。 Since the inverse orthogonal transform unit 405 according to the present embodiment is substantially the same as or similar to the inverse orthogonal transform unit 105 of FIG. 17, detailed description thereof is omitted. In particular, the inverse orthogonal transform unit 405 according to the present embodiment uses the 1D transform matrix C, the 1D transform matrix D, and the 1D transform matrix E common to the inverse orthogonal transform unit 105 in FIG. Note that the restored transform coefficient 122, 1D transform matrix set information 129, and the restored prediction error 123 in FIG. 17 correspond to the restored transform coefficient 417, 1D transform matrix set information 422, and the restored prediction error signal 418 in the present embodiment, respectively. .

本実施形態に係る１Ｄ変換行列セット部４１１は、第２の実施形態に係る１Ｄ変換行列セット部１１２と実質的に同一または類似の要素なのでその詳細な説明を省略する。即ち、本実施形態に係る１Ｄ変換行列セット部４１１は、例えば図１８Ａ、図１８Ｂ、図１８Ｃ図１８Ｄ及び図１８Ｅのテーブルを利用して、１Ｄ変換行列セット情報４２２を生成する。尚、第２の実施形態における予測情報１２６及び１Ｄ変換行列セット情報１２９は、本実施形態における予測情報４２４及び１Ｄ変換行列セット情報４２２に夫々対応している。 Since the 1D transformation matrix set unit 411 according to the present embodiment is substantially the same as or similar to the 1D transformation matrix set unit 112 according to the second embodiment, detailed description thereof is omitted. That is, the 1D conversion matrix set unit 411 according to the present embodiment generates 1D conversion matrix set information 422 using, for example, the tables of FIGS. 18A, 18B, 18C, 18D, and 18E. Note that the prediction information 126 and the 1D transformation matrix set information 129 in the second embodiment correspond to the prediction information 424 and the 1D transformation matrix set information 422 in the present embodiment, respectively.

以上説明したように、本実施形態に係る画像復号化装置は、前述の第２の実施形態に係る画像符号化装置と同一または類似の逆直交変換部を持つ。故に、本実施形態に係る画像復号化装置によれば、前述の第２の実施形態に係る画像符号化装置と同一または類似の効果が得られる。 As described above, the image decoding apparatus according to the present embodiment has the same or similar inverse orthogonal transform unit as the image encoding apparatus according to the second embodiment described above. Therefore, according to the image decoding apparatus according to the present embodiment, the same or similar effects as those of the image encoding apparatus according to the second embodiment described above can be obtained.

（第６の実施形態）
第６の実施形態に係る画像復号化装置は、前述の第４の実施形態及び第５の実施形態に係る画像復号化装置と逆直交変換の詳細において異なる。以降の説明では、本実施形態において第４の実施形態または第５の実施形態と同一部分には同一符号を付して示し、異なる部分を中心に説明する。本実施形態に係る画像復号化装置に対応する画像符号化装置は、第３の実施形態において説明した通りである。 (Sixth embodiment)
The image decoding device according to the sixth embodiment differs from the image decoding devices according to the fourth embodiment and the fifth embodiment described above in details of inverse orthogonal transform. In the following description, the same parts as those in the fourth embodiment or the fifth embodiment are denoted by the same reference numerals in the present embodiment, and different parts will be mainly described. The image encoding device corresponding to the image decoding device according to the present embodiment is as described in the third embodiment.

本実施形態に係る逆直交変換部４０５は、図２０の逆直交変換部１０５と実質的に同一または類似の要素なのでその詳細な説明を省略する。特に、本実施形態に係る逆直交変換部４０５は、図２０の逆直交変換部１０５と共通の１Ｄ変換行列Ｆ、１Ｄ変換行列Ｇ及び１Ｄ変換行列Ｈを利用する。尚、図２０における復元変換係数１２２、１Ｄ変換行列セット情報１２９及び復元予測誤差１２３は、本実施形態における復元変換係数４１７、１Ｄ変換行列セット情報４２２及び復元予測誤差信号４１８に夫々対応している。 Since the inverse orthogonal transform unit 405 according to the present embodiment is substantially the same as or similar to the inverse orthogonal transform unit 105 of FIG. 20, detailed description thereof is omitted. In particular, the inverse orthogonal transform unit 405 according to the present embodiment uses the 1D transform matrix F, the 1D transform matrix G, and the 1D transform matrix H that are common to the inverse orthogonal transform unit 105 in FIG. Note that the restored transform coefficient 122, 1D transform matrix set information 129, and the restored prediction error 123 in FIG. 20 respectively correspond to the restored transform coefficient 417, 1D transform matrix set information 422, and the restored prediction error signal 418 in the present embodiment. .

本実施形態に係る１Ｄ変換行列セット部４１１は、第３の実施形態に係る１Ｄ変換行列セット部１１２と実質的に同一または類似の要素なのでその詳細な説明を省略する。即ち、本実施形態に係る１Ｄ変換行列セット部４１１は、例えば図２１Ａ、図２１Ｂ、図２１Ｃ、図２１Ｄ及び図２１Ｅのテーブルを利用して、１Ｄ変換行列セット情報４２２を生成する。尚、第３の実施形態における予測情報１２６及び１Ｄ変換行列セット情報１２９は、本実施形態における予測情報４２４及び１Ｄ変換行列セット情報４２２に夫々対応している。 Since the 1D conversion matrix set unit 411 according to the present embodiment is substantially the same as or similar to the 1D conversion matrix set unit 112 according to the third embodiment, detailed description thereof is omitted. That is, the 1D conversion matrix set unit 411 according to the present embodiment generates 1D conversion matrix set information 422 using, for example, the tables of FIGS. 21A, 21B, 21C, 21D, and 21E. Note that the prediction information 126 and the 1D transformation matrix set information 129 in the third embodiment correspond to the prediction information 424 and the 1D transformation matrix set information 422 in the present embodiment, respectively.

以上説明したように、本実施形態に係る画像復号化装置は、前述の第３の実施形態に係る画像符号化装置と同一または類似の逆直交変換部を持つ。故に、本実施形態に係る画像復号化装置によれば、前述の第３の実施形態に係る画像符号化装置と同一または類似の効果が得られる。 As described above, the image decoding apparatus according to this embodiment has the same or similar inverse orthogonal transform unit as that of the image encoding apparatus according to the third embodiment described above. Therefore, according to the image decoding apparatus according to the present embodiment, the same or similar effects as those of the image encoding apparatus according to the third embodiment described above can be obtained.

第４乃至第６の実施形態では、２種類または３種類の１Ｄ変換行列を夫々用意し、予測モードに応じて垂直逆変換及び水平逆変換のための１Ｄ変換行列を選択する。しかしながら、前述の２種類または３種類の１Ｄ変換行列は、例示であり、更に多くの変換行列を用意して符号化効率を向上させることも可能である。例えば、第５の実施形態と第６の実施形態とを組み合わせて４種類の１Ｄ変換行列を用意することも可能である。但し、用意する変換行列の種類の増加に伴って更なるハードウェアなどが必要となるので、変換行列の種類の増加に伴うデメリットと符号化効率とのバランスを考慮することが望ましい。 In the fourth to sixth embodiments, two or three types of 1D conversion matrices are prepared, and a 1D conversion matrix for vertical inverse transformation and horizontal inverse transformation is selected according to the prediction mode. However, the above-described two or three types of 1D transformation matrices are examples, and it is possible to prepare more transformation matrices to improve the encoding efficiency. For example, it is possible to prepare four types of 1D conversion matrices by combining the fifth embodiment and the sixth embodiment. However, since additional hardware is required as the types of transformation matrices to be prepared increase, it is desirable to consider the balance between the disadvantages associated with the increase in types of transformation matrices and the coding efficiency.

以下、各実施形態の変形例を列挙して紹介する。
第１乃至第６の実施形態において、フレームを１６×１６画素サイズなどの矩形ブロックに分割し、画面左上のブロックから右下に向かって順に符号化／復号化を行う例について説明している（図６Ａを参照）。しかしながら、符号化順序及び復号化順序はこの例に限定されない。例えば、右下から左上に向かって順に符号化及び復号化が行われてもよいし、画面中央から画面端に向かって渦巻を描くように符号化及び復号化が行われてもよい。更に、右上から左下に向かって順に符号化及び復号化が行われてもよいし、画面端から画面中央に向かって渦巻きを描くように符号化及び復号化が行われてもよい。 Hereinafter, modifications of each embodiment will be listed and introduced.
In the first to sixth embodiments, an example is described in which a frame is divided into rectangular blocks of 16 × 16 pixel size and the like, and encoding / decoding is performed in order from the upper left block to the lower right side of the screen ( (See FIG. 6A). However, the encoding order and the decoding order are not limited to this example. For example, encoding and decoding may be performed sequentially from the lower right to the upper left, or encoding and decoding may be performed so as to draw a spiral from the center of the screen toward the screen end. Furthermore, encoding and decoding may be performed sequentially from the upper right to the lower left, or encoding and decoding may be performed so as to draw a spiral from the screen end toward the center of the screen.

第１乃至第６の実施形態において、４×４画素ブロック、８×８画素ブロック、１６×１６画素ブロックなどの予測対象ブロックサイズを例示して説明を行ったが、予測対象ブロックは均一なブロック形状でなくてもよい。例えば、予測対象ブロックサイズは、１６×８画素ブロック、８×１６画素ブロック、８×４画素ブロック、４×８画素ブロックなどであってもよい。また、１つのコーディングツリーユニット内で全てのブロックサイズを統一させる必要はなく、複数の異なるブロックサイズを混在させてもよい。１つのコーディングツリーユニット内で複数の異なるブロックサイズを混在させる場合、分割数の増加に伴って分割情報を符号化または復号化するための符号量も増加する。そこで、分割情報の符号量と局部復号画像または復号画像の品質との間のバランスを考慮して、ブロックサイズを選択することが望ましい。 In the first to sixth embodiments, the description has been given by exemplifying the prediction target block size such as the 4 × 4 pixel block, the 8 × 8 pixel block, and the 16 × 16 pixel block. However, the prediction target block is a uniform block. It does not have to be a shape. For example, the prediction target block size may be a 16 × 8 pixel block, an 8 × 16 pixel block, an 8 × 4 pixel block, a 4 × 8 pixel block, or the like. Also, it is not necessary to unify all the block sizes within one coding tree unit, and a plurality of different block sizes may be mixed. When a plurality of different block sizes are mixed in one coding tree unit, the amount of codes for encoding or decoding the division information increases as the number of divisions increases. Therefore, it is desirable to select the block size in consideration of the balance between the code amount of the division information and the quality of the locally decoded image or the decoded image.

第１乃至第６の実施形態において、簡単化のために、輝度信号と色差信号とを区別せず、色信号成分に関して包括的な説明を記述した。しかしながら、予測処理が輝度信号と色差信号との間で異なる場合には、同一または異なる予測方法が用いられてよい。輝度信号と色差信号との間で異なる予測方法が用いられるならば、色差信号に対して選択した予測方法を輝度信号と同様の方法で符号化または復号化できる。 In the first to sixth embodiments, for the sake of simplification, a comprehensive description of the color signal component is described without distinguishing between the luminance signal and the color difference signal. However, when the prediction process is different between the luminance signal and the color difference signal, the same or different prediction methods may be used. If different prediction methods are used between the luminance signal and the chrominance signal, the prediction method selected for the chrominance signal can be encoded or decoded in the same manner as the luminance signal.

第１乃至第６の実施形態において、簡単化のために、輝度信号と色差信号とを区別せず、色信号成分に関して包括的な説明を記述した。しかしながら、直交変換処理が輝度信号と色差信号との間で異なる場合には、同一または異なる直交変換方法が用いられてよい。輝度信号と色差信号との間で異なる直交変換方法が用いられるならば、色差信号に対して選択した直交変換方法を輝度信号と同様の方法で符号化または復号化できる。 In the first to sixth embodiments, for the sake of simplification, a comprehensive description of the color signal component is described without distinguishing between the luminance signal and the color difference signal. However, when the orthogonal transformation process is different between the luminance signal and the color difference signal, the same or different orthogonal transformation methods may be used. If different orthogonal transformation methods are used between the luminance signal and the color difference signal, the orthogonal transformation method selected for the color difference signal can be encoded or decoded in the same manner as the luminance signal.

以上説明したように、各実施形態は、ハードウェア実装及びソフトウェア実装における困難性を緩和しつつ、高効率な直交変換及び逆直交変換を実現する。故に、各実施形態によれば、符号化効率が向上し、ひいては主観画質も向上する。 As described above, each embodiment realizes highly efficient orthogonal transformation and inverse orthogonal transformation while alleviating the difficulty in hardware implementation and software implementation. Therefore, according to each embodiment, the encoding efficiency is improved, and the subjective image quality is also improved.

本発明のいくつかの実施形態を説明したが、これらの実施形態は、例として提示したものであり、発明の範囲を限定することは意図していない。これら新規な実施形態は、その他の様々な形態で実施されることが可能であり、発明の要旨を逸脱しない範囲で、種々の省略、置き換え、変更を行うことができる。これら実施形態やその変形は、発明の範囲や要旨に含まれるとともに、特許請求の範囲に記載された発明とその均等の範囲に含まれる。 Although several embodiments of the present invention have been described, these embodiments are presented by way of example and are not intended to limit the scope of the invention. These novel embodiments can be implemented in various other forms, and various omissions, replacements, and changes can be made without departing from the scope of the invention. These embodiments and modifications thereof are included in the scope and gist of the invention, and are included in the invention described in the claims and the equivalents thereof.

例えば、上記各実施形態の処理を実現するプログラムを、コンピュータで読み取り可能な記憶媒体に格納して提供することも可能である。記憶媒体としては、磁気ディスク、光ディスク（ＣＤ−ＲＯＭ、ＣＤ−Ｒ、ＤＶＤ等）、光磁気ディスク（ＭＯ等）、半導体メモリなど、プログラムを記憶でき、かつ、コンピュータが読み取り可能な記憶媒体であれば、その記憶形式は何れの形態であってもよい。 For example, it is possible to provide a program that realizes the processing of each of the above embodiments by storing it in a computer-readable storage medium. The storage medium may be a computer-readable storage medium such as a magnetic disk, optical disk (CD-ROM, CD-R, DVD, etc.), magneto-optical disk (MO, etc.), semiconductor memory, etc. For example, the storage format may be any form.

また、上記各実施形態の処理を実現するプログラムを、インターネットなどのネットワークに接続されたコンピュータ（サーバ）上に格納し、ネットワーク経由でコンピュータ（クライアント）にダウンロードさせてもよい。 Further, the program for realizing the processing of each of the above embodiments may be stored on a computer (server) connected to a network such as the Internet and downloaded to the computer (client) via the network.

１０１・・・減算部
１０２・・・直交変換部
１０３・・・量子化部
１０４・・・逆量子化部
１０５・・・逆直交変換部
１０６・・・加算部
１０７・・・参照画像メモリ
１０８・・・イントラ予測部
１０９・・・インター予測部
１１０・・・予測選択部
１１１・・・選択スイッチ
１１２・・・１Ｄ変換行列セット部
１１３・・・係数順制御部
１１４・・・エントロピー符号化部
１１５・・・出力バッファ
１１６・・・符号化制御部
１１７・・・量子化変換係数列
１１８・・・入力画像
１１９・・・予測誤差
１２０・・・変換係数
１２１・・・量子化変換係数
１２２・・・復元変換係数
１２３・・・復元予測誤差
１２４・・・局所復号画像
１２５・・・参照画像
１２６・・・予測情報
１２７・・・予測画像
１２９・・・１Ｄ変換行列セット情報
１３０・・・符号化データ
２０１，２０４，８０１，８０４，１１０１，１１０４，１２０１，１２０４・・・選択スイッチ
２０２，８０２，１１０２，１２０２・・・垂直変換部
２０６，・・・，２０９，８０６，・・・，８１１，１２０６，・・・，１２１１・・・１Ｄ直交変換部
２０３，１１０３・・・転置部
２０５，８０５，１１０５，１２０５・・・水平変換部
３０１，３０４，９０１，９０４，１３０１，１３０４・・・選択スイッチ
３０２，９０２，１３０２・・・垂直逆変換部
３０３・・・転置部
３０５，９０５，１３０５・・・水平逆変換部
３０６，・・・，３０９，９０６，・・・，９１１，１３０６，・・・，１３１１・・・１Ｄ逆直交変換部
４０１・・・入力バッファ
４０２・・・エントロピー復号化部
４０３・・・係数順制御部
４０４・・・逆量子化部
４０５・・・逆直交変換部
４０６・・・加算部
４０７・・・参照画像メモリ
４０８・・・イントラ予測部
４０９・・・インター予測部
４１０・・・選択スイッチ
４１１・・・１Ｄ変換行列セット部
４１２・・・出力バッファ
４１３・・・復号化制御部
４１４・・・符号化データ
４１５・・・量子化変換係数列
４１６・・・量子化変換係数
４１７・・・復元変換係数
４１８・・・復元予測誤差
４１９・・・復号画像
４２０・・・参照画像
４２１・・・予測モード情報
４２２・・・１Ｄ変換行列セット情報
４２３・・・予測画像
４２４・・・予測情報
４２５・・・出力画像
５０１・・・選択スイッチ
５０２，・・・，５１０・・・２Ｄ−１Ｄ変換部
５１１・・・発生頻度カウント部
５１２・・・係数順更新部
５１３・・・ヒストグラム
５１４・・・係数順更新情報
７００・・・シンタクス
７０１・・・ハイレベルシンタクス
７０２・・・スライスレベルシンタクス
７０３・・・コーディングツリーレベルシンタクス
７０４・・・シーケンスパラメータセットシンタクス
７０５・・・ピクチャパラメータセットシンタクス
７０６・・・スライスヘッダーシンタクス
７０７・・・スライスデータシンタクス
７０８・・・コーディングツリーユニットシンタクス
７０９・・・プレディクションユニットシンタクス
７１０・・・トランスフォームユニットシンタクス
１００１・・・選択スイッチ
１００２，・・・，１０１０・・・１Ｄ−２Ｄ変換部
１０１１・・・発生頻度カウント部
１０１２・・・係数順更新部
１０１３・・・ヒストグラム
１０１４・・・係数順更新情報 DESCRIPTION OF SYMBOLS 101 ... Subtraction part 102 ... Orthogonal transformation part 103 ... Quantization part 104 ... Dequantization part 105 ... Inverse orthogonal transformation part 106 ... Addition part 107 ... Reference image memory 108 ... Intra prediction unit 109 ... Inter prediction unit 110 ... Prediction selection unit 111 ... Selection switch 112 ... 1D transform matrix set unit 113 ... Coefficient order control unit 114 ... Entropy coding Unit 115 ... output buffer 116 ... encoding control unit 117 ... quantized transform coefficient sequence 118 ... input image 119 ... prediction error 120 ... transform coefficient 121 ... quantized transform coefficient 122: Restoration conversion coefficient 123: Restoration prediction error 124 ... Local decoded image 125 ... Reference image 126 ... Prediction information 127 ... Prediction image 129 ... 1D variation Matrix set information 130 ... Encoded data 201, 204, 801, 804, 1101, 1104, 1201, 1204 ... Selection switch 202, 802, 1102, 1202 ... Vertical conversion unit 206, ..., 209 , 806,..., 811, 1206,..., 1211... 1D orthogonal transform unit 203, 1103, transpose unit 205, 805, 1105, 1205, horizontal transform unit 301, 304, 901,. 904, 1301, 1304 ... selection switch 302, 902, 1302 ... vertical reverse conversion unit 303 ... transposition unit 305, 905, 1305 ... horizontal reverse conversion unit 306, ..., 309, 906 ..., 911, 1306, ..., 1311 ... 1D inverse orthogonal transform unit 401 ... input buffer 402 ... entry P-decoding unit 403 ... Coefficient order control unit 404 ... Inverse quantization unit 405 ... Inverse orthogonal transformation unit 406 ... Addition unit 407 ... Reference image memory 408 ... Intra prediction unit 409 .. Inter prediction unit 410 ... selection switch 411 ... 1D transform matrix set unit 412 ... output buffer 413 ... decoding control unit 414 ... encoded data 415 ... quantized transform coefficient sequence 416: quantized transform coefficient 417: restored transform coefficient 418 ... restored prediction error 419 ... decoded image 420 ... reference image 421 ... prediction mode information 422 ... 1D transform matrix set information 423 ... Predicted image 424 ... Predicted information 425 ... Output image 501 ... Selection switch 502, ..., 510 ... 2D-1D converter 511 ... Occurrence frequency counting unit 512... Coefficient order update unit 513... Histogram 514... Coefficient order update information 700 ... Syntax 701 ... High level syntax 702 ... Slice level syntax 703 ... Coding tree Level syntax 704 ... Sequence parameter set syntax 705 ... Picture parameter set syntax 706 ... Slice header syntax 707 ... Slice data syntax 708 ... Coding tree unit syntax 709 ... Prediction unit syntax 710 ··· Transform unit syntax 1001 ··· Selection switch 1002, ···, 1010 ··· 1D-2D converter 1011 ··· Frequency counter 1012 ··· Coefficient order update section 1013 ... histogram 1014 ... coefficient order update information

Claims

A decoding unit to decode the transform coefficient to be decoded from the sign-data,
A set unit that sets a combination of a vertical transformation matrix and a horizontal transformation matrix corresponding to the prediction mode to be decoded, based on a predetermined relationship according to a predicted image generation method;
Using the set vertical transformation matrix and the horizontal transformation matrix, an inverse transformation unit that obtains a prediction error by performing vertical inverse transformation and horizontal inverse transformation on the transformation coefficient;
An adder that generates a decoded image based on the prediction error, and
The combination is a combination between a first transformation matrix, there is a combination between different second transformation matrix from the first transformation matrix,
The combination of the second transformation matrices is set to a plurality of intra prediction modes respectively corresponding to Diagonal down right, Vertical right and Horizontal down directions.
An image decoding apparatus characterized by that.

The image decoding apparatus according to claim 1, wherein the encoded data is output from another apparatus via a communication line.