JP4995789B2

JP4995789B2 - Intra-screen predictive encoding method, intra-screen predictive decoding method, these devices, their programs, and recording media recording the programs

Info

Publication number: JP4995789B2
Application number: JP2008217488A
Authority: JP
Inventors: 誠明松村; 誠之高村; 一人上倉; 由幸八島
Original assignee: Nippon Telegraph and Telephone Corp
Current assignee: Nippon Telegraph and Telephone Corp
Priority date: 2008-08-27
Filing date: 2008-08-27
Publication date: 2012-08-08
Anticipated expiration: 2028-08-27
Also published as: JP2010056701A

Description

本発明は，画像・映像符号化における画面内予測の圧縮効率向上に関する発明である。同一の画面内で予測を実行する画面内予測符号化は，異なるフレーム間で予測を実行するフレーム間予測符号化と比較して大きな圧縮効率を達成できない。そのため，より圧縮効率の高い画面内予測符号化方式が望まれている。 The present invention relates to an improvement in compression efficiency of intra prediction in image / video coding. Intra-frame predictive coding that performs prediction within the same screen cannot achieve greater compression efficiency than inter-frame predictive coding that performs prediction between different frames. Therefore, an intra-screen predictive coding method with higher compression efficiency is desired.

画像・映像符号化における画面内予測は，映像符号化国際標準Ｈ．２６４／ＭＰＥＧ−４ＡＶＣ（非特許文献１参照）から取り入れられている手法であり，この手法を用いることにより，隣接画素間の相関を利用してエッジ部分に発生しやすい高周波成分を除去し，差分の振幅を効率的に抑えることができる。画面内予測は，いくつかの画素をまとめたブロック単位で行われ，Ｈ．２６４／ＭＰＥＧ−４ＡＶＣＦＲＥｘｔ（非特許文献２参照，以降Ｈ．２６４）では，輝度信号に対して１６×１６のマクロブロック内を４×４，８×８，１６×１６の３種類のブロックサイズで分割・予測を行う。 Intra-screen prediction in image / video coding is the video coding international standard H.264. H.264 / MPEG-4 AVC (see Non-Patent Document 1), this method is used to remove high-frequency components that are likely to occur at the edge using the correlation between adjacent pixels, The amplitude of the difference can be efficiently suppressed. In-screen prediction is performed in units of blocks in which several pixels are combined. In H.264 / MPEG-4 AVC FRExt (refer to Non-Patent Document 2; hereinafter H.264), three types of blocks of 4 × 4, 8 × 8, and 16 × 16 are included in a 16 × 16 macroblock for a luminance signal. Divide and predict by size.

これらの輝度信号に対して，４×４，８×８のブロックには９種類の予測モードが，１６×１６のブロックには４種類の予測モードが適用可能であり，ラグランジュコストの計算により最も符号化効率が高い予測モードとブロックサイズが決定される。４×４，８×８ブロックでは，１種類のＤＣ成分と４５°〜２０６．５７°の予測角度間を８種類に不等角度で分割したときに生じる直線と交差する復号済みの各画素値・画素間中心値を参照することで予測値の生成を実現している。 For these luminance signals, 9 types of prediction modes can be applied to 4 × 4, 8 × 8 blocks, and 4 types of prediction modes can be applied to 16 × 16 blocks. A prediction mode and a block size with high coding efficiency are determined. In the 4 × 4, 8 × 8 block, each decoded pixel value that intersects with a straight line generated when one type of DC component and a predicted angle of 45 ° to 206.57 ° are divided into eight types at unequal angles. -Generation of a predicted value is realized by referring to the center value between pixels.

図２０に，輝度信号の符号化に使用される４×４ブロックに適用可能な９種類の予測モード０〜８とその予測角度を示す。 FIG. 20 shows nine types of prediction modes 0 to 8 applicable to a 4 × 4 block used for encoding a luminance signal and their prediction angles.

ここで，予測モード０〜２では，図２１（ａ）に定義する符号化対象ブロックの上に位置するブロックの持つ画素Ａ，Ｂ，Ｃ，Ｄの値と，図２１（ａ）に定義する符号化対象ブロックの左に位置するブロックの持つ画素Ｉ，Ｊ，Ｋ，Ｌの値を使って，符号化対象ブロックの持つ画素ａ〜ｐの値を予測する。 Here, in the prediction modes 0 to 2, the values of the pixels A, B, C, and D of the block located on the block to be encoded defined in FIG. 21A and the values defined in FIG. Using the values of the pixels I, J, K, and L of the block located to the left of the encoding target block, the values of the pixels a to p of the encoding target block are predicted.

また，予測モード３〜８では，これらの画素Ａ，Ｂ，Ｃ，Ｄ，Ｉ，Ｊ，Ｋ，Ｌの値と，図２１（ａ）に定義する符号化対象ブロックの左上に位置するブロックの持つ画素Ｘの値と，図２１（ａ）に定義する符号化対象ブロックの右上に位置するブロックの持つ画素Ｅ，Ｆ，Ｇ，Ｈの値とに基づいて，図２１（ｂ）に示すように，それらの画素Ｘ，Ａ〜Ｌの値に対応付けられる値Ｘ′，Ａ′〜Ｌ′を算出するとともに，図２１（ａ）に定義する符号化対象ブロックの左下に位置するブロックの持つ画素Ｍの値に対応付けられる値Ｍ′を算出し，さらに，画素間中心に対応付けられる値Ａ″〜Ｆ″，Ｉ″〜Ｌ″を算出して，それらの値Ｘ′，Ａ′〜Ｍ′，Ａ″〜Ｆ″，Ｉ″〜Ｌ″を使って符号化対象ブロックの持つ画素ａ〜ｐの値を予測する。 In prediction modes 3 to 8, the values of these pixels A, B, C, D, I, J, K, and L and the block located at the upper left of the encoding target block defined in FIG. Based on the value of the pixel X possessed and the values of the pixels E, F, G, and H of the block located at the upper right of the encoding target block defined in FIG. Further, values X ′ and A ′ to L ′ associated with the values of the pixels X and A to L are calculated, and the block located at the lower left of the encoding target block defined in FIG. A value M ′ associated with the value of the pixel M is calculated, and further, values A ″ to F ″ and I ″ to L ″ associated with the center between pixels are calculated, and these values X ′, A ′ to M ′, A ″ to F ″, I ″ to L ″ are used to predict the values of the pixels a to p included in the encoding target block.

すなわち，図２０に示すように，予測モード０では，画素Ａ，Ｂ，Ｃ，Ｄの値を予測角度９０°の方向（垂直方向）にコピーすることで符号化対象ブロックの持つ画素ａ〜ｐの値を予測し，予測モード１では，画素Ｉ，Ｊ，Ｋ，Ｌの値を予測角度１８０°の方向（水平方向）にコピーすることで符号化対象ブロックの持つ画素ａ〜ｐの値を予測し，予測モード２では，（Ａ＋Ｂ＋Ｃ＋Ｄ＋Ｉ＋Ｊ＋Ｋ＋Ｌ＋４）／８の値に従って符号化対象ブロックの持つ画素ａ〜ｐの値を予測する。 That is, as shown in FIG. 20, in the prediction mode 0, the pixels a to p of the encoding target block are copied by copying the values of the pixels A, B, C, and D in the direction (vertical direction) of the prediction angle 90 °. In the prediction mode 1, the values of the pixels a to p of the encoding target block are obtained by copying the values of the pixels I, J, K, and L in the direction of the prediction angle 180 ° (horizontal direction). In prediction mode 2, the values of pixels a to p of the encoding target block are predicted according to the value of (A + B + C + D + I + J + K + L + 4) / 8.

また，予測モード３では，画素値に対応付ける形で算出した参照値（Ｂ′〜Ｈ′）を予測角度４５°の方向にコピーすることで符号化対象ブロックの持つ画素ａ〜ｐの値を予測し，予測モード４では，画素値に対応付ける形で算出した参照値を予測角度１３５°の方向にコピーすることで符号化対象ブロックの持つ画素ａ〜ｐの値を予測する。 In prediction mode 3, the reference values (B ′ to H ′) calculated in association with the pixel values are copied in the direction of the prediction angle 45 ° to predict the values of the pixels a to p of the block to be encoded. In the prediction mode 4, the reference values calculated in association with the pixel values are copied in the direction of the prediction angle 135 °, thereby predicting the values of the pixels a to p of the encoding target block.

そして，予測モード５では，画素値に対応付ける形で算出した参照値と画素間中心に対応付ける形で算出した参照値とを予測角度１１６．５７°の方向にコピーすることで符号化対象ブロックの持つ画素ａ〜ｐの値を予測し，予測モード６では，画素値に対応付ける形で算出した参照値と画素間中心に対応付ける形で算出した参照値とを予測角度１５３．４３°の方向にコピーすることで符号化対象ブロックの持つ画素ａ〜ｐの値を予測し，予測モード７では，画素値に対応付ける形で算出した参照値と画素間中心に対応付ける形で算出した参照値とを予測角度６３．４３°の方向にコピーすることで符号化対象ブロックの持つ画素ａ〜ｐの値を予測し，予測モード８では，画素値に対応付ける形で算出した参照値と画素間中心に対応付ける形で算出した参照値とを予測角度２０６．５７°の方向にコピーすることで符号化対象ブロックの持つ画素ａ〜ｐの値を予測する。 In prediction mode 5, the reference value calculated in association with the pixel value and the reference value calculated in association with the center between the pixels are copied in the direction of the prediction angle 116.57 ° to have the encoding target block. The values of the pixels a to p are predicted, and in the prediction mode 6, the reference value calculated in association with the pixel value and the reference value calculated in association with the center between the pixels are copied in the direction of the prediction angle 153.43 °. Thus, the values of the pixels a to p of the block to be encoded are predicted. In the prediction mode 7, the reference value calculated in the form corresponding to the pixel value and the reference value calculated in the form corresponding to the center between the pixels are set to the prediction angle 63. The value of the pixels a to p of the encoding target block is predicted by copying in the direction of .43 °, and in the prediction mode 8, the reference value calculated in association with the pixel value is associated with the center between the pixels. The calculated reference values to predict the value of a pixel a~p with the encoding target block by copying in the direction of the prediction angle 206.57 °.

このようにして，従来技術では，画面内予測符号化を実行する場合に，４×４，８×８のブロックでは，１種類のＤＣ成分を参照することで予測信号を生成することに加えて，４５°〜２０６．５７°の予測角度範囲を８種類に不等角度で分割したときに生じる直線と交叉する復号済みの各画素値・画素間中心値を参照することで予測信号の生成を実現している。 In this way, in the prior art, when performing intra prediction encoding, in the 4 × 4 and 8 × 8 blocks, in addition to generating a prediction signal by referring to one type of DC component, , 45 ° to 206.57 ° prediction angle range can be generated by referring to each decoded pixel value / inter-pixel center value that intersects with a straight line generated when the prediction angle range is divided into eight types at unequal angles. Realized.

図２２に，従来手法の映像符号化方式における符号化処理のフローチャートを示す。以下，図２２に従って，従来技術の映像符号化処理について説明する。 FIG. 22 shows a flowchart of the encoding process in the conventional video encoding method. Hereinafter, the conventional video encoding process will be described with reference to FIG.

まず，ステップＳ８０１では，ラグランジュコスト最小値Ｌ_minを＋∞で，ラグランジュコストが最小値をとる時のビット列Ｒ_bestを０で初期化し，ステップＳ８０２において，符号化対象ブロックに隣接するブロックから予測モードの予測値（以下，Ｐｒｅｄ＿ＰｒｅｄＭｏｄｅ）を求める。次に，ステップＳ８０３にて，ラグランジュコストを最小化する予測モード番号ｎを検出するためのループを行う（従来手法の画面内予測は９パターンのため，ｎ∈［０，１，... ，８］のループ）。 First, in step S801, the Lagrangian cost minimum value L _min is initialized to + ∞, and the bit string R _best when the Lagrangian cost takes the minimum value is initialized to 0. Is predicted (hereinafter referred to as Pred_PredMode). Next, in step S803, a loop for detecting the prediction mode number n that minimizes the Lagrangian cost is performed (since the in-screen prediction of the conventional method has 9 patterns, n∈ [0, 1,. 8] loop).

このループでは，まずステップＳ８０４において，予測モードｎのときに生成されるビット列Ｒを０で初期化し，ステップＳ８０５で，Ｐｒｅｄ＿ＰｒｅｄＭｏｄｅとｎとの比較により条件分岐を行う。これらが一致した場合には，ステップＳ８０６にてＲに１ビットの予測フラグを付加し，一致しない場合には，ステップＳ８０７にてＲに１ビットの予測フラグと，ｎの予測方向を示す３ビットの補正値を加える。 In this loop, first, in step S804, the bit string R generated in the prediction mode n is initialized with 0, and in step S805, conditional branching is performed by comparing Pred_PredMode with n. If they match, a 1-bit prediction flag is added to R in step S806. If they do not match, 1-bit prediction flag is added to R and 3 bits indicating the n prediction direction in step S807. Add a correction value.

続いて，ステップＳ８０８にて符号化対象ブロックにおける各画素の予測値を生成して予測残差を求め，ステップＳ８０９で予測残差のＤＣＴ，量子化，符号化を行い，生成されたビットをＲに付加する。続いてステップＳ８１０でＲのビット数と原画像との誤差からラグランジュコストＬ_cost（非特許文献３，４参照）を算出する。 Subsequently, in step S808, a prediction value of each pixel in the encoding target block is generated to obtain a prediction residual, and in step S809, the prediction residual is subjected to DCT, quantization, and encoding, and the generated bit is converted to R Append to In step S810, a Lagrangian cost L _cost (see Non-Patent Documents 3 and 4) is calculated from the error between the number of R bits and the original image.

次に，ステップＳ８１１においてラグランジュコスト最小値Ｌ_minとＬ_costとの比較を行い，Ｌ_costがＬ_minより小さければ，ステップＳ８１２にてＬ_minをＬ_costに，Ｒ_bestをＲに更新する。以上のステップＳ８０４〜Ｓ８１２の処理を，ｎ＝０から８までの各予測モードｎについて繰り返す。 Next, the comparison of the Lagrangian cost minimum L _min and L _cost in step S811, if L _cost is smaller than L _min, the L _min to L _cost at step S812, updates the R _best to R. The processes of steps S804 to S812 are repeated for each prediction mode n from n = 0 to 8.

従来手法では，ステップＳ８０２において，図２３に示すように符号化対象ブロックの上に隣接するブロック（以下，ＢｌｏｃｋＡ）の予測モードをＰｒｅｄＭｏｄｅＡ，左に隣接するブロック（以下，ＢｌｏｃｋＢ）の予測モードをＰｒｅｄＭｏｄｅＢとおき，Ｐｒｅｄ＿ＰｒｅｄＭｏｄｅを，次の式１によって決定している（非特許文献１，ｐ．１０２，式８−４２に記載）。 In the conventional method, in step S802, as shown in FIG. 23, the prediction mode of the block adjacent to the encoding target block (hereinafter referred to as Block A) is PredModeA, and the prediction mode of the block adjacent to the left (hereinafter Block B) is PredModeB. In other words, Pred_PredMode is determined by the following expression 1 (described in Non-Patent Document 1, p. 102, Expression 8-42).

Ｐｒｅｄ＿ＰｒｅｄＭｏｄｅ
＝ｍｉｎ（ＰｒｅｄＭｏｄｅＡ，ＰｒｅｄＭｏｄｅＢ） …（式１）
すなわち，ＢｌｏｃｋＡとＢｌｏｃｋＢの予測モードのうち，小さいほうを予測モードの予測値（Ｐｒｅｄ＿ＰｒｅｄＭｏｄｅ）としている。 Pred_PredMode
= Min (PredModeA, PredModeB) (Formula 1)
That is, the smaller one of the prediction modes of Block A and Block B is used as the prediction value (Pred_PredMode) of the prediction mode.

また，符号化対象ブロックに隣接する符号化済みブロックにおける予測モードから符号化対象ブロックの予測モードを予測するために１ビットのフラグを使用している。符号化対象ブロックにおける予測モードがＰｒｅｄ＿ＰｒｅｄＭｏｄｅと一致する場合には，ステップＳ８０６に示すように１ビットのフラグのみで記述可能であり，異なる場合には，ステップＳ８０７に示すように１ビットのフラグに加えて残りの８種類のいずれかを示す符号を３ビットの補正値として固定長符号化を行うため，計４ビットで記述される。
ITU-T Rec. H.264｜ISO/IEC 14496-10 version 1（2003） ITU-T Rec. H.264｜ISO/IEC 14496-10 version 4（2005） H.264 Reference Software JM ver. 13.2 Encoder ，http://iphome.hhi.de/suehring/tml/，2008 H.264 Reference Software KTA ver.1.9 Encorder, http://iphome.hhi.de/suehring/tml/, 2008 Further, a 1-bit flag is used to predict the prediction mode of the encoding target block from the prediction mode in the encoded block adjacent to the encoding target block. If the prediction mode in the encoding target block matches Pred_PredMode, it can be described with only a 1-bit flag as shown in step S806, and if different, in addition to the 1-bit flag as shown in step S807. In order to perform fixed-length encoding using a code indicating any of the remaining 8 types as a 3-bit correction value, the code is described in a total of 4 bits.
ITU-T Rec. H.264 | ISO / IEC 14496-10 version 1 (2003) ITU-T Rec. H.264 | ISO / IEC 14496-10 version 4 (2005) H.264 Reference Software JM ver. 13.2 Encoder, http://iphome.hhi.de/suehring/tml/, 2008 H.264 Reference Software KTA ver.1.9 Encorder, http://iphome.hhi.de/suehring/tml/, 2008

従来手法では，式１を用いて画面内予測における予測モードの予測値を生成しているため，例えばＰｒｅｄＭｏｄｅＡ，Ｂのいずれかに図２０に記載の縦方向の予測（予測モード０：Ｉｎｔｒａ４ｘ４ＰｒｅｄＭｏｄｅ０）が存在する場合，最も優先的に縦方向の予測がＰｒｅｄ＿ＰｒｅｄＭｏｄｅとして適用される。 In the conventional method, since the prediction value of the prediction mode in the intra prediction is generated using Equation 1, for example, the prediction in the vertical direction (prediction mode 0: Intra4 × 4PredMode0) illustrated in FIG. 20 is performed in either PredMode A or B. If present, the vertical prediction is most preferentially applied as Pred_PredMode.

図２４〜図２６は，適用される予測モード０，７による予測残差の比較を説明するための図である。図２４は原画像の画素値を示し，図２５は符号化対象ブロックを予測モード０で画面内符号化したときの予測画素値と予測残差を示し，図２６は符号化対象ブロックを予測モード７で画面内符号化したときの予測画素値と予測残差を示している。 24-26 is a figure for demonstrating the comparison of the prediction residual by the prediction modes 0 and 7 applied. 24 shows the pixel value of the original image, FIG. 25 shows the prediction pixel value and the prediction residual when the encoding target block is intra-coded in prediction mode 0, and FIG. 26 shows the encoding target block in the prediction mode. 7 shows a predicted pixel value and a predicted residual when intra-screen coding is performed.

ここで，図２４における現画像の中央下の４×４ブロックを符号化する状況を考える。ＰｒｅｄＭｏｄｅＡは予測モード７，ＰｒｅｄＭｏｄｅＢは予測モード０で予測されていると仮定する。このとき従来手法では，Ｐｒｅｄ＿ＰｒｅｄＭｏｄｅには予測モード０が選択されるが，画像のエッジに着目したとき，符号化対象ブロックは予測モード７をＰｒｅｄ＿ＰｒｅｄＭｏｄｅとしたほうが，３ビットの補正値が不要になることが容易に予想できる。 Now, consider the situation where the lower 4 × 4 block of the current image in FIG. 24 is encoded. It is assumed that PredModeA is predicted in prediction mode 7 and PredModeB is predicted in prediction mode 0. At this time, in the conventional method, the prediction mode 0 is selected for Pred_PredMode. However, when focusing on the edge of the image, the encoding target block does not require a 3-bit correction value when the prediction mode 7 is Pred_PredMode. Can be easily predicted.

このように，従来手法の画面内予測における予測モードの予測値生成手法では，予測が外れるケースが数多く存在し，ＰｒｅｄＭｏｄｅの補正のために１ビットのフラグと３ビットの補正値の計４ビットの記述が必要になるため符号量が増加する可能性が高い。その結果として符号量が増加する。 As described above, in the prediction value generation method of the prediction mode in the intra-screen prediction of the conventional method, there are many cases in which the prediction is out of order. Since description is required, there is a high possibility that the code amount will increase. As a result, the code amount increases.

本発明は，以上のような予測が外れる状況をオーバーヘッドなしに抑え，画面内予測における効果的な予測モードの予測方式を提供することを目的とする。 An object of the present invention is to provide an effective prediction mode prediction method in intra-screen prediction while suppressing the above-described situation in which prediction is lost without overhead.

前述の予測モードの予測が外れるケースを抑えるための方法として，本発明は，符号化対象ブロックに隣接する符号化済みブロックにおける予測モードによって決定される予測ベクトルに着目し，符号化対象ブロック中心方向への流入ベクトルの有無を選択の鍵として用いる。 As a method for suppressing the case where the prediction mode is not correctly predicted, the present invention pays attention to the prediction vector determined by the prediction mode in the encoded block adjacent to the encoding target block. The presence or absence of an inflow vector is used as a selection key.

ここで，予測ベクトルは，画面内予測における予測モードから決定される予測方向を表すベクトルであり，図２０に４×４ブロックの場合の例を示したように，画面内予測における予測モードから決定される予測角度（図２０参照）と一対一に対応する方向を意味する。 Here, the prediction vector is a vector representing the prediction direction determined from the prediction mode in the intra prediction, and is determined from the prediction mode in the intra prediction as shown in the example of the case of 4 × 4 blocks in FIG. Means a direction corresponding to the predicted angle (see FIG. 20).

また，符号化済みブロックの予測モードから決定される予測ベクトルが符号化対象ブロック中心方向に向いている場合に，その予測ベクトルを流入ベクトルという。予測ベクトルが符号化対象ブロック中心方向に向いているか否かの判断は，予測モードから決定される予測角度を参照し，後述する実施例で説明する閾値との比較で行う。 Further, when the prediction vector determined from the prediction mode of the encoded block is directed toward the center of the block to be encoded, the prediction vector is referred to as an inflow vector. Whether or not the prediction vector is directed toward the center of the block to be encoded is determined by referring to a prediction angle determined from the prediction mode and comparing with a threshold value described in an embodiment described later.

画面内予測は主に予測残差を抑えることでエッジを除去することにより，エッジ周辺に発生する高周波成分の除去に効果を発揮し，差分の振幅を効果的に抑える手法である。そのため画面内予測によって既に符号化済みブロックにおける予測残差を抑えることに成功していると仮定すると，符号化済みブロックはその予測ベクトルに沿ってエッジが存在する可能性が高いことが推測される。本発明では，予測モードの予測値を生成するにあたって，流入ベクトル有りの符号化済みブロックが複数存在する場合に，符号化済みブロックにおける参照画素におけるエッジを検出し，最もエッジが強く現れる方向の予測モードを持つ符号化済みブロックの予測モードを，符号化対象ブロックの予測モードの予測値とする。 In-screen prediction is a technique that effectively suppresses the amplitude of the difference by removing the edge mainly by suppressing the prediction residual, thereby effectively removing the high-frequency components generated around the edge. Therefore, assuming that the prediction residual in the already encoded block has been successfully suppressed by intra prediction, it is assumed that the encoded block is likely to have an edge along its prediction vector. . In the present invention, when generating a prediction value of a prediction mode, when there are a plurality of encoded blocks with an inflow vector, an edge in a reference pixel in the encoded block is detected, and a prediction of a direction in which the edge appears most strongly is performed. The prediction mode of the encoded block having the mode is set as the prediction value of the prediction mode of the encoding target block.

例えばＰｒｅｄＭｏｄｅＡにおける予測角度θ_Aが９０°辺りに存在する場合（符号化対象ブロック中心方向への流入ベクトルが存在する場合），符号化対象ブロックの参照画素Ａ〜Ｄ（図２０，図２１参照）上にエッジが出現する可能性が高い。そのため自然画像における画素の連続性を考えると，符号化対象ブロックにおいてもエッジがθ_Aの方向に連続している可能性が高く，ＰｒｅｄＭｏｄｅＡをＰｒｅｄ＿ＰｒｅｄＭｏｄｅとして適用するほうが適切であると考えられる。仮にＰｒｅｄＭｏｄｅＢが予測モード０（Ｉｎｔｒａ４ｘ４ＰｒｅｄＭｏｄｅ０）で予測されており，符号化対象ブロック方向への流入ベクトルが存在しない場合，符号化対象ブロックの参照画素Ｉ〜Ｌ（図２０，図２１参照）上にエッジが出現する可能性は低いため，ＰｒｅｄＭｏｄｅＢをＰｒｅｄ＿ＰｒｅｄＭｏｄｅとして適用すると予測モードの予測が外れる可能性が高くなる。 For example, when the prediction angle θ _{A in} PredMode _A exists around 90 ° (when there is an inflow vector toward the center of the block to be encoded), reference pixels A to D of the block to be encoded (see FIGS. 20 and 21). There is a high possibility that an edge will appear on top. Therefore, considering the continuity of pixels in a natural image, it is highly possible that edges are continuous in the direction of θ _A even in the encoding target block, and it is considered more appropriate to apply PredModeA as Pred_PredMode. If PredModeB is predicted in the prediction mode 0 (Intra4 × 4PredMode0) and there is no inflow vector in the direction of the encoding target block, an edge on the reference pixels I to L (see FIGS. 20 and 21) of the encoding target block. Since PredModeB is applied as Pred_PredMode, there is a high possibility that prediction mode will be unpredictable.

このように符号化対象ブロック中心方向への流入ベクトルの有無を判断することでエッジが存在する可能性を判断し，適切な予測モードを予測することで効率的な予測モードの予測値生成を実現する。 In this way, it is possible to determine the presence of an edge by determining the presence or absence of an inflow vector toward the center of the encoding target block, and to realize an efficient prediction mode generation by predicting an appropriate prediction mode. To do.

以上説明したように，本発明の使用により，従来の画面内予測における予測モードの予測値生成手法では考慮されていなかった符号化対象ブロック中心方向への流入ベクトルの有無を判断することによって，エッジが連続するような動画像において，従来手法より効率的な予測モードの予測値生成が実現可能となる。 As described above, by using the present invention, by determining whether there is an inflow vector toward the center of the block to be encoded, which is not considered in the prediction value generation method of the prediction mode in the conventional intra prediction, the edge is determined. It is possible to realize prediction value generation in a prediction mode more efficiently than a conventional method in a moving image in which the images are continuous.

その結果，従来手法では画面内予測の予測モードの決定に４ビット必要とされていたブロックを１ビットで表記可能になる確率が増加するため，符号量の削減につながり画面内予測の符号化効率が改善する。特に，低ビットレートの符号化では，３ビット減という効果は，高ビットレートと比較して顕著に現れるため効果が大きい。 As a result, the probability that a block that requires 4 bits for determining the prediction mode for intra prediction in the conventional method can be represented by 1 bit increases, leading to a reduction in the amount of codes and the encoding efficiency of intra prediction. Will improve. In particular, in the low bit rate encoding, the effect of 3 bits reduction is significant because it appears more conspicuously than the high bit rate.

また，本発明は従来手法のように画面内予測の予測モードの予測が予測モード番号に依存せず，予測モード番号から決定される予測ベクトルが持つ予測角度に依存するため，Ｈ．２６４準拠の画面内予測の予測モードの予測値生成方式以外の予測モードの予測方式に対しても親和性が高い。さらに，４×４ブロックだけでなく，８×８ブロックにおける予測モードの予測値生成手法としても使用することができる。 Further, according to the present invention, since the prediction of the prediction mode of the intra-screen prediction does not depend on the prediction mode number as in the conventional method, it depends on the prediction angle of the prediction vector determined from the prediction mode number. It has high affinity for prediction modes other than the prediction value generation method for the prediction mode of the prediction mode for intra-screen prediction based on H.264. Furthermore, it can be used not only as a 4 × 4 block but also as a prediction value generation method for a prediction mode in an 8 × 8 block.

本発明の実施形態を詳細に説明するに先立ち，その概要を従来技術と対比して説明する。 Prior to describing embodiments of the present invention in detail, an outline thereof will be described in comparison with the prior art.

前述のように，従来の画面内予測における予測モードの予測値生成方式では，図２３のように，上と左に隣接するブロックにおける予測モードの予測モード番号の小さいほうを予測モードの予測値として適用している。 As described above, in the prediction value generation method of the prediction mode in the conventional intra-screen prediction, as shown in FIG. 23, the smaller prediction mode number of the prediction mode in the upper and left adjacent blocks is used as the prediction value of the prediction mode. Applicable.

Ｈ．２６４では，４×４の符号化対象ブロックに対しては，図２０に示す９パターンの予測を行っており，予測モードの予測値と一致する予測モードを符号化する場合には，フラグ（１ビット）＋予測残差のＤＣＴ符号データとして結果を表記し，予測値と異なる予測モードを符号化する場合には，フラグ（１ビット）＋予測モードの補正値（３ビット）＋予測残差のＤＣＴ符号データとして結果を表記している。したがって，予測モードの予測値が外れると符号量が増えることになる。 H. In H.264, nine patterns shown in FIG. 20 are predicted for a 4 × 4 encoding target block, and a flag (1) is used to encode a prediction mode that matches the prediction value of the prediction mode. Bit) + prediction residual DCT code data, and when encoding a prediction mode different from the prediction value, flag (1 bit) + prediction mode correction value (3 bits) + prediction residual The result is expressed as DCT code data. Therefore, the code amount increases when the prediction value of the prediction mode deviates.

そこで，本発明は予測モードの予測値生成の精度を向上させるため，符号化対象ブロックに隣接する符号化済みブロックにおける予測モードによって決定される予測ベクトルに着目し，符号化対象ブロック中心方向へのベクトル（流入ベクトル）の有無を判定し，流入ベクトル有りと判断された１つ以上の符号化対象ブロックの予測モードの中から１つを選択することで符号化対象ブロックの予測モードの予測値を生成する。 Therefore, in order to improve the accuracy of prediction value generation in the prediction mode, the present invention focuses on the prediction vector determined by the prediction mode in the encoded block adjacent to the encoding target block, and moves toward the encoding target block center direction. The presence / absence of a vector (inflow vector) is determined, and the prediction value of the prediction mode of the encoding target block is selected by selecting one of the prediction modes of one or more encoding target blocks determined to have the inflow vector. Generate.

これを実現するための処理として，特に次の２段階の新しい処理機構を持つ。
［処理１］流入ベクトル有りの符号化済みブロックを抽出し，リストアップする。
［処理２］リストアップした符号化済みブロックに優先順序を設定し，最も優先順序の高いブロックの予測モードを符号化対象ブロックの予測モードの予測値として適用する。 As a process for realizing this, it has a new processing mechanism in the following two stages.
[Process 1] Extract encoded blocks with inflow vectors and list them.
[Process 2] A priority order is set for the listed encoded blocks, and the prediction mode of the block with the highest priority order is applied as the prediction value of the prediction mode of the encoding target block.

上記［処理１］の流入ベクトル有りの符号化済みブロックの抽出では，符号化対象ブロックに隣接するブロックのうち，どのブロックを対象として流入ベクトルの有無を判断するかを，外部から与えられる，または予め設定される設定値（ＳｗＫｅｙと呼ぶ）によって，変えることができる。 In the extraction of an encoded block with an inflow vector in [Process 1] above, it is given from the outside which block to determine the presence or absence of an inflow vector among blocks adjacent to an encoding target block, or It can be changed by a preset value (referred to as SwKey).

例えば，ＳｗＫｅｙに応じて次のようなパターンがある。
・ＳｗＫｅｙ＝１の場合，図３に示すＢｌｏｃｋＡ，Ｂの符号化済みブロックを用いて予測モードの予測を行う（実施例１として記載）。
・ＳｗＫｅｙ＝２の場合，図３に示すＢｌｏｃｋＡ，Ｂ，Ｆの符号化済みブロックを用いて予測モードの予測を行う（実施例２として記載）。
・ＳｗＫｅｙ＝３の場合，図３に示すＢｌｏｃｋＡ，Ｂ，ＦおよびＥ，Ｇの一方もしくは両方の符号化済みブロックを用いて予測モードの予測を行う（実施例３として記載）。
・ＳｗＫｅｙ＝４の場合，図３に示すＢｌｏｃｋＡ〜Ｈの符号化済みブロックを用いて予測モードの予測を行う（実施例４として記載）。 For example, there are the following patterns according to SwKey.
When SwKey = 1, prediction mode prediction is performed using the Block A and B encoded blocks shown in FIG. 3 (described as the first embodiment).
When SwKey = 2, the prediction mode is predicted using the encoded blocks of Block A, B, and F shown in FIG. 3 (described as Example 2).
When SwKey = 3, prediction mode prediction is performed using one or both of the blocks A, B, F, E, and G shown in FIG. 3 (described as Example 3).
When SwKey = 4, the prediction mode is predicted using the encoded blocks of Block A to H shown in FIG. 3 (described as Example 4).

また，上記［処理２］のリストアップした符号化済みブロックを優先順序に従って選択する方法としては，例えば次の方法を用いることができる。
〔ブロック距離別優先順序決定による選択方式（実施例５として記載）〕
符号化対象ブロックとの距離が近いブロックほど優先的に選択されるように，隣接するブロックにあらかじめ優先順序を設定しておき，リストアップした符号化済みブロックの中で最も優先順序が上位のブロックを選択する。
〔参照画素のエッジ検出による優先順序決定による選択方式（実施例６として記載）〕
参照画素におけるエッジ強度を算出し，最もエッジが強い方向を選出する。 As a method for selecting the encoded block listed in [Process 2] according to the priority order, for example, the following method can be used.
[Selection method by determining priority order by block distance (described as Example 5)]
Priorities are set in advance in adjacent blocks so that the blocks closer to the encoding target block are preferentially selected, and the block with the highest priority among the listed encoded blocks Select.
[Selection Method by Priority Order Determination by Edge Detection of Reference Pixel (described as Example 6)]
The edge strength at the reference pixel is calculated, and the direction with the strongest edge is selected.

以上の処理によって，予測モードの予測値生成の精度が向上するので，従来技術で説明した１ビットのフラグで予測モードを表記できるブロックを増やすことができる。そのため，予測モードの予測値と異なる場合に付加されていた補正値（３ビット）を削減できるようになる。なお，予測値生成の精度が向上するとは，従来のＨ．２６４で使用されているラグランジュコストの計算式によって選び出される予測モード番号が予測モードの予測値と一致する確率が高くなることを意味する。このように，予測モードの予測値生成の精度が向上することにより，画質一定の条件下では符号量の削減が可能になり，符号量一定の条件下では画質が向上し，符号化効率の向上が可能になる。 The accuracy of prediction value generation in the prediction mode is improved by the above processing, so that it is possible to increase the number of blocks in which the prediction mode can be expressed with the 1-bit flag described in the related art. Therefore, the correction value (3 bits) added when the prediction value is different from the prediction value in the prediction mode can be reduced. Note that the accuracy of predicted value generation is improved. This means that the probability that the prediction mode number selected by the Lagrangian cost calculation formula used in H.264 matches the prediction value of the prediction mode is increased. Thus, by improving the accuracy of prediction value generation in the prediction mode, it is possible to reduce the amount of code under the condition of constant image quality, improve the image quality under the condition of constant code amount, and improve the coding efficiency. Is possible.

上記［処理１］の意義について補足説明する。従来手法では，画像内予測の予測モード番号に応じて予測モードの予測値を生成していたが，予測モード番号から推測可能なエッジの連続性については考慮されていない。上記［処理１］では，隣接する符号化済みブロックから符号化対象ブロック中心方向へ向かう流入ベクトルの有無を判定し，流入ベクトル有りと判断された１つ以上の符号化対象ブロックの予測モードの中から１つを選択するので，予測モード番号から推測可能なエッジの連続性に着目して予測モードの予測値を生成することが実現でき，より精度の高い予測モードの予測値の生成が可能になる。 The significance of the above [Process 1] will be supplementarily described. In the conventional method, the prediction value of the prediction mode is generated according to the prediction mode number of intra prediction, but the continuity of the edges that can be estimated from the prediction mode number is not taken into consideration. In the above [Process 1], it is determined whether or not there is an inflow vector from the adjacent encoded block toward the center of the encoding target block, and one of the prediction modes of one or more encoding target blocks determined to have an inflow vector. Since one is selected from the above, it is possible to generate a prediction value of the prediction mode by focusing on the continuity of the edges that can be estimated from the prediction mode number, and it is possible to generate a prediction value of the prediction mode with higher accuracy Become.

さらに，ＳｗＫｅｙの設定値に応じて流入ベクトルの有無を判定する符号化済みブロックを決めることにより，隣接する複数の符号化済みブロックの中から，エッジが符号化対象ブロック中心方向に入り込む可能性の高い符号化済みブロックの探索を効率よく行うことができるようになる。 Further, by determining the encoded block for determining the presence or absence of the inflow vector according to the set value of SwKey, an edge may enter the encoding target block center direction from a plurality of adjacent encoded blocks. A search for a high-encoded block can be performed efficiently.

また，上記［処理２］では，上記［処理１］にてリストアップしたブロックに優先順序を設定し，最も優先順序の高いブロックの予測モードを符号化対象ブロックの予測モードの予測値として適用する。これは，上記［処理１］だけでは，隣接する符号化済みブロックから“流入ベクトル有り”と判断されたブロックが二つ以上存在する場合に，二つ以上の予測モードの中から一つを選択して符号化対象ブロックの予測モードの予測値とする必要があるからである。この［処理２］によって，エッジが符号化対象ブロック中心方向に入り込む可能性の高い複数の符号化済みブロックの中から，一つの予測モードを符号化対象ブロックの予測モードの予測値として適用することが可能になる。 In [Process 2], the priority order is set for the blocks listed in [Process 1], and the prediction mode of the block with the highest priority order is applied as the prediction value of the prediction mode of the encoding target block. . This is because [Process 1] alone selects one of two or more prediction modes when there are two or more blocks that are judged to have “inflow vectors” from adjacent coded blocks. This is because it is necessary to use the prediction value of the prediction mode of the encoding target block. By applying [Process 2], one prediction mode is applied as a prediction value of the prediction mode of the encoding target block from among a plurality of encoded blocks that are highly likely to have edges entering the encoding target block center direction. Is possible.

以下，図面を用いながら，本発明の実施形態を詳細に説明する。 Hereinafter, embodiments of the present invention will be described in detail with reference to the drawings.

図１は，本発明を適用した画面内予測符号化装置の一実施形態を示すブロック図である。本実施形態の画面内予測における予測モードの予測値生成方式を使用する符号化装置１０は，ラグランジュコスト最小値及びブロック符号化データ初期化部１１と，符号化対象ブロックに隣接する符号化済みブロックから予測モードを予測するための予測モードの予測部１２と，予測モード決定のための予測モード判定ループ部１３とから構成される。 FIG. 1 is a block diagram showing an embodiment of an intra prediction encoding apparatus to which the present invention is applied. The encoding device 10 that uses the prediction value generation method of the prediction mode in the intra prediction according to the present embodiment includes the Lagrange cost minimum value and block encoded data initialization unit 11 and an encoded block adjacent to the encoding target block. Are composed of a prediction mode prediction unit 12 for predicting a prediction mode and a prediction mode determination loop unit 13 for determining a prediction mode.

予測モードの予測部１２の内部は，流入ベクトル有りのブロック抽出部１２１，ブロック選択及び予測値生成部１２２から構成される。 The prediction unit 12 in the prediction mode includes a block extraction unit 121 with an inflow vector and a block selection / prediction value generation unit 122.

予測モード判定ループ部１３の内部は，予測モードの符号化部１３１，画面内予測によって予測残差を生成する画面内予測部１３２，予測残差の整数精度ＤＣＴ・量子化・符号化部１３３，ラグランジュコストの最小値判定を行うラグランジュコスト判定部１３４，ループの終了判定部１３５から構成される。 The prediction mode determination loop unit 13 includes a prediction mode encoding unit 131, an intra prediction unit 132 for generating a prediction residual by intra prediction, an integer precision DCT / quantization / encoding unit 133 for the prediction residual, It comprises a Lagrangian cost determination unit 134 that performs minimum value determination of a Lagrangian cost, and a loop end determination unit 135.

図２に，本発明の一実施形態の画面内符号化方式における符号化処理のフローチャートを示す。本実施形態では，従来手法の符号化アルゴリズムにおけるＰｒｅｄ＿ＰｒｅｄＭｏｄｅを決定するステップ（図２２のステップＳ８０２）を，図２のステップＳ１０２，Ｓ１０３に置き換えることで実現する。 FIG. 2 shows a flowchart of the encoding process in the intra-screen encoding system of one embodiment of the present invention. In the present embodiment, the step of determining Pred_PredMode in the conventional encoding algorithm (step S802 in FIG. 22) is replaced with steps S102 and S103 in FIG.

まず，ステップＳ１０１にて，ステップＳ８０１と同様に，ラグランジュコスト最小値Ｌ_minの初期値を＋∞とし，ラグランジュコストが最小値をとる時のビット列Ｒ_bestの初期値を０とする。ステップＳ１０２では，処理分岐係数ＳｗＫｅｙを取得する。このＳｗＫｅｙは１〜４の範囲でユーザが任意に設定しておく係数である。 First, in step S101, as in step S801, the initial value of the Lagrangian cost minimum value L _min is set to + ∞, and the initial value of the bit string R _best when the Lagrangian cost takes the minimum value is set to 0. In step S102, a processing branch coefficient SwKey is acquired. This SwKey is a coefficient that is arbitrarily set by the user in the range of 1-4.

この例では，あらかじめ設定されたＳｗＫｅｙの値によって，Ｐｒｅｄ＿ＰｒｅｄＭｏｄｅを決定する４種類の処理方法を切り分けているが，本発明は，以下に説明する４種類の中の一つの処理方法に限定して実施することも可能であり，その場合には，ＳｗＫｅｙの判定による処理方法の選択処理は省略することができる。 In this example, four types of processing methods for determining Pred_PredMode are classified according to a preset SwKey value. However, the present invention is limited to one of the four types of processing methods described below. In this case, the processing method selection process based on the SwKey determination can be omitted.

ステップＳ１０３では，ＳｗＫｅｙの値を判定し，ＳｗＫｅｙに応じた処理アルゴリズムによる処理を実行して，Ｐｒｅｄ＿ＰｒｅｄＭｏｄｅを決定する。このＳｗＫｅｙに応じてＰｒｅｄ＿ＰｒｅｄＭｏｄｅを決定する処理の詳細については，実施例１〜４として後述する。 In step S103, the value of SwKey is determined, processing by a processing algorithm corresponding to SwKey is executed, and Pred_PredMode is determined. Details of the process of determining Pred_PredMode in accordance with the SwKey will be described later as Embodiments 1 to 4.

次にステップＳ１０４のループ処理により，予測モード０から（ＭａｘＰｒｅｄ−１）までの各予測モードについて，ステップＳ１０５〜Ｓ１１３の処理を繰り返す。なお，ステップＳ１０４のＭａｘＰｒｅｄは予測方向最大値を表す。ステップＳ１０４以降の処理は，従来技術として説明した図２２のステップＳ８０３以降の処理と同様であるため，説明は省略する。 Next, the processing in steps S105 to S113 is repeated for each prediction mode from prediction mode 0 to (MaxPred-1) by the loop processing in step S104. Note that MaxPred in step S104 represents the maximum value in the prediction direction. The processing after step S104 is the same as the processing after step S803 in FIG.

また，本実施形態では，ステップＳ１０３にて符号化対象ブロックに隣接する最大８つの符号化済みブロックの予測モードを参照するものとする。ここで，符号化対象ブロックに隣接するブロックを図３のように設定する。 In the present embodiment, it is assumed that prediction modes of up to eight encoded blocks adjacent to the encoding target block are referred to in step S103. Here, the blocks adjacent to the encoding target block are set as shown in FIG.

〔実施例１〕
実施例１は，ＳｗＫｅｙ＝１の場合のＰｒｅｄ＿ＰｒｅｄＭｏｄｅを決定する処理（予測値生成処理）の例であり，図３に示すＢｌｏｃｋＡ，Ｂの符号化済みブロックを用いて予測モードの予測を行う場合の例である。 [Example 1]
The first embodiment is an example of processing (prediction value generation processing) for determining Pred_PredMode when SwKey = 1. In the case where prediction mode prediction is performed using the Block A and B encoded blocks shown in FIG. It is an example.

図４に，隣接ブロックから流入ベクトルの有無を判断するために用いる角度閾値を示す。ここでθ_A，θ_Bは，それぞれＢｌｏｃｋＡ，Ｂにおける予測モードから決定される予測角度を表す。すなわち，角度閾値（４５°，１３５°，２２５°）に対して，
４５°≦θ_A＜１３５°
１３５°≦θ_B＜２２５°
を満たすかどうかにより，流入ベクトルの有無が判断される。 FIG. 4 shows an angle threshold used for determining the presence or absence of an inflow vector from an adjacent block. Here, θ _A and θ _B represent prediction angles determined from prediction modes in Block _A and Block _B , respectively. That is, for the angle threshold (45 °, 135 °, 225 °),
45 ° ≦ θ _A <135 °
135 ° ≦ θ _B <225 °
Whether or not there is an inflow vector is determined depending on whether or not

図５に，実施例１の予測値生成処理のフローチャートを示す。まず，ステップＳ２０１において，予測モードの予測値Ｐｒｅｄ＿ＰｒｅｄＭｏｄｅをＤＣ成分で，流入ベクトルが存在するブロック番号を格納するバッファＢｌｏｃｋＢｕｆを０で，流入ベクトルが存在する数ＭａｘＥｌｅｍを０で初期化する。 FIG. 5 shows a flowchart of the predicted value generation process of the first embodiment. First, in step S201, the prediction value Pred_PredMode of the prediction mode is initialized with a DC component, the buffer BlockBuf that stores the block number in which the inflow vector exists is initialized with 0, and the number MaxElem with the inflow vector exists with 0.

次に，ステップＳ２０２において，符号化対象ブロックに隣接するＴａｒｇｅｔＢｌｏｃｋ∈［ＢｌｏｃｋＡ，Ｂ］のループを行い，ステップＳ２０３にてＴａｒｇｅｔＢｌｏｃｋが符号化済みブロックか否かの分岐を行う。この分岐において真のＴａｒｇｅｔＢｌｏｃｋのみ，ステップＳ２０４においてＴａｒｇｅｔＢｌｏｃｋのＰｒｅｄＭｏｄｅから予測角度θを算出する。 Next, in Step S202, a loop of TargetBlockε [BlockA, B] adjacent to the encoding target block is performed, and in Step S203, it is branched whether or not TargetBlock is an encoded block. For only the true TargetBlock in this branch, the predicted angle θ is calculated from the PredMode of the TargetBlock in Step S204.

続いて，ステップＳ２０５において，θが図４における閾値以内か否かの分岐処理を行い，真のＴａｒｇｅｔＢｌｏｃｋに対して，ステップＳ２０６にてＢｌｏｃｋＢｕｆにＴａｒｇｅｔＢｌｏｃｋを追加格納し，ＭａｘＥｌｅｍを１増やす。ステップＳ２０２〜Ｓ２０６のループにて流入ベクトルが存在するブロックの抽出を終了すると，ステップＳ２０７にてＢｌｏｃｋＢｕｆの中から最適な予測モードを一つ選出し（選出手法の実施例は“実施例５，６”に記載），Ｐｒｅｄ＿ＰｒｅｄＭｏｄｅに適用する。 Subsequently, in step S205, branch processing is performed to determine whether θ is within the threshold in FIG. 4, and in step S206, TargetBlock is additionally stored in BlockBuf, and MaxElem is incremented by one for true TargetBlock. When the extraction of the block in which the inflow vector exists is completed in the loop of steps S202 to S206, one optimum prediction mode is selected from BlockBuf in step S207 (the example of the selection method is “Examples 5 and 6”). ”), And applies to Pred_PredMode.

〔実施例２〕
実施例２は，ＳｗＫｅｙ＝２の場合のＰｒｅｄ＿ＰｒｅｄＭｏｄｅを決定する処理（予測値生成処理）の例であり，図３に示すＢｌｏｃｋＡ，Ｂ，Ｆの符号化済みブロックを用いて予測モードの予測を行う場合の例である。 [Example 2]
The second embodiment is an example of processing (prediction value generation processing) for determining Pred_PredMode when SwKey = 2, and prediction mode prediction is performed using the encoded blocks of Block A, B, and F shown in FIG. This is an example.

図６に，隣接ブロックから流入ベクトルの有無を判断するために用いる角度閾値を示す。ここで，θ_A，θ_B，θ_FはそれぞれＢｌｏｃｋＡ，Ｂ，Ｆにおける予測モードから決定される予測角度を表す。すなわち，角度閾値（４５°，１１２．５°，１５７．５°，２２５°）に対して，
４５°≦θ_A＜１１２．５°
１１２．５°≦θ_F＜１５７．５°
１５７．５°≦θ_B＜２２５°
を満たすかどうかにより，流入ベクトルの有無が判断される。 FIG. 6 shows an angle threshold used for determining the presence or absence of an inflow vector from an adjacent block. Here, θ _A , θ _B , and θ _F represent prediction angles determined from the prediction modes in Block A, B, and F, respectively. That is, for the angle threshold (45 °, 112.5 °, 157.5 °, 225 °),
45 ° ≦ θ _A <112.5 °
112.5 ° ≦ θ _F <157.5 °
157.5 ° ≦ θ _B <225 °
Whether or not there is an inflow vector is determined depending on whether or not

図７に，実施例２の予測値生成処理のフローチャートを示す。まず，ステップＳ３０１において，予測モードの予測値Ｐｒｅｄ＿ＰｒｅｄＭｏｄｅをＤＣ成分で，流入ベクトルが存在するブロック番号を格納するバッファＢｌｏｃｋＢｕｆを０で，流入ベクトルが存在する数ＭａｘＥｌｅｍを０で初期化する。 FIG. 7 shows a flowchart of predicted value generation processing according to the second embodiment. First, in step S301, the prediction value Pred_PredMode of the prediction mode is initialized with the DC component, the buffer BlockBuf that stores the block number in which the inflow vector exists is initialized to 0, and the number MaxElem in which the inflow vector exists is initialized to 0.

次に，ステップＳ３０２において，符号化対象ブロックに隣接するＴａｒｇｅｔＢｌｏｃｋ∈［ＢｌｏｃｋＡ，Ｂ，Ｆ］のループを行い，ステップＳ３０３にてＴａｒｇｅｔＢｌｏｃｋが符号化済みブロックか否かの分岐を行う。この分岐において真のＴａｒｇｅｔＢｌｏｃｋのみ，ステップＳ３０４においてＴａｒｇｅｔＢｌｏｃｋのＰｒｅｄＭｏｄｅから予測角度θを算出する。 Next, in Step S302, a loop of TargetBlockε [BlockA, B, F] adjacent to the encoding target block is performed, and branching is performed in Step S303 as to whether or not the TargetBlock is an encoded block. For only the true TargetBlock in this branch, the predicted angle θ is calculated from the PredMode of the TargetBlock in Step S304.

続いて，ステップＳ３０５において，θが図６における閾値以内か否かの分岐処理を行い，真のＴａｒｇｅｔＢｌｏｃｋに対して，ステップＳ３０６にてＢｌｏｃｋＢｕｆにＴａｒｇｅｔＢｌｏｃｋを追加格納し，ＭａｘＥｌｅｍを１増やす。ステップＳ３０２〜Ｓ３０６のループにて流入ベクトルが存在するブロックの抽出を終了すると，ステップＳ３０７にてＢｌｏｃｋＢｕｆの中から最適な予測モードを一つ選出し（選出手法の実施例は“実施例５，６”に記載），Ｐｒｅｄ＿ｐｒｅｄＭｏｄｅに適用する。 Subsequently, in step S305, branch processing is performed to determine whether θ is within the threshold in FIG. 6, and TargetBlock is additionally stored in BlockBuf in step S306 with respect to the true TargetBlock, and MaxElem is incremented by one. When the extraction of the block in which the inflow vector exists is completed in the loop of steps S302 to S306, one optimum prediction mode is selected from BlockBuf in step S307 (the example of the selection method is “Examples 5 and 6”). ”), And applies to Pred_predMode.

〔実施例３〕
実施例３は，ＳｗＫｅｙ＝３の場合のＰｒｅｄ＿ＰｒｅｄＭｏｄｅを決定する処理（予測値生成処理）の例であり，図３に示すＢｌｏｃｋＡ，Ｂ，ＦおよびＥ，Ｇの一方もしくはその両方の符号化済みブロックを用いて予測モードの予測を行う場合の例である。 Example 3
The third embodiment is an example of processing (predicted value generation processing) for determining Pred_PredMode when SwKey = 3, and one or both of the blocks A, B, F and E, G shown in FIG. It is an example in the case of performing prediction of prediction mode using.

図８に，ＢｌｏｃｋＡ，Ｂ，Ｅ，Ｆ，Ｇを用いて隣接ブロックから流入ベクトルの有無を判断するために用いる角度閾値を，図９に，ＢｌｏｃｋＡ，Ｂ，Ｅ，Ｆを用いて隣接ブロックから流入ベクトルの有無を判断するために用いる角度閾値を，図１０にＢｌｏｃｋＡ，Ｂ，Ｆ，Ｇを用いて隣接ブロックから流入ベクトルの有無を判断するために用いる角度閾値を示す。ここで，図８，図９，図１０のθ_A，θ_B，θ_E，θ_F，θ_Gは，それぞれＢｌｏｃｋＡ，Ｂ，Ｅ，Ｆ，Ｇにおける予測モードから決定される予測角度を表す。 FIG. 8 shows the angle threshold used to determine the presence or absence of an inflow vector from an adjacent block using Blocks A, B, E, F, and G. FIG. 9 shows the angle threshold value from the adjacent block using Blocks A, B, E, and F. FIG. 10 shows angle thresholds used to determine the presence or absence of an inflow vector from adjacent blocks using Blocks A, B, F, and G. FIG. Here, θ _A , θ _B , θ _E , θ _F , and θ _G in FIGS. 8, 9, and 10 represent prediction angles determined from the prediction modes in Block A, B, E, F, and G, respectively.

図８の例では，角度閾値（２２．５°，６７．５°，１１２．５°，１５７．５°，２０２．５°，２４７．５°）に対して，
２２．５°≦θ_E＜６７．５°
６７．５°≦θ_A＜１１２．５°
１１２．５°≦θ_F＜１５７．５°
１５７．５°≦θ_B＜２０２．５°
２０２．５°≦θ_G＜２４７．５°
を満たすかどうかにより，流入ベクトルの有無が判断される。 In the example of FIG. 8, with respect to the angle threshold (22.5 °, 67.5 °, 112.5 °, 157.5 °, 202.5 °, 247.5 °),
22.5 ° ≦ θ _E <67.5 °
67.5 ° ≦ θ _A <112.5 °
112.5 ° ≦ θ _F <157.5 °
157.5 ° ≦ θ _B <202.5 °
202.5 ° ≦ θ _G <247.5 °
Whether or not there is an inflow vector is determined depending on whether or not

また，図９の例では，角度閾値（２２．５°，６７．５°，１１２．５°，１５７．５°，２２５°）に対して，
２２．５°≦θ_E＜６７．５°
６７．５°≦θ_A＜１１２．５°
１１２．５°≦θ_F＜１５７．５°
１５７．５°≦θ_B＜２２５°
を満たすかどうかにより，流入ベクトルの有無が判断される。 Moreover, in the example of FIG. 9, with respect to the angle threshold (22.5 °, 67.5 °, 112.5 °, 157.5 °, 225 °),
22.5 ° ≦ θ _E <67.5 °
67.5 ° ≦ θ _A <112.5 °
112.5 ° ≦ θ _F <157.5 °
157.5 ° ≦ θ _B <225 °
Whether or not there is an inflow vector is determined depending on whether or not

図１０の例では，角度閾値（４５°，１１２．５°，１５７．５°，２０２．５°，２４７．５°）に対して，
４５°≦θ_A＜１１２．５°
１１２．５°≦θ_F＜１５７．５°
１５７．５°≦θ_B＜２０２．５°
２０２．５°≦θ_G＜２４７．５°
を満たすかどうかにより，流入ベクトルの有無が判断される。 In the example of FIG. 10, for the angle threshold values (45 °, 112.5 °, 157.5 °, 202.5 °, 247.5 °),
45 ° ≦ θ _A <112.5 °
112.5 ° ≦ θ _F <157.5 °
157.5 ° ≦ θ _B <202.5 °
202.5 ° ≦ θ _G <247.5 °
Whether or not there is an inflow vector is determined depending on whether or not

図１１に，例としてＢｌｏｃｋＡ，Ｂ，Ｅ，Ｆ，Ｇを用いて予測モードの予測を行う場合における予測値生成処理のフローチャートを示す。 FIG. 11 shows a flowchart of a predicted value generation process when prediction mode prediction is performed using Block A, B, E, F, and G as an example.

まず，ステップＳ４０１において，予測モードの予測値Ｐｒｅｄ＿ＰｒｅｄＭｏｄｅをＤＣ成分で，流入ベクトルが存在するブロック番号を格納するバッファＢｌｏｃｋＢｕｆを０で，流入ベクトルが存在する数ＭａｘＥｌｅｍを０で初期化する。 First, in step S401, the prediction value Pred_PredMode of the prediction mode is initialized with the DC component, the buffer BlockBuf that stores the block number in which the inflow vector exists, and the number MaxElem in which the inflow vector exists is set to 0.

次に，ステップＳ４０２において，符号化対象ブロックに隣接するＴａｒｇｅｔＢｌｏｃｋ∈［ＢｌｏｃｋＡ，Ｂ，Ｅ，Ｆ，Ｇ］のループを行い，ステップＳ４０３にてＴａｒｇｅｔＢｌｏｃｋが符号化済みブロックか否かの分岐を行う。この分岐において真のＴａｒｇｅｔＢｌｏｃｋのみ，ステップＳ４０４においてＴａｒｇｅｔＢｌｏｃｋのＰｒｅｄＭｏｄｅから予測角度θを算出する。 Next, in Step S402, a loop of TargetBlockε [Block A, B, E, F, G] adjacent to the encoding target block is performed, and branching is performed in Step S403 as to whether or not the TargetBlock is an encoded block. In this branch, only the true TargetBlock is calculated, and the predicted angle θ is calculated from the PredMode of the TargetBlock in Step S404.

続いて，ステップＳ４０５において，θが図８における閾値以内か否かの分岐処理を行い，真のＴａｒｇｅｔＢｌｏｃｋに対して，ステップＳ４０６にてＢｌｏｃｋＢｕｆにＴａｒｇｅｔＢｌｏｃｋを追加格納し，ＭａｘＥｌｅｍを１増やす。ステップＳ４０２〜Ｓ４０６のループにて流入ベクトルが存在するブロックの抽出を終了すると，ステップＳ４０７にてＢｌｏｃｋＢｕｆの中から最適な予測モードを一つ選出し（選出手法の実施例は“実施例５，６”に記載），Ｐｒｅｄ＿ＰｒｅｄＭｏｄｅに適用する。 Subsequently, in step S405, branch processing is performed to determine whether θ is within the threshold value in FIG. 8, and in step S406, TargetBlock is additionally stored in BlockBuf, and MaxElem is incremented by 1 for true TargetBlock. When the extraction of the block having the inflow vector is completed in the loop of steps S402 to S406, one optimum prediction mode is selected from BlockBuf in step S407 (the example of the selection method is “Examples 5 and 6”). ”), And applies to Pred_PredMode.

〔実施例４〕
実施例４は，ＳｗＫｅｙ＝４の場合のＰｒｅｄ＿ＰｒｅｄＭｏｄｅを決定する処理（予測値生成処理）の例であり，図３に示すＢｌｏｃｋＡ〜Ｈのブロック全てを用いて適応的に予測モードの予測を行う場合の例である。 Example 4
The fourth embodiment is an example of processing (prediction value generation processing) for determining Pred_PredMode when SwKey = 4, and when prediction mode prediction is adaptively performed using all the blocks A to H shown in FIG. It is an example.

図１２に，ＢｌｏｃｋＡ〜Ｈを用いて隣接ブロックから流入ベクトルの有無を判断するために用いる角度閾値を示す。ここで，θ_A〜θ_HはそれぞれＢｌｏｃｋＡ〜Ｈにおける予測モードから決定される予測角度を表す。 FIG. 12 shows angle thresholds used to determine the presence or absence of an inflow vector from an adjacent block using Blocks A to H. Here, θ _{A to} θ _H represent prediction angles determined from prediction modes in Blocks _{A to} _H , respectively.

図１２の例では，角度閾値（２２．５°，６７．５°，１１２．５°，１５７．５°，２０２．５°，２４７．５°，２９２．５°，３３７．５°）に対して，
２２．５°≦θ_E＜６７．５°
６７．５°≦θ_A＜１１２．５°
１１２．５°≦θ_F＜１５７．５°
１５７．５°≦θ_B＜２０２．５°
２０２．５°≦θ_G＜２４７．５°
２４７．５°≦θ_C＜２９２．５°
２９２．５°≦θ_H＜３３７．５°
３３７．５°≦θ_D＜２２．５°
を満たすかどうかにより，流入ベクトルの有無が判断される。 In the example of FIG. 12, the angle threshold (22.5 °, 67.5 °, 112.5 °, 157.5 °, 202.5 °, 247.5 °, 292.5 °, 337.5 °) is set. for,
22.5 ° ≦ θ _E <67.5 °
67.5 ° ≦ θ _A <112.5 °
112.5 ° ≦ θ _F <157.5 °
157.5 ° ≦ θ _B <202.5 °
202.5 ° ≦ θ _G <247.5 °
247.5 ° ≦ θ _C <292.5 °
292.5 ° ≦ θ _H <337.5 °
337.5 ° ≦ θ _D <22.5 °
Whether or not there is an inflow vector is determined depending on whether or not

図１３に，本実施例の予測値生成処理のフローチャートを示す。まず，ステップＳ５０１において，予測モードの予測値Ｐｒｅｄ＿ＰｒｅｄＭｏｄｅをＤＣ成分で，流入ベクトルが存在するブロック番号を格納するバッファＢｌｏｃｋＢｕｆを０で，流入ベクトルが存在する数ＭａｘＥｌｅｍを０で初期化する。次に，ステップＳ５０２において，符号化対象ブロックに隣接するＴａｒｇｅｔＢｌｏｃｋ∈［ＢｌｏｃｋＡ，Ｂ，... ，Ｈ］のループを行い，ステップＳ５０３にてＴａｒｇｅｔＢｌｏｃｋが符号化済みブロックか否かの分岐処理を行う。 FIG. 13 shows a flowchart of predicted value generation processing of this embodiment. First, in step S501, the prediction value Pred_PredMode of the prediction mode is initialized with a DC component, the buffer BlockBuf that stores the block number in which the inflow vector exists, 0, and the number MaxElem in which the inflow vector exists is initialized with 0. Next, in Step S502, a loop of TargetBlockε [BlockA, B,..., H] adjacent to the encoding target block is performed, and in Step S503, a branch process is performed to determine whether or not the TargetBlock is an encoded block. .

続いて，ステップＳ５０４にてＴａｒｇｅｔＢｌｏｃｋのＰｒｅｄＭｏｄｅから予測角度θを算出し，ステップＳ５０５において，θが図１２における閾値以内か否かの分岐処理を行う。上記ステップＳ５０３，Ｓ５０５の両方が真のＴａｒｇｅｔＢｌｏｃｋに対して，ステップＳ５０６にてＢｌｏｃｋＢｕｆにＴａｒｇｅｔＢｌｏｃｋを追加格納し，ＭａｘＥｌｅｍを１増やす。ステップＳ５０２〜Ｓ５０６のループにて流入ベクトルが存在するブロックの抽出を終了すると，ステップＳ５０７にてＢｌｏｃｋＢｕｆの中から最適な予測モードを一つ選出し（選出手法の実施例は“実施例５，６”に記載），Ｐｒｅｄ＿ＰｒｅｄＭｏｄｅに適用する。 Subsequently, in step S504, the predicted angle θ is calculated from the PredMode of TargetBlock, and in step S505, branch processing is performed to determine whether θ is within the threshold in FIG. In step S506, TargetBlock is additionally stored in BlockBuf, and MaxElm is incremented by 1, for both true SBlocks in steps S503 and S505. When the extraction of the block in which the inflow vector exists is completed in the loop of steps S502 to S506, one optimal prediction mode is selected from BlockBuf in step S507 (the example of the selection method is “Examples 5 and 6”). ”), And applies to Pred_PredMode.

〔実施例５（ブロック距離別優先順序決定による選択方式）〕
例として，４×４の符号化対象ブロックに隣接するＢｌｏｃｋＡ，Ｂ，Ｅ，Ｆ（図３参照）がＢｌｏｃｋＢｕｆに格納されていた場合における，ブロック距離による予測モードの予測値を生成する手法を記す。 [Example 5 (selection method based on priority order determination by block distance)]
As an example, a method for generating a prediction value of a prediction mode based on a block distance when Blocks A, B, E, and F (see FIG. 3) adjacent to a 4 × 4 encoding target block are stored in BlockBuf is described. .

符号化対象ブロックに対して最も距離が近いブロックはＢｌｏｃｋＡ，Ｂであり，ＢｌｏｃｋＥ，ＦはＢｌｏｃｋＡ，Ｂと比較して距離が離れている。したがって，ＢｌｏｃｋＡ，Ｂのほうが画素間相関は高いと判断することができる。ＢｌｏｃｋＥにおける参照画素（ＲｅｆＰｉｘｅｌ）Ｅ〜Ｈは符号化対象ブロックからの距離が離れているため，画素間相関はＢｌｏｃｋＦと比較して低いと判断することができる。ＢｌｏｃｋＡ，Ｂのブロック距離は等しいが，便宜的にＢｌｏｃｋＡのほうが優先度が高いと仮定し，ＢｌｏｃｋＡ，Ｂ，Ｅ，Ｆの優先順序を次の式２の不等式で定める。 The blocks closest to the encoding target block are Block A and B, and Block E and F are far apart from Block A and B. Therefore, it can be determined that Block A and B have higher inter-pixel correlation. Since the reference pixels (RefPixels) E to H in Block E are far from the encoding target block, it is possible to determine that the inter-pixel correlation is lower than that in Block F. Although the block distances of Block A and B are equal, for the sake of convenience, it is assumed that Block A has a higher priority, and the priority order of Block A, B, E, and F is defined by the following inequality.

ＢｌｏｃｋＡ＞ＢｌｏｃｋＢ＞ＢｌｏｃｋＦ＞ＢｌｏｃｋＥ …（式２）
上記で定めた優先順序に従う場合，ＢｌｏｃｋＡのＰｒｅｄＭｏｄｅをＢｅｓｔＰｒｅｄＭｏｄｅとして適用する。また，ＢｌｏｃｋＢ，ＥがＢｌｏｃｋＢｕｆに格納されていたと仮定すると，ＢｌｏｃｋＢのＰｒｅｄＭｏｄｅをＢｅｓｔＰｒｅｄＭｏｄｅとして適用する。このように隣接するブロックにあらかじめ優先順序を設定しておくことで，複数の流入ベクトルが存在する符号化対象ブロックにおいてもＰｒｅｄ＿ＰｒｅｄＭｏｄｅを一つに選出することが可能となる。 Block A> Block B> Block F> Block E (Formula 2)
When the priority order defined above is followed, BlockA's PredMode is applied as BestPredMode. Further, assuming that BlockB and E are stored in BlockBuf, the PredMode of BlockB is applied as BestPredMode. By setting the priority order in advance to adjacent blocks in this way, it becomes possible to select Pred_PredMode as one even in an encoding target block having a plurality of inflow vectors.

〔実施例６（参照画素のエッジ検出による優先順序決定による選択方式）〕
エッジの強度をより重視した予測モードを予測する例として，４×４の符号化対象ブロックに隣接するＢｌｏｃｋＡ，Ｂ，Ｅ，Ｆ（図３参照）がＢｌｏｃｋＢｕｆに格納されていた場合における，エッジ検出による予測モードの予測値を生成する手法の処理の例を示すフローチャートを，図１４に示す。 [Embodiment 6 (Selection Method by Priority Order Determination by Edge Detection of Reference Pixel)]
As an example of predicting a prediction mode that emphasizes edge strength, edge detection is performed when Block A, B, E, and F (see FIG. 3) adjacent to a 4 × 4 encoding target block are stored in BlockBuf. FIG. 14 is a flowchart showing an example of processing of a method for generating a prediction value of the prediction mode by the method.

まず，ステップＳ６０１において，ＢｌｏｃｋＢｕｆ，ＭａｘＥｌｅｍ，Ｐｒｅｄ＿ＰｒｅｄＭｏｄｅを受け取り，ステップＳ６０２で最大エッジ強度Ｅ_maxを０で初期化する。次に，ステップＳ６０３にて図２０における参照画素∈［Ａ〜Ｌ，Ｘ］と同じ画素におけるエッジ強度ＥｄｇｅＰｏｗｅｒ∈［Ａ″〜Ｌ″，Ｘ″］を算出する。このときＥｄｇｅＰｏｗｅｒの算出には輪郭抽出フィルタを用いる。 First, in step S601, BlockBuf, MaxElem, receive Pred_PredMode, initialized to 0 the maximum edge intensity E _max at step S602. Next, in step S603, the edge strength EdgePower∈ [A ″ to L ″, X ″] is calculated for the same pixel as the reference pixel ∈ [A to L, X] in FIG. Use an extraction filter.

輪郭抽出フィルタの一例を式３に記す。 An example of the contour extraction filter is shown in Equation 3.

ＥｄｇｅＰｏｗｅｒ＝｜ｉ−ｘ｜＋｜ｊ−ｘ｜ …（式３）
ここで，ｘはＥｄｇｅＰｏｗｅｒを求める画素における画素値，ｉ，ｊはｘの左右・上下に隣接する画素（参照画素∈［Ａ〜Ｌ，Ｘ］）における画素値を表す。ＥｄｇｅＰｏｗｅｒを計算するとき，ｉ，ｊ，ｘにはノイズ除去を行った値を用いるのが望ましいが，画面内予測で用いる参照画素はデブロッキングフィルタ適用前の画素値が適用されるため，特に参照画素Ａ，Ｄ，Ｅ，Ｉ，Ｘでは必要以上にエッジが検出されやすい。そのため，特に低ビットレート時には，これらの画素に対しては強めのノイズ除去を適用する。 EdgePower = | i−x | + | j−x | (Formula 3)
Here, x represents a pixel value in a pixel for obtaining EdgePower, and i and j represent pixel values in adjacent pixels (reference pixels ∈ [A to L, X]) on the left, right, and top and bottom of x. When calculating EdgePower, it is desirable to use values obtained by removing noise for i, j, and x. However, since the reference pixel used in the intra prediction is the pixel value before applying the deblocking filter, it is particularly referred to. In pixels A, D, E, I, and X, an edge is easily detected more than necessary. Therefore, strong noise reduction is applied to these pixels, especially at low bit rates.

続いて，ステップＳ６０４にてＢｌｏｃｋＢｕｆに蓄積されたブロックのループを行う。ステップＳ６０５においてＢｌｏｃｋＢｕｆにＩＤ番目に格納されたＴａｒｇｅｔＢｌｏｃｋのＰｒｅｄＭｏｄｅを取得し，ステップＳ６０６にて符号化対象ブロック内エッジ強度Ｅ_blockを０で初期化する。 In step S604, a loop of the blocks accumulated in BlockBuf is performed. In step S605, the PredMode of TargetBlock stored in the IDth block in BlockBuf is acquired. In step S606, the edge strength E _block in the encoding target block is initialized to zero.

次に，ステップＳ６０７にて符号化対象ブロック内の予測画素（以下，ＰｒｅｄＰｉｘｅｌ）全てに対してループを行う。ステップＳ６０８では，ＰｒｅｄＭｏｄｅの参照画素位置に存在するＥｄｇｅＰｏｗｅｒを参照し，ＰｒｅｄＰｉｘｅｌの座標値におけるＥｄｇｅＰｏｗｅｒの影響力ＥｄｇｅＩｎｆｌｕｅｎｃｅを計算する。ＥｄｇｅＩｎｆｌｕｅｎｃｅは参照画素に対して距離の二乗に反比例すると仮定し，画素間距離を１として，次の式４を用いて算出を行う。 Next, in step S607, a loop is performed for all prediction pixels (hereinafter, PredPixel) in the encoding target block. In step S608, EdgePower existing at the reference pixel position of PredMode is referred to, and EdgePower's influence EdgeInfluence at the coordinate value of PredPixel is calculated. It is assumed that EdgeInfluence is inversely proportional to the square of the distance with respect to the reference pixel, and the distance between the pixels is set to 1 and calculation is performed using the following Expression 4.

ＥｄｇｅＩｎｆｌｕｅｎｃｅ
＝｜ＥｄｇｅＰｏｗｅｒ｜／ＲｅｆＬｅｎｇｔｈ² …（式４）
ここで，ＲｅｆＬｅｎｇｔｈは図１５に示すようにＰｒｅｄＰｉｘｅｌと参照画素間の距離を表す。 EdgeInfluence
= | EdgePower | / RefLength ² (Formula 4)
Here, RefLength represents the distance between the PredPixel and the reference pixel as shown in FIG.

続いて，ステップＳ６０９でＥ_blockにＥｄｇｅＩｎｆｌｕｅｎｃｅを加算する。ステップＳ６１０でＥ_maxとＥ_blockの比較を行い，Ｅ_blockのほうが大きければ，ステップＳ６１１にてＥ_maxをＥ_blockで更新し，Ｐｒｅｄ＿ＰｒｅｄＭｏｄｅをＰｒｅｄＭｏｄｅで更新する。 In step S609, EdgeInfluence is added to E _block . It performs a comparison of the E _max and E _block in step S610, the larger the better of E _block, to update the E _max in E _block at step S611, to update the Pred_PredMode in PredMode.

これにより，符号化対象ブロックに対して複数の流入ベクトルが存在する符号化対象ブロックにおいても，最もエッジが強い方向をＰｒｅｄ＿ＰｒｅｄＭｏｄｅに選出することが可能となる。 As a result, the direction with the strongest edge can be selected as Pred_PredMode even in the encoding target block in which a plurality of inflow vectors exist for the encoding target block.

図１６は，本発明を適用した画面内予測復号装置の一実施形態を示すブロック図である。本実施形態の画面内予測における予測モードの予測値生成方式を使用する復号装置２０は，復号対象ブロックに隣接する復号済みブロックから予測モードを予測するための予測モードの予測部２１と，符号化データから予測モードを決定する予測モード決定部２２と，符号化データから予測残差を復号する残差信号復号部２３と，復号画像生成部２４とから構成される。 FIG. 16 is a block diagram showing an embodiment of the intra prediction decoding apparatus to which the present invention is applied. The decoding apparatus 20 that uses the prediction value generation method of the prediction mode in the intra prediction according to the present embodiment includes a prediction mode prediction unit 21 for predicting a prediction mode from a decoded block adjacent to a decoding target block, and encoding. A prediction mode determination unit 22 that determines a prediction mode from data, a residual signal decoding unit 23 that decodes a prediction residual from encoded data, and a decoded image generation unit 24 are configured.

予測モードの予測部２１の内部は，流入ベクトル有りのブロック抽出部２１１，ブロック選択及び予測値生成部２１２から構成される。 The prediction unit 21 in the prediction mode includes a block extraction unit 211 having an inflow vector and a block selection / prediction value generation unit 212.

図１７に，本発明の一実施形態の映像復号方式における復号処理のフローチャートを示す。 FIG. 17 shows a flowchart of the decoding process in the video decoding system according to the embodiment of the present invention.

まず，ステップＳ７０１において予測モードの予測値が一致しているか否かを示す１ビットの予測フラグＦを取得する。次に，ステップＳ７０２では，図２の符号化時におけるステップＳ１０２と同様に，処理分岐係数ＳｗＫｅｙを取得し，続いてステップＳ７０３では，図２の符号化時におけるステップＳ１０３と同様に，ＳｗＫｅｙの値を判定し，ＳｗＫｅｙに応じた処理アルゴリズム（前述した実施例１〜４）による処理を実行して，Ｐｒｅｄ＿ＰｒｅｄＭｏｄｅを決定する。 First, in step S701, a 1-bit prediction flag F indicating whether the prediction values in the prediction mode match is acquired. Next, in step S702, the processing branch coefficient SwKey is acquired in the same manner as in step S102 at the time of encoding in FIG. 2, and then in step S703, the value of SwKey is determined in the same manner as in step S103 in the encoding in FIG. And processing by the processing algorithm (the first to fourth embodiments described above) according to SwKey is performed to determine Pred_PredMode.

続いて，ステップＳ７０４で予測フラグＦの予測が当たっているか否かの条件分岐を行う。予測が当たっていた場合，ステップＳ７０５にてＰｒｅｄＭｏｄｅをＰｒｅｄ＿ＰｒｅｄＭｏｄｅとし，外れていた場合にはステップＳ７０６にて予測モードの補正値を取得してＰｒｅｄＭｏｄｅに適用する。 Subsequently, in step S704, a conditional branch is performed to determine whether or not the prediction flag F is predicted. If the prediction is correct, PredMode is set to Pred_PredMode in step S705, and if not, the correction value of the prediction mode is acquired and applied to PredMode in step S706.

続いて，ステップＳ７０７にてＰｒｅｄＭｏｄｅから予測値を生成し，ステップＳ７０８で予測残差符号データＤを取得する。ステップＳ７０９において予測残差符号データＤから予測残差を復号し，最後にステップＳ７１０において予測値と予測残差の加算によって復号信号を生成する。 Subsequently, a prediction value is generated from PredMode in step S707, and prediction residual code data D is acquired in step S708. In step S709, the prediction residual is decoded from the prediction residual code data D. Finally, in step S710, a decoded signal is generated by adding the prediction value and the prediction residual.

〔実験例〕
図１８は，テストシーケンスにおけるＲＤ曲線（６０フレームの平均値）を示す。本発明の提案手法による実施形態を，国際的な映像圧縮性能の比較基準であるＪＭｖｅｒ．１１．０ＫＴＡ１．９（非特許文献４参照）に実装し，従来手法と本発明との性能比較を行った結果は，図１８に示すとおりであった（比較したデータはすべて輝度信号のみ）。データは，６０フレームをすべて画面内予測にて符号化を行った際の平均値である。シーケンスは“ＦＯＲＥＭＡＮ”，解像度はＣＩＦ（３５２×２８８）を用いた。 [Experimental example]
FIG. 18 shows an RD curve (average value of 60 frames) in the test sequence. The embodiment according to the proposed method of the present invention is compared with JM ver. 11.0 KTA1.9 (see Non-Patent Document 4) and the results of performance comparison between the conventional method and the present invention are as shown in FIG. 18 (all the compared data are only luminance signals). . The data is an average value when all 60 frames are encoded by intra prediction. The sequence was “FOREMAN” and the resolution was CIF (352 × 288).

また，図１９に，図１８の実験において予測モードの予測値がラグランジュコストの最も低い予測モードとして適用される予測的中率を示す。この予測的中率が高いほど，１ビットのフラグで表記される数が増加するため，符号量の削減に繋がる。 FIG. 19 shows the predictive predictability applied as the prediction mode in which the prediction value of the prediction mode has the lowest Lagrangian cost in the experiment of FIG. The higher the predictive predictive value, the more the number represented by a 1-bit flag increases, leading to a reduction in code amount.

本実験では，実施例２および実施例６として説明した手法を用い，４×４ブロックだけでなく，８×８ブロックの予測モードの予測値生成にも提案手法を実装した。この結果，上記の実験において，ＰＳＮＲ＝３５［ｄＢ］では約２．９３％の符号量削減，ＰＳＮＲ＝３２［ｄＢ］では約５．０１％の符号量削減が確認できた。 In this experiment, the method described as Example 2 and Example 6 was used, and the proposed method was implemented not only for 4 × 4 block but also for 8 × 8 block prediction mode generation. As a result, in the above experiment, a code amount reduction of about 2.93% was confirmed when PSNR = 35 [dB], and a code amount reduction of about 5.01% was confirmed when PSNR = 32 [dB].

以上の実施形態における画面内予測の処理は，コンピュータとソフトウェアプログラムとによっても実現することができ，そのプログラムをコンピュータ読み取り可能な記録媒体に記録することも，ネットワークを通して提供することも可能である。 The intra-screen prediction process in the above embodiment can be realized by a computer and a software program, and the program can be recorded on a computer-readable recording medium or provided through a network.

画面内予測符号化装置の一実施形態を示すブロック図である。It is a block diagram which shows one Embodiment of the prediction encoding apparatus in a screen. 符号化処理のフローチャートである。It is a flowchart of an encoding process. 隣接ブロックのブロック番号の設定を説明する図である。It is a figure explaining the setting of the block number of an adjacent block. ＢｌｏｃｋＡ，Ｂを対象とした流入ベクトルの有無の判断に使用する角度閾値の例を示す図である。It is a figure which shows the example of the angle threshold value used for determination of the presence or absence of the inflow vector which made BlockA and B object. 予測モードの予測値生成処理のフローチャートである。It is a flowchart of the prediction value generation process of prediction mode. ＢｌｏｃｋＡ，Ｂ，Ｆを対象とした流入ベクトルの有無の判断に使用する角度閾値の例を示す図である。It is a figure which shows the example of the angle threshold value used for judgment of the presence or absence of the inflow vector which made BlockA, B, and F object. 予測モードの予測値生成処理のフローチャートである。It is a flowchart of the prediction value generation process of prediction mode. ＢｌｏｃｋＡ，Ｂ，Ｅ，Ｆ，Ｇを対象とした流入ベクトルの有無の判断に使用する角度閾値の例を示す図である。It is a figure which shows the example of the angle threshold value used for determination of the presence or absence of the inflow vector for BlockA, B, E, F, G. ＢｌｏｃｋＡ，Ｂ，Ｅ，Ｆを対象とした流入ベクトルの有無の判断に使用する角度閾値の例を示す図である。It is a figure which shows the example of the angle threshold value used for determination of the presence or absence of the inflow vector which made BlockA, B, E, and F object. ＢｌｏｃｋＡ，Ｂ，Ｆ，Ｇを対象とした流入ベクトルの有無の判断に使用する角度閾値の例を示す図である。It is a figure which shows the example of the angle threshold value used for determination of the presence or absence of the inflow vector which made BlockA, B, F, and G object. 予測モードの予測値生成処理のフローチャートである。It is a flowchart of the prediction value generation process of prediction mode. ＢｌｏｃｋＡ〜Ｈを対象とした流入ベクトルの有無の判断に使用する角度閾値の例を示す図である。It is a figure which shows the example of the angle threshold value used for the judgment of the presence or absence of the inflow vector which made BlockAH object. 予測モードの予測値生成処理のフローチャートである。It is a flowchart of the prediction value generation process of prediction mode. エッジ検出による予測モードの予測値生成処理のフローチャート。The flowchart of the prediction value production | generation process of the prediction mode by edge detection. 画素間距離の説明図である。It is explanatory drawing of the distance between pixels. 画面内予測復号装置の一実施形態を示すブロック図である。It is a block diagram which shows one Embodiment of the prediction decoding apparatus in a screen. 復号処理のフローチャートである。It is a flowchart of a decoding process. テストシーケンスにおけるＲＤ曲線（６０フレームの平均値）を示す図である。It is a figure which shows the RD curve (60 frame average value) in a test sequence. 予測モードの予測値がラグランジュコストの最も低い予測モードとして適用される予測的中率を示す図である。It is a figure which shows the predictive probability applied as a prediction mode with the prediction value of a prediction mode having the lowest Lagrange cost. 従来の画面内予測手法で用いる予測角度の説明図である。It is explanatory drawing of the prediction angle used with the conventional intra-screen prediction method. 従来の画面内予測手法で用いる参照画素の画素値の説明図である。It is explanatory drawing of the pixel value of the reference pixel used with the conventional prediction method in a screen. 従来手法における符号化処理のフローチャートである。It is a flowchart of the encoding process in a conventional method. 従来手法における予測モードの予測に使用する隣接ブロックを説明する図である。It is a figure explaining the adjacent block used for prediction of the prediction mode in a conventional method. 適用される予測モード０，７の比較を説明するための，原画像の画素値を示す図である。It is a figure which shows the pixel value of an original image for demonstrating the comparison of the prediction modes 0 and 7 applied. 符号化対象ブロックを予測モード０で画面内符号化したときの予測画素値と予測残差の例を示す図である。It is a figure which shows the example of a prediction pixel value when a coding object block is intra-coded in prediction mode 0, and a prediction residual. 符号化対象ブロックを予測モード７で画面内符号化したときの予測画素値と予測残差の例を示す図である。It is a figure which shows the example of a prediction pixel value when a coding object block is intra-coded in the prediction mode 7, and a prediction residual.

Explanation of symbols

１０符号化装置
１１ラグランジュコスト最小値及びブロック符号化データ初期化部
１２予測モードの予測部
１２１流入ベクトル有りのブロック抽出部
１２２ブロック選択及び予測値生成部
１３予測モード判定ループ部
１３１予測モードの符号化部
１３２画面内予測部
１３３整数精度ＤＣＴ・量子化・符号化部
１３４ラグランジュコスト判定部
１３５ループの終了判定部
２０復号装置
２１予測モードの予測部
２１１流入ベクトル有りのブロック抽出部
２１２ブロック選択及び予測値生成部
２２予測モード決定部
２３残差信号復号部
２４復号画像生成部 DESCRIPTION OF SYMBOLS 10 Encoding apparatus 11 Lagrange cost minimum value and block coding data initialization part 12 Prediction mode prediction part 121 Block extraction part with inflow vector 122 Block selection and prediction value generation part 13 Prediction mode determination loop part 131 Prediction mode code Encoding unit 132 intra prediction unit 133 integer precision DCT / quantization / coding unit 134 Lagrangian cost determination unit 135 loop end determination unit 20 decoding device 21 prediction mode prediction unit 211 block extraction unit with inflow vector 212 block selection and Prediction value generation unit 22 Prediction mode determination unit 23 Residual signal decoding unit 24 Decoded image generation unit

Claims

An intra-screen predictive encoding method for encoding an image using intra-screen prediction,
By comparing the prediction angle determined from the prediction mode in the encoded block adjacent to the encoding target block with a predetermined threshold value, it is determined whether or not there is an inflow vector in which the prediction vector indicating the prediction direction is directed toward the center of the encoding target block. Extracting the encoded block with the inflow vector,
Possess a process of generating a prediction value of a prediction mode of the encoding target block by selecting one of the prediction modes of encoded blocks of there inflow vector is the extraction,
In the process of generating the prediction value of the prediction mode,
When there are a plurality of encoded blocks with the inflow vector, an edge of a reference pixel in the encoded block is detected, and a prediction mode of the encoded block having a prediction mode in a direction in which the edge appears most strongly, An intra-screen predictive encoding method, wherein a prediction value of a prediction mode of an encoding target block is used.

In the intra prediction encoding method according to claim 1,
In the process of extracting the encoded block with the inflow vector,
In accordance with a predetermined setting value or a setting value given from the outside, an encoded block to be determined as to whether or not the inflow vector exists is selected, and the presence or absence of the inflow vector is determined only for the selected encoded block. An intra-screen predictive encoding method characterized by determining.

An intra-screen prediction decoding method for decoding an image using intra-screen prediction,
By comparing the prediction angle determined from the prediction mode in the decoded block adjacent to the decoding target block with a predetermined threshold, it is determined whether or not there is an inflow vector in which the prediction vector indicating the prediction direction is directed to the decoding target block center direction. Extracting a decoded block with a vector;
Possess a process of generating a prediction value of a prediction mode of the block to be decoded by selecting one of the prediction mode of the decoded blocks there inflow vector is the extraction,
In the process of generating the prediction value of the prediction mode,
When there are a plurality of decoded blocks having the inflow vector, an edge of a reference pixel in the decoded block is detected, and a prediction mode of a decoded block having a prediction mode in a direction in which the edge appears most strongly is determined as a decoding target block An intra-screen predictive decoding method, characterized in that the prediction value of the prediction mode is used.

In the intra prediction decoding method according to claim 3 ,
In the process of extracting the decoded block with the inflow vector,
In accordance with a predetermined setting value or a setting value given from the outside, a decoded block to be determined as to whether or not there is an inflow vector is selected, and the presence or absence of an inflow vector is determined only for the selected decoded block An intra-screen predictive decoding method characterized by

An intra-screen predictive encoding device that encodes an image using intra-screen prediction,
By comparing the prediction angle determined from the prediction mode in the encoded block adjacent to the encoding target block with a predetermined threshold value, it is determined whether or not there is an inflow vector in which the prediction vector indicating the prediction direction is directed toward the center of the encoding target block. A means for extracting a coded block with an inflow vector;
E Bei and means for generating a prediction value of a prediction mode of the encoding target block by selecting one of the prediction modes of encoded blocks of there inflow vector is the extraction,
The means for generating the prediction value of the prediction mode is:
When there are a plurality of encoded blocks with the inflow vector, an edge of a reference pixel in the encoded block is detected, and a prediction mode of the encoded block having a prediction mode in a direction in which the edge appears most strongly, An intra-screen predictive coding apparatus characterized by using a prediction value of a prediction mode of an encoding target block .

An intra-screen predictive decoding device that decodes an image using intra-screen prediction,
By comparing the prediction angle determined from the prediction mode in the decoded block adjacent to the decoding target block with a predetermined threshold, it is determined whether or not there is an inflow vector in which the prediction vector indicating the prediction direction is directed to the decoding target block center direction. Means for extracting decoded blocks with vectors;
E Bei and means for generating a prediction value of a prediction mode of the block to be decoded by selecting one of the prediction mode of the decoded blocks there inflow vector is the extraction,
The means for generating the prediction value of the prediction mode is:
When there are a plurality of decoded blocks having the inflow vector, an edge of a reference pixel in the decoded block is detected, and a prediction mode of a decoded block having a prediction mode in a direction in which the edge appears most strongly is determined as a decoding target block An intra-screen predictive decoding device characterized in that the prediction value is a prediction value of the prediction mode .

An intra-screen predictive encoding program for causing a computer to execute the intra-screen predictive encoding method according to claim 1 .

A computer-readable recording medium on which the in-screen predictive encoding program according to claim 7 is recorded.

An intra-screen predictive decoding program for causing a computer to execute the intra-screen predictive decoding method according to claim 3 or 4 .

A computer-readable recording medium on which the in-screen predictive decoding program according to claim 9 is recorded.