JP2009021673A

JP2009021673A - Coded parameter determining method, coded parameter determining device, coded parameter determining program and computer-readable recording medium recording the program

Info

Publication number: JP2009021673A
Application number: JP2007180910A
Authority: JP
Inventors: Yukihiro Bando; 幸浩坂東; Kazuya Hayase; 和也早瀬; Masayuki Takamura; 誠之高村; Kazuto Kamikura; 一人上倉; Yoshiyuki Yashima; 由幸八島
Original assignee: Nippon Telegraph and Telephone Corp
Current assignee: Nippon Telegraph and Telephone Corp
Priority date: 2007-07-10
Filing date: 2007-07-10
Publication date: 2009-01-29
Anticipated expiration: 2027-07-10
Also published as: JP4709187B2

Abstract

<P>PROBLEM TO BE SOLVED: To establish a technique for preventing deterioration in image quality in a skip mode and reducing code amount of the skip mode in determining a coded parameter by minimizing a cost function using coded distortion weighted on the basis of time spatial visual characteristics. <P>SOLUTION: The coded parameter determining method includes: coding target block, when a prediction mode of minimum cost selected by calculating the cost by using a distortion quantity weighted using a sensitivity coefficient indicating time spatial visual sensitivity is a skip mode, the skip mode is not determined as it is as the optimum prediction mode of the block, but the displacement quantity to be used in coding is compared to the estimated displacement quantity calculated prior to coding; and when the difference between the two displacement quantities is large and the estimated displacement quantity is large, the prediction mode of the minimum cost selected by calculating the cost by using the unweighted distortion quantity is determined as a prediction mode. <P>COPYRIGHT: (C)2009,JPO&INPIT

Description

本発明は、動画像符号化で用いられる符号化パラメータ決定方法およびその装置と、その符号化パラメータ決定方法の実現に用いられる符号化パラメータ決定プログラムおよびそのプログラムを記録したコンピュータ読み取り可能な記録媒体とに関し、特に、時空間視覚感度を考慮して動画像の符号化を行うときに、復号画像の主観品質を保ちながら効率的に符号量を削減できるようにする符号化パラメータの決定を実現する符号化パラメータ決定方法およびその装置と、その符号化パラメータ決定方法の実現に用いられる符号化パラメータ決定プログラムおよびそのプログラムを記録したコンピュータ読み取り可能な記録媒体とに関する。 The present invention relates to an encoding parameter determination method and apparatus used in moving image encoding, an encoding parameter determination program used to realize the encoding parameter determination method, and a computer-readable recording medium on which the program is recorded. In particular, when encoding a moving image in consideration of spatio-temporal visual sensitivity, a code that realizes determination of an encoding parameter that can efficiently reduce the amount of code while maintaining the subjective quality of the decoded image The present invention relates to an encoding parameter determination method and apparatus, an encoding parameter determination program used for realizing the encoding parameter determination method, and a computer-readable recording medium on which the program is recorded.

［二乗誤差規範のコスト関数を用いる符号化方式］
Ｈ．２６４では、イントラ予測および可変形状動き補償の導入に伴い、従来の標準化方式と比べて、予測モードの種類が増加している。このため、一定の主観画質を保持しつつ符号量を削減するには、適切な予測モードを選択する必要がある。Ｈ．２６４の参照ソフトウェアＪＭ（非特許文献１参照）では、以下のＲ−Ｄコストを最小化する予測モードを選択している。なお、以下の表記において、「＾Ｘ」（Ｘは文字）における記号＾は、「Ｘ」の上に付く記号を示している。 [Encoding method using cost function of square error criterion]
H. In H.264, with the introduction of intra prediction and variable shape motion compensation, the types of prediction modes are increasing compared to the conventional standardized method. For this reason, in order to reduce the amount of codes while maintaining a constant subjective image quality, it is necessary to select an appropriate prediction mode. H. In the H.264 reference software JM (see Non-Patent Document 1), the following prediction mode that minimizes the RD cost is selected. In the following notation, the symbol ^ in “^ X” (where X is a letter) indicates a symbol attached to “X”.

ここで、Ｓは原信号、ｑは量子化パラメータ、ｍは予測モードを表す番号であり、＾Ｓ_m,qは原信号Ｓに対して予測モードｍを用いて予測し、量子化パラメータｑを用いて量子化した場合の復号信号である。また、λはモード選択に用いるラグランジェの未定乗数である。さらに、Ｄ（Ｓ，＾Ｓ_m,q）は次式に示す二乗誤差和である。 Here, S is an original signal, q is a quantization parameter, m is a number representing a prediction mode, ^ S _{m, q} is predicted for the original signal S using the prediction mode m, and the quantization parameter q is It is a decoded signal when quantized by using. Further, λ is a Lagrange's undetermined multiplier used for mode selection. Further, D (S, ^ S _{m, q} ) is a sum of square errors shown in the following equation.

ここで、Ｓ^Y，Ｓ^U，Ｓ^Vはそれぞれ原信号のＹ，Ｕ，Ｖ成分であり、＾Ｓ^Y _m,q，＾Ｓ^U _m,q，＾Ｓ^V _m,qはそれぞれ復号信号のＹ，Ｕ，Ｖ成分である。 Here, S ^Y , S ^U , and S ^V are Y, U, and V components of the original signal, respectively, and ＾ S ^Y _{m, q} , ＳS ^U _{m, q} , and ＳS ^V _{m, q} are respectively the decoded signals. Y, U and V components.

Ｈ．２６４における復号信号の算出方法を以下に示す。なお、説明に用いる記号を下記の表にまとめる。 H. The calculation method of the decoded signal in H.264 is shown below. The symbols used for the explanation are summarized in the following table.

Ｈ．２６４の符号化処理では、モード番号ｍの予測を用いた場合の予測誤差信号Ｒ_m（＝Ｓ−Ｐ_m）に対して、変換行列Φを用いた直交変換が次式のように施される。 H. In the H.264 encoding process, the orthogonal transformation using the transformation matrix Φ is applied to the prediction error signal R _m (= S−P _m ) when the prediction of the mode number m is used as follows: .

ここで、Φ^tは変換行列Φに対する転置行列を表す。なお、変換行列Φは次式で表される整数要素の直交行列である。 Here, Φ ^t represents a transposed matrix with respect to the transformation matrix Φ. Note that the transformation matrix Φ is an orthogonal matrix of integer elements expressed by the following equation.

次に、行列Φが非正規行列であるため、次式に示すように、行列の正規化に相当する処理を行う。 Next, since the matrix Φ is a non-normal matrix, processing equivalent to matrix normalization is performed as shown in the following equation.

Ｃ_n＝Ｎ（Ｃ）
さらに、Ｃ_nに対して、量子化パラメータｑを用いた量子化が次式のとおり施される。なお、Ｈ．２６４の参照ソフトウェアＪＭでは、正規化は量子化の中に組み込まれている。 C _n = N (C)
Further, quantization using the quantization parameter q is performed on C _n as follows. H. In the H.264 reference software JM, normalization is built into quantization.

Ｖ＝Ｑ（Ｃ_n）
一方、Ｈ．２６４の復号処理では、Ｖに対して、次式のように逆量子化を施し、変換係数の復号値を得る。 V = Q (C _n )
On the other hand, H. In the H.264 decoding process, V is inversely quantized as in the following equation to obtain a decoded value of a transform coefficient.

次に、＾Ｃ_qに対して、次式のように逆変換を施し、予測誤差の復号信号を得る。 Next, inverse transformation is applied to { _circumflex over (C) _{} q} as shown in the following equation to obtain a prediction error decoded signal.

最後に、次式により、符号化対象画像の復号信号を得る。 Finally, a decoded signal of the encoding target image is obtained by the following equation.

［主観画質を考慮した歪み量への重み付け］
前述の通り、Ｈ．２６４の参照ソフトウェアＪＭで用いられている主観画質の尺度は二乗誤差である。しかし、この二乗誤差は必ずしも、主観的な画質劣化を反映した歪み量ではない。例えば、高周波数成分の変化は低周波成分の変化に比べて、視覚的には検知されにくい。しかし、こうした視覚特性を利用していない符号化器（例えば、ＪＭ）には、符号量の効率的な削減に関して、改良の余地が残る。 [Weighting distortion amount considering subjective image quality]
As described above, H.P. The measure of subjective image quality used in the H.264 reference software JM is a square error. However, this square error is not necessarily a distortion amount reflecting subjective image quality degradation. For example, a change in a high frequency component is less visually detected than a change in a low frequency component. However, an encoder (for example, JM) that does not use such visual characteristics still has room for improvement in terms of efficient code amount reduction.

そこで、時空間周波数成分に対して視覚感度に差があることを利用する検討がなされている。直交変換係数に対して、視覚感度に応じて空間周波数成分毎に歪み量の重み付けを行うことで、主観画質に対応した歪み量を定義する。さらに、時間方向の視覚感度も考慮して、上述の重み付けされた歪み量に対して、変移量に応じてさらに重み付けを行う。こうして時空間の視覚感度に基づき重み付けされた歪み量を、符号化パラメータ選択のコスト関数において用いる。 Therefore, studies have been made to use the difference in visual sensitivity with respect to spatio-temporal frequency components. A distortion amount corresponding to the subjective image quality is defined by weighting the distortion amount for each spatial frequency component in accordance with the visual sensitivity with respect to the orthogonal transform coefficient. Further, in consideration of the visual sensitivity in the time direction, the above-described weighted distortion amount is further weighted in accordance with the shift amount. Thus, the weighting amount weighted based on the visual sensitivity of the space time is used in the cost function for selecting the encoding parameter.

量子化誤差信号に対する視覚感度に基づく重み付けについて、以下に説明する。ここでは、次式のＲ−Ｄコストを用いることを想定している。 The weighting based on the visual sensitivity for the quantization error signal will be described below. Here, it is assumed that the RD cost of the following equation is used.

ここで、Ｃ_mはモード番号ｍを用いた場合の予測残差信号Ｒ_mに対する変換係数であり、＾Ｃ_m,qはＣ_mを量子化パラメータｑで量子化・逆量子化して得られる係数の復号値である。このＲ−Ｄコストの計算に用いる歪み量として、以下の重み付け歪み量を用いる。 Here, C _m is a transform coefficient for the prediction residual signal R _m when the mode number m is used, and ^ C _{m, q} is a coefficient obtained by quantizing and dequantizing C _m with the quantization parameter q. Is the decoded value. The following weighted distortion amount is used as the distortion amount used for calculating the RD cost.

ここで、１６／Ｎおよび８／Ｎを囲む記号は、小数部分の切り捨てを意味する。また、Ｃ^Y(i) _m［ｋ，ｌ］，Ｃ^U(i) _m［ｋ，ｌ］，Ｃ^V(i) _m［ｋ，ｌ］はＣ_mの要素であり、マクロブロック（Ｙ成分の場合には１６×１６［画素］、Ｕ，Ｖ成分の場合には８×８［画素］）内のサブブロック（Ｎ×Ｎ［画素］）のうち、ラスター走査においてｉ番目に走査されるサブブロックに含まれる変換係数である。また、＾Ｃ^Y(i) _m,q［ｋ，ｌ］、＾Ｃ^U(i) _m,q［ｋ，ｌ］、＾Ｃ^V(i) _m,q［ｋ，ｌ］は＾Ｃ_m,qの要素であり、マクロブロック（Ｙ成分の場合には１６×１６［画素］、Ｕ，Ｖ成分の場合には８×８［画素］）内のサブブロック（Ｎ×Ｎ［画素］）のうち、ラスター走査においてｉ番目に走査されるサブブロックに含まれる復号変換係数である。 Here, the symbols surrounding 16 / N and 8 / N mean truncation of the decimal part. C ^{Y (i)} _m [k, l], C ^{U (i)} _m [k, l], and C ^{V (i)} _m [k, l] are elements of C _m , and are macroblocks (Y component) Of the sub-block (N × N [pixel]) in 16 × 16 [pixel] in the case of U, V component and 8 × 8 [pixel] in the case of U and V components) is scanned i-th in the raster scan. This is a conversion coefficient included in the sub-block. ^ C ^{Y (i)} _{m, q} [k, l], ^ C ^{U (i)} _{m, q} [k, l] and ^ C ^{V (i)} _{m, q} [k, l] are ^ C _{m , q} elements and sub-blocks (N × N [pixels]) in a macroblock (16 × 16 [pixels] for Y components and 8 × 8 [pixels] for U and V components) Among them, the decoding transform coefficient included in the i-th scanned sub-block in raster scanning.

さらに、Ｗ^Y _k,l，Ｗ^U _k,l，Ｗ^V _k,lは１以下に設定される重み係数であり、空間周波数および時間周波数が高いほど、小さな値をとる。 Further, W ^Y _{k, l} , W ^U _{k, l} and W ^V _{k, l} are weighting factors set to 1 or less, and take a smaller value as the spatial frequency and time frequency are higher.

上式において、Ｗ^Y _k,l，Ｗ^U _k,l，Ｗ^V _k,lを小さな値に設定することは、量子化歪みＤ（Ｃ_m，＾Ｃ_m,q）を小さく見積もることに相当する。なお、直交変換の正規性より、すべてのｋ，１に対して、Ｗ^Y _k,l＝１，Ｗ^U _k,l＝１，Ｗ^V _k,l＝１とすれば、上述の重み付け歪み量は二乗誤差和と等価となる。
K.P.Lim and G.Sullivan and T.Wiegand, Text Description of Joint Model Reference Encoding Methods and Decoding Concealment Methods. Joint Video Team （JVT ）of ISO/IEC MPEG and ITU-T VCEG, JVT-R95, Jan., 2006. In the above equation, setting W ^Y _{k, l} , W ^U _{k, l} , W ^V _{k, l} to a small value is equivalent to estimating the quantization distortion D (C _m , ^ C _{m, q} ) to be small. To do. From the normality of orthogonal transformation, if W ^Y _{k, l} = 1, W ^U _{k, l} = 1, W ^V _{k, l} = 1 for all _{k, 1} , the above-mentioned weighted distortion amount Is equivalent to the sum of squared errors.
KPLim and G. Sullivan and T. Wiegand, Text Description of Joint Model Reference Encoding Methods and Decoding Concealment Methods. Joint Video Team (JVT) of ISO / IEC MPEG and ITU-T VCEG, JVT-R95, Jan., 2006.

Ｈ．２６４では、直上、直左、直左上のマクロブロックの動きベクトルを用いて、符号化対象のマクロブロックの動きベクトルに対する予測ベクトルを生成する。 H. In H.264, a prediction vector for a motion vector of a macroblock to be encoded is generated using the motion vectors of the macroblock immediately above, immediately left, and immediately above left.

スキップモード以外の予測モードの場合には、この予測ベクトルとの差分情報を用いて、動きベクトルを表現する。 In the case of a prediction mode other than the skip mode, a motion vector is expressed using difference information from this prediction vector.

一方、スキップモードの場合には、動きベクトルとして、この予測ベクトルを用いる。さらに、フレーム間予測残差は零値として、復号信号を生成する。 On the other hand, in the skip mode, this prediction vector is used as a motion vector. Further, the inter-frame prediction residual is set to zero and a decoded signal is generated.

このため、予測ベクトルが異なる絵柄の領域を参照することになれば、スキップモードのマクロブロックに大きな符号化歪みが重畳することになる。一方、符号量は極めて微小に抑えることができる。 For this reason, if reference is made to a region of a picture with a different prediction vector, a large coding distortion is superimposed on the skip mode macroblock. On the other hand, the amount of codes can be kept extremely small.

こうした符号化歪みと符号量とはトレードオフの関係にあるため、ＪＭと呼ばれるＨ．２６４の符号化器では、符号化歪みと符号量との加重和としてラグランジェのコスト関数を導入し、このトレードオフのバランスを勘案し、予測モードの選択を行っている。 Since such coding distortion and code amount are in a trade-off relationship, H.J. In the H.264 encoder, a Lagrangian cost function is introduced as a weighted sum of encoding distortion and code amount, and a prediction mode is selected in consideration of this trade-off balance.

しかるに、前述のコントラスト感度関数に基づく歪み量への重み付け（Ｗ^Y _k,l，Ｗ^U _k,l，Ｗ^V _k,lによる重み付け）を行った場合、スキップモードが大きな画質劣化を含んでいるにもかかわらず、スキップモードが最適な予測モードとして選択される場合がある。 However, when weighting is applied to the distortion amount based on the above-described contrast sensitivity function (weighting by W ^Y _{k, l} , W ^U _{k, l} , W ^V _{k, l} ), the skip mode includes large image quality degradation. Nevertheless, the skip mode may be selected as the optimal prediction mode.

このような画質劣化が発生するということは、符号化歪みに対する重み付けを行う際に、スキップモードに関しては、このモードの持つ特異性を考慮する必要性があることを示唆している。しかしながら、従来技術では、そのような検討がなされていない。 The occurrence of such image quality degradation suggests that it is necessary to consider the peculiarity of this mode with respect to the skip mode when weighting the coding distortion. However, such a study has not been made in the prior art.

本発明はかかる事情に鑑みてなされたものであって、時空間視覚特性に基づき重み付けされた符号化歪みを用いたコスト関数の最小化により、符号化パラメータの決定を行う際に、スキップモードの主観画質を適切に評価した符号化歪みの尺度を導入することで、スキップモードにおける画質劣化を回避し、スキップモードの符号量削減のメリットを最大限享受しうる新たな符号化パラメータ決定技術を確立することを目的とする。 The present invention has been made in view of such circumstances, and in determining the encoding parameter by minimizing the cost function using the encoding distortion weighted based on the spatio-temporal visual characteristics, the skip mode is used. Established a new coding parameter determination technology that can avoid degradation of image quality in skip mode and enjoy the maximum benefit of code amount reduction in skip mode by introducing a measure of coding distortion that appropriately evaluates subjective image quality The purpose is to do.

〔１〕第１の構成
前記の目的を達成するために、本発明の符号化パラメータ決定装置は、フレーム内予測およびフレーム間予測により得られた予測誤差信号に対して、変換符号化および量子化による情報圧縮を行う画像符号化に用いる符号化パラメータを決定するために、（イ）符号化対象のブロックについて、符号化処理に先立って、画像信号の時間的な動きを示す推定変位量を算出する算出手段と、（ロ）時空間視覚感度を示す感度係数を用いて重み付けされた歪み量を用いてコストを算出することで選択されたコスト最小の予測モードがスキップモードであるブロックを判断対象として、そのブロックを符号化する際に用いる変位量と算出手段の算出した推定変位量とを比較して、その２つの変位量の乖離度が規定の閾値より大きく、かつ、推定変位量の大きさが規定の閾値より大きいのか否かを判断することで、スキップモードで符号化すると画質劣化が大きくなる可能性の高いブロックに該当するのか否かを判断する判断手段と、（ハ）判断手段が該当のブロックでないと判断した判断対象のブロックについては、スキップモードをそのブロックの最適な予測モードとして決定し、判断手段が該当のブロックであると判断した判断対象のブロックについては、重み付けがされない歪み量を用いてコストを算出することで選択されたコスト最小の予測モードをそのブロックの最適な予測モードとして決定する決定手段と、（ニ）重み付けされた歪み量を用いてコストを算出することで選択されたコスト最小の予測モードがスキップモードでないブロックについて、その予測モードをそのブロックの最適な予測モードとして決定する第２の決定手段と、（ホ）判断手段が判断対象とするブロックについて、重み付けがされない歪み量を用いてコストを算出することで選択されたコスト最小の予測モードもまたスキップモードである場合には、判断手段による判断処理を行うことなく、スキップモードをそのブロックの最適な予測モードとして決定する第３の決定手段とを備えるように構成する。 [1] First Configuration To achieve the above object, the coding parameter determination apparatus of the present invention performs transform coding and quantization on a prediction error signal obtained by intraframe prediction and interframe prediction. In order to determine the encoding parameters used for image encoding that performs information compression by (i), the estimated displacement amount indicating the temporal movement of the image signal is calculated prior to the encoding process for the block to be encoded. And (b) a block in which the prediction mode with the lowest cost selected by calculating the cost using the weighted distortion amount using the sensitivity coefficient indicating the spatiotemporal visual sensitivity is the skip mode. As a comparison between the amount of displacement used when encoding the block and the estimated amount of displacement calculated by the calculation means, the difference between the two amounts of displacement is greater than a prescribed threshold, In addition, by determining whether or not the size of the estimated displacement amount is larger than a prescribed threshold, it is determined whether or not the block is likely to fall in image quality deterioration when encoded in the skip mode. (C) For a block to be determined that the determination means determines that it is not the corresponding block, the skip mode is determined as the optimum prediction mode of the block, and the determination target that the determination means determines to be the corresponding block is determined. For a block, a determination means for determining a prediction mode with the least cost selected by calculating a cost using an unweighted distortion amount as an optimal prediction mode of the block; and (d) a weighted distortion amount. Prediction of the block with the least cost prediction mode selected by calculating the cost using the skip mode A second determination unit that determines the mode as the optimum prediction mode of the block, and (e) a block selected by the determination unit is selected by calculating a cost using an unweighted distortion amount When the prediction mode with the lowest cost is also the skip mode, a third determination unit that determines the skip mode as the optimum prediction mode of the block without performing the determination process by the determination unit is provided. .

以上の各処理手段が動作することで実現される本発明の符号化パラメータ決定方法はコンピュータプログラムでも実現できるものであり、このコンピュータプログラムは、適当なコンピュータ読み取り可能な記録媒体に記録して提供されたり、ネットワークを介して提供され、本発明を実施する際にインストールされてＣＰＵなどの制御手段上で動作することにより本発明を実現することになる。 The encoding parameter determination method of the present invention realized by the operation of each of the above processing means can also be realized by a computer program, and this computer program is provided by being recorded on an appropriate computer-readable recording medium. Alternatively, the present invention is realized by being provided via a network, installed when the present invention is carried out, and operating on a control means such as a CPU.

このように構成される本発明の符号化パラメータ決定装置では、符号化対象のブロックについて、時空間視覚感度を示す感度係数を用いて重み付けされた歪み量を用いてコストを算出することで選択されたコスト最小の予測モードがスキップモード（動きベクトルとして予測ベクトルを用いることを指示する予測モード）である場合には、そのブロックの最適な予測モードして、そのままスキップモードを決定するのではなくて、符号化する際に用いる変位量と符号化に先立って算出した推定変位量とを比較して、その２つの変位量の乖離が大きく、かつ、推定変位量が大きい場合には、重み付けがされない歪み量を用いてコストを算出することで選択されたコスト最小の予測モードをそのブロックの最適な予測モードとして決定するようにする。 In the coding parameter determination apparatus of the present invention configured as described above, a block is selected by calculating a cost using a distortion amount weighted using a sensitivity coefficient indicating spatiotemporal visual sensitivity for a block to be coded. If the prediction mode with the lowest cost is a skip mode (a prediction mode instructing to use a prediction vector as a motion vector), the optimal prediction mode for the block is not determined and the skip mode is not determined as it is. The displacement amount used for encoding is compared with the estimated displacement amount calculated prior to encoding, and if the difference between the two displacement amounts is large and the estimated displacement amount is large, no weighting is performed. The prediction mode with the lowest cost selected by calculating the cost using the amount of distortion is determined as the optimal prediction mode for the block. That.

〔２〕第２の構成
また、前記の目的を達成するために、本発明の符号化パラメータ決定装置は、フレーム内予測およびフレーム間予測により得られた予測誤差信号に対して、変換符号化および量子化による情報圧縮を行う画像符号化に用いる符号化パラメータを決定するために、（イ）符号化対象のブロックについて、符号化処理に先立って、画像信号の時間的な動きを示す推定変位量を算出する算出手段と、（ロ）時空間視覚感度を示す感度係数を用いて重み付けされた歪み量を用いてコストを算出することで選択されたコスト最小の予測モードがスキップモードであるブロックを判断対象として、そのブロックを符号化する際に用いる変位量と算出手段の算出した推定変位量とを比較して、その２つの変位量の乖離度が規定の閾値より大きく、かつ、推定変位量の大きさが規定の閾値より大きいのか否かを判断することで、スキップモードで符号化すると画質劣化が大きくなる可能性の高いブロックに該当するのか否かを判断する判断手段と、（ハ）判断手段が該当のブロックでないと判断した判断対象のブロックについては、スキップモードをそのブロックの最適な予測モードとして決定し、判断手段が該当のブロックであると判断した判断対象のブロックについては、コスト最小のスキップモードの次にコストの小さなものとして選択された予測モードをそのブロックの最適な予測モードとして決定する決定手段と、（ニ）重み付けされた歪み量を用いてコストを算出することで選択されたコスト最小の予測モードがスキップモードでないブロックについて、その予測モードをそのブロックの最適な予測モードとして決定する第２の決定手段と、（ホ）判断手段が判断対象とするブロックについて、重み付けがされない歪み量を用いてコストを算出することで選択されたコスト最小の予測モードもまたスキップモードである場合には、判断手段による判断処理を行うことなく、スキップモードをそのブロックの最適な予測モードとして決定する第３の決定手段とを備えるように構成する。 [2] Second Configuration Further, in order to achieve the above object, the coding parameter determination apparatus of the present invention performs transform coding and prediction on a prediction error signal obtained by intraframe prediction and interframe prediction. In order to determine encoding parameters used for image encoding for performing information compression by quantization, (a) an estimated displacement amount indicating temporal movement of an image signal prior to encoding processing for a block to be encoded And (b) a block in which the prediction mode with the lowest cost selected by calculating the cost using the distortion amount weighted using the sensitivity coefficient indicating the spatiotemporal visual sensitivity is the skip mode. As a judgment target, the displacement amount used when coding the block is compared with the estimated displacement amount calculated by the calculation means, and the difference between the two displacement amounts is larger than a prescribed threshold value. And whether or not the estimated displacement amount is larger than a predetermined threshold value, it is determined whether or not the block corresponds to a block that is likely to have a large image quality deterioration when encoded in the skip mode. And (c) for the block to be determined that the determining unit determines not to be the corresponding block, the skip mode is determined as the optimum prediction mode of the block, and the determination unit determines that the block is the corresponding block With respect to the target block, using a determination unit that determines the prediction mode selected as the next lowest cost after the skip mode with the lowest cost as the optimum prediction mode of the block, and (d) using the weighted distortion amount The prediction mode selected for the block with the lowest cost selected by calculating the cost is not the skip mode. And (2) a minimum cost selected by calculating the cost using the unweighted distortion amount for the block to be determined by (2) the determination unit. If the prediction mode is also the skip mode, the third determination means for determining the skip mode as the optimum prediction mode for the block without performing the determination process by the determination means is provided.

このように構成される本発明の符号化パラメータ決定装置では、符号化対象のブロックについて、時空間視覚感度を示す感度係数を用いて重み付けされた歪み量を用いてコストを算出することで選択されたコスト最小の予測モードがスキップモード（動きベクトルとして予測ベクトルを用いることを指示する予測モード）である場合には、そのブロックの最適な予測モードして、そのままスキップモードを決定するのではなくて、符号化する際に用いる変位量と符号化に先立って算出した推定変位量とを比較して、その２つの変位量の乖離が大きく、かつ、推定変位量が大きい場合には、コスト最小のスキップモードの次にコストの小さなものとして選択された予測モードをそのブロックの最適な予測モードとして決定するようにする。 In the coding parameter determination apparatus of the present invention configured as described above, a block is selected by calculating a cost using a distortion amount weighted using a sensitivity coefficient indicating spatiotemporal visual sensitivity for a block to be coded. If the prediction mode with the lowest cost is a skip mode (a prediction mode instructing to use a prediction vector as a motion vector), the optimal prediction mode for the block is not determined and the skip mode is not determined as it is. The amount of displacement used for encoding is compared with the estimated amount of displacement calculated prior to encoding. When the difference between the two amounts of displacement is large and the estimated amount of displacement is large, the cost is minimized. The prediction mode selected as the next lowest cost after the skip mode is determined as the optimum prediction mode of the block.

〔３〕本発明について
このように、本発明では、あるブロックについて、重み付けされた歪み量を用いてコストを算出することで選択されたコスト最小の予測モードがスキップモードである場合には、そのブロックの最適な予測モードして、そのままスキップモードを決定するのではなくて、符号化する際に用いる変位量と符号化に先立って算出した推定変位量とを比較して、その２つの変位量の乖離が大きく、かつ、推定変位量が大きい場合には、コスト最小で選択されたものの、スキップモードで符号化したのでは大きな画質劣化を含んでいる可能性が高いことを考慮して、スキップモード以外の予測モードをそのブロックの最適な予測モードとして決定するようにするという構成を採る。 [3] About the present invention As described above, in the present invention, when the prediction mode with the minimum cost selected by calculating the cost using the weighted distortion amount for a certain block is the skip mode, Rather than determining the skip mode as the optimum prediction mode of the block, the displacement amount used for encoding is compared with the estimated displacement amount calculated prior to encoding, and the two displacement amounts are compared. If the discrepancy between the two is large and the estimated displacement is large, it is selected with the lowest cost, but it is highly possible that coding with the skip mode is likely to include a large image quality degradation. A configuration is adopted in which a prediction mode other than the mode is determined as the optimum prediction mode of the block.

スキップモードが大きな画質劣化を含んでいるにもかかわらず、最適な予測モードとして選択される原因を、以下で考察する。 The reason why the skip mode is selected as the optimum prediction mode despite the large image quality degradation will be considered below.

従来法では、各マクロブロックの真の変移量を符号化処理に先立ち推定する。ここで得られる変移量を推定変移量と呼ぶ。この推定変移量を用いて歪み量に対する重み付けを行う。符号化処理で用いる変移量は、別途、符号化器で算出する。この変移量を符号化器変移量と呼ぶ。 In the conventional method, the true shift amount of each macroblock is estimated prior to the encoding process. The amount of change obtained here is called an estimated amount of change. The estimated amount of displacement is used to weight the amount of distortion. The shift amount used in the encoding process is separately calculated by an encoder. This shift amount is called an encoder shift amount.

従来法では、推定変移量と符号化器変移量とが大きく乖離しないことを前提としている。これに対して、スキップモードの場合、動き探索を行う訳ではないため、符号化器変移量である予測ベクトルが推定変移量と大きく乖離する可能性がある。その結果、異なる絵柄の領域を参照することになれば、大きな符号化歪みが発生する。 In the conventional method, it is assumed that the estimated shift amount and the encoder shift amount do not greatly deviate. On the other hand, in the skip mode, since the motion search is not performed, there is a possibility that the prediction vector that is the encoder shift amount greatly deviates from the estimated shift amount. As a result, if a different picture area is referred to, a large coding distortion occurs.

しかし、推定変移量が大きな場合には、こうした大きな符号化歪みが発生したとしても、重み付けにより、その符号化歪みは小さく見積もられる。その結果、コストが小さくなり、最適モードとして選択される可能性がある。 However, when the estimated shift amount is large, even if such a large coding distortion occurs, the coding distortion is estimated to be small by weighting. As a result, the cost is reduced and the optimum mode may be selected.

そこで、本発明では、推定変移量と符号化器変移量との乖離度を考慮して、歪み量に対する重み付けを行う。すなわち、推定変移量と符号化器変移量とが大きく乖離し、かつ、推定変移量が大きな場合には、前述の画質劣化の可能性が高まるため、重み付けがされない歪み量を用いてコストを算出することで選択されたコスト最小の予測モードを用いることを決定したり、重み付けがされた歪み量を用いてコストを算出することで選択されたコスト最小の予測モード（スキップモード）の次にコストの小さなものとして選択された予測モードを用いることを決定するようにするのである。 Therefore, in the present invention, the distortion amount is weighted in consideration of the degree of deviation between the estimated shift amount and the encoder shift amount. In other words, if the estimated shift amount and the encoder shift amount deviate greatly, and the estimated shift amount is large, the possibility of the above-mentioned image quality degradation increases, so the cost is calculated using the unweighted distortion amount. The cost next to the prediction mode with the lowest cost selected (skip mode) is determined by calculating the cost using the weighted distortion amount. It is decided to use a prediction mode selected as a small one.

次に、推定変移量と符号化器変移量との乖離度を評価する評価関数、および推定変移量の大きさを評価する評価関数について説明する。 Next, an evaluation function for evaluating the degree of deviation between the estimated shift amount and the encoder shift amount, and an evaluation function for evaluating the magnitude of the estimated shift amount will be described.

［推定変位量の参照フレームと符号化器変移量の参照フレームとが同一の場合］
以下では、マクロブロックに対する推定変移量を“＾ｄ＝（＾ｄ_x，＾ｄ_y）”と表し、符号化器変移量を“ｄ＝（ｄｘ，ｄｙ）”と表す。 [When the estimated displacement reference frame is the same as the encoder displacement reference frame]
The following represents a macro block for the estimated displacement amount _{"^ d = (^ d x} , ^ d y)" and the encoder displacement amount "d = (dx, dy) " represents a.

次の２つの条件（１)(２）を満たす場合、歪み量に対する重み付けは行わないこととする。 When the following two conditions (1) and (2) are satisfied, no weighting is applied to the distortion amount.

ここで、ｄ（）は２つのベクトルの乖離度を表す関数であり、例えば、下記の式（１）で表される２つのベクトルの内積や、下記の式（２）で表される２つのベクトルの距離や、下記の式（３）で表される２つのベクトルの距離などを用いる。 Here, d () is a function representing the degree of divergence between two vectors. For example, the inner product of two vectors represented by the following expression (1) or two expressions represented by the following expression (2): A vector distance, a distance between two vectors represented by the following equation (3), or the like is used.

また、条件（２）の左辺は、下記の式（４）で表されるベクトルの絶対値ノルムである。ここで、θ，ψは外部から与えられる閾値である。 The left side of the condition (2) is the absolute value norm of the vector represented by the following formula (4). Here, θ and ψ are thresholds given from the outside.

［推定変移量の参照フレームと符号化器変移量の参照フレームとが異なる場合］
処理対象のマクロブロックが第ｔフレームに存在し、推定変移量が第ｔ−ｒ_eフレームを参照し、符号化器変移量が第ｔ−ｒ_cフレームを参照する場合にあって、次の２つの条件を満たす場合、歪み量に対する重み付けは行わないこととする。 [When the estimated transition amount reference frame is different from the encoder transition amount reference frame]
Macroblock to be processed is present in the t-th frame, the estimated displacement amount refers to the t-th-r _e frame, the encoder displacement amount is in the case of referring to the t-r _c frame, the next 2 When one condition is satisfied, weighting is not performed on the distortion amount.

つまり、推定変移量の参照フレームと符号化器変移量の参照フレームとが異なる場合、２つの変移量を隣接フレーム間の変移量に正規化した値を評価に用いる。また、ｒ_c，ｒ_eは正負いずれの値もとりうる。正値の場合は前方予測、負値の場合は後方予測にあたる。このため、各変移量をｒ_c，ｒ_eで除算するのは、ベクトルの長さの正規化とあわせて、ベクトルの方向が異なる場合に、両ベクトルの方向をそろえる意味も有る。 That is, when the estimated shift amount reference frame and the encoder shift amount reference frame are different, a value obtained by normalizing the two shift amounts to the shift amount between adjacent frames is used for evaluation. Further, r _c, r _e can take both positive and negative values. A positive value corresponds to forward prediction, and a negative value corresponds to backward prediction. Thus, for dividing each displacement amount in r _c, r _e, together with the length normalization of the vector, when the direction of the vector are different, also it means there align the direction of both vectors.

［Ｂフレームにおけるスキップモードの場合］
Ｂフレームにおけるスキップモードの場合、符号化器変移量の算出方法として２種類の方法（spatial directとtemporal direct)が規定されている。 [Skip mode in B frame]
In the case of the skip mode in the B frame, two types of methods (spatial direct and temporal direct) are defined as methods for calculating the encoder shift amount.

（ｉ）temporal direct の場合
符号化器変移量は、アンカーピクチャ（通常、表示順序で符号化対象フレームの後方の一番近い参照フレーム）において、符号化対象ブロックと同一位置にあるブロックであるアンカーブロックの変移量（アンカー変移量と呼ぶ）を用いて設定する。アンカー変移量をｄ_Colとし、アンカーピクチャとその参照フレームとの間の時間間隔をｔ_dとし、同参照フレームと符号化対象フレームとの間の時間間隔をｔ_bとして、双方向予測の動きベクトルは以下のように求められる。 (I) In the case of temporal direct The encoder shift amount is an anchor which is a block located at the same position as the encoding target block in the anchor picture (usually, the nearest reference frame behind the encoding target frame in the display order). This is set using the block shift amount (referred to as anchor shift amount). Bi-predicted motion vector, where d _Col is the anchor displacement, t _d is the time interval between the anchor picture and its reference frame, and t _b is the time interval between the reference frame and the encoding target frame Is obtained as follows.

双方向予測の動きベクトルは、いずれも、アンカー変移量から求まることが上式より分かる。そこで、次の２つの条件を満たす場合、歪み量に対する重み付けは行わないこととする。 It can be seen from the above equation that the bi-directional motion vector can be obtained from the anchor shift amount. Therefore, when the following two conditions are satisfied, the distortion amount is not weighted.

見かけ上、双方向予測の動きベクトルは２本であっても、２つのベクトルの方向はアンカー変移量と従属な関係にある。このため、推定変移量とアンカー変移量とについて評価すれば、双方向予測の２つのベクトルと推定変移量との各乖離度を得ることができる。このため、上式では、推定変移量とアンカー変移量の乖離度を評価尺度として用いている。 Apparently, even if there are two bi-directional motion vectors, the directions of the two vectors are dependent on the anchor shift amount. For this reason, if the estimated shift amount and the anchor shift amount are evaluated, the respective divergence degrees between the two vectors of bidirectional prediction and the estimated shift amount can be obtained. For this reason, in the above equation, the degree of deviation between the estimated shift amount and the anchor shift amount is used as an evaluation scale.

（ii）spatial directの場合
アンカー変移量の値によって、以下のいずれかを決定する。 (Ii) Spatial direct One of the following is determined according to the value of the anchor displacement.

・双方向予測の動きベクトルを零ベクトルに設定する
・近傍マクロブロックの動きベクトルから導出する
双方向予測の動きベクトルをｄ_L0及びｄ_L1とし、各参照フレームを第ｔ−ｒ_c0フレーム及び第ｔ−ｒ_c1フレームとすると、次の条件を満たす場合、歪み量に対する重み付けは行わないこととする。 - a motion vector of the bidirectional prediction to be derived from the motion vectors of-neighboring macroblocks to set the motion vector of the bidirectional prediction to zero vector and d _L0 and d _L1, each reference frame the t-r _c0 frame and the t Assuming that _{−rc 1} frame is used, weighting is not applied to the distortion amount when the following condition is satisfied.

本発明では、このような評価関数を用いて、符号化する際に用いる変位量（符号化器変移量）と符号化に先立って算出した推定変位量との乖離度と、その推定変位量の大きさとを評価して、それに基づいて、スキップモードで符号化したのでは大きな画質劣化を含んでいる可能性が高いことを判断する場合は、スキップモード以外の予測モードをそのブロックの最適な予測モードとして決定するようにするのである。 In the present invention, using such an evaluation function, the degree of deviation between the displacement amount (encoder displacement amount) used for encoding and the estimated displacement amount calculated prior to encoding, and the estimated displacement amount When the size is evaluated and it is determined that coding in the skip mode is likely to include large image quality degradation based on the size, a prediction mode other than the skip mode is selected as the optimal prediction of the block. The mode is determined.

本発明では、時空間コントラスト感度関数に基づく歪み量への重み付けを行った場合、スキップモードが大きな画質劣化を含んでいるにもかかわらず、最適な予測モードとして選択されることがないよう、スキップモードの主観画質を適切に評価した符号化歪みの尺度を導入することで、画質劣化の誘発を回避している。 In the present invention, when weighting is applied to the distortion amount based on the spatio-temporal contrast sensitivity function, the skip mode is selected so as not to be selected as the optimum prediction mode even though the skip mode includes a large image quality degradation. By introducing a measure of coding distortion that appropriately evaluates the subjective image quality of the mode, image quality degradation is avoided.

これにより、本発明によれば、スキップモードにおける画質劣化を回避し、スキップモードの符号量削減のメリットを最大限享受することが可能となる。つまり、動きベクトルとして予測ベクトルを用いることを指示するスキップモードに対しても、視覚的には検知され難い領域に対して符号量の削減を行うため、復号画像の主観画質を保ちながら、効率的に符号量を削減できるようになる。 As a result, according to the present invention, it is possible to avoid image quality deterioration in the skip mode and enjoy the maximum benefit of code amount reduction in the skip mode. In other words, even in the skip mode that instructs to use a prediction vector as a motion vector, the code amount is reduced for a region that is difficult to detect visually, so that it is efficient while maintaining the subjective image quality of the decoded image. Therefore, the amount of codes can be reduced.

以下、実施の形態に従って本発明を詳細に説明する。 Hereinafter, the present invention will be described in detail according to embodiments.

図１に、本発明を具備する映像符号化装置１の装置構成を図示する。 FIG. 1 shows a device configuration of a video encoding device 1 having the present invention.

本発明を具備する映像符号化装置１は、Ｈ．２６４に従って動画像を符号化する処理を行うものであり、この図に示すように、符号化対象マクロブロックの符号化に用いる符号化パラメータを決定する符号化パラメータ決定部１０と、符号化パラメータ決定部１０の決定した符号化パラメータを使って符号化対象マクロブロックを符号化する符号化部２０とを備える。 The video encoding apparatus 1 provided with the present invention is an H.264 standard. H.264 is used to perform processing for encoding a moving image. As shown in this figure, an encoding parameter determination unit 10 that determines an encoding parameter used for encoding an encoding target macroblock, and an encoding parameter determination And an encoding unit 20 that encodes the encoding target macroblock using the encoding parameter determined by the unit 10.

この符号化パラメータ決定部１０は、符号化パラメータの１つである予測モードを決定するために、符号化対象フレーム信号と参照フレーム信号と量子化パラメータとを入力として、符号化対象マクロブロックの予測モードを決定するという処理を行う予測モード決定部１１を備えるものであり、そして、この予測モード決定部１１は、その決定を行うために、符号化対象マクロブロックの推定変位量を算出する推定変位量算出部１２を備える。 The encoding parameter determination unit 10 receives the encoding target frame signal, the reference frame signal, and the quantization parameter as inputs to determine a prediction mode that is one of the encoding parameters, and predicts the encoding target macroblock. A prediction mode determination unit 11 that performs a process of determining a mode, and the prediction mode determination unit 11 calculates an estimated displacement of an encoding target macroblock in order to perform the determination. An amount calculation unit 12 is provided.

図２に、符号化パラメータ決定部１０の実行するフローチャートの一例を図示する。ここで、このフローチャートでは、符号化パラメータ決定部１０が量子化パラメータおよび予測モードを決定することを想定している。 FIG. 2 illustrates an example of a flowchart executed by the encoding parameter determination unit 10. Here, in this flowchart, it is assumed that the encoding parameter determination unit 10 determines the quantization parameter and the prediction mode.

符号化パラメータ決定部１０は、符号化対象マクロブロックの符号化に用いる量子化パラメータおよび予測モードを決定する場合には、図２のフローチャートに示すように、先ず最初に、ステップＳ１０で、レジスタＣに対して大きな値を示す初期コストを格納するとともに、レジスタＭに対して意味のない値を格納することで、レジスタＣおよびレジスタＭを初期化する。 When determining the quantization parameter and the prediction mode to be used for encoding the encoding target macroblock, the encoding parameter determination unit 10 firstly, in step S10, in the register C, as shown in the flowchart of FIG. Register C and register M are initialized by storing an initial cost indicating a large value for, and storing a meaningless value for register M.

続いて、ステップＳ１１で、量子化パラメータの値を格納する変数ＱＰに、量子化パラメータの最小値ＱＰmin を設定する。 Subsequently, in step S11, the minimum value QPmin of the quantization parameter is set to the variable QP that stores the value of the quantization parameter.

続いて、ステップＳ１２で、変数ＱＰに設定した量子化パラメータを指定して予測モード決定部１１を起動することで、変数ＱＰに設定した量子化パラメータにおける最適な予測モードを決定する。このとき実行する予測モードの決定処理については、図３及び図５のフローチャートに従って後述する。 Subsequently, in step S12, the quantization mode set in the variable QP is designated and the prediction mode determination unit 11 is activated to determine the optimum prediction mode for the quantization parameter set in the variable QP. The prediction mode determination process executed at this time will be described later according to the flowcharts of FIGS. 3 and 5.

続いて、ステップＳ１３で、変数ＱＰに設定した量子化パラメータと、ステップＳ１２で決定した予測モードとを用いて符号化する場合のコストを算出する。 Subsequently, in step S13, the cost for encoding using the quantization parameter set in the variable QP and the prediction mode determined in step S12 is calculated.

続いて、ステップＳ１４で、ステップＳ１３で算出したコストがレジスタＣに格納されるコストよりも小さいのか否かを判断する。 Subsequently, in step S14, it is determined whether or not the cost calculated in step S13 is smaller than the cost stored in the register C.

この判断処理に従って、ステップＳ１３で算出したコストの方が小さいことを判断するときには、ステップＳ１５に進んで、ステップＳ１３で算出したコストをレジスタＣに格納し、続くステップＳ１６で、変数ＱＰに設定した量子化パラメータと、ステップＳ１２で決定した予測モードの識別情報との組情報をレジスタＭに格納する。一方、ステップＳ１３で算出したコストの方が大きいことを判断するときには、このステップＳ１５，１６の処理を省略する。 When it is determined that the cost calculated in step S13 is smaller according to this determination process, the process proceeds to step S15, the cost calculated in step S13 is stored in the register C, and the variable QP is set in the subsequent step S16. The set information of the quantization parameter and the identification information of the prediction mode determined in step S12 is stored in the register M. On the other hand, when it is determined that the cost calculated in step S13 is larger, the processes in steps S15 and S16 are omitted.

続いて、ステップＳ１７で、変数ＱＰに設定した量子化パラメータの値がその最大値ＱＰmax を超えたのか否かを判断して、最大値ＱＰmax を超えていないことを判断するときには、ステップＳ１８に進んで、変数ＱＰに設定した量子化パラメータの値を規定量ΔＱＰだけ増分させてから、ステップＳ１２の処理に戻る。 Subsequently, in step S17, it is determined whether or not the value of the quantization parameter set in the variable QP exceeds the maximum value QPmax. When determining that the value does not exceed the maximum value QPmax, the process proceeds to step S18. Thus, the value of the quantization parameter set in the variable QP is incremented by the specified amount ΔQP, and the process returns to step S12.

このようにして、ステップＳ１２〜ステップＳ１８の処理を繰り返していくことで、ステップＳ１７で、変数ＱＰに設定した量子化パラメータの値がその最大値ＱＰmax を超えたことを判断すると、ステップＳ１９に進んで、レジスタＭに格納される量子化パラメータおよび予測モードを用いて符号化することを決定して、量子化パラメータおよび予測モードの決定処理を終了する。 When it is determined in this way that the value of the quantization parameter set in the variable QP exceeds the maximum value QPmax in step S17 by repeating the processing in steps S12 to S18, the process proceeds to step S19. Thus, it is determined that the encoding is performed using the quantization parameter and the prediction mode stored in the register M, and the determination process of the quantization parameter and the prediction mode is ended.

〔１〕第１の実施形態例
図３に、予測モード決定部１１の実行するフローチャートの一実施形態例を図示する。 [1] First Embodiment FIG. 3 illustrates an embodiment of a flowchart executed by the prediction mode determination unit 11.

次に、このフローチャートに従って、本実施形態例において予測モード決定部１１が実行する予測モードの決定処理について詳細に説明する。 Next, according to this flowchart, the prediction mode determination process executed by the prediction mode determination unit 11 in the present embodiment will be described in detail.

予測モード決定部１１は、符号化対象マクロブロックについて量子化パラメータを指定して予測モードの決定要求が発行されると、図３のフローチャートに示すように、先ず最初に、ステップＳ１００で、レジスタＸに対して予測モードの初期値（初期値となる予測モードの識別情報）を設定し、さらに、Ｒ−Ｄコストを格納することになる２つのレジスタＣ０，Ｃ１に対して大きな値を示す初期コストを格納するとともに、予測モードの識別情報を格納することになる２つのレジスタＭ０，Ｍ１に対して意味のない値を格納することで、レジスタＸ，Ｃ０，Ｃ１，Ｍ０，Ｍ１を初期化する。 When a prediction mode determination request is issued by designating a quantization parameter for the encoding target macroblock, the prediction mode determination unit 11 first, as shown in the flowchart of FIG. Is set to the initial value of the prediction mode (prediction mode identification information as the initial value), and the initial cost indicating a large value for the two registers C0 and C1 that store the RD cost. And the registers X, C0, C1, M0, and M1 are initialized by storing meaningless values in the two registers M0 and M1 that store the prediction mode identification information.

なお、以下に説明するように、これらのレジスタＸ，Ｃ０，Ｃ１，Ｍ０，Ｍ１の他に、符号量を格納するレジスタαと、未定乗数を格納するレジスタβと、重み付き歪み量を格納するレジスタγ０と、重みなし歪み量を格納するレジスタγ１という４つのレジスタを使用している。 As will be described below, in addition to these registers X, C0, C1, M0, and M1, a register α that stores a code amount, a register β that stores an undetermined multiplier, and a weighted distortion amount are stored. Four registers are used: a register γ0 and a register γ1 that stores an unweighted distortion amount.

続いて、ステップＳ１０１で、符号化対象マクロブロックの変移量を推定する。この推定手法については、外部より与えられるものとする。例えば、Ｈ．２６４の参照ソフトＪＭが算出する動きベクトルを、以下で使用する変移量の推定値として用いることも可能である。あるいは、符号化対象マクロブロックと参照マクロブロックとの絶対値誤差和を最小化する規範に従って、推定変位量を算出することも可能である。 Subsequently, in step S101, the shift amount of the encoding target macroblock is estimated. This estimation method is assumed to be given from the outside. For example, H.M. It is also possible to use a motion vector calculated by the H.264 reference software JM as an estimated value of the shift amount used below. Alternatively, it is also possible to calculate the estimated displacement amount according to a standard for minimizing the sum of absolute value errors between the encoding target macroblock and the reference macroblock.

続いて、ステップＳ１０２で、レジスタＸの予測モード、予測ベクトル、量子化パラメータ、符号化対象フレーム信号、参照フレーム信号を入力として、その予測モードを用いて符号化する場合の符号量を算出し、その算出した値をレジスタαに書き出す。具体的な算出方法は、Ｈ．２６４の参照ソフトＪＭの方法に従う。 Subsequently, in step S102, the prediction mode of the register X, the prediction vector, the quantization parameter, the encoding target frame signal, and the reference frame signal are input, and the code amount when encoding using the prediction mode is calculated. The calculated value is written to the register α. The specific calculation method is as follows. It follows the method of H.264 reference software JM.

続いて、ステップＳ１０３で、レジスタＸの予測モード、量子化パラメータを入力として、その予測モードを用いて符号化する場合の未定乗数を算出し、その算出した値をレジスタβに書き出す。具体的な算出方法は、Ｈ．２６４の参照ソフトＪＭの方法に従う。 Subsequently, in step S103, the prediction mode and the quantization parameter of the register X are input, an undetermined multiplier for encoding using the prediction mode is calculated, and the calculated value is written to the register β. The specific calculation method is as follows. It follows the method of H.264 reference software JM.

続いて、ステップＳ１０４で、最初に、ステップＳ１０１で算出した推定変位量に基づいて重みを決定し、次に、レジスタＸの予測モード、予測ベクトル、量子化パラメータ、符号化対象フレーム信号、参照フレーム信号を入力として、それらの入力信号とその決定した重みとに基づいて、その予測モードを用いて符号化する場合の重み付き歪み量を算出し、その算出した値をレジスタγ０に書き出す。具体的な算出方法は、前述した式（Ｄ)(〔数９〕に示した式）に従う。 Subsequently, in step S104, a weight is first determined based on the estimated displacement calculated in step S101, and then the prediction mode, prediction vector, quantization parameter, encoding target frame signal, reference frame of the register X are determined. Based on the input signal and the determined weight, the weighted distortion amount when encoding using the prediction mode is calculated, and the calculated value is written to the register γ0. A specific calculation method follows the above-described formula (D) (the formula shown in [Equation 9]).

続いて、ステップＳ１０５で、レジスタαに格納される符号量と、レジスタβに格納される未定乗数と、レジスタγ０に格納される重み付き歪み量とを読み出して、Ｒ−Ｄコストを算出する。具体的な算出方法は、前述した式（Ｃ)(〔数８〕に示した式）に従う。 Subsequently, in step S105, the code amount stored in the register α, the undetermined multiplier stored in the register β, and the weighted distortion amount stored in the register γ0 are read to calculate the RD cost. A specific calculation method follows the above-described formula (C) (the formula shown in [Equation 8]).

続いて、ステップＳ１０６で、その算出したＲ−ＤコストとレジスタＣ０の値とを比較して、その算出したＲ−Ｄコストの方がレジスタＣ０の値よりも小さいことを判断するときには、ステップＳ１０７に進んで、その算出したＲ−ＤコストをレジスタＣ０に格納し、続くステップＳ１０８で、レジスタＸに格納されている予測モードの識別情報をレジスタＭ０に格納する。一方、ステップＳ１０６で、算出したＲ−Ｄコストの方がレジスタＣ０の値よりも大きいことを判断するときには、このステップＳ１０７，１０８の処理を省略する。 Subsequently, in step S106, when the calculated RD cost is compared with the value of the register C0 and it is determined that the calculated RD cost is smaller than the value of the register C0, step S107 is performed. Then, the calculated RD cost is stored in the register C0, and in step S108, the prediction mode identification information stored in the register X is stored in the register M0. On the other hand, when it is determined in step S106 that the calculated RD cost is larger than the value of the register C0, the processes in steps S107 and 108 are omitted.

このステップＳ１０４〜ステップＳ１０８の処理と並列処理する形で、ステップＳ１０４ｘ〜ステップＳ１０８ｘの処理を実行する。 The processes in steps S104x to S108x are executed in parallel with the processes in steps S104 to S108.

すなわち、ステップＳ１０４ｘで、レジスタＸの予測モード、予測ベクトル、量子化パラメータ、符号化対象フレーム信号、参照フレーム信号を入力として、その予測モードを用いて符号化する場合の重みなし歪み量を算出し、その算出した値をレジスタγ１に書き出す。具体的な算出方法は、前述した式（Ｄ)(〔数９〕に示した式）で重み付けがない（Ｗ_k,l ^Y＝Ｗ_k,l ^U＝Ｗ_k,l ^V＝１）ものとしたものに従うか、あるいは、前述した式（Ｂ)(〔数２〕に示した式）に従う。 That is, in step S104x, the prediction mode, the prediction vector, the quantization parameter, the encoding target frame signal, and the reference frame signal of the register X are input, and the unweighted distortion amount when encoding using the prediction mode is calculated. The calculated value is written in the register γ1. As a specific calculation method, there is no weighting (W _{k, l} ^Y = W _{k, l} ^U = W _{k, l} ^V = 1) in the above-described formula (D) (the formula shown in [Equation 9]). Or according to the above-described formula (B) (the formula shown in [Equation 2]).

続いて、ステップＳ１０５ｘで、レジスタαに格納される符号量と、レジスタβに格納される未定乗数と、レジスタγ１に格納される重みなし歪み量とを読み出して、Ｒ−Ｄコストを算出する。具体的な算出方法は、前述した式（Ｃ)(〔数８〕に示した式）で歪み量を重みなしの値を用いるものに従うか、あるいは、前述した式（Ａ)(〔数１〕に示した式）に従う。 Subsequently, in step S105x, the code amount stored in the register α, the undetermined multiplier stored in the register β, and the unweighted distortion amount stored in the register γ1 are read to calculate the RD cost. The specific calculation method is based on the above-described equation (C) (the equation shown in [Equation 8]) using a distortion-free value, or the above-described equation (A) ([Equation 1]). Follow the formula shown in

続いて、ステップＳ１０６ｘで、その算出したＲ−ＤコストとレジスタＣ１の値とを比較して、その算出したＲ−Ｄコストの方がレジスタＣ１の値よりも小さいことを判断するときには、ステップＳ１０７ｘに進んで、その算出したＲ−ＤコストをレジスタＣ１に格納し、続くステップＳ１０８ｘで、レジスタＸに格納されている予測モードの識別情報をレジスタＭ１に格納する。一方、ステップＳ１０６ｘで、算出したＲ−Ｄコストの方がレジスタＣ１の値よりも大きいことを判断するときには、このステップＳ１０７ｘ，１０８ｘの処理を省略する。 Subsequently, in step S106x, when the calculated RD cost is compared with the value of the register C1, and it is determined that the calculated RD cost is smaller than the value of the register C1, step S107x Then, the calculated RD cost is stored in the register C1, and in step S108x, the prediction mode identification information stored in the register X is stored in the register M1. On the other hand, when it is determined in step S106x that the calculated RD cost is larger than the value of the register C1, the processes in steps S107x and 108x are omitted.

ステップＳ１０８，ステップＳ１０８ｘの処理を終了すると、続いて、ステップＳ１０９で、全ての予測モードを処理したのか否かを判断して、全ての予測モードを処理してないことを判断するときには、ステップＳ１１０に進んで、予め定められる順番に従って未処理の予測モードの中から予測モードを１つ選択して、その選択した予測モードの識別情報をレジスタＸに格納してから、ステップＳ１０２に処理に戻る。 When the processes of step S108 and step S108x are finished, subsequently, in step S109, it is determined whether or not all prediction modes have been processed, and when it is determined that not all prediction modes have been processed, step S110 is performed. Then, one prediction mode is selected from the unprocessed prediction modes in accordance with a predetermined order, the identification information of the selected prediction mode is stored in the register X, and the process returns to step S102.

このようにして、ステップＳ１０２〜ステップＳ１１０の処理を繰り返していくことで、ステップＳ１０９で、全ての予測モードを処理したことを判断すると、ステップＳ１１１に進んで、レジスタＭ０に格納される予測モードの識別情報がスキップモードであることを示しているのか否かを判断する。 In this manner, by repeating the processing of step S102 to step S110, when it is determined in step S109 that all prediction modes have been processed, the process proceeds to step S111, and the prediction mode stored in the register M0 is determined. It is determined whether or not the identification information indicates a skip mode.

すなわち、重み付き歪み量を用いて評価したＲ−Ｄコスト最小の予測モードがスキップモードであるのか否かを判断するのである。 That is, it is determined whether or not the prediction mode with the minimum RD cost evaluated using the weighted distortion amount is the skip mode.

このステップＳ１１１の判断処理に従って、レジスタＭ０に格納される予測モードの識別情報がスキップモードでないことを判断するときには、スキップモードによる画質劣化が問題とならないことから、ステップＳ１１５に進んで、レジスタＭ０に格納される予測モードを符号化対象ブロックの最適な予測モードとして出力して、処理を終了する。 When it is determined that the prediction mode identification information stored in the register M0 is not the skip mode according to the determination process in step S111, the image quality deterioration due to the skip mode does not matter, so the process proceeds to step S115 and is stored in the register M0. The stored prediction mode is output as the optimum prediction mode for the encoding target block, and the process is terminated.

一方、ステップＳ１１１の判断処理に従って、レジスタＭ０に格納される予測モードの識別情報がスキップモードであることを判断するときには、ステップＳ１１２に進んで、レジスタＭ１に格納される予測モードの識別情報がスキップモードであることを示しているのか否かを判断する。 On the other hand, when it is determined that the prediction mode identification information stored in the register M0 is the skip mode according to the determination process in step S111, the process proceeds to step S112, and the prediction mode identification information stored in the register M1 is skipped. It is determined whether or not the mode is indicated.

すなわち、重みなし歪み量を用いて評価したＲ−Ｄコスト最小の予測モードがスキップモードであるのか否かを判断するのである。 That is, it is determined whether or not the prediction mode with the minimum RD cost evaluated using the unweighted distortion amount is the skip mode.

このステップＳ１１２の判断処理に従って、レジスタＭ１に格納される予測モードの識別情報がスキップモードであることを判断するときには、重み付き歪み量を用いても、重みなし歪み量を用いてもスキップモードが選択されたことから、ステップＳ１１５に進んで、レジスタＭ０に格納される予測モードであるスキップモードを符号化対象ブロックの最適な予測モードとして出力して、処理を終了する。 When it is determined that the prediction mode identification information stored in the register M1 is the skip mode according to the determination process in step S112, the skip mode is set regardless of whether the weighted distortion amount or the unweighted distortion amount is used. Since it has been selected, the process proceeds to step S115, the skip mode, which is the prediction mode stored in the register M0, is output as the optimum prediction mode for the block to be encoded, and the process is terminated.

一方、このステップＳ１１２の判断処理に従って、レジスタＭ１に格納される予測モードの識別情報がスキップモードでないことを判断するときは、ステップＳ１１３に進んで、ステップＳ１０１で推定した推定変位量と符号化対象ブロックの予測ベクトルとの乖離度が閾値より大きく、かつ、その推定変位量の大きさが閾値より大きいのか否かの判定を行う。 On the other hand, when it is determined that the prediction mode identification information stored in the register M1 is not the skip mode according to the determination process in step S112, the process proceeds to step S113, and the estimated displacement amount and the encoding target estimated in step S101 are processed. It is determined whether the degree of deviation from the predicted vector of the block is greater than a threshold and whether the estimated displacement is greater than the threshold.

すなわち、ステップＳ１１１でＲ−Ｄコスト最小の予測モードがスキップモードである判断していることで、実際に符号化する場合には周辺のマクロブロックの動きベクトルから求められる予測ベクトルを動きベクトルとして用いることになることから、ステップＳ１０１で推定した推定変位量とその予測ベクトルとを判断対象として、２つの変位量の乖離度が閾値より大きく、かつ、推定変位量の大きさが閾値より大きいのか否かの判定を行うのである。 That is, since it is determined in step S111 that the prediction mode with the minimum RD cost is the skip mode, the prediction vector obtained from the motion vectors of the surrounding macroblocks is used as the motion vector when actually encoding. Therefore, whether or not the difference between the two displacement amounts is larger than the threshold value and the estimated displacement amount is larger than the threshold value with the estimated displacement amount estimated in step S101 and the predicted vector as a determination target. This determination is made.

このステップＳ１１３の判断処理に従って、２つの変位量の乖離度が閾値より大きく、かつ、推定変位量の大きさが閾値より大きいという条件が成立することを判断するときには、ステップＳ１１４に進んで、レジスタＭ１に格納される予測モードを符号化対象ブロックの最適な予測モードとして出力して、処理を終了する。 When it is determined that the condition that the degree of deviation between the two displacement amounts is greater than the threshold value and the estimated displacement amount is greater than the threshold value according to the determination process in step S113, the process proceeds to step S114 and the register is registered. The prediction mode stored in M1 is output as the optimal prediction mode for the encoding target block, and the process is terminated.

一方、このステップＳ１１３の判断処理に従って、２つの変位量の乖離度が閾値より大きく、かつ、推定変位量の大きさが閾値より大きいという条件が成立しないことを判断するときには、ステップＳ１１５に進んで、レジスタＭ０に格納される予測モードであるスキップモードを符号化対象ブロックの最適な予測モードとして出力して、処理を終了する。 On the other hand, when it is determined that the condition that the degree of deviation between the two displacement amounts is greater than the threshold value and the estimated displacement amount is greater than the threshold value is not satisfied according to the determination process in step S113, the process proceeds to step S115. Then, the skip mode, which is the prediction mode stored in the register M0, is output as the optimum prediction mode for the encoding target block, and the process is terminated.

このようにして、本実施形態例では、予測モード決定部１１は、符号化対象マクロブロックについて、重み付きの歪み量を使って求められたＲ−Ｄコスト最小の予測モードがスキップモードであっても、スキップモードで符号化したのでは大きな画質劣化を含んでいる可能性が高いことを判断する場合には、重みなしの歪み量を使って求められたＲ−Ｄコスト最小の予測モードを最適な予測モードとして決定するように処理するのである。 Thus, in this embodiment, the prediction mode determination unit 11 determines that the prediction mode with the minimum RD cost obtained using the weighted distortion amount for the encoding target macroblock is the skip mode. However, when it is determined that coding in the skip mode is likely to include large image quality deterioration, the prediction mode with the minimum RD cost obtained using the unweighted distortion amount is optimal. It is processed so as to be determined as a proper prediction mode.

図４に、本実施形態例を実現する予測モード決定部１１の装置構成の一例を図示する。 FIG. 4 illustrates an example of a device configuration of the prediction mode determination unit 11 that realizes the present embodiment.

ここで、１００は変位量記憶部、１０１は予測ベクトル算出部、１０２は予測ベクトル記憶部、１０３は初期モード設定部、１０４はモード記憶部、１０５は符号量算出部、１０６は符号量記憶部、１０７は重み付き歪み量算出部、１０８は重み付き歪み量記憶部、１０９は重みなし歪み量算出部、１１０は重みなし歪み量記憶部、１１１は未定乗数算出部、１１２は未定乗数記憶部、１１３はコスト算出部、１１４はコスト記憶部、１１５は重みなしコスト算出部、１１６は重みなしコスト記憶部、１１７は最小コスト判定部、１１８は最小コスト記憶部、１１９は重みなし最小コスト記憶部、１２０はモード更新部、１２１は最適モード記憶部、１２２は重みなし最適モード記憶部、１２３は最終モード判定部、１２４はモード設定部、１２５は最適モード出力部である。 Here, 100 is a displacement amount storage unit, 101 is a prediction vector calculation unit, 102 is a prediction vector storage unit, 103 is an initial mode setting unit, 104 is a mode storage unit, 105 is a code amount calculation unit, and 106 is a code amount storage unit 107 is a weighted distortion amount calculation unit, 108 is a weighted distortion amount storage unit, 109 is an unweighted distortion amount calculation unit, 110 is an unweighted distortion amount storage unit, 111 is an undetermined multiplier calculation unit, and 112 is an undetermined multiplier storage unit. 113 is a cost calculation unit, 114 is a cost storage unit, 115 is an unweighted cost calculation unit, 116 is an unweighted cost storage unit, 117 is a minimum cost determination unit, 118 is a minimum cost storage unit, and 119 is an unweighted minimum cost storage. , 120 is a mode update unit, 121 is an optimal mode storage unit, 122 is an unweighted optimal mode storage unit, 123 is a final mode determination unit, 124 is a mode setting unit, 25 is the optimal mode output unit.

次に、これらの各処理部について説明する。 Next, each of these processing units will be described.

〔変位量記憶部１００〕
変位量記憶部１００は、図示しない推定変位量算出部の算出した符号化対象マクロブロックについての推定変位量を格納する。この推定変位量の推定方法としては、例えば、Ｈ．２６４の参照ソフトＪＭが算出する動きベクトルを変移量の推定値として用いることも可能である。あるいは、符号化対象マクロブロックと参照マクロブロックとの絶対値誤差和を最小化する規範に従って、推定変位量を算出することも可能である。 [Displacement storage unit 100]
The displacement amount storage unit 100 stores the estimated displacement amount for the encoding target macroblock calculated by the estimated displacement amount calculation unit (not shown). As an estimation method of the estimated displacement amount, for example, H.264. It is also possible to use a motion vector calculated by the H.264 reference software JM as an estimated value of the shift amount. Alternatively, it is also possible to calculate the estimated displacement amount according to a standard for minimizing the sum of absolute value errors between the encoding target macroblock and the reference macroblock.

〔予測ベクトル算出部１０１〕
予測ベクトル算出部１０１は、符号化対象マクロブロックに隣接するマクロブロックの動きベクトル、マクロブロックの分割情報を入力として、符号化対象マクロブロックに対する予測ベクトルを算出し、予測ベクトル記憶部１０２に書き出す。具体的な算出方法は、Ｈ．２６４の規定に従う。 [Predicted vector calculation unit 101]
Prediction vector calculation section 101 receives the motion vector of the macroblock adjacent to the encoding target macroblock and the macroblock division information as input, calculates a prediction vector for the encoding target macroblock, and writes it to prediction vector storage section 102. The specific calculation method is as follows. H.264 is followed.

〔初期モード設定部１０３〕
初期モード設定部１０３は、予測モードの初期値をモード記憶部１０４に書き出す。 [Initial mode setting unit 103]
The initial mode setting unit 103 writes the initial value of the prediction mode in the mode storage unit 104.

〔符号量算出部１０５〕
符号量算出部１０５は、モード記憶部１０４の記憶する予測モード、予測ベクトル、量子化パラメータ、符号化対象フレーム信号、参照フレーム信号を入力として、その予測モードを用いて符号化する場合の符号量を算出し、その算出した値を符号量記憶部１０６に書き出す。具体的な算出方法は、Ｈ．２６４の参照ソフトＪＭの方法に従う。 [Code amount calculation unit 105]
The code amount calculation unit 105 receives the prediction mode, the prediction vector, the quantization parameter, the encoding target frame signal, and the reference frame signal stored in the mode storage unit 104, and encodes using the prediction mode. And the calculated value is written in the code amount storage unit 106. The specific calculation method is as follows. It follows the method of H.264 reference software JM.

〔重み付き歪み量算出部１０７〕
重み付き歪み量算出部１０７は、変位量記憶部１００に記憶される推定変位量に基づいて重みを決定するとともに、モード記憶部１０４の記憶する予測モード、予測ベクトル、量子化パラメータ、符号化対象フレーム信号、参照フレーム信号を入力として、それらの入力信号とその決定した重みとに基づいて、その予測モードを用いて符号化する場合の重み付き歪み量を算出し、その算出した値を重み付き歪み量記憶部１０８に書き出す。具体的な算出方法は、前述した式（Ｄ)(〔数９〕に示した式）に従う。 [Weighted distortion amount calculation unit 107]
The weighted distortion amount calculation unit 107 determines the weight based on the estimated displacement amount stored in the displacement amount storage unit 100, and stores the prediction mode, prediction vector, quantization parameter, and encoding target stored in the mode storage unit 104. Calculates the weighted distortion amount when encoding using the prediction mode based on the input signal and the determined weight, with the frame signal and the reference frame signal as input, and the calculated value is weighted. Write to the distortion amount storage unit 108. A specific calculation method follows the above-described formula (D) (the formula shown in [Equation 9]).

〔重みなし歪み量算出部１０９〕
重みなし歪み量算出部１０９は、モード記憶部１０４の記憶する予測モード、予測ベクトル、量子化パラメータ、符号化対象フレーム信号、参照フレーム信号を入力として、その予測モードを用いて符号化する場合の重みなし歪み量を算出し、その算出した値を重みなし歪み量記憶部１１０に書き出す。具体的な算出方法は、前述した式（Ｄ)(〔数９〕に示した式）で重み付けがないものとしたものに従うか、あるいは、前述した式（Ｂ)(〔数２〕に示した式）に従う。 [Unweighted distortion amount calculation unit 109]
The unweighted distortion amount calculation unit 109 receives the prediction mode, the prediction vector, the quantization parameter, the encoding target frame signal, and the reference frame signal stored in the mode storage unit 104 and performs encoding using the prediction mode. The unweighted distortion amount is calculated, and the calculated value is written in the unweighted distortion amount storage unit 110. The specific calculation method is based on the above formula (D) (the formula shown in [Equation 9]) with no weighting, or the above formula (B) (shown in [Equation 2]). Follow the formula.

〔未定乗数算出部１１１〕
未定乗数算出部１１１は、モード記憶部１０４の記憶する予測モード、量子化パラメータを入力として、その予測モードを用いて符号化する場合の未定乗数を算出し、その算出した値を未定乗数記憶部１１２に書き出す。具体的な算出方法は、Ｈ．２６４の参照ソフトＪＭの方法に従う。 [Undetermined multiplier calculation unit 111]
The undetermined multiplier calculation unit 111 receives the prediction mode and quantization parameter stored in the mode storage unit 104, calculates an undetermined multiplier when encoding using the prediction mode, and calculates the calculated value to the undetermined multiplier storage unit. Write to 112. The specific calculation method is as follows. It follows the method of H.264 reference software JM.

〔コスト算出部１１３〕
コスト算出部１１３は、符号量記憶部１０６に記憶される符号量と、重み付き歪み量記憶部１０８に記憶される重み付き歪み量と、未定乗数記憶部１１２に記憶される未定乗数とを読み出して、Ｒ−Ｄコストを算出して、その算出した値をコスト記憶部１１４に書き出す。具体的な算出方法は、前述した式（Ｃ)(〔数８〕に示した式）に従う。 [Cost calculation unit 113]
The cost calculation unit 113 reads the code amount stored in the code amount storage unit 106, the weighted distortion amount stored in the weighted distortion amount storage unit 108, and the undetermined multiplier stored in the undetermined multiplier storage unit 112. Then, the RD cost is calculated, and the calculated value is written in the cost storage unit 114. A specific calculation method follows the above-described formula (C) (the formula shown in [Equation 8]).

〔重みなしコスト算出部１１５〕
重みなしコスト算出部１１５は、符号量記憶部１０６に記憶される符号量と、重みなし歪み量記憶部１１０に記憶される重みなし歪み量と、未定乗数記憶部１１２に記憶される未定乗数とを読み出して、Ｒ−Ｄコストを算出し、その算出した値を重みなしコスト記憶部１１６に書き出す。具体的な算出方法は、前述した式（Ｃ)(〔数８〕に示した式）で歪み量を重みなしの値を用いるものに従うか、あるいは、前述した式（Ａ)(〔数１〕に示した式）に従う。 [Unweighted cost calculation unit 115]
The weightless cost calculation unit 115 includes a code amount stored in the code amount storage unit 106, an unweighted distortion amount stored in the unweighted distortion amount storage unit 110, and an undetermined multiplier stored in the undetermined multiplier storage unit 112. , The RD cost is calculated, and the calculated value is written in the unweighted cost storage unit 116. The specific calculation method is based on the above-described equation (C) (the equation shown in [Equation 8]) using a distortion-free value, or the above-described equation (A) ([Equation 1]). Follow the formula shown in

〔最小コスト判定部１１７〕
最小コスト判定部１１７は、コスト記憶部１１４に記憶されるＲ−Ｄコストと、最小コスト記憶部１１８に記憶される最小コストとを読み出して、そのＲ−Ｄコストがその最小コストよりも小さいのか否かの判定を行い、そのＲ−Ｄコストがその最小コストよりも小さい場合には、そのＲ−Ｄコストを最小コスト記憶部１１８に書き出すとともに、モード更新部１２０に制御を渡す。一方、そのＲ−Ｄコストがその最小コストよりも大きい場合には、最終モード判定部１２３に制御を渡す。 [Minimum cost determination unit 117]
The minimum cost determination unit 117 reads out the RD cost stored in the cost storage unit 114 and the minimum cost stored in the minimum cost storage unit 118, and whether the RD cost is smaller than the minimum cost. If the RD cost is smaller than the minimum cost, the RD cost is written to the minimum cost storage unit 118 and control is passed to the mode update unit 120. On the other hand, if the RD cost is greater than the minimum cost, control is passed to the final mode determination unit 123.

そして、重みなしコスト記憶部１１６に記憶されるＲ−Ｄコストと、重みなし最小コスト記憶部１１９に記憶される最小コストとを読み出して、そのＲ−Ｄコストがその最小コストよりも小さいのか否かの判定を行い、そのＲ−Ｄコストがその最小コストよりも小さい場合には、そのＲ−Ｄコストを重みなし最小コスト記憶部１１９に書き出すとともに、モード更新部１２０に制御を渡す。一方、そのＲ−Ｄコストがその最小コストよりも大きい場合には、最終モード判定部１２３に制御を渡す。 Then, the RD cost stored in the non-weighted cost storage unit 116 and the minimum cost stored in the non-weighted minimum cost storage unit 119 are read, and whether or not the RD cost is smaller than the minimum cost. If the RD cost is smaller than the minimum cost, the RD cost is written to the weightless minimum cost storage unit 119 and control is passed to the mode update unit 120. On the other hand, if the RD cost is greater than the minimum cost, control is passed to the final mode determination unit 123.

〔モード更新部１２０〕
モード更新部１２０は、最小コスト判定部１１７が最小コスト記憶部１１８にＲ−Ｄコストを書き出すときに、そのＲ−Ｄコストの算出元となった予測モード（その時点でモード記憶部１０４に記憶されている予測モード）の識別情報を最適モード記憶部１２１に書き出してから、最終モード判定部１２３に制御を渡す。 [Mode update unit 120]
When the minimum cost determination unit 117 writes the RD cost to the minimum cost storage unit 118, the mode update unit 120 predicts the prediction mode that is the calculation source of the RD cost (stored in the mode storage unit 104 at that time). Is written to the optimum mode storage unit 121, and control is passed to the final mode determination unit 123.

そして、最小コスト判定部１１７が重みなし最小コスト記憶部１１９にＲ−Ｄコストを書き出すときに、そのＲ−Ｄコストの算出元となった予測モード（その時点でモード記憶部１０４に記憶されている予測モード）の識別情報を重みなし最適モード記憶部１２２に書き出してから、最終モード判定部１２３に制御を渡す。 Then, when the minimum cost determination unit 117 writes the RD cost to the weightless minimum cost storage unit 119, the prediction mode that is the calculation source of the RD cost (stored in the mode storage unit 104 at that time) Is written to the optimum mode storage unit 122 without weight, and then the control is passed to the final mode determination unit 123.

〔最終モード判定部１２３〕
最終モード判定部１２３は、最小コスト判定部１１７やモード更新部１２０から制御が渡されると、モード設定部１２４が全ての予測モードの設定を終了したのか否かを判断して、全ての予測モードの設定を終了したことを判断するときには、最適モード出力部１２５に対して最適な予測モードの出力を指示し、全ての予測モードの設定を終了していないことを判断するときには、モード設定部１２４に対して次の予測モードの設定を指示する。 [Final mode determination unit 123]
When control is passed from the minimum cost determination unit 117 or the mode update unit 120, the final mode determination unit 123 determines whether or not the mode setting unit 124 has finished setting all prediction modes, and determines all prediction modes. When determining that the setting of the prediction mode has been completed, the optimal mode output unit 125 is instructed to output the optimum prediction mode. When determining that all the prediction modes have not been set, the mode setting unit 124 is selected. Is instructed to set the next prediction mode.

〔モード設定部１２４〕
モード設定部１２４は、最終モード判定部１２３から予測モードの設定指示があると、モード記憶部１０４に対して次の予測モードを設定する。 [Mode setting unit 124]
When there is a prediction mode setting instruction from the final mode determination unit 123, the mode setting unit 124 sets the next prediction mode in the mode storage unit 104.

〔最適モード出力部１２５〕
最適モード出力部１２５は、最終モード判定部１２３から最適な予測モードの出力指示があると、最適モード記憶部１２１に記憶される予測モードがスキップモードであるのか否かの判定を行い、スキップモードでない場合には、最適モード記憶部１２１に記憶される予測モードを符号化対象ブロックの最適な予測モードとして出力する。 [Optimum mode output unit 125]
The optimal mode output unit 125 determines whether or not the prediction mode stored in the optimal mode storage unit 121 is the skip mode when there is an instruction to output the optimal prediction mode from the final mode determination unit 123. Otherwise, the prediction mode stored in the optimum mode storage unit 121 is output as the optimum prediction mode of the encoding target block.

一方、最適モード記憶部１２１に記憶される予測モードがスキップモードである場合には、重みなし最適モード記憶部１２２に記憶される予測モードがスキップモードであるのか否かの判定を行い、スキップモードである場合には、最適モード記憶部１２１に記憶されるスキップモードを符号化対象ブロックの最適な予測モードとして出力する。 On the other hand, when the prediction mode stored in the optimal mode storage unit 121 is the skip mode, it is determined whether or not the prediction mode stored in the unweighted optimal mode storage unit 122 is the skip mode. If it is, the skip mode stored in the optimum mode storage unit 121 is output as the optimum prediction mode of the encoding target block.

一方、最適モード記憶部１２１に記憶される予測モードがスキップモードで、重みなし最適モード記憶部１２２に記憶される予測モードがスキップモードでない場合には、変位量記憶部１００に記憶される推定変位量と、予測ベクトル記憶部１０２に記憶される予測ベクトルとを読み出して、その推定変位量とその予測ベクトルとの乖離度を算出するとともに、変移推定量の大きさを算出して、その乖離度が閾値よりも大きく、かつ、その変移推定量の大きさが閾値よりも大きいのか否かの判定を行う。そして、その判定結果に基づいて、推定変位量と予測ベクトルとの乖離度が大きく、かつ、変移推定量が大きいという条件が成立する場合には、重みなし最適モード記憶部１２２に記憶される予測モードを符号化対象ブロックの最適な予測モードとして出力し、その条件が成立しない場合には、最適モード記憶部１２１に記憶されるスキップモードを符号化対象ブロックの最適な予測モードとして出力する。 On the other hand, when the prediction mode stored in the optimal mode storage unit 121 is the skip mode and the prediction mode stored in the unweighted optimal mode storage unit 122 is not the skip mode, the estimated displacement stored in the displacement amount storage unit 100 The amount and the prediction vector stored in the prediction vector storage unit 102 are read out, and the degree of deviation between the estimated displacement amount and the prediction vector is calculated, and the magnitude of the displacement estimation amount is calculated, and the degree of deviation is calculated. Is larger than the threshold value, and it is determined whether or not the magnitude of the transition estimation amount is larger than the threshold value. Then, based on the determination result, when the condition that the degree of deviation between the estimated displacement amount and the prediction vector is large and the displacement estimation amount is large is satisfied, the prediction stored in the unweighted optimum mode storage unit 122 The mode is output as the optimal prediction mode of the encoding target block, and when the condition is not satisfied, the skip mode stored in the optimal mode storage unit 121 is output as the optimal prediction mode of the encoding target block.

この図４に示す構成に従って、予測モード決定部１１は、図３のフローチャートの処理を実行することで、符号化対象マクロブロックについて、重み付きの歪み量を使って求められたＲ−Ｄコスト最小の予測モードがスキップモードであっても、スキップモードで符号化したのでは大きな画質劣化を含んでいる可能性が高いことを判断する場合には、重みなしの歪み量を使って求められたＲ−Ｄコスト最小の予測モードを符号化対象ブロックの最適な予測モードとして決定するように処理するのである。 According to the configuration shown in FIG. 4, the prediction mode determination unit 11 performs the processing of the flowchart of FIG. 3 to minimize the RD cost obtained using the weighted distortion amount for the encoding target macroblock. Even if the prediction mode is the skip mode, when it is determined that there is a high possibility that the image is encoded in the skip mode, the image quality is greatly deteriorated. -D The processing is performed so as to determine the prediction mode with the lowest cost as the optimum prediction mode of the encoding target block.

〔２〕第２の実施形態例
図５に、予測モード決定部１１の実行するフローチャートの他の実施形態例を図示する。 [2] Second Embodiment FIG. 5 illustrates another embodiment of the flowchart executed by the prediction mode determination unit 11.

予測モード決定部１１は、符号化対象マクロブロックについて量子化パラメータを指定して予測モードの決定要求が発行されると、図５のフローチャートに示すように、先ず最初に、ステップＳ２００で、レジスタＸに対して予測モードの初期値（初期値となる予測モードの識別情報）を設定し、さらに、Ｒ−Ｄコストを格納することになる３つのレジスタＣ０，Ｃ１，Ｃ２に対して大きな値を示す初期コストを格納するとともに、予測モードの識別情報を格納することになる３つのレジスタＭ０，Ｍ１，Ｍ２に対して意味のない値を格納することで、レジスタＸ，Ｃ０〜Ｃ２，Ｍ０〜Ｍ２を初期化する。 When a prediction mode determination request is issued by designating a quantization parameter for a macroblock to be encoded and a prediction mode determination request is issued, first, as shown in the flowchart of FIG. Is set to the initial value of the prediction mode (prediction mode identification information to be an initial value), and the three registers C0, C1, and C2 that store the RD cost have large values. By storing an insignificant value for the three registers M0, M1, and M2 that store the initial cost and the identification information of the prediction mode, the registers X, C0 to C2, and M0 to M2 are stored. initialize.

なお、以下に説明するように、これらのレジスタＸ，Ｃ０〜Ｃ２，Ｍ０〜Ｍ２の他に、符号量を格納するレジスタαと、未定乗数を格納するレジスタβと、重み付き歪み量を格納するレジスタγ０と、重みなし歪み量を格納するレジスタγ１という４つのレジスタを使用している。 As will be described below, in addition to these registers X, C0 to C2, and M0 to M2, a register α that stores a code amount, a register β that stores an undetermined multiplier, and a weighted distortion amount are stored. Four registers are used: a register γ0 and a register γ1 that stores an unweighted distortion amount.

続いて、ステップＳ２０１で、符号化対象マクロブロックの変移量を推定する。この推定手法については、外部より与えられるものとする。例えば、Ｈ．２６４の参照ソフトＪＭが算出する動きベクトルを、以下で使用する変移量の推定値として用いることも可能である。あるいは、符号化対象マクロブロックと参照マクロブロックとの絶対値誤差和を最小化する規範に従って、推定変位量を算出することも可能である。 Subsequently, in step S201, the shift amount of the encoding target macroblock is estimated. This estimation method is assumed to be given from the outside. For example, H.M. It is also possible to use a motion vector calculated by the H.264 reference software JM as an estimated value of the shift amount used below. Alternatively, it is also possible to calculate the estimated displacement amount according to a standard for minimizing the sum of absolute value errors between the encoding target macroblock and the reference macroblock.

続いて、ステップＳ２０２で、レジスタＸの予測モード、予測ベクトル、量子化パラメータ、符号化対象フレーム信号、参照フレーム信号を入力として、その予測モードを用いて符号化する場合の符号量を算出し、その算出した値をレジスタαに書き出す。具体的な算出方法は、Ｈ．２６４の参照ソフトＪＭの方法に従う。 Subsequently, in step S202, the prediction mode, the prediction vector, the quantization parameter, the encoding target frame signal, and the reference frame signal of the register X are input, and the code amount when encoding using the prediction mode is calculated. The calculated value is written to the register α. The specific calculation method is as follows. It follows the method of H.264 reference software JM.

続いて、ステップＳ２０３で、レジスタＸの予測モード、量子化パラメータを入力として、その予測モードを用いて符号化する場合の未定乗数を算出し、その算出した値をレジスタβに書き出す。具体的な算出方法は、Ｈ．２６４の参照ソフトＪＭの方法に従う。 Subsequently, in step S203, the prediction mode of the register X and the quantization parameter are input, an undetermined multiplier for encoding using the prediction mode is calculated, and the calculated value is written to the register β. The specific calculation method is as follows. It follows the method of H.264 reference software JM.

続いて、ステップＳ２０４で、最初に、ステップＳ２０１で算出した推定変位量に基づいて重みを決定し、次に、レジスタＸの予測モード、予測ベクトル、量子化パラメータ、符号化対象フレーム信号、参照フレーム信号を入力として、それらの入力信号とその決定した重みとに基づいて、その予測モードを用いて符号化する場合の重み付き歪み量を算出し、その算出した値をレジスタγ０に書き出す。具体的な算出方法は、前述した式（Ｄ)(〔数９〕に示した式）に従う。 Subsequently, in step S204, a weight is first determined based on the estimated displacement calculated in step S201, and then the prediction mode, prediction vector, quantization parameter, encoding target frame signal, reference frame of the register X are determined. Based on the input signal and the determined weight, the weighted distortion amount when encoding using the prediction mode is calculated, and the calculated value is written to the register γ0. A specific calculation method follows the above-described formula (D) (the formula shown in [Equation 9]).

続いて、ステップＳ２０５で、レジスタαに格納される符号量と、レジスタβに格納される未定乗数と、レジスタγ０に格納される重み付き歪み量とを読み出して、Ｒ−Ｄコストを算出する。具体的な算出方法は、前述した式（Ｃ)(〔数８〕に示した式）に従う。 Subsequently, in step S205, the code amount stored in the register α, the undetermined multiplier stored in the register β, and the weighted distortion amount stored in the register γ0 are read, and the RD cost is calculated. A specific calculation method follows the above-described formula (C) (the formula shown in [Equation 8]).

続いて、ステップＳ２０６で、その算出したＲ−ＤコストとレジスタＣ０の値とを比較して、その算出したＲ−Ｄコストの方がレジスタＣ０の値よりも小さいことを判断するときには、ステップＳ２０７に進んで、その算出したＲ−ＤコストをレジスタＣ０に格納し、続くステップＳ２０８で、レジスタＸに格納されている予測モードの識別情報をレジスタＭ０に格納する。 Subsequently, in step S206, when the calculated RD cost is compared with the value of the register C0 and it is determined that the calculated RD cost is smaller than the value of the register C0, step S207 is performed. Then, the calculated RD cost is stored in the register C0, and in step S208, the prediction mode identification information stored in the register X is stored in the register M0.

一方、ステップＳ２０６の判断処理で、ステップＳ２０５で算出したＲ−Ｄコストの方がレジスタＣ０の値よりも大きいことを判断するときには、ステップＳ２０９に進んで、その算出したＲ−ＤコストとレジスタＣ２の値とを比較し、その算出したＲ−Ｄコストの方がレジスタＣ２の値よりも小さいことを判断するときには、ステップＳ２１０に進んで、その算出したＲ−ＤコストをレジスタＣ２に格納し、続くステップＳ２１１で、レジスタＸに格納されている予測モードの識別情報をレジスタＭ１に格納する。一方、ステップＳ２０９で、その算出したＲ−Ｄコストの方が大きいことを判断するときには、このステップＳ２１０，２１１の処理を省略する。 On the other hand, when it is determined in step S206 that the RD cost calculated in step S205 is larger than the value in the register C0, the process proceeds to step S209, where the calculated RD cost and the register C2 are registered. When it is determined that the calculated RD cost is smaller than the value of the register C2, the process proceeds to step S210, and the calculated RD cost is stored in the register C2. In subsequent step S211, the prediction mode identification information stored in the register X is stored in the register M1. On the other hand, when it is determined in step S209 that the calculated RD cost is higher, the processes in steps S210 and 211 are omitted.

このようにして、レジスタＣ０には、これまでの処理の求められた最小コストが格納されるとともに、それに対応して、レジスタＭ０には、その最小コストを実現する予測モードの識別情報が格納され、そして、レジスタＣ２には、これまでの処理の求められた最小コストに続く小さなコストが格納されるとともに、それに対応して、レジスタＭ０には、その最小コストに続く小さなコストを実現する予測モードの識別情報が格納されることになる。 In this way, the register C0 stores the minimum cost that has been obtained so far, and correspondingly, the register M0 stores prediction mode identification information that realizes the minimum cost. In addition, the register C2 stores a small cost following the required minimum cost of the processing so far, and correspondingly, the register M0 has a prediction mode for realizing the small cost following the minimum cost. The identification information is stored.

このステップＳ２０４〜ステップＳ２１１の処理と並列処理する形で、ステップＳ２０４ｘ〜ステップＳ２０８ｘの処理を実行する。 The processes in steps S204x to S208x are executed in parallel with the processes in steps S204 to S211.

すなわち、ステップＳ２０４ｘで、レジスタＸの予測モード、予測ベクトル、量子化パラメータ、符号化対象フレーム信号、参照フレーム信号を入力として、その予測モードを用いて符号化する場合の重みなし歪み量を算出し、その算出した値をレジスタγ１に書き出す。具体的な算出方法は、前述した式（Ｄ)(〔数９〕に示した式）で重み付けがないものとしたものに従うか、あるいは、前述した式（Ｂ)(〔数２〕に示した式）に従う。 That is, in step S204x, the prediction mode, the prediction vector, the quantization parameter, the encoding target frame signal, and the reference frame signal of the register X are input, and the unweighted distortion amount when encoding using the prediction mode is calculated. The calculated value is written in the register γ1. The specific calculation method is based on the above formula (D) (the formula shown in [Equation 9]) with no weighting, or the above formula (B) (shown in [Equation 2]). Follow the formula.

続いて、ステップＳ２０５ｘで、レジスタαに格納される符号量と、レジスタβに格納される未定乗数と、レジスタγ１に格納される重みなし歪み量とを読み出して、Ｒ−Ｄコストを算出する。具体的な算出方法は、前述した式（Ｃ)(〔数８〕に示した式）で歪み量を重みなしの値を用いるものに従うか、あるいは、前述した式（Ａ)(〔数１〕に示した式）に従う。 Subsequently, in step S205x, the code amount stored in the register α, the undetermined multiplier stored in the register β, and the unweighted distortion amount stored in the register γ1 are read to calculate the RD cost. The specific calculation method is based on the above-described equation (C) (the equation shown in [Equation 8]) using a distortion-free value, or the above-described equation (A) ([Equation 1]). Follow the formula shown in

続いて、ステップＳ２０６ｘで、その算出したＲ−ＤコストとレジスタＣ１の値とを比較して、その算出したＲ−Ｄコストの方がレジスタＣ１の値よりも小さいことを判断するときには、ステップＳ２０７ｘに進んで、その算出したＲ−ＤコストをレジスタＣ１に格納し、続くステップＳ２０８ｘで、レジスタＸに格納されている予測モードの識別情報をレジスタＭ１に格納する。一方、ステップＳ２０６ｘで、算出したＲ−Ｄコストの方がレジスタＣ１の値よりも大きいことを判断するときには、このステップＳ２０７ｘ，２０８ｘの処理を省略する。 Subsequently, in step S206x, when the calculated RD cost is compared with the value of the register C1, and it is determined that the calculated RD cost is smaller than the value of the register C1, step S207x Then, the calculated RD cost is stored in the register C1, and the identification information of the prediction mode stored in the register X is stored in the register M1 in the subsequent step S208x. On the other hand, when it is determined in step S206x that the calculated RD cost is larger than the value of the register C1, the processes in steps S207x and 208x are omitted.

ステップＳ２０８，ステップＳ２０８ｘ，ステップＳ２１１の処理を終了すると、続いて、ステップＳ２１２で、全ての予測モードを処理したのか否かを判断して、全ての予測モードを処理してないことを判断するときには、ステップＳ２１３に進んで、予め定められる順番に従って未処理の予測モードの中から予測モードを１つ選択し、その選択した予測モードの識別情報をレジスタＸに格納してから、ステップＳ２０２に処理に戻る。 When the processing of step S208, step S208x, and step S211 is completed, then, in step S212, it is determined whether or not all prediction modes have been processed, and it is determined that not all prediction modes have been processed. Then, the process proceeds to step S213, one prediction mode is selected from the unprocessed prediction modes according to a predetermined order, the identification information of the selected prediction mode is stored in the register X, and the process proceeds to step S202. Return.

このようにして、ステップＳ２０２〜ステップＳ２１３の処理を繰り返していくことで、ステップＳ２１２で、全ての予測モードを処理したことを判断すると、ステップＳ２１４に進んで、レジスタＭ０に格納される予測モードの識別情報がスキップモードであることを示しているのか否かを判断する。 In this manner, by repeating the processing of step S202 to step S213, when it is determined in step S212 that all prediction modes have been processed, the process proceeds to step S214, and the prediction mode stored in the register M0 is determined. It is determined whether or not the identification information indicates a skip mode.

このステップＳ２１４の判断処理に従って、レジスタＭ０に格納される予測モードの識別情報がスキップモードでないことを判断するときには、スキップモードによる画質劣化が問題とならないことから、ステップＳ２１８に進んで、レジスタＭ０に格納される予測モードを符号化対象ブロックの最適な予測モードとして出力して、処理を終了する。 When it is determined that the prediction mode identification information stored in the register M0 is not the skip mode according to the determination process in step S214, the image quality deterioration due to the skip mode does not matter, so the process proceeds to step S218, and the register M0 is stored. The stored prediction mode is output as the optimum prediction mode for the encoding target block, and the process is terminated.

一方、ステップＳ２１４の判断処理に従って、レジスタＭ０に格納される予測モードの識別情報がスキップモードであることを判断するときには、ステップＳ２１５に進んで、レジスタＭ１に格納される予測モードの識別情報がスキップモードであることを示しているのか否かを判断する。 On the other hand, when it is determined that the prediction mode identification information stored in the register M0 is the skip mode according to the determination process in step S214, the process proceeds to step S215, and the prediction mode identification information stored in the register M1 is skipped. It is determined whether or not the mode is indicated.

このステップＳ２１５の判断処理に従って、レジスタＭ１に格納される予測モードの識別情報がスキップモードであることを判断するときには、重み付き歪み量を用いても、重みなし歪み量を用いてもスキップモードが選択されたことから、ステップＳ２１８に進んで、レジスタＭ０に格納される予測モードであるスキップモードを符号化対象ブロックの最適な予測モードとして出力して、処理を終了する。 When it is determined that the prediction mode identification information stored in the register M1 is the skip mode according to the determination process in step S215, the skip mode is set regardless of whether the weighted distortion amount or the unweighted distortion amount is used. Since it has been selected, the process proceeds to step S218, the skip mode, which is the prediction mode stored in the register M0, is output as the optimum prediction mode for the block to be encoded, and the process is terminated.

一方、このステップＳ２１５の判断処理に従って、レジスタＭ１に格納される予測モードの識別情報がスキップモードでないことを判断するときは、ステップＳ２１６に進んで、ステップＳ２０１で推定した推定変位量と符号化対象ブロックの予測ベクトルとの乖離度が閾値より大きく、かつ、その推定変位量の大きさが閾値より大きいのか否かの判定を行う。 On the other hand, when it is determined that the prediction mode identification information stored in the register M1 is not the skip mode according to the determination process in step S215, the process proceeds to step S216, and the estimated displacement amount and the encoding target estimated in step S201 are processed. It is determined whether the degree of deviation from the predicted vector of the block is greater than a threshold and whether the estimated displacement is greater than the threshold.

すなわち、ステップＳ２１４でＲ−Ｄコスト最小の予測モードがスキップモードである判断していることで、実際に符号化する場合には周辺のマクロブロックの動きベクトルから求められる予測ベクトルを動きベクトルとして用いることになることから、ステップＳ２０１で推定した推定変位量とその予測ベクトルとを判断対象として、２つの変位量の乖離度が閾値より大きく、かつ、推定変位量の大きさが閾値より大きいのか否かの判定を行うのである。 That is, since it is determined in step S214 that the prediction mode with the minimum RD cost is the skip mode, the prediction vector obtained from the motion vectors of the surrounding macroblocks is used as the motion vector when actually encoding. Therefore, whether or not the difference between the two displacement amounts is larger than the threshold value and the estimated displacement amount is larger than the threshold value with the estimated displacement amount estimated in step S201 and the predicted vector as a determination target. This determination is made.

このステップＳ２１６の判断処理に従って、２つの変位量の乖離度が閾値より大きく、かつ、推定変位量の大きさが閾値より大きいという条件が成立することを判断するときには、ステップＳ２１７に進んで、レジスタＭ２に格納される予測モードを符号化対象ブロックの最適な予測モードとして出力して、処理を終了する。 When it is determined that the condition that the degree of deviation between the two displacement amounts is greater than the threshold value and the estimated displacement amount is greater than the threshold value according to the determination processing in step S216, the process proceeds to step S217, where The prediction mode stored in M2 is output as the optimal prediction mode for the encoding target block, and the process is terminated.

一方、このステップＳ２１６の判断処理に従って、２つの変位量の乖離度が閾値より大きく、かつ、推定変位量の大きさが閾値より大きいという条件が成立しないことを判断するときには、ステップＳ２１８に進んで、レジスタＭ０に格納される予測モードであるスキップモードを符号化対象ブロックの最適な予測モードとして出力して、処理を終了する。 On the other hand, when it is determined that the condition that the degree of deviation between the two displacement amounts is greater than the threshold value and the estimated displacement amount is greater than the threshold value is not satisfied according to the determination process in step S216, the process proceeds to step S218. Then, the skip mode, which is the prediction mode stored in the register M0, is output as the optimum prediction mode for the encoding target block, and the process is terminated.

このようにして、本実施形態例では、予測モード決定部１１は、符号化対象マクロブロックについて、重み付きの歪み量を使って求められたＲ−Ｄコスト最小の予測モードがスキップモードであっても、スキップモードで符号化したのでは大きな画質劣化を含んでいる可能性が高いことを判断する場合には、その次に小さなＲ−Ｄコストの予測モードを最適な予測モードとして決定するのである。 Thus, in this embodiment, the prediction mode determination unit 11 determines that the prediction mode with the minimum RD cost obtained using the weighted distortion amount for the encoding target macroblock is the skip mode. However, when it is determined that coding in the skip mode is likely to include large image quality degradation, the prediction mode with the next smallest RD cost is determined as the optimum prediction mode. .

図６に、本実施形態例を実現する予測モード決定部１１の装置構成の一例を図示する。 FIG. 6 illustrates an example of a device configuration of the prediction mode determination unit 11 that realizes the present embodiment.

この図６に示す装置構成と図４に示す装置構成との違いは、この図６に示す構成では、新たに準最小コスト記憶部２００および準最適モード記憶部２０１を備えるとともに、図４に示す最小コスト判定部１１７とは異なる処理を実行する最小コスト判定部１１７αと、図４に示すモード更新部１２０とは異なる処理を実行するモード更新部１２０αと、図４に示す最適モード出力部１２５とは異なる処理を実行する最適モード出力部１２５αとを備えるという点である。 The difference between the device configuration shown in FIG. 6 and the device configuration shown in FIG. 4 is that the configuration shown in FIG. A minimum cost determination unit 117α that executes processing different from the minimum cost determination unit 117, a mode update unit 120α that executes processing different from the mode update unit 120 shown in FIG. 4, and an optimum mode output unit 125 shown in FIG. Is provided with an optimum mode output unit 125α that executes different processes.

次に、これらの各処理部について説明するが、変位量記憶部１００、予測ベクトル算出部１０１、初期モード設定部１０３、符号量算出部１０５、重み付き歪み量算出部１０７、重みなし歪み量算出部１０９、未定乗数算出部１１１、コスト算出部１１３および重みなしコスト算出部１１５の処理については、図４で説明したものと同じであるので、その説明を省略する。 Next, each of these processing units will be described. The displacement amount storage unit 100, the prediction vector calculation unit 101, the initial mode setting unit 103, the code amount calculation unit 105, the weighted distortion amount calculation unit 107, and the unweighted distortion amount calculation. The processing of unit 109, undetermined multiplier calculation unit 111, cost calculation unit 113, and unweighted cost calculation unit 115 is the same as that described with reference to FIG.

〔最小コスト判定部１１７α〕
最小コスト判定部１１７αは、コスト記憶部１１４に記憶されるＲ−Ｄコストと、最小コスト記憶部１１８に記憶される最小コストとを読み出して、そのＲ−Ｄコストがその最小コストよりも小さいのか否かの判定を行い、そのＲ−Ｄコストがその最小コストよりも小さい場合には、そのＲ−Ｄコストを最小コスト記憶部１１８に書き出すとともに、モード更新部１２０αに制御を渡す。 [Minimum cost determination unit 117α]
The minimum cost determination unit 117α reads out the RD cost stored in the cost storage unit 114 and the minimum cost stored in the minimum cost storage unit 118, and whether the RD cost is smaller than the minimum cost. If the RD cost is smaller than the minimum cost, the RD cost is written to the minimum cost storage unit 118 and control is passed to the mode update unit 120α.

一方、そのＲ−Ｄコストがその最小コストよりも大きい場合には、コスト記憶部１１４に記憶されるＲ−Ｄコストと、準最小コスト記憶部２００に記憶される準最小コスト（２番目に小さなコスト）とを読み出して、そのＲ−Ｄコストがその準最小コストよりも小さいのか否かの判定を行い、そのＲ−Ｄコストがその準最小コストよりも小さい場合には、そのＲ−Ｄコストを準最小コスト記憶部２００に書き出すとともに、モード更新部１２０αに制御を渡す。一方、そのＲ−Ｄコストがその準最小コストよりも大きい場合には、最終モード判定部１２３に制御を渡す。 On the other hand, when the RD cost is larger than the minimum cost, the RD cost stored in the cost storage unit 114 and the quasi-minimum cost stored in the quasi-minimum cost storage unit 200 (second smallest) Cost), and it is determined whether or not the RD cost is smaller than the quasi-minimum cost. If the RD cost is smaller than the quasi-minimum cost, the RD cost is read. Is written to the quasi-minimum cost storage unit 200 and control is passed to the mode update unit 120α. On the other hand, if the RD cost is greater than the quasi-minimum cost, control is passed to the final mode determination unit 123.

そして、重みなしコスト記憶部１１６に記憶されるＲ−Ｄコストと、重みなし最小コスト記憶部１１９に記憶される最小コストとを読み出して、そのＲ−Ｄコストがその最小コストよりも小さいのか否かの判定を行い、そのＲ−Ｄコストがその最小コストよりも小さい場合には、そのＲ−Ｄコストを重みなし最小コスト記憶部１１９に書き出すとともに、モード更新部１２０αに制御を渡す。一方、そのＲ−Ｄコストがその最小コストよりも大きい場合には、最終モード判定部１２３に制御を渡す。 Then, the RD cost stored in the non-weighted cost storage unit 116 and the minimum cost stored in the non-weighted minimum cost storage unit 119 are read, and whether or not the RD cost is smaller than the minimum cost. If the RD cost is smaller than the minimum cost, the RD cost is written to the weightless minimum cost storage unit 119 and control is passed to the mode update unit 120α. On the other hand, if the RD cost is greater than the minimum cost, control is passed to the final mode determination unit 123.

〔モード更新部１２０α〕
モード更新部１２０αは、最小コスト判定部１１７αが最小コスト記憶部１１８にＲ−Ｄコストを書き出すときに、そのＲ−Ｄコストの算出元となった予測モード（その時点でモード記憶部１０４に記憶されている予測モード）の識別情報を最適モード記憶部１２１に書き出してから、最終モード判定部１２３に制御を渡す。 [Mode update unit 120α]
When the minimum cost determination unit 117α writes the RD cost to the minimum cost storage unit 118, the mode update unit 120α stores the prediction mode that is the calculation source of the RD cost (stored in the mode storage unit 104 at that time). Is written to the optimum mode storage unit 121, and control is passed to the final mode determination unit 123.

そして、最小コスト判定部１１７αが準最小コスト記憶部２００にＲ−Ｄコストを書き出すときに、そのＲ−Ｄコストの算出元となった予測モード（その時点でモード記憶部１０４に記憶されている予測モード）の識別情報を準最適モード記憶部２０１に書き出してから、最終モード判定部１２３に制御を渡す。 Then, when the minimum cost determination unit 117α writes the RD cost to the quasi-minimum cost storage unit 200, the prediction mode that is the calculation source of the RD cost (stored in the mode storage unit 104 at that time) After the identification information of (prediction mode) is written in the suboptimal mode storage unit 201, control is passed to the final mode determination unit 123.

そして、最小コスト判定部１１７αが重みなし最小コスト記憶部１１９にＲ−Ｄコストを書き出すときに、そのＲ−Ｄコストの算出元となった予測モード（その時点でモード記憶部１０４に記憶されている予測モード）の識別情報を重みなし最適モード記憶部１２２に書き出してから、最終モード判定部１２３に制御を渡す。 When the minimum cost determination unit 117α writes the RD cost to the unweighted minimum cost storage unit 119, the prediction mode that is the calculation source of the RD cost (stored in the mode storage unit 104 at that time) Is written to the optimum mode storage unit 122 without weight, and then the control is passed to the final mode determination unit 123.

〔最終モード判定部１２３〕
最終モード判定部１２３は、最小コスト判定部１１７αやモード更新部１２０αから制御が渡されると、モード設定部１２４が全ての予測モードの設定を終了したのか否かを判断して、全ての予測モードの設定を終了したことを判断するときには、最適モード出力部１２５αに対して最適な予測モードの出力を指示し、全ての予測モードの設定を終了していないことを判断するときには、モード設定部１２４に対して次の予測モードの設定を指示する。 [Final mode determination unit 123]
When the control is passed from the minimum cost determination unit 117α or the mode update unit 120α, the final mode determination unit 123 determines whether or not the mode setting unit 124 has finished setting all prediction modes, and determines all prediction modes. When determining that the setting of the prediction mode has been completed, the optimal mode output unit 125α is instructed to output the optimal prediction mode. When determining that all the prediction modes have not been set, the mode setting unit 124 Is instructed to set the next prediction mode.

〔最適モード出力部１２５α〕
最適モード出力部１２５αは、最終モード判定部１２３から最適モードの出力指示があると、最適モード記憶部１２１に記憶される予測モードがスキップモードであるのか否かの判定を行い、スキップモードでない場合には、最適モード記憶部１２１に記憶される予測モードを符号化対象ブロックの最適な予測モードとして出力する。 [Optimum mode output unit 125α]
When there is an instruction to output the optimal mode from the final mode determination unit 123, the optimal mode output unit 125α determines whether or not the prediction mode stored in the optimal mode storage unit 121 is the skip mode. In this case, the prediction mode stored in the optimum mode storage unit 121 is output as the optimum prediction mode of the encoding target block.

一方、最適モード記憶部１２１に記憶される予測モードがスキップモードで、重みなし最適モード記憶部１２２に記憶される予測モードがスキップモードでない場合には、変位量記憶部１００に記憶される推定変位量と、予測ベクトル記憶部１０２に記憶される予測ベクトルとを読み出して、その推定変位量とその予測ベクトルとの乖離度を算出するとともに、変移推定量の大きさを算出して、その乖離度が閾値よりも大きく、かつ、その変移推定量の大きさが閾値よりも大きいのか否かの判定を行う。そして、その判定結果に基づいて、推定変位量と予測ベクトルとの乖離度が大きく、かつ、変移推定量が大きいという条件が成立する場合には、準最適モード記憶部２０１に記憶される予測モードを符号化対象ブロックの最適な予測モードとして出力し、その条件が成立しない場合には、最適モード記憶部１２１に記憶されるスキップモードを符号化対象ブロックの最適な予測モードとして出力する。 On the other hand, when the prediction mode stored in the optimal mode storage unit 121 is the skip mode and the prediction mode stored in the unweighted optimal mode storage unit 122 is not the skip mode, the estimated displacement stored in the displacement amount storage unit 100 The amount and the prediction vector stored in the prediction vector storage unit 102 are read out, and the degree of deviation between the estimated displacement amount and the prediction vector is calculated, and the magnitude of the displacement estimation amount is calculated, and the degree of deviation is calculated. Is larger than a threshold value, and it is determined whether or not the magnitude of the transition estimation amount is larger than the threshold value. Then, based on the determination result, when the condition that the deviation between the estimated displacement amount and the prediction vector is large and the displacement estimation amount is large is satisfied, the prediction mode stored in the suboptimal mode storage unit 201 Is output as the optimal prediction mode of the encoding target block, and when the condition is not satisfied, the skip mode stored in the optimal mode storage unit 121 is output as the optimal prediction mode of the encoding target block.

この図６に示す構成に従って、予測モード決定部１１は、図５のフローチャートの処理を実行することで、符号化対象マクロブロックについて、重み付きの歪み量を使って求められたＲ−Ｄコスト最小の予測モードがスキップモードであっても、スキップモードで符号化したのでは大きな画質劣化を含んでいる可能性が高いことを判断する場合には、その次に小さなＲ−Ｄコストの予測モードを最適な予測モードとして決定するように処理するのである。 In accordance with the configuration shown in FIG. 6, the prediction mode determination unit 11 executes the processing of the flowchart in FIG. 5 to minimize the RD cost obtained using the weighted distortion amount for the encoding target macroblock. Even if the prediction mode is the skip mode, when it is determined that there is a high possibility that the image is coded in the skip mode, the image quality is likely to contain a large deterioration in image quality. Processing is performed so as to determine the optimum prediction mode.

〔３〕本発明の有効性を検証するために行った実験の実験結果について
本発明の有効性を検証するために、本発明をＨ．２６４の参照ソフトウェアＪＳＭＶ（version 8.0.1)に実装し、デフォルトのＪＳＭＶとの比較実験を行った。 [3] Experimental results of experiments conducted to verify the effectiveness of the present invention In order to verify the effectiveness of the present invention, It was implemented in H.264 reference software JSMV (version 8.0.1) and compared with the default JSMV.

この実験で用いた符号化対象のシーケンスは、サイズ３５２×２８８[pixles]の“Mobile＆Calender", "City”である。また、いずれのシーケンスもフレームレート３０[fps] である。ＧＯＰ構造はＩ，Ｐピクチャからなり、Ｉピクチャの挿入間隔を１５フレームとした。量子化パラメータはＩ，Ｐピクチャいずれに対してもＱＰ＝２８とした。 The encoding target sequence used in this experiment is “Mobile & Calender”, “City” of size 352 × 288 [pixles]. Each sequence has a frame rate of 30 [fps]. The GOP structure consists of I and P pictures, and the I picture insertion interval is 15 frames. The quantization parameter is QP = 28 for both I and P pictures.

この実験により得られた符号量の比較結果を下記の表に示す。 The following table shows the comparison results of the code amounts obtained by this experiment.

この実験結果から、本発明は、ＪＳＭＶに対して１．１５〜４．１４％の符号量低減を実現していることを確認できた。これにより本発明の有効性を検証することができた。なお、この実験で得た両手法の復号画像には、主観的な画質の差が認められないことを確認している。 From this experimental result, it has been confirmed that the present invention realizes a code amount reduction of 1.15 to 4.14% with respect to JSMV. Thus, the effectiveness of the present invention could be verified. Note that it has been confirmed that there is no subjective difference in image quality between the decoded images of both methods obtained in this experiment.

本発明は、動きベクトルとして予測ベクトルを用いることを指示するスキップモードを持つ動画像符号化に適用できるものであり、本発明を適用することで、スキップモードにおける画質劣化を回避しつつ、スキップモードの符号量削減のメリットを最大限享受することができるようになる。 The present invention can be applied to moving picture coding having a skip mode instructing to use a prediction vector as a motion vector. By applying the present invention, skipping image quality deterioration in the skip mode can be avoided. It is possible to enjoy the maximum benefit of reducing the amount of codes.

本発明を具備する映像符号化装置の装置構成図である。It is an apparatus block diagram of the video coding apparatus which comprises this invention. 符号化パラメータ決定部の実行するフローチャートである。It is a flowchart which an encoding parameter determination part performs. 予測モード決定部の実行するフローチャートである。It is a flowchart which a prediction mode determination part performs. 予測モード決定部の装置構成図である。It is an apparatus block diagram of a prediction mode determination part. 予測モード決定部の実行するフローチャートである。It is a flowchart which a prediction mode determination part performs. 予測モード決定部の装置構成図である。It is an apparatus block diagram of a prediction mode determination part.

Explanation of symbols

１映像符号化装置
１０符号化パラメータ決定部
１１予測モード決定部
１２推定変位量算出部
２０符号化部
１００変位量記憶部
１０１予測ベクトル算出部
１０２予測ベクトル記憶部
１０３初期モード設定部
１０４モード記憶部
１０５符号量算出部
１０６符号量記憶部
１０７重み付き歪み量算出部
１０８重み付き歪み量記憶部
１０９重みなし歪み量算出部
１１０重みなし歪み量記憶部
１１１未定乗数算出部
１１２未定乗数記憶部
１１３コスト算出部
１１４コスト記憶部
１１５重みなしコスト算出部
１１６重みなしコスト記憶部
１１７最小コスト判定部
１１８最小コスト記憶部
１１９重みなし最小コスト記憶部
１２０モード更新部
１２１最適モード記憶部
１２２重みなし最適モード記憶部
１２３最終モード判定部
１２４モード設定部
１２５最適モード出力部
２００準最小コスト記憶部
２０１準最適モード記憶部 DESCRIPTION OF SYMBOLS 1 Video coding apparatus 10 Encoding parameter determination part 11 Prediction mode determination part 12 Estimated displacement amount calculation part 20 Encoding part 100 Displacement amount memory | storage part 101 Prediction vector calculation part 102 Prediction vector memory | storage part 103 Initial mode setting part 104 Mode storage part 105 Code amount calculation unit 106 Code amount storage unit 107 Weighted distortion amount calculation unit 108 Weighted distortion amount storage unit 109 Unweighted distortion amount calculation unit 110 Unweighted distortion amount storage unit 111 Undetermined multiplier calculation unit 112 Undetermined multiplier storage unit 113 Cost Calculation unit 114 Cost storage unit 115 Weightless cost calculation unit 116 Weightless cost storage unit 117 Minimum cost determination unit 118 Minimum cost storage unit 119 Weightless minimum cost storage unit 120 Mode update unit 121 Optimal mode storage unit 122 Weightless optimal mode storage Part 123 Final mode Tough 124 mode setting unit 125 optimum mode output unit 200 sub-minimal cost storage unit 201 suboptimal mode storage unit

Claims

An encoding parameter determination method for determining an encoding parameter used for image encoding for performing information compression by transform encoding and quantization for a prediction error signal obtained by intra-frame prediction and inter-frame prediction,
For a block to be encoded, a process of calculating an estimated displacement amount indicating temporal movement of an image signal prior to the encoding process;
A block whose coding mode is the skip prediction mode selected by the cost calculation using the weighted distortion amount using the sensitivity coefficient indicating the spatio-temporal visual sensitivity is used as a determination target, and the block is encoded. A process of determining whether or not a block is likely to have a high image quality degradation when encoded in the skip mode, based on the displacement amount used at the time and the estimated displacement amount;
For the block to be determined that is determined not to be the corresponding block, the skip mode is determined as the optimum prediction mode of the block, and for the block to be determined that is determined to be the corresponding block, the weighting is determined. Determining a least cost prediction mode selected by calculating a cost using an undistorted amount as an optimal prediction mode of the block,
A characteristic encoding parameter determination method.

An encoding parameter determination method for determining an encoding parameter used for image encoding for performing information compression by transform encoding and quantization for a prediction error signal obtained by intra-frame prediction and inter-frame prediction,
For a block to be encoded, a process of calculating an estimated displacement amount indicating temporal movement of an image signal prior to the encoding process;
A block whose coding mode is the skip prediction mode selected by the cost calculation using the weighted distortion amount using the sensitivity coefficient indicating the spatio-temporal visual sensitivity is used as a determination target, and the block is encoded. A process of determining whether or not a block is likely to have a high image quality degradation when encoded in the skip mode, based on the displacement amount used at the time and the estimated displacement amount;
For the block to be determined that is not the corresponding block, the skip mode is determined as the optimum prediction mode of the block, and the cost minimum is determined for the block to be determined that the block is determined to be the corresponding block. Determining a prediction mode selected as the next lowest cost mode as the optimal prediction mode of the block,
A characteristic encoding parameter determination method.

The encoding parameter determination method according to claim 1 or 2,
In the determining process, the displacement amount is compared with the estimated displacement amount, and the difference between the two displacement amounts is larger than a prescribed threshold value, and the estimated displacement amount is larger than the prescribed threshold value. In this case, it is determined that the block is likely to have a large image quality deterioration when encoded in the skip mode.
A characteristic encoding parameter determination method.

In the encoding parameter determination method according to any one of claims 1 to 3,
For a block whose prediction mode with the lowest cost selected by calculating the cost using the weighted distortion amount is not the skip mode, the step of determining the prediction mode as the optimum prediction mode of the block,
A characteristic encoding parameter determination method.

The encoding parameter determination method according to any one of claims 1 to 4,
In the case where the prediction mode with the minimum cost selected by calculating the cost using the unweighted distortion amount is also the skip mode for the determination target block, the skip mode is performed without performing the determination process. Comprising determining the optimal prediction mode for the block,
A characteristic encoding parameter determination method.

An encoding parameter determination device that determines an encoding parameter used for image encoding for performing information compression by transform encoding and quantization for a prediction error signal obtained by intra-frame prediction and inter-frame prediction,
For an encoding target block, prior to encoding processing, means for calculating an estimated displacement amount indicating temporal movement of an image signal;
A block whose coding mode is the skip prediction mode selected by the cost calculation using the weighted distortion amount using the sensitivity coefficient indicating the spatio-temporal visual sensitivity is used as a determination target, and the block is encoded. Means for determining whether or not the block corresponds to a block that is highly likely to deteriorate image quality when encoded in the skip mode, based on the displacement amount used at the time and the estimated displacement amount;
For the block to be determined that is determined not to be the corresponding block, the skip mode is determined as the optimum prediction mode of the block, and for the block to be determined that is determined to be the corresponding block, the weighting is determined. Means for determining a prediction mode with the lowest cost selected by calculating a cost using an undistorted amount as an optimal prediction mode of the block,
An encoding parameter determination device as a feature.

An encoding parameter determination device that determines an encoding parameter used for image encoding for performing information compression by transform encoding and quantization for a prediction error signal obtained by intra-frame prediction and inter-frame prediction,
For an encoding target block, prior to encoding processing, means for calculating an estimated displacement amount indicating temporal movement of an image signal;
A block whose coding mode is the skip prediction mode selected by the cost calculation using the weighted distortion amount using the sensitivity coefficient indicating the spatio-temporal visual sensitivity is used as a determination target, and the block is encoded. Means for determining whether or not the block corresponds to a block that is highly likely to deteriorate image quality when encoded in the skip mode, based on the displacement amount used at the time and the estimated displacement amount;
For the block to be determined that is not the corresponding block, the skip mode is determined as the optimum prediction mode of the block, and the cost minimum is determined for the block to be determined that the block is determined to be the corresponding block. Determining a prediction mode selected as the next lowest cost mode of the skip mode as an optimal prediction mode of the block,
An encoding parameter determination device as a feature.

The encoding parameter determination apparatus according to claim 6 or 7,
The means for determining compares the displacement amount with the estimated displacement amount, and the difference between the two displacement amounts is greater than a prescribed threshold value, and the magnitude of the estimated displacement amount is greater than a prescribed threshold value. In this case, it is determined that the block is likely to have a large image quality deterioration when encoded in the skip mode.
An encoding parameter determination device as a feature.

In the encoding parameter determination device according to any one of claims 6 to 8,
For a block in which the prediction mode with the lowest cost selected by calculating the cost using the weighted distortion amount is not the skip mode, means for determining the prediction mode as the optimum prediction mode of the block,
An encoding parameter determination device as a feature.

The encoding parameter determination device according to any one of claims 6 to 9,
In the case where the prediction mode with the minimum cost selected by calculating the cost using the unweighted distortion amount is also the skip mode for the determination target block, the skip mode is performed without performing the determination process. Comprising means for determining as the optimal prediction mode for the block,
An encoding parameter determination device as a feature.

An encoding parameter determination program for causing a computer to execute processing used to realize the encoding parameter determination method according to any one of claims 1 to 5.

A computer-readable recording medium on which an encoding parameter determination program for causing a computer to execute processing used to realize the encoding parameter determination method according to any one of claims 1 to 5 is recorded.