JP2009225163A

JP2009225163A - Video encoder and video encoding method

Info

Publication number: JP2009225163A
Application number: JP2008068070A
Authority: JP
Inventors: Hiromitsu Nakayama; 啓満中山; Shunichi Ishiwatari; 俊一石渡; Shuo Nomura; 周央野村; Takaya Ogawa; 貴也小川
Original assignee: Toshiba Corp
Current assignee: Toshiba Corp
Priority date: 2008-03-17
Filing date: 2008-03-17
Publication date: 2009-10-01

Abstract

PROBLEM TO BE SOLVED: To curtail a computational complexity required to decide a prediction mode of encoding. SOLUTION: This video encoder comprises: an intra-prediction image generating part 41 for searching a prediction image in an intra-prediction for decision based on an encode cost; an inter-prediction image generating part 42 for layering search processes of the prediction image in an inter-prediction in response to search accuracy, successively switching the search accuracy from the low accuracy to the high accuracy for each layer, and searching and determining the prediction image in the inter-prediction for decision based on the encode cost for each layer; and a comparing part 46 for comparing the encode cost of the prediction image decided by the intra-prediction image generating part with the encode cost of the prediction image decided by the inter-prediction image generating part for each layer, deciding either prediction mode of the intra-prediction or inter-prediction for each layer based on a comparison result or reserving the decision, and stopping the search processes of the subsequent layers in the inter-prediction image generating part when the prediction mode is decided to a mode for intra-prediction. COPYRIGHT: (C)2010,JPO&INPIT

Description

本発明は、予測符号化及び算術符号化を採用した動画像符号化装置及び動画像符号化方法に関する。 The present invention relates to a moving picture coding apparatus and a moving picture coding method that employ predictive coding and arithmetic coding.

近年、画像処理のディジタル化が普及している。ディジタル化された画像データは、符号量を削減するために、Ｈ．２６４、ＡＶＣ規格等の動画像符号化方法を採用して符号化後に伝送や記録が行われることが多い。 In recent years, digitization of image processing has become widespread. In order to reduce the amount of codes, digitized image data is recorded in H.264 format. In many cases, transmission and recording are performed after encoding using a moving image encoding method such as H.264 and AVC standards.

Ｈ．２６４規格等においては、複数の予測モードで複数のブロック形状の予測符号化を採用しており、画像ブロック単位に多数の予測モードの中から最適なモードを選択して、予測符号化を行う。最適な予測モードは、画像ブロックの性質毎に相違し、適切なモードを選択することで、符号化効率を向上させることができる。 H. In the H.264 standard and the like, predictive encoding of a plurality of block shapes is employed in a plurality of prediction modes, and an optimal mode is selected from a large number of prediction modes for each image block, and predictive encoding is performed. The optimum prediction mode differs depending on the property of the image block, and the encoding efficiency can be improved by selecting an appropriate mode.

複数の予測モードから最適なモードを選択する手法として、ラグランジュの未定乗数法に基づくレート−歪み最適化手法が知られている。このレート−歪み最適化手法では、各予測モードで符号化した際の符号化歪みＤ（現画像と再生画像の２乗誤差）と発生符号量とから、符号化コストを算出する。画像符号化における符号化効率は、いかに少ない符号量で、いかに画質劣化の少ない符号化ができるかということを意味している。つまり符号化コストが小さいほど符号化効率が高くなる。従って、与えられた量子化パラメータに対して、符号化コストが最小となる予測モードを画像ブロック毎に選択することで、符号化効率を最大にすることができる。 A rate-distortion optimization method based on Lagrange's undetermined multiplier method is known as a method for selecting an optimum mode from a plurality of prediction modes. In this rate-distortion optimization method, the encoding cost is calculated from the encoding distortion D (the square error between the current image and the reproduced image) and the generated code amount when encoding is performed in each prediction mode. The encoding efficiency in image encoding means how much encoding can be performed with a small amount of code and with little deterioration in image quality. That is, the encoding efficiency increases as the encoding cost decreases. Therefore, the encoding efficiency can be maximized by selecting, for each image block, a prediction mode that minimizes the encoding cost for a given quantization parameter.

ところで、Ｈ．２６４では、画面内予測によって符号量を削減するイントラ予測と、画面間予測によって符号量を削減するインター予測とが行われる。イントラ予測及びインター予測は、種々のモードを含んでおり、最小の符号化コストのモードを決定するための計算量は極めて大きい。特に、Ｈ．２６４におけるインター予測においては、参照画像の探索精度として１／４画素精度の予測モードを選択することが可能であり、符号化コスト算出のためには膨大な計算が必要である。 H. In H.264, intra prediction for reducing the code amount by intra prediction and inter prediction for reducing the code amount by inter prediction are performed. Intra prediction and inter prediction include various modes, and the amount of calculation for determining the mode with the minimum coding cost is extremely large. In particular, H.C. In the inter prediction in H.264, it is possible to select a prediction mode of 1/4 pixel accuracy as the reference image search accuracy, and enormous calculation is necessary for calculating the coding cost.

そこで、従来、モードの決定に必要な符号化コストの算出のための計算量を削減する技術が採用されている。例えば、インター予測において、各探索精度の演算を階層化する技術がある。この技術では、最も低い探索精度、つまり２画素精度において所定の探索範囲を設定して参照画像ブロックの候補を求める。そして、求めた候補の参照画像ブロックを中心にして、比較的狭い探索範囲を新たに設定し、この探索範囲で次の探索精度つまり、１画素精度の探索を行う。以後同様にして、各探索精度で求めた候補のブロックを中心にして、次の探索精度の探索範囲を設定するのである。探索範囲を狭くすることができるので、参照画像ブロックの候補の探索に必要な計算量を低減することが可能である。こうして、最終的には、１／４画素精度で参照画像ブロックの候補を求めて、符号化コストを算出する。こうして得たインター予測における最小符号化コストと、イントラ予測における最小符号化コストとを比較して、符号化モードを決定するようになっている。 Therefore, conventionally, a technique for reducing the amount of calculation for calculating the coding cost necessary for mode determination has been adopted. For example, in inter prediction, there is a technique for hierarchizing each search accuracy calculation. In this technique, a reference image block candidate is obtained by setting a predetermined search range with the lowest search accuracy, that is, two-pixel accuracy. Then, a relatively narrow search range is newly set centering on the obtained candidate reference image block, and the next search accuracy, that is, a one-pixel accuracy search is performed within this search range. Thereafter, similarly, the search range of the next search accuracy is set around the candidate block obtained with each search accuracy. Since the search range can be narrowed, it is possible to reduce the amount of calculation required for searching for reference image block candidates. Thus, finally, a candidate for a reference image block is obtained with 1/4 pixel accuracy, and the encoding cost is calculated. The encoding mode is determined by comparing the minimum encoding cost in the inter prediction thus obtained and the minimum encoding cost in the intra prediction.

しかしながら、例え階層化したとしても、高い探索精度、例えば１／４画素精度の探索には、膨大な計算量が必要であり、全体として計算量を十分に削減することはできない。 However, even if hierarchization is performed, a search with high search accuracy, for example, 1/4 pixel accuracy, requires a huge amount of calculation, and the calculation amount as a whole cannot be reduced sufficiently.

また、特許文献１においては、符号化対象画像と参照画像とから計算された類似度に基づいてインター予測及びイントラ予測のいずれかを選択する技術が開示されている。しかしながら、特許文献１の提案では、類似度のみによってインター予測とイントラ予測との切り換えを行っており、最適な予測モードの判定に誤りが生じる可能性がある。また、インター予測を選択した場合には、高精度の探索処理が行われるので、計算量を低減することはできないという問題があった。
特開２００６−２０２１７号公報 Patent Document 1 discloses a technique for selecting either inter prediction or intra prediction based on the similarity calculated from an encoding target image and a reference image. However, in the proposal of Patent Document 1, switching between inter prediction and intra prediction is performed based only on similarity, and there is a possibility that an error occurs in the determination of the optimal prediction mode. In addition, when inter prediction is selected, there is a problem that the calculation amount cannot be reduced because high-precision search processing is performed.
JP 2006-20217 A

本発明は、最適な予測モードを少ない演算量で決定して高効率の符号化を行うことができる動画像符号化装置及び動画像符号化方法を提供することを目的とする。 An object of the present invention is to provide a moving picture coding apparatus and a moving picture coding method capable of determining an optimum prediction mode with a small amount of calculation and performing highly efficient coding.

本発明の一態様に係る動画像符号化装置は、符号化コストに基づいてイントラ予測における予測画像を探索して決定するイントラ予測画像生成部と、インター予測における予測画像の探索処理を探索精度に応じて階層化し、探索精度を低い精度から高い精度に階層毎に順次切り換えて、各階層毎に符号化コストに基づいてインター予測における予測画像を探索して決定するインター予測画像生成部と、前記イントラ予測画像生成部が決定した予測画像の符号化コストと前記インター予測画像生成部が決定した予測画像の符号化コストとを前記階層毎に比較し、比較結果に基づいて前記階層毎にイントラ予測とインター予測とのいずれかの予測モードを決定し又は決定を保留し、予測モードをイントラ予測のモードに決定した場合には、前記インター予測画像生成部における以後の階層の探索処理を停止させる比較部とを具備したことを特徴とするものであり、
本発明の一態様に係る動画像符号化方法は、符号化コストに基づいてイントラ予測における予測画像を探索して決定し、インター予測における予測画像の探索処理を探索精度に応じて階層化し、探索精度を低い精度から高い精度に階層毎に順次切り換えて、各階層毎に符号化コストに基づいてインター予測における予測画像を探索して決定し、前記イントラ予測における予測画像の符号化コストと前記インター予測における予測画像の符号化コストとを前記階層毎に比較し、比較結果に基づいて前記階層毎にイントラ予測とインター予測とのいずれかの予測モードを決定し又は決定を保留し、予測モードをイントラ予測のモードに決定した場合には、前記インター予測における以後の探索処理を停止させることを特徴とする。 A video encoding apparatus according to an aspect of the present invention provides an intra-predicted image generation unit that searches for and determines a predicted image in intra prediction based on a coding cost, and search processing for a predicted image in inter prediction with search accuracy. The inter prediction image generation unit for sequentially searching for and determining the prediction image in the inter prediction based on the coding cost for each layer, by layering according to the layer, sequentially switching the search accuracy from low accuracy to high accuracy for each layer, and The encoding cost of the prediction image determined by the intra prediction image generation unit and the encoding cost of the prediction image determined by the inter prediction image generation unit are compared for each layer, and intra prediction is performed for each layer based on the comparison result. If the prediction mode is determined as the intra prediction mode or the prediction mode is determined as the intra prediction mode or the inter prediction mode is determined. Which is characterized by comprising a comparator unit stopping the search process following layer in the terpolymer predicted image generation unit,
A moving image encoding method according to an aspect of the present invention searches for and determines a prediction image in intra prediction based on an encoding cost, hierarchizes search processing for a prediction image in inter prediction according to search accuracy, and performs search The accuracy is sequentially switched from low accuracy to high accuracy for each layer, and the prediction image in the inter prediction is searched and determined based on the encoding cost for each layer, and the encoding cost of the prediction image in the intra prediction and the inter prediction are determined. The coding cost of the prediction image in the prediction is compared for each layer, and the prediction mode of either intra prediction or inter prediction is determined for each layer based on the comparison result, or the determination is suspended, and the prediction mode is set. When the intra prediction mode is determined, the subsequent search process in the inter prediction is stopped.

本発明によれば、最適な予測モードを少ない演算量で決定して高効率の符号化を行うことができるという効果を有する。 According to the present invention, it is possible to determine an optimal prediction mode with a small amount of calculation and perform highly efficient encoding.

以下、図面を参照して本発明の実施の形態について詳細に説明する。 Hereinafter, embodiments of the present invention will be described in detail with reference to the drawings.

図１は本発明の実施の形態に係る動画像符号化装置に採用される予測信号生成部を示すブロック図である。また、図２は動画像符号化装置の全体構成を示すブロック図である。 FIG. 1 is a block diagram showing a prediction signal generation unit employed in a video encoding apparatus according to an embodiment of the present invention. FIG. 2 is a block diagram showing the overall configuration of the moving picture coding apparatus.

図２において、符号化部１１には画像信号が入力される。この画像信号は符号化部１１の減算器１２及び予測信号生成部２３に供給される。減算器１２は、後述するＭＢ予測モード選択部２４から予測画像信号が与えられており、入力された画像信号と予測画像信号との差分値（予測誤差）を出力する。なお、予測誤差は、予測モードに応じたサイズのブロック単位で求められる。 In FIG. 2, an image signal is input to the encoding unit 11. This image signal is supplied to the subtracter 12 and the prediction signal generation unit 23 of the encoding unit 11. The subtracter 12 receives a predicted image signal from an MB prediction mode selection unit 24 described later, and outputs a difference value (prediction error) between the input image signal and the predicted image signal. The prediction error is obtained in units of blocks having a size corresponding to the prediction mode.

減算器１２からの予測誤差は、直交変換部１３に与えられる。直交変換部１３は、入力された予測誤差を直交変換して、ブロック単位で空間周波数成分に変換する。これにより空間的な相関成分が削減可能となる。直交変換部１３からの変換係数は量子化部１４に与えられる。量子化部１４は、入力された変換係数を量子化することにより、ブロックの信号の冗長度を削減して出力する。 The prediction error from the subtracter 12 is given to the orthogonal transform unit 13. The orthogonal transform unit 13 performs orthogonal transform on the input prediction error and transforms it into a spatial frequency component in units of blocks. Thereby, spatial correlation components can be reduced. The transform coefficient from the orthogonal transform unit 13 is given to the quantization unit 14. The quantization unit 14 quantizes the input transform coefficient, thereby reducing the redundancy of the signal of the block and outputting it.

量子化部１４からの量子化出力は、ＣＡＢＡＣ符号化部１５に供給される。ＣＡＢＡＣ符号化部１５は、２値化部１６及び算術符号化部１７によって構成されている。２値化部１６は、量子化出力であるシンタックス要素が入力されると共に、ＭＢ予測モード選択部２４から予測モード情報も入力され、シンタックス要素を可変長の２値のシンボル列に変換する。算術符号化部１７は、２値化部１６からの２値シンボル列を、シンボルの発生確率に応じて算術符号化する。こうして、ＣＡＢＡＣ符号化部１５は、符号量を一層削減した符号化出力を出力バッファ１８に出力するようになっている。 The quantized output from the quantizing unit 14 is supplied to the CABAC encoding unit 15. The CABAC encoding unit 15 includes a binarizing unit 16 and an arithmetic encoding unit 17. The binarization unit 16 receives a syntax element as a quantization output and also receives prediction mode information from the MB prediction mode selection unit 24, and converts the syntax element into a binary symbol string of variable length. . The arithmetic encoding unit 17 arithmetically encodes the binary symbol sequence from the binarizing unit 16 according to the occurrence probability of the symbol. In this way, the CABAC encoding unit 15 outputs the encoded output with the code amount further reduced to the output buffer 18.

また、ＭＢ予測モード選択部２４から出力される量子化係数等の予測方法に関する情報は、可変長符号化部２５に与えられる。可変長符号化部２５は入力された情報を可変長符号化して出力バッファ１８に出力する。出力バッファ１８はＣＡＢＡＣ符号化部１５の出力及び可変長符号化部２５の出力を、所定のビットレートで符号化データとして出力する。 In addition, information regarding a prediction method such as a quantization coefficient output from the MB prediction mode selection unit 24 is given to the variable length encoding unit 25. The variable length coding unit 25 performs variable length coding on the input information and outputs it to the output buffer 18. The output buffer 18 outputs the output of the CABAC encoder 15 and the output of the variable length encoder 25 as encoded data at a predetermined bit rate.

符号化制御部３１は、発生符号量のフィードバック制御及び量子化特性制御等によって発生符号量を制御するレート制御、予測モードの決定処理の制御の他、符号化処理全体の制御等を行う。 The encoding control unit 31 performs control of the entire encoding process in addition to rate control for controlling the generated code amount by feedback control of generated code amount, quantization characteristic control, and the like, control of prediction mode determination processing, and the like.

量子化部１４の出力は、予測画像信号を生成するために逆量子化部１９にも与えられる。逆量子化部１９、逆直交変換部２０、加算器２１及び参照画像メモリ２２によって局部復号器が構成される。逆量子化部１９は入力された量子化出力を逆量子化して、量子化前のデータ（変換係数）に戻して逆直交変換部２０に出力する。逆直交変換部２０は入力された変換係数を逆直交変換することにより、直交変換前の予測誤差を得る。この予測誤差は加算器２１に与えられる。 The output of the quantization unit 14 is also supplied to the inverse quantization unit 19 in order to generate a predicted image signal. The inverse quantization unit 19, the inverse orthogonal transform unit 20, the adder 21 and the reference image memory 22 constitute a local decoder. The inverse quantization unit 19 inversely quantizes the input quantized output, returns the data before the quantization (transform coefficient), and outputs it to the inverse orthogonal transform unit 20. The inverse orthogonal transform unit 20 obtains a prediction error before orthogonal transform by performing inverse orthogonal transform on the input transform coefficient. This prediction error is given to the adder 21.

加算器２１はＭＢ予測モード選択部２４から予測画像信号も与えられており、予測誤差と予測画像信号との加算によって、入力画像を復元する。加算器２１からの復元画像は、参照画像として参照画像メモリ２２に与えられる。 The adder 21 is also supplied with the prediction image signal from the MB prediction mode selection unit 24, and restores the input image by adding the prediction error and the prediction image signal. The restored image from the adder 21 is given to the reference image memory 22 as a reference image.

予測信号生成部２３には入力画像信号も入力されている。予測信号生成部２３は、後述するように、各予測モードに応じた予測画像信号を生成してＭＢ予測モード選択部２４に出力する共に、いずれの予測モードに決定するかを指示するためのモード指示信号を発生してＭＢ予測モード選択部２４に出力するようになっている。ＭＢ予測モード選択部２４は、予測信号生成部２３から、各予測モード毎の予測画像信号及びモード指示信号が与えられる。ＭＢ予測モード選択部２４は、予測モードを示すモード指示信号に基づいて、予測画像信号を選択して、減算器１２及び加算器２１に出力するようになっている。 An input image signal is also input to the prediction signal generation unit 23. As will be described later, the prediction signal generation unit 23 generates a prediction image signal corresponding to each prediction mode, outputs the prediction image signal to the MB prediction mode selection unit 24, and instructs a prediction mode to be determined. An instruction signal is generated and output to the MB prediction mode selection unit 24. The MB prediction mode selection unit 24 is provided with a prediction image signal and a mode instruction signal for each prediction mode from the prediction signal generation unit 23. The MB prediction mode selection unit 24 selects a prediction image signal based on the mode instruction signal indicating the prediction mode, and outputs the prediction image signal to the subtracter 12 and the adder 21.

図１は図２中の予測信号生成部２３の具体的な構成を示している。 FIG. 1 shows a specific configuration of the prediction signal generation unit 23 in FIG.

予測信号生成部２３は、イントラ予測及びインター予測に基づく予測画像信号を生成することができる。また、予測信号生成部２３は、インター予測においては複数の画素精度、例えば、２画素精度、１画素精度、１／２画素精度及び１／４画素精度で予測画像信号を求めることができる。本実施の形態においては、後述するように、予測信号生成部２３は、予測モードの決定に要する計算量を削減することが可能である。 The prediction signal generation unit 23 can generate a prediction image signal based on intra prediction and inter prediction. In addition, in the inter prediction, the prediction signal generation unit 23 can obtain a prediction image signal with a plurality of pixel accuracy, for example, two pixel accuracy, one pixel accuracy, ½ pixel accuracy, and ¼ pixel accuracy. In the present embodiment, as will be described later, the prediction signal generation unit 23 can reduce the amount of calculation required for determining the prediction mode.

イントラ予測画像生成部４１には、入力画像信号が供給される。イントラ予測画像生成部４１は、入力画像信号が与えられて、画面内の画像を用いて予測画像（イントラ予測画像）を生成する。例えば、イントラ予測画像生成部４１は、符号化の対象のブロックに隣接する画素を用いて、イントラ予測画像を生成する。イントラ予測画像生成部４１は、生成したイントラ予測画像を差分画像生成部４４に出力すると共に、ＭＢ予測モード選択部２４にも出力する。なお、最終的には、後述するように、イントラ予測画像生成部４１は、イントラ予測の各モードのうち最も小さい符号化コストが得られるモードのイントラ予測画像をＭＢ予測モード選択部２４に出力するようになっている。 An input image signal is supplied to the intra predicted image generation unit 41. The intra predicted image generation unit 41 is provided with an input image signal and generates a predicted image (intra predicted image) using an image in the screen. For example, the intra predicted image generation unit 41 generates an intra predicted image using pixels adjacent to the block to be encoded. The intra predicted image generation unit 41 outputs the generated intra predicted image to the difference image generation unit 44 and also outputs it to the MB prediction mode selection unit 24. Finally, as will be described later, the intra-predicted image generation unit 41 outputs an intra-predicted image of a mode in which the lowest coding cost is obtained among the intra-prediction modes to the MB prediction mode selection unit 24. It is like that.

インター予測画像生成部４２は、参照画像信号が与えられ、他の画面の画像（参照画像）を用いて予測画像（インター予測画像）を生成する。この場合において、インター予測画像生成部４２は、符号化コストが最も小さくなるように、インター予測画像を生成する。 The inter prediction image generation unit 42 is given a reference image signal, and generates a prediction image (inter prediction image) using an image (reference image) of another screen. In this case, the inter prediction image generation unit 42 generates an inter prediction image so that the encoding cost is minimized.

本実施の形態においては、インター予測画像生成部４２は、後述する比較部４６に制御されて、インター予測画像の生成のための探索処理を階層化する。即ち、インター予測画像生成部４２は、最も低い探索精度から高い探索精度に向かって探索精度を階層毎に切換えて順次探索を行う。なお、本実施の形態をＨ．２６４に適用した場合には、探索精度としては、２画素精度、１画素精度、１／２画素精度及び１／４画素精度が採用される。 In the present embodiment, the inter-predicted image generation unit 42 is controlled by the comparison unit 46 described later, and stratifies search processing for generating an inter-predicted image. In other words, the inter predicted image generation unit 42 performs a search sequentially by switching the search accuracy for each hierarchy from the lowest search accuracy to the higher search accuracy. The present embodiment is referred to as H.264. When applied to H.264, 2-pixel accuracy, 1-pixel accuracy, 1 / 2-pixel accuracy, and 1 / 4-pixel accuracy are employed as search accuracy.

例えば、インター予測画像生成部４２は、階層毎に、参照画像中に探索範囲を設定し、設定した探索範囲内で、当該階層の探索精度に応じてブロック化位置を順次変化させながらマクロブロックを選択してインター予測画像として差分画像生成部４４に出力する。そして、インター予測画像生成部４２は、最終的には、後述する比較部４６から符号化コストについての情報が与えられて、当該階層において最も符号化コストが小さくなるインター予測画像をＭＢ予測モード選択部２４に出力する。 For example, the inter prediction image generation unit 42 sets a search range in the reference image for each layer, and within the set search range, the macro block is changed while sequentially changing the blocking position according to the search accuracy of the layer. It selects and outputs to the difference image generation part 44 as an inter estimated image. Then, the inter prediction image generation unit 42 finally receives information on the encoding cost from the comparison unit 46 described later, and selects an inter prediction image having the lowest encoding cost in the hierarchy as an MB prediction mode selection. To the unit 24.

差分画像生成部４４は、入力画像信号も入力され、イントラ予測画像信号と入力画像信号との差分を求めて、差分画像を符号化コスト計算部４５に出力する。また、差分画像生成部４４は、インター予測画像信号と入力画像信号との差分を求めて、差分画像を符号化コスト計算部４５に出力する。 The difference image generation unit 44 also receives an input image signal, obtains a difference between the intra predicted image signal and the input image signal, and outputs the difference image to the encoding cost calculation unit 45. Further, the difference image generation unit 44 obtains a difference between the inter predicted image signal and the input image signal, and outputs the difference image to the encoding cost calculation unit 45.

符号化コスト計算部４５は、差分画像生成部４４の出力に基づいて、符号化コストを算出する。即ち、符号化コスト計算部４５は、イントラ予測画像と入力画像との差分から、イントラ予測画像の符号化コストを算出する。また、符号化コスト計算部４５は、インター予測画像と入力画像との差分から、インター予測画像の符号化コストを算出する。 The encoding cost calculation unit 45 calculates the encoding cost based on the output of the difference image generation unit 44. That is, the encoding cost calculation unit 45 calculates the encoding cost of the intra predicted image from the difference between the intra predicted image and the input image. Also, the encoding cost calculation unit 45 calculates the encoding cost of the inter predicted image from the difference between the inter predicted image and the input image.

比較部４６は、メモリ４７を有しており、入力されるイントラ予測画像の符号化コスト及びインター予測画像の符号化コストをメモリ４７に記憶させながら比較を行う。即ち、比較部４６は、イントラ予測における各モードのイントラ予測画像毎に符号化コストを比較し、最も低い符号化コストが得られるモードをイントラ予測画像生成部４１に指示する。また、比較部４６は、インター予測の各階層におけるイントラ予測画像の符号化コストを比較し、各階層毎に、最も低い符号化コストが得られるインター予測画像（マクロブロック）をイントラ予測画像生成部４２に指示する。 The comparison unit 46 includes a memory 47, and performs comparison while storing the encoding cost of the input intra prediction image and the encoding cost of the inter prediction image in the memory 47. That is, the comparison unit 46 compares the encoding costs for each intra prediction image of each mode in the intra prediction, and instructs the intra prediction image generation unit 41 which mode can obtain the lowest encoding cost. In addition, the comparison unit 46 compares the encoding cost of the intra prediction image in each layer of the inter prediction, and the inter prediction image (macroblock) from which the lowest encoding cost is obtained for each layer is the intra prediction image generation unit. 42.

図３はインター予測における階層的な探索処理を説明するための説明図である。図３（ａ）は最も低い探索精度の探索処理を示し、図３（ｂ）は２番目に低い探索精度の探索処理を示し、図３（ｃ）は３番目に低い探索精度の探索処理を示している。 FIG. 3 is an explanatory diagram for explaining hierarchical search processing in inter prediction. 3A shows the search processing with the lowest search accuracy, FIG. 3B shows the search processing with the second lowest search accuracy, and FIG. 3C shows the search processing with the third lowest search accuracy. Show.

いま、図３の対象となるマクロブロック（処理ＭＢ）５２について、インター予測画像となるマクロブロック（以下、インター予測画像マクロブロックという）を求めるものとする。先ず、インター予測画像生成部４２は、最も低い探索精度で探索を行う。即ち、インター予測画像生成部４２は、入力画像の処理ＭＢ５２と同一位置の参照画像のマクロブロックを中心として、所定の探索範囲Ｒ１を設定する。インター予測画像生成部４２は、探索範囲Ｒ１内で探索精度に応じてブロック化位置を変化させながら、インター予測画像マクロブロックを順次選択する。 Assume that a macroblock (hereinafter referred to as an inter-predicted image macroblock) that is an inter-predicted image is obtained for the macroblock (processing MB) 52 in FIG. First, the inter prediction image generation unit 42 performs a search with the lowest search accuracy. In other words, the inter predicted image generation unit 42 sets a predetermined search range R1 around the macroblock of the reference image at the same position as the processing MB52 of the input image. The inter prediction image generation unit 42 sequentially selects inter prediction image macroblocks while changing the blocking position according to the search accuracy within the search range R1.

インター予測画像生成部４２からのインター予測画像は差分画像生成部４４に与えられて、入力画像との差分画像が求められる。符号化コスト計算部４５は、差分画像からインター予測画像の符号化コストを求める。符号化コストは比較部４６に順次供給される。比較部４６は、探索範囲Ｒ１内で、最小の符号化コストが得られるマクロブロックを候補Ｃ１（×印）とする。 The inter prediction image from the inter prediction image generation unit 42 is given to the difference image generation unit 44, and a difference image from the input image is obtained. The encoding cost calculation unit 45 obtains the encoding cost of the inter prediction image from the difference image. The encoding cost is sequentially supplied to the comparison unit 46. The comparison unit 46 sets a macroblock that provides the minimum coding cost within the search range R1 as a candidate C1 (x mark).

インター予測画像生成部４２は、比較部４６から２番目に低い探索精度での探索が指示されると、候補Ｃ１のマクロブロックを中心として、所定の探索範囲Ｒ２を設定する。インター予測画像生成部４２は、探索範囲Ｒ２内で、２番目に低い探索精度に応じてブロック化位置を変化させながら、順次インター予測画像マクロブロックを選択する。インター予測画像生成部４２からのインター予測画像の符号化コストは、比較部４６によって比較される。こうして、最も低い符号化コストが得られるマクロブロックが候補Ｃ２として求められる。 When the search with the second lowest search accuracy is instructed from the comparison unit 46, the inter predicted image generation unit 42 sets a predetermined search range R2 around the macroblock of the candidate C1. The inter prediction image generation unit 42 sequentially selects inter prediction image macroblocks while changing the blocking position according to the second lowest search accuracy within the search range R2. The encoding cost of the inter prediction image from the inter prediction image generation unit 42 is compared by the comparison unit 46. In this way, the macroblock that provides the lowest coding cost is obtained as the candidate C2.

更に、インター予測画像生成部４２は、比較部４６から３番目に低い探索精度での探索が指示されると、候補Ｃ２のマクロブロックを中心として、所定の探索範囲Ｒ３を設定する。インター予測画像生成部４２は、探索範囲Ｒ３内で３番目に低い探索精度に応じてブロック化位置を変化させながら、順次インター予測画像マクロブロックを選択する。インター予測画像生成部４２は、比較部４６から最も低い符号化コストが得られるマクロブロックが指定されて、当該マクロブロックを候補Ｃ３とする。 Further, when a search with the third lowest search accuracy is instructed by the comparison unit 46, the inter predicted image generation unit 42 sets a predetermined search range R3 centering on the macroblock of the candidate C2. The inter prediction image generation unit 42 sequentially selects inter prediction image macroblocks while changing the blocking position according to the third lowest search accuracy within the search range R3. The inter prediction image generation unit 42 is designated by the comparison unit 46 as a macroblock that provides the lowest coding cost, and sets the macroblock as a candidate C3.

以後、同様にして、インター予測画像生成部４２は、比較部４６の指示に応じて、低い探索精度から高い探索精度に切り換えながら階層的にインター予測画像の探索を行う。 Thereafter, similarly, the inter-predicted image generation unit 42 searches for inter-predicted images hierarchically while switching from low search accuracy to high search accuracy in accordance with an instruction from the comparison unit 46.

本実施の形態においては、比較部４６は、イントラ予測における符号化コストと、インター予測における符号化コストとを、インター予測の各階層毎に比較するようになっている。比較部４６は、各探索精度の探索毎に閾値を設定し、所定の探索精度の探索時に、イントラ予測画像の符号化コストがインター予測画像の符号化コストより所定の閾値を超えて小さい場合には、イントラ予測モードを設定するためのモード指示信号を出力する共に、イントラ予測画像生成部４１から当該イントラ予測画像信号をＭＢ予測モード選択部２４に出力させる。 In the present embodiment, the comparison unit 46 compares the encoding cost in intra prediction and the encoding cost in inter prediction for each layer of inter prediction. The comparison unit 46 sets a threshold value for each search with each search accuracy, and when searching with a predetermined search accuracy, the coding cost of the intra prediction image is smaller than the coding cost of the inter prediction image and is smaller than the predetermined cost. Outputs a mode instruction signal for setting the intra prediction mode, and causes the MB prediction mode selection unit 24 to output the intra prediction image signal from the intra prediction image generation unit 41.

また、比較部４６は、所定の探索精度の探索時に、イントラ予測画像の符号化コストとインター予測画像の符号化コストとの差が所定の閾値以内である場合には、当該探索精度よりも高い探索精度の探索を継続して行うように、インター予測画像生成部４２に指示を与える。 In addition, when the difference between the coding cost of the intra-predicted image and the coding cost of the inter-predicted image is within a predetermined threshold during the search with a predetermined search accuracy, the comparing unit 46 is higher than the search accuracy. An instruction is given to the inter-predicted image generation unit 42 so as to continue searching for search accuracy.

また、比較部４６は、所定の探索精度の探索時に、インター予測画像の符号化コストがイントラ予測画像の符号化コストより所定の閾値を超えて小さい場合には、インター予測モードを設定するためのモード指示信号を出力する共に、当該所定の探索精度の探索時に求めたインター予測画像信号をインター予測画像生成部４２からＭＢ予測モード選択部２４に出力させるようになっている。またこの場合には、比較部４６は、以後、インター予測画像生成部４２の探索処理（インター予測画像の生成処理）を停止させるようになっている。 In addition, the comparison unit 46 sets the inter prediction mode when the coding cost of the inter prediction image is smaller than the coding cost of the intra prediction image at a predetermined search accuracy and exceeds a predetermined threshold. While outputting the mode instruction signal, the inter prediction image generation unit 42 outputs the inter prediction image signal obtained at the time of the search with the predetermined search accuracy to the MB prediction mode selection unit 24. In this case, the comparison unit 46 stops the search process (inter prediction image generation process) of the inter prediction image generation unit 42 thereafter.

本実施の形態においては、インター予測画像生成部４２は、比較部４６によって、インター予測画像の探索処理が制御されるようになっている。即ち、インター予測画像生成部４２は、各探索精度毎のインター予測画像の算出を比較部４６からの指示に従って行い、比較部４６からイントラ予測モードを決定するモード指示信号が出力されると、以後の階層における演算を停止するようになっている。 In the present embodiment, the inter prediction image generation unit 42 is controlled by the comparison unit 46 to search for the inter prediction image. That is, the inter prediction image generation unit 42 performs the calculation of the inter prediction image for each search accuracy according to the instruction from the comparison unit 46, and when the mode instruction signal for determining the intra prediction mode is output from the comparison unit 46, The operation in the hierarchy is stopped.

次にこのように構成された実施の形態の動作について図４乃至図９を参照して説明する。図４は予測モードの決定処理を説明するためのフローチャートである。図５は図４中のイントラ予測候補決定処理を具体的に示すフローチャートである。図６乃至図９はインター予測における予測画像生成処理（探索処理）を示しており、夫々２画素精度、１画素精度、１／２画素精度又は１／４画素精度による探索処理を用いた例を示している。なお、図５乃至図９は図１に示す予測信号生成部２３の各部の処理をソフトウェアによって実現した場合のフローを示している。 Next, the operation of the embodiment configured as described above will be described with reference to FIGS. FIG. 4 is a flowchart for explaining prediction mode determination processing. FIG. 5 is a flowchart specifically showing the intra prediction candidate determination process in FIG. 6 to 9 show predicted image generation processing (search processing) in inter prediction, and examples using search processing with 2-pixel accuracy, 1-pixel accuracy, 1 / 2-pixel accuracy, or 1 / 4-pixel accuracy, respectively. Show. 5 to 9 show a flow when the processing of each unit of the prediction signal generation unit 23 shown in FIG. 1 is realized by software.

入力画像信号は予測信号生成部２３に与えられる。予測信号生成部２３は、図４のステップＳ１において、イントラ予測による符号化コストが最小となる予測画像の候補を決定する。即ち、入力画像信号は予測信号生成部２３内のイントラ予測画像生成部４１に与えられて、各モードにおける予測画像が生成される。差分画像生成部４４は入力画像と予測画像との差分画像を求め、符号化コスト計算部４５は差分画像の符号化コストを計算する。比較部４６は、イントラ予測における各モードの符号化コストを比較し、最小の符号化コストが得られる予測画像を求める。 The input image signal is given to the prediction signal generator 23. In step S1 of FIG. 4, the prediction signal generation unit 23 determines a prediction image candidate that minimizes the coding cost by intra prediction. That is, the input image signal is given to the intra prediction image generation unit 41 in the prediction signal generation unit 23, and a prediction image in each mode is generated. The difference image generation unit 44 obtains a difference image between the input image and the predicted image, and the encoding cost calculation unit 45 calculates the encoding cost of the difference image. The comparison unit 46 compares the coding costs of the respective modes in intra prediction, and obtains a predicted image that can obtain the minimum coding cost.

予測信号生成部２３はコンピュータシステムを用いて構成することができ、図５はＣＰＵによるイントラ予測画像の生成処理を示している。図５のステップＳ１１において、ＣＰＵは、先ず、最小の符号化コストの変数ｉｎｔｒａＣｏｓｔを、符号化コストとして取り得る最大値に初期化する。次に、ステップＳ１２において、所定の予測モードによるイントラ予測画像ｐｒｅｄ［ｘ，ｙ］を生成する。次に、イントラ予測画像と入力画像との差分が求められ（ステップＳ１３）、差分画像ｄｉｆｆ［ｘ，ｙ］に対して符号化コストが計算される。 The prediction signal generation unit 23 can be configured using a computer system, and FIG. 5 shows a process of generating an intra prediction image by the CPU. In step S11 of FIG. 5, the CPU first initializes a variable intraCost of the minimum encoding cost to a maximum value that can be taken as the encoding cost. Next, in step S12, an intra predicted image pred [x, y] in a predetermined prediction mode is generated. Next, the difference between the intra-predicted image and the input image is obtained (step S13), and the coding cost is calculated for the difference image diff [x, y].

符号化コストは、差分画像のコストである係数コストｃｏｅｆｆＣｏｓｔと予測モードの情報を符号化した場合のコストである予測モードコストｍｏｄｅＣｏｓｔとの和で与えられる。ステップＳ１４においては係数コストが計算され、ステップＳ１５では予測モードコストが計算される。なお、係数コストｃｏｅｆｆＣｏｓｔは、例えば下記（１）式に示す絶対値和の計算式で求められる。 The encoding cost is given as the sum of the coefficient cost coeffCost that is the cost of the difference image and the prediction mode cost modeCost that is the cost when the prediction mode information is encoded. In step S14, the coefficient cost is calculated, and in step S15, the prediction mode cost is calculated. The coefficient cost coeffCost is obtained, for example, by the absolute value sum calculation formula shown in the following formula (1).

ｃｏｅｆｆＣｏｓｔ＝ΣΣａｂｓ（ｄｉｆｆ［ｘ，ｙ］） …（１）
なお、ａｂｓ（）は、（）内の値の絶対値を示す。 coeffCost = ΣΣabs (diff [x, y]) (1)
Note that abs () indicates the absolute value of the value in ().

次に、ステップＳ１６において、ＣＰＵは、イントラ予測時の各モードの符号化コストの比較に用いる変数ｔｍｐＣｏｓｔに、係数コストの値ｃｏｅｆｆＣｏｓｔと予測モードコストの値ｍｏｄｅＣｏｓｔの和を代入する。ステップＳ１７では、ＣＰＵは、ｔｍｐＣｏｓｔと最小の符号化コストの変数ｉｎｔｒａＣｏｓｔとの大小を判定する。ｔｍｐＣｏｓｔの方がｉｎｔｒａＣｏｓｔよりも小さい場合には、ＣＰＵは、ｉｎｔｒａＣｏｓｔにｔｍｐＣｏｓｔを代入して更新し（ステップＳ１８）、そうでない場合には、ｉｎｔｒａＣｏｓｔを更新しない。 Next, in step S 16, the CPU substitutes the sum of the coefficient cost value coeffCost and the prediction mode cost value modeCost into a variable tmpCost used for comparing the coding cost of each mode at the time of intra prediction. In step S 17, the CPU determines the magnitude of tmpCost and the variable intraCost of the minimum encoding cost. If tmpCost is smaller than intraCost, the CPU assigns tmpCost to intraCost and updates it (step S18). Otherwise, it does not update intraCost.

ＣＰＵは、ステップＳ１２〜Ｓ１８を、各予測モード毎に繰返す。こうして、ｉｎｔｒａＣｏｓｔには、全予測モードの最も小さい符号化コストの値が格納される。ステップＳ１９では、ＣＰＵは、イントラ予測時の符号化コストとインター予測時の符号化コストとの比較を行う場合の調整パラメータｉｎｔｒａＣｏｓｔＯｆｆｓｅｔをｉｎｔｒａＣｏｓｔに加算する。こうして、最小の符号化コストｉｎｔｒａＣｏｓｔ及び最小の符号化コストを得るイントラ予測画像が求められる。 The CPU repeats steps S12 to S18 for each prediction mode. In this way, the value of the smallest coding cost of all prediction modes is stored in intraCost. In step S 19, the CPU adds an adjustment parameter intraCostOffset for comparison between the encoding cost at the time of intra prediction and the encoding cost at the time of inter prediction, to intraCost. In this way, an intra prediction image that obtains the minimum coding cost intraCost and the minimum coding cost is obtained.

次に、インター予測による候補を決定する。いま、インター予測画像の探索精度が２画素精度、１画素精度、１／２画素精度又は１／４画素精度であるものとする。本実施の形態においてはインター予測画像の探索は階層的に行われる。 Next, a candidate by inter prediction is determined. Now, the search accuracy of the inter prediction image is assumed to be 2 pixel accuracy, 1 pixel accuracy, 1/2 pixel accuracy, or 1/4 pixel accuracy. In the present embodiment, the search for inter predicted images is performed hierarchically.

先ず、図４のステップＳ２において、最も低い探索精度による探索処理を実施する。予測信号生成部２３は、インター予測においても、符号化コストが最小となる予測画像の候補を決定する。インター予測画像生成部４２は、参照画像信号が与えられて、探索範囲内で最も低い探索精度に応じてブロック化位置を変化させながらマクロブロックを選択して、インター予測画像として順次差分画像生成部４４に出力する。差分画像生成部４４は入力画像とインター予測画像との差分画像を求め、符号化コスト計算部４５は差分画像の符号化コストを計算する。比較部４６は、各インター予測画像の符号化コストを比較し、最小の符号化コストｉｎｔｅｒＣｏｓｔ０が得られるインター予測画像を求める。 First, in step S2 of FIG. 4, a search process with the lowest search accuracy is performed. The predicted signal generation unit 23 also determines a predicted image candidate that minimizes the coding cost even in inter prediction. The inter prediction image generation unit 42 is provided with a reference image signal, selects a macroblock while changing the blocking position according to the lowest search accuracy within the search range, and sequentially generates a difference image generation unit as an inter prediction image. 44. The difference image generation unit 44 obtains a difference image between the input image and the inter prediction image, and the encoding cost calculation unit 45 calculates the encoding cost of the difference image. The comparison unit 46 compares the encoding costs of the inter-predicted images and obtains an inter-predicted image that provides the minimum encoding cost interCost0.

図６はＣＰＵによる２画素精度の探索処理を示している。図６のステップＳ２１において、ＣＰＵは、先ず、最小の符号化コストの変数ｉｎｔｅｒＣｏｓｔ０を、符号化コストとして取り得る最大値に初期化する。次に、ステップＳ２２において、インター予測画像を生成する。即ち、ＣＰＵは、所定の探索範囲内で、２画素精度で動きベクトルを探索し、動きベクトル候補ＭＶ候補Ｍｖ０を決定する。次に、ＣＰＵは、２画素精度の動きベクトル候補からインター予測画像ｐｒｅｄ［ｘ，ｙ］を生成する。次に、インター予測画像と入力画像との差分が求められ（ステップＳ２３）、差分画像ｄｉｆｆ［ｘ，ｙ］に対して符号化コストが計算される。 FIG. 6 shows a two-pixel precision search process by the CPU. In step S21 in FIG. 6, the CPU first initializes the variable coding cost interCost0 of the minimum coding cost to the maximum value that can be taken as the coding cost. Next, in step S22, an inter prediction image is generated. That is, the CPU searches for a motion vector with 2-pixel accuracy within a predetermined search range, and determines a motion vector candidate MV candidate Mv0. Next, the CPU generates an inter predicted image pred [x, y] from the motion vector candidates with 2-pixel accuracy. Next, the difference between the inter predicted image and the input image is obtained (step S23), and the coding cost is calculated for the difference image diff [x, y].

インター予測における符号化コストは、差分画像ｄｉｆｆ［ｘ，ｙ］のコストである係数コストｃｏｅｆｆＣｏｓｔとインター予測画像のブロック化位置を示す動きベクトルの情報を符号化した場合のコストであるＭＶコストｍｖＣｏｓｔとの和で与えられる。ステップＳ２４においては係数コストが計算され、ステップＳ２５ではＭＶコストが計算される。 The encoding cost in the inter prediction is a coefficient cost coeffCost that is a cost of the difference image diff [x, y] and an MV cost mvCost that is a cost in the case of encoding motion vector information indicating the blocking position of the inter prediction image. Is given as the sum of In step S24, the coefficient cost is calculated, and in step S25, the MV cost is calculated.

次に、ステップＳ２６において、インター予測時の各モードの符号化コストの比較に用いる変数ｔｍｐＣｏｓｔに、係数コストの値ｃｏｅｆｆＣｏｓｔとＭＶコストの値ｍｖＣｏｓｔの和を代入する。ステップＳ２７では、ｔｍｐＣｏｓｔと最小の符号化コストの変数ｉｎｔｅｒＣｏｓｔ０との大小を判定する。ｔｍｐＣｏｓｔの方がｉｎｔｅｒＣｏｓｔ０よりも小さい場合には、ｉｎｔｅｒＣｏｓｔ０にｔｍｐＣｏｓｔを代入して更新し（ステップＳ２８）、そうでない場合には、ｉｎｔｅｒＣｏｓｔ０を更新しない。 Next, in step S26, the sum of the coefficient cost value coeffCost and the MV cost value mvCost is substituted into a variable tmpCost used for comparison of the coding cost of each mode during inter prediction. In step S27, it is determined whether tmpCost is the minimum encoding cost variable interCost0. If tmpCost is smaller than interCost0, tmpCost is substituted and updated for interCost0 (step S28). Otherwise, interCost0 is not updated.

ステップＳ２２〜Ｓ２８を繰返して、探索範囲内で２画素精度でブロック化位置を変化させながら、最小となる符号化コストを求める。こうして、ｉｎｔｅｒＣｏｓｔ０には、探索範囲内で最も小さい符号化コストの値が格納される。ステップＳ２９では、イントラ予測時の符号化コストとインター予測時の符号化コストとの比較を行う場合の調整パラメータｉｎｔｅｒＣｏｓｔＯｆｆｓｅｔをｉｎｔｅｒＣｏｓｔ０に加算する。こうして、探索精度が２画素精度の場合において、最小の符号化コスト及び最小の符号化コストを得るインター予測画像が求められる。 Steps S22 to S28 are repeated to obtain the minimum encoding cost while changing the blocking position with two-pixel accuracy within the search range. Thus, interCost0 stores the smallest coding cost value within the search range. In step S29, the adjustment parameter interCostOffset for comparing the coding cost at the time of intra prediction and the coding cost at the time of inter prediction is added to interCost0. Thus, when the search accuracy is two-pixel accuracy, an inter prediction image that obtains the minimum coding cost and the minimum coding cost is obtained.

なお、本実施の形態においては、後述するように、イントラ予測の符号化コストとインター予測の符号化コストとの比較時に所定の閾値を用いているので、この閾値を適宜設定することにより、調整パラメータｉｎｔｅｒＣｏｓｔＯｆｆｓｅｔの加算は、省略してもよい。 In this embodiment, as will be described later, since a predetermined threshold is used when comparing the encoding cost of intra prediction and the encoding cost of inter prediction, adjustment can be made by appropriately setting this threshold. The addition of the parameter interCostOffset may be omitted.

次に、図４のステップＳ３において、ＣＰＵは、ステップＳ２の探索処理が最も探索精度が高い探索処理であるか否かを判定する。最も高い探索精度の探索処理でない場合には、処理をステップＳ４に移行して、イントラ予測における符号化コスト（イントラコスト）が十分に小さいか否かを判定する。例えば、ＣＰＵは、下記（２）式に示すように、イントラ予測における符号化コストｉｎｔｒａＣｏｓｔがインター予測における符号化コストｉｎｔｅｒＣｏｓｔ（この場合には、２画素精度で求めたｉｎｔｅｒＣｏｓｔ０）より所定の閾値ｔｈＩｎｔｒａ０を超えて小さいか否かを判定する。 Next, in step S3 in FIG. 4, the CPU determines whether or not the search process in step S2 is the search process with the highest search accuracy. If it is not the search process with the highest search accuracy, the process proceeds to step S4, and it is determined whether or not the encoding cost (intra cost) in intra prediction is sufficiently small. For example, as shown in the following equation (2), the CPU sets a predetermined threshold thIntra0 from the encoding cost intraCost in intra prediction based on the encoding cost interCost in inter prediction (in this case, interCost0 obtained with two-pixel accuracy). It is determined whether or not it is too small.

ｉｎｔｒａＣｏｓｔ＜ｉｎｔｅｒＣｏｓｔ０−ｔｈＩｎｔｒａ０ …（２）
上記（２）式を満足しない場合には、次のステップＳ６において、インターコストが十分に小さいか否かが判定される。例えば、ＣＰＵは、下記（３）式に示すように、インター予測における符号化コストｉｎｔｅｒＣｏｓｔがイントラ予測における符号化コストｉｎｔｒａＣｏｓｔ（この場合には、ｉｎｔｅｒＣｏｓｔ０）より所定の閾値ｔｈＩｎｔｅｒ０を超えて小さいか否かを判定する。 intraCost <interCost0-thIntra0 (2)
If the above equation (2) is not satisfied, it is determined in the next step S6 whether or not the inter cost is sufficiently small. For example, as shown in the following equation (3), the CPU determines whether or not the coding cost interCost in inter prediction is smaller than the coding cost intraCost in intra prediction (in this case, interCost0) exceeding a predetermined threshold thInter0. Determine.

ｉｎｔｅｒＣｏｓｔ０＜ｉｎｔｒａＣｏｓｔ−ｔｈＩｎｔｅｒ０ …（３）
上記（３）式を満足すると判定された場合、即ち、インター予測モードの方がイントラ予測モードよりも十分に低い符号化コストが得られると判定された場合には、ＣＰＵは処理をステップＳ７に移行して、従来と同様に、次に高い探索精度から最高探索精度まで順次探索処理を実施する。 interCost0 <intraCost-thInter0 (3)
If it is determined that the above equation (3) is satisfied, that is, if it is determined that the inter prediction mode can provide a sufficiently lower encoding cost than the intra prediction mode, the CPU proceeds to step S7. In the same manner as in the prior art, the search processing is sequentially performed from the next highest search accuracy to the highest search accuracy.

図７はＣＰＵによる１画素精度の探索処理を示している。１画素精度の探索処理においては、図３において示したように、探索範囲は、２画素精度の探索処理における候補を含む範囲に設定される。従って、１画素精度において求められるイントラ予測の符号化コストは、必ず、２画素精度において求められるイントラ予測の符号化コスト以下の値となる。そこで、図７のステップＳ３１において、ＣＰＵは、１画素精度の探索処理における最小の符号化コストの変数ｉｎｔｅｒＣｏｓｔ１を、符号化コストとして取り得る最大値であるｉｎｔｅｒＣｏｓｔ０に初期化する。 FIG. 7 shows search processing with one pixel accuracy by the CPU. In the search process with one pixel accuracy, as shown in FIG. 3, the search range is set to a range including candidates in the search process with two pixel accuracy. Therefore, the coding cost of intra prediction obtained with one pixel accuracy is always a value equal to or lower than the coding cost of intra prediction obtained with two pixel accuracy. Therefore, in step S31 of FIG. 7, the CPU initializes the variable interCost1 of the minimum encoding cost in the search process with one pixel accuracy to the interCost0 that is the maximum value that can be taken as the encoding cost.

次に、ステップＳ３２において、インター予測画像を生成する。即ち、ＣＰＵは、所定の探索範囲内で、１画素精度で動きベクトルを探索し、動きベクトル候補を決定し、決定した動きベクトル候補からインター予測画像ｐｒｅｄ［ｘ，ｙ］を生成する。以後のステップＳ３３〜Ｓ３９の処理は、図６の２画素精度の探索処理と同様である。こうして、ステップＳ２９において、１画素精度の探索処理における最小の符号化コストｉｎｔｅｒＣｏｓｔ１が求められる。 Next, in step S32, an inter prediction image is generated. That is, the CPU searches for a motion vector with one pixel accuracy within a predetermined search range, determines a motion vector candidate, and generates an inter-predicted image pred [x, y] from the determined motion vector candidate. The subsequent steps S33 to S39 are the same as the two-pixel precision search process of FIG. Thus, in step S29, the minimum encoding cost interCost1 in the search process with one pixel accuracy is obtained.

次に、ＣＰＵは図８に示す１／２画素精度の探索処理を行う。１／２画素精度の探索処理においては、ステップＳ４１において、ＣＰＵは、１／２画素精度の探索処理における最小の符号化コストの変数ｉｎｔｅｒＣｏｓｔ２を、符号化コストとして取り得る最大値であるｉｎｔｅｒＣｏｓｔ１に初期化する。 Next, the CPU performs a search process with 1/2 pixel accuracy shown in FIG. In the search process with 1/2 pixel accuracy, in step S41, the CPU initially sets a variable interCost2 of the minimum encoding cost in the search process with 1/2 pixel accuracy to interCost1, which is the maximum value that can be taken as the encoding cost. Turn into.

次に、ステップＳ４２において、インター予測画像を生成する。即ち、ＣＰＵは、所定の探索範囲内で、１／２画素精度で動きベクトルを探索し、動きベクトル候補を決定し、決定した動きベクトル候補からインター予測画像ｐｒｅｄ［ｘ，ｙ］を生成する。以後のステップＳ４３〜Ｓ４９の処理は、図６の２画素精度の探索処理と同様である。こうして、ステップＳ４９において、１／２画素精度の探索処理における最小の符号化コストｉｎｔｅｒＣｏｓｔ２が求められる。 Next, in step S42, an inter prediction image is generated. That is, the CPU searches for a motion vector with a 1/2 pixel accuracy within a predetermined search range, determines a motion vector candidate, and generates an inter predicted image pred [x, y] from the determined motion vector candidate. The subsequent steps S43 to S49 are the same as the two-pixel precision search process of FIG. Thus, in step S49, the minimum encoding cost interCost2 in the search process with 1/2 pixel accuracy is obtained.

次に、ＣＰＵは図９に示す１／４画素精度の探索処理を行う。１／４画素精度の探索処理においては、ステップＳ５１において、ＣＰＵは、１／４画素精度の探索処理における最小の符号化コストの変数ｉｎｔｅｒＣｏｓｔ３を、符号化コストとして取り得る最大値であるｉｎｔｅｒＣｏｓｔ２に初期化する。 Next, the CPU performs a search process with 1/4 pixel accuracy shown in FIG. In the 1/4 pixel precision search process, in step S51, the CPU initially sets a variable coding cost interCost3 of the minimum coding cost in the 1/4 pixel precision search process to interCost2, which is the maximum value that can be taken as the coding cost. Turn into.

次に、ステップＳ５２において、インター予測画像を生成する。即ち、ＣＰＵは、所定の探索範囲内で、１／４画素精度で動きベクトルを探索し、動きベクトル候補を決定し、決定した動きベクトル候補からインター予測画像ｐｒｅｄ［ｘ，ｙ］を生成する。以後のステップＳ５３〜Ｓ５９の処理は、図６の２画素精度の探索処理と同様である。こうして、ステップＳ５９において、１／４画素精度の探索処理における最小の符号化コストｉｎｔｅｒＣｏｓｔ３が求められる。 Next, in step S52, an inter prediction image is generated. That is, the CPU searches for a motion vector with 1/4 pixel accuracy within a predetermined search range, determines a motion vector candidate, and generates an inter predicted image pred [x, y] from the determined motion vector candidate. The subsequent processing of steps S53 to S59 is the same as the two-pixel precision search processing of FIG. Thus, in step S59, the minimum encoding cost interCost3 in the search process with 1/4 pixel accuracy is obtained.

次のステップＳ８において、ＣＰＵは、インター予測モードを設定する。即ち、ステップＳ７の処理が行われる場合には、インター予測において、最低探索精度から最高探索精度までの全ての探索精度での処理が行われることになる。 In the next step S8, the CPU sets an inter prediction mode. That is, when the process of step S7 is performed, in the inter prediction, the process is performed with all the search precisions from the lowest search precision to the highest search precision.

これに対し、ステップＳ４において、上記（２）式を満足するものと判定されて、処理がステップＳ５に移行する場合には、最低探索精度から最高探索精度までの全ての探索精度での処理は行われない。例えば、インター予測の最低探索精度の探索処理において求められた符号化コストが十分に小さいと判定された場合には、ＣＰＵは、処理をステップＳ４からステップＳ５に移行して、イントラ予測モードを設定する。即ち、この場合には、インター予測の探索処理としては、最低探索精度の処理のみが行われることになる。 On the other hand, if it is determined in step S4 that the above equation (2) is satisfied and the process proceeds to step S5, the process with all search accuracy from the minimum search accuracy to the maximum search accuracy is performed. Not done. For example, if it is determined that the coding cost obtained in the search process with the minimum search accuracy for inter prediction is sufficiently small, the CPU shifts the process from step S4 to step S5 to set the intra prediction mode. To do. That is, in this case, only the process with the lowest search accuracy is performed as the inter prediction search process.

また、ステップＳ４において、上記（２）式を満足しないものと判定され、更に、ステップＳ６において上記（３）式も満足しないと判定された場合には、イントラ予測モードとインター予測モードとのいずれのモードにするかの決定を行わずに、ステップＳ９において次に高い探索精度の探索処理を行う。 In step S4, if it is determined that the above equation (2) is not satisfied, and further, if it is determined in step S6 that the above equation (3) is not satisfied, either the intra prediction mode or the inter prediction mode is selected. In step S9, the search processing with the next highest search accuracy is performed without determining whether to select the mode.

例えば、２画素精度の探索処理によって得られた符号化コストに基づいて、ステップＳ４，Ｓ６が判定されてステップＳ９に処理が移行した場合には、ステップＳ９において１画素精度の探索処理が行われる。そして、１画素精度の探索処理によって得られた符号化コストに基づいて、ステップＳ４，Ｓ６の判定が行われる。 For example, when steps S4 and S6 are determined based on the coding cost obtained by the search process with two pixel accuracy and the process proceeds to step S9, the search process with one pixel accuracy is performed at step S9. . Then, determinations in steps S4 and S6 are made based on the encoding cost obtained by the search process with one pixel accuracy.

例えば、ステップＳ４，Ｓ６では、下記（４），（５）式を満足するか否かが判定される。なお、ＣＰＵは、各階層毎に、ステップＳ４，Ｓ６に用いる閾値を再設定する。インター予測の符号化コストは、探索精度が高精度になる程小さくなるので、閾値も探索精度に応じて小さくした方がよい。 For example, in steps S4 and S6, it is determined whether or not the following expressions (4) and (5) are satisfied. In addition, CPU resets the threshold value used for step S4, S6 for every hierarchy. Since the encoding cost of inter prediction becomes smaller as the search accuracy becomes higher, it is better to reduce the threshold according to the search accuracy.

ｉｎｔｒａＣｏｓｔ＜ｉｎｔｅｒＣｏｓｔ１−ｔｈＩｎｔｒａ１ …（４）
ｉｎｔｅｒＣｏｓｔ１＜ｉｎｔｒａＣｏｓｔ−ｔｈＩｎｔｅｒ１ …（５）
上記（５）式を満足する場合には、次のステップＳ７が実行されることになり、上記（４）式を満足する場合には、次のステップＳ５が実行されることになる。ステップＳ５が実行される場合には、インター予測の２画素精度及び１画素精度の探索処理のみによって、イントラ予測モードが決定されることになる。 intraCost <interCost1-thIntra1 (4)
interCost1 <intraCost-thInter1 (5)
When the above equation (5) is satisfied, the next step S7 is executed, and when the above equation (4) is satisfied, the next step S5 is executed. When step S5 is executed, the intra prediction mode is determined only by the search process with the 2-pixel accuracy and the 1-pixel accuracy of inter prediction.

同様に、ステップＳ９において１／２画素精度の探索処理が行われる場合には、１／２画素精度の探索処理によって得られた符号化コストに基づいて、ステップＳ４，Ｓ６の判定が行われる。この場合には、ＣＰＵは、ステップＳ４，Ｓ６において、符号化コストが下記（６），（７）式を満足するか否かを判定する。 Similarly, when the search process with 1/2 pixel accuracy is performed in step S9, the determinations of steps S4 and S6 are made based on the coding cost obtained by the search process with 1/2 pixel accuracy. In this case, the CPU determines whether or not the encoding cost satisfies the following expressions (6) and (7) in steps S4 and S6.

ｉｎｔｒａＣｏｓｔ＜ｉｎｔｅｒＣｏｓｔ２−ｔｈＩｎｔｒａ２ …（６）
ｉｎｔｅｒＣｏｓｔ２＜ｉｎｔｒａＣｏｓｔ−ｔｈＩｎｔｅｒ２ …（７）
上記（７）式を満足する場合には、次のステップＳ７が実行されることになり、上記（６）式を満足する場合には、次のステップＳ５が実行されることになる。ステップＳ５が実行される場合には、インター予測の２画素精度〜１／２画素精度の探索処理によって、イントラ予測モードが決定されることになる。 intraCost <interCost2-thIntra2 (6)
interCost2 <intraCost-thInter2 (7)
When the above equation (7) is satisfied, the next step S7 is executed, and when the above equation (6) is satisfied, the next step S5 is executed. When step S5 is executed, the intra prediction mode is determined by the search process with the inter-prediction 2-pixel accuracy to 1 / 2-pixel accuracy.

ステップＳ９において１／４画素精度の探索処理が行われる場合には、ＣＰＵは、処理をステップＳ３からステップＳ１０に移行させて、１／４画素精度の探索処理によって得られた符号化コストとイントラコストとの比較を行う。この場合には、ＣＰＵは、ステップＳ１０において、例えば符号化コストが下記（８）式を満足するか否かを判定する。 When the 1/4 pixel precision search process is performed in step S9, the CPU shifts the process from step S3 to step S10, and the encoding cost and the intra obtained by the 1/4 pixel precision search process. Compare with cost. In this case, the CPU determines in step S10 whether, for example, the encoding cost satisfies the following expression (8).

ｉｎｔｅｒＣｏｓｔ３＞ｉｎｔｒａＣｏｓｔ …（８）
上記（８）式を満足する場合には、ステップＳ５においてイントラ予測モードが設定され、上記（８）式を満足しない場合には、ステップＳ８においてインター予測モードが設定されることになる。 interCost3> intraCost (8)
If the expression (8) is satisfied, the intra prediction mode is set in step S5. If the expression (8) is not satisfied, the inter prediction mode is set in step S8.

ステップＳ５においてイントラ予測モードが設定される場合には、ＣＰＵは以降の探索処理を停止する。なお、図１では比較部４６がインター予測画像生成部４２を制御することで探索処理を停止させることができる。即ち、ステップＳ４の比較処理において、イントラ予測モードが選択された場合には、その階層以降の探索処理が停止されることになり、その分の計算を省略することができる。つまり、インター予測における演算量の削減が可能である。 When the intra prediction mode is set in step S5, the CPU stops the subsequent search process. In FIG. 1, the search process can be stopped by the comparison unit 46 controlling the inter predicted image generation unit 42. That is, when the intra prediction mode is selected in the comparison process in step S4, the search process after the hierarchy is stopped, and the calculation for that level can be omitted. That is, it is possible to reduce the amount of calculation in inter prediction.

このように本実施の形態においては、インター予測の探索処理を探索精度毎に階層化して行い、各階層毎にイントラ予測とインター予測の符号化コストによるモードの決定処理を行い、イントラ予測モードの決定を行った場合には、次の階層の探索処理を停止させるようになっている。これにより、インター予測における演算量を削減することができ、消費電力を削減させることが可能である。また、インター予測の演算量が少なくなるので、モード決定に要する時間を短縮することもできる。 As described above, in the present embodiment, the inter prediction search process is hierarchized for each search accuracy, and the mode determination process is performed for each hierarchy based on the intra prediction and the inter prediction encoding cost. When the determination is made, the search process for the next hierarchy is stopped. Thereby, the calculation amount in the inter prediction can be reduced, and the power consumption can be reduced. In addition, since the amount of calculation for inter prediction is reduced, the time required for mode determination can be shortened.

なお、本発明は上記各実施の形態に限定されるものではなく、種々の変形が可能である。例えば、符号化装置の構成及び符号化手法は、特に限定されるものではない。 In addition, this invention is not limited to said each embodiment, A various deformation | transformation is possible. For example, the configuration of the encoding device and the encoding method are not particularly limited.

本発明の実施の形態に係る動画像符号化装置に採用される予測信号生成部を示すブロック図。The block diagram which shows the prediction signal production | generation part employ | adopted for the moving image encoder which concerns on embodiment of this invention. 動画像符号化装置の全体構成を示すブロック図。The block diagram which shows the whole structure of a moving image encoder. インター予測における階層的な探索処理を説明するための説明図。Explanatory drawing for demonstrating the hierarchical search process in inter prediction. 図４は予測モードの決定処理を説明するためのフローチャート。FIG. 4 is a flowchart for explaining prediction mode determination processing. 図５は図４中のイントラ予測候補決定処理を具体的に示すフローチャート。FIG. 5 is a flowchart specifically showing intra prediction candidate determination processing in FIG. 4. インター予測における２画素精度の予測画像生成処理を示すフローチャート。The flowchart which shows the prediction image generation process of 2 pixel precision in inter prediction. インター予測における１画素精度の予測画像生成処理を示すフローチャート。The flowchart which shows the prediction image generation process of 1 pixel precision in inter prediction. インター予測における１／２画素精度の予測画像生成処理を示すフローチャート。The flowchart which shows the prediction image generation process of the 1/2 pixel precision in inter prediction. インター予測における１／４画素精度の予測画像生成処理を示すフローチャート。The flowchart which shows the prediction image generation process of the 1/4 pixel precision in inter prediction.

Explanation of symbols

２３…予測信号生成部、４１…イントラ予測画像生成部、４２…インター予測画像生成部、４４…差分画像生成部、４５…符号化コスト計算部、４６…比較部。 DESCRIPTION OF SYMBOLS 23 ... Prediction signal generation part, 41 ... Intra prediction image generation part, 42 ... Inter prediction image generation part, 44 ... Difference image generation part, 45 ... Coding cost calculation part, 46 ... Comparison part.

Claims

An intra-prediction image generation unit that searches and determines a prediction image in intra prediction based on the coding cost;
Search processing for predicted images in inter prediction is hierarchized according to search accuracy, search accuracy is sequentially switched from low to high accuracy for each layer, and prediction images in inter prediction are searched based on coding cost for each layer An inter-prediction image generation unit to be determined,
The encoding cost of the prediction image determined by the intra-prediction image generation unit and the encoding cost of the prediction image determined by the inter-prediction image generation unit are compared for each hierarchy, and the intra cost is calculated for each hierarchy based on the comparison result. When the prediction mode of prediction or inter prediction is determined or suspended, and the prediction mode is determined to be the intra prediction mode, the search process of the subsequent layers in the inter prediction image generation unit is stopped. A moving picture coding apparatus comprising: a comparison unit.

The comparison unit includes:
When the encoding cost of the prediction image determined by the intra prediction image generation unit is smaller than the encoding cost of the prediction image determined by the inter prediction image generation unit by exceeding a predetermined threshold, the prediction mode is changed to the intra prediction mode. Decide
When the encoding cost of the prediction image determined by the inter prediction image generation unit is smaller than the encoding cost of the prediction image determined by the intra prediction image generation unit exceeding a predetermined threshold, the prediction mode is set to the inter prediction mode. Decide
When the difference between the encoding cost of the prediction image determined by the intra prediction image generation unit and the encoding cost of the prediction image determined by the inter prediction image generation unit is within a predetermined threshold, the prediction mode is determined. The video encoding apparatus according to claim 1, wherein the video encoding apparatus holds the video.

The moving image encoding apparatus according to claim 1 or 2, wherein the inter predicted image generation unit sets a search range including a predicted image determined in the immediately preceding hierarchy as a search range of a predicted image. .

The moving image encoding apparatus according to claim 2, wherein the comparison unit resets the predetermined threshold for each of the hierarchies.

Search and determine the prediction image in intra prediction based on the coding cost,
Search processing for predicted images in inter prediction is hierarchized according to search accuracy, search accuracy is sequentially switched from low to high accuracy for each layer, and prediction images in inter prediction are searched based on coding cost for each layer Decide
Comparing the encoding cost of the prediction image in the intra prediction and the encoding cost of the prediction image in the inter prediction for each layer,
Based on the comparison result, determine any prediction mode of intra prediction and inter prediction for each tier or hold the decision,
A moving picture coding method characterized by stopping subsequent search processing in the inter prediction when the prediction mode is determined to be an intra prediction mode.