JP2005142986A

JP2005142986A - Moving image encoding method, moving image encoding apparatus and moving image encoding program

Info

Publication number: JP2005142986A
Application number: JP2003379448A
Authority: JP
Inventors: Makoto Hagai; 誠羽飼
Original assignee: Matsushita Electric Industrial Co Ltd
Current assignee: Panasonic Holdings Corp
Priority date: 2003-11-10
Filing date: 2003-11-10
Publication date: 2005-06-02

Abstract

<P>PROBLEM TO BE SOLVED: To provide a method for reducing an inter-frame prediction throughput by stopping detecting a motion vector using an intra-frame prediction evaluation value. <P>SOLUTION: In the image encoding apparatus comprising a motion detection unit ME for calculating an inter-frame prediction evaluation value InterCost, an intra-frame prediction evaluation value calculation unit IntraCostCalc, a mode selection unit ModeSel for selecting either inter-frame prediction or intra-frame prediction as a prediction mode, and a variable length encoding unit VLC for outputting an encoded signal BitStm, the image encoding apparatus includes an inter-frame prediction evaluation value estimation unit for calculating an inter-frame prediction evaluation estimate before the end of the calculation in InterCost, and an intra-frame prediction determination unit IntraDet for instructing the end of calculation of the inter-frame prediction evaluation value in the motion detection unit ME by selecting intra-frame prediction as a prediction mode when the intra-frame prediction evaluation value and the inter-frame prediction evaluation value estimate fulfill predetermined conditions. Thus, a problem is solved that huge throughput is required for motion detection since it has to detect a motion vector from a number of motion vector candidates in motion detection processing used to inter-frame prediction of moving image encoding. <P>COPYRIGHT: (C)2005,JPO&NCIPI

Description

本発明は動画像符号化に用いられるフレーム間予測処理の高速化方法、並びにそれをソフトウェアで実施するためのプログラムに係るものである。 The present invention relates to a method for speeding up an inter-frame prediction process used for moving picture coding, and a program for implementing the method by software.

近年、マルチメディアアプリケーションの発展に伴い、画像・音声・テキストなど、あらゆるメディアの情報を統一的に扱うことが一般的になってきた。この時、全てのメディアをディジタル化することにより統一的にメディアを扱うことが可能になる。しかしながら、ディジタル化された画像は膨大なデータ量を持つため、蓄積・伝送のためには、画像の情報圧縮技術が不可欠である。一方で、圧縮した画像データを相互運用するためには、圧縮技術の標準化も重要である。画像圧縮技術の標準規格としては、ＩＴＵ（国際電気通信連合電気通信標準化部門）のＨ．２６１、Ｈ．２６３、ＩＳＯ（国際標準化機構）のＭＰＥＧ（Moving Picture Experts Group）−１、ＭＰＥＧ−２、ＭＰＥＧ−４などがある。 In recent years, with the development of multimedia applications, it has become common to handle all media information such as images, sounds and texts in a unified manner. At this time, the media can be handled uniformly by digitizing all the media. However, since a digitized image has an enormous amount of data, image information compression technology is indispensable for storage and transmission. On the other hand, in order to interoperate compressed image data, standardization of compression technology is also important. Image compression technology standards include H.264 of ITU (International Telecommunication Union Telecommunication Standardization Sector). 261, H.H. H.263, ISO (International Organization for Standardization) MPEG (Moving Picture Experts Group) -1, MPEG-2, MPEG-4, and the like.

これらの動画像符号化方式に共通の技術として動き補償を伴うフレーム間予測がある。フレーム間予測はフレーム間の相関を利用した予測方法で、符号化済み画像の画素値から入力画素の予測をする方法である。このとき、画面内の物体の動きに対応するため、参照する符号化済み画像（以降、参照画像と呼ぶ）のどの位置の画素を予測画素としたかを示す動きベクトル情報を符号化信号に付加しておく。動きベクトルは、ＭＰＥＧ等では入力画像を所定サイズに分割した矩形領域（以降、ブロックと呼ぶ）毎に付加される。 As a technique common to these video coding systems, there is inter-frame prediction with motion compensation. Inter-frame prediction is a prediction method using correlation between frames, and is a method of predicting input pixels from pixel values of an encoded image. At this time, in order to correspond to the motion of the object in the screen, motion vector information indicating which pixel in the encoded image to be referred (hereinafter referred to as a reference image) is the predicted pixel is added to the encoded signal. Keep it. A motion vector is added to each rectangular area (hereinafter referred to as a block) obtained by dividing an input image into a predetermined size in MPEG or the like.

ところが、新たな物体がフレームインする場合や、物体が移動した際に後ろに隠れていた背景が現れる場合など、参照画像中に対応する部分が存在しない場合がある。このような場合には、フレーム間予測よりもフレーム内予測の予測効率が高い場合がある。 However, there are cases where there is no corresponding part in the reference image, such as when a new object enters the frame or when a background that has been hidden behind when the object moves appears. In such a case, the prediction efficiency of intra-frame prediction may be higher than inter-frame prediction.

これに対応するため、ＭＰＥＧ等ではブロック毎にフレーム間予測とフレーム内予測が切り替え可能になっている。選択した予測モードを示す信号（予測モード情報）が符号化信号中に格納されるので、画像復号装置はこの情報から画像符号化装置と同じ予測モードを選択することができる。画像符号化装置では各モードでの予測誤差を計算・比較し、予測モードを選択する。動きベクトルMVに対するフレーム間予測の評価値としては、例えば、式１で示した２乗誤差が使用される場合や式２で示した絶対値誤差が使用される。 In order to cope with this, in MPEG or the like, inter-frame prediction and intra-frame prediction can be switched for each block. Since a signal indicating the selected prediction mode (prediction mode information) is stored in the encoded signal, the image decoding apparatus can select the same prediction mode as the image encoding apparatus from this information. In the image coding apparatus, the prediction error in each mode is calculated and compared, and the prediction mode is selected. As the evaluation value of the inter-frame prediction for the motion vector MV, for example, the square error shown in Equation 1 is used or the absolute value error shown in Equation 2 is used.

一方、フレーム内予測の評価値としては、例えば、式３で示した分散値や式４で示した値が使用される。 On the other hand, as the evaluation value for intra-frame prediction, for example, the variance value shown in Expression 3 and the value shown in Expression 4 are used.

これらの評価値は予測誤差を表すため、値が小さい程、予測は正確になり符号化効率が高くなる。従って、例えば、式５で示すようにフレーム間予測とフレーム内予測の評価値を比較し、評価値がより小さいモードを選択すれば、より符号化効率が高くなる。 Since these evaluation values represent prediction errors, the smaller the value, the more accurate the prediction and the higher the encoding efficiency. Therefore, for example, as shown in Equation 5, if the evaluation values of inter-frame prediction and intra-frame prediction are compared and a mode with a smaller evaluation value is selected, the encoding efficiency becomes higher.

なお、式１〜４は評価値の一例であり、他の計算式で定義される評価値を用いてもよい。 Expressions 1 to 4 are examples of evaluation values, and evaluation values defined by other calculation expressions may be used.

なお、式５はモード判定式の一例であり、フレーム間予測評価値とフレーム内予測評価値を変数とし、符号化効率が高い予測モードを選択するのに適した他の判定式を用いてもよい。 Formula 5 is an example of the mode determination formula, and other determination formulas suitable for selecting a prediction mode with high encoding efficiency using the inter-frame prediction evaluation value and the intra-frame prediction evaluation value as variables may be used. Good.

図１０は従来の画像符号化装置である。図１０の従来の画像符号化装置は、画像信号Imgをブロックに分割し、そのブロック毎に処理を行う。減算器Subは、画像符号化装置に入力した画像信号Imgから予測画像信号Predを減算し、残差信号Resとして出力する。残差符号化部ResEncは、残差信号Resを入力し、ＤＣＴ変換・量子化などの符号化処理を行い、量子化済ＤＣＴ係数などを含む残差符号化信号EResを出力する。残差復号部ResDecは、残差符号化信号EResを入力し、逆量子化・逆ＤＣＴ変換などの画像復号処理を行い、残差復号信号DResを出力する。加算器Addは、残差復号信号DResと予測画像信号Predを加算し、復号画像信号DecImgとして出力する。復号画像信号DecImgで、以降のフレーム間予測で参照される可能性がある信号は、フレームメモリFMに格納される。画素予測部PredictorはフレームメモリFMに蓄積された符号化済画像を参照画像信号Refとして入力し予測画像信号Predを出力する。画素予測部Predictorは、画素予測を行い、予測モードを示す予測モード情報Modeとフレーム間予測時の動きベクトルMVを出力する。可変長符号化部VLCは、残差信号Resと予測モード情報Modeと動きベクトルMVを可変長符号化し、符号化信号BitStrmとして出力する。 FIG. 10 shows a conventional image coding apparatus. The conventional image encoding device shown in FIG. 10 divides the image signal Img into blocks and performs processing for each block. The subtracter Sub subtracts the predicted image signal Pred from the image signal Img input to the image encoding device, and outputs it as a residual signal Res. The residual encoding unit ResEnc receives the residual signal Res, performs encoding processing such as DCT transformation and quantization, and outputs a residual encoded signal ERes including quantized DCT coefficients. The residual decoding unit ResDec receives the residual encoded signal ERes, performs image decoding processing such as inverse quantization and inverse DCT transform, and outputs a residual decoded signal DRes. The adder Add adds the residual decoded signal DRes and the predicted image signal Pred, and outputs the result as a decoded image signal DecImg. In the decoded image signal DecImg, a signal that may be referred to in the subsequent inter-frame prediction is stored in the frame memory FM. The pixel prediction unit Predictor inputs the encoded image stored in the frame memory FM as a reference image signal Ref and outputs a predicted image signal Pred. The pixel prediction unit Predictor performs pixel prediction and outputs prediction mode information Mode indicating a prediction mode and a motion vector MV at the time of inter-frame prediction. The variable length coding unit VLC performs variable length coding on the residual signal Res, the prediction mode information Mode, and the motion vector MV, and outputs the result as a coded signal BitStrm.

次に画素予測部Predictorの詳細説明を行う。図１１は画素予測部Predictorのブロック図である。動き検出部MEは参照画像信号Refと画像信号Imgから動きベクトルMVを検出・出力し、動きベクトルMVでフレーム間予測を行った場合の評価値であるフレーム間予測評価値InterCostも出力する。フレーム内予測評価値計算部IntraCostCalcは、画像信号Imgを入力しフレーム内予測を行った場合の評価値であるフレーム内予測評価値IntraCostを出力する。モード判定部ModeSelは、フレーム間予測評価値InterCostとフレーム内予測評価値IntraCostを入力比較し、予測モードを選択し、選択した予測モードを予測モード情報Modeとして出力する。動き補償部MCは、動きベクトルMVを使用してフレーム間予測を行い、予測結果をフレーム間予測画像信号MCPredとして出力する。フレーム内予測部IntraPredictorは、フレーム内予測を行い、予測結果をフレーム内予測画像信号IPredとして出力する。予測モード情報Modeがフレーム間予測を示す場合にはスイッチSW1は”１”側に切り替わり、フレーム間予測画像信号MCPredを予測画像信号Predとして出力し、予測モード情報Modeがフレーム内予測を示す場合にはスイッチSW1は”０”側に切り替わり、フレーム内予測画像信号IPredを予測画像信号Predとして出力する。 Next, the pixel prediction unit Predictor will be described in detail. FIG. 11 is a block diagram of the pixel prediction unit Predictor. The motion detection unit ME detects and outputs a motion vector MV from the reference image signal Ref and the image signal Img, and also outputs an inter-frame prediction evaluation value InterCost that is an evaluation value when inter-frame prediction is performed using the motion vector MV. The intra-frame prediction evaluation value calculation unit IntraCostCalc outputs an intra-frame prediction evaluation value IntraCost that is an evaluation value when the image signal Img is input and intra-frame prediction is performed. The mode determination unit ModeSel inputs and compares the inter-frame prediction evaluation value InterCost and the intra-frame prediction evaluation value IntraCost, selects a prediction mode, and outputs the selected prediction mode as prediction mode information Mode. The motion compensation unit MC performs inter-frame prediction using the motion vector MV, and outputs a prediction result as an inter-frame prediction image signal MCPred. The intra-frame prediction unit IntraPredictor performs intra-frame prediction and outputs a prediction result as an intra-frame prediction image signal IPred. When the prediction mode information Mode indicates inter-frame prediction, the switch SW1 switches to the “1” side, outputs the inter-frame prediction image signal MCPred as the prediction image signal Pred, and the prediction mode information Mode indicates intra-frame prediction. The switch SW1 switches to the “0” side and outputs the intra-frame predicted image signal IPred as the predicted image signal Pred.

フレーム間予測では多数の動きベクトル候補から１つの動きベクトルを選択することになる。そのため、動き検出部MEは複数の動きベクトル候補から評価値が最小となる１つの動きベクトルを選択する。各動きベクトル候補に対し、例えば、式１、式２で示した評価値を計算し、評価値を最小とする動きベクトル候補を選択することで動きベクトルを検出する。選択した動きベクトルに対応した評価値がフレーム間予測評価値InterCostとなる。 In inter-frame prediction, one motion vector is selected from a large number of motion vector candidates. Therefore, the motion detection unit ME selects one motion vector having the smallest evaluation value from a plurality of motion vector candidates. For each motion vector candidate, for example, the evaluation values shown in Equations 1 and 2 are calculated, and the motion vector is detected by selecting the motion vector candidate that minimizes the evaluation value. The evaluation value corresponding to the selected motion vector is the inter-frame prediction evaluation value InterCost.

図１２は画素予測部Predictorの処理フローを示した図である。S11では動き検出部MEにより動きベクトル検出を行い、動きベクトルMVとフレーム間予測評価値InterCostを得る。S12ではフレーム内予測評価値計算部IntraCostCalcによりフレーム内予測評価値IntraCostを計算する。S13ではモード判定部ModeSelにより、フレーム間予測評価値InterCostとフレーム内予測評価値IntraCostの比較を行い、予測モードを判定する。 FIG. 12 is a diagram illustrating a processing flow of the pixel prediction unit Predictor. In S11, a motion vector is detected by the motion detection unit ME to obtain a motion vector MV and an inter-frame prediction evaluation value InterCost. In S12, the intra-frame prediction evaluation value IntraCost is calculated by the intra-frame prediction evaluation value calculation unit IntraCostCalc. In S13, the mode determination unit ModeSel compares the inter-frame prediction evaluation value InterCost and the intra-frame prediction evaluation value IntraCost to determine the prediction mode.

図１３は、図１１の従来の画像符号化装置に対応する画像復号装置の構成を示したブロック図である。可変長復号部VLDは、符号化信号BitStrmを入力し可変長復号を行い、残差符号化信号EResと動きベクトルMVと予測モード情報Modeを出力する。残差復号部ResDecは、残差符号化信号EResを入力し、逆量子化・逆ＤＣＴ変換などの復号処理を行い、残差復号信号DResを出力する。動き補償部MCは、フレームメモリFMから復号済み画像を参照画像信号Refとして入力し、動きベクトルMVを使用して動き補償を伴うフレーム間予測した結果をフレーム間予測画像信号MCPredとして出力する。フレーム内予測部IntraPredictorはフレーム内予測画像信号IPredを出力する。予測モード情報Modeがフレーム内予測を示す場合には、スイッチSW1は“０”側に切り替わりフレーム内予測画像信号IPredを予測画像信号Predとして出力する。予測モード情報Modeがフレーム間予測を示す場合には、スイッチSW1は“１”側に切り替わりフレーム間予測画像信号MCPredを予測画像信号Predとして出力する。加算器Addは、残差復号信号DResと予測画像信号Predを加算し、復号画像信号DecImgとして画像復号装置外に出力する。フレームメモリFMは、フレーム間予測のために復号画像信号DecImgを格納する。以上、説明した処理により画像復号装置は符号化信号BitStrmを復号し復号画像信号DecImgとして出力する。 FIG. 13 is a block diagram showing a configuration of an image decoding apparatus corresponding to the conventional image encoding apparatus of FIG. The variable length decoding unit VLD receives the encoded signal BitStrm, performs variable length decoding, and outputs a residual encoded signal ERes, a motion vector MV, and prediction mode information Mode. The residual decoding unit ResDec receives the residual encoded signal ERes, performs decoding processing such as inverse quantization and inverse DCT transform, and outputs a residual decoded signal DRes. The motion compensation unit MC inputs a decoded image from the frame memory FM as a reference image signal Ref, and outputs a result of inter-frame prediction with motion compensation using the motion vector MV as an inter-frame prediction image signal MCPred. The intra-frame prediction unit IntraPredictor outputs an intra-frame prediction image signal IPred. When the prediction mode information Mode indicates intra-frame prediction, the switch SW1 switches to “0” and outputs the intra-frame prediction image signal IPred as the prediction image signal Pred. When the prediction mode information Mode indicates inter-frame prediction, the switch SW1 switches to the “1” side and outputs the inter-frame prediction image signal MCPred as the prediction image signal Pred. The adder Add adds the residual decoded signal DRes and the predicted image signal Pred, and outputs the result as a decoded image signal DecImg outside the image decoding apparatus. The frame memory FM stores the decoded image signal DecImg for inter-frame prediction. As described above, the image decoding apparatus decodes the encoded signal BitStrm and outputs it as a decoded image signal DecImg by the processing described above.

ところで、フレーム間予測では、各動きベクトル候補に対し式１、式２で示したような評価値を計算しなければならないため、動きベクトル候補が多数あると計算時間が膨大になる。例えば、動きベクトルの各成分が−16〜16の範囲内の全画素位置を動きベクトル候補とすると、候補数は33×33＝1089個となる。処理能力が低い演算器でリアルタイムの符号化処理を行う場合には全候補の評価を行うのは処理量の観点から不可能な場合がある。そのため、これまでも、評価を行う動きベクトル候補数を減らすことで、動き検出の処理量を減らす方法が検討されてきた。その場合、全候補を評価するわけではないので評価値を最小とする動きベクトルを選択できるとは限らず、予測誤差は大きくなり符号化効率が落ちる。そのため、評価する動きベクトルの数を減らしても符号化効率が殆ど劣化しない動き検出方法が望ましい。その方法の１つとして局所探索による動き検出がある。以下、局所探索による動き検出について説明する。 By the way, in the inter-frame prediction, since the evaluation values as shown in Expression 1 and Expression 2 must be calculated for each motion vector candidate, if there are many motion vector candidates, the calculation time becomes enormous. For example, if all pixel positions within the range of −16 to 16 in each component of the motion vector are motion vector candidates, the number of candidates is 33 × 33 = 1089. When real-time encoding processing is performed with an arithmetic unit having low processing capability, it may be impossible to evaluate all candidates from the viewpoint of processing amount. Therefore, methods for reducing the amount of motion detection processing by reducing the number of motion vector candidates to be evaluated have been studied. In this case, since not all candidates are evaluated, it is not always possible to select a motion vector that minimizes the evaluation value, and the prediction error increases and coding efficiency decreases. Therefore, it is desirable to use a motion detection method in which the coding efficiency hardly deteriorates even if the number of motion vectors to be evaluated is reduced. One of the methods is motion detection by local search. Hereinafter, motion detection by local search will be described.

局所探索は、予測誤差が真の動きベクトル（予測誤差が最小となる動きベクトル）を中心に、距離に応じて予測誤差が大きくなる傾向を利用し、予測誤差が小さくなる方向に順に動きベクトル候補を評価していく方法である。ここで局所探索の一例を示す。図１４は局所探索の概念図である。局所探索ではまず探索開始点（探索中心の初期値）を選択し評価する。ここでは番号０（動きベクトルPMVが指す点）の点を探索開始点とする。次に探索中心の周辺の画素位置を指す動きベクトルを評価する。図では探索中心の4近傍を評価する場合を示す。探索中心とその4近傍の点の中で、探索中心以外の点での評価値が最小となる場合にはその点に探索中心を移動する。探索中心の評価値が最小であれば、その時点で動きベクトル検出を終了し、探索中心を指す動きベクトル候補を動きベクトルMVとする。図の例では、０，２，５，９の順で探索中心が移動している。局所探索において評価すべき動きベクトル候補数を減らすためには、できるだけ真の動きベクトルに近い探索開始点を選択することが重要である。そのような探索開始点の一例として、予測ベクトル（周辺ブロックの動きベクトルから予測した動きベクトル）等が使用される。また、探索開始点を選択する際に、予測ベクトルやゼロベクトル等の複数の候補を評価して、その中で評価値を最小とする点を探索開始点とする方法もある。 The local search uses a tendency that the prediction error increases according to the distance, centering on a motion vector with a true prediction error (motion vector that minimizes the prediction error), and motion vector candidates in order of decreasing prediction error. It is a method to evaluate. Here, an example of local search is shown. FIG. 14 is a conceptual diagram of local search. In the local search, a search start point (initial value of the search center) is first selected and evaluated. Here, the point of number 0 (point indicated by the motion vector PMV) is set as the search start point. Next, a motion vector indicating a pixel position around the search center is evaluated. The figure shows the case where four neighborhoods of the search center are evaluated. When the evaluation value at the point other than the search center is the minimum among the search center and the four neighboring points, the search center is moved to that point. If the evaluation value at the search center is minimum, the motion vector detection is terminated at that time, and a motion vector candidate indicating the search center is set as a motion vector MV. In the illustrated example, the search center moves in the order of 0, 2, 5, and 9. In order to reduce the number of motion vector candidates to be evaluated in the local search, it is important to select a search start point that is as close to the true motion vector as possible. As an example of such a search start point, a prediction vector (a motion vector predicted from motion vectors of surrounding blocks) or the like is used. Further, when selecting a search start point, there is also a method of evaluating a plurality of candidates such as a prediction vector and a zero vector, and setting a point that minimizes an evaluation value as a search start point.

図１５は局所探索を行う画素予測部Predictorを示したブロック図である。図１１と同じ動作をするユニットおよび同じ動作の信号は同じ記号を付し、説明を省略する。探索開始点選択部ME1は、探索開始点を選択し、選択した探索開始点を指す動きベクトルを初期動きベクトルMV1として出力する。局所探索部ME2は、初期動きベクトルMV1を入力し、局所探索を行い、動きベクトルMVと動きベクトルMVに対する評価値をフレーム間予測評価値InterCostとして出力する。図１６は図１５の画素予測部Predictorの処理フローを示した図である。図１２と同じ処理ステップは同じ記号を付し、説明を省略する。S14で探索開始点選択部ME1により探索開始点を選択する。S15では局所探索部ME2により局所探索を行う。 FIG. 15 is a block diagram illustrating a pixel prediction unit Predictor that performs local search. Units that operate in the same manner as in FIG. 11 and signals that operate in the same manner are denoted by the same symbols, and description thereof is omitted. The search start point selection unit ME1 selects a search start point, and outputs a motion vector indicating the selected search start point as an initial motion vector MV1. Local search unit ME2 receives initial motion vector MV1, performs local search, and outputs motion vector MV and an evaluation value for motion vector MV as inter-frame prediction evaluation value InterCost. FIG. 16 is a diagram illustrating a processing flow of the pixel prediction unit Predictor of FIG. The same processing steps as those in FIG. 12 are denoted by the same symbols, and description thereof is omitted. In S14, the search start point is selected by the search start point selection unit ME1. In S15, the local search is performed by the local search unit ME2.

また、複数の予測モードを切り替える画像符号化装置において、モード判定を高速化する方法についての従来技術が知られている（例えば、特許文献１参照。）。この従来技術では、第１符号化モードの符号化効率の推定値と設定値とを比較し、比較結果が第１符号化モードで十分符号量を小さくできることが保証される条件を満たす時には、第１符号化モードを選択し第２符号化モードの符号化効率の推定を行わないことで処理量を削減する方法を示している。
特開平８−１８６７２２（実施例７） In addition, in an image encoding apparatus that switches between a plurality of prediction modes, a conventional technique for a method for speeding up mode determination is known (see, for example, Patent Document 1). In this prior art, the estimated value of the encoding efficiency in the first encoding mode is compared with the set value, and when the comparison result satisfies a condition that the code amount can be sufficiently reduced in the first encoding mode, This shows a method of reducing the processing amount by selecting one encoding mode and not estimating the encoding efficiency of the second encoding mode.
JP-A-8-186722 (Example 7)

従来の画像符号化装置のフレーム間予測処理の処理量軽減では、動きベクトル検出処理のみに注目して高速化していたが、本発明では、フレーム内予測の評価値を使用して動きベクトル検出を打ち切ることで、フレーム間予測の処理量を軽減する方法を提供する。 In the conventional image coding apparatus, the amount of processing of inter-frame prediction processing is reduced by focusing only on the motion vector detection processing. In the present invention, however, motion vector detection is performed using the evaluation value of intra-frame prediction. A method of reducing the processing amount of inter-frame prediction by truncating is provided.

前記従来の課題を解決するために、本発明では、フレーム間予測評価値を計算する動き検出手段と、フレーム内予測評価値を計算するフレーム内予測評価値計算手段と、フレーム間予測評価値とフレーム内予測評価値からフレーム間予測とフレーム内予測の何れかを予測モードとして選択するモード判定手段と、モード判定手段が選択した予測モードを示す予測モード情報を多重化し符号化信号として出力する可変長符号化手段とを備えたことを特徴とする画像符号化装置において、フレーム間予測評価値の計算終了前にフレーム間予測評価値の推定値を計算するフレーム間予測評価値推定手段と、フレーム内予測評価値とフレーム間予測評価値の推定値とが所定の条件を満たす場合には、フレーム内予測を予測モードとして選択し、動き検出手段のフレーム間予測評価値の計算の終了を指示するフレーム内予測判定手段とを有す。 In order to solve the conventional problem, in the present invention, motion detection means for calculating an inter-frame prediction evaluation value, intra-frame prediction evaluation value calculation means for calculating an intra-frame prediction evaluation value, inter-frame prediction evaluation value, A mode determination unit that selects either inter-frame prediction or intra-frame prediction from the intra-frame prediction evaluation value as a prediction mode, and a variable that multiplexes prediction mode information indicating the prediction mode selected by the mode determination unit and outputs it as an encoded signal In the image encoding device, comprising: a long encoding unit; an inter-frame prediction evaluation value estimation unit that calculates an estimation value of an inter-frame prediction evaluation value before the end of calculation of an inter-frame prediction evaluation value; When the intra prediction evaluation value and the estimated value of the inter-frame prediction evaluation value satisfy a predetermined condition, intra-frame prediction is selected as the prediction mode, and motion detection is performed. Having a frame prediction judgment means for instructing termination of the calculation of the inter-frame prediction evaluation value means.

本構成によって、最終的にブロックがフレーム内予測とモード判定される可能性が高い場合には、その時点で動き検出処理を打ち切ることができ、従来の画像符号化装置から符号化効率を殆ど劣化させずに動き検出の処理量を削減できる。 With this configuration, when there is a high possibility that a block will be finally determined to be intra-frame prediction, the motion detection process can be aborted at that point, and the coding efficiency is almost degraded from the conventional image coding device. The amount of motion detection processing can be reduced without doing so.

本発明の画像符号化装置によれば、フレーム間予測の処理量を軽減でき、結果として画像符号化装置の処理量を軽減できる。 According to the image encoding device of the present invention, the processing amount of inter-frame prediction can be reduced, and as a result, the processing amount of the image encoding device can be reduced.

以下本発明の実施の形態について、図面を参照しながら説明する。本発明のポイントはフレーム間予測の評価値を推定し使用することで、最終的にブロックがフレーム内予測とモード判定される可能性が高い場合には、その時点で動き検出処理を打ち切ることである。これにより従来の画像符号化装置から符号化効率を殆ど劣化させずに動き検出の処理量を削減できる。本発明は、フレーム内予測と判定されるブロックが多数ある場合に特に効果が高くなる。例えば、カメラを大きく動かした場合や大きな物体がフレームインする場合では、フレーム間相関が低くなり、フレーム内予測となるブロックが多くなる。このような場合に本発明の効果が高くなる。 Embodiments of the present invention will be described below with reference to the drawings. The point of the present invention is to estimate and use the evaluation value of inter-frame prediction. When there is a high possibility that the block is finally determined to be intra-frame prediction and mode determination, the motion detection process is terminated at that point. is there. As a result, it is possible to reduce the amount of motion detection processing with almost no deterioration in encoding efficiency from the conventional image encoding apparatus. The present invention is particularly effective when there are a large number of blocks determined to be intra-frame prediction. For example, when the camera is moved greatly or when a large object enters the frame, the inter-frame correlation becomes low, and the number of blocks for intra-frame prediction increases. In such a case, the effect of the present invention is enhanced.

（実施の形態１）
本発明の画像符号化装置は図１０で示す従来の画像符号化装置に対して、画素予測部Predictorのみ異なる。そこで、ここでは画素予測部Predictorについて説明し、その他の部分は従来の画像符号化装置と同じとし、説明を省略する。 (Embodiment 1)
The image coding apparatus of the present invention differs from the conventional image coding apparatus shown in FIG. 10 only in the pixel prediction unit Predictor. Therefore, here, the pixel prediction unit Predictor will be described, and the other parts are assumed to be the same as those of the conventional image encoding device, and description thereof will be omitted.

図１は実施の形態１の画素予測部Predictorを示したブロック図である。図１１と同じ動作をするユニットおよび同じ動作の信号は同じ記号を付し、説明を省略する。図１１の従来の画像符号化装置の画素予測部Predictorとの相違点は、フレーム間予測評価値推定部InterCostEstとフレーム内予測判定部IntraDetとスイッチSW2が追加されていることである。フレーム間予測評価値推定部InterCostEstはフレーム間予測評価値を推定し、フレーム間予測評価値推定値EInterCostを出力する。フレーム内予測判定部IntraDetは、フレーム内予測評価値IntraCostとフレーム間予測評価値推定値EInterCostを入力し、所定の条件を満たす場合には動き検出を行わないように動き検出部MEに指示し、スイッチSW2を“０”に切り替える。所定の条件を満たさない場合には、スイッチSW2を“１”に切り替え、従来の画像符号化装置と同じ処理を行う。 FIG. 1 is a block diagram illustrating a pixel prediction unit Predictor according to the first embodiment. Units that operate in the same manner as in FIG. 11 and signals that operate in the same manner are denoted by the same symbols, and description thereof is omitted. The difference from the pixel prediction unit Predictor of the conventional image encoding device in FIG. 11 is that an inter-frame prediction evaluation value estimation unit InterCostEst, an intra-frame prediction determination unit IntraDet, and a switch SW2 are added. The inter-frame prediction evaluation value estimation unit InterCostEst estimates an inter-frame prediction evaluation value, and outputs an inter-frame prediction evaluation value estimation value EInterCost. The intra-frame prediction determination unit IntraDet inputs the intra-frame prediction evaluation value IntraCost and the inter-frame prediction evaluation value estimation value EInterCost, and instructs the motion detection unit ME not to perform motion detection when a predetermined condition is satisfied, Switch SW2 is set to “0”. When the predetermined condition is not satisfied, the switch SW2 is switched to “1” and the same processing as that of the conventional image encoding device is performed.

図２は図１の画素予測部Predictorの処理フローを示した図である。図１２と同じ処理ステップは同じ記号を付し、説明を省略する。本発明ではS12のフレーム内予測評価値計算とS16のフレーム間予測評価値推定をS11の動きベクトル探索前に行い、S17のフレーム内予測判定で所定の条件を満たす場合には、そこで予測モードをフレーム内予測と判定し終了する。ここでは、S12、S16の順で処理を行っているが、この順番は逆でもよい。 FIG. 2 is a diagram illustrating a processing flow of the pixel prediction unit Predictor of FIG. The same processing steps as those in FIG. 12 are denoted by the same symbols, and description thereof is omitted. In the present invention, the intra-frame prediction evaluation value calculation of S12 and the inter-frame prediction evaluation value estimation of S16 are performed before the motion vector search of S11, and when the predetermined condition is satisfied in the intra-frame prediction determination of S17, the prediction mode is set there. The determination is made as intra-frame prediction, and the process ends. Here, the processing is performed in the order of S12 and S16, but this order may be reversed.

フレーム内予想判定で使用する条件には、例えば、式６で示す条件が考えられる。 As a condition used in the intra-frame prediction determination, for example, a condition represented by Expression 6 can be considered.

ここでフレーム間予測評価値推定値Eは、フレーム間予測評価値の推定値である。式６は、モード判定部ModeSelで使用される判定式（式５）でフレーム内予測と判定される条件式のフレーム間予測評価値を推定値Eで置き換えたものに相当する。従って、当該ブロックのフレーム間予測評価値に近い値を推定値Eとして選択できれば、選択される予測モードは従来の画像符号化装置と殆ど同じ結果となるはずである。フレーム間予測評価値推定値Eの計算処理量が、動き検出部MEによるフレーム間予測評価値の計算処理量より小さければ、画素予測部Predictorの処理量は減少することになる。このような、フレーム間予測評価値の推定値Eの例として、直前のブロックのフレーム間予測評価値等、過去に符号化したブロックの符号化で得られた値を使用することが考えられる。 Here, the inter-frame prediction evaluation value estimation value E is an estimation value of the inter-frame prediction evaluation value. Expression 6 corresponds to the inter-frame prediction evaluation value of the conditional expression determined as intra-frame prediction in the determination expression (Expression 5) used in the mode determination unit ModeSel replaced with the estimated value E. Therefore, if a value close to the inter-frame prediction evaluation value of the block can be selected as the estimated value E, the selected prediction mode should have almost the same result as that of the conventional image encoding device. If the calculation processing amount of the inter-frame prediction evaluation value estimation value E is smaller than the calculation processing amount of the inter-frame prediction evaluation value by the motion detection unit ME, the processing amount of the pixel prediction unit Predictor decreases. As an example of such an estimated value E of the inter-frame prediction evaluation value, it is conceivable to use a value obtained by encoding a block encoded in the past, such as an inter-frame prediction evaluation value of the immediately preceding block.

以上のように本実施の形態によれば、従来の画像符号化装置から符号化効率を殆ど劣化させずに動き検出の処理量を削減できる。 As described above, according to the present embodiment, it is possible to reduce the amount of motion detection processing with almost no deterioration in encoding efficiency from the conventional image encoding apparatus.

背景技術で示した特許文献１の方法ではフレーム内予測評価値を予め設定された値と比較するのに対して、本発明では過去に符号化したブロックの情報等からフレーム間予測評価値を推定し使用するため、より高い精度でフレーム内予測を予め判定できる。 In the method of Patent Document 1 shown in the background art, an intra-frame prediction evaluation value is compared with a preset value. In the present invention, an inter-frame prediction evaluation value is estimated from information of a block encoded in the past. Therefore, intra-frame prediction can be determined in advance with higher accuracy.

なお、式６はフレーム内予測判定式の一例であり、フレーム間予測評価値推定値とフレーム内予測評価値を変数とした他の判定式を用いてもよい。 Expression 6 is an example of an intra-frame prediction determination expression, and other determination expressions using the inter-frame prediction evaluation value estimated value and the intra-frame prediction evaluation value as variables may be used.

なお、本実施の形態は、２種類の予測モードのみの場合を説明したが、３種類以上の予測モードの場合にも、１種類以上の予測モードに対し評価値を推定し、各予測モードの評価値もしくは評価値の推定値を比較することで、本実施の形態と同様の処理を行うことができる。 In addition, although this Embodiment demonstrated the case of only two types of prediction modes, also in the case of three or more types of prediction modes, an evaluation value is estimated with respect to one or more types of prediction modes, By comparing the evaluation value or the estimated value of the evaluation value, the same processing as in the present embodiment can be performed.

なお、本実施の形態では、予測モードとしてフレーム間予測とフレーム内予測を例に説明したが、他の予測方法を使用してもよい。 In the present embodiment, inter-frame prediction and intra-frame prediction have been described as examples of the prediction mode, but other prediction methods may be used.

（実施の形態２）
実施の形態１の式６で説明したフレーム間予測の推定値について、動き検出の処理の一部を行い、その結果からフレーム間予測評価値の推定することで、よりフレーム間予測評価値の推定精度を上げる方法も考えられる。以下、説明する。 (Embodiment 2)
A part of the motion detection process is performed on the estimated value of the inter-frame prediction described in Equation 6 of Embodiment 1, and the estimation of the inter-frame prediction evaluation value is estimated from the result, thereby further estimating the inter-frame prediction evaluation value. A method of increasing the accuracy is also conceivable. This will be described below.

本発明の画像符号化装置は図１０で示す従来の画像符号化装置に対して、画素予測部Predictorのみ異なる。そこで、ここでは画素予測部Predictorについて説明し、その他の部分は従来の画像符号化装置と同じとし、説明を省略する。 The image coding apparatus of the present invention is different from the conventional image coding apparatus shown in FIG. 10 only in the pixel prediction unit Predictor. Therefore, here, the pixel prediction unit Predictor will be described, and the other parts are the same as those of the conventional image encoding device, and description thereof will be omitted.

図３は実施の形態２の画素予測部Predictorを示したブロック図である。図１、図１５と同じ動作をするユニットおよび同じ動作の信号は同じ記号を付し、説明を省略する。フレーム内予測判定部IntraDetは、初期動きベクトルMV1によるフレーム間予測評価値InterCost1とフレーム内予測評価値IntraCostを入力し、所定の条件を満たす場合には動き検出を行わないように動き検出部MEに指示し、スイッチSW2を“０”に切り替える。所定の条件を満たさない場合には、スイッチSW2を“１”に切り替え、従来の画像符号化装置と同じ処理を行う。初期動きベクトルMV1によるフレーム間予測評価値InterCost1は、実施の形態１におけるフレーム間予測評価値推定値EInterCostに相当する。 FIG. 3 is a block diagram illustrating the pixel predictor Predictor according to the second embodiment. Units that operate in the same manner as in FIG. 1 and FIG. The intra-frame prediction determination unit IntraDet inputs the inter-frame prediction evaluation value InterCost1 and the intra-frame prediction evaluation value IntraCost based on the initial motion vector MV1, and when the predetermined condition is satisfied, the motion detection unit ME does not perform motion detection. And switch SW2 to “0”. When the predetermined condition is not satisfied, the switch SW2 is switched to “1” and the same processing as that of the conventional image encoding device is performed. The inter-frame prediction evaluation value InterCost1 based on the initial motion vector MV1 corresponds to the inter-frame prediction evaluation value estimated value EInterCost in the first embodiment.

図４は図３の画素予測部Predictorの処理フローを示した図である。図１６と同じ処理ステップは同じ記号を付し、説明を省略する。S12のフレーム内予測評価値計算とS16のフレーム間予測評価値推定をS15での局所探索の前に行い、S17のフレーム内予測判定で所定の条件を満たす場合には、そこで予測モードをフレーム内予測と判定し終了する。ここでは、S14、S12、S16の順で処理を行っているが、これらの処理の順序は変えても構わない。 FIG. 4 is a diagram showing a processing flow of the pixel prediction unit Predictor of FIG. The same processing steps as those in FIG. 16 are denoted by the same symbols, and description thereof is omitted. When the intraframe prediction evaluation value calculation in S12 and the interframe prediction evaluation value estimation in S16 are performed before the local search in S15, and the predetermined condition is satisfied in the intraframe prediction determination in S17, the prediction mode is set in the frame It is determined as prediction and the process ends. Here, the processes are performed in the order of S14, S12, and S16, but the order of these processes may be changed.

本発明の所定の条件は、実施の形態１と同様に式６で判定できる。最終的にフレーム内予測と判定される場合には、探索開始点の位置に関わらず常に式６はフレーム内予測となるはずである。従って、問題は、最終的にフレーム間予測と判定されるのに関わらず、式６でフレーム内予測と判定される場合である。ただし、最終的にフレーム間予測と判定される場合には、初期動きベクトル時点でフレーム間予測評価値interCost1はフレーム内予測評価値IntraCostに対して十分に小さな値をとることが期待できる。そのため、前記のような問題は殆ど発生せず、従って、従来の画像符号化装置に対する符号化効率劣化は殆どない。 The predetermined condition of the present invention can be determined by Expression 6 as in the first embodiment. When it is finally determined to be intraframe prediction, Equation 6 should always be intraframe prediction regardless of the position of the search start point. Therefore, the problem is when the intra-frame prediction is determined by Equation 6 regardless of the final determination of inter-frame prediction. However, when it is finally determined to be inter-frame prediction, it can be expected that the inter-frame prediction evaluation value interCost1 takes a sufficiently small value with respect to the intra-frame prediction evaluation value IntraCost at the time of the initial motion vector. For this reason, the above-described problems hardly occur, and therefore there is almost no deterioration in encoding efficiency with respect to the conventional image encoding apparatus.

なお、実施の形態１の判定と実施の形態２の判定を組み合わせてもよい。例えば、両方の判定式がフレーム内予測を示す場合のみフレーム内予測としてもよい。 Note that the determination in Embodiment 1 and the determination in Embodiment 2 may be combined. For example, the intra-frame prediction may be performed only when both determination formulas indicate intra-frame prediction.

（実施の形態３）
さらに、上記各実施の形態で示した画像符号化方法の構成を実現するためのプログラムを、フレキシブルディスク等の記憶媒体に記録するようにすることにより、上記各実施の形態で示した処理を、独立したコンピュータシステムにおいて簡単に実施することが可能となる。 (Embodiment 3)
Furthermore, by recording a program for realizing the configuration of the image encoding method shown in each of the above embodiments on a storage medium such as a flexible disk, the processing shown in each of the above embodiments is performed. It can be easily implemented in an independent computer system.

図５は実施の形態１、２の画像符号化方法をコンピュータシステムにより実現するためのプログラムを格納するための記憶媒体についての説明図である。 FIG. 5 is an explanatory diagram of a storage medium for storing a program for realizing the image coding methods of the first and second embodiments by a computer system.

図５（ｂ）は、フレキシブルディスクの正面からみた外観、断面構造、及びフレキシブルディスクを示し、図５（ａ）は、記録媒体本体であるフレキシブルディスクの物理フォーマットの例を示している。フレキシブルディスクＦＤはケースＦ内に内蔵され、該ディスクの表面には、同心円状に外周からは内周に向かって複数のトラックＴｒが形成され、各トラックは角度方向に１６のセクタＳｅに分割されている。従って、上記プログラムを格納したフレキシブルディスクでは、上記フレキシブルディスクＦＤ上に割り当てられた領域に、上記プログラムとしての画像符号化方法が記録されている。 FIG. 5B shows an appearance, a cross-sectional structure, and a flexible disk as viewed from the front of the flexible disk, and FIG. 5A shows an example of a physical format of the flexible disk that is a recording medium body. The flexible disk FD is built in the case F, and on the surface of the disk, a plurality of tracks Tr are formed concentrically from the outer periphery toward the inner periphery, and each track is divided into 16 sectors Se in the angular direction. ing. Therefore, in the flexible disk storing the program, an image encoding method as the program is recorded in an area allocated on the flexible disk FD.

また、図５（ｃ）は、フレキシブルディスクＦＤに上記プログラムの記録再生を行うための構成を示す。上記プログラムをフレキシブルディスクＦＤに記録する場合は、コンピュータシステムＣｓから上記プログラムとしての画像符号化方法をフレキシブルディスクドライブＦＤＤを介して書き込む。また、フレキシブルディスク内のプログラムにより上記画像符号化方法をコンピュータシステム中に構築する場合は、フレキシブルディスクドライブによりプログラムをフレキシブルディスクから読み出し、コンピュータシステムに転送する。 FIG. 5C shows a configuration for recording and reproducing the program on the flexible disk FD. When recording the program on the flexible disk FD, the image encoding method as the program is written from the computer system Cs via the flexible disk drive FDD. When the image encoding method is built in a computer system by a program in a flexible disk, the program is read from the flexible disk by a flexible disk drive and transferred to the computer system.

なお、上記説明では、記録媒体としてフレキシブルディスクを用いて説明を行ったが、光ディスクを用いても同様に行うことができる。また、記録媒体はこれに限らず、ＩＣカード、ＲＯＭカセット等、プログラムを記録できるものであれば同様に実施することができる。 In the above description, a flexible disk is used as the recording medium, but the same can be done using an optical disk. Further, the recording medium is not limited to this, and any recording medium such as an IC card or a ROM cassette capable of recording a program can be similarly implemented.

（実施の形態４）
本発明の実施の形態４として、上記で説明した動画像符号化方法、動画像符号化装置および動画像符号化プログラムの応用例とそれを用いたシステムを図６〜図９を用いて説明する。 (Embodiment 4)
As a fourth embodiment of the present invention, application examples of the moving picture coding method, the moving picture coding apparatus, and the moving picture coding program described above and a system using the same will be described with reference to FIGS. .

図６は、コンテンツ配信サービスを実現するコンテンツ供給システムｅｘ１００の全体構成を示すブロック図である。通信サービスの提供エリアを所望の大きさに分割し、各セル内にそれぞれ固定無線局である基地局ｅｘ１０７〜ｅｘ１１０が設置されている。 FIG. 6 is a block diagram showing an overall configuration of a content supply system ex100 that implements a content distribution service. The communication service providing area is divided into desired sizes, and base stations ex107 to ex110, which are fixed wireless stations, are installed in each cell.

このコンテンツ供給システムｅｘ１００は、例えば、インターネットｅｘ１０１にインターネットサービスプロバイダｅｘ１０２および電話網ｅｘ１０４、および基地局ｅｘ１０７〜ｅｘ１１０を介して、コンピュータｅｘ１１１、ＰＤＡ（personal digital assistant）ｅｘ１１２、カメラｅｘ１１３、携帯電話ｅｘ１１４、カメラ付きの携帯電話ｅｘ１１５などの各機器が接続される。 The content supply system ex100 includes, for example, a computer ex111, a PDA (personal digital assistant) ex112, a camera ex113, a mobile phone ex114, a camera via the Internet ex101, the Internet service provider ex102, the telephone network ex104, and the base stations ex107 to ex110. Each device such as the attached mobile phone ex115 is connected.

しかし、コンテンツ供給システムｅｘ１００は図６のような組合せに限定されず、いずれかを組み合わせて接続するようにしてもよい。また、固定無線局である基地局ｅｘ１０７〜ｅｘ１１０を介さずに、各機器が電話網ｅｘ１０４に直接接続されてもよい。 However, the content supply system ex100 is not limited to the combination as shown in FIG. 6 and may be connected in combination. Also, each device may be directly connected to the telephone network ex104 without going through the base stations ex107 to ex110 which are fixed wireless stations.

カメラｅｘ１１３はデジタルビデオカメラ等の動画撮影が可能な機器である。また、携帯電話は、ＰＤＣ（Personal Digital Communications）方式、ＣＤＭＡ（Code Division Multiple Access）方式、Ｗ−ＣＤＭＡ（Wideband-Code Division Multiple Access）方式、若しくはＧＳＭ（Global System for Mobile Communications）方式の携帯電話機、またはＰＨＳ（Personal Handyphone System）等であり、いずれでも構わない。 The camera ex113 is a device capable of shooting a moving image such as a digital video camera. The mobile phone is a PDC (Personal Digital Communications) system, a CDMA (Code Division Multiple Access) system, a W-CDMA (Wideband-Code Division Multiple Access) system, or a GSM (Global System for Mobile Communications) system mobile phone, Alternatively, PHS (Personal Handyphone System) or the like may be used.

また、ストリーミングサーバｅｘ１０３は、カメラｅｘ１１３から基地局ｅｘ１０９、電話網ｅｘ１０４を通じて接続されており、カメラｅｘ１１３を用いてユーザが送信する符号化処理されたデータに基づいたライブ配信等が可能になる。撮影したデータの符号化処理はカメラｅｘ１１３で行っても、データの送信処理をするサーバ等で行ってもよい。また、カメラｅｘ１１６で撮影した動画データはコンピュータｅｘ１１１を介してストリーミングサーバｅｘ１０３に送信されてもよい。カメラｅｘ１１６はデジタルカメラ等の静止画、動画が撮影可能な機器である。この場合、動画データの符号化はカメラｅｘ１１６で行ってもコンピュータｅｘ１１１で行ってもどちらでもよい。また、符号化処理はコンピュータｅｘ１１１やカメラｅｘ１１６が有するＬＳＩｅｘ１１７において処理することになる。なお、画像符号化・復号化用のソフトウェアをコンピュータｅｘ１１１等で読み取り可能な記録媒体である何らかの蓄積メディア（ＣＤ−ＲＯＭ、フレキシブルディスク、ハードディスクなど）に組み込んでもよい。さらに、カメラ付きの携帯電話ｅｘ１１５で動画データを送信してもよい。このときの動画データは携帯電話ｅｘ１１５が有するＬＳＩで符号化処理されたデータである。 In addition, the streaming server ex103 is connected from the camera ex113 through the base station ex109 and the telephone network ex104, and live distribution or the like based on the encoded data transmitted by the user using the camera ex113 becomes possible. The encoded processing of the captured data may be performed by the camera ex113, or may be performed by a server or the like that performs data transmission processing. The moving image data shot by the camera ex116 may be transmitted to the streaming server ex103 via the computer ex111. The camera ex116 is a device such as a digital camera that can shoot still images and moving images. In this case, the moving image data may be encoded by the camera ex116 or the computer ex111. The encoding process is performed in the LSI ex117 included in the computer ex111 and the camera ex116. Note that image encoding / decoding software may be incorporated into any storage medium (CD-ROM, flexible disk, hard disk, or the like) that is a recording medium readable by the computer ex111 or the like. Furthermore, you may transmit moving image data with the mobile phone ex115 with a camera. The moving image data at this time is data encoded by the LSI included in the mobile phone ex115.

このコンテンツ供給システムｅｘ１００では、ユーザがカメラｅｘ１１３、カメラｅｘ１１６等で撮影しているコンテンツ（例えば、音楽ライブを撮影した映像等）を符号化処理してストリーミングサーバｅｘ１０３に送信する一方で、ストリーミングサーバｅｘ１０３は要求のあったクライアントに対して上記コンテンツデータをストリーム配信する。クライアントとしては、符号化処理されたデータを復号化することが可能な、コンピュータｅｘ１１１、ＰＤＡｅｘ１１２、カメラｅｘ１１３、携帯電話ｅｘ１１４等がある。このようにすることでコンテンツ供給システムｅｘ１００は、符号化されたデータをクライアントにおいて受信して再生することができ、さらにクライアントにおいてリアルタイムで受信して復号化し、再生することにより、個人放送をも実現可能になるシステムである。 In the content supply system ex100, the content (for example, a video image of music live) captured by the user with the camera ex113, the camera ex116, and the like is encoded and transmitted to the streaming server ex103, while the streaming server ex103. Distributes the content data to the requested client. Examples of the client include a computer ex111, a PDA ex112, a camera ex113, and a mobile phone ex114 that can decode the encoded data. In this way, the content supply system ex100 can receive and play back the encoded data at the client, and can also receive a private broadcast by receiving, decoding, and playing back at the client in real time. It is a system that becomes possible.

コンテンツの符号化処理あるいは復号化処理に際して、上記実施形態で説明した動画像符号化方法、動画像符号化装置および動画像符号化プログラムを用いてもよい。例えば、コンピュータｅｘ１１１、ＰＤＡｅｘ１１２、カメラｅｘ１１３、携帯電話ｅｘ１１４等は、上記実施形態で示した動画像符号化装置を備え、動画像符号化方法、動画像符号化プログラムを実現できるものであってよい。 In the content encoding process or decoding process, the moving picture encoding method, the moving picture encoding apparatus, and the moving picture encoding program described in the above embodiment may be used. For example, the computer ex111, the PDA ex112, the camera ex113, the mobile phone ex114, and the like may be provided with the moving picture coding apparatus described in the above embodiment and realize the moving picture coding method and the moving picture coding program.

一例として携帯電話について説明する。
図７は、上記実施形態の動画像符号化方法、動画像符号化装置および動画像符号化プログラムを用いた携帯電話ｅｘ１１５を示す図である。携帯電話ｅｘ１１５は、基地局ｅｘ１１０との間で電波を送受信するためのアンテナｅｘ２０１、ＣＣＤカメラ等の映像、静止画を撮ることが可能なカメラ部ｅｘ２０３、カメラ部ｅｘ２０３で撮影した映像、アンテナｅｘ２０１で受信した映像等が復号化されたデータを表示する液晶ディスプレイ等の表示部ｅｘ２０２、操作キーｅｘ２０４群から構成される本体部、音声出力をするためのスピーカ等の音声出力部ｅｘ２０８、音声入力をするためのマイク等の音声入力部ｅｘ２０５、撮影した動画もしくは静止画のデータ、受信したメールのデータ、動画のデータもしくは静止画のデータ等、符号化されたデータまたは復号化されたデータを保存するための記録メディアｅｘ２０７、携帯電話ｅｘ１１５に記録メディアｅｘ２０７を装着可能とするためのスロット部ｅｘ２０６を有している。記録メディアｅｘ２０７はＳＤカード等のプラスチックケース内に電気的に書換えや消去が可能な不揮発性メモリであるＥＥＰＲＯＭ（Electrically Erasable and Programmable Read Only Memory）の一種であるフラッシュメモリ素子を格納したものである。 A mobile phone will be described as an example.
FIG. 7 is a diagram showing a mobile phone ex115 using the moving picture coding method, the moving picture coding apparatus, and the moving picture coding program according to the embodiment. The mobile phone ex115 includes an antenna ex201 for transmitting and receiving radio waves to and from the base station ex110, a video from a CCD camera, a camera unit ex203 capable of taking a still image, a video shot by the camera unit ex203, and an antenna ex201. A display unit ex202 such as a liquid crystal display that displays data obtained by decoding received video and the like, a main body unit composed of a group of operation keys ex204, an audio output unit ex208 such as a speaker for audio output, and audio input To store encoded data or decoded data such as a voice input unit ex205 such as a microphone, captured video or still image data, received mail data, video data or still image data, etc. Recording media ex207 and mobile phone ex115 with recording media ex207 And a slot portion ex206 to ability. The recording medium ex207 stores a flash memory element, which is a kind of EEPROM (Electrically Erasable and Programmable Read Only Memory), which is a nonvolatile memory that can be electrically rewritten and erased, in a plastic case such as an SD card.

さらに、携帯電話ｅｘ１１５について図８を用いて説明する。携帯電話ｅｘ１１５は表示部ｅｘ２０２および操作キーｅｘ２０４を備えた本体部の各部を統括的に制御するようになされた主制御部ｅｘ３１１に対して、電源回路部ｅｘ３１０、操作入力制御部ｅｘ３０４、画像符号化部ｅｘ３１２、カメラインターフェース部ｅｘ３０３、ＬＣＤ（Liquid Crystal Display）制御部ｅｘ３０２、画像復号化部ｅｘ３０９、多重分離部ｅｘ３０８、記録再生部ｅｘ３０７、変復調回路部ｅｘ３０６および音声処理部ｅｘ３０５が同期バスｅｘ３１３を介して互いに接続されている。 Further, the cellular phone ex115 will be described with reference to FIG. The mobile phone ex115 controls the main control unit ex311 which controls the respective units of the main body unit including the display unit ex202 and the operation key ex204. The power supply circuit unit ex310, the operation input control unit ex304, and the image encoding Unit ex312, camera interface unit ex303, LCD (Liquid Crystal Display) control unit ex302, image decoding unit ex309, demultiplexing unit ex308, recording / reproducing unit ex307, modulation / demodulation circuit unit ex306, and audio processing unit ex305 via a synchronization bus ex313 Are connected to each other.

電源回路部ｅｘ３１０は、ユーザの操作により終話および電源キーがオン状態にされると、バッテリパックから各部に対して電力を供給することによりカメラ付ディジタル携帯電話ｅｘ１１５を動作可能な状態に起動する。 When the end of call and the power key are turned on by a user operation, the power supply circuit ex310 starts up the camera-equipped digital mobile phone ex115 in an operable state by supplying power from the battery pack to each unit. .

携帯電話ｅｘ１１５は、ＣＰＵ、ＲＯＭおよびＲＡＭ等でなる主制御部ｅｘ３１１の制御に基づいて、音声通話モード時に音声入力部ｅｘ２０５で集音した音声信号を音声処理部ｅｘ３０５によってディジタル音声データに変換し、これを変復調回路部ｅｘ３０６でスペクトラム拡散処理し、送受信回路部ｅｘ３０１でディジタルアナログ変換処理および周波数変換処理を施した後にアンテナｅｘ２０１を介して送信する。また携帯電話ｅｘ１１５は、音声通話モード時にアンテナｅｘ２０１で受信した受信信号を増幅して周波数変換処理およびアナログディジタル変換処理を施し、変復調回路部ｅｘ３０６でスペクトラム逆拡散処理し、音声処理部ｅｘ３０５によってアナログ音声信号に変換した後、これを音声出力部ｅｘ２０８を介して出力する。 The cellular phone ex115 converts the voice signal collected by the voice input unit ex205 in the voice call mode into digital voice data by the voice processing unit ex305 based on the control of the main control unit ex311 including a CPU, a ROM, a RAM, and the like. The modulation / demodulation circuit unit ex306 performs spread spectrum processing, the transmission / reception circuit unit ex301 performs digital analog conversion processing and frequency conversion processing, and then transmits the result via the antenna ex201. In addition, the cellular phone ex115 amplifies the received signal received by the antenna ex201 in the voice call mode, performs frequency conversion processing and analog-digital conversion processing, performs spectrum despreading processing by the modulation / demodulation circuit unit ex306, and analog audio by the voice processing unit ex305 After conversion into a signal, this is output via the audio output unit ex208.

さらに、データ通信モード時に電子メールを送信する場合、本体部の操作キーｅｘ２０４の操作によって入力された電子メールのテキストデータは操作入力制御部ｅｘ３０４を介して主制御部ｅｘ３１１に送出される。主制御部ｅｘ３１１は、テキストデータを変復調回路部ｅｘ３０６でスペクトラム拡散処理し、送受信回路部ｅｘ３０１でディジタルアナログ変換処理および周波数変換処理を施した後にアンテナｅｘ２０１を介して基地局ｅｘ１１０へ送信する。 Further, when an e-mail is transmitted in the data communication mode, text data of the e-mail input by operating the operation key ex204 of the main body is sent to the main control unit ex311 via the operation input control unit ex304. The main control unit ex311 performs spread spectrum processing on the text data in the modulation / demodulation circuit unit ex306, performs digital analog conversion processing and frequency conversion processing in the transmission / reception circuit unit ex301, and then transmits the text data to the base station ex110 via the antenna ex201.

データ通信モード時に画像データを送信する場合、カメラ部ｅｘ２０３で撮像された画像データをカメラインターフェース部ｅｘ３０３を介して画像符号化部ｅｘ３１２に供給する。また、画像データを送信しない場合には、カメラ部ｅｘ２０３で撮像した画像データをカメラインターフェース部ｅｘ３０３およびＬＣＤ制御部ｅｘ３０２を介して表示部ｅｘ２０２に直接表示することも可能である。 When transmitting image data in the data communication mode, the image data captured by the camera unit ex203 is supplied to the image encoding unit ex312 via the camera interface unit ex303. When image data is not transmitted, the image data captured by the camera unit ex203 can be directly displayed on the display unit ex202 via the camera interface unit ex303 and the LCD control unit ex302.

画像符号化部ｅｘ３１２は、カメラ部ｅｘ２０３から供給された画像データを圧縮符号化することにより符号化画像データに変換し、これを多重分離部ｅｘ３０８に送出する。また、このとき同時に携帯電話ｅｘ１１５は、カメラ部ｅｘ２０３で撮像中に音声入力部ｅｘ２０５で集音した音声を音声処理部ｅｘ３０５を介してディジタルの音声データとして多重分離部ｅｘ３０８に送出する。 The image encoding unit ex312 converts the image data supplied from the camera unit ex203 into encoded image data by compression encoding, and sends this to the demultiplexing unit ex308. At the same time, the cellular phone ex115 sends the sound collected by the voice input unit ex205 during imaging by the camera unit ex203 to the demultiplexing unit ex308 via the voice processing unit ex305 as digital voice data.

多重分離部ｅｘ３０８は、画像符号化部ｅｘ３１２から供給された符号化画像データと音声処理部ｅｘ３０５から供給された音声データとを所定の方式で多重化し、その結果得られる多重化データを変復調回路部ｅｘ３０６でスペクトラム拡散処理し、送受信回路部ｅｘ３０１でディジタルアナログ変換処理および周波数変換処理を施した後にアンテナｅｘ２０１を介して送信する。 The demultiplexing unit ex308 multiplexes the encoded image data supplied from the image encoding unit ex312 and the audio data supplied from the audio processing unit ex305 by a predetermined method, and the multiplexed data obtained as a result is a modulation / demodulation circuit unit Spread spectrum processing is performed in ex306, digital analog conversion processing and frequency conversion processing are performed in the transmission / reception circuit unit ex301, and then transmitted through the antenna ex201.

データ通信モード時にホームページ等にリンクされた動画像ファイルのデータを受信する場合、アンテナｅｘ２０１を介して基地局ｅｘ１１０から受信した受信信号を変復調回路部ｅｘ３０６でスペクトラム逆拡散処理し、その結果得られる多重化データを多重分離部ｅｘ３０８に送出する。 When receiving data of a moving image file linked to a homepage or the like in the data communication mode, the received signal received from the base station ex110 via the antenna ex201 is subjected to spectrum despreading processing by the modulation / demodulation circuit unit ex306, and the resulting multiplexing is obtained. Data is sent to the demultiplexing unit ex308.

また、アンテナｅｘ２０１を介して受信された多重化データを復号化するには、多重分離部ｅｘ３０８は、多重化データを分離することにより画像データの符号化ビットストリームと音声データの符号化ビットストリームとに分け、同期バスｅｘ３１３を介して当該符号化画像データを画像復号化部ｅｘ３０９に供給すると共に当該音声データを音声処理部ｅｘ３０５に供給する。 In addition, in order to decode the multiplexed data received via the antenna ex201, the demultiplexing unit ex308 separates the multiplexed data to generate an encoded bitstream of image data and an encoded bitstream of audio data. The encoded image data is supplied to the image decoding unit ex309 via the synchronization bus ex313, and the audio data is supplied to the audio processing unit ex305.

次に、画像復号化部ｅｘ３０９は、画像データの符号化ビットストリームを復号することにより再生動画像データを生成し、これをＬＣＤ制御部ｅｘ３０２を介して表示部ｅｘ２０２に供給し、これにより、例えばホームページにリンクされた動画像ファイルに含まれる動画データが表示される。このとき同時に音声処理部ｅｘ３０５は、音声データをアナログ音声信号に変換した後、これを音声出力部ｅｘ２０８に供給し、これにより、例えばホームページにリンクされた動画像ファイルに含まる音声データが再生される。 Next, the image decoding unit ex309 generates reproduction moving image data by decoding the encoded bit stream of the image data, and supplies this to the display unit ex202 via the LCD control unit ex302. The moving image data included in the moving image file linked to the home page is displayed. At the same time, the audio processing unit ex305 converts the audio data into an analog audio signal, and then supplies the analog audio signal to the audio output unit ex208. Thus, for example, the audio data included in the moving image file linked to the home page is reproduced. The

以上の構成において、画像符号化部ｅｘ３１２は、上記実施形態の動画像符号化装置を備えている。 In the above configuration, the image encoding unit ex312 includes the moving image encoding apparatus of the above embodiment.

なお、上記システムの例に限られず、最近は衛星、地上波によるディジタル放送が話題となっており、図９に示すようにディジタル放送用システムにも上記実施形態で説明した動画像符号化方法、動画像符号化装置および動画像符号化プログラムを組み込むことができる。具体的には、放送局ｅｘ４０９では映像情報の符号化ビットストリームが電波を介して通信または放送衛星ｅｘ４１０に伝送される。これを受けた放送衛星ｅｘ４１０は、放送用の電波を発信し、この電波を衛星放送受信設備をもつ家庭のアンテナｅｘ４０６で受信し、テレビ（受信機）ｅｘ４０１またはセットトップボックス（ＳＴＢ）ｅｘ４０７などの装置により符号化ビットストリームを復号化してこれを再生する。ここで、テレビ（受信機）ｅｘ４０１またはセットトップボックス（ＳＴＢ）ｅｘ４０７などの装置が上記実施形態で説明した動画像符号化装置を備えていてもよい。また、上記実施形態の動画像符号化方法を用いるものであってもよい。さらに、動画像符号化プログラムを備えていてもよい。また、記録媒体であるＣＤやＤＶＤ等の蓄積メディアｅｘ４０２に記録した符号化ビットストリームを読み取り、復号化する再生装置ｅｘ４０３にも上記実施形態で説明した動画像符号化方法、動画像符号化装置および動画像符号化プログラムを実装することが可能である。この場合、再生された映像信号はモニタｅｘ４０４に表示される。また、ケーブルテレビ用のケーブルｅｘ４０５または衛星／地上波放送のアンテナｅｘ４０６に接続されたセットトップボックスｅｘ４０７内に上記実施形態で説明した動画像符号化方法、動画像符号化装置および動画像符号化プログラムを実装し、これをテレビのモニタｅｘ４０８で再生する構成も考えられる。このときセットトップボックスではなく、テレビ内に上記実施形態で説明した動画像符号化装置を組み込んでも良い。また、アンテナｅｘ４１１を有する車ｅｘ４１２で衛星ｅｘ４１０からまたは基地局ｅｘ１０７等から信号を受信し、車ｅｘ４１２が有するカーナビゲーションｅｘ４１３等の表示装置に動画を再生することも可能である。 It should be noted that the present invention is not limited to the above system, and recently, digital broadcasting by satellite and terrestrial has become a hot topic, and as shown in FIG. A moving picture coding apparatus and a moving picture coding program can be incorporated. Specifically, in the broadcasting station ex409, a coded bit stream of video information is transmitted to a communication or broadcasting satellite ex410 via radio waves. Receiving this, the broadcasting satellite ex410 transmits a radio wave for broadcasting, and receives the radio wave with a home antenna ex406 having a satellite broadcasting receiving facility, such as a television (receiver) ex401 or a set top box (STB) ex407. The device decodes the encoded bit stream and reproduces it. Here, a device such as the television (receiver) ex401 or the set-top box (STB) ex407 may include the moving image encoding device described in the above embodiment. Moreover, you may use the moving image encoding method of the said embodiment. Furthermore, a moving image encoding program may be provided. The playback apparatus ex403 that reads and decodes the encoded bitstream recorded on the storage medium ex402 such as a CD or DVD that is a recording medium also includes the moving picture encoding method, the moving picture encoding apparatus, and the like described in the above embodiment. It is possible to implement a moving image encoding program. In this case, the reproduced video signal is displayed on the monitor ex404. In addition, the moving picture coding method, the moving picture coding apparatus, and the moving picture coding program described in the above embodiment are included in the set-top box ex407 connected to the cable ex405 for cable television or the antenna ex406 for satellite / terrestrial broadcasting. A configuration is also possible in which the above is implemented and this is played back on a television monitor ex408. At this time, the moving picture coding apparatus described in the above embodiment may be incorporated in the television instead of the set top box. It is also possible to receive a signal from the satellite ex410 or the base station ex107 by the car ex412 having the antenna ex411 and reproduce the moving image on a display device such as the car navigation ex413 that the car ex412 has.

更に、画像信号を符号化し、記録媒体に記録することもできる。具体例としては、ＤＶＤディスクｅｘ４２１に画像信号を記録するＤＶＤレコーダや、ハードディスクに記録するディスクレコーダなどのレコーダｅｘ４２０がある。更にＳＤカードｅｘ４２２に記録することもできる。レコーダｅｘ４２０が上記実施形態の復号化装置を備えていれば、ＤＶＤディスクｅｘ４２１やＳＤカードｅｘ４２２に記録した画像信号を補間して再生し、モニタｅｘ４０８に表示することができる。 Furthermore, an image signal can be encoded and recorded on a recording medium. Specific examples include a recorder ex420 such as a DVD recorder that records image signals on a DVD disk ex421 and a disk recorder that records images on a hard disk. Further, it can be recorded on the SD card ex422. If the recorder ex420 includes the decoding device of the above embodiment, the image signal recorded on the DVD disc ex421 or the SD card ex422 can be interpolated and reproduced and displayed on the monitor ex408.

なお、カーナビゲーションｅｘ４１３の構成は例えば図８に示す構成のうち、カメラ部ｅｘ２０３とカメラインターフェース部ｅｘ３０３、画像符号化部ｅｘ３１２を除いた構成が考えられ、同様なことがコンピュータｅｘ１１１やテレビ（受信機）ｅｘ４０１等でも考えられる。 For example, the configuration of the car navigation ex413 may be a configuration excluding the camera unit ex203, the camera interface unit ex303, and the image encoding unit ex312 in the configuration illustrated in FIG. 8, and the same applies to the computer ex111 and the television (receiver). ) Ex401 can also be considered.

また、上記携帯電話ｅｘ１１４等の端末は、符号化器・復号化器を両方持つ送受信型の端末の他に、符号化器のみの送信端末、復号化器のみの受信端末の３通りの実装形式が考えられる。 In addition to the transmission / reception type terminal having both the encoder and the decoder, the terminal such as the mobile phone ex114 has three mounting formats, that is, a transmitting terminal having only an encoder and a receiving terminal having only a decoder. Can be considered.

このように、上記実施形態で説明した動画像符号化方法、動画像符号化装置および動画像符号化プログラムを上述したいずれの機器・システムに用いることは可能であり、上記実施形態で説明した効果を得ることができる。 As described above, the moving picture coding method, the moving picture coding apparatus, and the moving picture coding program described in the above embodiment can be used in any of the devices and systems described above, and the effects described in the above embodiment. Can be obtained.

本発明にかかる装置は、フレーム間予測の処理量を軽減でき、結果として画像符号化装置の処理量を軽減できる効果を有し、例えば、動画像符号化装置等に有用である。 The device according to the present invention can reduce the processing amount of inter-frame prediction, and as a result, can reduce the processing amount of the image encoding device, and is useful for, for example, a moving image encoding device.

実施の形態１の画素予測部のブロック図Block diagram of the pixel prediction unit of the first embodiment 実施の形態１の画素予測部の処理フローProcess Flow of Pixel Prediction Unit of Embodiment 1 実施の形態２の画素予測部のブロック図Block diagram of a pixel prediction unit according to the second embodiment 実施の形態２の画素予測部の処理フローProcess Flow of Pixel Prediction Unit of Embodiment 2 実施の形態１〜実施の形態２の画像符号化方法をコンピュータシステムにより実現するためのプログラムを格納するための記憶媒体についての説明図Explanatory drawing about the storage medium for storing the program for implement | achieving the image coding method of Embodiment 1- Embodiment 2 by a computer system コンテンツ供給システムの全体構成について説明するブロック図Block diagram explaining the entire configuration of the content supply system 本発明の動画像符号化装置を搭載する携帯電話の例Example of a mobile phone equipped with the moving picture encoding apparatus of the present invention 携帯電話の構成について説明するブロック図Block diagram explaining the structure of a mobile phone ディジタル放送用システムの例Example of digital broadcasting system 従来の画像符号化装置のブロック図Block diagram of a conventional image encoding device 従来の画素予測部のブロック図Block diagram of a conventional pixel prediction unit 従来の画素予測部の処理フローProcessing flow of conventional pixel prediction unit 画像復号装置のブロック図Block diagram of image decoding device 局所探索の概念図Conceptual diagram of local search 局所探索を行う従来の画像符号化装置内の画素予測部のブロック図Block diagram of a pixel prediction unit in a conventional image encoding device that performs local search 局所探索を行う従来の画素予測部の処理フローProcessing flow of conventional pixel prediction unit that performs local search

Explanation of symbols

Img 画像信号
BitStrm 符号化信号
Res 残差信号
ERes 残差符号化信号
DRes 残差復号信号
DecImg 復号画像信号
Ref 参照画像信号
Pred 予測画像信号
MV 動きベクトル
Mode 予測モード情報
InterCost フレーム間予測評価値
IntraCost フレーム内予測評価値
MCPred フレーム間予測画像信号
IPred フレーム内予測画像信号
MV1 初期動きベクトル
InterCost1 初期動きベクトルのフレーム間予測評価値
EInterCost フレーム間予測評価値推定値
Sub 減算器
Add 加算器
ResEnc 残差符号化部
ResDec 残差復号部
FM フレームメモリ
Predictor 画素予測部
VLC 可変長符号化部
SW1,SW2 スイッチ
ME 動き検出部
IntraCostCalc フレーム内予測評価値計算部
ModeSel モード判定部
MC 動き補償部
IntraPredictor フレーム内予測部
VLD 可変長復号部
ME1 探索開始点選択部
ME2 局所探索部
IntraDet フレーム内予測判定部
InterCostEst フレーム間予測評価値推定部
Cs コンピュータ・システム
FD フレキシブルディスク
FDD フレキシブルディスクドライブ Img image signal
BitStrm encoded signal
Res residual signal
ERes residual encoded signal
DRes residual decoded signal
DecImg Decoded image signal
Ref reference image signal
Pred Predictive image signal
MV motion vector
Mode Prediction mode information
InterCost interframe prediction evaluation value
IntraCost Intra-frame prediction evaluation value
MCPred interframe prediction image signal
IPred intra-frame prediction image signal
MV1 initial motion vector
InterCost1 Initial motion vector interframe prediction evaluation value
EInterCost Interframe prediction evaluation value estimate
Sub subtractor
Add Adder
ResEnc Residual encoder
ResDec residual decoder
FM frame memory
Predictor pixel prediction unit
VLC variable length encoder
SW1, SW2 switch
ME motion detector
IntraCostCalc Intra-frame prediction evaluation value calculator
ModeSel Mode judgment part
MC motion compensation unit
IntraPredictor Intraframe prediction unit
VLD variable length decoder
ME1 Search start point selection section
ME2 Local search unit
IntraDet Intraframe prediction determination unit
InterCostEst inter-frame prediction evaluation value estimation unit
Cs computer system
FD flexible disk
FDD flexible disk drive

Claims

In a video encoding method for selecting one prediction mode from two or more types of prediction modes and encoding type information indicating the selected one prediction mode,
If one or more of the two or more types of prediction modes are set as the first prediction mode and a mode other than the first prediction mode is set as the second prediction mode, the first prediction mode is set before the evaluation value calculation in the first prediction mode is completed. When the estimated value of the evaluation value and the evaluation value of the second prediction mode are calculated, and the estimated value of the evaluation value of the first prediction mode and the evaluation value of the second prediction mode satisfy a predetermined condition The moving picture coding method characterized in that the evaluation value calculation in the first prediction mode is terminated and the second prediction mode is selected as the one prediction mode.

The moving image encoding method according to claim 1, wherein an estimated value of the evaluation value of the first prediction mode and an evaluation value of the second prediction mode are calculated before the evaluation value calculation of the first prediction mode is started, When the estimated value of the evaluation value of the first prediction mode and the evaluation value of the second prediction mode satisfy a predetermined condition, the evaluation value calculation of the first prediction mode is terminated, and the second prediction mode is changed to the second prediction mode. A moving picture encoding method, characterized by selecting as one prediction mode.

The video encoding method according to claim 1, wherein the evaluation value calculation in the first prediction mode is interrupted halfway, the evaluation value in the first prediction mode is estimated from the calculation intermediate result, and the evaluation value in the first prediction mode is determined. If the estimated value and the evaluation value of the second prediction mode satisfy a predetermined condition, the evaluation value calculation of the first prediction mode is terminated and the second prediction mode is selected as the one prediction mode. A video encoding method characterized by the above.

4. The moving image encoding method according to claim 1, wherein a prediction mode indicating inter-frame prediction is the first prediction mode, and a prediction mode indicating intra-frame prediction is the second prediction mode. Encoding method.

Motion detection means for calculating an inter-frame prediction evaluation value, intra-frame prediction evaluation value calculation means for calculating an intra-frame prediction evaluation value, inter-frame prediction and intra-frame prediction based on the inter-frame prediction evaluation value and the intra-frame prediction evaluation value A mode determination unit that selects one of predictions as a prediction mode; and a variable-length encoding unit that multiplexes prediction mode information indicating the prediction mode selected by the mode determination unit and outputs the multiplexed signal as an encoded signal. In the image encoding device,
Inter-frame prediction evaluation value estimation means for calculating an estimated value of the inter-frame prediction evaluation value before the calculation of the inter-frame prediction evaluation value, and the intra-frame prediction evaluation value and the estimated value of the inter-frame prediction evaluation value An intra-frame prediction determining unit that selects intra-frame prediction as a prediction mode when the predetermined condition is satisfied, and instructs the end of calculation of the inter-frame prediction evaluation value of the motion detection unit; A moving image encoding device.

A program for selecting a prediction mode from two or more types of prediction modes by a computer and performing a moving picture encoding method for encoding type information indicating the selected one prediction mode,
If one or more of the two or more types of prediction modes are set as the first prediction mode and a mode other than the first prediction mode is set as the second prediction mode, the first prediction mode is set before the evaluation value calculation in the first prediction mode is completed. When the estimated value of the evaluation value and the evaluation value of the second prediction mode are calculated, and the estimated value of the evaluation value of the first prediction mode and the evaluation value of the second prediction mode satisfy a predetermined condition A moving image characterized by causing a computer to perform a moving image encoding method, wherein the evaluation value calculation in the first prediction mode is terminated and the second prediction mode is selected as the one prediction mode. Encoding program.