JP4676513B2

JP4676513B2 - Encoded picture type determination method, apparatus, program thereof, and recording medium thereof

Info

Publication number: JP4676513B2
Application number: JP2008148104A
Authority: JP
Inventors: 淳清水; 隆一谷田
Original assignee: Nippon Telegraph and Telephone Corp
Current assignee: Nippon Telegraph and Telephone Corp
Priority date: 2008-06-05
Filing date: 2008-06-05
Publication date: 2011-04-27
Anticipated expiration: 2028-06-05
Also published as: JP2009296328A

Description

本発明は，映像符号化の符号化ピクチャタイプの決定方法に関するものである。 The present invention relates to a method for determining an encoded picture type for video encoding.

ＭＰＥＧ−２やＨ．２６４などの動画像符号化方式では，符号化ツールの違いからピクチャタイプを切り替えて符号化することができる。動き補償予測を行わずにピクチャ内に閉じて符号化するＩピクチャ，時間的に前の符号化済みピクチャを参照してピクチャ間予測を行うＰピクチャ，過去または未来の符号化済みピクチャを参照可能なＢピクチャなどがある。これら複数のピクチャタイプを組み合わせてＧＯＰ(Group of Pictures) を構成し，ランダムアクセスやビットレート制御の単位として利用する。 MPEG-2 and H.264 In a moving image encoding method such as H.264, encoding can be performed by switching picture types due to differences in encoding tools. I picture that is closed and encoded in the picture without motion compensation prediction, P picture that performs inter-picture prediction by referring to the previous encoded picture in time, and past or future encoded pictures can be referenced There is a B picture. These multiple picture types are combined to form a GOP (Group of Pictures) and used as a unit for random access and bit rate control.

ＧＯＰ構造を示すものとして，Ｎ，Ｍがある。一般に，Ｉピクチャの符号化間隔をＮ，Ｐピクチャの符号化間隔をＭとする。図８に，各ピクチャタイプの参照関係の例を示す。図８（Ａ）−（Ｄ）は，これまでの動画像符号化方式でよく利用されているものである。図８（Ｅ）は，Ｈ．２６４で採用された他のピクチャから参照可能なＢピクチャ（以下，Ｂｓ）を利用した例を示している。 There are N and M to indicate the GOP structure. In general, the encoding interval for I pictures is N, and the encoding interval for P pictures is M. FIG. 8 shows an example of the reference relationship of each picture type. FIGS. 8A to 8D are frequently used in the conventional moving picture coding system. FIG. An example using a B picture (hereinafter referred to as Bs) that can be referred to from other pictures adopted in H.264 is shown.

このＧＯＰ構造は，符号化対象となる映像によって，最適な参照関係が異なる。例えば，動きの速い映像では，ピクチャ間の相関が小さくなるため，ピクチャ間距離が短い参照関係が有利である（例えば，図８（Ａ）や図８（Ｂ））。そこで，より効率のよいＧＯＰ構造を決定するための方法が提案されている。 In this GOP structure, the optimum reference relationship differs depending on the video to be encoded. For example, in a fast-moving video, since the correlation between pictures is small, a reference relationship with a short inter-picture distance is advantageous (for example, FIGS. 8A and 8B). Therefore, a method for determining a more efficient GOP structure has been proposed.

例えば，特許文献１に記載されているシーン適応型動画像符号化装置では，符号化対象ピクチャに対し，事前に動き補償予測の効率を調べることで，ピクチャタイプを決定している。この装置では，符号化処理前に動き補償予測特性を検出し，符号化対象ピクチャのピクチャタイプを決定している。 For example, in the scene adaptive moving image encoding device described in Patent Document 1, the picture type is determined by examining the efficiency of motion compensation prediction in advance for a picture to be encoded. In this apparatus, the motion compensated prediction characteristics are detected before the encoding process, and the picture type of the picture to be encoded is determined.

図９に従来技術のフローチャート，図１０に従来技術のブロック図を示す。 FIG. 9 is a flowchart of the prior art, and FIG. 10 is a block diagram of the prior art.

図１０の事前解析部２００は，符号化処理前に入力画像信号から符号化対象ピクチャの動き補償予測特性を検出する（ステップＳ１００）。ピクチャタイプ決定部２０１は，事前解析部２００によって検出された動き補償予測特性をもとに，符号化対象ピクチャのピクチャタイプを決定する（ステップＳ１０１）。ピクチャ並び替え部２０２は，決定されたピクチャタイプに従ってピクチャを並び替え，符号化順番を入れ替える（ステップＳ１０２）。符号化処理部２０３は，ピクチャ並び替え部２０２により並び替えられたピクチャの画像信号に対して，予測符号化を行う（ステップＳ１０３）。 The pre-analysis unit 200 in FIG. 10 detects the motion compensation prediction characteristics of the current picture from the input image signal before the encoding process (step S100). The picture type determination unit 201 determines the picture type of the encoding target picture based on the motion compensation prediction characteristics detected by the pre-analysis unit 200 (step S101). The picture rearrangement unit 202 rearranges the pictures according to the determined picture type, and changes the encoding order (step S102). The encoding processing unit 203 performs predictive encoding on the image signals of the pictures rearranged by the picture rearrangement unit 202 (step S103).

なお，符号化処理部２０３における減算器２０４，直交変換部２０５，量子化部２０６，情報源符号化部２０７，逆量子化部２０８，逆直交変換部２０９，加算器２１０，フレームメモリ２１１，動き探索部２１２，動き補償部２１３による予測符号化処理は，よく知られた処理であるので詳しい説明を省略する。 The subtractor 204, orthogonal transform unit 205, quantization unit 206, information source coding unit 207, inverse quantization unit 208, inverse orthogonal transform unit 209, adder 210, frame memory 211, motion in the encoding processing unit 203 The predictive encoding process performed by the search unit 212 and the motion compensation unit 213 is a well-known process and will not be described in detail.

このような方法によれば，入力された映像に最適なピクチャタイプを選択することができ，ＧＯＰ構造が固定された方式に比べ，符号化効率が向上する。
特開２００２−７７９２４号公報 According to such a method, it is possible to select an optimal picture type for the input video, and the coding efficiency is improved as compared with a method in which the GOP structure is fixed.
JP 2002-77924 A

前述した特許文献１の技術では，符号化処理前に符号化対象ピクチャについて，動き検出や動き補償予測を行うことで，動き補償予測効率を推定してピクチャタイプを選択している。ここでは，符号化ループ内の結果から動き補償予測特性を求めている。 In the technique of Patent Document 1 described above, the motion compensation prediction efficiency is estimated and the picture type is selected by performing motion detection and motion compensation prediction on the encoding target picture before the encoding process. Here, the motion compensation prediction characteristic is obtained from the result in the coding loop.

この方法では，ピクチャタイプが変更された場合，符号化順番の変更が必要になり，処理が複雑になる。例えば，動き補償予測特性からＢピクチャが選択された場合，次に符号化されるべきピクチャタイプはＰピクチャであるため，符号化ループ内の情報を削除しなければならない。このように，従来の方法では，演算コストの増加や処理構造が複雑になるなどの問題があった。 In this method, when the picture type is changed, the encoding order needs to be changed, and the processing becomes complicated. For example, when a B picture is selected from the motion compensated prediction characteristics, since the picture type to be encoded next is a P picture, information in the encoding loop must be deleted. As described above, the conventional methods have problems such as an increase in calculation cost and a complicated processing structure.

本発明は，上記課題の解決を図り，符号化対象ピクチャに対して，符号化前に動き補償効率などを調べることなく，最適なピクチャタイプ（ＧＯＰ構造）を決定し，これにより符号化前に符号化対象ピクチャへの事前の動きベクトル探索などを不要とし，演算コストの削減および処理構造の簡易化を実現することを目的とする。 The present invention solves the above-mentioned problem, and determines the optimal picture type (GOP structure) for a picture to be coded without checking the motion compensation efficiency before coding, and thereby before coding. It is an object of the present invention to eliminate the need for a prior motion vector search for a picture to be encoded, and to reduce the calculation cost and simplify the processing structure.

複数のピクチャ間予測モードを有する映像符号化方式において，動きベクトルや予測モード選択などの符号化結果は，入力された映像に対し，そのピクチャタイプの符号化効率や映像の性質によって変化する。そこで，符号化済みピクチャの符号化結果を利用して，符号化対象ピクチャの映像性質を推定する。符号化結果としては，予測モードの選択比率や動きベクトルのノルムなど，エンコーダ側で得られる情報を利用する。 In a video encoding system having a plurality of inter-picture prediction modes, encoding results such as motion vector and prediction mode selection vary depending on the encoding efficiency of the picture type and the video characteristics of the input video. Therefore, the video property of the encoding target picture is estimated using the encoding result of the encoded picture. As the encoding result, information obtained on the encoder side, such as the selection ratio of the prediction mode and the norm of the motion vector, is used.

すなわち，本発明は，上記課題を解決するため，ピクチャ間予測のピクチャタイプを切り替える際，過去に符号化したピクチャ間予測ピクチャの予測モードや動きベクトルに着目し，符号化対象ピクチャの符号化処理前に，同じ符号化対象映像における当該符号化対象ピクチャより前に符号化した符号化済みピクチャ間予測ピクチャの符号化結果のみの統計量を算出し，その統計量から得られる値と所定の閾値とを比較し，比較結果から符号化対象ピクチャのピクチャタイプを決定することを特徴とする。前記統計量から得られる値は，統計量そのものでもよく，また統計量から求められるコスト値のような値でもよい。 That is, in order to solve the above-described problem, the present invention pays attention to the prediction mode and motion vector of the inter-picture prediction picture encoded in the past when switching the picture type of inter-picture prediction, and encodes the encoding target picture. Before calculating the statistic only of the encoding result of the inter-picture prediction picture that has been encoded before the current encoding target picture in the same encoding target picture , a value obtained from the statistical amount and a predetermined threshold value And the picture type of the encoding target picture is determined from the comparison result. The value obtained from the statistic may be the statistic itself or a value such as a cost value obtained from the statistic.

従来技術では，符号化処理よりも前に，符号化対象ピクチャの動き補償予測の効率などを算出し，ピクチャタイプを決定しているので，符号化対象ピクチャの決定に，符号化対象ピクチャの信号が必要になる。これに対し，本発明では，符号化済みピクチャの符号化結果を利用して，符号化対象ピクチャのピクチャタイプを決定するので，符号化対象ピクチャそのものを入力する必要はなく，この点が従来技術と大きく異なる。これにより，符号化前の符号化対象ピクチャへの事前の動きベクトル探索などが不要となり，演算コストおよび事前解析部の回路規模を縮小することができる。 In the prior art, since the efficiency of motion compensated prediction of the encoding target picture is calculated and the picture type is determined before the encoding process, the signal of the encoding target picture is used to determine the encoding target picture. Is required. On the other hand, in the present invention, since the picture type of the encoding target picture is determined using the encoding result of the encoded picture, there is no need to input the encoding target picture itself, which is the conventional technique. And very different. This eliminates the need for a prior motion vector search for the encoding target picture before encoding, thereby reducing the calculation cost and the circuit scale of the preliminary analysis unit.

さらに，双方向予測ピクチャの結果のみを利用してピクチャタイプを決定することも，最適なピクチャタイプの選択に有効である。
Et al is, it is also effective to select the optimum picture type determines picture types by using only the results of the bidirectional predictive pictures.

上記発明において，符号化済みのピクチャ間予測ピクチャについて，動きベクトル，予測モード，予測ブロックサイズ，予測誤差電力のうち，１つまたは複数の値を用いてピクチャタイプを決定する方法も好適である。 In the above-described invention, a method of determining a picture type using one or more values of a motion vector, a prediction mode, a prediction block size, and a prediction error power for an encoded inter-picture prediction picture is also preferable.

また，上記発明において，符号化済みピクチャの複数の符号化結果（動きベクトル，予測モード，予測ブロックサイズ，予測誤差電力など）を利用してピクチャタイプを決定する際，利用する符号化結果の種別毎に予め定めた閾値を設定し，符号化結果と閾値を比較し，比較結果から符号化対象ピクチャのピクチャタイプを決定する方法を用いることもよい結果が得られる。 In the above invention, when determining a picture type using a plurality of encoding results (motion vector, prediction mode, prediction block size, prediction error power, etc.) of an encoded picture, the type of encoding result to be used It is also possible to use a method in which a predetermined threshold value is set for each time, the encoding result is compared with the threshold value, and the picture type of the encoding target picture is determined from the comparison result.

また，上記発明において，符号化済みピクチャの複数の符号化結果（動きベクトル，予測モード，予測ブロックサイズ，予測誤差電力など）を利用してピクチャタイプを決定する際，利用する符号化結果をパラメータとするコスト関数に代入してコストを算出する手段と，算出したコストと予め定めた閾値とを比較する手段とを用いて，比較結果から符号化対象ピクチャのピクチャタイプを決定することも好適である。 In the above invention, when a picture type is determined using a plurality of encoding results (motion vector, prediction mode, prediction block size, prediction error power, etc.) of an encoded picture, the encoding result to be used is set as a parameter. It is also preferable to determine the picture type of the picture to be encoded from the comparison result using a means for calculating the cost by substituting it into a cost function and a means for comparing the calculated cost with a predetermined threshold. is there.

上記発明において，符号化済みのピクチャ間予測ピクチャの動きベクトルを利用する際，統計量として，動きベクトルの各成分やノルムについて，平均値や最大値，分散などの統計量を算出する手段を用い，算出した統計量を用いてピクチャタイプを決定することも好適である。 In the above invention, when using a motion vector of an encoded inter-picture prediction picture, means for calculating a statistic such as an average value, a maximum value, and a variance for each component or norm of the motion vector is used as a statistic. It is also preferable to determine the picture type using the calculated statistic.

上記発明において，符号化済みのピクチャ間予測ピクチャの，予測モード，予測ブロックサイズを利用する際，統計量として，各予測モードや予測ブロックサイズの選択比率を算出する手段を用い，算出した選択比率を用いてピクチャタイプを決定することも好適である。 In the above invention, when using the prediction mode and the prediction block size of the encoded inter-picture prediction picture, a means for calculating a selection ratio of each prediction mode and prediction block size is used as a statistic, and the calculated selection ratio is calculated. It is also preferable to determine the picture type using.

上記発明において，符号化済み双方向予測ピクチャから，双方向予測モードと省オーバヘッド予測モードの選択比率を算出する手段と，算出した選択比率と予め定めた閾値を比較する手段と，比較結果が予め定めた範囲に収まる場合，前方向予測とイントラ予測以外の予測モードの選択比率を算出する手段と，その選択比率を予め定めた閾値と比較する手段を用い，予測モードの統計量を利用して，段階的にピクチャタイプを決定することも好適な結果が得られる。 In the above invention, the means for calculating the selection ratio between the bidirectional prediction mode and the reduced overhead prediction mode from the encoded bidirectional prediction picture, the means for comparing the calculated selection ratio with a predetermined threshold, and the comparison result in advance If it falls within the defined range, use a means for calculating the selection ratio of prediction modes other than forward prediction and intra prediction, and a means for comparing the selection ratio with a predetermined threshold, and using the statistics of the prediction mode. , Determining the picture type step by step also gives good results.

上記発明において，符号化済みの片方向予測ピクチャと双方向予測ピクチャの両ピクチャタイプの符号化結果を用いてピクチャタイプを決定することも好適である。さらにまた，符号化済みの片方向予測ピクチャの動きベクトルの統計量を計測する手段と，双方向予測ピクチャの予測モードの統計量を計測する手段を用い，算出した統計量からピクチャタイプを決定することも好適な結果が得られる。 In the above invention, it is also preferable to determine the picture type using the encoding results of both the encoded one-way prediction picture and bidirectional prediction picture. Furthermore, the picture type is determined from the calculated statistic by using the means for measuring the statistic of the motion vector of the encoded unidirectional prediction picture and the means for measuring the statistic of the prediction mode of the bidirectional prediction picture. Also good results can be obtained.

上記発明において，閾値を設定する際，入力映像のピクチャサイズやフレームレート，および，符号化ビットレートまたは各ピクチャの割り当て符号量から，閾値を算出する手段を用い，符号化条件や入力映像に合わせて閾値を変化させてピクチャタイプを決定する。このように閾値を自動的に切り替えることにより，状況に応じて適切な閾値を用いることができ，さらに効率的なピクチャタイプの選択が可能になる。 In the above invention, when setting the threshold value, a means for calculating the threshold value from the picture size and frame rate of the input video and the encoding bit rate or the allocated code amount of each picture is used to match the encoding condition and the input video. The threshold value is changed to determine the picture type. By automatically switching the threshold values in this way, an appropriate threshold value can be used according to the situation, and more efficient picture type selection is possible.

このような方法によれば，符号化前に符号化対象ピクチャの動き補償予測などを行うことなく，効率的なピクチャタイプを選択できる。事前に符号化対象ピクチャの符号化効率を調べる必要がないため，演算コストの削減や回路規模の縮小が期待できる。 According to such a method, an efficient picture type can be selected without performing motion compensation prediction or the like of the encoding target picture before encoding. Since it is not necessary to check the coding efficiency of the picture to be coded in advance, it can be expected to reduce the calculation cost and the circuit scale.

以下，図面を使いながら，本発明の実施の形態を説明する。本発明の実施の形態の基本的なフローチャートを図１に示す。 Hereinafter, embodiments of the present invention will be described with reference to the drawings. A basic flowchart of the embodiment of the present invention is shown in FIG.

まず，ＰピクチャやＢピクチャなど，ピクチャ間予測を行うピクチャについて，動きベクトルや予測モードなどの統計量を算出する（ステップＳ１）。 First, statistics such as a motion vector and a prediction mode are calculated for a picture that performs inter-picture prediction, such as a P picture or a B picture (step S1).

以下に，統計量の例を示す。
（１）動きベクトルの水平・垂直成分の平均値や最大値，分散など
（２）動きベクトルのノルムの平均値や最大値，分散など
（３）各予測モードや予測ブロックサイズの選択比率
（４）予測誤差電力
以上のような統計量において，動きベクトルの平均値や最大値，ノルムは，その映像の動きの速さを示している，動きベクトルの分散は，動きのばらつきを示している。予測モードや予測ブロックサイズの選択比率からは，その映像の特徴を知ることができる。例えば，Ｂピクチャでは，予測方向から，前後の参照ピクチャとの類似性を見ることができる。予測ブロックサイズからは，動領域の大きさを推測できる。４×４サイズの小さい予測ブロックサイズが多く選択される場合，画面内の動きが均一でない可能性がある。予測誤差電力は，予測効率を知るために利用できる。 The following are examples of statistics.
(1) Average, maximum, and variance of horizontal and vertical components of motion vector (2) Average, maximum, and variance of motion vector norm (3) Selection ratio of each prediction mode and prediction block size (4 ) Prediction error power In the above statistics, the average value, maximum value, and norm of the motion vector indicate the speed of motion of the video, and the variance of the motion vector indicates variation in motion. The characteristics of the video can be known from the selection ratio of the prediction mode and the prediction block size. For example, in the B picture, the similarity with the preceding and following reference pictures can be seen from the prediction direction. The size of the moving area can be estimated from the predicted block size. When many small prediction block sizes of 4 × 4 size are selected, there is a possibility that the motion in the screen is not uniform. The prediction error power can be used to know the prediction efficiency.

また，統計量を算出する際，閾値を用いて判定比率を算出してもよい。例えば，動きベクトルの場合，動きの速さを判定する閾値を設け，動きが早いと判定される動きベクトルの比率を利用することもできる。 Further, when calculating the statistic, the determination ratio may be calculated using a threshold value. For example, in the case of motion vectors, a threshold value for determining the speed of motion can be provided, and the ratio of motion vectors determined to be fast can be used.

次に，符号化済みピクチャの符号化結果から求めた統計量をもとに，符号化対象ピクチャのピクチャタイプを決定する（ステップＳ２）。ピクチャタイプの決定は，符号化済みの特定のピクチャタイプまたは複数のピクチャタイプの統計量を用いる。 Next, the picture type of the picture to be coded is determined based on the statistic obtained from the coding result of the coded picture (step S2). The picture type is determined by using a statistic of a specific encoded picture type or a plurality of picture types.

（１）特定のピクチャタイプの統計量を使用
Ｂピクチャなど，１種類のピクチャタイプの統計量を用いて，符号化対象ピクチャの適合度を判定し，符号化対象ピクチャのピクチャタイプを決定する。 (1) Use a statistic of a specific picture type Using a statistic of one type of picture type such as a B picture, the suitability of the encoding target picture is determined, and the picture type of the encoding target picture is determined.

（２）複数のピクチャタイプの統計量を使用
各ピクチャタイプの統計量を比較し，符号化効率が高いと思われるピクチャタイプを選択し，符号化対象ピクチャのピクチャタイプを決定する。 (2) Use statistics of a plurality of picture types Compare statistics of each picture type, select a picture type that seems to have high encoding efficiency, and determine a picture type of a picture to be encoded.

ピクチャタイプの決定に用いる統計量は，１つまたは複数の統計量を組み合わせる。複数の統計量を組み合わせる際，
（ｉ）各統計量をパラメータとするコスト関数を用いてコストを算出する，
（ii）各統計量毎に閾値による判定を行う，
といった方法がある。 The statistics used to determine the picture type are a combination of one or more statistics. When combining multiple statistics,
(I) calculating the cost using a cost function with each statistic as a parameter;
(Ii) Judgment by threshold for each statistic.
There is a method.

コスト関数を用いる場合，複数の統計量からコスト値を算出する。例えば，予測誤差電力Ｄと発生符号量Ｒを用いてコストＣｏｓｔを求め，このコストからピクチャタイプを決定する。λはラグランジュの未定乗数である。 When using a cost function, a cost value is calculated from a plurality of statistics. For example, the cost Cost is obtained using the prediction error power D and the generated code amount R, and the picture type is determined from this cost. λ is Lagrange's undetermined multiplier.

Ｃｏｓｔ＝Ｄ＋λ・Ｒ …（式１）
閾値判定の場合，各統計量毎に閾値を定め，その大小関係によりピクチャタイプを決定する。また，比較する統計量に優先順位をつけ，その条件により，後段の統計量の閾値を変更することもできる。例えば，統計量Ａと統計量Ｂを用いる場合，統計量Ａが閾値ＴＨ_A以上ならば，統計量Ｂの閾値はＴＨ_B1，統計量Ａが閾値ＴＨ_A未満ならば，統計量Ｂの閾値はＴＨ_B2と切り替えることができる。 Cost = D + λ · R (Formula 1)
In the case of threshold determination, a threshold is determined for each statistic, and the picture type is determined based on the magnitude relationship. It is also possible to prioritize the statistics to be compared and change the statistics threshold in the latter stage according to the conditions. For example, when using statistic A and statistic B, if statistic A is greater than or equal to threshold TH _A , the threshold of statistic B is TH _B1 , and if statistic A is less than threshold TH _A , the threshold of statistic B is Can be switched to TH _B2 .

このようにして，符号化済みピクチャの符号化結果を利用してピクチャタイプを決定することで，符号化前に入力映像の解析等を行うことなく，符号化対象ピクチャのピクチャタイプを決定できる。 In this way, by determining the picture type using the encoding result of the encoded picture, the picture type of the encoding target picture can be determined without analyzing the input video before encoding.

図２は，本発明を用いた符号化装置の構成例を示すブロック図である。図１において，統計量算出部１０は，符号化済みピクチャの符号化結果の統計量を算出する手段である。ピクチャタイプ決定部１１は，統計量算出部１０が算出した統計量を利用して，統計量から得られる値と所定の閾値とを比較し，比較結果から符号化対象ピクチャのピクチャタイプを決定する。ピクチャ並び替え部１２は，決定されたピクチャタイプをもとに，ピクチャ間予測のピクチャタイプを切り替え，参照関係に合わせて，符号化順番を入れ替える。 FIG. 2 is a block diagram showing a configuration example of an encoding apparatus using the present invention. In FIG. 1, a statistic calculation unit 10 is means for calculating a statistic of the encoding result of an encoded picture. The picture type determination unit 11 compares the value obtained from the statistic with a predetermined threshold using the statistic calculated by the statistic calculation unit 10, and determines the picture type of the picture to be encoded from the comparison result. . The picture rearrangement unit 12 switches the picture type for inter-picture prediction based on the determined picture type, and changes the coding order according to the reference relationship.

符号化処理部１００は，符号化済みピクチャの符号化結果を統計量算出部１０に対して出力すること以外は，従来の動き補償予測を用いる符号化処理部の構成と同様である。符号化処理部１００は，ピクチャ並び替え部１２で並び替えられた符号化対象ピクチャの画像信号を入力する。 The encoding processing unit 100 is the same as the configuration of the conventional encoding processing unit using motion compensation prediction, except that the encoding result of the encoded picture is output to the statistic calculation unit 10. The encoding processing unit 100 inputs image signals of encoding target pictures rearranged by the picture rearranging unit 12.

符号化処理部１００において，減算器１０１は，入力画像信号と予測画像信号との予測誤差を算出する。直交変換部１０２は，予測誤差信号に対してＤＣＴ等の直交変換を施し，量子化部１０３は，直交変換部１０２の出力を量子化する。情報源符号化部１０４は，量子化された信号を可変長符号化し，符号化データとして出力する。 In the encoding processing unit 100, the subtractor 101 calculates a prediction error between the input image signal and the predicted image signal. The orthogonal transform unit 102 performs orthogonal transform such as DCT on the prediction error signal, and the quantization unit 103 quantizes the output of the orthogonal transform unit 102. The information source encoding unit 104 performs variable length encoding on the quantized signal and outputs it as encoded data.

一方，量子化部１０３の出力は，逆量子化部１０５で逆量子化され，逆直交変換部１０６は，逆量子化値に対しＩＤＣＴ等の逆直交変換を行う。加算器１０７は，逆直交変換によって得られた予測誤差に予測画像信号を加算し，参照のための復号画像信号を求め，フレームメモリ１０８に格納する。 On the other hand, the output of the quantization unit 103 is inversely quantized by the inverse quantization unit 105, and the inverse orthogonal transform unit 106 performs inverse orthogonal transform such as IDCT on the inverse quantized value. The adder 107 adds the predicted image signal to the prediction error obtained by the inverse orthogonal transform, obtains a decoded image signal for reference, and stores it in the frame memory 108.

動き探索部１０９は，入力画像信号とフレームメモリ１０８に格納された復号画像信号とから動き推定を行い，動きベクトルを求めて，動き補償部１１０に出力する。動き補償部１１０は，動きベクトルから予測画像信号を生成し，減算器１０１に出力する。また，局部復号のために予測画像信号を加算器１０７にも出力する。 The motion search unit 109 performs motion estimation from the input image signal and the decoded image signal stored in the frame memory 108, obtains a motion vector, and outputs the motion vector to the motion compensation unit 110. The motion compensation unit 110 generates a predicted image signal from the motion vector and outputs it to the subtracter 101. The predicted image signal is also output to the adder 107 for local decoding.

動き探索部１０９によって得られた動きベクトルや予測モードの情報は，情報源符号化部１０４にて符号化され，符号化データとして出力される。また，これらの符号化結果の情報は，統計量算出部１０に対しても出力される。 Information on motion vectors and prediction modes obtained by the motion search unit 109 is encoded by the information source encoding unit 104 and output as encoded data. Also, information on these encoding results is also output to the statistic calculation unit 10.

〔実施例１〕
以下に本発明の具体的な実施例を示す。実施例１では，Ｈ．２６４でのピクチャタイプ切り替えを前提とし，図８（Ｂ）のＭ＝２と，図８（Ｅ）のＭ＝４を切り替える。切り替えには，符号化済みＢピクチャの統計量のみを用い，統計量は，以下の２種類を利用する。
（１）双方向予測＋省オーバヘッド予測モードの選択比率ＢＭＣ_bi
（２）前方向予測とイントラ予測以外の予測モードの選択比率ＢＭＣ_all
省オーバヘッド予測モードとは，Ｈ．２６４で利用されるスキップモードやダイレクトモードのことである。前方向予測とイントラ予測以外の選択比率ＢＭＣ_allは，双方向予測＋省オーバヘッド予測モードの選択比率ＢＭＣ_biに後方向予測モードを加えたものである。 [Example 1]
Specific examples of the present invention are shown below. In example 1, H. Assuming that the picture type is switched in H.264, M = 2 in FIG. 8B and M = 4 in FIG. 8E are switched. Only the statistic of the encoded B picture is used for switching, and the following two types of statistic are used.
(1) Selection ratio BMC _{bi of} bidirectional prediction + overhead saving prediction mode
(2) Selection ratio BMC _all of prediction modes other than forward prediction and intra prediction
The reduced overhead prediction mode is H.264. This is a skip mode or direct mode used in H.264. The selection ratio BMC _all other than the forward prediction and the intra prediction is obtained by adding the backward prediction mode to the selection ratio BMC _bi of the bidirectional prediction + saving overhead prediction mode.

ピクチャタイプ切り替えは，閾値判定とし，４ピクチャ単位でＭ＝２とＭ＝４とを切り替える。図３に，４ピクチャ単位でＭ値を切り替える例を示す。 The picture type switching is performed by threshold determination, and M = 2 and M = 4 are switched in units of 4 pictures. FIG. 3 shows an example of switching the M value in units of 4 pictures.

本実施例では，４ピクチャ単位でＭ値を切り替えるため，Ｍ値切り替えの処理は，４ピクチャ単位で最後のＢピクチャ処理後に以下の処理を実行する。 In this embodiment, since the M value is switched in units of 4 pictures, the M value switching process executes the following process after the last B picture process in units of 4 pictures.

図４に，本実施例のフローチャートを示す。
［ステップＳ１０］Ｂピクチャの予測モードの統計量を計測：
図３に示す符号化済みＢピクチャ２０の統計量を計測する。
［ステップＳ１１］Ｍ値の決定：
計測した統計量から，Ｍ値を決定する。この処理の詳細については，図５を用いて後述する。
［ステップＳ１２］ピクチャタイプの決定：
ここでは，ステップＳ１１で決定したＭ値をもとに，４フレーム先までのピクチャタイプを決定する。
［ステップＳ１３］ピクチャの並び替え：
参照関係に合わせて，符号化順番を入れ替える。
［ステップＳ１４］符号化処理：
ステップＳ１２で決定したピクチャタイプに従って，順番に符号化処理を行う。 FIG. 4 shows a flowchart of this embodiment.
[Step S10] Measure the B picture prediction mode statistic:
The statistic of the encoded B picture 20 shown in FIG. 3 is measured.
[Step S11] Determination of M Value:
The M value is determined from the measured statistics. Details of this processing will be described later with reference to FIG.
[Step S12] Determination of picture type:
Here, picture types up to four frames ahead are determined based on the M value determined in step S11.
[Step S13] Rearrange pictures:
The coding order is changed according to the reference relationship.
[Step S14] Encoding process:
The encoding process is performed in order according to the picture type determined in step S12.

ここで，上記ステップＳ１１におけるＭ値の決定方法について説明する。最初に，双方向予測＋省オーバヘッド予測モードの選択比率ＢＭＣ_biについて，閾値との比較を行う。選択比率ＢＭＣ_biが，２つの閾値ＴＨ_HとＴＨ_L（ここでＴＨ_H＞ＴＨ_L）の間にある場合には，前方向予測とイントラ予測以外の予測モードの選択比率ＢＭＣ_allについて，閾値ＴＨ_Hと比較する。図５に，Ｍ値を決定する処理のフローチャートを示す。
［ステップＳ２０］ＢＭＣ_biを算出：
双方向予測＋省オーバヘッド予測モードの選択比率ＢＭＣ_biを算出する。
［ステップＳ２１］ＢＭＣ_bi＞ＴＨ_H：
選択比率ＢＭＣ_biと閾値ＴＨ_Hとを比較し，選択比率ＢＭＣ_biが閾値ＴＨ_Hを超える場合には，ステップＳ２５（Ｍ＝４）へ進み，閾値ＴＨ_H以下の場合には，次のステップＳ２２へ進む。
［ステップＳ２２］ＢＭＣ_bi＞ＴＨ_L：
選択比率ＢＭＣ_biと閾値ＴＨ_Lとを比較し，選択比率ＢＭＣ_biが閾値ＴＨ_L以下の場合には，ステップＳ２６（Ｍ＝２）へ進み，閾値ＴＨ_Lを超える場合には，次のステップＳ２３へ進む。
［ステップＳ２３］ＢＭＣ_allを算出：
前方向予測とイントラ予測以外の予測モードの選択比率ＢＭＣ_allを算出する。
［ステップＳ２４］ＢＭＣ_all＞閾値ＴＨ_H：
選択比率ＢＭＣ_allと閾値ＴＨ_Hとを比較し，選択比率ＢＭＣ_allが閾値ＴＨ_Hを超える場合には，ステップＳ２５（Ｍ＝４）へ進み，閾値ＴＨ_H以下の場合には，ステップＳ２６（Ｍ＝２）へ進む。
［ステップＳ２５］Ｍ＝４：
Ｐピクチャの符号化間隔Ｍを４に決定する。
［ステップＳ２６］Ｍ＝２：
Ｐピクチャの符号化間隔Ｍを２に決定する。 Here, the method for determining the M value in step S11 will be described. First, the selection ratio BMC _bi in the bidirectional prediction + saving overhead prediction mode is compared with a threshold value. When the selection ratio BMC _bi is between two thresholds TH _H and TH _L (where TH _H > TH _L ), the threshold TH for the selection ratio BMC _all of prediction modes other than forward prediction and intra prediction is used. Compare with _H. FIG. 5 shows a flowchart of processing for determining the M value.
[Step S20] Calculate BMC _bi :
The selection ratio BMC _bi of the bidirectional prediction + saving overhead prediction mode is calculated.
[Step S21] BMC _bi > TH _H :
Comparing the selection ratio BMC _bi and the threshold TH _H, when the selection ratio BMC _bi exceeds the threshold TH _H, the step S25 advances to (M = 4), if more than the threshold TH _H, the next step S22 Proceed to
[Step S22] BMC _bi > TH _L :
Comparing the selection ratio BMC _bi and the threshold TH _L, if the selected percentage BMC _bi is equal to or less than the threshold value TH _L, the step S26 advances to (M = 2), if it exceeds the threshold value TH _L, the next step S23 Proceed to
[Step S23] Calculate BMC _all :
A selection ratio BMC _all of prediction modes other than forward prediction and intra prediction is calculated.
[Step S24] BMC _all > threshold TH _H :
Comparing the selection ratio BMC _all the threshold TH _H, when the selection ratio BMC _all exceeds a threshold value TH _H, the step S25 advances to (M = 4), if more than the threshold TH _H, the step S26 (M = Go to 2).
[Step S25] M = 4:
The encoding interval M of the P picture is determined to be 4.
[Step S26] M = 2:
The encoding interval M of the P picture is determined as 2.

図６は，統計量とＧＯＰ構造の関係を示しており，特に各選択比率とＭ値の関係を示している。例えば図６に示すように，２種類の選択比率の位置関係によって，Ｍ値を決定する。 FIG. 6 shows the relationship between the statistics and the GOP structure, and particularly shows the relationship between each selection ratio and the M value. For example, as shown in FIG. 6, the M value is determined by the positional relationship between the two types of selection ratios.

〔実施例２〕
二つ目の実施例２を示す。実施例２では，最初の実施例１と同様に，図８（Ｂ）のＭ＝２と，図８（Ｅ）のＭ＝４を切り替える。先のＢピクチャの符号化結果に加え，Ｐピクチャの統計量として，動きベクトルのノルムを用いる。動きベクトルのノルムに対し，閾値ＴＨ_MVnormを設定し，この閾値を超えた動きベクトル比率ＭＶ_fを算出する。この動きベクトルの比率ＭＶ_fを用いて，ピクチャタイプを切り替える。 [Example 2]
A second example 2 is shown. In the second embodiment, as in the first embodiment, M = 2 in FIG. 8B and M = 4 in FIG. 8E are switched. In addition to the previous B picture encoding result, the norm of the motion vector is used as the statistic of the P picture. A threshold TH _MVnorm is set for the norm of the motion vector, and a motion vector ratio MV _f exceeding this threshold is calculated. Using the ratio MV _f of the motion vector, it switches the picture type.

図７に，実施例２のフローチャートを示す。
［ステップＳ３０］Ｐピクチャの動きベクトル比率を計測：
符号化済みＰピクチャの動きベクトルのノルムを算出し，算出したノルムが閾値ＴＨ_MVnormを超えているかを判定する。閾値ＴＨ_NVnormを超えている動きベクトルの比率ＭＶ_fを算出する。
［ステップＳ３１］動きベクトルの比率ＭＶ_fと閾値ＴＨ_rを比較：
動きベクトルの比率ＭＶ_fが閾値ＴＨ_rより大きい場合には，ステップＳ３２へ進み，それ以外の場合には，ステップＳ３３へ進む。
［ステップＳ３２］Ｍ＝２：
動きベクトルの比率ＭＶ_fが閾値ＴＨ_rより大きい場合，動きが早いとして，Ｍ＝２に決定し，ステップＳ３５へ進む。
［ステップＳ３３］Ｂピクチャの予測モードの統計量を計測：
図３に示す符号化済みＢピクチャ２０の統計量を計測する。
［ステップＳ３４］Ｍ値の決定：
計測した統計量から，Ｍ値を決定する。ここでは，例えば図５で説明した実施例１のＭ値の決定と同じアルゴリズムにより，Ｍ値を決定する。
［ステップＳ３５］ピクチャタイプの決定：
ここでは，ステップＳ３２，Ｓ３４で決定したＭ値をもとに，４フレーム先までのピクチャタイプを決定する。
［ステップＳ３６］ピクチャの並び替え：
参照関係に合わせて，符号化順番を入れ替える。
［ステップＳ３７］符号化処理
ステップＳ３６で決定したピクチャタイプに従って，順番に符号化処理を行う。 FIG. 7 shows a flowchart of the second embodiment.
[Step S30] Measure the motion vector ratio of the P picture:
The norm of the motion vector of the encoded P picture is calculated, and it is determined whether the calculated norm exceeds the threshold value TH _MVnorm . A motion vector ratio MV _f exceeding the threshold TH _NVnorm is calculated.
[Step S31] Compare the motion vector ratio MV _f with the threshold TH _r :
If the motion vector ratio MV _f is greater than the threshold value TH _r , the process proceeds to step S32; otherwise, the process proceeds to step S33.
[Step S32] M = 2:
If the ratio MV _f of the motion vector is larger than the threshold TH _r, as a fast motion determines the M = 2, the process proceeds to step S35.
[Step S33] Measure the B picture prediction mode statistic:
The statistic of the encoded B picture 20 shown in FIG. 3 is measured.
[Step S34] Determination of M Value:
The M value is determined from the measured statistics. Here, for example, the M value is determined by the same algorithm as the determination of the M value of the first embodiment described with reference to FIG.
[Step S35] Determination of picture type:
Here, picture types up to four frames ahead are determined based on the M value determined in steps S32 and S34.
[Step S36] Rearranging pictures:
The coding order is changed according to the reference relationship.
[Step S37] Encoding Processing Encoding processing is performed in order according to the picture type determined in step S36.

本実施例では，動きの速さをＰピクチャの動きベクトルを利用して，確認している。動きが早い映像では，早い動きで効率のよいＭ＝２とし，それ以外では，実施例１と同様の方法で，Ｍ値を決定する。 In this embodiment, the speed of motion is confirmed using a motion vector of a P picture. For images with fast motion, M = 2 is determined by the same method as in the first embodiment except that M = 2 is fast and efficient.

本実施例では，Ｐピクチャの動きベクトルのノルムを利用しているが，参照ピクチャ間の距離に応じて，値が変化する。そこで，動きベクトルを利用する際は，参照ピクチャ間の距離で正規化したほうがよい。正規化することで，参照ピクチャ間距離が変化しても同じように扱うことが可能となる。 In this embodiment, the norm of the motion vector of the P picture is used, but the value changes according to the distance between the reference pictures. Therefore, when using motion vectors, it is better to normalize with the distance between reference pictures. By normalizing, even if the distance between reference pictures changes, it can be handled in the same way.

以上の実施例における符号化ピクチャタイプの決定において，符号化条件や入力映像に合わせて閾値を変化させてピクチャタイプを決定する方法も好適である。このため，閾値を設定する際，入力映像のピクチャサイズやフレームレート，および，符号化ビットレートまたは各ピクチャの割り当て符号量から，閾値を算出する手段を設ける。これにより，例えば以下のように閾値を変化させる。 In the determination of the encoded picture type in the above embodiment, a method of determining the picture type by changing the threshold according to the encoding condition and the input video is also suitable. For this reason, when setting the threshold value, means for calculating the threshold value from the picture size and frame rate of the input video and the encoding bit rate or the allocated code amount of each picture is provided. Thereby, for example, the threshold value is changed as follows.

（１）ピクチャサイズ
ピクチャサイズが変わるとブロックサイズが変わるため，統計量が変化する。ピクチャサイズが大きい場合には，閾値を下げ，ピクチャサイズが小さい場合には，閾値を上げる。 (1) Picture size Since the block size changes when the picture size changes, the statistic changes. When the picture size is large, the threshold value is lowered, and when the picture size is small, the threshold value is raised.

（２）フレームレート
フレームレートが変わるとフレーム間距離が変化し，統計量が変化する。フレームレートが高い場合には，フレーム間相関が高いため，閾値を下げ，フレームレートが低い場合には，フレーム間相関が低いため，閾値を上げる。 (2) Frame rate When the frame rate changes, the distance between frames changes and the statistics change. When the frame rate is high, the inter-frame correlation is high, so the threshold value is lowered. When the frame rate is low, the inter-frame correlation is low, and the threshold value is raised.

（３）符号化ビットレート
ビットレートによって，オーバヘッドコストの比率が変動し，予測モードの選択比率が変わり，統計量が変化する。ビットレートが高い場合には，閾値を下げ，ビットレートが低い場合には，閾値を上げる。 (3) Encoding bit rate The overhead cost ratio varies according to the bit rate, the prediction mode selection ratio changes, and the statistics change. When the bit rate is high, the threshold value is lowered, and when the bit rate is low, the threshold value is raised.

（４）各ピクチャの割り当て符号量
符号化ビットレートと同様に，割り当て符号量によって，予測モードの選択比率が変化する。例えば，Ｂピクチャへの割り当て比率が多い場合には，閾値を下げ，割り当て比率が少ない場合には，閾値を上げる。 (4) Allocated code amount of each picture Similar to the encoding bit rate, the selection ratio of the prediction mode varies depending on the allocated code amount. For example, when the allocation ratio to the B picture is large, the threshold is decreased, and when the allocation ratio is small, the threshold is increased.

フレーム間相関が小さい，Ｂピクチャの割り当て符号量が小さい場合には，オーバヘッドコストを小さくする制御が働くため，ｄｉｒｅｃｔモードやスキップが多発する。このため，これらの予測モードが多く選択されたとしても，必ずしも最適なピクチャタイプが選択されるとは限らない。 When the inter-frame correlation is small and the B code allocation code amount is small, the direct mode and skip frequently occur because control for reducing the overhead cost works. For this reason, even if many of these prediction modes are selected, the optimal picture type is not necessarily selected.

ピクチャサイズ，フレームレート，符号化ビットレートまたは各ピクチャの割り当て符号量などのパラメータは，ピクセル当たりの割り当て符号量でも比較することができる。単位時間当たりの画素数は，“ピクチャサイズ×フレームレート”であり，ビットレートをこの画素数で割れば，ｂｉｔ／ｐｅｌが求まる。ピクセル当たりの割り当て符号量と閾値のテーブルなどを予め作成しておき，このテーブルをピクセル当たりの符号量で参照することにより，閾値を決定することができる。 Parameters such as picture size, frame rate, coding bit rate, or assigned code amount of each picture can be compared with the assigned code amount per pixel. The number of pixels per unit time is “picture size × frame rate”. When the bit rate is divided by the number of pixels, bit / pel is obtained. A threshold value can be determined by creating a table of the assigned code amount per pixel and the threshold value in advance and referring to this table with the code amount per pixel.

以上の符号化ピクチャタイプ決定の処理は，コンピュータとソフトウェアプログラムとによっても実現することができ，そのプログラムをコンピュータ読み取り可能な記録媒体に記録して提供することも，ネットワークを通して提供することも可能である。 The coded picture type determination process described above can be realized by a computer and a software program. The program can be provided by being recorded on a computer-readable recording medium or via a network. is there.

本発明の実施の形態のフローチャートである。It is a flowchart of an embodiment of the invention. 本発明を用いた符号化装置の構成例を示すブロック図である。It is a block diagram which shows the structural example of the encoding apparatus using this invention. ４ピクチャ単位でＭ値を切り替える例を示す図である。It is a figure which shows the example which switches M value per 4 pictures. 実施例１のフローチャートである。3 is a flowchart of the first embodiment. Ｍ値を決定する処理のフローチャートである。It is a flowchart of the process which determines M value. 統計量とＧＯＰ構造の関係を示す図である。It is a figure which shows the relationship between a statistic and GOP structure. 実施例２のフローチャートである。10 is a flowchart of Example 2. 各ピクチャタイプの参照関係（ＧＯＰ構造）の例を示す図である。It is a figure which shows the example of the reference relationship (GOP structure) of each picture type. 従来技術のフローチャートである。It is a flowchart of a prior art. 従来技術のブロック図である。It is a block diagram of a prior art.

Explanation of symbols

１０統計量算出部
１１ピクチャタイプ決定部
１２ピクチャ並び替え部
１００符号化処理部
１０１減算器
１０２直交変換部
１０３量子化部
１０４情報源符号化部
１０５逆量子化部
１０６逆直交変換部
１０７加算器
１０８フレームメモリ
１０９動き探索部
１１０動き補償部 DESCRIPTION OF SYMBOLS 10 Statistic calculation part 11 Picture type determination part 12 Picture rearrangement part 100 Encoding process part 101 Subtractor 102 Orthogonal transformation part 103 Quantization part 104 Information source encoding part 105 Inverse quantization part 106 Inverse orthogonal transformation part 107 Adder 108 Frame memory 109 Motion search unit 110 Motion compensation unit

Claims

A coding picture type determination method in a video coding system having a plurality of inter-picture prediction modes,
Calculating a statistic of only the encoding result of the inter-picture prediction picture that has been encoded before the encoding target picture in the same encoding target video before encoding the encoding target picture ;
Using the statistic, comparing a value obtained from the statistic with a predetermined threshold, and determining a picture type of the encoding target picture from a comparison result;
A method for determining an encoded picture type, comprising: switching a picture type for inter-picture prediction based on the determined picture type.

The encoded picture type determination method according to claim 1 ,
The code haze when using only coding result of viewing Picture predictive picture coded picture type determination method characterized by utilizing only the coding result of the bidirectional predictive pictures.

The encoded picture type determination method according to claim 1 ,
As the coding result of the code haze seen Picture predictive picture, the motion vector, prediction mode, prediction block size of the prediction error power, the encoding picture type determination, which comprises using one or more values Method.

The encoded picture type determination method according to claim 3 ,
When a picture type is determined using a plurality of encoding results of the encoded inter - picture prediction picture, a threshold value set in advance for each type of encoding result to be used is used. A coding picture type determination method, comprising: comparing a threshold value corresponding to a type of the coding result and determining a picture type of the coding target picture from the comparison result.

The encoded picture type determination method according to claim 3 ,
A process of calculating a cost value by substituting a statistic of the encoding result to be used as a parameter when determining a picture type using a plurality of encoding results of the encoded inter - picture prediction picture Have
An encoded picture type determination method, wherein the cost value is compared with the threshold value, and a picture type of the encoding target picture is determined from a comparison result.

The encoded picture type determination method according to claim 1 ,
When using a motion vector as the encoding result of the code haze seen Picture predictive picture, as statistics, to have a process of calculating an average value for each component or norm of the motion vector, the maximum value or dispersion Characterized coded picture type determination method.

The encoded picture type determination method according to claim 1 ,
When using the prediction mode or prediction block size as a coding result of the code haze seen Picture predictive picture, and characterized in that a statistic comprises the step of calculating the respective prediction mode or selecting the ratio of the prediction block size A method for determining an encoded picture type.

The encoded picture type determination method according to claim 7 ,
From the bidirectional predictive pictures of said code haze seen Picture predictive picture, comprising the steps of: calculating a first selection ratio of the bidirectional predictive mode and saving overhead prediction mode,
Comparing the first selection ratio with a predetermined threshold;
Calculating a second selection ratio obtained by adding a selection ratio of the backward prediction mode to the first selection ratio when the comparison result falls within a predetermined range;
Comparing the second selection ratio with a predetermined threshold;
An encoded picture type determination method characterized in that the picture type is determined step by step using prediction mode statistics.

The encoded picture type determination method according to claim 1 ,
As the encoding result of the prediction picture between the coded Kasumi see Picture, coding picture type determination method which comprises using the encoding results of both the picture type of coded unidirectional prediction picture and bidirectionally predictive picture .

The coded picture type determination method according to claim 9 ,
Measuring a motion vector statistic of the encoded unidirectional prediction picture;
Measuring a prediction mode statistic of the bi-predictive picture,
An encoded picture type determination method, characterized by determining a picture type from a calculated statistic.

The encoded picture type determination method according to claim 1 ,
Calculating the threshold from the picture size of the input video, the frame rate, the encoding bit rate or the assigned code amount of each picture, or the assigned code amount per pixel,
An encoded picture type determination method, wherein the calculated threshold value is used for comparison with a value obtained from the statistics, and the picture type is determined by changing the threshold value according to an encoding condition or an input video.

An encoded picture type determination apparatus in a video encoding system having a plurality of inter-picture prediction modes,
Means for calculating a statistic of only the encoding result of the encoded inter-picture prediction picture encoded before the encoding target picture in the same encoding target video before the encoding target picture encoding process;
Means for comparing a value obtained from the statistic with a predetermined threshold using the statistic, and determining a picture type of the picture to be encoded from a comparison result;
An encoded picture type determining apparatus comprising: means for switching a picture type for inter-picture prediction based on the determined picture type.

The encoded picture type determination device according to claim 12 ,
The code haze when using only coding result of viewing Picture predictive picture coded picture type determination apparatus characterized by utilizing only the coding result of the bidirectional predictive pictures.

The encoded picture type determination device according to claim 12 ,
As the coding result of the code haze seen Picture predictive picture, the motion vector, prediction mode, the predicted block size, among the prediction error power, the encoding picture type determination, which comprises using one or more values apparatus.

The encoded picture type determining apparatus according to claim 14 ,
When a picture type is determined using a plurality of encoding results of the encoded inter - picture prediction picture, a threshold value set in advance for each type of encoding result to be used is used. An encoded picture type determining apparatus, which compares a threshold corresponding to a type of the encoded result and determines a picture type of the encoding target picture from the comparison result.

The encoded picture type determining apparatus according to claim 14 ,
Means for calculating a cost value by substituting a statistic of the encoding result to be used as a parameter when determining a picture type using a plurality of encoding results of the encoded inter - picture prediction picture Prepared,
An encoded picture type determining apparatus, which compares the cost value with the threshold and determines a picture type of the encoding target picture from a comparison result.

The encoded picture type determination device according to claim 12 ,
When using a motion vector as the encoding result of the prediction picture between the coded Kasumi see Picture, as statistics, further comprising means for calculating the average value for each component or norm of the motion vector, the maximum value or dispersion An encoded picture type determination device as a feature.

The encoded picture type determination device according to claim 12 ,
When using the prediction mode or prediction block size as a coding result of the code haze seen Picture predictive picture, and characterized in that a statistic comprises means for calculating each prediction mode or selecting the ratio of the prediction block size An encoded picture type determination device.

The encoded picture type determination device according to claim 18 ,
From the bidirectional predictive pictures of said code haze seen Picture predictive picture, it means for calculating a first selection ratio of the bidirectional predictive mode and saving overhead prediction mode,
Means for comparing the first selection ratio with a predetermined threshold;
Means for calculating a second selection ratio obtained by adding a selection ratio of the backward prediction mode to the first selection ratio when the comparison result falls within a predetermined range;
Means for comparing the second selection ratio with a predetermined threshold;
An encoded picture type determination device characterized in that a picture type is determined step by step using a prediction mode statistic.

The encoded picture type determination device according to claim 12 ,
As the encoding result of the prediction picture between the coded Kasumi see Picture, coding picture type determination apparatus characterized by using the coding results of both the picture type of coded unidirectional prediction picture and bidirectionally predictive picture .

The encoded picture type determination device according to claim 20 ,
Means for measuring a statistic of a motion vector of the encoded unidirectional prediction picture;
Means for measuring a statistics of a prediction mode of the bidirectional prediction picture,
An encoded picture type determining apparatus, wherein a picture type is determined from a calculated statistic.

The encoded picture type determination device according to claim 12 ,
Means for calculating the threshold from the picture size of the input video, the frame rate, the encoding bit rate or the assigned code amount of each picture, or the assigned code amount per pixel;
An encoded picture type determining apparatus, wherein the calculated threshold value is used for comparison with a value obtained from the statistics, and the picture type is determined by changing the threshold value according to an encoding condition or an input video.

An encoded picture type determination program for causing a computer to execute the encoded picture type determination method according to any one of claims 1 to 11 .

A computer-readable recording medium on which the encoded picture type determination program according to claim 23 is recorded.