JP2007318617A

JP2007318617A - Image encoder and image encoding program

Info

Publication number: JP2007318617A
Application number: JP2006148030A
Authority: JP
Inventors: Shohei Saito; 昇平齋藤; Toru Yokoyama; 徹横山; Ren Imaoka; 連今岡; Seiji Mochizuki; 誠二望月; Muneaki Yamaguchi; 宗明山口
Original assignee: Renesas Technology Corp
Current assignee: Renesas Technology Corp
Priority date: 2006-05-29
Filing date: 2006-05-29
Publication date: 2007-12-06

Abstract

<P>PROBLEM TO BE SOLVED: To encode original image data after an encoded picture without preliminarily reading it while substantially holding image quality in quantization control for determining quantization steps. <P>SOLUTION: The image encoder has: a picture judgment part which detects whether a picture to be coded is any of I, P, B pictures; and a target code amount determination part which determines a target code amount, based on the generated code amount of the I picture encoded in the past when it is judged that the picture is the I picture by the picture judgment part and determines the target code amount of the encoded picture from predicted error information of the encoded picture when the picture is judged as the P or B picture. Then, the image encoder has an activity operation part which detects complexity of design of a block in a screen and a quantization control part which determines a quantization parameter in the picture from the determined target code amount and the activity information. <P>COPYRIGHT: (C)2008,JPO&INPIT

Description

本発明は、画像データを効率的に符号化するための画像符号化装置、及び画像符号化プログラムに関する。 The present invention relates to an image encoding device and an image encoding program for efficiently encoding image data.

従来、画像データを効率的に符号化するための画像符号化の技術としては、MPEG（Moving Picture Experts Group）に代表される符号化方式がある。図１に一般的なMPEG規格における画像符号化装置の構成を示す。 Conventionally, as an image encoding technique for efficiently encoding image data, there is an encoding method represented by MPEG (Moving Picture Experts Group). FIG. 1 shows a configuration of an image encoding device in a general MPEG standard.

図１に示す量子化制御部では、符号化されたデータを所定のデータ量で伝送するため、各ピクチャに対する割り当てデータ量（目標符号量）を制御する。例えば、MPEG符号化方式では、性質の異なる３種類のピクチャ(I、P、Bピクチャ)が存在するため、各ピクチャの種類によって目標符号量を決定する。即ちIまたはPピクチャをPまたはBピクチャの参照ピクチャとして用いる場合は、I、Pピクチャの目標符号量をBピクチャよりも多く割り当てる。 The quantization control unit shown in FIG. 1 controls the allocated data amount (target code amount) for each picture in order to transmit the encoded data with a predetermined data amount. For example, in the MPEG encoding method, there are three types of pictures (I, P, B pictures) having different properties, and therefore the target code amount is determined according to the type of each picture. That is, when an I or P picture is used as a reference picture for a P or B picture, a larger target code amount for the I or P picture is assigned than for a B picture.

上記のような量子化制御方式としては、例えば特許文献１のように、符号化ピクチャの量子化パラメータを決定する際に、符号化ピクチャを含めた1GOP分の原画像データを先読みして、イントラ予測処理とインター予測誤処理を実施し、イントラ予測処理とインター予測誤処理によって得られる予測誤差の大きさに応じて各ピクチャへ符号量を割り当てる量子化制御方式が知られている。また、非特許文献２のように、符号化ピクチャの予測誤差情報と発生符号量の相関関係に着目し、符号化ピクチャの目標符号量を決定する量子化制御方式が知られている。 As the quantization control method as described above, for example, as in Patent Document 1, when determining the quantization parameter of an encoded picture, the original image data for 1 GOP including the encoded picture is pre-read and intra-coded. There is known a quantization control method that performs prediction processing and inter prediction error processing, and assigns a code amount to each picture according to the magnitude of prediction error obtained by intra prediction processing and inter prediction error processing. Also, as in Non-Patent Document 2, a quantization control method is known that determines the target code amount of a coded picture by paying attention to the correlation between the prediction error information of the coded picture and the generated code amount.

特開2006-25077号公報JP 2006-25077 A MPEG-2, Test Model 5(TM5), Doc.ISO/IEC JTC1/SC29/WG11/N0400, Test Model Editing Committee, Apr.1993MPEG-2, Test Model 5 (TM5), Doc.ISO / IEC JTC1 / SC29 / WG11 / N0400, Test Model Editing Committee, Apr.1993 “画像性質とバッファ制約を考慮したH.264レート制御方式”, 電子情報通信学会論文誌,D-II,Vol.J88-D-II, 2005年, p.1114-1125“H.264 Rate Control Method Considering Image Properties and Buffer Constraints”, IEICE Transactions, D-II, Vol.J88-D-II, 2005, p.1114-1125

以下、MPEG-2規格にて用いられているMPEG-2 Test Model 5方式を例に量子化制御部での動作を簡単に説明する。なお、詳細な動作説明は、後述する。 Hereinafter, the operation in the quantization control unit will be briefly described by taking the MPEG-2 Test Model 5 method used in the MPEG-2 standard as an example. Detailed operation description will be described later.

MPEG-2 Test Model 5では、各ピクチャへの符号量割り当てを行うステップ１、仮想バッファを用いて、ピクチャ内の各ブロックへの量子化パラメータを決定するステップ２、ピクチャ内の視覚特性を考慮して各ブロックの量子化パラメータに重み付けを行うステップ３の３ステップの処理が行われている（非特許文献１参照）。 In MPEG-2 Test Model 5, Step 1 performs code amount allocation to each picture, Step 2 determines quantization parameters for each block in the picture using a virtual buffer, and considers visual characteristics in the picture. Thus, the three-step process of step 3 for weighting the quantization parameter of each block is performed (see Non-Patent Document 1).

ところで、上記MPEG-2 Test Model 5方式では、上記ステップ２の処理とステップ３の処理が相反して動作する場合がある。例えばステップ２において、符号化マクロブロックの符号量が多く発生した場合には、次に符号化するマクロブロックにおいて発生符号量を抑えようと量子化ステップが大きくなる方向に働く。そのとき、次に符号化するマクロブロックのアクティビティが小さい場合には、量子化ステップが小さくなる方向に働き、ステップ２とステップ３の制御が相殺されてしまうといった問題があった。 By the way, in the MPEG-2 Test Model 5 method, the processing in step 2 and the processing in step 3 may operate in conflict. For example, when a large amount of code is generated in the encoded macroblock in step 2, the quantization step increases in order to suppress the generated code amount in the macroblock to be encoded next. At that time, when the activity of the macroblock to be encoded next is small, there is a problem that the quantization step is reduced and the control of step 2 and step 3 is offset.

また、特許文献1の方式では、符号化ピクチャを含めて1GOP分の画像データを先読みして、イントラ予測またはインター予測誤処理を実施する必要があるため、処理時間が大幅にかかる。特に、HD(High Definition:高品位)画像などのサイズの大きい動画像データを符号化する際には、実時間で符号化処理を行うことが困難であるといった問題があった。 In addition, in the method of Patent Document 1, it is necessary to pre-read image data for 1 GOP including an encoded picture and to perform intra prediction or inter prediction error processing, so that processing time is significantly increased. In particular, when moving image data having a large size such as an HD (High Definition) image is encoded, there is a problem that it is difficult to perform encoding processing in real time.

また、非特許文献２の方式においても、符号化ピクチャの画像データを先読みして、符号化ピクチャ内の全マクロブロックの予測誤差を算出した後に、符号化ピクチャの目標符号量を決定するため、マクロブロックごとに符号化処理を行うパイプライン処理には不適であるといった問題があった。 Also in the method of Non-Patent Document 2, in order to determine the target code amount of the encoded picture after prefetching the image data of the encoded picture and calculating the prediction error of all the macroblocks in the encoded picture, There is a problem that it is not suitable for pipeline processing in which encoding processing is performed for each macroblock.

従って、本発明の目的は、かかる問題を解消し、画質を実質的に保持しながら、符号化ピクチャ以降の原画像データを先読みすることなく量子化制御を実施することを可能とした画像符号化装置および符号化方法を提供することである。 Accordingly, an object of the present invention is to provide an image coding that eliminates such a problem and can perform quantization control without prefetching the original image data after the coded picture while substantially maintaining the image quality. An apparatus and an encoding method are provided.

上記課題を解決するために、本発明の画像符号化装置は、符号化するピクチャがI、P、Bピクチャのいずれであるかを検出するピクチャ判定部とそのピクチャ判定部にてピクチャがIピクチャと判定された場合には、過去に符号化されたIピクチャの発生符号量に基づいて目標符号量を決定し、PまたはBピクチャと判定された場合には、符号化済みピクチャの予測誤差情報から、符号化ピクチャの目標符号量を決定する目標符号量決定部を備える。そして、画面内ブロックの絵柄の複雑さを検出するアクティビティ演算部を備え、上記決定された目標符号量とアクティビティ情報からピクチャ内の量子化パラメータを決定する量子化制御部を備える。 In order to solve the above problems, an image encoding apparatus according to the present invention includes a picture determination unit that detects whether a picture to be encoded is an I, P, or B picture, and the picture determination unit includes a picture that is an I picture. If it is determined, the target code amount is determined based on the generated code amount of the I picture encoded in the past, and if it is determined as the P or B picture, the prediction error information of the encoded picture And a target code amount determination unit for determining a target code amount of the encoded picture. Then, an activity calculation unit for detecting the complexity of the picture of the block in the screen is provided, and a quantization control unit for determining a quantization parameter in the picture from the determined target code amount and activity information.

本発明の画像符号化方法は、シーンチェンジなどの符号化ピクチャと符号化済みピクチャとの相関が低い場合には、符号化済みピクチャの予測誤差情報と符号化ピクチャの発生符号量を推定する関係式を補正するように制御することが好ましい。 In the image coding method of the present invention, when the correlation between a coded picture such as a scene change and a coded picture is low, the prediction error information of the coded picture and the generated code amount of the coded picture are estimated. It is preferable to control to correct the equation.

本発明によれば、MPEG規格をはじめとする動画像符号化方式における量子化ステップ決定する量子化制御において、画質を実質的保持しつつ、符号化ピクチャ以降の原画像データを先読みすることなくて符号化することが可能となる。 According to the present invention, in the quantization control for determining the quantization step in the moving picture coding system such as the MPEG standard, the image quality after the coded picture is substantially maintained without prefetching the original image data after the coded picture. It becomes possible to encode.

図１に一般的なMPEG規格における画像符号化装置の構成を示す。同図において、101は入力された画像を記憶する原画像メモリである。102は、原画像データと過去に符号化されたデータから生成された予測画像データの差分を取る加算器である。103は加算器102で演算された差分データを周波数領域に変換する直交変換処理部であり、104は、103で直交変換されたデータを量子化する量子化部である。 FIG. 1 shows a configuration of an image encoding device in a general MPEG standard. In the figure, reference numeral 101 denotes an original image memory for storing input images. Reference numeral 102 denotes an adder that takes the difference between the original image data and predicted image data generated from data encoded in the past. Reference numeral 103 denotes an orthogonal transformation processing unit that transforms the difference data calculated by the adder 102 into the frequency domain, and reference numeral 104 denotes a quantization unit that quantizes the data orthogonally transformed in 103.

また、105は、104で量子化されたデータを符号化する符号化部である。106は、105で符号化されたデータを伝送するために、蓄積されるバッファであり、107は104で量子化されたデータを逆量子化する逆量子化部である。108は107で逆量子化されたデータを逆変換する逆直交変換部である。109は、108にて逆直交変換されたデータに、フレームメモリに格納されている過去のピクチャデータを加算する加算器である。 Reference numeral 105 denotes an encoding unit that encodes the data quantized in 104. Reference numeral 106 denotes a buffer that is stored in order to transmit the data encoded in 105. Reference numeral 107 denotes an inverse quantization unit that inversely quantizes the data quantized in 104. Reference numeral 108 denotes an inverse orthogonal transform unit that inversely transforms the data inversely quantized at 107. Reference numeral 109 denotes an adder that adds past picture data stored in the frame memory to the data inversely orthogonally transformed in 108.

また、110は、加算器109で演算された後の復号画像データを記憶するフレームメモリであり、111は、原画像データと復号画像データの近似している部分を検出し、その動き検出によって生成された動きベクトルに基づき、フレームメモリから原画像データに最も近い復号画像データを読み出して予測画像を生成する動き検出・動き補償部である。113は、原画像をある領域にブロック分割し、そのブロックの特徴量を決定するアクティビティ演算を行うアクティビティ演算部である。112は、量子化ステップを制御し、目標符号量および符号化画質の品質を決定する量子化制御部である。114は符号化ブロックの周囲の画素から予測画像を生成するイントラ予測部である。 Reference numeral 110 denotes a frame memory for storing the decoded image data calculated by the adder 109, and 111 detects an approximate portion of the original image data and the decoded image data, and is generated by detecting the motion. This is a motion detection / compensation unit that reads out the decoded image data closest to the original image data from the frame memory based on the motion vector thus generated, and generates a predicted image. Reference numeral 113 denotes an activity calculation unit that performs an activity calculation that divides an original image into blocks and determines a feature amount of the block. A quantization control unit 112 controls the quantization step and determines the target code amount and the quality of the encoded image quality. Reference numeral 114 denotes an intra prediction unit that generates a prediction image from pixels around the coding block.

次に、MPEG-2 Test Model 5における各ステップの処理を説明する。 Next, processing of each step in MPEG-2 Test Model 5 will be described.

ステップ１では、まず、符号化ピクチャの発生符号量と画面の複雑さを示すパラメータXi、Xp、Xbを次式により求める。 In step 1, first, parameters Xi, Xp, and Xb indicating the amount of generated code of a coded picture and the complexity of the screen are obtained by the following equations.

ここで、Si、Sp、Sbはそれぞれ、I、P、Bピクチャ符号化時の発生符号量、Qi、Qp、QbはそれぞれI、P、Bピクチャの量子化パラメータの平均値である。また、パラメータXi、Xp、Xbの初期値は、次式で示される値が用いられる。 Here, Si, Sp, and Sb are generated code amounts when I, P, and B pictures are encoded, and Qi, Qp, and Qb are average values of quantization parameters of I, P, and B pictures, respectively. In addition, as the initial values of the parameters Xi, Xp, and Xb, values represented by the following expressions are used.

ここで、bit_rateは目標ビットレートである。 Here, bit_rate is a target bit rate.

次に、Iピクチャの量子化パラメータとP、Bピクチャの各量子化パラメータとの比率Kp、Kbを決定し、各ピクチャの目標符号量を決定する。Kp、Kｂの値はそれぞれ次式に示す値が用いられる。 Next, ratios Kp and Kb between the quantization parameter of the I picture and the quantization parameters of the P and B pictures are determined, and the target code amount of each picture is determined. The values shown in the following equations are used for the values of Kp and Kb, respectively.

GOP(Group of Picture)内の各I、P、Bピクチャに対する目標符号量をTi、Tp、Tbとすると、それぞれ次式で表される。 Assuming that the target code amount for each I, P, and B picture in a GOP (Group of Picture) is Ti, Tp, and Tb, they are respectively expressed by the following equations.

ここで、Np、NbはGOP内でまだ符号化されていないP、Bピクチャの枚数であり、frame_rateはフレームレートである。また、Rは、GOP内に対して割り当てられる符号量であり、次式で更新される。 Here, Np and Nb are the numbers of P and B pictures that have not been encoded in the GOP, and frame_rate is the frame rate. R is a code amount assigned to the GOP and is updated by the following equation.

また、GOP内の最初のピクチャを符号化する際には、次式が用いられる。 Further, the following equation is used when encoding the first picture in the GOP.

NはGOP内のピクチャ数である。またシーケンスの最初でのRの初期値は0である。 N is the number of pictures in the GOP. The initial value of R at the beginning of the sequence is 0.

ステップ２では、上記ステップ１で求められた各ピクチャに対する目標符号量と発生符号量とを一致させるため、ピクチャタイプごとに独立に設定した３種類の仮想バッファ容量に基づき、マクロブロック(MB: ピクチャ内を16×16画素単位で分割したブロック)ごとの量子化パラメータを決定する。I、P、Bピクチャの仮想バッファ占有量は、次式により求められる。 In step 2, in order to match the target code amount and the generated code amount for each picture obtained in step 1 above, a macroblock (MB: picture) is based on three types of virtual buffer capacities set independently for each picture type. The quantization parameter for each block) is determined. The virtual buffer occupancy of I, P, and B pictures is obtained by the following equation.

ここで、Bjはピクチャ先頭からj番目のマクロブロックまでの発生符号量、はピクチャ内のマクロブロック数である。ｊ番目のマクロブロックの量子化パラメータを次式により計算する。 Here, Bj is the amount of generated code from the beginning of the picture to the jth macroblock, and is the number of macroblocks in the picture. The quantization parameter of the jth macroblock is calculated by the following equation.

ここで、rはフィードバックの応答速度を制御するパラメータであり次式で与えられる。 Here, r is a parameter for controlling the feedback response speed, and is given by the following equation.

なお、仮想バッファ占有量の初期値は、次式で与えられる。 Note that the initial value of the virtual buffer occupation amount is given by the following equation.

ステップ３では、平坦部では歪が目立ちやすく、絵柄の複雑な部分では目立ちにくいという人間の視覚特性を利用し、その判別のために、アクティビティと呼ばれる分散値を用いる。そして、アクティビティに基づいて量子化ステップを変更している。具体的な処理としては、マクロブロックを4分割し、8×8画素ブロック単位で輝度信号の分散値を計算する。分散値は、数式11の計算式で求める。ここで、変数nはマクロブロック内の8×8ブロックの位置であり、左上の8×8ブロックを1右上の8×8ブロックを2、左下の8×8ブロックを3、右下の8×8ブロックを4とする。また、Pn(x,y)は、ピクチャ内の(x,y)位置の輝度信号を示している。 In step 3, a human visual characteristic that distortion is conspicuous in a flat part and inconspicuous in a complicated part of a pattern is used, and a variance value called activity is used for the discrimination. Then, the quantization step is changed based on the activity. As a specific process, the macroblock is divided into four, and the variance value of the luminance signal is calculated in units of 8 × 8 pixel blocks. The variance value is obtained by the calculation formula of Formula 11. Here, the variable n is the position of the 8 × 8 block in the macro block, the upper left 8 × 8 block is 1 the upper right 8 × 8 block is 2, the lower left 8 × 8 block is 3, the lower right 8 × 8 blocks is set to 4. Pn (x, y) indicates the luminance signal at the (x, y) position in the picture.

アクティビティは、数式12に示す式で算出する。 The activity is calculated by the formula shown in Formula 12.

ここで、最小値をとるのは、マクロブロック内の一部だけでも平坦部のある場合には量子化を細かくするためである。さらに量子化ステップの重み付け係数Nactを、下記の式で算出する。 Here, the minimum value is taken in order to make the quantization fine when only a part of the macroblock has a flat portion. Further, the weighting coefficient Nact of the quantization step is calculated by the following formula.

ここで、avg_actは、直前に符号化したピクチャのアクティビティの平均値である。
続いて、ステップ２で算出された量子化パラメータQPjに重み付け係数Ｎact を乗算符号化ブロックの量子化パラメータQPを算出する。 Here, avg_act is an average value of the activity of the picture encoded immediately before.
Subsequently, the quantization parameter QP of the coded block is calculated by multiplying the quantization parameter QPj calculated in step 2 by the weighting coefficient Nact.

このような方法により、視覚的劣化の目立ちにくい絵柄の複雑な部分では、荒く量子化を行い、また視覚的劣化の目立ちやすい平坦な領域においては細かく量子化して、発生符号量を制御する。 By such a method, the amount of generated code is controlled by performing rough quantization on a complicated portion of a pattern that is not easily noticeable visually, and finely quantizing on a flat area where the image is easily visually deteriorated.

以下に、図を用いて、本発明の実施例を詳細に述べる。
＜実施例１＞
図２は、本発明による画像符号化装置の一実施例の構成を示すブロック構成図である。同図に示すように、本画像符号化装置は、従来の画像符号化装置のように、原画像メモリ101、加算器102、直交変換部103、量子化部104、符号化部105、バッファ106、逆量子化部107、逆直交変換部108、フレームメモリ110、動き検出・動き補償部111、量子化制御部112、アクティビティ演算部113、イントラ予測部114、シーンチェンジ検出部201、ピクチャタイプ判定部202、予測誤差情報算出部203、目標符号量決定部204から構成される。 Hereinafter, embodiments of the present invention will be described in detail with reference to the drawings.
<Example 1>
FIG. 2 is a block configuration diagram showing the configuration of an embodiment of an image encoding device according to the present invention. As shown in the figure, the present image encoding apparatus is an original image memory 101, an adder 102, an orthogonal transform unit 103, a quantization unit 104, an encoding unit 105, a buffer 106, as in a conventional image encoding device. , Inverse quantization unit 107, inverse orthogonal transform unit 108, frame memory 110, motion detection / compensation unit 111, quantization control unit 112, activity calculation unit 113, intra prediction unit 114, scene change detection unit 201, picture type determination Section 202, prediction error information calculation section 203, and target code amount determination section 204.

上記構成において、シーンチェンジを検出するシーンチェンジ検出部201、符号化ピクチャの種類を判定するピクチャタイプ判定部202、イントラ予測または、インター予測処理によって選択された予測モードに基づいて生成された予測画像と原画像の差分平均値を更新する予測誤差情報更新部203、ピクチャごとの目標符号量を決定する目標符号量決定部204を除いては、従来知られている画像符号化装置と実質的に同じである。そこで、従来知られている処理部においては、図１と同じ番号を付し、説明を省略する。 In the above configuration, a scene change detection unit 201 that detects a scene change, a picture type determination unit 202 that determines the type of an encoded picture, a prediction image generated based on a prediction mode selected by intra prediction or inter prediction processing Except for the prediction error information updating unit 203 that updates the difference average value of the original image and the target code amount determining unit 204 that determines the target code amount for each picture. The same. Therefore, conventionally known processing units are denoted by the same reference numerals as those in FIG. 1 and description thereof is omitted.

以下、図３、図４、図５、図６を参照して本発明の主要部であるシーンチェンジ検出部201、ピクチャ判定部202、予測誤差情報算出部203、目標符号量決定部204における処理を中心に本実施例を説明する。図３は、本実施例における符号化処理の流れを示すフローチャート図である。 Hereinafter, with reference to FIG. 3, FIG. 4, FIG. 5, and FIG. 6, the processing in the scene change detection unit 201, the picture determination unit 202, the prediction error information calculation unit 203, and the target code amount determination unit 204, which are the main parts of the present invention The present embodiment will be described focusing on the above. FIG. 3 is a flowchart showing the flow of the encoding process in this embodiment.

まず、ステップS301にて、符号化ピクチャと符号化ピクチャの直前に符号化されたピクチャとの相関をチェックし、シーンチェンジが起こったかどうかを判定する。シーンチェンジの判定においては、例えば符号化ピクチャと符号化ピクチャの直前に符号化されたピクチャとの輝度差分値を算出し、この輝度差分値がある閾値よりも大きい場合は、シーンチェンジが起こったものと判定する。ステップS301にてシーンチェンジが検出されなかった場合には、符号化ピクチャの目標符号量決定するステップS302に移る。以下、符号化ピクチャの目標符号量を決定する方法について説明する。 First, in step S301, the correlation between the encoded picture and the picture encoded immediately before the encoded picture is checked to determine whether a scene change has occurred. In the determination of a scene change, for example, a luminance difference value between an encoded picture and a picture encoded immediately before the encoded picture is calculated. If this luminance difference value is greater than a certain threshold, a scene change has occurred. Judge that it is. If no scene change is detected in step S301, the process proceeds to step S302 for determining the target code amount of the encoded picture. Hereinafter, a method for determining the target code amount of the encoded picture will be described.

動画像符号化では、予測誤差が大きくなると予測誤差を符号化するのに必要な情報量が増加するため、発生符号量が増える。また、量子化ステップ幅が小さいほど、細かく量子化されることになり発生符号量が増える。以下、この量子化幅を決定するパラメータを量子化パラメータQPと呼ぶことにする。一例として、量子化パラメータQP1、QP2、QP3の関係が、QP1<QP2<QP3となっているとき、予測誤差と発生符号量の関係は図４のように近似できる。 In moving picture encoding, the amount of generated code increases because the amount of information required to encode the prediction error increases as the prediction error increases. Further, the smaller the quantization step width, the finer the quantization, and the larger the generated code amount. Hereinafter, the parameter for determining the quantization width is referred to as a quantization parameter QP. As an example, when the relationship between the quantization parameters QP1, QP2, and QP3 is QP1 <QP2 <QP3, the relationship between the prediction error and the generated code amount can be approximated as shown in FIG.

このような画像符号化処理の特性に基づき、符号化ピクチャの前に符号化されたピクチャの予測誤差と発生符号量の関係を、予め統計的に求め、次式のように定式化しておく。 Based on such characteristics of the image encoding process, the relationship between the prediction error of the picture encoded before the encoded picture and the generated code amount is statistically obtained in advance and formulated as the following equation.

ここで、Tは予測される符号化ピクチャの発生符号量、SADは過去に符号化されたピクチャ予測誤差平均値、a、bは量子化パラメータによって決まる変数である。各量子化パラメータに対応する変数a、bは、図１５に示すような対応表を作成し、目標符号量決定部内のメモリに記憶しておく。なお、数式15は１次式で表したが、２次式以上の多項式で表しても良い。その際、係数は次数に比例して増えることになる。そして、数式15を用いて算出される値をピクチャごとの目標符号量とする。 Here, T is a generated code amount of a predicted encoded picture, SAD is a picture prediction error average value encoded in the past, and a and b are variables determined by quantization parameters. For the variables a and b corresponding to each quantization parameter, a correspondence table as shown in FIG. 15 is created and stored in the memory in the target code amount determination unit. In addition, although Formula 15 was represented by the linear expression, you may represent it by the polynomial more than a quadratic expression. At that time, the coefficient increases in proportion to the order. Then, the value calculated using Equation 15 is set as the target code amount for each picture.

ここで、予測誤差情報を用いる過去の符号化ピクチャと符号化ピクチャが異なるピクチャタイプの場合、過去に符号化されたピクチャの予測誤差平均値と符号化ピクチャにおいて発生する符号量との相関が低くなるため、数式15を用いて目標符号量を決定することが難しくなる場合がある。そこで、例えば、図５に示すように、符号化ピクチャがIピクチャ(I1)であるとき、目標符号量をTiとすると、既に符号化済みのＩピクチャ（I0）の発生符号量とQPの平均値を乗算した値Xiから目標符号量Tiを次式のように求める。 Here, when the past coded picture using the prediction error information is different from the coded picture, the correlation between the prediction error average value of the previously coded picture and the code amount generated in the coded picture is low. Therefore, it may be difficult to determine the target code amount using Equation 15. Therefore, for example, as shown in FIG. 5, when the encoded picture is an I picture (I1), if the target code amount is Ti, the generated code amount of the already encoded I picture (I0) and the average of the QP The target code amount Ti is obtained from the value Xi obtained by multiplying the values as follows:

ここで、ave(QP)は、符号化するIピクチャ(I1)の直前に符号化されたピクチャ（P1）の量子化パラメータの平均値である。また、aはIピクチャの発生符号量を補正する重み付け係数である。 Here, ave (QP) is an average value of quantization parameters of a picture (P1) encoded immediately before the I picture (I1) to be encoded. Further, a is a weighting coefficient for correcting the generated code amount of the I picture.

一方、符号化ピクチャがPピクチャ（P2）のとき、Pピクチャ（P2）符号化前に符号化されたピクチャがIピクチャ（I1）であるため、この場合は、Iピクチャ(I1)符号化前に符号化されたピクチャ（P1）の予測誤差平均値を用いて、数式15により目標符号量を決定する。同様に符号化するピクチャがBピクチャ（B2）のときは、Bピクチャ（B2）符号化前に符号化されたBピクチャ(B1)の予測誤差平均値を用いて、数式15により目標符号量を決定する。上記目標符号量は、次GOP全体の目標符号量決定に用いることもできる。例えば、過去に符号化されたピクチャの目標符号量をピクチャタイプ毎にメモリに格納しておく。そして、次GOP内のピクチャタイプの構成（I,B,B,P,B,B,P・・・）により、各ピクチャの枚数を計算し、上記目標符号量を加算することで、次GOP全体の目標符号量を決定することができる。 On the other hand, when the encoded picture is a P picture (P2), the picture encoded before the P picture (P2) encoding is the I picture (I1). In this case, before the I picture (I1) encoding The target code amount is determined by Equation 15 using the prediction error average value of the picture (P1) encoded in. Similarly, when the picture to be encoded is a B picture (B2), the target code amount is calculated by Equation 15 using the prediction error average value of the B picture (B1) encoded before the B picture (B2) encoding. decide. The target code amount can also be used for determining the target code amount of the entire next GOP. For example, a target code amount of a picture encoded in the past is stored in a memory for each picture type. Then, the number of each picture is calculated according to the configuration of the picture type in the next GOP (I, B, B, P, B, B, P...), And the target code amount is added to the next GOP. The overall target code amount can be determined.

なお、上記予測誤差は、H.264符号化方式のように複数の予測モードが使用されている場合、イントラ予測またはインター予測処理によって選択された予測モードの中で、最も符号化効率が良くなる予測モードで生成された予測画像と原画像の差分絶対値和を用いる。なお、予測誤差は前記差分絶対値和に重み付けを行った値を用いても良い。また、上記、予測誤差は予測画像と原画像の差分絶対値和に代えて例えば予測誤差の二乗和を適用する場合等、必要に応じて種々のパラメータを広く適用することができる。また上記予測誤差平均値は、ピクチャ内の全マクロブロックの予測誤差平均ではなく、1マクロブロック置きに算出した予測誤差の平均値を用いても良い。また、上記予測誤差平均値は、イントラ予測画像を生成する際に用いるイントラ予測画像の予測誤差平均値とインター予測処理によって生成される予測誤差平均値をそれぞれ求め、状況に応じて切り替えても良い。 Note that, when a plurality of prediction modes are used as in the H.264 encoding method, the prediction error has the highest encoding efficiency among the prediction modes selected by intra prediction or inter prediction processing. The sum of absolute differences between the predicted image generated in the prediction mode and the original image is used. The prediction error may be a value obtained by weighting the sum of absolute differences. In addition, for the prediction error, various parameters can be widely applied as necessary, for example, when a sum of squares of prediction errors is applied instead of the sum of absolute differences between the prediction image and the original image. In addition, as the prediction error average value, the prediction error average value calculated every other macro block may be used instead of the prediction error average of all the macroblocks in the picture. In addition, the prediction error average value may be switched according to the situation by obtaining the prediction error average value of the intra prediction image used when generating the intra prediction image and the prediction error average value generated by the inter prediction process, respectively. .

上記の方法により、符号化ピクチャの目標符号量を決定した後、次に、アクティビティ演算部113にて符号化マクロブロックのアクティビティを算出する（S303）。アクティビティの算出方法としては、例えば、数式11、12を用いて求める。なお、アクティビティは輝度値の分散値ではなく、輝度値の絶対差分和を用いても良い。 After determining the target code amount of the encoded picture by the above method, the activity calculation unit 113 calculates the activity of the encoded macroblock (S303). As an activity calculation method, for example, it is obtained using Formulas 11 and 12. The activity may use the absolute difference sum of luminance values instead of the variance value of luminance values.

次に、ステップS302にて決定された目標符号量とステップS303にて算出されたアクティビティ情報に基づき、マクロブロックごとの量子化パラメータを決定する(S304)。マクロブロックごとの量子化パラメータの決定では、まず、ステップ302にて決定された目標符号量から、符号化ピクチャの量子化パラメータの基準値QPinitを求める。この量子化パラメータの基準値Qpinitの算出方法を述べる。 Next, a quantization parameter for each macroblock is determined based on the target code amount determined in step S302 and the activity information calculated in step S303 (S304). In determining the quantization parameter for each macroblock, first, the reference value QPinit of the quantization parameter of the encoded picture is obtained from the target code amount determined in step 302. A method for calculating the reference value Qpinit of the quantization parameter will be described.

まず、図６に示すようにステップS302にて決定された目標符号量（BIT）と、符号化ピクチャを符号化前に符号化されたピクチャの予測誤差平均値（AVG_SAD）との交点Tを求める。次に、量子化パラメータとa、bの関係を記憶した対応表から、数式15示される直線を求め、交点Tをはさむ2直線に対応する量子化パラメータQP１、QP2を探索する。この量子化パラメータQP１とQP2をTの位置で内分した値を、量子化パラメータ基準値QPinitの値とする。ここで、図６に示す変数Dは、量子化パラメータと係数a、bの対応付けを行う対応表における記憶する量子化パラメータの間隔であり、記憶する量子化パラメータの値が5の場合はD=5となる。このように量子化パラメータ基準値Qpinitを決定した後、ステップS303にて算出されたアクティビティ情報に基づき、マクロブロックごとの量子化パラメータを決定する。具体的には、数式13および数式14により、量子化パラメータの重み付け係数Nactを算出し、量子化パラメータ基準値Qpinitに対してNactの重み付けをおこない、符号化マクロブロックの量子化パラメータを決定する。 First, as shown in FIG. 6, the intersection T between the target code amount (BIT) determined in step S302 and the prediction error average value (AVG_SAD) of the picture encoded before encoding the encoded picture is obtained. . Next, from the correspondence table storing the relationship between the quantization parameter and a and b, the straight line represented by Equation 15 is obtained, and the quantization parameters QP1 and QP2 corresponding to the two straight lines sandwiching the intersection T are searched. A value obtained by internally dividing the quantization parameters QP1 and QP2 at the position of T is set as a quantization parameter reference value QPinit. Here, the variable D shown in FIG. 6 is the interval between the quantization parameters stored in the correspondence table for associating the quantization parameter with the coefficients a and b. If the value of the stored quantization parameter is 5, D = 5. After determining the quantization parameter reference value Qpinit in this way, the quantization parameter for each macroblock is determined based on the activity information calculated in step S303. Specifically, the quantization parameter weighting coefficient Nact is calculated by Expression 13 and Expression 14, and the quantization parameter reference value Qpinit is weighted with Nact to determine the quantization parameter of the encoded macroblock.

次に、ステップS304にて符号化マクロブロックの量子化パラメータが決定した後、その量子化パラメータを用いて、従来の符号化装置と同様に、イントラ予測部114にてイントラ予測処理を、動き検出・動き補償部111にてインター予測処理を行い、予測誤差を直交変換部103にて直交変換、量子化部104にて量子化、符号化部105にて符号化を行うといった一連の符号化処理を行う（S305）。 Next, after the quantization parameter of the encoded macroblock is determined in step S304, intra prediction processing is performed by the intra prediction unit 114 using the quantization parameter in the same manner as in the conventional encoding device. A series of coding processes in which inter prediction processing is performed by the motion compensation unit 111, orthogonal transformation is performed on the prediction error by the orthogonal transformation unit 103, quantization is performed by the quantization unit 104, and coding is performed by the coding unit 105. (S305).

次に、予測誤差情報更新部203にてピクチャの先頭マクロブロックから符号化マクロブロックまでの予測誤差の平均値を算出する（S306）。S303〜S306の処理をピクチャ内の全てのマクロブロックが終了するまで繰り返す。そして、ピクチャ内の全てのマクロブロックにおいて上記処理が終了した後、ステップS301に戻る。 Next, the prediction error information updating unit 203 calculates an average value of prediction errors from the first macroblock of the picture to the encoded macroblock (S306). The processes of S303 to S306 are repeated until all macroblocks in the picture are completed. Then, after the above processing is completed for all macroblocks in the picture, the process returns to step S301.

次に、ステップ301にてシーンチェンジが検出された場合について説明する。シーンチェンジが検出された場合には、符号化ピクチャと過去の符号化ピクチャとの相関が低くなることが予想される。そのため、符号化ピクチャがPまたはBピクチャの場合には、イントラ予測モードが多く選択されることになる。そこで、このような場合には、数式16を用いて目標符号量を決定することにする。そして、シーンチェンジ後の符号化ピクチャにて目標符号量と実際の発生符号量との差が閾値THを超えたピクチャが続く場合には、図１５に示す係数a、bの値を目標符号量と実際の発生符号量との差が小さくなるように変更して、数式15の関係式を補正する（S309）。S301〜S309の処理をピクチャ内の全てのマクロブロックが終了するまで繰り返す。 Next, a case where a scene change is detected in step 301 will be described. When a scene change is detected, it is expected that the correlation between the coded picture and the past coded picture will be low. Therefore, when the encoded picture is a P or B picture, many intra prediction modes are selected. Therefore, in such a case, the target code amount is determined using Equation 16. Then, when a picture in which the difference between the target code amount and the actual generated code amount exceeds the threshold TH continues in the encoded picture after the scene change, the values of the coefficients a and b shown in FIG. And the relational expression of Equation 15 is corrected (S309). The processing from S301 to S309 is repeated until all macroblocks in the picture are completed.

本実施例では、符号化ピクチャの過去に符号化されたピクチャの予測誤差平均値を使用しているため、符号化ピクチャの画像データを先読みせずに、目標符号量と量子化パラメータを決定することができる。また、各ピクチャの目標符号量を決定した後に、アクティビティの大きさに応じたマクロブロックごとの量子化パラメータ決定処理に移るため、MPEG-2 Test Model 5方式のように、ステップ２とステップ３の制御が相殺されるということは起こらない。
＜実施例２＞
本実施例では、ピクチャごとに発生する符号量の上限値と下限値を設定することにより、バッファ破綻を回避して安定して符号化することが可能な画像符号化装置について述べる。本実施例の画像符号化装置は図２と同様の構成である。 In this embodiment, since the prediction error average value of the previously encoded picture of the encoded picture is used, the target code amount and the quantization parameter are determined without prefetching the image data of the encoded picture. be able to. In addition, after determining the target code amount of each picture, in order to move to the quantization parameter determination process for each macroblock corresponding to the size of the activity, the steps 2 and 3 are performed as in the MPEG-2 Test Model 5 method. It does not happen that control is offset.
<Example 2>
In the present embodiment, an image coding apparatus capable of stably coding by avoiding buffer failure by setting an upper limit value and a lower limit value of the code amount generated for each picture will be described. The image coding apparatus of the present embodiment has the same configuration as that shown in FIG.

量子化制御処理では、デコーダのバッファ状態を仮想的にモデル化したものを破綻(オーバーフローまたはアンダーフロー)しないように符号化データを生成する必要がある。図７にバッファの概念図を示す。横軸は経過時間、縦軸はバッファ占有量であり、この仮想バッファを破綻しないように量子化パラメータを決定する必要がある。 In the quantization control process, it is necessary to generate encoded data so that a virtual model of the buffer state of the decoder is not broken (overflow or underflow). FIG. 7 shows a conceptual diagram of the buffer. The horizontal axis is the elapsed time, and the vertical axis is the buffer occupancy, and it is necessary to determine the quantization parameter so that this virtual buffer does not fail.

図８は、本実施例における符号化処理の流れを示すフローチャート図である。まずステップS801にて、符号化直前のバッファ占有量の状況を確認する。一般にバッファ占有量は、情報量の多いIフレームにおいてバッファレベルが下がり、その後のPまたはBフレームにおいてバッファレベルを回復するように制御することが望ましい。そこで、例えば図７に示すようにバッファ占有量がオーバーフローリミット（Overflow Limit）とアンダーフローリミット(Underflow Limit)の間で推移するように量子化パラメータを決定する。 FIG. 8 is a flowchart showing the flow of the encoding process in this embodiment. First, in step S801, the state of buffer occupancy immediately before encoding is confirmed. In general, it is desirable to control the buffer occupancy so that the buffer level decreases in an I frame with a large amount of information and then recovers in the subsequent P or B frame. Therefore, for example, as shown in FIG. 7, the quantization parameter is determined so that the buffer occupation amount changes between the overflow limit (Overflow Limit) and the underflow limit (Underflow Limit).

次にステップS802、S804にて符号化ピクチャの目標符号量と許容する発生符号量の上限値、下限値を決定する。以下、目標符号量および許容する発生符号量の上限値、下限値の決定方法について述べる。
Ｉピクチャ符号化時には、Ｉピクチャ符号化後のバッファ残量が、最小バッファレベルをなるべく下回らないように、目標符号量と許容する発生符号量の上限値と下限値を決定する。 Next, in steps S802 and S804, an upper limit value and a lower limit value of the target code amount of the encoded picture and the allowable generated code amount are determined. Hereinafter, a method for determining the upper limit value and lower limit value of the target code amount and the allowable generated code amount will be described.
At the time of I picture encoding, the upper limit value and the lower limit value of the target code amount and the allowable generated code amount are determined so that the remaining buffer capacity after the I picture encoding does not fall below the minimum buffer level as much as possible.

図９にＩピクチャの目標符号量、許容する発生符号量の上限値と下限値の決定方法を示す。Bは、Ｉピクチャの符号化直前のバッファ占有量である。Ｉピクチャでの目標符号量Tiは、実施例1と同様に過去に符号化されたＩピクチャの発生符号量とQPの平均値を乗算した値Xiから数式16を用いて算出する。次に、ステップS801にて確認したバッファ占有量が最小バッファレベル(MinLevel)付近に位置する場合には、以下のように目標符号量を決定する。まず、B-Tiが最小バッファレベル(MinLevel)を下回らない場合（図９中(a)）には、目標符号量Tiは変更せずに、許容する発生符号量の上限値をa×Ti、許容する発生符号量の下限値をb×Tiとする。ここで、係数a、bはa>1、0<b<1の値をとる。一方、B-Tiが最小バッファレベルを下回る場合（図９中(b)）は、B-MinLevelを許容する発生符号量の上限値とし、ｃ×(B-MinLevel)を目標符号量、d×(B-MinLevel)を許容する発生符号量の下限値とする。ここで、係数c、dは、1>c>d>0である。 FIG. 9 shows a method for determining the target code amount of the I picture and the upper limit value and lower limit value of the allowable generated code amount. B is the buffer occupancy immediately before encoding the I picture. The target code amount Ti for the I picture is calculated using Equation 16 from the value Xi obtained by multiplying the generated code amount of the I picture encoded in the past and the average value of QP, as in the first embodiment. Next, when the buffer occupation amount confirmed in step S801 is located near the minimum buffer level (MinLevel), the target code amount is determined as follows. First, when B-Ti does not fall below the minimum buffer level (MinLevel) ((a) in FIG. 9), the target code amount Ti is not changed, and the upper limit value of the generated generated code amount is set to a × Ti, Let b × Ti be the lower limit of the amount of generated code allowed. Here, the coefficients a and b have values of a> 1, 0 <b <1. On the other hand, when B-Ti is lower than the minimum buffer level ((b) in FIG. 9), B-MinLevel is set as the upper limit value of the generated code amount, c × (B-MinLevel) is the target code amount, d × Let (B-MinLevel) be the lower limit value of the generated code amount that is allowed. Here, the coefficients c and d are 1> c> d> 0.

次にP、Bピクチャ符号化時の目標符号量および、許容する発生符号量の上限値、下限値の決定法について説明する。
P、Bピクチャ符号化時には、P、Bピクチャの発生符号量が、アンダーフローリミットを下回らないようにP、Bピクチャの目標符号量および許容する発生符号量の上限値、許容する発生符号量の下限値を決定する。 Next, a method for determining the target code amount at the time of P and B picture encoding and the upper limit value and lower limit value of the allowable generated code amount will be described.
When encoding P and B pictures, the target code amount for P and B pictures, the upper limit value of the allowable generated code amount, and the allowable generated code amount so that the generated code amount of P and B pictures does not fall below the underflow limit. Determine the lower limit.

図１０にP、Bピクチャの目標符号量、許容する発生符号量の上限値、許容する発生符号量の下限値の決定方法を示す。Bは、P、Bピクチャの符号化直前のバッファ占有量である。P、Bピクチャでは、符号化ピクチャを符号化する前に符号化されたピクチャの予測誤差平均値と符号化ピクチャの発生符号量の相関が高いため、数式15を用いて算出された値を目標符号量Tとする。 FIG. 10 shows a method for determining the target code amount of P and B pictures, the upper limit value of the allowable generated code amount, and the lower limit value of the allowable generated code amount. B is the buffer occupancy immediately before encoding the P and B pictures. For P and B pictures, since the correlation between the prediction error average value of a picture encoded before encoding the encoded picture and the generated code amount of the encoded picture is high, the value calculated using Equation 15 is the target. The code amount is T.

次に、ステップS801にて確認したバッファ占有量がアンダーフローリミット境界付近に位置する場合には、以下のように目標符号量を決定する。まず、B-Tがバッファのアンダーフローリミット(UFL)を下回らない場合（図１０中(a)）には、目標Tは変更せずに、許容する発生符号量の上限値をa×T、許容する発生符号量の下限値をb×Tとする。ここで、係数a、bはa>1、0<b<1の値をとる。一方、B-Tがバッファのアンダーフローリミット(UFL)を下回る場合（図10中(b)）には、(B-UFL)を許容する発生符号量の上限値とし、c×(B-UFL)を目標符号量、d×(B-UFL)を許容する発生符号量の下限値とする。ここで、係数c、dは、1>c>d>0の値をとる。 Next, when the buffer occupation amount confirmed in step S801 is located near the underflow limit boundary, the target code amount is determined as follows. First, when BT does not fall below the buffer underflow limit (UFL) ((a) in FIG. 10), the target T is not changed, and the upper limit of the generated code amount to be allowed is allowed to be a × T. The lower limit value of the generated code amount is b × T. Here, the coefficients a and b have values of a> 1, 0 <b <1. On the other hand, when BT falls below the buffer underflow limit (UFL) ((b) in Fig. 10), (B-UFL) is set as the upper limit of the amount of generated code, and c x (B-UFL) is The target code amount, d × (B-UFL), is set as the lower limit value of the generated code amount. Here, the coefficients c and d have values of 1> c> d> 0.

次に、ステップS801にて確認したバッファ占有量Bが、オーバーフローリミット付近に位置する場合の処理について説明する。図１１にオーバーフローリミット付近での各ピクチャの目標符号量および許容する発生符号量の上限値、下限値の決定方法を示す。各ピクチャの目標符号量Tは、数式15または数式16を用いて算出された値を用いる。そして、B-Tがオーバーフローリミット(OFL)を上回らない場合は、目標符号量Tは変更せずに、許容する発生符号量の上限値をa×T、許容する発生符号量の下限値をb×T (図１１中(a))。ここで、係数a、bはa>1、0<b<1である。一方、B-Tがオーバーフローリミット(OFL)を上回る場合（図１１中(b)）は、B-OFLを許容する発生符号量の下限値とし、e×(B-UFL)を目標符号量、f×(B-UFL)を許容する発生符号量の上限値とする。ここで、係数e、fは、1<e<fである。 Next, processing when the buffer occupation amount B confirmed in step S801 is located near the overflow limit will be described. FIG. 11 shows a method for determining the target code amount of each picture in the vicinity of the overflow limit and the upper limit value and lower limit value of the allowable generated code amount. As the target code amount T of each picture, a value calculated using Equation 15 or Equation 16 is used. If BT does not exceed the overflow limit (OFL), the target code amount T is not changed, the upper limit value of the allowable generated code amount is a × T, and the lower limit value of the allowable generated code amount is b × T. ((A) in FIG. 11). Here, the coefficients a and b are a> 1, 0 <b <1. On the other hand, when BT exceeds the overflow limit (OFL) ((b) in FIG. 11), B-OFL is set as the lower limit value of the generated code amount, e × (B-UFL) is the target code amount, and f × Let (B-UFL) be the upper limit of the amount of generated code that is allowed. Here, the coefficients e and f are 1 <e <f.

上述した方法により、符号化ピクチャの目標符号量および許容する符号量の上限値、下限値を決定した後、目標符号量から符号化ピクチャの量子化パラメータの基準値を決定し（S803）、許容する発生符号量の上限値、下限値から符号化ピクチャの量子化パラメータの最小値、最大値をそれぞれ決定する(S805)。符号化ピクチャの量子化パラメータの基準値、最大値、最小値を決定方法は、実施例１で述べた目標符号量から基準値QPinitを求める方法と同様の処理により求めることができる。 After determining the target code amount of the encoded picture and the upper limit value and lower limit value of the allowable code amount by the method described above, the reference value of the quantization parameter of the encoded picture is determined from the target code amount (S803). The minimum value and the maximum value of the quantization parameter of the coded picture are determined from the upper limit value and lower limit value of the generated code amount (S805). The method for determining the reference value, maximum value, and minimum value of the quantization parameter of the coded picture can be obtained by the same process as the method for obtaining the reference value QPinit from the target code amount described in the first embodiment.

以上のようにして決定されたピクチャ内の量子化パラメータの基準値、最大値、最小値に基づき、マクロブロックごとの量子化パラメータを決定する。(S806)
マクロブロックごとの量子化パラメータの決定では、実施例１と同様に量子化パラメータの基準値QPinitに対して、アクティビティの大きさに応じて重み付けを行い、マクロブロックごとのQPを決定する。即ち、数式11〜数式14の処理を適用してマクロブロックごとの量子化パラメータを算出する。このとき、算出されたマクロブロックの量子化パラメータQPがステップS805によって求められたピクチャ内の量子化パラメータの最大値QPmaxを超える場合は、QP=QPmaxとし、ピクチャ内の最小値QPminを下回る場合は、QP=QPminとする。本実施例により、バッファ破綻を回避して安定した量子化制御を行うことが可能となる。
＜実施例３＞
本実施例では、各マクロブロックの量子化パラメータを決定する方法について述べる。本実施例は、上記実施例にて述べた符号化装置と同様の構成である。まず、実施例２のステップS802、S804にて述べた方法により、目標符号量および、許容する発生符号量の上限値、下限値を求める(S1201)。次に、ステップS803、S805と同様の処理を実施することにより、符号化ピクチャの量子化パラメータ基準値Qpinitおよび最大値QPmax、最小値QPminを求める。次にアクティビティ演算部113にて算出されるアクティビティ情報に基づき、アクティビティと量子化パラメータとの特性を決定する（S1202）。 Based on the reference value, maximum value, and minimum value of the quantization parameter in the picture determined as described above, the quantization parameter for each macroblock is determined. (S806)
In the determination of the quantization parameter for each macroblock, the quantization parameter reference value QPinit is weighted according to the size of the activity in the same manner as in the first embodiment, and the QP for each macroblock is determined. That is, the quantization parameter for each macroblock is calculated by applying the processing of Equations 11 to 14. At this time, if the calculated quantization parameter QP of the macroblock exceeds the maximum value QPmax of the quantization parameter in the picture obtained in step S805, QP = QPmax, and if below the minimum value QPmin in the picture QP = QPmin. This embodiment makes it possible to perform stable quantization control while avoiding buffer failure.
<Example 3>
In this embodiment, a method for determining the quantization parameter of each macroblock will be described. This embodiment has the same configuration as that of the encoding apparatus described in the above embodiment. First, the target code amount and the upper limit value and lower limit value of the allowable generated code amount are obtained by the method described in steps S802 and S804 of the second embodiment (S1201). Next, by performing the same processing as steps S803 and S805, the quantization parameter reference value Qpinit, the maximum value QPmax, and the minimum value QPmin of the encoded picture are obtained. Next, based on the activity information calculated by the activity calculation unit 113, the characteristics of the activity and the quantization parameter are determined (S1202).

具体的には、図１３に示すように量子化パラメータの最大値Qpmaxに対して符号化ピクチャを符号化する前に符号化されたピクチャのアクティビティの最大値Act_maxを、量子化パラメータの最小値QPminに対して符号化ピクチャを符号化する前に符号化されたピクチャのアクティビティの最小値Act_minを対応付ける。次にアクティビティ演算部113にて算出される符号化マクロブロックのアクティビティの大きさに応じて、量子化パラメータQPを決定する（S1203）。即ち、図１３に示すように、アクティビティの値ACTが領域Aにある場合は、QP=QPmin、領域Cにある場合には、QP=QPmaxとし、領域Bにある場合には、次式を用いて求める。 Specifically, as shown in FIG. 13, the maximum value Act_max of the activity of the encoded picture before encoding the encoded picture with respect to the maximum value Qpmax of the quantization parameter is set as the minimum value QPmin of the quantization parameter. Is associated with the minimum value Act_min of the activity of the encoded picture before encoding the encoded picture. Next, the quantization parameter QP is determined according to the activity size of the encoded macroblock calculated by the activity calculation unit 113 (S1203). That is, as shown in FIG. 13, when the activity value ACT is in the region A, QP = QPmin, when it is in the region C, QP = QPmax, and when it is in the region B, the following equation is used: Ask.

量子化パラメータQPを決定した後、アクティビティ演算部113にて算出されるアクティビティの最大値、およびアクティビティの最小値を更新する(S1205)。上記ステップS1202〜S1205の処理をピクチャ内のすべてのマクロブロックに対して実施する。なお、アクティビティの最大値、最小値は、アクティビティの値にフィルタをかけた後の値を用いてもよい。 After determining the quantization parameter QP, the activity maximum value and the activity minimum value calculated by the activity calculation unit 113 are updated (S1205). The processes in steps S1202 to S1205 are performed on all macroblocks in the picture. Note that, as the maximum value and the minimum value of the activity, values obtained by filtering the activity value may be used.

上記ステップ1202において決定するアクティビティとQP特性は、図１４に示すような特性にしても良い。即ち、符号化ピクチャの量子化パラメータ基準値QPinit、最大値QPmax、最小値QPminにおいて、量子化パラメータの基準値Qpinitに対して符号化ピクチャを符号化する前のピクチャのアクティビティ平均値Act_avgを、最大値QPmaxに対して、符号化ピクチャを符号化する前のピクチャのアクティビティ最大値ACT_maxを、最小値QPminに対して符号化ピクチャを符号化する前のピクチャのアクティビティ最小値ACT_minを対応付ける。 The activity and QP characteristics determined in step 1202 may be the characteristics shown in FIG. That is, in the quantized parameter reference value QPinit, maximum value QPmax, and minimum value QPmin of the coded picture, the activity average value Act_avg of the picture before coding the coded picture with respect to the quantization parameter reference value Qpinit is maximized. The activity maximum value ACT_max of the picture before coding the coded picture is associated with the value QPmax, and the activity minimum value ACT_min of the picture before coding the coded picture is associated with the minimum value QPmin.

そして、アクティビティ演算部113にて算出される符号化マクロブロックのアクティビティの大きさに応じて、量子化パラメータQPを決定する。即ち図１４に示すように、アクティビティの値ACTが領域Aにある場合は、QP=QPmin、領域Dにある場合には、QP=QPmaxとし、領域Bにある場合には、次式を用いて求める。 Then, the quantization parameter QP is determined according to the activity size of the encoded macroblock calculated by the activity calculation unit 113. That is, as shown in FIG. 14, when the activity value ACT is in the region A, QP = QPmin, when it is in the region D, QP = QPmax, and when it is in the region B, Ask.

また、領域Cにある場合には、次式を用いて求める。 If it is in region C, it is obtained using the following equation.

大抵の自然画像におけるアクティビティの分布の特徴としては、大抵のアクティビティ値は低い領域に集中する。本実施例により、アクティビティの高い領域と低い領域とで、アクティビティ値の変動に対する量子化パラメータの感度を変えることで、全ての領域のアクティビティ値に対して、アクティビティの差を反映した量子化パラメータの設定を行うことが可能となる。また、図１４中の点Qの位置を調整することで、ピクチャごとの発生符号量の増減が可能となる。即ち、Qが左上に移動するほど全体的にQPが高くなり発生符号量が抑制される。従って、ピクチャごとに目標符号量と発生符号量の差をフィードバックさせて符号量の調整を行うことができ、発生符号量の目標ビットレートへの追従性を向上させることが可能となる。
＜実施例４＞
本実施例においては、上記実施例における符号化処理を実行するステップ手順を記録したプログラムを作成することによりコンピュータで動作させることができる。なお、このような符号化処理を実行するプログラムを、インターネット等のネットワークを介してユーザがダウンロードして使用することができる。また記録媒体に記録して使用することができる。また、このような記録媒体としては、光ディスク、光磁気ディスク等の記録媒体に広く適用することができる。 As a feature of the distribution of activity in most natural images, most activity values are concentrated in a low area. According to the present embodiment, by changing the sensitivity of the quantization parameter to the fluctuation of the activity value between the high activity region and the low activity region, the quantization parameter that reflects the difference in the activity is reflected in the activity values of all regions. Settings can be made. Further, by adjusting the position of the point Q in FIG. 14, the amount of generated code for each picture can be increased or decreased. That is, as Q moves to the upper left, QP increases as a whole and the amount of generated code is suppressed. Accordingly, it is possible to adjust the code amount by feeding back the difference between the target code amount and the generated code amount for each picture, and to improve the followability of the generated code amount to the target bit rate.
<Example 4>
In this embodiment, the computer can be operated by creating a program that records the step procedure for executing the encoding process in the above embodiment. Note that a program that executes such encoding processing can be downloaded and used by a user via a network such as the Internet. It can also be used by being recorded on a recording medium. Such a recording medium can be widely applied to recording media such as an optical disk and a magneto-optical disk.

また、上記実施例では、入力画像の単位をピクチャとして説明してきたが、ピクチャ内を複数の分割したスライス単位で符号化する場合にも適用できる。また、インタレース画像を入力データとして符号化する場合、フィールドとして符号化する場合においても適用できる。また、上記実施例においては、マクロブロック(16×16画素)単位で処理を行う場合について説明したが、32×32画素や8×8画素など、任意の大きさのブロックで処理を行う場合にも適用できる。本発明はH.264符号化方式をはじめとして、各種の画像データ、符号化方式において、広く適用することができる。 In the above embodiment, the unit of the input image has been described as a picture. However, the present invention can also be applied to the case where encoding is performed in units of a plurality of divided slices. Further, the present invention can also be applied to the case where an interlaced image is encoded as input data and is encoded as a field. In the above embodiment, the case where processing is performed in units of macroblocks (16 × 16 pixels) has been described. However, when processing is performed using blocks of an arbitrary size such as 32 × 32 pixels or 8 × 8 pixels. Is also applicable. The present invention can be widely applied to various image data and encoding methods including the H.264 encoding method.

従来の画像符号化装置の構成を示すブロック図である。It is a block diagram which shows the structure of the conventional image coding apparatus. 本発明による画像符号化装置の第１、第２、第３の実施形態を示すブロック構成図である。It is a block block diagram which shows 1st, 2nd, 3rd embodiment of the image coding apparatus by this invention. 第１の実施形態での量子化ステップの設定のための処理の流れを示すフローチャートである。It is a flowchart which shows the flow of the process for the setting of the quantization step in 1st Embodiment. 符号化処理における予測誤差と発生符号量の関係を示した概念図である。It is the conceptual diagram which showed the relationship between the prediction error and generated code amount in an encoding process. 本発明において、符号化ピクチャと目標符号量算出に用いるピクチャとの位置関係を示した図である。In this invention, it is the figure which showed the positional relationship of the encoding picture and the picture used for target code amount calculation. 本発明において、目標符号量から量子化パラメータを算出する決定する方法を示した図である。In this invention, it is the figure which showed the method to determine which calculates a quantization parameter from target code amount. バッファ占有量の推移を示したバッファの概念図である。It is the conceptual diagram of the buffer which showed transition of the buffer occupation amount. 第２の実施形態での量子化ステップの設定のための処理の流れを示すフローチャートである。It is a flowchart which shows the flow of the process for the setting of the quantization step in 2nd Embodiment. 第２の実施形態におけるアンダーフロー境界でのIピクチャの目標符号量および許容する発生符号量の決定方法を示した図である。It is the figure which showed the determination method of the target code amount of I picture in the underflow boundary in 2nd Embodiment, and the generation | occurrence | production code amount to accept | permit. 第２の実施形態におけるアンダーフロー境界でのP、Bピクチャの目標符号量および許容する発生符号量の決定方法を示した図である。It is the figure which showed the determination method of the target code amount of P and B pictures in the underflow boundary in 2nd Embodiment, and the generation | occurrence | production code amount to accept | permit. 第２の実施形態におけるオーバーフロー境界での各ピクチャの目標符号量および許容する発生符号量の決定方法を示した図である。It is the figure which showed the determination method of the target code amount of each picture and the allowable generated code amount in the overflow boundary in 2nd Embodiment. 第３の実施形態での量子化ステップの設定のための処理の流れを示すフローチャートである。It is a flowchart which shows the flow of the process for the setting of the quantization step in 3rd Embodiment. 第３の実施形態おけるアクティビティと量子化パラメータの特性を示した図である。It is the figure which showed the characteristic of the activity and quantization parameter in 3rd Embodiment. 第３の実施形態おけるアクティビティと量子化パラメータの特性を示した図である。It is the figure which showed the characteristic of the activity and quantization parameter in 3rd Embodiment. 本発明における予測誤差と発生符号量の関係を決定する対応表の一例である。It is an example of the correspondence table which determines the relationship between the prediction error and the generated code amount in the present invention.

Explanation of symbols

101…原画像メモリ、
102、109…加算器、
103…直行変換部、
104…量子化部、
105…符号化部、
106…バッファ、
107…逆量子化部、
108…逆直行変換部、
110…フレームメモリ、
111…動き検出・動き補償部、
112…量子化制御部、
113…アクティビティ演算部、
114…イントラ予測部、
201…シーンチェンジ検出部、
202…ピクチャタイプ判定部、
203…予測誤差情報更新部、
204…目標符号量決定部。
101 ... Original image memory,
102, 109 ... adder,
103… Direct conversion part,
104 ... Quantization part,
105: Encoding unit,
106 ... buffer,
107: Inverse quantization unit,
108 ... Inverse transformation unit,
110… Frame memory,
111: Motion detection / compensation unit,
112 ... Quantization control unit,
113… Activity calculation part,
114 ... Intra prediction part,
201 ... Scene change detection unit,
202 ... Picture type determination unit,
203 ... Prediction error information update unit,
204: Target code amount determination unit.

Claims

An image encoding device that encodes moving image information including a series of pictures,
Original image storage means for storing an original image of a picture input from the outside;
Based on the original image, intra prediction means for performing intra prediction and inter prediction means for performing inter prediction,
Feature amount calculating means for calculating a feature amount of the picture to be encoded,
Prediction error information updating means for updating prediction error information based on the difference between the prediction image generated by the intra prediction means or the inter prediction means and the original image;
Target code amount determining means for determining a target code amount for each picture to be encoded according to the updated prediction error information and the generated code amount of the encoded picture;
A quantization control unit configured to determine a quantization width of each of the plurality of blocks to be encoded based on the feature amount and the target code amount in the picture to be encoded divided into a plurality of blocks; An image encoding device.

The image encoding device according to claim 1,
A buffer for storing the generated code amount;
The image coding apparatus, wherein the target code amount is determined based on a storage capacity state of the buffer and prediction error information.

The image encoding device according to claim 1,
The image coding apparatus according to claim 1, wherein the prediction error information is a value calculated based on a past picture coded before the picture to be coded.

The image encoding device according to claim 1,
Means for multiplying a generated code amount of the past picture by an average value of quantization widths in the picture;
When the picture to be coded is intra prediction, an image coding apparatus is characterized in that a target code amount is determined based on the multiplication value.

The image encoding device according to claim 1,
Means for formulating a relationship between a prediction error of the past picture and a generated code amount of the picture to be encoded;
When the picture to be encoded is inter-screen prediction, a target code amount is determined based on the relational expression.

In any one of the image coding apparatuses of Claim 1 thru | or 5,
A scene change detecting means for detecting a difference value between an attribute of the picture to be encoded and an attribute of a picture encoded immediately before the picture to be encoded, and detecting the presence or absence of a scene change;
When it is determined that the difference value is smaller than a predetermined threshold, a target code amount is determined based on a relational expression between prediction error information of the past picture and a generated code amount of the picture to be encoded. Image encoding device.

The image encoding device according to claim 6, wherein
When the scene change is detected, an image coding apparatus, wherein a relational expression between prediction error information of the past picture and a generated code amount of the picture to be coded is updated.

The image encoding device according to claim 6, wherein
An image encoding apparatus comprising: a picture type determination unit that determines a type of a picture to be encoded.

The image encoding device according to claim 2, wherein
An image encoding apparatus, wherein the target code amount is determined by a relational expression prepared in advance based on a relationship between a prediction error of a picture encoded immediately before the picture to be encoded and a generated code amount.

The image encoding device according to claim 1,
Provide an allowable range of generated code amount of the picture to be encoded,
An image encoding apparatus that determines an upper limit value and a lower limit value of the quantization width based on the allowable range.

The image encoding device according to claim 10.
Means for detecting a generated code amount situation for each picture;
The image encoding device, wherein the allowable range is determined based on the generated code amount situation.

The image encoding device according to claim 1,
An image encoding apparatus, wherein the maximum value, the minimum value, and the average value of the feature amount are calculated in units of pictures.

The image encoding device according to claim 12, wherein
An image code characterized by determining the quantization width of the block to be encoded by associating the maximum value and minimum value of the quantization width with the maximum value and minimum value of the feature amount, respectively. Device.

The image encoding device according to claim 12, wherein
Determining the quantization width of the block to be encoded by associating the maximum value, minimum value, and reference value of the quantization width with the maximum value, minimum value, and average value of the feature amount, respectively; An image encoding device characterized by the above.

A video encoding program having a step of encoding video information consisting of a series of pictures,
Original image storage means for storing an original image of a picture input from the outside;
Based on the original image, an intra prediction step for performing intra-picture prediction and an inter prediction step for performing screen prediction;
Calculating a feature amount for calculating a feature amount of the picture to be encoded, and
Updating prediction error information based on the difference between the prediction image generated by the intra prediction step or the inter prediction step and the original image;
Determining a target code amount for determining a target code amount for each picture to be encoded, according to the updated prediction error information and a generated code amount of the encoded picture;
A moving image for causing a computer to execute a step of controlling a quantization width of each of the plurality of blocks to be encoded based on the feature amount and the target code amount in the picture to be encoded divided into a plurality of blocks Encoding program.

In the moving image encoding program according to claim 15,
A moving picture encoding program comprising a picture type determination step for determining a type of picture to be encoded.

In the moving image encoding program according to claim 16,
Detecting a difference value between an attribute of the picture to be encoded and an attribute of a picture encoded immediately before the picture to be encoded, and detecting the presence or absence of a scene change,
And determining a target code amount based on a relational expression between the prediction error information of the past picture and the generated code amount of the picture to be encoded when it is determined that the difference value is larger than a predetermined threshold. A moving image encoding program as a feature.