JP5272940B2

JP5272940B2 - Image encoding device

Info

Publication number: JP5272940B2
Application number: JP2009168783A
Authority: JP
Inventors: 章弘屋森; 幸二山田; 潔酒井
Original assignee: Fujitsu Ltd
Current assignee: Fujitsu Ltd
Priority date: 2009-07-17
Filing date: 2009-07-17
Publication date: 2013-08-28
Anticipated expiration: 2020-11-07
Also published as: JP2009239969A

Abstract

<P>PROBLEM TO BE SOLVED: To provide an image coder applying prediction coding to an input image that adaptively controls a coding parameter in the unit of pictures so as to attain high efficiency coding. <P>SOLUTION: The image coder applying prediction coding to an input image is provided with: an acquisition means that acquires at least any one statistic information among first statistic information that denotes the property of an input image read from a frame memory 1 such as an input image information statistic acquisition device 14, second statistic information based on correlation information in the process of prediction coding with a motion information statistics acquisition device 15 or the like and third statistic information as a result of coding or in the process of coding with a coding information statistic acquisition device 16 or the like; and a control means such as a coding control section 12 that uses a scene discrimination device 17 or the like to discriminate a scene on the basis of the statistic information acquired by the acquisition means, thereby applying adaptive control to a coding parameter in a coder 6 in the unit of frames or fields. <P>COPYRIGHT: (C)2010,JPO&INPIT

Description

本発明は、入力画像の性質や符号化過程に於ける統計情報を基に符号化パラメータをピクチャ単位に切替えて高能率符号化を行う画像符号化装置に関する。 The present invention relates to an image coding apparatus that performs high-efficiency coding by switching coding parameters in units of pictures based on characteristics of an input image and statistical information in the coding process.

動画像の符号化方式の一つとして、ＭＰＥＧ−２（ＭｏｖｉｎｇＰｉｃｔｕｒｅＥｘｐｅｒｔｓＧｒｏｕｐ−２）が国際標準化されており、ＤＶＤ（ＤｉｇｉｔａｌＶｉｄｅｏＤｉｓｃ／ＤｉｇｉｔａｌＶｅｒｓａｔｉｌｅＤｉｓｃ）映像コンテンツやディジタル放送等の分野に適用されている。このＭＰＥＧ−２が国際標準化される以前、ＣＤ−ＲＯＭ（ＶｉｄｅｏＣＤ）等の記録媒体用途や、１．５Ｍｂｐｓ程度までの回線使用をターゲットとした符号化方式としてＭＰＥＧ−１が標準化されていた。 MPEG-2 (Moving Picture Experts Group-2) has been internationally standardized as one of the moving image encoding systems, and is applied to the fields of DVD (Digital Video Disc / Digital Versatile Disc) video content and digital broadcasting. ing. Prior to the international standardization of MPEG-2, MPEG-1 was standardized as an encoding method targeting recording media such as CD-ROM (Video CD) and line use up to about 1.5 Mbps.

前述のＭＰＥＧ−２は、それ以前のＭＰＥＧ−１に比較して高画質化対策が組み込まれている。代表的な例としては、より高解像度の画像の符号化や、インタレース画像符号化の対策として、フィールドを意識した動き予測による符号化を行うことができる。又符号化も一層汎用的になっている。これらの機能は、付加機能（ｅｘｔｅｎｓｉｏｎ）としてストリーム（データ列）に多重化される。基本的には、ＭＰＥＧ−２は、ＭＰＥＧ−１の上位互換性を有し、ＭＰＥＧ−２の各付加機能レイヤで指定可能な方式のうち、一つはＭＰＥＧ−１と同じものである。 The above-mentioned MPEG-2 incorporates countermeasures for improving image quality as compared with the previous MPEG-1. As a typical example, field-aware motion prediction can be performed as a countermeasure for higher resolution image encoding and interlaced image encoding. Encoding is also becoming more versatile. These functions are multiplexed into a stream (data string) as an additional function (extension). Basically, MPEG-2 has upward compatibility with MPEG-1, and one of the methods that can be specified in each additional function layer of MPEG-2 is the same as MPEG-1.

このＭＰＥＧ−２に於ける付加機能レイヤの一つのｐｉｃｔｕｒｅ＿ｃｏｄｉｎｇ＿ｅｘｔｅｎｓｉｏｎについて、その一例を図９に示すものであり、その中の一部について説明すると、
ｆ＿ｃｏｄｅ：表現可能な動きベクトルの範囲、
ｉｎｔｒａ＿ｄｃ＿ｐｒｅｃｉｓｉｏｎ：フレーム内符号化ブロックのＤＣＴ直流分の精度、
ｐｉｃｔｕｅｒ＿ｓｔｒｕｃｔｕｒｅ：符号化構造、
ｔｏｐ＿ｆｉｅｌｄ＿ｆｉｒｓｔ：１フレーム内のフィールドの入力順序、
ｆｒａｍｅ＿ｐｒｅｄ＿ｆｒａｍｅ＿ｄｃｔ：フレーム予測フレームＤＣＴの制限フラグ、
ｑ＿ｓｃａｌｅ＿ｔｙｐｅ：量子化スケールタイプ、
ｉｎｔｒａ＿ｖｌｃ＿ｆｏｒｍａｔ：ＤＣＴ係数の可変長符号テーブル選択、
ａｌｔｅｒｎａｔｅ＿ｓｃａｎ：ＤＣＴ係数の可変長符号化部への入力順序、
ｐｒｏｇｒｅｓｓｉｖｅ＿ｆｒａｍｅ：入力信号がプログレッシブか否かの選択、
を示している。 An example of one picture_coding_extension of the additional function layer in MPEG-2 is shown in FIG. 9, and a part of it will be described.
f_code: range of motion vector that can be expressed,
intra_dc_precise: accuracy of DCT DC component of intra-frame coding block,
picture_structure: coding structure,
top_field_first: input order of fields in a frame,
frame_pred_frame_dct: limit flag of frame prediction frame DCT,
q_scale_type: quantization scale type,
intra_vlc_format: DCT coefficient variable length code table selection,
alternate_scan: input order of DCT coefficients to variable-length coding unit,
progressive_frame: selection of whether the input signal is progressive,
Is shown.

又ＭＰＥＧ規格で量子化係数が５ビットで、量子化係数の変化幅が１であると、量子化レベルは１〜３１の範囲の整数値となる。そこで、変化幅を例えば標準値の１と、この標準値の１／４の０．２５と、標準値の４倍の４とし、量子化レベルを０．２５〜７．７５と、１〜３１と、４〜１２４との何れかの範囲に選択可能とする手段が提案されている（例えば、特許文献１参照）。 If the quantization coefficient is 5 bits and the change width of the quantization coefficient is 1 according to the MPEG standard, the quantization level is an integer value in the range of 1 to 31. Therefore, for example, the change width is set to 1 of the standard value, 0.25 of 1/4 of the standard value, and 4 times 4 of the standard value, and the quantization level is set to 0.25 to 7.75, 1 to 31. And means for enabling selection within a range from 4 to 124 have been proposed (see, for example, Patent Document 1).

特開平７−１３１７８９号公報JP-A-7-131789

前述のＭＰＥＧ−２に追加されている付加機能（ｅｘｔｅｎｓｉｏｎ）レイヤで選択可能なパラメータは、基本的には、ＭＰＥＧ−１に於いて不得意であった動画像符号化に於ける効率の向上を図るものである。しかし、入力画像の性質や符号化レートによっては、ＭＰＥＧ−１に於ける符号化パラメータを適応的に用いた方が効率の良い符号化が可能な場合も存在する。従来の画像符号化装置に於いては、このような付加機能レイヤのパラメータは、予め設定した固定値とすることにより、入力画像の予測符号化を行うものであった。又前述の特許文献１により提案された画像符号化手段は、固定ビット数による量子化係数の変化幅を選択して量子化を行うものであるが、入力画像の性質に応じて最適な符号化を行うには、不充分なものであった。 The parameters that can be selected in the extension layer added to the above-mentioned MPEG-2 basically improve the efficiency in moving picture coding, which was not good in MPEG-1. It is intended. However, depending on the nature of the input image and the coding rate, there are cases where more efficient coding can be achieved by adaptively using coding parameters in MPEG-1. In the conventional image encoding apparatus, such an additional function layer parameter is set to a preset fixed value to perform predictive encoding of an input image. Also, the image encoding means proposed by the above-mentioned Patent Document 1 performs quantization by selecting the change width of the quantization coefficient depending on the number of fixed bits. Optimum encoding is performed according to the nature of the input image. It was not enough to do.

本発明は、ピクチャ単位で入力画像の性質に従って符号化パラメータを変更可能とし、同一符号化レートによっても高画質の再生画像が可能となる符号化手段を提供することを目的とする。 An object of the present invention is to provide an encoding means that can change an encoding parameter in accordance with the nature of an input image on a picture-by-picture basis and can produce a high-quality reproduced image even at the same encoding rate.

本発明の画像符号化装置は、入力画像の予測符号化を行う符号化手段を有する画像符号化装置であって、入力画像の性質を示す第１の統計情報と、予測符号化の過程に於ける相関情報を基にした第２の統計情報と、符号化パラメータを基にした符号化結果又は符号化過程に於ける第３の統計情報との少なくとも何れか一つの統計情報を取得する取得手段と、この取得手段により取得した統計情報を基にシーン判定を行って、フレーム若しくはフィールド単位で前記符号化手段に於ける符号化パラメータを適応制御する符号化制御手段とを備え、取得手段は、第３の統計情報としてピクチャ内の１マクロブロック当たりの量子化後の有効係数の平均値を取得する構成を有し、符号化制御手段は、有効係数の平均値と予め設定した係数とを比較するシーン判定結果を基に前記符号化手段に於ける可変長符号テーブルを切替えて可変長符号化を行わせる制御構成を有するものである。 An image encoding apparatus according to the present invention is an image encoding apparatus having an encoding unit that performs predictive encoding of an input image. In the process of predictive encoding, first statistical information indicating the nature of an input image is provided. Acquisition means for acquiring at least one statistical information of the second statistical information based on the correlation information and the third statistical information in the encoding result or the encoding process based on the encoding parameter And coding control means for performing scene determination based on statistical information obtained by the obtaining means and adaptively controlling coding parameters in the coding means in units of frames or fields, the obtaining means includes: The third statistical information has a configuration for obtaining an average value of effective coefficients after quantization per macroblock in a picture, and the encoding control means compares the average value of effective coefficients with a preset coefficient. You And it has a control structure to perform variable length coding by switching in the variable length code table in the encoding means based on the scene determination result.

又前記取得手段は、前記第２の統計情報として動きベクトルの水平成分の平均値、水平成分の分散値、垂直成分の平均値、垂直成分の分散を取得する構成を有し、前記符号化手段は、前記第２の統計情報と予め設定した係数とを比較するシーン判定結果を基に前記符号化手段に於ける可変長符号化の為の入力スキャン順序を切替える制御構成を備えることができる。 Further, the acquisition means has a configuration for acquiring, as the second statistical information, an average value of a horizontal component of a motion vector, a variance value of a horizontal component, an average value of a vertical component, and a variance of a vertical component, and the encoding means Can comprise a control configuration for switching the input scan order for variable length encoding in the encoding means based on a scene determination result comparing the second statistical information with a preset coefficient.

又前記取得手段は、前記第３の統計情報として平均量子化値を取得する構成を有し、前記符号化制御手段は、前記平均量子化値と予め設定した係数とを比較するシーン判定結果を基に前記符号化手段に於ける量子化テーブルを切替えて量子化を行わせる制御構成を備えることができる。 The acquisition means has a configuration for acquiring an average quantized value as the third statistical information, and the encoding control means obtains a scene determination result for comparing the average quantized value with a preset coefficient. Based on this, it is possible to provide a control configuration for performing quantization by switching the quantization table in the encoding means.

本発明は、入力画像の性質を示すアクティビティ等の第１の統計情報と、予測符号化の過程に於ける相関情報の動きベクトル等の第２の統計情報と、符号化パラメータを基にした符号化結果の符号化情報量や符号化過程の量子化平均値等の第３の統計情報との少なくとも何れか一つ或いは複数の統計情報を用いて、フレーム若しくはフィールド単位で符号化手段に於ける符号化パラメータを適用制御して、符号化情報量を増大することなく、入力画像に最適な符号化を行わせることができる利点がある。 The present invention provides first statistical information such as an activity indicating the nature of an input image, second statistical information such as a motion vector of correlation information in the process of predictive coding, and a code based on a coding parameter. In the encoding means on a frame or field basis, using at least one or a plurality of statistical information of the third statistical information such as the amount of encoded information of the encoding result and the quantization average value of the encoding process There is an advantage that optimal encoding can be performed on an input image without increasing the amount of encoded information by controlling the application of encoding parameters.

本発明の実施例１の説明図である。It is explanatory drawing of Example 1 of this invention. 動き探索の説明図である。It is explanatory drawing of a motion search. 本発明の実施の形態の判定処理のフローチャートである。It is a flowchart of the determination process of embodiment of this invention. 可変長符号テーブルの説明図である。It is explanatory drawing of a variable-length code table. 可変長符号テーブルの説明図である。It is explanatory drawing of a variable-length code table. 可変長符号テーブルの選択説明図である。It is selection explanatory drawing of a variable-length code table. 入力スキャンの説明図である。It is explanatory drawing of an input scan. 量子化テーブルの説明図である。It is explanatory drawing of a quantization table. 付加機能の一例の説明図である。It is explanatory drawing of an example of an additional function.

本発明の画像符号化装置は、図１を参照して説明すると、入力画像の予測符号化を行う符号化手段を有する画像符号化装置であって、入力画像の性質を示す第１の統計情報と、予測符号化の過程に於ける相関情報を基にした第２の統計情報と、符号化パラメータを基にした符号化結果又は符号化過程に於ける第３の統計情報との少なくとも何れか一つの統計情報を取得する取得手段、例えば、入力画像情報統計取得器１４と動き情報統計取得器１５と符号化情報統計取得器１６等を含む取得手段と、この取得手段により取得した統計情報を基にシーン判定を行って、フレーム若しくはフィールド単位で前記符号化手段に於ける符号化パラメータを適応制御する符号化制御手段とを備え、取得手段は、第３の統計情報としてピクチャ内の１マクロブロック当たりの量子化後の有効係数の平均値を取得する構成を有し、符号化制御手段は、有効係数の平均値と予め設定した係数とを比較するシーン判定結果を基に前記符号化手段に於ける可変長符号テーブルを切替えて可変長符号化を行わせる制御構成を有するものである。 Referring to FIG. 1, the image coding apparatus of the present invention is an image coding apparatus having coding means for performing predictive coding of an input image, and includes first statistical information indicating the nature of the input image. And / or second statistical information based on the correlation information in the predictive encoding process and encoding result based on the encoding parameter or third statistical information in the encoding process Acquisition means for acquiring one piece of statistical information, for example, acquisition means including an input image information statistical acquisition unit 14, a motion information statistical acquisition unit 15, an encoded information statistical acquisition unit 16 and the like, and statistical information acquired by the acquisition unit Coding means for adaptively controlling the coding parameters in the coding means for each frame or field by performing scene determination based on the scene, and the obtaining means includes one macro in the picture as third statistical information. The coding means has a configuration for obtaining an average value of effective coefficients after quantization per block, and the encoding control means is based on a scene determination result for comparing the average value of effective coefficients with a preset coefficient. In this case, the variable length code table is switched to perform variable length coding.

図１は本発明の実施例１の説明図であり、１はフレームメモリ、２は原画ＭＢ（マクロブロック）読出部、３は参照ブロック読出部、４は動きベクトル探索器、５は予測判定器、６は符号化器、７は局所復号化器、８，９は切替部、１０は加算器、１１は減算器、１２は符号化制御部、１３はヘッダ情報生成部、１４は入力画像情報統計取得器、１５は動き情報統計取得器、１６は符号化情報統計取得器、１７はシーン判定器を示す。 FIG. 1 is an explanatory diagram of Embodiment 1 of the present invention, in which 1 is a frame memory, 2 is an original picture MB (macroblock) reading unit, 3 is a reference block reading unit, 4 is a motion vector searcher, and 5 is a prediction determination unit. , 6 is an encoder, 7 is a local decoder, 8 and 9 are switching units, 10 is an adder, 11 is a subtractor, 12 is an encoding control unit, 13 is a header information generation unit, and 14 is input image information. A statistical acquisition unit, 15 is a motion information statistical acquisition unit, 16 is an encoded information statistical acquisition unit, and 17 is a scene determination unit.

フレームメモリ１は、入力画像情報を蓄積する領域と、参照画像情報を蓄積する領域とを含み、入力画像領域からマクロブロック単位で原画ＭＢ読出部２によって読出し、又参照画像領域の探索範囲内のマクロブロック単位で参照ブロック読出部３により読出し、動きベクトル探索器４により動きベクトルを求めて予測判定器５に入力する。又符号化器６は、ＤＣＴ（ＤｉｓｃｒｅｔｅＣｏｓｉｎｅＴｒａｎｓｆｏｒｍ）による直交変換と、量子化と、可変長符号化との機能を含むものである。なお、ＭＰＥＧ−２方式に於けるマクロブロック（ＭＢ）サイズは１６×１６画素であり、又ＤＣＴを行うブロックは、マクロブロックを４分割した８×８画素とするものである。 The frame memory 1 includes an area for accumulating input image information and an area for accumulating reference image information. The frame memory 1 is read from the input image area in units of macroblocks by the original image MB reading unit 2 and is within the search range of the reference image area. The data is read by the reference block reading unit 3 in units of macroblocks, and a motion vector is obtained by the motion vector search unit 4 and input to the prediction determination unit 5. The encoder 6 includes functions of orthogonal transform by DCT (Discrete Cosine Transform), quantization, and variable length coding. Note that the macroblock (MB) size in the MPEG-2 system is 16 × 16 pixels, and the block on which DCT is performed is 8 × 8 pixels obtained by dividing the macroblock into four.

又切替部８，９は、フレーム内とフレーム間の符号化時は、加算器１０及び減算器１１側に切替え、フィールド内とフィールド間の符号化時は、加算器１０及び減算器１１側と反対側に切替えるものである。又局所復号化器７は、逆量子化と、逆ＤＣＴとの機能を含み、符号化器６の可変符号化前の段階の量子化出力を用いて復号化処理を行い、参照画像の再構成を行ってフレームメモリ１の参照画像情報の領域に格納する。 The switching units 8 and 9 switch to the adder 10 and subtractor 11 side when encoding within the frame and between frames, and switch between the adder 10 and subtractor 11 side when encoding within the field and between fields. Switching to the opposite side. The local decoder 7 includes functions of inverse quantization and inverse DCT, performs decoding using the quantized output of the encoder 6 before variable encoding, and reconstructs a reference image. Are stored in the reference image information area of the frame memory 1.

符号化器６に於けるＤＣＴは、二次元ＤＣＴであり、前述のように、ＭＰＥＧ−２に於いては、８×８画素のブロックについて行う。このブロックをｆ（ｘ，ｙ）、ＤＣＴ結果の係数をＦ（ｕ，ｖ）とすると、次の（１）式に示すものとなる。

The DCT in the encoder 6 is a two-dimensional DCT. As described above, in the MPEG-2, the DCT is performed on a block of 8 × 8 pixels. When this block is f (x, y) and the coefficient of the DCT result is F (u, v), the following equation (1) is obtained.

このＤＣＴ演算によって、ブロック単位の画像情報を周波数成分に変換し、有効成分は低周波成分側に集まることにより、符号化情報の削減を図るものであり、ブロック単位のＤＣＴ係数をＲｅｃ〔ｘ〕、量子化スケールをＱｓ、量子化マトリクス値をＱｍ〔ｘ〕、量子化結果をＬｅｖｅｌ〔ｘ〕、処理過程の値をＬｅｖｅｌ’〔ｘ〕とすると、イントラ符号化に於ける量子化結果の直流成分（ＩｎｔｒａＤＣ）と、交流成分（ＩｎｔｒａＡＣ）と、ノンイントラ符号化に於ける量子化結果（ＮｏｎＩｎｔｒａ）とを求める演算処理は、次の（２）式で表すことができる。なお、式中のｐ及びｑは、ｐ＝３，ｑ＝４が一般には用いられている。又“／／”は除算結果の端数を四捨五入する演算を示し、“／”は除算結果の端数を切り捨てる演算を示す。

By this DCT calculation, the block unit image information is converted into frequency components, and the effective components are concentrated on the low frequency component side, thereby reducing the coding information. The block unit DCT coefficients are set to Rec [x]. If the quantization scale is Qs, the quantization matrix value is Qm [x], the quantization result is Level [x], and the value of the processing process is Level '[x], the direct current of the quantization result in the intra coding An arithmetic process for obtaining a component (Intra DC), an alternating current component (Intra AC), and a quantization result (Non Intra) in non-intra coding can be expressed by the following equation (2). In addition, as for p and q in a formula, p = 3 and q = 4 are generally used. “//” indicates an operation for rounding off the fraction of the division result, and “/” indicates an operation for rounding down the fraction of the division result.

又フレーム間符号化を行う為にフレーム間差分情報を得る必要があり、その為に、符号化データを局所復号化器７に於いて逆量子化と逆ＤＣＴとの処理により復号し、画像の再構成を行って参照画像としてフレームメモリ１に格納する。この局所復号化器７は、前述のように、逆量子化と、逆ＤＣＴとの処理機能を含むもので、逆量子化は（３）式に示す処理により行うことができ、又逆ＤＣＴは（４）式に示す処理により行うことができる。

In addition, it is necessary to obtain inter-frame difference information in order to perform inter-frame encoding. For this purpose, the encoded data is decoded by the local quantizer 7 through inverse quantization and inverse DCT processing, and the image data is decoded. Reconstruction is performed and the image is stored in the frame memory 1 as a reference image. As described above, the local decoder 7 includes processing functions of inverse quantization and inverse DCT, and the inverse quantization can be performed by the process shown in the equation (3). It can be performed by the process shown in the equation (4).

実際に入力画像の符号化処理を行う場合、最初のピクチャ（フレーム又はフィールド）は、参照するピクチャが存在しないので、ピクチャ内符号化を行い、次のピクチャからはピクチャ間符号化を行うことになる。なお、ピクチャ内符号化は、周期的リフレッシュの意味もあって、所定の間隔毎に行われる場合が一般的である。又ピクチャ間符号化に於いて、動きベクトル探索器４により動き予測を行うものであり、例えば、図２に示すように、原画像２１のマクロブロック２２と、参照画像２３の探索範囲２４内のマクロブロックとの各画素について差分絶対値の累算値が最小となる位置を探索して動きベクトルを求め、符号化情報に多重化するものである。 When the input image is actually encoded, the first picture (frame or field) has no picture to be referred to, so intra-picture encoding is performed, and inter-picture encoding is performed from the next picture. Become. Note that intra-picture encoding is generally performed at predetermined intervals in the sense of periodic refresh. In the inter-picture coding, the motion vector search unit 4 performs motion prediction. For example, as shown in FIG. 2, the macro block 22 of the original image 21 and the search range 24 of the reference image 23 are included. A motion vector is obtained by searching for a position where the accumulated value of the absolute difference value is minimum for each pixel with respect to the macroblock, and multiplexed on the encoded information.

前述の図１に於ける符号１〜１１の機能部分により符号化手段を構成し、又符号化器６に於ける符号化パラメータを制御する符号化制御部１２により、符号化制御手段を構成し、ヘッダ情報生成部１３により符号化パラメータを含むヘッダ情報を生成し、符号化器６からのピクチャ単位の符号化データにヘッダ情報を付加して送出する。 Coding means is constituted by the functional parts denoted by reference numerals 1 to 11 in FIG. 1 described above, and coding control means is constituted by the coding control unit 12 for controlling the coding parameters in the coder 6. Then, header information including encoding parameters is generated by the header information generation unit 13, and the header information is added to the encoded data in units of pictures from the encoder 6 and transmitted.

又入力画像情報統計取得器１４は、入力画像の性質を示す第１の統計情報を取得する手段であり、例えば、フレームメモリ１に蓄積された符号化対象ピクチャの特徴情報として輝度信号についての統計情報（アクティビティ）を求める。この場合、入力フレーム内の各画素の輝度値を累積し、累積した画素数で除算することにより、フレーム輝度平均を求めて、第１の統計情報とすることができる。即ち、フレーム内の画素集合をＵ、輝度をＰｉｘｅｌ＿ｉ、画素数をＮｕｍ＿ｉ、フレーム輝度平均をＡｖｅＹ、フレーム輝度分散をＶａｒＹとすると、次の（５）式及び（６）式で表される。なお、（６）式の中のＡは、フレーム輝度平均ＡｖｅＹを示す。

The input image information statistic acquisition unit 14 is means for acquiring first statistical information indicating the nature of the input image. For example, the statistic about the luminance signal is used as the feature information of the encoding target picture stored in the frame memory 1. Ask for information (activity). In this case, the luminance value of each pixel in the input frame is accumulated and divided by the accumulated number of pixels, whereby the frame luminance average can be obtained and used as the first statistical information. That is, when the pixel set in the frame is U, the luminance is Pixel_i, the number of pixels is Num_i, the frame luminance average is AveY, and the frame luminance variance is VarY, the following equations (5) and (6) are expressed. In the equation (6), A represents the frame luminance average AveY.

又動き情報統計取得器１５は、フレーム間又はフィールド間の相関情報を示す第２の統計情報を取得する手段であり、例えば、動きベクトル探索器４に於けるマクロブロックで求めた動きベクトルを累算し、マクロブロック数で除算することにより、動きベクトルの平均値を得ることができる。或いは、動きベクトルとその平均値との差の自乗和を求めてマクロブロック数で除算することにより動きベクトルの分散を得ることができる。 The motion information statistic acquisition unit 15 is means for acquiring second statistical information indicating correlation information between frames or fields. For example, the motion information statistic acquisition unit 15 accumulates motion vectors obtained by macroblocks in the motion vector searcher 4. By calculating and dividing by the number of macroblocks, an average value of motion vectors can be obtained. Alternatively, the variance of the motion vector can be obtained by obtaining the square sum of the difference between the motion vector and its average value and dividing the result by the number of macroblocks.

即ち、フレーム内のマクロブロックの集合をＶ、各動きベクトルの水平成分及び垂直成分をＶｅｃＨ＿ｉ，ＶｅｃＶ＿ｉとし、水平成分平均をＡｖｅＨＶ、水平成分分散をＶｅｒＨＶ、垂直成分平均値をＡｖｅＶＶ、垂直成分分散をＶｅｒＶＶとすると、（７）〜（１０）式で表される。

なお、（８）式の中のＡＨは、水平成分平均ＡｖｅＨＶを示し、又（１０）式の中のＡＶは、垂直成分平均ＡｖｅＶＶを示す。 That is, the set of macroblocks in the frame is V, the horizontal and vertical components of each motion vector are VecH_i and VecV_i, the horizontal component average is AveHV, the horizontal component variance is VerHV, the vertical component average value is AveVV, and the vertical component variance is Assuming that VerVV is expressed by the equations (7) to (10).

In the equation (8), AH represents the horizontal component average AveHV, and AV in the equation (10) represents the vertical component average AveVV.

又符号化情報統計取得器１６は、符号化過程に於ける第３の統計情報を取得する手段であり、例えば、各マクロブロックを符号化した結果の情報を累積し、発生情報量や量子化値の平均値等を求める。その場合に、フレーム内のマクロブロックの集合をＶ、各マクロブロックの発生情報量をＢｉｔ＿ｉ、ピクチャの発生情報量をＳｕｍＢ、各マクロブロックの量子化スケール値をＱｓ＿ｉ、平均量子化値をＡｖｅＱとすると、ピクチャの発生情報量ＳｕｍＢ及び平均量子化値ＡｖｅＱは、（１１）式及び（１２）式で表される。

The encoded information statistic acquisition unit 16 is means for acquiring third statistical information in the encoding process, for example, accumulates information obtained as a result of encoding each macroblock, and generates a generated information amount or quantization. Find the average of the values. In this case, the set of macroblocks in the frame is V, the generated information amount of each macroblock is Bit_i, the generated information amount of pictures is SumB, the quantization scale value of each macroblock is Qs_i, and the average quantized value is AveQ. Then, the generated information amount SumB and the average quantization value AveQ of the picture are expressed by Expressions (11) and (12).

又量子化後の各マクロブロックの有効係数をＣｏｅｆ＿ｉ、ピクチャ内の１マクロブロック当たりの有効係数の平均値をＡｖｅＣとすると、この有効係数平均値ＡｖｅＣは、（１３）式で表される。

Further, assuming that the effective coefficient of each macroblock after quantization is Coef_i and the average value of effective coefficients per macroblock in the picture is AveC, this effective coefficient average value AveC is expressed by Expression (13).

又シーン判定器１７は、前述の第１，第２，第３の統計情報の少なくとも何れか一つを基にシーン判定を行うものであり、第１の統計情報を取得する手段としての入力画像情報統計取得器１４からのフレーム輝度平均ＡｖｅＹや分散ＶａｒＹと、第２の統計情報を取得する手段としての動き情報統計取得器１５からの水平成分平均ＡｖｅＨＶ、水平成分分散ＶｅｒＨＶ、垂直成分平均ＡｖｅＶＶ、垂直成分分散ＡｅｒＶＶと、第３の統計情報を取得する手段としての符号化情報統計取得器１６からのピクチャ発生情報量ＳｕｍＢ、平均量子化値ＡｖｅＱ、有効係数平均値ＡｖｅＣ等の一つ或いは複数を用いて、例えば、動きの激しいシーンや平坦な輝度のシーン等についての判定を行い、符号化制御部１２により符号化パラメータを適応制御して、符号化器６に於ける入力画像の符号化を行わせるものである。 The scene determination unit 17 performs scene determination based on at least one of the first, second, and third statistical information, and an input image as a means for acquiring the first statistical information. Frame luminance average AveY and variance VarY from the information statistics acquisition unit 14, and horizontal component average AveHV, horizontal component variance VerHV, vertical component average AveVV from the motion information statistics acquisition unit 15 as means for acquiring second statistical information, One or more of the vertical component variance AerVV, the picture generation information amount SumB, the average quantization value AveQ, the effective coefficient average value AveC, etc. from the encoded information statistics acquisition unit 16 as means for acquiring the third statistical information. For example, a determination is made for a scene with intense motion or a scene with flat brightness, and the encoding control unit 12 adaptively controls the encoding parameter. Te is intended to perform the encoding of the in the input image to the encoder 6.

図３は、本発明の実施の形態の判定処理のフローチャートであり、フレームメモリ１から入力画像の読込みを行い（ａ１）、ヘッダ情報生成部１３によりピクチャ単位でヘッダを生成し（ａ２）、動きベクトル探索器４に於いて動き探索を行い（ａ３）、符号化器６に於いてＭＢ（マクロブロック）符号化を行い（ａ４）、ピクチャＥｎｄ、即ち、１ピクチャ分について終了か否かを判定し（ａ５）、終了していない場合は、ステップ（ａ３）に移行し、終了した場合は、必要情報取得を行う（ａ１０）。 FIG. 3 is a flowchart of the determination process according to the embodiment of the present invention. The input image is read from the frame memory 1 (a1), the header information generation unit 13 generates a header for each picture (a2), and the motion The vector searcher 4 performs motion search (a3), and the encoder 6 performs MB (macroblock) encoding (a4) to determine whether or not the picture End, that is, one picture is finished. (A5) If not completed, the process proceeds to step (a3). If completed, necessary information is acquired (a10).

又入力画像情報統計取得器１４に於いて第１の統計情報の取得を行い（ａ６）、又動き情報統計取得器１５に於いて第２の統計情報としての動きベクトル探索結果を基にした統計情報の取得を行い（ａ７）、又符号化器６による符号化結果又は符号化過程に於ける第３の統計情報を符号化情報統計取得器１６に於いて取得し（ａ８）、それぞれ平均値を求める場合は平均化処理を行う（ａ９）。この平均化処理結果をシーン判定器１７に於いて必要情報取得（ａ１０）として取得し、所定の条件を満たすか否かを判定し（ａ１１）、判定結果により符号化パラメータ１（ａ１２）の選択又は符号化パラメータ２（ａ１３）の選択を行い、符号化Ｅｎｄか否かを判定し（ａ１４）、終了していない場合はステップ（ａ１）に移行する。なお、判定のステップ（ａ１１）に於いて、複数種類の判定条件に従って、更に多数の符号化パラメータの選択切替えの制御を行うことも可能である。 In addition, the input image information statistics acquisition unit 14 acquires the first statistical information (a6), and the motion information statistics acquisition unit 15 calculates the statistics based on the motion vector search result as the second statistical information. The information is acquired (a7), and the encoded result by the encoder 6 or the third statistical information in the encoding process is acquired by the encoded information statistics acquirer 16 (a8), and the average value is obtained. Is obtained, an averaging process is performed (a9). The average processing result is acquired as necessary information acquisition (a10) in the scene determination unit 17, and it is determined whether or not a predetermined condition is satisfied (a11), and the encoding parameter 1 (a12) is selected based on the determination result. Alternatively, the encoding parameter 2 (a13) is selected, and it is determined whether or not it is the encoding end (a14). If not completed, the process proceeds to step (a1). In the determination step (a11), it is also possible to control selection switching of a larger number of encoding parameters in accordance with a plurality of types of determination conditions.

例えば、図９に示す付加機能（ｐｉｃｔｕｒｅ＿ｃｏｄｉｎｇ＿ｅｘｔｅｎｓｉｏｎ）のパラメータとして、ｉｎｔｒａ＿ｖｌｃ＿ｆｏｒｍａｔ（ＤＣＴ係数の可変長符号テーブルの選択）の適応制御を行う場合、ｉｎｔｒａ＿ｖｌｃ＿ｆｏｒｍａｔ＝０と、ｉｎｔｒａ＿ｖｌｃ＿ｆｏｒｍａｔ＝１とにより、図４のｔａｂｌｅ＝０と、図５のｔａｂｌｅ＝１との選択を行うことができる。なお、図４及び図５は、可変長符号（Ｖａｒｉａｂｌｅｌｅｎｇｔｈｃｏｄｅ）とラン（Ｒｕｎ）とレベル（ｌｅｂｅｌ）とを含む可変長符号テーブルの一部を示し、又最終ビットのｓはレベルの正負の符号で、０は正、１は負を示す。又１ｓはブロックの最初のＤＣＴ係数、１１ｓは次のＤＣＴ係数を示す。 For example, when adaptive control of intra_vlc_format (selection of DCT coefficient variable length code table) is performed as a parameter of the additional function (picture_coding_extension) shown in FIG. Selection can be made between 0 and table = 1 in FIG. 4 and 5 show a part of a variable length code table including a variable length code, a run, and a level, and s in the final bit indicates whether the level is positive or negative. In the sign, 0 indicates positive and 1 indicates negative. 1s indicates the first DCT coefficient of the block, and 11s indicates the next DCT coefficient.

又図４のｔａｂｌｅ＝０に比較して図５のｔａｂｌｅ＝１の方が或る程度短いビットを均等にして可変長符号を割当てることができるものであり、従って、ブロック内に有効係数が多く存在する場合は、図５のｔａｂｌｅ＝１を選択して可変長符号化を行った方が効果的である。又実際には、平均量子化値が大きくなると有効係数の数は少なくなり、又平面的な画面でアクティビティが小さいと有効係数の数は少なくなる。又平均量子化値が大きい場合でも、アクティビティが小さいと有効係数の数は少なく、反対にアクティビティが大きいと有効係数の数が多くなる。 In addition, when table = 1 in FIG. 5 is equal to table = 0 in FIG. 4, it is possible to assign a variable length code by equalizing a bit shorter to some extent. Therefore, there are more effective coefficients in the block. If it exists, it is more effective to select table = 1 in FIG. 5 and perform variable length coding. In practice, the number of effective coefficients decreases when the average quantization value increases, and the number of effective coefficients decreases when the activity is small on a flat screen. Even when the average quantization value is large, the number of effective coefficients is small when the activity is small, and conversely, when the activity is large, the number of effective coefficients is large.

図６は可変長符号テーブルの選択説明図であり、ｉｎｔｒａ＿ｖｌｃ＿ｆｏｒｍａｔ＝０の時に、イントラブロック及びノンイントラブロックについてはｔａｂｌｅ＝０、ｉｎｔｒａ＿ｖｌｃ＿ｆｏｒｍａｔ＝１の時に、イントラブロックについてはｔａｂｌｅ＝１、ノンイントラブロックについてはｔａｂｌｅ＝０の選択が行われて、可変長符号化が行われる。 FIG. 6 is an explanatory diagram of selection of the variable-length code table. When intra_vlc_format = 0, table = 0 for intra blocks and non-intra blocks, table = 1 for intra blocks, and table = 1 for intra blocks when intra_vlc_format = 1. Table = 0 is selected and variable length coding is performed.

前述の（１２）式により求めた平均量子化値ＡｖｅＱと、（６）式で求めた入力画像のアクティビティとしてのＶａｒＹとを用いて、
ＡｖｅＱ＞ＶａｒＹ＊α_１＋β_１ …（１４）
の条件が成立すれば、有効係数の数が少ないので、ｔａｂｌｅ＝０を選択し、成立しない場合は、ｔａｂｌｅ＝１を選択するように適応的に切替制御する。なお、α_１，β_１は重み付けの係数を示す。 Using the average quantization value AveQ obtained by the above equation (12) and VarY as the activity of the input image obtained by the equation (6),
AveQ> VarY * α ₁ + β ₁ (14)
If this condition is satisfied, the number of effective coefficients is small, so that table = 0 is selected, and if not, table-1 is adaptively controlled to select table = 1. Α ₁ and β ₁ indicate weighting coefficients.

又更に単純化する為に、（１３）式により求めた有効係数の平均値ＡｖｅＣを用いて、
ＡｖｅＣ＜α_２ …（１５）
の条件が成立するか否かを判定し、成立すれば、ｔａｂｌｅ＝０を選択し、成立しない場合は、ｔａｂｌｅ＝１を選択するように適応的に切替制御して、可変長符号化を行うことができる。なお、α２は係数を示す。即ち、予め設定した係数α２より、マクロブロック当たりの有効係数の平均値ＡｖｅＣが小さいことは、入力画像のアクティビティが小さい場合に相当し、ｔａｂｌｅ＝０を選択するように切替えて可変長符号化を行った方が符号化効率が良くなる。 For further simplification, the average value AveC of the effective coefficients obtained from the equation (13) is used.
AveC <α ₂ (15)
If this condition is satisfied, table = 0 is selected, and if not satisfied, variable-length encoding is performed by adaptively switching control so that table = 1 is selected. be able to. Α2 represents a coefficient. That is, the fact that the average value AveC of the effective coefficient per macroblock is smaller than the preset coefficient α2 corresponds to the case where the activity of the input image is small, and variable length coding is performed by switching to select table = 0. The encoding efficiency is better when this is done.

又ａｌｔｅｎａｔｅ＿ｓｃａｎ（ＤＣＴ係数の可変長符号化の入力順序）のパラメータとして、ａｌｔｅｎａｔｅ＿ｓｃａｎ＝０の場合は、ＤＣＴ係数をスキャンする順序を、図７の（Ａ）に示すジクザグスキャン（スキャンタイプ０）とし、ａｌｔｅｎａｔｅ＿ｓｃａｎ＝１の場合は、図７の（Ｂ）に示すオルタネートスキャン（スキャンタイプ１）とすることができる。この場合、（Ａ）のスキャンタイプ０に比較して、（Ｂ）のスキャンタイプ１の方が、周波数成分に於ける垂直成分の係数を優先的に符号化することになる。このような垂直成分に有効係数が多数発生する要因としては、例えば、インターレース画像の符号化に於ける奇偶フィールド画像が大きく異なる場合、即ち、パニングやチルト等のような動きのある場合等がある。 Further, as a parameter of alternate_scan (input order of variable length coding of DCT coefficients), when alternate_scan = 0, the DCT coefficient scanning order is a zigzag scan (scan type 0) shown in FIG. If alternate_scan = 1, the alternate scan (scan type 1) shown in FIG. 7B can be performed. In this case, compared with the scan type 0 of (A), the scan type 1 of (B) preferentially encodes the coefficient of the vertical component in the frequency component. As a factor for generating a large number of effective coefficients in such a vertical component, for example, when an odd / even field image in encoding of an interlaced image is greatly different, that is, there is a movement such as panning or tilt. .

ＭＰＥＧ−２は、フィールドを考慮したフィールド間の動き予測や、ＤＣＴへの入力として、マクロブロックからブロックを切り出す時に、１ラインおきに切り出すフィールドＤＣＴを行うことが可能であり、又動き予測を行う動きベクトル探索範囲は、図２に示すように、０ベクトルを中心とした所定の範囲２４とする場合が一般的であるから、フィールド予測を行う場合、探索範囲が不足するような大きな動きをしている場合等に於いて、有効係数の分布については、周波数垂直成分方向がより多くなる。 MPEG-2 can perform motion prediction between fields in consideration of fields, and can perform field DCT that cuts out every other line when cutting out a block from a macroblock as an input to DCT. As shown in FIG. 2, the motion vector search range is generally set to a predetermined range 24 centered on the 0 vector. Therefore, when performing field prediction, the motion vector search range moves so much that the search range is insufficient. In such a case, the frequency vertical component direction becomes larger for the distribution of the effective coefficient.

そこで、（７）式の水平成分平均値ＡｖｅＨＶと、（８）式の水平成分分散ＶｅｒＨＶとを用いて、
ＡｖｅＨＶ＞α_３ …（１６）
ＶｅｒＨＶ＜β_３ …（１７）
の条件が成立するか否かを判定する。この条件が成立するような水平方向に或る程度揃った大きな動きの場合は、垂直方向の成分が多くなるので、ａｌｔｅｎａｔｅｓｃａｎ＝１、そうでない場合は、ａｌｔｅｎａｔｅ＿ｓｃａｎ＝０を選択するように適応制御を行って可変長符号化を行う。なお、α_３，β_３は定数を示し、動き探索範囲が装置対応に異なる場合が一般的であるから、その動き探索範囲を基に予め設定することができる。 Therefore, using the horizontal component average value AveHV in the equation (7) and the horizontal component variance VerHV in the equation (8),
AveHV> α ₃ (16)
VerHV <β ₃ (17)
It is determined whether or not the above condition is satisfied. In the case of large movements that are aligned to some extent in the horizontal direction so that this condition is satisfied, the vertical component increases, and therefore, adaptive scan = 1 is selected, and otherwise, adaptive_scan = 0 is selected. To perform variable length coding. Note that α ₃ and β ₃ are constants, and the motion search range is generally different depending on the device, and can be set in advance based on the motion search range.

又符号化情報統計取得器１６に於いて取得した第３の統計情報として、平均量子化値ＡｖｅＱを用いて量子化スケールタイプｑ＿ｓｃａｌｅ＿ｔｙｐｅの選択を行うことができる。即ち、図８に示す量子化テーブルについて、ｑ＿ｓｃａｌｅ＿ｔｙｐｅ＝０の場合は、量子化スケール値の変化は線形で、量子化スケール値（ｐｕａｎｔｉｓｅｒ＿ｓｃａｌｅ＿ｃｏｄｅ）は２〜６２の変化となる。これに対して、ｑ＿ｓｃａｌｅ＿ｔｙｐｅ＝１の場合は、広範囲の量子化スケール値をカバーする為に、非線形な変化となり、量子化の細かい部分はより細かく、量子化の粗い部分はより粗く量子化するもので、１〜１１２まで変化する。 Further, as the third statistical information acquired by the encoded information statistical acquisition unit 16, the quantization scale type q_scale_type can be selected using the average quantization value AveQ. That is, in the quantization table shown in FIG. 8, when q_scale_type = 0, the change in the quantization scale value is linear, and the quantization scale value (punizer_scale_code) changes from 2 to 62. On the other hand, in the case of q_scale_type = 1, in order to cover a wide range of quantization scale values, non-linear changes occur, and the finer quantization part is finer and the coarser quantization part is more coarsely quantized. Thus, it changes from 1 to 112.

量子化スケール値が極端に小さくなるか又は極端に大きくなることがない場合は、量子化テーブルの何れを選択しても大きな相違はないが、符号化レートが高いような場合に於いて、平均量子化値が大きい場合には、ｑ＿ｓｃａｌｅ＿ｔｙｐｅ＝１を選択するように制御する。例えば、（１２）式による平均量子化値ＡｖｅＱについて、
ＡｖｅＱ＜α_４ …（１８）
ＡｖｅＱ＞β_４ …（１９）
の条件が成立するか否かを判定し、この条件が成立する場合、即ち、平均量子化値が極端に小さい場合、又は極端に大きい場合には、量子化スケール値が非線形な変化となるｑ＿ｓｃａｌｅ＿ｔｙｐｅ＝１を選択し、それ以外の場合は、ｑ＿ｓｃａｌｅ＿ｔｙｐｅ＝０を選択するように適応制御する。 If the quantization scale value does not become extremely small or extremely large, there is no significant difference in selecting any of the quantization tables, but the average is used when the coding rate is high. When the quantization value is large, control is performed so as to select q_scale_type = 1. For example, for the average quantization value AveQ according to equation (12),
AveQ <α ₄ (18)
AveQ> β ₄ (19)
If this condition is satisfied, that is, if the average quantization value is extremely small or extremely large, the quantization scale value becomes a non-linear change q_scale_type. = 1 is selected, and otherwise, adaptive control is performed so that q_scale_type = 0 is selected.

本発明は、前述の各実施の形態のみに限定されるものではなく、種々付加変更することが可能であり、前述の量子化テーブルの切替えや可変長符号テーブルの切替えによる符号化パラメータの適応的な切替制御以外に、他の符号化パラメータの切替制御を行うことも可能であり、又符号化するピクチャの統計情報を、そのピクチャの符号化前に取得して、フィードフォワード制御による符号化を行うことも可能である。又蓄積メディアに適用する場合は、複数種類の符号化パラメータを用いてピクチャの仮符号化を行い、この仮符号化結果により最終判定して、最適な符号化パラメータを含むヘッダ情報を、符号化データに付加して、蓄積することもできる。 The present invention is not limited only to the above-described embodiments, and various additions and modifications can be made. Adaptive encoding parameters can be changed by switching the quantization table or variable-length code table. In addition to switching control, it is also possible to perform switching control of other encoding parameters, and obtain statistical information of a picture to be encoded before encoding the picture, and perform encoding by feedforward control. It is also possible to do this. When applying to storage media, provisional encoding of pictures is performed using multiple types of encoding parameters, and final determination is made based on the provisional encoding results, and header information including optimal encoding parameters is encoded. It can also be added to data and stored.

１フレームメモリ
２原画ＭＢ読出部
３参照ブロック読出部
４動きベクトル探索器
５予測判定器
６符号化器
７局所復号化器
８，９切替部
１０加算器
１１減算器
１２符号化制御部
１３ヘッダ情報生成部
１４入力画像情報統計取得器
１５動き情報統計取得器
１６符号化情報統計取得器
１７シーン判定器 DESCRIPTION OF SYMBOLS 1 Frame memory 2 Original image MB reading part 3 Reference block reading part 4 Motion vector searcher 5 Predictive decision unit 6 Encoder 7 Local decoder 8, 9 Switching part 10 Adder 11 Subtractor 12 Encoding control part 13 Header information Generating unit 14 Input image information statistical acquisition unit 15 Motion information statistical acquisition unit 16 Encoding information statistical acquisition unit 17 Scene determination unit

Claims

In an image coding apparatus having coding means for performing predictive coding of an input moving image,
First statistical information indicating the nature of the input video, second statistical information based on correlation information in the process of predictive encoding, and an encoding result or encoding based on an encoding parameter An acquisition means for acquiring at least one statistical information of the third statistical information in the process;
Coding determination means for performing scene determination based on statistical information acquired by the acquisition means, and adaptively controlling encoding parameters in the encoding means in units of frames or fields;
The acquisition means acquires, as the second statistical information, a horizontal component average value, a horizontal component variance value, a vertical component average value, and a vertical component variance value of the motion vector, and the third statistical information. It has a structure and to obtain the average value of the effective coefficient after quantization of one macro block per the picture as,
The encoding control means controls the variable length encoding by switching the variable length code table in the encoding means based on a scene determination result comparing the average value of the effective coefficients with a preset coefficient. configuration and that a control arrangement for switching the input scan order for the in variable-length coding on the coding unit based on the scene determination result of comparing the coefficient set in advance as the second statistical information An image encoding device.

The acquisition unit has a configuration for acquiring an average quantized value as the third statistical information, and the encoding control unit is based on a scene determination result for comparing the average quantized value with a preset coefficient. 2. The image encoding apparatus according to claim 1, further comprising a control configuration for performing quantization by switching a quantization table in said encoding means.