JP2001339720A

JP2001339720A - Image information conversion apparatus and method for same

Info

Publication number: JP2001339720A
Application number: JP2000155399A
Authority: JP
Inventors: Kazufumi Sato; 数史佐藤; Kuniaki Takahashi; 邦明高橋; Teruhiko Suzuki; 輝彦鈴木; Yoichi Yagasaki; 陽一矢ケ崎
Original assignee: Sony Corp
Current assignee: Sony Corp
Priority date: 2000-05-25
Filing date: 2000-05-25
Publication date: 2001-12-07

Abstract

PROBLEM TO BE SOLVED: To correspond to a change of complexity in GOP. SOLUTION: The device of the present invention transcodes MPEG 2 image compression information (bit stream) of interlaced scanning to MPEG 4 image compression information (bit stream) of sequential scanning, wherein average quantization scale calculating unit 25 finds an approximate value of average quantization scale relative to each frame using information quantizer-scale-code contained in slice header within MPEG 2 image compression information (bit stream) that has been taken out by slice header analyzing unit 18, a complexity calculating unit 26 finds an approximate value of complexity relative to each frame using these encoding amount (bit number) assigned to each frame, and MPEG 4 image information encoding unit (I/P-VOP) controls encoding amount of MPEG 4 image compression information (bit stream) using approximated complexity.

Description

DETAILED DESCRIPTION OF THE INVENTION

【０００１】[0001]

【発明の属する技術分野】本発明は、画像情報を変換す
る画像情報変換装置及び方法に関し、詳しくは、離散コ
サイン変換等の直交変換と動き補償によって圧縮された
ＭＰＥＧ等の画像情報（ビットストリーム）を衛星放
送、ケーブルＴＶ、インターネット等のネットワークメ
ディアを介して受信する際に、若しくは光、磁気ディス
クのような記憶メディア上で処理する際に用いられる画
像情報を変換する画像情報変換装置及び方法に関する。BACKGROUND OF THE INVENTION 1. Field of the Invention The present invention relates to an image information conversion apparatus and method for converting image information, and more particularly, to image information (bit stream) such as MPEG compressed by orthogonal transform such as discrete cosine transform and motion compensation. And a method for converting image information used when receiving an image via a network medium such as satellite broadcasting, cable TV, or the Internet, or when processing the same on a storage medium such as an optical disk or a magnetic disk. .

【０００２】[0002]

【従来の技術】近年、画像情報をディジタルとして取り
扱い、効率の高い情報の伝送、蓄積を目的とし、画像情
報特有の冗長性を利用して、離散コサイン変換等の直交
変換と動き補償により圧縮するＭＰＥＧなどの画像情報
圧縮方式が提供されている。そして、このような画像情
報圧縮方法に準拠した装置は、放送局などの情報配信、
及び一般家庭における情報受信の双方において普及しつ
つある。2. Description of the Related Art In recent years, image information is handled as digital data, and for the purpose of transmitting and storing information with high efficiency, compression is performed by orthogonal transform such as discrete cosine transform and motion compensation using redundancy inherent in image information. An image information compression system such as MPEG is provided. Then, an apparatus conforming to such an image information compression method is used for information distribution of a broadcasting station or the like,
It is becoming widespread in both information reception in general households.

【０００３】特に、ＭＰＥＧ２（ＩＳＯ／ＩＥＣ１３
８１８−２）は、飛び越し走査画像及び順次走査画像の
双方、並びに標準解像度画像及び高精細画像を網羅す
る、汎用画像符号化方式として定義されている。In particular, MPEG2 (ISO / IEC 13
818-2) is defined as a general-purpose image encoding method that covers both interlaced scan images and progressive scan images, as well as standard resolution images and high definition images.

【０００４】すなわち、ＭＰＥＧ２符号化圧縮方式によ
れば、例えば、７２０×４８０画素を持つ標準解像度の
飛び越し走査画像に４〜８Ｍｂｐｓの符号量（ビットレ
ート）を割り当て、１９２０×１０８８画素を持つ高解
像度の飛び越し走査画像に対して１８〜２２Ｍｂｐｓの
符号量（ビットレート）を割り当てることにより、高い
圧縮率と良好な画質の実現が可能となる。That is, according to the MPEG2 encoding and compression system, for example, a code amount (bit rate) of 4 to 8 Mbps is assigned to a standard resolution interlaced scan image having 720 × 480 pixels, and a high resolution having 1920 × 1088 pixels is assigned. By assigning a code amount (bit rate) of 18 to 22 Mbps to the interlaced scan image, a high compression rate and good image quality can be realized.

【０００５】このようなことから、ＭＰＥＧ２は、プロ
フェッショナル用途及びコンシューマー用途の広範なア
プリケーションに今後とも用いられるものと予想され
る。しかし、ＭＰＥＧ２は、主として放送用に適合する
高画質符号化を対象としており、例えばＭＰＥＧ１より
低い符号量（ビットレート）、つまりより高い圧縮率の
符号化方式には対応していなかった。[0005] For these reasons, it is expected that MPEG2 will be used in a wide range of applications for professional use and consumer use. However, MPEG2 is mainly intended for high-quality coding suitable for broadcasting, and does not support, for example, a coding amount (bit rate) lower than that of MPEG1, that is, a coding method with a higher compression rate.

【０００６】一方で、近年の携帯端末の普及により、今
後とも高い圧縮率の符号化方式のニーズは高まると思わ
れ、これに対応して、高い圧縮率を有するＭＰＥＧ４符
号化方式の標準化が行われている。この画像符号化方式
に関しては、１９９８年１２月にＩＳＯ／ＩＥＣ１４
４９６−２として国際標準の規格が承認された。[0006] On the other hand, with the spread of portable terminals in recent years, it is expected that the need for an encoding system with a high compression ratio will increase in the future, and in response to this, standardization of the MPEG4 encoding system with a high compression ratio has been carried out. Have been done. Regarding this image coding method, ISO / IEC 14
International standard was approved as 496-2.

【０００７】ところで、ディジタル放送用に一度符号化
されたＭＰＥＧ２画像圧縮情報（ビットストリーム）
を、携帯端末上等で処理するのにより適した、より低い
符号量（ビットレート）の画像圧縮情報（ビットストリ
ーム）に変換したいというニーズがある。By the way, MPEG2 image compression information (bit stream) once encoded for digital broadcasting
There is a need to convert image compression information (bit stream) having a lower code amount (bit rate), which is more suitable for processing on a mobile terminal or the like.

【０００８】かかる目的を達成するために、“Field-to
-Frame Transcoding with Spatialand Temporal Downsa
mpling”（Susie L Wee,John G.Apostolopoulos, and N
ickFeamster, ICIP 99、以下これを文献１と呼ぶ）にお
いて画像情報変換装置（トランスコーダ）が提供されて
いる。In order to achieve such an object, “Field-to
-Frame Transcoding with Spatialand Temporal Downsa
mpling ”(Susie L Wee, John G. Apostolopoulos, and N
ickFeamster, ICIP99, hereafter referred to as Document 1).
And an image information conversion device (transcoder) is provided.
I have.

【０００９】この文献１において提供された画像情報変
換装置（トランスコーダ）は、図５に示すように、ピク
チャタイプ判別部１と、ＭＰＥＧ２画像情報復号化部
（Ｉ／Ｐピクチャ）２と、間引き部３と、ＭＰＥＧ４画
像情報符号化部（Ｉ／Ｐ−ＶＯＰ）４と、動きベクトル
合成部５と、動きベクトル検出部６とから構成されてい
る。[0009] As shown in FIG. 5, an image information conversion device (transcoder) provided in Document 1 includes a picture type discrimination unit 1, an MPEG2 image information decoding unit (I / P picture) 2, a thinning-out unit. It comprises a unit 3, an MPEG4 image information encoding unit (I / P-VOP) 4, a motion vector synthesizing unit 5, and a motion vector detecting unit 6.

【００１０】この画像情報変換装置には、フレーム内で
符号化されたイントラ符号化画像（Ｉピクチャ；Ｉ）、
表示順序で順方向を参照して予測符号化された順方向予
測符号化画像（Ｐピクチャ；Ｐ）及び表示順序で順方向
及び逆方向を参照して予測符号化された双方向予測符号
化画像（Ｂピクチャ；Ｂ）から構成される飛び越し走査
のＭＰＥＧ２画像圧縮情報（ビットストリーム）が入力
される。[0010] The image information conversion apparatus includes an intra-coded image (I picture; I) encoded in a frame,
A forward predictive coded image (P picture; P) predictively coded by referring to the forward direction in the display order, and a bidirectional predictive coded image predictively coded by referring to the forward and reverse directions in the display order MPEG-2 image compression information (bit stream) of interlaced scanning composed of (B picture; B) is input.

【００１１】このＭＰＥＧ２画像圧縮情報（ビットスト
リーム）は、ピクチャタイプ判別部１において、Ｉ／Ｐ
ピクチャに関するものか、Ｂピクチャに関するものであ
るかを判別され、Ｉ／Ｐピクチャのみ後続のＭＰＥＧ２
画像情報復号化部（Ｉ／Ｐピクチャ）２に出力され、Ｂ
ピクチャは破棄される。[0011] The MPEG2 image compression information (bit stream) is supplied to an I / P
It is determined whether the picture is related to a picture or a B picture.
Output to the image information decoding unit (I / P picture) 2
The picture is discarded.

【００１２】ＭＰＥＧ２画像情報復号化部（Ｉ／Ｐピク
チャ）２における処理は通常のＭＰＥＧ２画像情報復号
化装置と同様に、ＭＰＥＧ２画像圧縮情報（ビットスト
リーム）を画像信号に復号するものである。The processing in the MPEG2 image information decoding section (I / P picture) 2 is to decode the MPEG2 image compression information (bit stream) into an image signal, as in a normal MPEG2 image information decoding device.

【００１３】ＭＰＥＧ２画像情報復号化部（Ｉ／Ｐピク
チャ）２の出力となる画素値は、間引き部３に入力され
る。間引き部３は、水平方向には１／２の間引き処理を
施し、垂直方向には、第一フィールド若しくは第二フィ
ールドのどちらか一方のデータのみを残し、もう一方を
廃棄する。このような間引きによって、入力となる画像
情報の１／４の大きさを持つ順次走査画像を生成する。A pixel value output from the MPEG2 image information decoding unit (I / P picture) 2 is input to a thinning unit 3. The thinning section 3 performs a 1/2 thinning process in the horizontal direction, and leaves only one of the data of the first field and the second field in the vertical direction, and discards the other. By such thinning, a progressively scanned image having a size of ４ of the input image information is generated.

【００１４】間引き部３によって生成された順次走査画
像はＭＰＥＧ４画像情報符号化部（Ｉ／Ｐ−ＶＯＰ）４
によってフレーム内で符号化されたＩ−ＶＯＰ及び表示
順序で順方向を参照して予測符号化されたＰ−ＶＯＰに
符号化され、ＭＰＥＧ４画像圧縮情報（ビットストリー
ム）として出力される。尚、ＶＯＰはVideo object Pla
neを意味し、ＭＰＥＧ２におけるフレームに相当するも
のである。The progressively scanned image generated by the thinning section 3 is converted to an MPEG4 image information encoding section (I / P-VOP) 4
Is encoded into an I-VOP encoded in the frame and a P-VOP predicted and encoded with reference to the forward direction in the display order, and output as MPEG4 image compression information (bit stream). VOP is Video object Pla
ne, which corresponds to a frame in MPEG2.

【００１５】その際、入力となるＭＰＥＧ２画像圧縮情
報（ビットストリーム）中の動きベクトル情報は、動き
ベクトル合成部５において間引き後の画像情報に対する
動きベクトルにマッピングされ、動きベクトル検出部６
においては、動きベクトル合成部５において合成された
動きベクトル値を元に高精度の動きベクトルを検出す
る。At this time, the motion vector information in the input MPEG2 image compression information (bit stream) is mapped to a motion vector for the decimated image information in the motion vector synthesizing unit 5 and the motion vector detecting unit 6
In, a highly accurate motion vector is detected based on the motion vector value synthesized by the motion vector synthesizing unit 5.

【００１６】文献１は、入力となるＭＰＥＧ２画像圧縮
情報（ビットストリーム）の１／２×１／２の大きさを
持つ順次走査画像のＭＰＥＧ４画像圧縮情報（ビットス
トリーム）を生成する画像情報変換装置に関して記述し
ている。すなわち、例えば入力となるＭＰＥＧ２画圧縮
情報（ビットストリーム）がＮＴＳＣ（National Telev
ision System Committee）の規格に準拠したものであっ
た場合、出力となるＭＰＥＧ４画像圧縮情報はＳＩＦサ
イズ（３５２×２４０画素）ということになる。Reference 1 discloses an image information conversion apparatus for generating MPEG4 image compression information (bit stream) of a progressively scanned image having a size of 1/2 × 1/2 of the input MPEG 2 image compression information (bit stream). Is described. That is, for example, the input MPEG2 image compression information (bit stream) is transmitted using NTSC (National Telev.
If it complies with the standards of the ISion Committee, the output MPEG4 image compression information has an SIF size (352 × 240 pixels).

【００１７】ところで、図５に示した画像情報変換装置
においては、ＭＰＥＧ４画像情報符号化部（Ｉ／Ｐ−Ｖ
ＯＰ）４における符号量制御が、出力となるＭＰＥＧ４
画像圧縮情報（ビットストリーム）における画質を決定
する大きな要因となる。ＩＳＯ／ＩＥＣ１４４９６−
２においては、符号量制御の方式に関しては特に規定さ
れておらず、各ベンダが、アプリケーションに応じて、
演算量及び出力画質の観点から最適と考えられる方式を
用いることが出来る。以下では、代表的な符号量制御方
式として、ＭＰＥＧ２ＴｅｓｔＭｏｄｅｌ５（ＩＳ
Ｏ／ＩＥＣＪＴＣ１／ＳＣ２９／ＷＧ１１Ｎ０４０
０）で述べられている方式について述べる。In the image information conversion apparatus shown in FIG. 5, an MPEG4 image information encoding unit (I / P-V
OP) 4 is to control the amount of code to output MPEG4
This is a major factor in determining the image quality of the image compression information (bit stream). ISO / IEC 14496-
In No. 2, the code amount control method is not specified, and each vendor has
It is possible to use a method that is considered optimal in terms of the amount of calculation and the output image quality. In the following, MPEG2 Test Model 5 (IS
O / IEC JTC1 / SC29 / WG11 N040
The method described in 0) will be described.

【００１８】この符号量制御のフローを図６に示すフロ
ーを用いて説明する。最初のステップＳ１１において、
画像情報符号化部（Ｉ／Ｐ−ＶＯＰ）４は、目標符号量
（ターゲットビットレート）、及び、ＧＯＰ（ｇroup o
f pictures）構成を入力変数として、各ピクチャヘのビ
ット配分を行う。ここで、ＧＯＰとは、ランダムアクセ
ス可能なピクチャの組である。The flow of this code amount control will be described with reference to the flow shown in FIG. In the first step S11,
The image information encoding unit (I / P-VOP) 4 includes a target code amount (target bit rate) and a GOP (groove).
f pictures) configuration, and allocates bits to each picture. Here, a GOP is a set of pictures that can be randomly accessed.

【００１９】すなわち、ステップＳ１１において、画像
情報符号化部（Ｉ／Ｐ−ＶＯＰ）４は、ＧＯＰ内の各ピ
クチャに対する割り当てビット量を、割り当て対象ピク
チャを含めＧＯＰ内でまだ復号化されていないピクチャ
に対して割り当てられるビット量（以下、これをＲとす
る）を基に配分する。この配分をＧＯＰ内の符号化ピク
チャ順に繰り返す。その際、以下に述べる２つの仮定を
用いて各ピクチャヘの符号量割り当てを行う。That is, in step S11, the image information encoding unit (I / P-VOP) 4 determines the amount of bits allocated to each picture in the GOP, including the picture to be allocated, of the picture not yet decoded in the GOP. Are allocated based on the amount of bits allocated to (hereinafter referred to as R). This distribution is repeated in the order of the coded pictures in the GOP. At this time, the code amount is assigned to each picture using the following two assumptions.

【００２０】まず、第一に、各ピクチャを符号化する際
に用いる平均量子化スケールコードと発生符号量の積
は、画面が変化しない限り、ピクチャタイプ毎に一定値
となると仮定する。そこで、各ピクチャを符号化した
後、各ピクチャタイプ毎に、画面の複雑さを示す変数Ｘ
_i，Ｘ_p，Ｘ_b（grobal complelxity measure）を次の式
（１）によって更新する。First, it is assumed that the product of the average quantization scale code used when encoding each picture and the generated code amount becomes a constant value for each picture type unless the picture changes. Therefore, after encoding each picture, a variable X indicating the complexity of the screen is set for each picture type.
_i, X _p, updated by X _b (grobal complelxity measure) the following equation (1).

【００２１】[0021]

【数６】 (Equation 6)

【００２２】ここでＳ_i，Ｓ_p，Ｓ_bはピクチャ符号化時
の発生符号ビット量であり、Ｑ_i，Ｑ _p，Ｑ_bは、ピクチ
ャ符号化時の平均量子化スケールコードである。また、
初期値は、目標符号量（ターゲットビットレート）ｂｉ
ｔ＿ｒａｔｅ［ｂｉｔｓ／ｓｅｃ］を用いて、式（２）
で示される値とする。Where S_i, S_p, S_bIs for picture encoding
Is the amount of generated code bits, and Q_i, Q _p, Q_bIs a picture
This is an average quantization scale code at the time of key coding. Also,
The initial value is the target code amount (target bit rate) bi
Using t_rate [bits / sec], equation (2)
The value is indicated by

【００２３】[0023]

【数７】 (Equation 7)

【００２４】第二に、Ｉピクチャの量子化スケールコー
ドを基準としたＰ，Ｂピクチャの量子化スケールコード
の比率Ｋ_p，Ｋ_bが式（３）に定めた値となる場合に常に
全体の画質が最適化されると仮定する。Second, when the ratios K _p and K _b of the quantized scale codes of the P and B pictures with respect to the quantized scale code of the I picture are equal to the values defined in the equation (3), the whole is always obtained. Assume that the image quality is optimized.

【００２５】[0025]

【数８】 (Equation 8)

【００２６】すなわち、Ｂピクチャの量子化スケールコ
ードは、Ｉ，Ｐピクチャの量子化スケールコードの常に
１．４倍としている。これは、ＢピクチャをＩ，Ｐピク
チャに比較して多少粗めに符号化することにより、Ｂピ
クチャで節約できる符号量をＩ，Ｐピクチャに加える
と、Ｉ，Ｐピクチャの画質が改善され、これを参照する
Ｂピクチャの画質も改善されることを想定している。That is, the quantization scale code of the B picture is always 1.4 times the quantization scale code of the I and P pictures. This is because the picture quality of the I and P pictures is improved by adding the code amount that can be saved in the B picture to the I and P pictures by coding the B picture somewhat coarsely compared to the I and P pictures. It is assumed that the image quality of the B picture that refers to this is also improved.

【００２７】上記２つの仮定より、ＧＯＰの各ピクチャ
に対する割り当てビット量（Ｔ_i，Ｔ_p，Ｔ_b）は式
（４）に示す値とする。Based on the above two assumptions, the bit amount (T _i , T _p , T _b ) allocated to each picture of the GOP is a value shown in equation (4).

【００２８】[0028]

【数９】 (Equation 9)

【００２９】ここでＮ_p，Ｎ_bはＧＯＰ内でまだ符号化さ
れていないＰ，Ｂピクチャの枚数である。Here, N _p and N _b are the numbers of P and B pictures which have not been encoded in the GOP.

【００３０】このようにして求めた割当符号量を基にし
て、各ピクチャをステップＳ１１，Ｓ１２に従って符号
化する毎に、ＧＯＰ内の未符号化ピクチャに対して割り
当てられるビット量Ｒを式（５）で更新する。Each time each picture is coded according to steps S11 and S12 based on the allocated code amount obtained in this manner, the bit amount R allocated to the uncoded picture in the GOP is calculated by the equation (5). ) To update.

【００３１】[0031]

【数１０】 (Equation 10)

【００３２】また、ＧＯＰの最初のピクチャを符号化す
る際には、式（６）によりＲを更新する。When encoding the first picture of the GOP, R is updated by equation (6).

【００３３】[0033]

【数１１】 [Equation 11]

【００３４】ＮはＧＯＰ内のピクチャ数である。また、
シーケンスの最初でのＲの初期値は０とする。N is the number of pictures in the GOP. Also,
The initial value of R at the beginning of the sequence is 0.

【００３５】次に、ステップＳ１２において、画像情報
符号化装置（Ｉ／Ｐ−ＶＯＰ）４は、仮想バッファを用
いたレート制御を行う。すなわち、ステップＳ１２にお
いて、画像情報符号化装置（Ｉ／Ｐ−ＶＯＰ）４は、ス
テップＳ１１で式（４）により求められた各ピクチャに
対する割当ビット量（Ｔ_i，Ｔ_p，Ｔ_b）と、実際の発生
符号量を一致させるため、各ピクチャ毎に独立に設定し
た３種類の仮想バッファの容量を基に、量子化スケール
コードを、マクロブロック単位のフィードバック制御で
求める。Next, in step S12, the image information encoding device (I / P-VOP) 4 performs rate control using the virtual buffer. That is, in step S12, the image information encoding device (I / P-VOP) 4 allocates bits (T _i , T _p , T _b ) for each picture obtained by equation (4) in step S11, In order to match the actual generated code amounts, the quantization scale code is obtained by macroblock-based feedback control based on the capacity of three types of virtual buffers independently set for each picture.

【００３６】まず、ｊ番目のマクロブロック符号化に先
立ち、仮想バッファの占有量を式（７）によって求め
る。First, prior to encoding the j-th macroblock, the occupancy of the virtual buffer is determined by equation (7).

【００３７】[0037]

【数１２】 (Equation 12)

【００３８】ここで、ｄ₀ ⁱ，ｄ₀ ^p，ｄ₀ ^bは各仮想バッフ
ァの初期占有量、Ｂ_jはピクチャの先頭からｊ番目のマ
クロブロックまでの発生ビット量、ＭＢ＿ｃｎｔは１ピ
クチャ内のマクロブロック数である。各ピクチャ符号化
終了時の仮想バッファ占有量（ｄ_{MB_cnt} ⁱ，ｄ_{MB_cnt} ^p，
ｄ_{MB_cnt} ^b）は、それぞれ同一のピクチャタイプで、次
のピクチャに対する仮想バッファ占有量の初期値
（ｄ₀ ⁱ，ｄ₀ ^p，ｄ₀ ^b）として用いられる。Here, d ₀ ⁱ , d ₀ ^p , and d ₀ ^b are the initial occupancy of each virtual buffer, B _j is the amount of generated bits from the head of the picture to the j-th macroblock, and MB_cnt is the number of bits in one picture. This is the number of macro blocks. Virtual buffer occupancy at the end of each picture encoding (d _{MB_cnt} ⁱ , d _{MB_cnt} ^p ,
d _{MB_cnt} ^b ) is used as an initial value (d ₀ ⁱ , d ₀ ^p , d ₀ ^b ) of the virtual buffer occupancy for the next picture of the same picture type.

【００３９】次に、ｊ番目のマクロブロックに対する量
子化スケールコードを式（８）により計算する。Next, the quantization scale code for the j-th macroblock is calculated by equation (8).

【００４０】[0040]

【数１３】 (Equation 13)

【００４１】ここで、ｒはリアクションパラメーターと
呼ばれるフィードバックループの応答を制御する変数で
あり、式（９）により与えられる。Here, r is a variable called a reaction parameter that controls the response of the feedback loop, and is given by equation (9).

【００４２】[0042]

【数１４】 [Equation 14]

【００４３】尚、符号化開始時における仮想バッファの
初期値は式（１０）で与えられる。The initial value of the virtual buffer at the start of encoding is given by equation (10).

【００４４】[0044]

【数１５】 (Equation 15)

【００４５】最後に、ステップＳ１３において、画像情
報符号化装置（Ｉ／Ｐ−ＶＯＰ）４は、視覚特性を考慮
したマクロブロック毎の適応量子化を行う。すなわち、
ステップＳ１３において、画像情報符号化部（Ｉ／Ｐ−
ＶＯＰ）４は、ステップＳ１２で求められた量子化スケ
ールコードを、視覚的に劣化の目立ちやすい平坦部でよ
り細かく量子化し、劣化の比較的目立ちにくい絵柄の複
雑な部分で粗く量子化するように、各マクロブロック毎
のアクティビティと呼ばれる変数によって変化させてい
る。Finally, in step S13, the image information encoding device (I / P-VOP) 4 performs adaptive quantization for each macroblock in consideration of visual characteristics. That is,
In step S13, the image information encoding unit (I / P-
VOP) 4 is to quantize the quantized scale code obtained in step S12 more finely in a flat portion where the deterioration is conspicuous visually and coarsely in a complicated portion of the picture where the deterioration is relatively inconspicuous. , And is changed by a variable called an activity for each macroblock.

【００４６】アクティビティは、原画の輝度信号画素値
を用い、フレーム離散コサイン変換モードにおける４個
のブロックと、フィールド離散コサイン変換モードにお
ける４個のブロックとの、合計８ブロックの画素値を用
いて式（１１）で与えられる。The activity is calculated by using the pixel value of the luminance signal of the original picture and the pixel values of a total of 8 blocks of 4 blocks in the frame discrete cosine transform mode and 4 blocks in the field discrete cosine transform mode. Given by (11).

【００４７】[0047]

【数１６】 (Equation 16)

【００４８】ここで、Ｐ_kは原画の輝度信号ブロック内
画素値である。式（１１）において最小値を採るのは、
マクロブロック内の一部だけでも平坦部分のある場合に
は量子化を細かくするためである。Here, P _k is the pixel value in the luminance signal block of the original image. The minimum value in the equation (11) is
This is because the quantization is made fine when there is a flat portion even in only a part of the macro block.

【００４９】更に、式（１２）によりその値が０．５〜
２の範囲を取る正規化アクティビティＮａｃｔ_jを求め
る。Further, according to equation (12), the value is 0.5 to
A normalization activity Nact _j having a range of 2 is obtained.

【００５０】[0050]

【数１７】 [Equation 17]

【００５１】ここで、ａｖｇ＿ａｃｔは、直前に符号化
したピクチャでのａｃｔ_jの平均値である。Here, avg_act is the average value of act _j in the picture coded immediately before.

【００５２】視覚特性を考慮した量子化スケールコード
ｍｑｕａｎｔ_jはステップＳ１２で得られた量子化スケ
ールコードＱ_jを基に式（１３）で与えられる。The quantized scale code mquant _j taking into account the visual characteristics is given by equation (13) based on the quantized scale code Q _j obtained in step S12.

【００５３】[0053]

【数１８】 (Equation 18)

【００５４】ＭＰＥＧ２ＴｅｓｔＭｏｄｅｌ５にお
いて定められた上記符号量制御方式には以下の制限のあ
ることが知られており、実際の制御を行う場合には、こ
れらの制限に対する対策が必要となる。すなわち、第一
の制限は、第一ステップＳ１１はシーンチェンジに対応
出来ず、また、シーンチェンジ後には第三ステップＳ１
３で用いる媒介変数ａｖｇ＿ａｃｔが間違った値となる
ということである。第二の制限は、ＭＰＥＧ２及びＭＰ
ＥＧ４において規定されているＶＢＶ（VideoBuffer Ve
rifier）の拘束条件を満たす保証がないことである。It is known that the above-mentioned code amount control method defined in the MPEG2 Test Model 5 has the following restrictions, and when performing actual control, it is necessary to take measures against these restrictions. That is, the first limitation is that the first step S11 cannot respond to a scene change, and the third step S1
This means that the parameter avg_act used in step 3 has an incorrect value. The second limitation is that MPEG2 and MP
VBV (VideoBuffer Ve) specified in EG4
rifier) is not guaranteed.

【００５５】ところで、文献”ＭＰＥＧ圧縮効率の理論
解析とその符号量制御への応用”（信学技報、ＩＥ−９
５，ＤＳＰ９５−１０，１９９５年５月、以下これを
文献２と呼ぶ）でも述べられている通り、ＴｅｓｔＭ
ｏｄｅｌ５で定められている符号量制御方式は、ＭＰ
ＥＧ−２画像符号化装置において、必ずしも良好な画質
を与えるものではない。By the way, the document "Theoretical analysis of MPEG compression efficiency and its application to code amount control" (IEICE Technical Report, IE-9).
5, DSP95-10, May 1995, hereinafter referred to as Reference 2).
The code amount control method defined in model 5 is MP
The EG-2 image encoding device does not always provide good image quality.

【００５６】この文献２では、特に、良好な画質を与え
るための、ＧＯＰ内における各フレーム毎の最適な符号
量配分を与える手法として以下の方式を提案している。
すなわち、Ｎ_I，Ｎ_P，Ｎ_Bを、ＧＯＰ内においてまだ符
号化されていないＩ，Ｐ，Ｂピクチャの枚数として、こ
れらに割り当てられる符号量をＲ_I，Ｒ_P，Ｒ_Bとする。
また、式（１４）で与えられる固定レート条件の下に、
それぞれにおける量子化ステップサイズをＱ_I，Ｑ_P，Ｑ
_Bとし、ｍを、量子化ステップサイズと再生誤差分散を
関係付ける次数（すなわち、量子化ステップサイズをｍ
乗したものの平均値の最小化が再生誤差分散を最低にす
ると仮定する）とする。そして、式（１５）を最小にす
ることを考える。This document 2 proposes the following method as a method for giving an optimal code amount distribution for each frame in a GOP, particularly for giving a good image quality.
That is, let N _I , N _P , and N _B be the numbers of I, P, and B pictures that have not been encoded in the GOP, and let R _I , R _P , and R _B be the code amounts assigned to these.
Also, under the fixed rate condition given by equation (14),
Q _I , Q _P , Q
_B, and m is an order relating the quantization step size and the reproduction error variance (that is, the quantization step size is m
It is assumed that minimizing the average value of the powers will minimize the reproduction error variance). Then, consider minimizing equation (15).

【００５７】[0057]

【数１９】 [Equation 19]

【００５８】[0058]

【数２０】 (Equation 20)

【００５９】尚、それぞれのフレームにおける平均量子
化スケールＱ、及び符号量Ｒは、ＴｅｓｔＭｏｄｅｌ
５でも用いられる媒体変数である各フレームのコンプレ
キシティＸと、式（１６）のように関係づけられる。The average quantization scale Q and the code amount R in each frame are represented by Test Model.
5 is related to the complexity X of each frame, which is also a medium variable used in Expression 5, as shown in Expression (16).

【００６０】[0060]

【数２１】 (Equation 21)

【００６１】式（１６）の関係も考慮しつつ、式（１
４）の拘束条件の元に式（１５）を最小にするＲ_I，
Ｒ_P，Ｒ_Bを、ラグランジェの未定乗数法を用いて算出す
ると、最適なＲ_I，Ｒ_P，Ｒ_Bとして以下の式のような値
が求められる。The formula (1) is also taken into consideration while considering the relationship of the formula (16).
R _I , which minimizes equation (15) under the constraint of 4)
When R _P and R _B are calculated using the Lagrange's undetermined multiplier method, the following values are obtained as optimal R _I , R _P and R _B.

【００６２】[0062]

【数２２】 (Equation 22)

【００６３】α＝１として、式（１７）と、ＭＰＥＧ２
ＴｅｓｔＭｏｄｅｌ５で定められた符号量制御方式
における式（４）との関係は以下の通りであると言え
る。すなわち、式（１７）は、符号量制御の媒介変数で
あるＫ_p，Ｋ_bを、各フレームのコンプレキシティＸ_I，
Ｘ_P，Ｘ_Bに応じて、式（１８）のように適応的に算出し
ていることに他ならない。When α = 1, equation (17) and MPEG2
It can be said that the relationship with Expression (4) in the code amount control method defined in Test Model 5 is as follows. That is, equation (17) uses the parameters K _p and K _b that are the code amount control parameters as the complexity X _I ,
X _P, according to X _B, nothing but that they are adaptively calculated as in Equation (18).

【００６４】[0064]

【数２３】 (Equation 23)

【００６５】文献２では、１／（１＋ｍ）の値として、
０．６〜１．２程度に設定することで良好な画質が得ら
れることが示されている。In Reference 2, as the value of 1 / (1 + m),
It is shown that good image quality can be obtained by setting the value to about 0.6 to 1.2.

【００６６】[0066]

【発明が解決しようとする課題】しかしながら、図５に
示した画像情報変換装置内で、ＭＰＥＧ４画像情報符号
化装置（Ｉ／Ｐ−ＶＯＰ）４において、ＭＰＥＧ２Ｔ
ｅｓｔＭｏｄｅｌ５において定められたのと同様な手
法を用いて符号量制御を行った場合、シーンチェンジ等
に起因する、ＧＯＰ内でのコンプレキシティの変化に対
応することが不可能である。このため、安定した符号量
制御が困難となり、画質劣化を引き起こすことが考えら
れる。また、この画像情報変換装置においては、演算処
理量が多大であるという問題が有る。However, in the image information conversion device shown in FIG. 5, the MPEG2 T information is encoded by the MPEG4 image information encoding device (I / P-VOP) 4.
When code amount control is performed using a method similar to that defined in est Model 5, it is impossible to cope with a change in complexity within the GOP due to a scene change or the like. For this reason, stable code amount control becomes difficult, which may cause image quality degradation. In addition, this image information conversion device has a problem that the amount of calculation processing is large.

【００６７】本発明は、上述の実情に鑑みて提案される
ものであって、ＧＯＰ内でのコンプレキシティの変化に
対応することができ、演算処理量を低減するような画像
情報変換装置及び方法を提供することを目的とする。The present invention has been proposed in view of the above-mentioned circumstances, and is capable of coping with a change in complexity within a GOP, and an image information conversion apparatus and a method for reducing the amount of arithmetic processing. The aim is to provide a method.

【００６８】[0068]

【課題を解決するための手段】上述の課題を解決するた
めに、本発明は、第１の圧縮符号化方式で圧縮された飛
び越し走査の入力画像圧縮情報を、第２の圧縮符号化方
式で圧縮された順次走査の出力画像圧縮情報に変換する
ものであって、上記第１の圧縮符号化方式で圧縮された
入力画像圧縮情報の所定位置に含まれる量子化スケール
情報を用いて、この入力画像圧縮情報の各フレームの平
均量子化スケールの近似値を求め、この近似値及び各フ
レームに割り当てられた符号量を用いて上記フレームの
画像に対するコンプレキシティの近似値を求め、上記近
似値に基づいて出力画像圧縮情報の符号化における符号
量を制御するものである。SUMMARY OF THE INVENTION In order to solve the above-mentioned problems, the present invention provides an interlaced scan input image compression information compressed by a first compression encoding system by a second compression encoding system. The input image compression information is converted into compressed progressively scanned output image compression information. The input image compression information is compressed by using the quantization scale information included in a predetermined position of the input image compression information compressed by the first compression encoding method. The approximate value of the average quantization scale of each frame of the image compression information is obtained, and the approximate value of the complexity of the image of the frame is obtained by using the approximate value and the code amount assigned to each frame. The code amount in the coding of the output image compression information is controlled on the basis of this.

【００６９】本発明は、飛び越し走査のＭＰＥＧ２画像
圧縮情報（ビットストリーム）を入力画像圧縮情報と
し、順次操作のＭＰＥＧ４画像圧縮情報（ビットストリ
ーム）を出力画像圧縮情報とする。これらＭＰＥＧ２画
像圧縮情報（ビットストリーム）及びＭＰＥＧ４画像圧
縮情報は、画素ブロック列すなわちスライスから構成さ
れている。このスライスは、複数の画素から構成される
画素ブロックすなわちマクロブロックから構成されてい
る。According to the present invention, MPEG2 image compression information (bit stream) of interlaced scanning is used as input image compression information, and MPEG4 image compression information (bit stream) of sequential operation is used as output image compression information. The MPEG2 image compression information (bit stream) and the MPEG4 image compression information are composed of a pixel block sequence, that is, a slice. This slice is composed of a pixel block composed of a plurality of pixels, that is, a macroblock.

【００７０】本発明は、ＭＰＥＧ２画像圧縮情報の画像
のタイプを判別するピクチャタイプ判別部、ＭＰＥＧ２
画像圧縮情報の圧縮情報を解析する圧縮情報解析部及び
その構成要素であるスライスヘッダを解析するスライス
ヘッダ解析部、ＭＰＥＧ２画像圧縮情報を復号する画像
圧縮情報復号化部（Ｉ／Ｐピクチャ）、画素を間引く間
引き部、ＭＰＥＧ４画像圧縮情報に符号化するＭＰＥＧ
４画像情報符号化部（Ｉ／Ｐ−ＶＯＰ）、ＭＰＥＧ２画
像圧縮情報の動きベクトルを合成する動きベクトル合成
部、動きベクトルを検出する動きベクトル検出部、圧縮
情報解析部から送られた圧縮情報を格納する情報バッフ
ァ、平均量子化スケールを算出する平均量子化スケール
算出部、平均量子化スケールに基づいてコンプレキシテ
ィを算出するコンプレキシティ算出部を兼ね備える。The present invention provides a picture type discriminating section for discriminating the type of an image of MPEG2 image compression information.
A compression information analysis unit for analyzing compression information of image compression information, a slice header analysis unit for analyzing a slice header which is a component thereof, an image compression information decoding unit (I / P picture) for decoding MPEG2 image compression information, and a pixel. Thinning unit for thinning, MPEG encoding to MPEG4 image compression information
4 image information coding unit (I / P-VOP), a motion vector synthesizing unit for synthesizing the motion vector of the MPEG2 image compression information, a motion vector detecting unit for detecting the motion vector, and the compression information sent from the compression information analyzing unit. It also has an information buffer for storing, an average quantization scale calculation unit for calculating an average quantization scale, and a complexity calculation unit for calculating complexity based on the average quantization scale.

【００７１】そして、本発明は、各スライスヘッダに存
在するquantizer_scale_codeを用いて画面全体の平均量
子化スケールを近似的に算出し、これを用いて算出され
た、入力となるＭＰＥＧ２画像圧縮情報（ビットストリ
ーム）における各フレームに対するコンプレキシティの
近似値を利用して、より少ない処理量により、各フレー
ムに対する符号量割当が画像に対して最適化された状態
で、順次走査のＭＰＥＧ４画像圧縮情報（ビットストリ
ーム）を出力する手段を提供するものである。The present invention uses the quantizer_scale_code present in each slice header to approximately calculate the average quantization scale of the entire screen, and calculates the input MPEG2 image compression information (bit MPEG4 image compression information (bits) of progressive scanning in a state where the code amount allocation for each frame is optimized for the image with a smaller amount of processing using the approximate value of the complexity for each frame in the stream). Stream).

【００７２】上記構成のピクチャタイプ判別部において
は、入力となるＭＰＥＧ２画像圧縮情報（ビットストリ
ーム）内で、Ｉ／Ｐピクチャに関するものだけ残してＢ
ピクチャに関するものは廃棄し、圧縮情報解析部におい
ては、１ＧＯＰ分の遅延を実現し、また、入力となるＭ
ＰＥＧ２画像圧縮情報（ビットストリーム）内で、各フ
レームに対して割り当てられた符号量（ビット数）を検
出するとともに、圧縮情報解析部内のスライスヘツダ解
析部においては、スライスヘッダ内のquantizer_scale_
codeの値を抽出し、ＭＰＥＧ２画像情報復号化部（Ｉ／
Ｐピクチャ）においては、ピクチャタイプ判別部の出力
となる、Ｉ／Ｐピクチャに関する圧縮情報（ビットスト
リーム）を、水平方向垂直方向ともに、８次の離散コサ
イン係数全てを用いた、若しくはその低域成分のみを用
いた復号処理を行い、間引き装置においては、ＭＰＥＧ
２画像情報復号化部（Ｉ／Ｐピクチャ）の出力である画
像情報の第一フィールド若しくは第二フイールドのみを
取り出して順次走査画像への変換を行うと同時に、所望
の画枠サイズに変換するためのダウンサンプリングを行
い、ＭＰＥＧ４画像情報符号化部（Ｉ／Ｐ−ＶＯＰ）に
おいては、間引き装置の出力となる画像情報をＭＰＥＧ
４符号化方式により符号化し、動きベクトル合成部にお
いては、ＭＰＥＧ２画像情報復号化部（Ｉ／Ｐピクチ
ャ）で検出された、入力となる画像圧縮情報（ビットス
トリーム）内の動きベクトル値を元に、走査変換後の画
像データに対する動きベクトル値にマッピングを行い、
動きベクトル検出部においては、動きベクトル合成部か
ら出力される動きベクトル値を元に、高精度の動きベク
トル検出を行い、情報バッファにおいては、圧縮情報解
析部において抽出された、入力となるＭＰＥＧ２画像圧
縮情報（ビットストリーム）における、各フレームに割
り当てられた符号量（ビット数）を格納し、平均量子化
スケール算出部においては、スライスヘッダ解析部にお
いて抽出された各スライスに対するquantizer_scale_co
deから、各フレームに対する平均量子化スケールを近似
的に算出し、コンプレキシティ算出部においては、情報
バッファに格納された、入力となる画像圧縮情報（ビッ
トストリーム）における、各フレームに割り当てられた
符号量（ビット数）、及びコンプレキシティ算出部にお
いて近似的に計算された、各フレームに対する平均量子
化スケールから、各フレームに対するコンプレキシティ
を近似的に算出し、ＭＰＥＧ４画像情報符号化部（Ｉ／
Ｐ−ＶＯＰ）へ、符号量制御のための媒介変数として伝
送する。In the picture type discriminating section having the above-described structure, only the I / P picture is left in the input MPEG2 image compression information (bit stream).
Pictures related to pictures are discarded, and the compression information analyzer realizes a delay of one GOP, and outputs M
In the PEG2 image compression information (bit stream), the code amount (bit number) allocated to each frame is detected, and in the slice header analysis unit in the compression information analysis unit, quantizer_scale_ in the slice header is determined.
The value of code is extracted, and the MPEG2 image information decoding unit (I /
(P picture), the compression information (bit stream) relating to the I / P picture, which is output from the picture type discriminating unit, is obtained by using all the 8th-order discrete cosine coefficients in the horizontal and vertical directions, or a low-frequency component thereof. The decoding process using only MPEG-2 is performed.
To extract only the first field or the second field of the image information output from the two-image information decoding unit (I / P picture) and convert it to a sequentially scanned image, and at the same time to convert it to a desired image frame size Is downsampled, and the MPEG4 image information encoding unit (I / P-VOP) converts the image information to be output from the thinning device to MPEG.
In the motion vector synthesizing unit, based on the motion vector value in the input image compression information (bit stream) detected by the MPEG2 image information decoding unit (I / P picture). Perform mapping to the motion vector value for the image data after scan conversion,
The motion vector detecting section performs high-precision motion vector detection based on the motion vector value output from the motion vector synthesizing section. In the information buffer, the input MPEG2 image extracted by the compression information analyzing section. In the compression information (bit stream), the code amount (the number of bits) allocated to each frame is stored. In the average quantization scale calculation unit, the quantizer_scale_co for each slice extracted in the slice header analysis unit is stored.
From the de, the average quantization scale for each frame is approximately calculated, and the complexity calculating unit allocates the average quantization scale to each frame in the input image compression information (bit stream) stored in the information buffer. The complexity for each frame is approximately calculated from the code amount (the number of bits) and the average quantization scale for each frame approximately calculated by the complexity calculation unit, and the MPEG4 image information encoding unit ( I /
(P-VOP) as a parameter for code amount control.

【００７３】[0073]

【発明の実施の形態】以下、本発明の実施の形態につい
て、図面を参照して説明する。Embodiments of the present invention will be described below with reference to the drawings.

【００７４】まず、画像情報変換装置（トランスコー
ダ）の一例について、図１を参照して説明する。First, an example of an image information conversion device (transcoder) will be described with reference to FIG.

【００７５】この画像情報変換装置は、ピクチャタイプ
判別部７と、圧縮情報解析部８と、ＭＰＥＧ２画像情報
復号化部（Ｉ／Ｐピクチャ）９と、間引き部１０と、Ｍ
ＰＥＧ４画像情報符号化部（Ｉ／Ｐ−ＶＯＰ）１１と、
動きベクトル合成部１２と、動きベクトル検出部１３
と、情報バッファ１４と、コンプレキシティ算出部１５
とから構成される。The image information conversion apparatus includes a picture type discriminating unit 7, a compression information analyzing unit 8, an MPEG2 image information decoding unit (I / P picture) 9, a thinning unit 10,
A PEG4 image information encoding unit (I / P-VOP) 11;
Motion vector synthesis unit 12 and motion vector detection unit 13
, Information buffer 14 and complexity calculating unit 15
It is composed of

【００７６】この画像情報変換装置には、フレーム内で
符号化されたイントラ符号化画像（Ｉピクチャ；Ｉ）、
表示順序で順方向を参照して予測符号化された順方向予
測符号化画像（Ｐピクチャ；Ｐ）及び表示順序で順方向
及び逆方向を参照して予測符号化された双方向予測符号
化画像（Ｂピクチャ；Ｂ）から構成される飛び越し走査
のＭＰＥＧ２画像圧縮情報（ビットストリーム）が入力
される。The image information conversion apparatus includes an intra-coded image (I picture; I) encoded in a frame,
A forward predictive coded image (P picture; P) predictively coded by referring to the forward direction in the display order, and a bidirectional predictive coded image predictively coded by referring to the forward and reverse directions in the display order MPEG-2 image compression information (bit stream) of interlaced scanning composed of (B picture; B) is input.

【００７７】このＭＰＥＧ２画像圧縮情報（ビットスト
リーム）は、ピクチャタイプ判別部７において、Ｉ／Ｐ
ピクチャに関するものか、Ｂピクチャに関するものであ
るかを判別され、Ｉ／Ｐピクチャのみ後続の圧縮情報解
析部８に出力され、Ｂピクチャは破棄される。The MPEG2 image compression information (bit stream) is transmitted to the I / P
It is determined whether it is related to a picture or a B picture, and only the I / P picture is output to the subsequent compression information analyzer 8, and the B picture is discarded.

【００７８】圧縮情報解析部８において、ＭＰＥＧ２画
像情報復号化装置（Ｉ／Ｐピクチャ）８における復号処
理に用いられる量子化スケールのフレーム全体に渡る平
均値Ｑ、及び、入力となるＭＰＥＧ２画像圧縮情報（ビ
ットストリーム）において、当該フレームに割り当てら
れた総符号量（ビット数）Ｂは、情報バッファ１４に格
納される。In the compression information analysis unit 8, the average value Q over the entire frame of the quantization scale used for the decoding processing in the MPEG2 image information decoding device (I / P picture) 8, and the MPEG2 image compression information to be input In the (bit stream), the total code amount (the number of bits) B allocated to the frame is stored in the information buffer 14.

【００７９】ＭＰＥＧ２画像情報復号化部（Ｉ／Ｐピク
チャ）９における処理は通常のＭＰＥＧ２画像情報復号
化装置と同様に、ＭＰＥＧ２画像圧縮情報（ビットスト
リーム）を画像信号に復号するものである。ここで、Ｂ
ピクチャに関するデータはピクチャタイプ判別部７にお
いて廃棄されているので、ＭＰＥＧ２画像情報復号化部
（Ｉ／Ｐピクチャ）１は、Ｉ／Ｐピクチャのみを復号化
出来る機能を有すればよい。The processing in the MPEG2 image information decoding section (I / P picture) 9 decodes the MPEG2 image compression information (bit stream) into an image signal, similarly to a normal MPEG2 image information decoding device. Where B
Since data relating to pictures is discarded in the picture type discriminating unit 7, the MPEG2 image information decoding unit (I / P picture) 1 only needs to have a function capable of decoding only I / P pictures.

【００８０】ＭＰＥＧ２画像情報復号化部（Ｉ／Ｐピク
チャ）９の出力となる画素値は、間引き部１０に入力さ
れる。間引き部１０は、水平方向には１／２の間引き処
理を施し、垂直方向には、第一フィールド若しくは第二
フィールドのどちらか一方のデータのみを残し、もう一
方を廃棄する。このような間引きによって、入力となる
画像情報の１／４の大きさを持つ順次走査画像を生成す
る。The pixel value output from the MPEG2 image information decoding unit (I / P picture) 9 is input to the thinning unit 10. The thinning section 10 performs a 1/2 thinning process in the horizontal direction, and leaves only one of the data of the first field and the second field in the vertical direction, and discards the other. By such thinning, a progressively scanned image having a size of ４ of the input image information is generated.

【００８１】ところで、間引き部１０から出力された画
像をＭＰＥＧ４画像情報符号化部（Ｉ／Ｐ−ＶＯＰ）１
１において１６×１６画素で構成されるマクロブロック
単位で符号化するためには、水平方向、垂直方向とも
に、その画素数が１６の倍数である必要が有る。間引き
部１０においては、このための画素の補填若しくは廃棄
を、間引きと同時に行う。By the way, the image output from the thinning section 10 is converted into an MPEG4 image information encoding section (I / P-VOP) 1
In order to perform encoding in units of macroblocks composed of 16 × 16 pixels in 1, the number of pixels must be a multiple of 16 in both the horizontal and vertical directions. In the thinning section 10, the pixels are supplemented or discarded for this purpose simultaneously with the thinning.

【００８２】例えば、入力となるＭＰＥＧ２画像圧縮情
報（ビットストリーム）がＮＴＳＣ（National Televis
ion System Committee）の規格に準拠したもの、つまり
７２０×４８０画素、３０Ｈｚの飛び越し走査画像であ
った場合、間引き後の画枠はＳＩＦ（３６０×２４０画
素）サイズということになる。この画像に対して、間引
き部３において、例えば水平方向の右端若しくは左端の
８ラインを廃棄して３５２×２４０画素とする。For example, if the input MPEG2 image compression information (bit stream) is NTSC (National Televis)
In the case of an image conforming to the standards of the Ion System Committee, that is, an interlaced scan image of 720 × 480 pixels and 30 Hz, the image frame after thinning out has an SIF (360 × 240 pixels) size. For this image, the thinning unit 3 discards, for example, the eight lines at the right end or the left end in the horizontal direction to obtain 352 × 240 pixels.

【００８３】なお、間引き部１０における動作の変更を
行うことで、これ以外の画枠、例えば上記の例で、約１
／４×１／４の画枠であるＱＳＩＦ（１７６×１１２画
素）サイズの画像に変換することも可能である。By changing the operation of the thinning section 10, other image frames, for example, about 1 in the above example, can be used.
It is also possible to convert the image into an image of QSIF (176 × 112 pixels) which is an image frame of ××.

【００８４】更に、上述した文献１は、ＭＰＥＧ２画像
情報復号化部（Ｉ／Ｐピクチャ）９における処理とし
て、水平方向、垂直方向それぞれについて、入力となる
ＭＰＥＧ２画像圧縮情報（ビットストリーム）内の、８
次の離散コサイン変換係数すべてを用いた復号処理を行
う画像情報変換装置について述べられているが、図１に
示した装置に関してはその限りではなく、水平方向の
み、或いは水平方向、垂直方向ともに、８次の離散コサ
イン変換係数のうちの低域成分のみを用いた復号処理を
行い、画質劣化を最小限に抑えながら、復号処理に伴う
演算量とビデオメモリ容量を削減することが可能であ
る。Further, the above-mentioned reference 1 discloses that the processing in the MPEG2 image information decoding unit (I / P picture) 9 includes, in each of the horizontal and vertical directions, the MPEG2 image compression information (bit stream) in the input. 8
An image information conversion apparatus that performs decoding processing using all of the following discrete cosine transform coefficients is described. However, the description is not limited to the apparatus illustrated in FIG. 1. Only the horizontal direction, or both the horizontal and vertical directions, By performing decoding using only low-frequency components of the eighth-order discrete cosine transform coefficients, it is possible to reduce the amount of computation and video memory capacity involved in decoding while minimizing image quality degradation.

【００８５】間引き部１０によって生成された順次走査
画像はＭＰＥＧ４画像情報符号化部（Ｉ／Ｐ−ＶＯＰ）
１１によってフレーム内で符号化されたＩ−ＶＯＰ及び
表示順序で順方向を参照して予測符号化されたＰ−ＶＯ
Ｐに符号化され、ＭＰＥＧ４画像圧縮情報（ビットスト
リーム）として出力される。The progressive scan image generated by the thinning unit 10 is an MPEG4 image information encoding unit (I / P-VOP)
11 and an I-VOP coded in the frame and a P-VO coded predictively with reference to the forward direction in the display order.
P is encoded and output as MPEG4 image compression information (bit stream).

【００８６】尚、ＶＯＰはVideo object Planeを意味
し、ＭＰＥＧ２におけるフレームに相当するものであ
る。また、Ｉ−ＶＯＰはＩピクチャに対応するイントラ
符号化ＶＯＰ、Ｐ−ＶＯＰはＰピクチャに対応する順方
向予測符号化ＶＯＰ、Ｂ−ＶＯＰはＢピクチャに対応す
る双方向予測符号化ＶＯＰである。VOP means Video Object Plane, which corresponds to a frame in MPEG2. Further, I-VOP is an intra-coded VOP corresponding to an I picture, P-VOP is a forward predictive coded VOP corresponding to a P picture, and B-VOP is a bidirectional predicted coded VOP corresponding to a B picture.

【００８７】ＭＰＥＧ４画像情報符号化部（Ｉ／Ｐ−Ｖ
ＯＰ）１１における符号化の際には、入力となるＭＰＥ
Ｇ２画像圧縮情報（ビットストリーム）中の動きベクト
ル情報は、動きベクトル合成部１２において間引き後の
画像情報に対する動きベクトルにマッピングされ、動き
ベクトル検出部１３においては、動きベクトル合成部１
２において合成された動きベクトル値を元に高精度の動
きベクトルを検出する。The MPEG4 image information encoding unit (I / P-V
OP) 11, the input MPE
The motion vector information in the G2 image compression information (bit stream) is mapped to a motion vector for the decimated image information in the motion vector synthesizing unit 12, and the motion vector synthesizing unit 1 in the motion vector detecting unit 13.
2, a highly accurate motion vector is detected based on the motion vector value synthesized.

【００８８】また、ＭＰＥＧ４画像情報符号化部（Ｉ／
Ｐ−ＶＯＰ）１１は、コンプレキシティ算出部１５から
コンプレキシティの近似値が与えられるので、この近似
値に基づいて各ＶＯＰに対する目標符号量（ターゲット
ビット）を算出し、この目標符号量（ターゲットビッ
ト）に応じて符号化を行う。The MPEG4 image information encoding unit (I /
The P-VOP) 11 receives an approximate value of the complexity from the complexity calculating unit 15, and calculates a target code amount (target bit) for each VOP based on the approximate value. Encoding is performed according to the target bit).

【００８９】コンプレキシティ算出部１５においては、
情報バッファ１４に格納されたフレーム毎の情報Ｑ及び
Ｂから、当該フレームに対するコンプレキシティＸを式
（１９）により算出する。In the complexity calculating section 15,
From the information Q and B for each frame stored in the information buffer 14, the complexity X for the frame is calculated by equation (19).

【００９０】[0090]

【数２４】 (Equation 24)

【００９１】式（１９）によって算出された、当該フレ
ームに対するコンプレキシティＸは、１ＧＯＶ（group
of VOPs）分バッファリングされた後、ＭＰＥＧ４画像
情報符号化部（Ｉ／Ｐ−ＶＯＰ）１１に符号量制御のた
めの媒介変数として伝送される。このため、１ＧＯＶ分
の遅延が必要となる。この遅延は図示しない遅延バッフ
ァを用いて実現される。ここで、ＧＯＶとは、ランダム
アクセス可能なＶＯＰの組である。The complexity X for the frame calculated by the equation (19) is 1 GOV (group
After being buffered for the amount of VOPs), it is transmitted to the MPEG4 image information encoding unit (I / P-VOP) 11 as a parameter for controlling the code amount. Therefore, a delay of 1 GOV is required. This delay is realized using a delay buffer (not shown). Here, GOV is a set of VOPs that can be randomly accessed.

【００９２】以下では、式（１９）において算出され
た、ＧＯＶ内の各フレームに対するコンプレキシティＸ
が、ＭＰＥＧ４画像情報符号化部（Ｉ／Ｐ−ＶＯＰ）１
１においてどのように用いられるかについて述べる。
尚、以下では、ピクチャタイプ判別部７がこの画像情報
変換装置内に存在せず、フレームレートの変換を行わな
い場合をも考慮することにする。In the following, the complexity X for each frame in the GOV calculated in equation (19) will be described.
Is an MPEG4 image information encoding unit (I / P-VOP) 1
1 will be described.
In the following, a case where the picture type determination unit 7 does not exist in the image information conversion apparatus and does not perform the frame rate conversion will be considered.

【００９３】式（１８）によつて求められたＫ_P，Ｋ_Bの
意味するところは、Ｉ−ＶＯＰに対する理想的な平均量
子化スケールＱ_{i_ideal}に対するＰ−ＶＯＰ／Ｂ−ＶＯ
Ｐに対する理想的な平均量子化スケールＱ_{p_ideal}，Ｑ
_{b_ideal}の比が、式（２０）によって与えられるという
ことである。The meaning of K _P and K _B obtained by the equation (18) means that P-VOP / B-VO for the ideal average quantization scale Q _{i_ideal} for I-VOP.
Ideal average quantization scale Q _{p_ideal} , Q for P
_That is, the ratio of _{b_ideal} is given by equation (20).

【００９４】[0094]

【数２５】 (Equation 25)

【００９５】ＭＰＥＧ２ＴｅｓｔＭｏｄｅｌ５にお
いては、式（１８）のように適応的にＫ_p，Ｋ_bを算出す
ることを行わず、式（３）に示したような固定値を用い
ている。The MPEG2 Test Model 5 does not adaptively calculate K _p and K _b as in equation (18), but uses a fixed value as shown in equation (3).

【００９６】式（１８）及び式（２０）から、或るＶＯ
Ｐ１と、或るＶＯＰ２に対するコンプレキシテイをそれ
ぞれＸ₁，Ｘ₂とし、理想的な量子化スケールをＱ
_{1_ideal}，Ｑ_{2_ideal}とすれば、式（２１）のようにな
る。From equations (18) and (20), a certain VO
Let P 1 and the complexity for a certain VOP 2 be X ₁ and X ₂ respectively, and let the ideal quantization scale be Q
_Assuming that _{1_ideal} and Q _{2_ideal} , equation (21) is obtained.

【００９７】[0097]

【数２６】 (Equation 26)

【００９８】或いはまた、ＭＰＥＧ２ＴｅｓｔＭｏｄ
ｅｌ５のように、式（３）に示した固定値を用いたい
場合には、式（２１）に代えて、式（２２）のようにす
れば良い。Alternatively, MPEG2 Test Mod
When it is desired to use the fixed value shown in the equation (3) as in el 5, the equation (22) may be used instead of the equation (21).

【００９９】[0099]

【数２７】 [Equation 27]

【０１００】今、ＧＯＶ内の未符号化されたＶＯＰに対
して割り当てられる総符号量（ビット数）をＲとし、Ｒ
が、各ＶＯＰに対して、Ｒ₁，Ｒ₂，…Ｒ_nといったよう
に割り当てられる時、当該ＧＯＶに対する画質が最適化
されるものとする。ここでＲとＲ₁，Ｒ₂，…Ｒ_nの間に
は式（２３）のような関係式が成り立つ。Now, let R be the total code amount (the number of bits) allocated to the uncoded VOP in the GOV.
Are assigned to each VOP as R ₁ , R ₂ ,... R _n , the image quality for the GOV is optimized. Here, a relational expression such as Expression (23) is established between R and R ₁ , R ₂ ,... R _n .

【０１０１】[0101]

【数２８】 [Equation 28]

【０１０２】或るＶＯＰ_kに対する平均量子化スケール
Ｑ_k、割当符号量Ｒ_k、コンプレキシティＸ_kの間には式
（２４）なる関係があることにも注意して、式（２３）
を変形すれば式（２５）が得られる。[0102] Some VOP average quantization scale for _k Q _k, assigned code amount R _k, between the complexity X _k Note also that there is a relation of equation (24), formula (23)
Equation (25) is obtained by transforming

【０１０３】[0103]

【数２９】 (Equation 29)

【０１０４】[0104]

【数３０】 [Equation 30]

【０１０５】式（２５）において、Ｋ（Ｘ₁，Ｘ₂）に関
しては、式（２１）に示した値を用いても、式（２２）
に示した値を用いても良いが、前者の方が、画像に応じ
た、より最適な符号量配分を実現することが可能であ
る。その際、１／（１＋ｍ）の値を１．０と設定するこ
とで、指数演算を行うことが不要となり、高速な実行が
可能となる。また、１／（１＋ｍ）の値を１．０以外に
設定する場合にも、予めテーブルを持ち、これを参照し
て指数演算を行うことで高速な実行が可能となる。In equation (25), regarding K (X ₁ , X ₂ ), even if the value shown in equation (21) is used, equation (22)
May be used, but the former can realize more optimal code amount distribution according to the image. At this time, by setting the value of 1 / (1 + m) to 1.0, it is not necessary to perform an exponential operation, and high-speed execution is possible. Even when the value of 1 / (1 + m) is set to a value other than 1.0, high-speed execution can be performed by holding a table in advance and performing an exponential operation with reference to the table.

【０１０６】式（２５）における各ＶＯＰに対するコン
プレキシティＸ_kはＭＰＥＧ４画像符号化によるもので
あるが、ＭＰＥＧ２画像符号化による各フレームに対す
るコンプレキシティと、ＭＰＥＧ４画像符号化による各
フレームに対するコンプレキシティが等しいと仮定すれ
ば、コンプレキシティ算出部１５に格納されたＸ_kを用
いることで、式（２５）によって当該ＶＯＰに対する目
標符号量を算出することが可能である。The complexity X _k for each VOP in the equation (25) is based on the MPEG4 image coding. The complexity for each frame based on the MPEG2 image coding and the complexity for each frame based on the MPEG4 image coding are shown. Assuming that the cities are equal, it is possible to calculate the target code amount for the VOP by using equation (25) by using X _k stored in the complexity calculating unit 15.

【０１０７】この目標符号量の算出のフローを図２に示
す。最初のステップＳ２１において、圧縮情報解析部８
は、ＭＰＥＧ２画像情報復号化部９における復号処理に
用いられるＧＯＰ内の各フレームに対する平均量子化ス
ケールＱ，及び割当符号量（ビット数）Ｂを抽出する。FIG. 2 shows a flow of calculating the target code amount. In the first step S21, the compression information analysis unit 8
Extracts an average quantization scale Q and an allocated code amount (number of bits) B for each frame in a GOP used for decoding processing in the MPEG2 image information decoding unit 9.

【０１０８】ステップＳ２２において、コンプレキシテ
ィ算出部１５は、平均量子化スケールＱ及び割当符号量
（ビット数）Ｂの積で与えられるコンプレキシティＸを
算出する。In step S22, the complexity calculating section 15 calculates a complexity X given by the product of the average quantization scale Q and the allocated code amount (number of bits) B.

【０１０９】ステップＳ２３において、ＭＰＥＧ４画像
符号化部（Ｉ／Ｐ−ＶＯＰ）１１は、コンプレキシティ
Ｘに応じた目標符号量（ターゲットビット）を算出す
る。In step S23, the MPEG4 image encoding unit (I / P-VOP) 11 calculates a target code amount (target bit) according to the complexity X.

【０１１０】ＭＰＥＧ２ＴｅｓｔＭｏｄｅｌ５で
は、ＧＯＰ内におけるＩ，Ｐ，Ｂピクチャに対するコン
プレキシティＸ_i，Ｘ_p，Ｘ_bは一定であると仮定してい
るが実際にはシーンチェンジをＧＯＰ内に含む場合や、
ＧＯＰ内で背景が著しく変化する場合等ではこの仮定が
成り立たず、安定した符号量制御の妨げとなり、画質劣
化の要因ともなる。図１に示した画像情報変換装置にお
いては、そのような場合にも、入力となるＭＰＥＧ２画
像圧縮情報（ビットストリーム）における、各フレーム
に対するコンプレキシティに基づいた符号量制御を行う
ため、画質劣化を引き起こすことなく、安定した符号量
制御を行うことが可能である。In MPEG2 Test Model 5, it is assumed that the complexities X _i , X _p , and X _b for the I, P, and B pictures in the GOP are constant, but a scene change is actually included in the GOP. Or
This assumption does not hold when the background changes significantly in the GOP, hinders stable code amount control, and causes image quality deterioration. In the image information conversion apparatus shown in FIG. 1, even in such a case, the code amount control based on the complexity for each frame in the input MPEG2 image compression information (bit stream) is performed. , It is possible to perform stable code amount control.

【０１１１】次に、画像情報変換装置の他の例につい
て、図３を参照して説明する。Next, another example of the image information conversion apparatus will be described with reference to FIG.

【０１１２】この画像情報変換装置は、ピクチャタイプ
判別部１６と、スライスヘッダ解析部１８を備える圧縮
情報解析部１７と、ＭＰＥＧ２画像情報復号化部（Ｉ／
Ｐピクチャ）１９と、間引き部２０と、ＭＰＥＧ４画像
情報符号化部（Ｉ／Ｐ−ＶＯＰ）２１と、動きベクトル
合成部２２と、動きベクトル検出部２３と、情報バッフ
ァ２４と、平均量子化スケール算出部２５と、コンプレ
キシティ算出部２６とから構成される。This image information conversion apparatus includes a picture type discrimination section 16, a compression information analysis section 17 having a slice header analysis section 18, and an MPEG2 image information decoding section (I /
P picture) 19, a thinning-out unit 20, an MPEG4 image information encoding unit (I / P-VOP) 21, a motion vector combining unit 22, a motion vector detecting unit 23, an information buffer 24, an average quantization scale It comprises a calculation unit 25 and a complexity calculation unit 26.

【０１１３】図３において、圧縮情報解析部１７、スラ
イスヘッダ解析部１８、情報バッファ２４、平均量子化
スケール算出部２５、コンプレキシティ算出部２６にお
ける符号量制御以外の動作については、図１に示した画
像情報変換装置と同様であるため、以下ではこれらによ
る符号量制御の動作を中心に述べることにする。In FIG. 3, the operations other than the code amount control in the compression information analysis unit 17, slice header analysis unit 18, information buffer 24, average quantization scale calculation unit 25, and complexity calculation unit 26 are shown in FIG. Since the configuration is the same as that of the image information conversion apparatus shown, the operation of controlling the code amount by these will be mainly described below.

【０１１４】圧縮情報解析部１７においては、ピクチャ
タイプ判別部１６から入力されたＩ／Ｐピクチャのみか
らなるＭＰＥＧ２画像圧縮情報（ビットストリーム）に
対して、各フレームに対して割り当てられた符号量（ビ
ット数）に関する情報Ｂが抽出され、この情報が情報バ
ッファ２４へと伝送される。同時に、圧縮情報解析部１
７に内在するスライスヘッダ解析部１８においては、入
力となるＭＰＥＧ２画像圧縮情報（ビットストリーム）
において、スライスレイヤーのパーシングを行い、スラ
イスヘッダに含まれる情報であるquantizer_scale_code
を抽出し、これを平均量子化スケール算出部２５に伝送
する。In the compression information analysis unit 17, for the MPEG2 image compression information (bit stream) consisting of only the I / P pictures input from the picture type discrimination unit 16, the code amount (bit stream) allocated to each frame Information B on the number of bits is extracted, and this information is transmitted to the information buffer 24. At the same time, the compression information analyzer 1
7, the slice header analysis unit 18 receives the input MPEG2 image compression information (bit stream).
In the, the slice layer is parsed, and the quantizer_scale_code
Is extracted and transmitted to the average quantization scale calculation unit 25.

【０１１５】ここで、スライスとは、ＭＰＥＧ２画像圧
縮情報（ビットストリーム）の画像を構成するマクロブ
ロック列であり、quantizer_scale_codeは、スライスに
含まれる最初のマクロブロックの量子化スケールを示す
ものである。Here, a slice is a macroblock sequence constituting an image of MPEG2 image compression information (bit stream), and quantizer_scale_code indicates a quantization scale of the first macroblock included in the slice.

【０１１６】ＭＰＥＧ２画像情報復号化部（Ｉ／Ｐピク
チャ）１９における処理は通常のＭＰＥＧ２画像情報復
号化装置と同様に、ＭＰＥＧ２画像圧縮情報（ビットス
トリーム）を画像信号に復号するものである。The processing in the MPEG2 image information decoding section (I / P picture) 19 is for decoding the MPEG2 image compression information (bit stream) into an image signal, similarly to a normal MPEG2 image information decoding apparatus.

【０１１７】ＭＰＥＧ２画像情報復号化部（Ｉ／Ｐピク
チャ）１９の出力となる画素値は、間引き部２０に入力
される。間引き部２０は、水平方向には１／２の間引き
処理を施し、垂直方向には、第一フィールド若しくは第
二フィールドのどちらか一方のデータのみを残し、もう
一方を廃棄する。このような間引きによって、入力とな
る画像情報の１／４の大きさを持つ順次走査画像を生成
する。The pixel value output from the MPEG2 image information decoding section (I / P picture) 19 is input to the thinning section 20. The thinning unit 20 performs 1/2 thinning processing in the horizontal direction, leaves only data of one of the first field and the second field in the vertical direction, and discards the other. By such thinning, a progressively scanned image having a size of ４ of the input image information is generated.

【０１１８】間引き部２０によって生成された順次走査
画像はＭＰＥＧ４画像情報符号化部（Ｉ／Ｐ−ＶＯＰ）
２１によってフレーム内で符号化されたＩ−ＶＯＰ及び
表示順序で順方向を参照して予測符号化されたＰ−ＶＯ
Ｐに符号化され、ＭＰＥＧ４画像圧縮情報（ビットスト
リーム）として出力される。The progressive scan image generated by the thinning section 20 is an MPEG4 image information encoding section (I / P-VOP)
21 and an I-VOP encoded in a frame and a P-VO encoded by prediction with reference to the forward direction in the display order.
P is encoded and output as MPEG4 image compression information (bit stream).

【０１１９】ＭＰＥＧ４画像情報符号化部（Ｉ／Ｐ−Ｖ
ＯＰ）２１における符号化の際には、入力となるＭＰＥ
Ｇ２画像圧縮情報（ビットストリーム）中の動きベクト
ル情報は、動きベクトル合成部２２において間引き後の
画像情報に対する動きベクトルにマッピングされ、動き
ベクトル検出部２３においては、動きベクトル合成部２
２において合成された動きベクトル値を元に高精度の動
きベクトルを検出する。The MPEG4 image information encoding unit (I / P-V
OP) 21, the input MPE
The motion vector information in the G2 image compression information (bit stream) is mapped to a motion vector for the decimated image information in the motion vector synthesizing unit 22, and in the motion vector detecting unit 23,
2, a highly accurate motion vector is detected based on the motion vector value synthesized.

【０１２０】また、ＭＰＥＧ４画像情報符号化部（Ｉ／
Ｐ−ＶＯＰ）２１は、コンプレキシティ算出部２６から
コンプレキシティの近似値が与えられるので、この近似
値に基づいて各ＶＯＰに対する目標符号量（ターゲット
ビット）を算出し、この目標符号量（ターゲットビッ
ト）に応じて符号化を行う。Further, the MPEG4 image information encoding section (I /
The P-VOP) 21 receives an approximate value of the complexity from the complexity calculating unit 26, and calculates a target code amount (target bit) for each VOP based on the approximate value, and calculates the target code amount (target bit amount). Encoding is performed according to the target bit).

【０１２１】平均量子化スケール算出部２５において
は、各スライスに対するquantizer_scale_codeであるｑ
＿ｓｌｉｃｅ_j、及び各フレームに含まれるスライスの
数Ｎ_sli _ceより、各フレームに対する平均量子化スケー
ルの近似値Ｑ_estiを式（２６）により算出する。In the average quantization scale calculation unit 25, q which is the quantizer_scale_code for each slice
_Slice _j, and than the number N _sli _ce of slices included in each frame, the approximate value Q _ESTI average quantizer scale for each frame is calculated by the equation (26).

【０１２２】[0122]

【数３１】 (Equation 31)

【０１２３】コンプレキシティ算出部２６においては、
情報バッファ２４に格納されている、各フレームに対し
て割り当てられた符号量（ビット数）に関する情報Ｂ、
及び、平均量子化スケール算出部２５において、式（２
６）によって算出された、各フレームに対するＱ_estiを
用いて、各フレームに対するコンプレキシティの近似値
Ｘ_estiを式（２７）により算出する。In the complexity calculating section 26,
Information B on the code amount (the number of bits) allocated to each frame stored in the information buffer 24;
And, in the average quantization scale calculation unit 25, the expression (2)
Using Q _esti for each frame calculated in 6), an approximate value X _esti of the _complexity for each frame is calculated by equation (27).

【０１２４】[0124]

【数３２】 (Equation 32)

【０１２５】コンプレキシティ算出部２６において、式
（２７）によって算出された、１ＧＯＶ内の各ＶＯＰに
対するコンプレキシティＸ_{k_esti}は、ＭＰＥＧ４画像情
報符号化部（Ｉ／Ｐ−ＶＯＰ）２１に伝送され、式（２
５）におけるＸ_kとして、各ＶＯＰに対する目標符号量
（ターゲットビット）を算出するのに用いられる。図４
に、ＭＰＥＧ４画像情報符号化部（Ｉ／Ｐ−ＶＯＰ）１
４における、各ＶＯＰに対する目標符号量（ターゲット
ビット）算出の一連のフローを示す。The complexity calculating section 26 calculates the complexity X _{k_esti} for each VOP in one GOV calculated by the equation (27), and transmits the complexity X _{k_esti} to the MPEG4 image information coding section (I / P-VOP) 21. , Equation (2)
As X _k in 5), used to calculate the target code amount (target bit) for each VOP. FIG.
, An MPEG4 image information encoding unit (I / P-VOP) 1
4 shows a series of flows for calculating a target code amount (target bits) for each VOP in FIG.

【０１２６】最初のステップＳ３１において、圧縮情報
解析部１７は、ＭＰＥＧ２画像圧縮情報（ビットストリ
ーム）の構文解析を行う。ステップＳ３２において、情
報バッファ２４は、ステップＳ３１の構文解析により明
らかになった各フレームに割り当てられた符号量（ビッ
ト数）Ｂを格納する。In the first step S31, the compression information analyzer 17 analyzes the syntax of MPEG2 image compression information (bit stream). In step S32, the information buffer 24 stores the code amount (the number of bits) B assigned to each frame, which is clarified by the syntax analysis in step S31.

【０１２７】一方、ステップＳ３３において、圧縮情報
解析部１７に備えられるスライスヘッダ解析部１８は、
ＭＰＥＧ２画像圧縮情報（ビットストリーム）のスライ
スヘッダにあるquantizer_scale_codeを抽出する。ステ
ップＳ３４においては、平均量子化スケール算出部２５
は、スライスヘッダ解析部１８にて抽出されたquantize
r_scale_codeに基づいて、各フレームに対する平均量子
スケールの近似値Ｑ_es _tiを算出する。On the other hand, in step S33, the slice header analysis unit 18 provided in the compression information analysis unit 17
The quantizer_scale_code in the slice header of the MPEG2 image compression information (bit stream) is extracted. In step S34, the average quantization scale calculation unit 25
Is the quantize extracted by the slice header analysis unit 18.
Based on R_scale_code, to calculate the approximate value Q _es _ti of average quantization scale for each frame.

【０１２８】ステップＳ３５においては、コンプレキシ
ティ算出部２６は、情報バッファ２４に格納された符号
量（ビット数）Ｂ及び算出した各フレームに対する平均
量子化スケールの近似値Ｑ_estiに基づいて、コンプレキ
シティの近似値Ｘ_estiを平均量子化スケールの近似値Ｑ
_esti及び符号量（ビット数）の近似値Ｂ_estiの積Ｑ_es _ti
×Ｂ_estiとして算出する。In step S35, the complexity calculating section 26 calculates the complexity based on the code amount (number of bits) B stored in the information buffer 24 and the calculated approximate value Q _esti of the average quantization scale for each frame. The approximate value X _{esti of the} chisity is converted to the approximate value Q of the average quantization scale.
_ESTI and product Q _es _ti approximation B _ESTI code quantity (bit number)
X Calculate as _Besti .

【０１２９】ステップＳ３６においては、ＭＰＥＧ４画
像情報符号化部（Ｉ／Ｐ−ＶＯＰ）２１は、コンプレキ
シティの近似値Ｘ_estiに応じた目標符号量（ターゲット
ビット）を算出する。In step S36, the MPEG4 image information encoding unit (I / P-VOP) 21 calculates a target code amount (target bit) according to the approximate value X _esti of the complexity.

【０１３０】この画像情報変換装置の他の例は、図１に
示した画像情報変換装置の一例の圧縮画像解析装置８に
おいて各フレームに対する量子化スケールを知るために
行ったマクロブロックレイヤーのためのパーシングの必
要がないので、演算の処理量が低減されている。[0130] Another example of the image information conversion apparatus is a compressed image analysis apparatus 8 as an example of the image information conversion apparatus shown in FIG. Since there is no need for parsing, the amount of calculation processing is reduced.

【０１３１】以上、入力としてＭＰＥＧ２画像圧縮情報
（ビットストリーム）を、出力としてＭＰＥＧ４画像圧
縮情報（ビットストリーム）を対象としてきたが、入
力、出力ともこれに限らず、例えばＭＰＥＧ−１やＨ．
２６３などの画像圧縮情報（ビットストリーム）でも良
い。As described above, the MPEG2 image compression information (bit stream) has been used as the input, and the MPEG4 image compression information (bit stream) has been used as the output. However, the input and output are not limited to this.
H.263 or other image compression information (bit stream).

【０１３２】[0132]

【発明の効果】以上述べてきた様に、本発明は、飛び越
し走査のＭＰＥＧ２画像圧縮情報（ビットストリーム）
を入力とし、入力となるＭＰＥＧ２画像圧縮情報（ビッ
トストリーム）内における各フレームにおいて、スライ
スヘッダに含まれる情報であるquantizer_scale_codeを
用いてフレーム全体に対する平均量子化スケールを近似
的に算出し、この平均量子化スケールの近似値を用いて
算出される、入力となるＭＰＥＧ２画像圧縮情報（ビッ
トストリーム）における各フレームに対するコンプレキ
シティの近似値を利用することで、より少ない処理量で
各ＶＯＰに対する符号量割当を画像に対して最適化し、
順次走査のＭＰＥＧ４画像圧縮情報（ビットストリー
ム）に変換して出力する手段を提供するものである。As described above, the present invention provides MPEG2 image compression information (bit stream) for interlaced scanning.
Is input, and in each frame in the input MPEG2 image compression information (bit stream), an average quantization scale for the entire frame is approximately calculated using quantizer_scale_code which is information included in a slice header. Code amount allocation for each VOP with a smaller processing amount by using the approximate value of the complexity of each frame in the input MPEG2 image compression information (bit stream) calculated using the approximate value of the quantization scale Is optimized for the image,
It provides a means for converting into progressively scanned MPEG4 image compression information (bit stream) and outputting it.

[Brief description of the drawings]

【図１】画像情報処理装置の一例の構成を示すブロック
図である。FIG. 1 is a block diagram illustrating a configuration of an example of an image information processing apparatus.

【図２】コンプレキシティＸを用いて符号量制御を行う
フローチャートである。FIG. 2 is a flowchart for performing code amount control using a complexity X;

【図３】画像情報処理装置の他の例の構成を示すブロッ
ク図である。FIG. 3 is a block diagram illustrating a configuration of another example of the image information processing apparatus.

【図４】ＶＯＰに対する目標符号量（ターゲットビッ
ト）算出を示すフローチャートである。FIG. 4 is a flowchart showing calculation of a target code amount (target bit) for a VOP.

【図５】従来の画像情報変換装置の構成を示すブロック
図である。FIG. 5 is a block diagram showing a configuration of a conventional image information conversion device.

【図６】符号量制御方式の動作原理を示すフローチャー
トである。FIG. 6 is a flowchart showing the operation principle of the code amount control method.

[Explanation of symbols]

７ピクチャタイプ判別部、８圧縮情報解析部、９
ＭＰＥＧ２画像情報復号化部（Ｉ／Ｐピクチャ）、１０
間引き部、１１ＭＰＥＧ４画像情報符号化部（Ｉ／
Ｐ−ＶＯＰ）、１２動きベクトル合成部、１３動き
ベクトル検出部7 picture type discriminator, 8 compression information analyzer, 9
MPEG2 image information decoding unit (I / P picture), 10
Thinning unit, 11 MPEG4 image information encoding unit (I /
P-VOP), 12 motion vector synthesizer, 13 motion vector detector

───────────────────────────────────────────────────── フロントページの続き (72)発明者鈴木輝彦東京都品川区北品川６丁目７番35号ソニー株式会社内 (72)発明者矢ケ崎陽一東京都品川区北品川６丁目７番35号ソニー株式会社内Ｆターム(参考） 5C059 KK41 LB05 LB07 MA00 MA03 MA05 MA13 MA23 MB01 MC11 NN01 PP05 PP06 PP07 RC14 RC28 SS10 TA60 TB05 TB18 TC04 TC06 TC10 TC18 TD03 TD16 UA02 UA34 5J064 AA02 BB01 BB13 BC14 BC21 BC25 BD01 ──────────────────────────────────────────────────続き Continued on the front page (72) Inventor Teruhiko Suzuki 6-7-35 Kita-Shinagawa, Shinagawa-ku, Tokyo Inside Sony Corporation (72) Inventor Yoichi Yagasaki 6-35, Kita-Shinagawa, Shinagawa-ku, Tokyo Sony Corporation F term (reference) 5C059 KK41 LB05 LB07 MA00 MA03 MA05 MA13 MA23 MB01 MC11 NN01 PP05 PP06 PP07 RC14 RC28 SS10 TA60 TB05 TB18 TC04 TC06 TC10 TC18 TD03 TD16 UA02 UA34 5J064 AA02 BB01 BC25

Claims

[Claims]

An image information conversion apparatus for converting input image compression information of interlaced scanning compressed by a first compression encoding method into output image compression information of progressive scanning compressed by a second compression encoding method. In the above, using the quantization scale information included in a predetermined position of the input image compression information compressed by the first compression encoding method,
The approximate value of the average quantization scale of each frame of the input image compression information is obtained, and the approximate value of the complexity of the image of the frame is obtained using the approximate value and the code amount allocated to each frame. An image information conversion device comprising code amount control means for controlling a code amount in encoding of output image compression information based on a value.

2. A compression information analyzing means for analyzing the input image compression information and extracting at least a code amount allocated to each frame of the input image compression information, and storing the code amount and delaying for a predetermined period. 2. The image information conversion device according to claim 1, further comprising an information buffer for causing the image information to be converted.

3. The input image compression information and the output image compression information are configured in units of pixel blocks constituting a frame, the predetermined position is a header of a pixel block sequence, and the compression information analysis is performed. 3. The image information conversion apparatus according to claim 2, wherein said means comprises a header extracting means for extracting a header of said pixel block sequence.

4. Based on quantization scale information of each pixel block column and the number of pixel blocks included in a frame,
2. The image information conversion apparatus according to claim 1, wherein an approximate value of the average quantization scale for each frame is calculated by the following equation. (Equation 1) Here, the approximate value of the average quantization scale for each frame is Q _esti, and the quantization scale of the j-th pixel block sequence is
Let q_slice _j and the number of pixel blocks included in each frame be N _slice .

5. The image information conversion apparatus according to claim 4, wherein an approximate value of the complexity of each frame of the input image compression information is calculated by the following equation. (Equation 2) Here, the approximate value of the complexity for each frame is X _esti , the approximate value of the average quantization scale for each frame is Q _esti, and the allocated code amount for each frame is B
And

6. The first compression encoding method is MPEG.
2. The second compression encoding method is MPEG4, and VOP ₁ ,... VO corresponding to the image of the output image compression information
Picture information converting apparatus according to claim 5, wherein the target code amount R _1, characterized in that given by the following expression for VOP ₁ in the random accessible unit GOV composed of P _n. (Equation 3) Here, the code amount assigned to the entire image that has not been encoded by the GOV of the output image compression information is R, VOP ₁ ,
VOP _2, ... complexity of the approximate value X _{1_Esti} for _VOP _n, X 2_exti, ... to the _X n_esti, the function whose variable is the complexity X _k and _{_{X l K (X k, X}} l) and.

7. The image information conversion apparatus according to claim 6, wherein said function is given by the following equation. (Equation 4)

8. The image information conversion device according to claim 7, wherein the value of the exponent 1 / (1 + m) of the function is 0.6 to 1.2.

9. The image information conversion apparatus according to claim 7, wherein a value of an exponent 1 / (1 + m) of said function is set to 1.0.

10. The image information conversion apparatus according to claim 7, wherein the operation of the function is performed by referring to a table prepared in advance.

11. The image information conversion apparatus according to claim 6, wherein said function is given by the following equation. (Equation 5) Note that _Kp = 1.0 and _Kb = 1.4.

12. Decoding means for decoding the input image compression information, and rate conversion means for performing a rate conversion by thinning out the image information decoded by the decoding means,
The rate conversion means passes the forward coded image predicted and coded with reference to the forward direction in the intra coded image coded in the frame and the display order in the display order. 2. The image information conversion apparatus according to claim 1, wherein the bidirectionally predicted coded image that has been predictively coded by reference is discarded.

13. An image information conversion method for converting input image compression information of interlaced scanning compressed by a first compression encoding method into output image compression information of progressive scanning compressed by a second compression encoding method. In the above, using the quantization scale information included in a predetermined position of the input image compression information compressed by the first compression encoding method,
Obtaining an approximate value of the average quantization scale of each frame of the input image compression information; and obtaining an approximate value of the complexity of the image of the frame using the approximate value and the code amount allocated to each frame. And controlling a code amount in coding of the output image compression information based on the approximate value.

14. A step of analyzing the input image compression information to extract at least a code amount assigned to each frame of the input image compression information, storing the code amount in the information buffer, and extracting the code amount for a predetermined period. 14. The method according to claim 13, further comprising the step of delaying the input image compression information by delaying.

15. The input image compression information and the output image compression information are configured in units of pixel blocks constituting a frame, wherein the predetermined position is a header of a pixel block sequence, and the pixel block sequence is 15. The method according to claim 14, further comprising the step of:
The described image information conversion method.