JP2002034046A

JP2002034046A - Method and device for converting image information

Info

Publication number: JP2002034046A
Application number: JP2000215111A
Authority: JP
Inventors: Kazufumi Sato; 数史佐藤; Kuniaki Takahashi; 邦明高橋; Teruhiko Suzuki; 輝彦鈴木; Yoichi Yagasaki; 陽一矢ケ崎
Original assignee: Sony Corp
Current assignee: Sony Corp
Priority date: 2000-07-14
Filing date: 2000-07-14
Publication date: 2002-01-31

Abstract

PROBLEM TO BE SOLVED: To simplify a process at reducing a macro block. SOLUTION: There are provided an MPEG2 image information decoding part (I/P picture 2×8 down decoder) 26 where a skip image is decoded using only 2×8 component among discrete cosine conversion factor of 8×8 component of a macro block which constitutes MPEG2 image compression information by skip scanning, a scan converting part 27 where any one of first and second fields of the decoded skip image is selected to generate an image of sequential scanning, the generated sequential-scanning image, and an MPEG4 image information encoding part (I/P-VOP) 28 where the generated image is encoded into an MPEG4 image compression information.

Description

DETAILED DESCRIPTION OF THE INVENTION

【０００１】[0001]

【発明の属する技術分野】本発明は、画像情報を変換す
る画像情報変換装置及び方法に関し、詳しくは、離散コ
サイン変換等の直交変換と動き補償によって圧縮された
ＭＰＥＧ等の画像情報（ビットストリーム）を衛星放
送、ケーブルＴＶ、インターネット等のネットワークメ
ディアを介して受信する際に、若しくは光、磁気ディス
クのような記憶メディア上で処理する際に用いられる画
像情報変換装置及び方法に関する。BACKGROUND OF THE INVENTION 1. Field of the Invention The present invention relates to an image information conversion apparatus and method for converting image information, and more particularly, to image information (bit stream) such as MPEG compressed by orthogonal transform such as discrete cosine transform and motion compensation. The present invention relates to an image information conversion apparatus and method used when receiving an image through a network medium such as satellite broadcasting, cable TV, or the Internet, or when processing the same on a storage medium such as an optical disk or a magnetic disk.

【０００２】[0002]

【従来の技術】近年、画像情報をディジタルとして取り
扱い、効率の高い情報の伝送、蓄積を目的とし、画像情
報特有の冗長性を利用して、離散コサイン変換等の直交
変換と動き補償により圧縮するＭＰＥＧなどの画像情報
圧縮方式が提供されている。そして、このような画像情
報圧縮方法に準拠した装置は、放送局などの情報配信、
及び一般家庭における情報受信の双方において普及しつ
つある。2. Description of the Related Art In recent years, image information is handled as digital data, and for the purpose of transmitting and storing information with high efficiency, compression is performed by orthogonal transform such as discrete cosine transform and motion compensation using redundancy inherent in image information. An image information compression system such as MPEG is provided. Then, an apparatus conforming to such an image information compression method is used for information distribution of a broadcasting station or the like,
It is becoming widespread in both information reception in general households.

【０００３】特に、ＭＰＥＧ２（ＩＳＯ／ＩＥＣ１３
８１８−２）は、汎用画像符号化方式として定義されて
おり、飛び越し走査画像及び順次走査画像の双方、並び
に標準解像度画像及び高精細画像を網羅する標準で、プ
ロフェッショナル用途及びコンシューマー用途の広範な
アプリケーションに今後とも用いられるものと予想され
る。In particular, MPEG2 (ISO / IEC 13
818-2) is defined as a general-purpose image coding method, and is a standard covering both interlaced scan images and progressive scan images, as well as standard resolution images and high-definition images, and has a wide range of applications for professional use and consumer use. It is expected to be used in the future.

【０００４】ＭＰＥＧ２圧縮方式を用いることにより、
高い圧縮率と良好な画質の実現が可能である。このため
には、例えば、７２０×４８０画素を持つ標準解像度の
飛び越し走査画像であれば４〜８Ｍｂｐｓ、１９２０×
１０８８画素を持つ高解像度の飛び越し走査画像であれ
ば１８〜２２Ｍｂｐｓの符号量（ビットレート）を割り
当てることが必要である。[0004] By using the MPEG2 compression method,
A high compression rate and good image quality can be realized. For this purpose, for example, a standard resolution interlaced scanning image having 720 × 480 pixels is 4 to 8 Mbps and 1920 × 480.
For a high-resolution interlaced scan image having 1088 pixels, it is necessary to allocate a code amount (bit rate) of 18 to 22 Mbps.

【０００５】今後広く普及するであろうと思われるディ
ジタル放送においても、このような圧縮方式によって画
像情報が伝送されるが、その規格には、標準解像度の画
像及び高解像度の画像が存在し、受信機においてはこの
両方を復号することの出来る機能を持つことが望まれ
る。[0005] In digital broadcasting which is expected to be widely used in the future, image information is transmitted by such a compression method. Standards include an image having a standard resolution and an image having a high resolution. It is desirable that the device has a function capable of decoding both of them.

【０００６】特に、標準解像度画像との共存を図り、安
価な受信機を構築するためには、高解像度画像情報を画
質劣化を最小限に抑えながら何らかの形で間引いて処理
する必要性がある。かかる問題は、ディジタル放送など
の伝送メディアにおいてのみならず、光ディスクやフラ
ッシュメモリ等の蓄積メディアにおいても生じ得るもの
と考えられる。In particular, in order to coexist with standard-resolution images and construct an inexpensive receiver, it is necessary to process high-resolution image information by thinning it in some form while minimizing image quality deterioration. It is considered that such a problem can occur not only in transmission media such as digital broadcasting, but also in storage media such as optical disks and flash memories.

【０００７】かかる問題を解決するために、本願出願人
は、先に図８に示すようなダウンデコーダを提案した。
このダウンデコーダは、ＭＰＥＧ２画像圧縮情報（ビッ
トストリーム）の符号化画像を構成する垂直方向及び水
平方向共に８成分（以下、８×８と表記する。他も同様
とする。）のブロックを４×４ブロックに変換する４×
４ダウンデコーダである。In order to solve such a problem, the present applicant has previously proposed a down decoder as shown in FIG.
This down decoder is a block of 8 components (hereinafter, referred to as 8 × 8; the same applies to other blocks) in a vertical direction and a horizontal direction, which constitute an encoded image of MPEG2 image compression information (bit stream). 4x to convert to 4 blocks
4 down decoder.

【０００８】この４×４ダウンデコーダは、符号バッフ
ァ１と、圧縮情報解析部２と、可変長復号化部３と、逆
量子化部４と、逆離散コサイン変換部（４×４）５と、
逆離散コサイン変換部（フィールド分離）６と、加算器
７と、動き補償部（フィールド予測）８と、動き補償部
（フレーム予測）９と、ビデオメモリ１０と、画枠変換
・位相ずれ補正部１１とから較正されている。The 4 × 4 down decoder includes a code buffer 1, a compression information analysis unit 2, a variable length decoding unit 3, an inverse quantization unit 4, an inverse discrete cosine transform unit (4 × 4) 5, ,
Inverse discrete cosine transform section (field separation) 6, adder 7, motion compensation section (field prediction) 8, motion compensation section (frame prediction) 9, video memory 10, image frame conversion / phase shift correction section 11 and is calibrated.

【０００９】符号バッファ１は入力される画像圧縮情報
を一時的に保持し、圧縮情報解析部２は入力画像圧縮情
報を解析し、可変長符号化部３は入力画像圧縮情報を可
変長復号し、逆量子化部４は可変長復号化部３の出力を
逆量子化する。A code buffer 1 temporarily stores input image compression information, a compression information analysis unit 2 analyzes the input image compression information, and a variable length coding unit 3 performs variable length decoding of the input image compression information. , The inverse quantization unit 4 inversely quantizes the output of the variable length decoding unit 3.

【００１０】また、縮小逆離散コサイン変換部（４×
４）５は逆量子化部４から出力された８×８成分の内で
低域４×４成分のみに逆離散コサイン変換を施し、縮小
逆離散コサイン変換部（フィールド分離）５は飛び越し
走査の画像を構成する第１フィールド及び第２フィール
ドを分離する。A reduced inverse discrete cosine transform unit (4 ×
4) 5 performs inverse discrete cosine transform only on the low-frequency 4 × 4 component among the 8 × 8 components output from the inverse quantization unit 4, and the reduced inverse discrete cosine transform unit (field separation) 5 performs interlaced scanning. A first field and a second field constituting an image are separated.

【００１１】さらに、動き補償部（フィールド予測）８
はビデオメモリ１０から与えられた画像に対してフィー
ルド単位で動き予測して動き補償を行い、動き補償部
（フレーム予測）９はビデオメモリ１０から与えられた
画像に対してフレーム単位で動き予測して動き補償を行
い、加算器７はこれらの出力と縮小逆離散コサイン変換
部（４×４）５及び縮小逆離散コサイン変換部（フィー
ルド分離）６の出力を加算し、ビデオメモリ１０は加算
器７からの出力を記憶し、画枠／位相ずれ補正部１１は
ビデオメモリ１０に記憶された画像に画枠補正及び位相
ずれ補正を施して出力する。Further, a motion compensator (field prediction) 8
Performs motion compensation by performing motion prediction on the image supplied from the video memory 10 on a field basis and performs motion prediction on the image supplied from the video memory 10 on a frame basis. The adder 7 adds these outputs to the output of the reduced inverse discrete cosine transform unit (4 × 4) 5 and the reduced inverse discrete cosine transform unit (field separation) 6, and the video memory 10 The image frame / phase shift correction unit 11 performs image frame correction and phase shift correction on the image stored in the video memory 10 and outputs the image.

【００１２】可変長復号化部３においては、当該マクロ
ブロックの離散コサイン変換（discrete cosine tran
sformation; DCT）モードがフィールド離散コサイン変
換モードであるかフレーム離散コサイン変換モードであ
るかに応じて、後続の縮小逆離散コサイン変換部（４×
４）５若しくは縮小逆離散コサイン変換部（フィールド
分離）６において必要な離散コサイン変換係数のみを復
号し、あとはＥＯＢが検出されるまで処理を行わないと
いうことも考えられる。In the variable length decoding unit 3, a discrete cosine transform (discrete cosine transform) of the macro block is performed.
Depending on whether the DCT mode is the field discrete cosine transform mode or the frame discrete cosine transform mode, the subsequent reduced inverse discrete cosine transform unit (4 ×
4) It is also conceivable that only the necessary discrete cosine transform coefficients are decoded in 5 or the reduced inverse discrete cosine transform unit (field separation) 6, and the processing is not performed until the EOB is detected.

【００１３】ここで、入力となるＭＰＥＧ２画像圧縮情
報（ビットストリーム）がジグザグスキャンである場合
の可変長符号化部３における動作原理について、図９を
用いて説明する。なお、図９中の数字は、離散コサイン
変換係数を読み取る順序を示している。Here, the principle of operation of the variable length coding unit 3 when the input MPEG2 image compression information (bit stream) is a zigzag scan will be described with reference to FIG. The numbers in FIG. 9 indicate the order in which the discrete cosine transform coefficients are read.

【００１４】フレーム離散コサイン変換モードの場合に
は、図９のＡに示すように、縮小逆離散コサイン変換部
（４×４）５は、８×８成分のマクロブロックの内で破
線で囲まれた低域４×４成分の離散コサイン変換係数の
みを可変長復号する。フィールド離散コサイン変換モー
ドの場合には、図９のＢに示すように、縮小逆離散コサ
イン変換部（フィールド分離）６は、８×８成分のマク
ロブロックの内で破線で囲まれた低域４×８成分の離散
コサイン変換係数のみを可変長復号する。In the case of the frame discrete cosine transform mode, as shown in FIG. 9A, the reduced inverse discrete cosine transform unit (4 × 4) 5 is surrounded by a broken line in the macroblock of the 8 × 8 component. Only the discrete cosine transform coefficients of the low frequency 4 × 4 component are subjected to variable length decoding. In the case of the field discrete cosine transform mode, as shown in FIG. 9B, the reduced inverse discrete cosine transform unit (field separation) 6 converts the low band 4 surrounded by a broken line in the macroblock of 8 × 8 components. Only the discrete cosine transform coefficients of × 8 components are variable-length decoded.

【００１５】入力となるＭＰＥＧ２画像圧縮情報（ビッ
トストリーム）がオルタネートスキャンである場合の可
変長符号化部３における動作原理について、図１０を用
いて説明する。The principle of operation of the variable length coding unit 3 when the input MPEG2 image compression information (bit stream) is an alternate scan will be described with reference to FIG.

【００１６】フレーム離散コサイン変換モードの場合に
は、図１０のＡに示すように、逆離散コサイン変換部
（４×４）５は、８×８成分のマクロブロックの内で破
線で囲まれた低域４×４成分の離散コサイン変換係数の
みを可変長復号する。フィールド離散コサイン変換モー
ドの場合には、図１０のＢに示すように、逆離散コサイ
ン変換部（フィールド分離）６は、８×８成分のマクロ
ブロックの位置で低域４×８成分の離散コサイン変換係
数のみを可変長復号する。In the case of the frame discrete cosine transform mode, as shown in FIG. 10A, the inverse discrete cosine transform unit (4.times.4) 5 is surrounded by a broken line in the macro block of the 8.times.8 component. Only the discrete cosine transform coefficients of the low-frequency 4 × 4 component are variable-length decoded. In the case of the field discrete cosine transform mode, as shown in FIG. 10B, the inverse discrete cosine transform unit (field separation) 6 performs the discrete cosine transform of the low frequency 4 × 8 component at the position of the 8 × 8 component macroblock. Variable length decoding is performed only on the transform coefficients.

【００１７】逆量子化部４にて逆量子化された離散コサ
イン変換係数は、当該マクロブロックの離散コサイン変
換モードがフレーム離散コサイン変換モードである場合
には縮小逆離散コサイン変換部（４×４）５において、
当該マクロブロックの離散コサイン変換モードがフィー
ルド離散コサイン変換モードである場合には縮小逆離散
コサイン変換部（フィールド分離）６において逆離散コ
サイン変換が施される。When the discrete cosine transform coefficient of the macroblock is the frame discrete cosine transform mode, the discrete cosine transform coefficient inversely quantized by the inverse quantizing unit 4 is a reduced inverse discrete cosine transform unit (4 × 4). ) 5.
When the discrete cosine transform mode of the macroblock is the field discrete cosine transform mode, the discrete inverse cosine transform unit (field separation) 6 performs inverse discrete cosine transform.

【００１８】縮小逆離散コサイン変換部（４×４）５又
は縮小逆離散コサイン変換部（フィールド分離）６から
の出力は、当該マクロブロックがイントラマクロブロッ
クの場合は、ビデオメモリ１０にそのまま格納される。The output from the reduced inverse discrete cosine transform unit (4 × 4) 5 or the reduced inverse discrete cosine transform unit (field separation) 6 is stored as it is in the video memory 10 when the macroblock is an intra macroblock. You.

【００１９】縮小逆離散コサイン変換部（４×４）５又
は縮小逆離散コサイン変換部（フィールド分離）６から
の出力は、当該マクロブロックがインターマクロブロッ
クの場合は、動き補償モードがフィールド予測モードの
場合には動き補償部（フィールド予測）８により、動き
補償モードがフレーム予測モードの場合には動き補償部
（フレーム予測）９により、ビデオメモリ１０内の参照
データを元に水平方向及び垂直方向共に１／４画素精度
の補間処理が施された予測画像と加算器７によって合成
されてビデオメモリ１０に出力される。The output from the reduced inverse discrete cosine transform unit (4 × 4) 5 or the reduced inverse discrete cosine transform unit (field separation) 6 indicates that the motion compensation mode is the field prediction mode when the macroblock is an inter macroblock. In the case of (1), the motion compensation unit (field prediction) 8 uses the reference data in the video memory 10 in the horizontal direction and the vertical direction based on the reference data in the video memory 10 when the motion compensation mode is the frame prediction mode. Both are combined by the adder 7 with the predicted image that has been subjected to the interpolation processing of 1/4 pixel precision and output to the video memory 10.

【００２０】ビデオメモリ１０に格納された画素値は、
上位レイヤーの画素に対応して、図１１のＡに示す上位
レイヤーや図１１のＢに示す下位レイヤーのように、第
１フィールド及び第２フィールドの間で位相ずれを含
む。The pixel values stored in the video memory 10 are
Corresponding to the pixels in the upper layer, a phase shift is included between the first field and the second field as in the upper layer shown in FIG. 11A and the lower layer shown in FIG. 11B.

【００２１】図１１のＡの上位レイヤーにおいては、第
１フィールドの画素ａ１及び第２フィールドの画素ａ２
が示されている。図１１のＢの下位レイヤーにおいて
は、第１フィールドの画素ｂ１及び第２フィールドの画
素ｂ２が示されている。図１１のＢに示す下位レイヤー
の画素値は縮小逆離散コサイン変換により上位レイヤー
の画素数を減じて得られるが、フィールド間の位相ずれ
を含んでいる。In the upper layer of FIG. 11A, the pixel a1 in the first field and the pixel a2 in the second field
It is shown. In the lower layer of FIG. 11B, a pixel b1 in the first field and a pixel b2 in the second field are shown. Although the pixel value of the lower layer shown in FIG. 11B is obtained by reducing the number of pixels of the upper layer by the reduced inverse discrete cosine transform, it includes a phase shift between fields.

【００２２】ビデオメモリ１０に格納された画素値は、
画枠変換／位相ずれ補正部１１によって表示装置に適し
た画枠サイズに変換されると同時にフィールド間の位相
ずれが補正される。The pixel values stored in the video memory 10 are
The image frame is converted into an image frame size suitable for the display device by the image frame conversion / phase shift correction unit 11, and at the same time, the phase shift between the fields is corrected.

【００２３】縮小逆離散コサイン変換部（４×４）５に
おける動作原理は、水平成分及び垂直成分共に、８×８
成分の離散コサイン変換係数の内低域４×４成分を取り
出し、これに４次の逆離散コサイン変換を施すというも
のである。The principle of operation of the reduced inverse discrete cosine transform unit (4 × 4) 5 is that both the horizontal and vertical components are 8 × 8.
A 4 × 4 inner low-frequency component of the discrete cosine transform coefficient of the component is extracted, and a fourth-order inverse discrete cosine transform is performed on the component.

【００２４】図１２は、逆離散コサイン変換部（フィー
ルド分離型）５の処理を示したものである。すなわち、
入力となる画像圧縮情報（ビットストリーム）中の符号
化データである離散コサイン変換係数ｙ₁〜ｙ₈に、８×
８成分の逆離散コサイン変換を施し、復号化されたデー
タｘ₁〜ｘ₈を得る。次に、これらを第１フィールドのデ
ータｘ₁，ｘ₃，ｘ₅，ｘ₇と第２フィールドのデータ
ｘ₂，ｘ₄，ｘ₆，ｘ₈に分離する。FIG. 12 shows the processing of the inverse discrete cosine transform unit (field separation type) 5. That is,
The discrete cosine transform coefficients y _{1 to} y ₈ , which are coded data in the input image compression information (bit stream), have 8 ×
Performs inverse discrete cosine transform of the 8 components to obtain data x ₁ ~x ₈ decoded. Next, to separate them in the first field data x ₁ _{_{in, x 3, x 5, x}} 7 data x ₂ in the second _{_{field, x 4, x 6, x}} 8.

【００２５】分離されたそれぞれのデータ列に４×４成
分の離散コサイン変換を施し、第１フィールドに対する
離散コサイン変換係数ｚ₁，ｚ₃，ｚ₅，ｚ₇及び第２フィ
ールドに対する離散コサイン変換係数ｚ₂，ｚ₄，ｚ₆，
ｚ₈を得る。Each of the separated data strings is subjected to a discrete cosine transform of 4 × 4 components, and discrete cosine transform coefficients z ₁ , z ₃ , z ₅ , z _{7 for} the first field and discrete cosine transform coefficients for the second field. z ₂ , z ₄ , z ₆ ,
get the z _8.

【００２６】こうして得られた第１及び第２フィールド
に対する離散コサイン変換係数に間引き処理を行い低域
２成分を残す。すなわち、第１フィールドに対する離散
コサイン変換係数の内ｚ₅、ｚ₇、第２フィールドに対す
る離散コサイン変換係数の内ｚ₆、ｚ₈を破棄する。これ
によって、第１フィールドに対する離散コサイン変換係
数ｚ₁、ｚ₃及び第２フィールドに対する離散コサイン変
換係数ｚ₂、ｚ₄が残る。The discrete cosine transform coefficients for the first and second fields obtained in this manner are subjected to thinning processing to leave two low-frequency components. That is, the inner z ₅ of the discrete cosine transform coefficients for the first field, z _7, discards the inner z _6, z ₈ discrete cosine transform coefficients for the second field. This leaves the discrete cosine transform coefficients z ₁ , z ₃ for the first field and the discrete cosine transform coefficients z ₂ , z ₄ for the second field.

【００２７】間引きされた第１フィールドの低域離散コ
サイン変換成分ｚ₁、ｚ₃及び間引きされた第２フィール
ドの低域離散コサイン変換成分ｚ₂、ｚ₄に対してそれぞ
れ２×２の逆離散コサイン変換を施すことで、第１フィ
ールドに対する縮小された画素値ｘ’₁，ｘ’₃及び第２
フィールドに対する縮小され亜ｔ画素値ｘ’₂，ｘ’₄が
得られる。The discrete discrete cosine transform components z ₁ , z ₃ of the decimated _first field and the discrete cosine transform components z ₂ , z _{4 of the} decimated second field are each 2 × 2 inverse discrete components. By performing the cosine transform, the reduced pixel values x ′ ₁ , x ′ ₃ and the second
By nitrous t pixel values x reduction for the field _'2, x' ₄ is obtained.

【００２８】これらの値を再びフレーム合成することに
よって出力値となる画素値ｘ’₁〜ｘ’₄とする。The pixel values x ′ _{1 to} x ′ ₄ which are output values by synthesizing these values again in the frame are set.

【００２９】尚、実際の処理はこれら一連の処理と等価
な行列を離散コサイン変換係数ｙ₁〜ｙ₈に施すことで、
直接画素値ｘ’₁〜ｘ’₄を得る。加法定理を用いて展開
計算することにより得られるこの行列［ＦＳ^I］は次の
式（１）によって与えられる。The actual processing is performed by applying a matrix equivalent to these series of processing to the discrete cosine transform coefficients y _{1 to} y ₈ .
The pixel values x ′ _{1 to} x ′ ₄ are directly obtained. This matrix [FS ^I ] obtained by expansion calculation using the addition theorem is given by the following equation (1).

【００３０】[0030]

【数１】 (Equation 1)

【００３１】但し、式（１）における要素Ａ〜Ｊは、以
下のように定義される。However, the elements A to J in the equation (1) are defined as follows.

【００３２】[0032]

【数２】 (Equation 2)

【００３３】[0033]

【数３】 (Equation 3)

【００３４】[0034]

【数４】 (Equation 4)

【００３５】[0035]

【数５】 (Equation 5)

【００３６】[0036]

【数６】 (Equation 6)

【００３７】[0037]

【数７】 (Equation 7)

【００３８】[0038]

【数８】 (Equation 8)

【００３９】[0039]

【数９】 (Equation 9)

【００４０】[0040]

【数１０】 (Equation 10)

【００４１】[0041]

【数１１】 [Equation 11]

【００４２】４×４成分の縮小逆離散コサイン変換、及
びフィールド分離型縮小逆離散コサイン変換に関して
は、高速アルゴリズムによる実現が可能である。以下で
は、Ｗａｎｇのアルゴリズム（参考文献： Zhong de
Wang.,“Fast Algorithms for the Discrete W
Transform and for the Discrete Fourier Tra
nsform", IEEE Tr.ASSP-32, N0.4, pp.803-816, A
ug.1984）に基づく手法を示す。The 4 × 4 reduced inverse discrete cosine transform and the field-separated reduced inverse discrete cosine transform can be realized by a high-speed algorithm. In the following, Wang's algorithm (reference: Zhong de
Wang., “Fast Algorithms for the Discrete W
Transform and for the Discrete Fourier Tra
nsform ", IEEE Tr.ASSP-32, N0.4, pp.803-816, A
ug.1984).

【００４３】４×４成分の縮小逆離散コサイン変換を表
す行列は、Ｗａｎｇの高速化アルゴリズムを用いて、次
の式（２）のように分解される。A matrix representing a reduced inverse discrete cosine transform of 4 × 4 components is decomposed as in the following equation (2) using a Wang speed-up algorithm.

【００４４】[0044]

【数１２】 (Equation 12)

【００４５】但し、式（２）においては、以下のように
定義された小行列及び要素が用いられる。However, in equation (2), small matrices and elements defined as follows are used.

【００４６】[0046]

【数１３】 (Equation 13)

【００４７】[0047]

【数１４】 [Equation 14]

【００４８】[0048]

【数１５】 (Equation 15)

【００４９】この構成を表したものが図１３である。こ
のように乗算器５個及び加算器９個を用いることで本装
置の実現が可能となる。FIG. 13 shows this configuration. By using five multipliers and nine adders in this way, the present device can be realized.

【００５０】図１３においては、第０出力要素ｆ（０）
は、加算器４３において値ｓ２及び値ｓ５を加えること
により得られる。In FIG. 13, the 0th output element f (0)
Is obtained by adding the value s2 and the value s5 in the adder 43.

【００５１】ここで、値ｓ２は、第０入力要素Ｆ（０）
と第２入力要素Ｆ（２）を加算器３１にて加えた値を乗
算器３４でＡ倍したものである。また、値ｓ５は、第１
入力要素Ｆ（１）を乗算器３７でＣ倍した値に加算器４
０において値ｓ１を加えたものである。値ｓ１は、第３
入力要素Ｆ（３）から第１入力要素Ｆ（１）を加算器３
３にて減じた値を乗算器３８でＤ倍したものである。Here, the value s2 is the 0th input element F (0)
And a second input element F (2) added by the adder 31 is multiplied by A in the multiplier 34. The value s5 is the first
An adder 4 adds a value obtained by multiplying the input element F (1) by C by the multiplier 37.
At 0, the value s1 is added. The value s1 is the third
The first input element F (1) is converted from the input element F (3) to an adder 3
The value reduced by 3 is multiplied by D in the multiplier 38.

【００５２】第１出力要素ｆ（１）は、値ｓ３及び値ｓ
４を加算器４１において加えることにより得られる。The first output element f (1) has a value s3 and a value s
4 in the adder 41.

【００５３】ここで、値ｓ３は、第０入力要素Ｆ（０）
から第２入力要素Ｆ（２）を加算器３２で減じた値を乗
算器３５でＡ倍したものである。値ｓ４は、第３入力要
素Ｆ（３）を乗算器３６でＢ倍した値から値ｓ１を加算
器３９において減算したものである。Here, the value s3 is the 0th input element F (0)
And the second input element F (2) is subtracted by the adder 32 and multiplied by A in the multiplier 35. The value s4 is obtained by subtracting the value s1 in the adder 39 from the value obtained by multiplying the third input element F (3) by B in the multiplier 36.

【００５４】第２出力要素ｆ（２）は、値ｓ４から値ｓ
３を加算器４２において減じることにより得られる。The second output element f (2) changes the value s4 to the value s
It is obtained by subtracting 3 in the adder 42.

【００５５】第３出力要素ｆ（３）は、値ｓ２から値ｓ
５を加算器４４において減じることにより得られる。The third output element f (3) is calculated from the value s2 to the value s
5 in the adder 44.

【００５６】図中では、次のような量が用いられてい
る。In the figure, the following quantities are used.

【００５７】Ａ＝１／√２Ｂ＝−Ｃ_1/8＋Ｃ_3/8 Ｃ＝Ｃ_1/8＋Ｃ_3/8 Ｄ＝Ｃ_3/8 但し、上の式では、以下のような数が用いられる。他も
同様である。A = 1 / √2 B = −C _1/8 + C _3/8 C = C _1/8 + C _3/8 D = C _3/8 However, in the above equation, the following numbers are used. Can be Others are the same.

【００５８】Ｃ_3/8＝ｃｏｓ（３π／８）フィールド分離型縮小逆離散コサイン変換を表す式
（１）の行列は、Ｗａｎｇの高速化アルゴリズムを用い
て、次の式（３）のように分解される。C _3/8 = cos (3π / 8) The matrix of the expression (1) representing the field-separated reduced inverse discrete cosine transform is expressed by the following expression (3) using the Wang speed-up algorithm. Decomposed.

【００５９】[0059]

【数１６】 (Equation 16)

【００６０】但し、式（３）における小行列は、以下の
ように定義されている。However, the small matrix in equation (3) is defined as follows.

【００６１】[0061]

【数１７】 [Equation 17]

【００６２】[0062]

【数１８】 (Equation 18)

【００６３】尚、要素Ａ〜Ｊについては式（１）と同様
である。この構成を表したものが図１４である。このよ
うに乗算器１０個及び加算器１３個を用いることで本装
置の実現が可能となる。The elements A to J are the same as in the equation (1). FIG. 14 shows this configuration. By using 10 multipliers and 13 adders as described above, the present device can be realized.

【００６４】すなわち、第０出力要素ｆ（０）は、値ｓ
１６及び値ｓ１８が加算器７０において加算されたもの
である。That is, the zeroth output element f (0) has the value s
16 and the value s18 are added in the adder 70.

【００６５】ここで、値ｓ１６は、値ｓ１１及び値ｓ１
２が加算器６６において加算されたものである。値ｓ１
１は、第０入力要素Ｆ（０）が乗算器５１においてＡ倍
されたものである。値ｓ１２は、第２入力要素Ｆ（２）
が乗算器５２においてＤ倍された値と第４入力要素Ｆ
（４）が乗算器５３においてＦ倍された値が加算器６１
において加えられた値と、第６入力要素Ｆ（６）が乗算
器５４においてＨ倍された値とが加算器６３において加
えられたものである。Here, the value s16 is the value s11 and the value s1.
2 is the value added by the adder 66. Value s1
1 is obtained by multiplying the zeroth input element F (0) by A in the multiplier 51. The value s12 is the second input element F (2)
Is multiplied by D in the multiplier 52 and the fourth input element F
The value obtained by multiplying (4) by F in the multiplier 53 is added to the adder 61.
, And the value obtained by multiplying the sixth input element F (6) by H in the multiplier 54 is added in the adder 63.

【００６６】第１出力要素ｆ（１）は、値ｓ１７から値
ｓ１９が加算器７３において減ぜられたものである。The first output element f (1) is obtained by subtracting the value s19 from the value s17 in the adder 73.

【００６７】ここで、値ｓ１７は、値ｓ１１から値ｓ１
２が加算器６７において減ぜられたものである。値ｓ１
９は、値ｓ１３及び値ｓ１５が加算器６９において加え
られたものである。値ｓ１３は、第３入力要素Ｆ（３）
が乗算器５５においてＥ倍された値から第５入力要素Ｆ
（５）が乗算器５６においてＧ倍された値が加算器６４
において減ぜられたものである。値ｓ１５は、第１入力
要素Ｆ（１）が乗算器５８においてＣ倍された値及び第
７入力要素Ｆ（７）が乗算器６０においてＪ倍された値
が加算器６５において加えられたものである。Here, the value s17 is changed from the value s11 to the value s1.
2 has been subtracted in the adder 67. Value s1
9 is a value obtained by adding the value s13 and the value s15 in the adder 69. The value s13 is the third input element F (3)
From the value multiplied by E in the multiplier 55 to the fifth input element F
The value obtained by multiplying (5) by G in the multiplier 56 is added to the adder 64.
It was reduced in. The value s15 is obtained by adding the value obtained by multiplying the first input element F (1) by C in the multiplier 58 and the value obtained by multiplying the seventh input element F (7) by J in the multiplier 60 in the adder 65. It is.

【００６８】第２出力要素ｆ（２）は、値ｓ１７及び値
ｓ１９が加算器７２において加えられたものである。The second output element f (2) is obtained by adding the value s17 and the value s19 in the adder 72.

【００６９】第３出力要素ｆ（３）は、値ｓ１６から値
ｓ１８が加算器７１において減ぜられたものである。The third output element f (3) is obtained by subtracting the value s18 from the value s16 in the adder 71.

【００７０】ここで、値ｓ１８は、値ｓ１３及び値ｓ１
４が加算器６８において加えられたものである。値ｓ１
４は、第１入力要素Ｆ（１）が乗算器５７においてＢ倍
された値及び第７入力要素Ｆ（７）が乗算器５９におい
てＩ倍された値が加算器６２において加えられたもので
ある。Here, the value s18 is equal to the value s13 and the value s1.
4 has been added in the adder 68. Value s1
4 is a value obtained by adding the value obtained by multiplying the first input element F (1) by B in the multiplier 57 and the value obtained by multiplying the seventh input element F (7) by I in the multiplier 59 in the adder 62. is there.

【００７１】次に、フィールド動き補償モード及びフレ
ーム動き補償モードに対応した動き補償部（フィールド
予測）８及び動き補償部（フレーム予測）９の動作につ
いて述べる。水平方向の補間に関しては、フィールド動
き補償モード、フレーム動き補償モードの場合共に、ま
ず、１／２精度相当の画素を、ハーフバンドフィルタの
様な２倍補間のフィルタによって作り出し、その作り出
された画素を元に１／４精度相当の画素を線形補間によ
って作り出す。その際、ハーフバンドフィルタを用いる
ことで、フレームメモリから取り出された画素と同じ位
相の画素値を予測画として出力する場合、タップ数に応
じた積和演算を行う必要がないために高速な演算が可能
である。また、ハーフバンドフィルタを用いることによ
り、補間に伴う除算をシフト演算で行うことが可能とな
り、さらに高速な実行が可能である。若しくは、４倍補
間のフィルタリングによって、動き補償に必要な画素を
直接作り出すことも考えられる。Next, the operation of the motion compensation unit (field prediction) 8 and the motion compensation unit (frame prediction) 9 corresponding to the field motion compensation mode and the frame motion compensation mode will be described. Regarding the horizontal interpolation, in both the field motion compensation mode and the frame motion compensation mode, first, a pixel equivalent to 精度 precision is created by a double interpolation filter such as a half-band filter, and the created pixel is created. , A pixel equivalent to 1/4 precision is created by linear interpolation. At this time, when a half-band filter is used to output a pixel value having the same phase as a pixel extracted from the frame memory as a predicted image, it is not necessary to perform a multiply-accumulate operation according to the number of taps. Is possible. In addition, by using a half-band filter, division accompanying interpolation can be performed by a shift operation, and higher-speed execution is possible. Alternatively, it is also conceivable to directly create pixels required for motion compensation by filtering with quadruple interpolation.

【００７２】図１５はフィールド動き補償モード対応の
動き補償部８の垂直方向の補間に関するものである。ま
ず、入力となる画像圧縮情報（ビットストリーム）中の
動きベクトルの値に応じて、図１５のＡの様に、フィー
ルド間の位相ずれを含む画素値がビデオメモリ１０から
取り出される。図中の左側の記号ａ１が第１フィールド
の画素に、右側の記号ａ２が第２フィールドの画素に、
それぞれ対応している。第１フィールドの画素及び第２
フィールドの画素は位相がずれている。FIG. 15 relates to vertical interpolation of the motion compensator 8 corresponding to the field motion compensation mode. First, a pixel value including a phase shift between fields is fetched from the video memory 10 according to the value of the motion vector in the input image compression information (bit stream) as shown in FIG. In the figure, the symbol a1 on the left is a pixel in the first field, the symbol a2 on the right is a pixel in the second field,
Each corresponds. The pixels of the first field and the second
The pixels in the field are out of phase.

【００７３】次に、図１５のＢの様に、ハーフバンドフ
ィルタ等の２倍補間フィルタを用いて、フィールド内で
１／２画素精度相当の画素値が作り出される。２倍補間
フィルタを用いて第１フィールド及び第２フィールド内
でそれぞれ２倍補間により作り出された画素はそれぞれ
記号ｂ１及び記号ｂ２で表される。Next, as shown in FIG. 15B, a pixel value equivalent to 1/2 pixel precision is created in the field using a double interpolation filter such as a half band filter. Pixels created by double interpolation in the first field and the second field using the double interpolation filter are represented by symbols b1 and b2, respectively.

【００７４】そして、図１５のＣの様に、フィールド内
で線形補間を行うことによって１／４画素精度相当の画
素値が作り出される。第１フィールド及び第２フィール
ド内でそれぞれ線形補間により作り出された画素はそれ
ぞれ記号ｃ１及び記号ｃ２にて示されている。その際、
ハーフバンドフィルタを用いることで、フレームメモリ
から取り出された画素と同じ位相の画素値を予測画とし
て出力する場合、タップ数に応じた積和演算を行う必要
がないために高速な演算が可能である。若しくは、図１
５のＡの画素値を元に、４倍補間のフィルタリングによ
って図１５のＣの位相に相当する画素値を作り出すこと
も考えられる。Then, as shown in FIG. 15C, by performing linear interpolation in the field, a pixel value equivalent to 1/4 pixel precision is created. Pixels created by linear interpolation in the first field and the second field, respectively, are indicated by symbols c1 and c2, respectively. that time,
By using a half-band filter, when outputting a pixel value having the same phase as a pixel extracted from the frame memory as a predicted image, it is not necessary to perform a product-sum operation according to the number of taps, so that high-speed operation is possible. is there. Or Figure 1
It is also conceivable to create a pixel value corresponding to the phase C in FIG. 15 by filtering the quadruple interpolation based on the pixel value A in FIG.

【００７５】例えば、第１フィールドの画素が位置０、
位置１等に存在する場合には、２倍補間による画素は位
置０．５等に作り出される。さらに、線形補間による画
素は、位置０．２５、位置０．７５等に作り出される。
第２フィールドについても同様である。尚、図中では第
１フィールドの位置と第２フィールド位置とは０．２５
ずれている。For example, if the pixel in the first field is at position 0,
When the pixel exists at the position 1 or the like, the pixel by the double interpolation is created at the position 0.5 or the like. Further, pixels by linear interpolation are created at a position 0.25, a position 0.75, and the like.
The same applies to the second field. In the figure, the position of the first field and the position of the second field are 0.25.
It is out of alignment.

【００７６】図１６はフレーム動き補償モード対応の動
き補償部９の垂直方向の補間に関するものである。ま
ず、入力となる画像圧縮情報（ビットストリーム）中の
動きベクトルの値に応じて、図１６のＡの様に、フィー
ルド間の位相ずれを含む画素値がビデオメモリ１０から
取り出される。図中の左側の記号ａ１が第１フィールド
の画素に、右側の記号ａ２が第２フィールドの画素に、
それぞれ対応している。第１のフィールドの画素及び第
２フィールドの画素は位相が擦れている。FIG. 16 relates to the vertical interpolation of the motion compensation unit 9 corresponding to the frame motion compensation mode. First, according to the value of the motion vector in the input image compression information (bit stream), a pixel value including a phase shift between fields is fetched from the video memory 10 as shown in FIG. In the figure, the symbol a1 on the left is a pixel in the first field, the symbol a2 on the right is a pixel in the second field,
Each corresponds. The pixels in the first field and the pixels in the second field are out of phase.

【００７７】次に、図１６のＢの様に、ハーフバンドフ
ィルタ等の２倍補間フィルタを用いて、フィールド内で
１／２画素精度相当の画素値が作り出される。２倍補間
フィルタを用いて第１フィールド及び第２フィールド内
でそれぞれ２倍補間により作り出された画素はそれぞれ
記号ｂ１及び記号ｂ２にて示されている。Next, as shown in FIG. 16B, a pixel value equivalent to 1/2 pixel precision is created in the field using a double interpolation filter such as a half band filter. Pixels created by double interpolation in the first field and the second field using the double interpolation filter are indicated by symbols b1 and b2, respectively.

【００７８】そして、図１６のＣの様に、フィールド間
で線形補間を行うことによって１／４画素精度相当の画
素値が作り出される。第１フィールド及び第２フィール
ドの画素の線形補間により作り出された画素は記号ｃに
て示されている。Then, as shown in FIG. 16C, a pixel value equivalent to 1/4 pixel accuracy is created by performing linear interpolation between fields. Pixels created by linear interpolation of the pixels in the first and second fields are indicated by the symbol c.

【００７９】例えば、第１フィールドの画素が位置０、
位置２等に、第２フィールドの画素が位置０．５、位置
２．５等に存在する場合には、第１フィールドの２倍補
間による画素は位置１等に、第２フィールドの２倍補間
による画素は位置１．５等に作り出される。さらに、線
形補間による画素は、位置０．２５、位置０．７５、位
置１．２５及び位置１．７５等に作り出される。For example, if the pixel in the first field is at position 0,
If the pixel of the second field exists at the position 0.5, the position 2.5 or the like at the position 2 or the like, the pixel obtained by the double interpolation of the first field is at the position 1 or the like and the pixel of the second field is the double interpolation Are created at position 1.5 and so on. Further, pixels by linear interpolation are created at positions 0.25, 0.75, 1.25, 1.75, and the like.

【００８０】このような補間処理を行うことにより、画
質劣化の要因となる、フィールド反転及びフィールドミ
ックスを防ぐことが出来る。また、ハーフバンドフィル
タを用いることで、フレームメモリから取り出された画
素と同じ位相の画素値を予測画として出力する場合、タ
ップ数に応じた積和演算を行う必要がないために高速な
演算が可能である。By performing such interpolation processing, it is possible to prevent field inversion and field mixing, which cause image quality deterioration. In addition, by using a half-band filter, when outputting a pixel value having the same phase as a pixel extracted from the frame memory as a predicted image, it is not necessary to perform a product-sum operation according to the number of taps, so that high-speed operation can be performed. It is possible.

【００８１】実際の処理としては、水平、垂直の場合共
に、上記の様な２倍補間フィルタと線形補間によって実
現される２段階の補間が、一度で行われる様な係数を予
め用意しておき、１段階の補間であるかの様に処理が行
われる。また、水平、垂直の場合共に、入力となる画像
圧縮情報（ビットストリーム）中の動きベクトルの値に
応じて、必要な画素値のみが作り出される。水平方向と
垂直方向の動きベクトルの値に応じたフィルタ係数を予
め用意しておき、水平方向と垂直方向の補間を一度に行
うことも可能である。As the actual processing, in both the horizontal and vertical cases, coefficients are prepared in advance so that the two-stage interpolation realized by the double interpolation filter and the linear interpolation as described above is performed at once. The processing is performed as if it were a one-stage interpolation. In both horizontal and vertical cases, only necessary pixel values are created according to the value of the motion vector in the input image compression information (bit stream). It is also possible to prepare in advance filter coefficients according to the values of the horizontal and vertical motion vectors, and to perform interpolation in the horizontal and vertical directions at once.

【００８２】尚、２倍補間のフィルタリングを行う際、
動きベクトルの値によって、ビデオメモリにおける画枠
の外を参照する必要が生じる場合がある。この場合は、
端点を中心にして必要なタップ数だけ対称に折り返すか
（以下これをミラー処理と呼ぶ。）、端点の画素値と同
じ値の画素が必要なタップ数の分だけ画枠の外に存在す
るものとして取り扱う（以下これをホールド処理と呼
ぶ。）。When performing the double interpolation filtering,
Depending on the value of the motion vector, it may be necessary to refer to outside the image frame in the video memory. in this case,
Either wrap around the end point symmetrically for the required number of taps (hereinafter referred to as mirror processing) or have pixels with the same value as the pixel value of the end point outside the image frame by the required number of taps (This is hereinafter referred to as a hold process).

【００８３】図１７のＡにミラー処理を示す。図中の記
号ｐはビデオメモリ１０内の画素であり、記号ｑは補間
に必要となる画枠外の仮想的な画素である。この画枠外
の画素は、画枠内の画素を画枠について対称に折り返し
たものである。FIG. 17A shows the mirror processing. The symbol p in the figure is a pixel in the video memory 10, and the symbol q is a virtual pixel outside the image frame required for interpolation. The pixels outside the image frame are obtained by folding the pixels inside the image frame symmetrically with respect to the image frame.

【００８４】図１７のＢにはホールド処理を示す。この
画枠外の画素は、画枠内の画枠と垂直方向に対しては、
動き補償部（フィールド予測）８においても、動き補償
部（フレーム予測）９においても、ミラー処理若しくは
ホールド処理はフィールド単位で行う。或いはまた、水
平方向、垂直方向共に、画枠をはみ出した画素値に関し
ては、固定値（例えば１２８）とすることも考えられ
る。FIG. 17B shows a hold process. Pixels outside this image frame are perpendicular to the image frame inside the image frame.
In both the motion compensation unit (field prediction) 8 and the motion compensation unit (frame prediction) 9, mirror processing or hold processing is performed in units of fields. Alternatively, a fixed value (for example, 128) may be considered for a pixel value that extends outside the image frame in both the horizontal direction and the vertical direction.

【００８５】ところで、ＭＰＥＧ２は主として放送用に
適合する高画質符号化を対象としていたが、ＭＰＥＧ１
より低い符号量（ビットレート）、つまり高い圧縮率の
符号化方式には対応していなかった。携帯端末の普及に
より、今後そのような符号化方式のニーズは高まると思
われ、これに対応してＭＰＥＧ４符号化方式の標準化が
行われた。画像符号化方式に関しては、１９９８年１２
月にＩＳＯ／ＩＥＣ１４４９６−２としてその規格書が
国際標準に承認された。By the way, MPEG2 is mainly intended for high-quality encoding suitable for broadcasting, but MPEG1
It does not correspond to a coding method with a lower code amount (bit rate), that is, a high compression ratio. With the spread of mobile terminals, it is expected that the need for such an encoding system will increase in the future, and in response to this, the MPEG4 encoding system has been standardized. Regarding the image coding method,
In May, the standard was approved as an international standard as ISO / IEC14496-2.

【００８６】また、ディジタル放送用に一度符号化され
たＭＰＥＧ２画像圧縮情報（ビットストリーム）を、携
帯端末上等で処理するのにより適した、より低い符号量
（ビットレート）のＭＰＥＧ４画像圧縮情報（ビットス
トリーム）に変換したいというニーズがある。Also, MPEG4 image compression information (bit rate) having a lower code amount (bit rate), which is more suitable for processing MPEG2 image compression information (bit stream) once encoded for digital broadcasting on a portable terminal or the like. Bit stream).

【００８７】かかる目的を達成する画像情報変換装置
（トランスコーダ）として、“Field-to-Frame Transc
oding with Spatial and Tempora1 Downsampling"
(SusieJ.Wee, John G.Apostolopoulos,and Nick Fe
amster,ICIP99)では、図１８に示すような装置が提案さ
れている。即ち、この装置は、ピクチャタイプ判別部１
２、ＭＰＥＧ２画像情報復号化部（Ｉ／Ｐピクチャ）１
３、間引き部１４、ＭＰＥＧ４画像情報符号化部（Ｉ／
Ｐ−ＶＯＰ）１５、動きベクトル合成部１６、動きベク
トル検出部１７から構成されている。Image information conversion apparatus for achieving the above object
(Transcoder) as “Field-to-Frame Transc
oding with Spatial and Tempora1 Downsampling "
(Susie J. Wee, John G. Apostolopoulos, and Nick Fe
amster, ICIP99), an apparatus as shown in FIG. 18 was proposed.
Have been. In other words, this device includes a picture type discriminating unit 1
2. MPEG2 image information decoding unit (I / P picture) 1
3, thinning unit 14, MPEG4 image information encoding unit (I /
P-VOP) 15, a motion vector synthesis unit 16, a motion vector
It comprises a torque detector 17.

【００８８】ピクチャタイプ判別部１２には、飛び越し
走査のＭＰＥＧ２画像圧縮情報（ビットストリーム）が
入力される。ＭＰＥＧ２画像圧縮情報（ビットストリー
ム）は、フレーム内で符号化されたイントラ符号化画像
（Ｉピクチャ）、表示順序で順方向を参照して符号化さ
れた順方向予測符号化画像（Ｐピクチャ）、表示順序で
順方向及び逆方向を参照して符号化された双方向予測符
号化画像（Ｂピクチャ）から構成されている。The MPEG-2 image compression information (bit stream) of the interlaced scanning is input to the picture type determination unit 12. MPEG2 image compression information (bit stream) includes an intra-coded image (I picture) coded in a frame, a forward prediction coded image (P picture) coded by referring to the forward direction in the display order, It is composed of bidirectionally predicted coded images (B pictures) coded with reference to the forward and backward directions in the display order.

【００８９】ピクチャタイプ判別部１２においては、各
フレームのデータにおいて、Ｉ／Ｐピクチャに関するも
のか、Ｂピクチャに関するものであるかが判別され、前
者のみ、後続のＭＰＥＧ２画像情報復号化部（Ｉ／Ｐピ
クチャ）１３に出力される。The picture type discriminating section 12 discriminates whether the data of each frame is related to an I / P picture or a B picture, and only the former is used for the subsequent MPEG2 image information decoding section (I / P P picture) 13.

【００９０】ＭＰＥＧ２画像情報復号化部（Ｉ／Ｐピク
チャ）１３における処理は通常のＭＰＥＧ２画像情報復
号化装置と同様であるが、Ｂピクチャに関するデータは
ピクチャタイプ判別部１２において廃棄されるため、Ｍ
ＰＥＧ２画像情報復号化部（Ｉ／Ｐピクチャ）１３にお
ける機能としてはＩ／Ｐピクチャのみを復号化出来れば
良い。The processing in the MPEG2 image information decoding section (I / P picture) 13 is the same as that in the ordinary MPEG2 image information decoding apparatus.
The function of the PEG2 image information decoding unit (I / P picture) 13 only needs to be able to decode only the I / P picture.

【００９１】ＭＰＥＧ２画像情報復号化部（Ｉ／Ｐピク
チャ）１３の出力となる画素値は、間引き部１４に入力
され、ここで、水平方向には１／２の間引き処理が施さ
れ、垂直方向には、第１フィールド若しくは第２フィー
ルドのどちらか一方のデータのみを残し、もう一方を廃
棄することで、入力となる画像情報の１／４の大きさを
持つ順次走査画像を生成する。The pixel value output from the MPEG2 image information decoding unit (I / P picture) 13 is input to the thinning unit 14, where it is subjected to a half thinning process in the horizontal direction and to the vertical direction. In this method, only one of the data of the first field and the second field is left, and the other is discarded, thereby generating a progressively scanned image having a size of 1/4 of the input image information.

【００９２】間引き部１４によって生成された順次走査
画像はＭＰＥＧ４画像情報符号化部（Ｉ／Ｐ−ＶＯＰ）
１５によつて符号化され、ＭＰＥＧ４画像圧縮情報（ビ
ットストリーム）として出力される。その際、入力とな
るＭＰＥＧ２画像圧縮情報（ビットストリーム）中の動
きベクトル情報は、動きベクトル合成部１６において間
引き後の画像情報に対する動きベクトルにマッピングさ
れ、動きベクトル検出部１７においては、動きベクトル
合成部１５において合成された動きベクトル値を元に高
精度の動きベクトルを検出する。The progressively scanned image generated by the thinning unit 14 is an MPEG4 image information encoding unit (I / P-VOP)
15 and output as MPEG4 image compression information (bit stream). At this time, the motion vector information in the input MPEG2 image compression information (bit stream) is mapped to a motion vector for the decimated image information by the motion vector synthesis unit 16, and the motion vector A high-precision motion vector is detected based on the motion vector value synthesized by the unit 15.

【００９３】図１８に示した画像情報変換装置は、入力
となるＭＰＥＧ２画像圧縮情報（ビットストリーム）が
ＮＴＳＣ規格（７２０×４８０画素、飛び越し走査）に
準ずるものである場合、その凡そ１／２×１／２の画枠
であるＳＩＦ（３５２×２４０画素、順次走査）の大き
さのＭＰＥＧ４画像圧縮情報（ビットストリーム）を出
力するものである。しかしながら、ＭＰＥＧ４のターゲ
ットアプリケーションの１つである携帯情報端末におい
ては、モニタ部の解像度がＳＩＦサイズの画像を表示し
きれないということもありうる。また、記憶メディアの
容量、或いは伝送経路のバンド幅により定められた符号
量（ビットレート）の元では、ＳＩＦサイズのままでは
良好な画質が得られないという問題が生じる場合もあ
る。このような場合、入力となるＭＰＥＧ２画像圧縮情
報（ビットストリーム）の凡そ１／４×１／４の画枠で
あるＱＳＩＦ（１７６×１１２画素、順次走査）に変換
する必要がある。さらに、後段で破棄されてしまう、画
像の持つ高域成分に関する情報もＭＰＥＧ２画像情報復
号化部（Ｉ／Ｐピクチャ）１３においては処理されてし
まうため、復号化に必要とされる演算量とメモリ容量が
冗長であると言える。The image information conversion apparatus shown in FIG. 18 is arranged so that when the input MPEG2 image compression information (bit stream) conforms to the NTSC standard (720 × 480 pixels, interlaced scanning), about 1/2 × It outputs MPEG4 image compression information (bit stream) having a size of SIF (352 × 240 pixels, progressive scanning), which is a half image frame. However, in a portable information terminal that is one of the target applications of MPEG4, the resolution of the monitor unit may not be able to display an image having an SIF size. Also, under the condition of the code amount (bit rate) determined by the capacity of the storage medium or the bandwidth of the transmission path, there may be a problem that good image quality cannot be obtained with the SIF size. In such a case, it is necessary to convert the input MPEG2 image compression information (bit stream) into a QSIF (176 × 112 pixels, sequential scanning) which is an approximately 1/4 × 1/4 picture frame. Further, information on high-frequency components of the image, which is discarded in the subsequent stage, is also processed in the MPEG2 image information decoding unit (I / P picture) 13, so that the amount of computation and memory required for decoding are required. It can be said that the capacity is redundant.

【００９４】かかる問題を解決するために、本願出願人
は、先に図１９に示すような画像情報変換装置を提案し
た。In order to solve such a problem, the present applicant has previously proposed an image information conversion apparatus as shown in FIG.

【００９５】この画像情報変換装置は、ピクチャタイプ
判別部１８と、ＭＰＥＧ２画像情報復号化部（Ｉ／Ｐピ
クチャ４×４ダウンデコーダ）１９と、走査変換部２
０と、間引き部２１と、ＭＰＥＧ４画像情報符号化部
（Ｉ／Ｐ−ＶＯＰ）２２と、動きベクトル合成部２３
と、動きベクトル検出部２４とから構成されている。This image information conversion apparatus includes a picture type discrimination section 18, an MPEG2 image information decoding section (I / P picture 4 × 4 down decoder) 19, and a scan conversion section 2.
0, a thinning unit 21, an MPEG4 image information encoding unit (I / P-VOP) 22, and a motion vector combining unit 23
And a motion vector detecting unit 24.

【００９６】入力となる飛び越し走査のＭＰＥＧ２画像
圧縮情報（ビットストリーム）は、まずピクチャタイプ
判別部１８に入力され、ここで、Ｉ／Ｐピクチャに関す
る情報は出力されＭＰＥＧ２画像情報復号化部（Ｉ／Ｐ
ピクチャ４×４ダウンデコーダ）１９の入力となる
が、Ｂピクチャに関する情報は破棄される。このように
してフレームレートの変換が行われる。ＭＰＥＧ２画像
情報復号化部（Ｉ／Ｐピクチャ４×４ダウンデコー
ダ）１９は図１８に示したものと同等であるが、Ｂピク
チャに関する情報はピクチャタイプ判別部１８において
既に破棄されているため、機能としてはＩ／Ｐピクチャ
のみの復号化処理が行えれば良い。水平方向及び垂直方
向共に低域４次情報のみを用いた復号化処理を行うこと
で、図１９におけるＭＰＥＧ２画像情報復号化部（Ｉ／
Ｐピクチャ４×４ダウンデコーダ）１９で必要とされ
るビデオメモリの容量は、図１８におけるＭＰＥＧ２画
像情報復号化部（Ｉ／Ｐピクチャ４×４ダウンデコー
ダ）１３の１／４で良く、また、逆離散コサイン変換に
要する演算量もフィールド離散コサイン変換モードの場
合には１／４、フレーム離散コサイン変換モードの場合
には１／２で良い。さらに、フレーム離散コサイン変換
モードの際、図２０に示すように、４×８成分の離散コ
サイン変換係数の一部を０と置き換えることで、画質を
殆ど劣化させることなく演算量を削減することが可能で
ある。図中の記号ａは、０と置き換える画素値を示して
いる。The input MPEG-2 image compression information (bit stream) of the interlaced scanning is first input to the picture type discriminating section 18, where the information relating to the I / P picture is output and output to the MPEG2 image information decoding section (I / P P
Picture 4 × 4 down decoder) 19, but information on B pictures is discarded. The conversion of the frame rate is performed in this manner. The MPEG2 image information decoding unit (I / P picture 4 × 4 down decoder) 19 is the same as that shown in FIG. 18, but the information about the B picture has already been discarded by the picture type discriminating unit 18. As long as the decoding process of only the I / P picture can be performed. By performing the decoding process using only the low-frequency quaternary information in both the horizontal direction and the vertical direction, the MPEG2 image information decoding unit (I /
The capacity of the video memory required by the P picture 4 × 4 down decoder 19 may be 1/4 of the MPEG2 image information decoding unit (I / P picture 4 × 4 down decoder) 13 in FIG. The amount of calculation required for the inverse discrete cosine transform may be １／ in the field discrete cosine transform mode, and １／ in the frame discrete cosine transform mode. Furthermore, in the frame discrete cosine transform mode, as shown in FIG. 20, by replacing a part of the 4 × 8 discrete cosine transform coefficients with 0, it is possible to reduce the amount of calculation without substantially deteriorating the image quality. It is possible. Symbol a in the drawing indicates a pixel value to be replaced with 0.

【００９７】ＭＰＥＧ２画像情報復号化部（Ｉ／Ｐピク
チャ４×４ダウンデコーダ）１９の出力となる、入力
となる画像圧縮情報（ビットストリーム）の１／２×１
／２の大きさを持つ飛び越し走査の画素データは、ま
ず、走査変換部２０において、まず、第１フィールド若
しくは第２フィールドの内一方のみを残してもう一方を
破棄することで、入力となる画像圧縮情報（ビットスト
リーム）の１／２×１／４の大きさを持つ順次走査の画
素データへと変換され出力される。その動作原理を図２
１に示す。図２１のＡにおいて、第１フィールドの画素
ａ１及び第２フィールドの画素ａ２の内、第２フィール
ドａ２の画素を破棄することにより図２１のＢに示す画
素ｂが得られる。The output of the MPEG2 image information decoding unit (I / P picture 4 × 4 down decoder) 19 １／ × 1 × 1 of the input image compression information (bit stream)
The pixel data of the interlaced scanning having a size of / 2 is first input to the scan conversion unit 20 by leaving only one of the first field and the second field and discarding the other. The compressed information (bit stream) is converted into progressive scan pixel data having a size of 1/2 × １／ and output. Fig. 2 shows the principle of operation.
1 is shown. In FIG. 21A, the pixel b shown in FIG. 21B is obtained by discarding the pixels in the second field a2 among the pixels a1 in the first field and the pixels a2 in the second field.

【００９８】次に、走査変換部２０の出力である、入力
となる画像圧縮情報（ビットストリーム）の１／２×１
／４の大きさを持つ順次走査の画素データは、間引き部
２１へ入力され、ここで水平方向に１／２倍のダウンサ
ンプリングが施され、入力となる画像圧縮情報（ビット
ストリーム）の１／４×１／４の大きさを持つ順次走査
の画素データへと変換される。１／２倍のダウンサンプ
リングは、単純な間引き処理を用いても、或いは数タッ
プのローパスフィルタを用いても良い。その動作原理を
図２２に示す。図２２のＡにおいて、画素ａに対して水
平方向に１／２ダウンサンプリングすることにより図２
２のＢに示す画素ｂが得られる。走査変換部２０におけ
る処理と間引き部２１の処理との順序は逆であっても良
い。間引き部２１の出力である、入力となる画像圧縮情
報（ビットストリーム）の１／４×１／４の大きさを持
つ順次走査の画素データはＭＰＥＧ４画像情報符号化部
（Ｉ／Ｐ−ＶＯＰ）２２において符号化処理がなされ
る。Next, 1/2 × 1 of the input image compression information (bit stream) which is the output of the scan conversion unit 20
The pixel data of the progressive scanning having the size of / 4 is input to the thinning unit 21, where it is down-sampled by a factor of に in the horizontal direction, and 1 / １／ of the input image compression information (bit stream). The data is converted into progressive scan pixel data having a size of 4 × １／. The downsampling of 1/2 times may use a simple thinning process or a low-pass filter with several taps. The operation principle is shown in FIG. In FIG. 22A, the pixel “a” is horizontally down-sampled by １／ in FIG.
The pixel b shown in B of FIG. 2 is obtained. The order of the process in the scan conversion unit 20 and the process in the thinning unit 21 may be reversed. The pixel data of the progressive scanning having a size of １／ × の of the input image compression information (bit stream) which is the output of the thinning unit 21 is an MPEG4 image information encoding unit (I / P-VOP). At 22, encoding processing is performed.

【００９９】尚、ＭＰＥＧ４画像情報符号化部（Ｉ／Ｐ
―ＶＯＰ）２２においては、ブロック毎の処理を行うた
め、水平方向、垂直方向共に、輝度成分の画素数が１６
の倍数である必要がある。色差成分に関しては、入力と
なる画像圧縮情報（ビットストリーム）が４２０フォー
マットの場合、水平方向、垂直方向共に８の倍数であれ
ば良い。４２２フォーマットの場合、水平方向は８の倍
数で良いが、垂直方向に関しては１６の倍数でなければ
ならない。４４４フォーマットの場合には水平方向、垂
直方向共に１６の倍数でなければならない。Note that the MPEG4 image information encoding unit (I / P
-VOP) 22, since the processing is performed for each block, the number of pixels of the luminance component is 16 in both the horizontal and vertical directions.
Must be a multiple of Regarding the color difference component, when the input image compression information (bit stream) is in the 420 format, it may be a multiple of 8 in both the horizontal and vertical directions. In the case of the 422 format, the horizontal direction may be a multiple of 8, but the vertical direction must be a multiple of 16. In the case of the 444 format, it must be a multiple of 16 in both the horizontal and vertical directions.

【０１００】このための垂直方向、水平方向の画素数の
調整をそれぞれ走査変換部２０及び間引き部２１にて行
う。すなわち、例えば入力となる画像圧縮情報（ビット
ストリーム）の輝度成分が７２０×４８０画素である場
合、走査変換部において第１若しくは第２フィールドの
み抜き出した後の画像の大きさは３６０×１２０とな
る。１２０は１６の倍数でないので、１６の倍数となる
よう、例えば下８ライン分の画素データを破棄して３６
０×１１２画素とする。また、この画像を間引き部２１
を用いて処理すると１８０×１１２画素となるが、１８
０は１６の倍数でないので、１６の倍数となるよう、例
えば右８行を破棄し、１７６×１１２画素とする。For this purpose, the number of pixels in the vertical and horizontal directions is adjusted by the scan converter 20 and the thinning unit 21, respectively. That is, for example, when the luminance component of the input image compression information (bit stream) is 720 × 480 pixels, the size of the image after extracting only the first or second field in the scan conversion unit is 360 × 120. . Since 120 is not a multiple of 16, pixel data for the lower eight lines are discarded so as to be a multiple of 16, for example, 36
It is assumed to be 0 × 112 pixels. Further, this image is converted to a thinning unit 21.
Is 180 × 112 pixels when processed using
Since 0 is not a multiple of 16, for example, the right 8 rows are discarded so as to be a multiple of 16 and are set to 176 × 112 pixels.

【０１０１】また、ＭＰＥＧ２画像情報復号化部（Ｉ／
Ｐピクチャ４×４ダウンデコーダ）１９において検出
される、入力となるＭＰＥＧ２画像圧縮情報（ビットス
トリーム）中の動きベクトル情報は、動きベクトル合成
部２３に入力され、ここで走査変換後の順次走査画像に
おける動きベクトル値にマッピングされる。動きベクト
ル検出部２４においては、動きベクトル合成部２３の出
力となる走査変換後の順次走査画像における動きベクト
ル値を元に高精度の動き検出を行う。The MPEG2 image information decoding unit (I /
The motion vector information in the input MPEG2 image compression information (bit stream) detected by the P picture 4 × 4 down decoder 19 is input to the motion vector synthesis unit 23, where the sequentially converted image after scan conversion is input. To the motion vector value at. The motion vector detecting section 24 performs high-precision motion detection based on the motion vector values in the sequentially converted scanned image output from the motion vector synthesizing section 23.

【０１０２】[0102]

【発明が解決しようとする課題】ところで、図１９に示
した画像情報変換装置におけるＭＰＥＧ２画像情報復号
化部（Ｉ／Ｐピクチャ４×４ダウンデコーダ）１９
で、８次の離散コサイン変換係数のうち、低域４次係数
を取り出し、これに４次の逆離散コサイン変換を施すと
いう処理は、一種の低域通過フィルタバンクを施すのと
等価な処理であると言える。図１９に示した画像情報変
換装置での水平成分に対する処理に関しては、まず、Ｍ
ＰＥＧ２画像情報復号化装置（Ｉ／Ｐピクチャ４×４
ダウンデコーダ）１９において低域通過フィルタが施さ
れ、次に間引き部２１において低域通過フィルタが施さ
れるため、２段階の低域通過フィルタ処理が行われるこ
とになり、処理が複雑になっていた。The MPEG2 image information decoder (I / P picture 4 × 4 down decoder) 19 in the image information converter shown in FIG.
A process of extracting a low-order fourth-order coefficient from the eighth-order discrete cosine transform coefficients and performing a fourth-order inverse discrete cosine transform on this is a process equivalent to applying a kind of low-pass filter bank. It can be said that there is. Regarding the processing for the horizontal component in the image information conversion apparatus shown in FIG.
PEG2 image information decoding device (I / P picture 4 × 4
The low-pass filter is applied in the down decoder 19, and then the low-pass filter is applied in the decimation unit 21, so that two-stage low-pass filter processing is performed, which complicates the processing. Was.

【０１０３】本発明は、上述の実情に鑑みて提案される
ものであって、ＭＰＥＧ２画像圧縮情報（ビットストリ
ーム）の符号化画像を構成する画素ブロックを縮小する
際の処理を簡易化するような画像情報変換装置及び方法
を提供することを目的とする。The present invention has been proposed in view of the above-described circumstances, and is intended to simplify the processing for reducing the pixel blocks constituting the encoded image of the MPEG2 image compression information (bit stream). It is an object to provide an image information conversion device and method.

【０１０４】[0104]

【課題を解決するための手段】上述の課題を解決するた
めに、本発明は、画像を水平方向及び垂直方向共に８成
分の画素からなる画素ブロックを単位として離散コサイ
ン変換した画像圧縮情報の解像度を変換するものであっ
て、飛び越し走査による画像を符号化した入力画像圧縮
情報を構成する画素ブロックの水平方向及び垂直方向共
に８成分の離散コサイン変換係数の内、水平方向に低域
２成分及び垂直方向に低域４成分のみを用いて飛び越し
走査の画像を復号し、復号された飛び越し走査の画像を
構成する第１フィールド及び第２フィールドのいずれか
一方を選択して順次走査の画像を生成し、生成された画
像を上記入力画像圧縮情報に対して水平方向及び垂直方
向共に１／４の解像度を有する出力画像圧縮情報に符号
化するものである。SUMMARY OF THE INVENTION In order to solve the above-mentioned problems, the present invention provides a resolution of image compression information obtained by performing discrete cosine transform on an image in units of a pixel block composed of pixels of eight components in both the horizontal and vertical directions. Of the discrete cosine transform coefficients of the eight components in both the horizontal and vertical directions of the pixel blocks constituting the input image compression information obtained by encoding the image obtained by the interlaced scanning, and The image of the interlaced scan is decoded using only the four low-frequency components in the vertical direction, and one of the first field and the second field constituting the decoded interlaced scan image is selected to generate an image of the progressive scan. Then, the generated image is encoded into output image compression information having a resolution of 1/4 in both the horizontal and vertical directions with respect to the input image compression information.

【０１０５】本発明は、飛び越し走査のＭＰＥＧ２画像
圧縮情報（ビットストリーム）を入力画像圧縮情報と
し、順次走査のＭＰＥＧ４画像圧縮情報（ビットストリ
ーム）を出力画像圧縮情報とする。In the present invention, interlaced scanning MPEG2 image compression information (bit stream) is used as input image compression information, and progressive scanning MPEG4 image compression information (bit stream) is used as output image compression information.

【０１０６】ＭＰＥＧ２画像圧縮情報（ビットストリー
ム）及びＭＰＥＧ４画像圧縮情報（ビットストリーム）
は、画像群すなわちＧＯＰ（group of pictures）及
びＧＯＶ（group of VOPs）からそれぞれ構成されて
いる。画像群であるＧＯＰ及びＧＯＶは、複数の符号化
画像すなわちピクチャ（picture）及びＶＯＰ（videoob
ject plane）からそれぞれ構成され、符号化画像は複
数の画素から構成される画素ブロックすなわちマクロブ
ロックから構成されている。MPEG2 image compression information (bit stream) and MPEG4 image compression information (bit stream)
Is composed of a group of pictures, that is, a group of pictures (GOP) and a group of VOPs (GOV). The group of images GOP and GOV includes a plurality of encoded images, that is, pictures and VOPs (videoob).
ject plane), and the encoded image is composed of a pixel block composed of a plurality of pixels, that is, a macroblock.

【０１０７】すなわち、本発明は、上述した課題を解決
するために、ピクチャタイプ判別部と、ＭＰＥＧ２画像
情報復号化部（Ｉ／Ｐピクチャ２×４ダウンデコー
ダ）と、走査変換部と、ＭＰＥＧ４画像情報復号化部
（Ｉ／Ｐ−ＶＯＰ）と、動きベクトル合成部と、動きベ
クトル検出部とを兼ね備え、入力となる画像圧縮情報
（ビットストリーム）の１／４×１／４の大きさを持つ
順次走査のＭＰＥＧ４画像圧縮情報（ビットストリー
ム）を出力する手段を提供するものである。That is, in order to solve the above-mentioned problems, the present invention provides a picture type discriminating unit, an MPEG2 image information decoding unit (I / P picture 2 × 4 down decoder), a scan conversion unit, and an MPEG4 image Combines an information decoding unit (I / P-VOP), a motion vector synthesis unit, and a motion vector detection unit, and has a size of １／ × １／ of the input image compression information (bit stream). It is intended to provide means for outputting MPEG4 image compression information (bit stream) of progressive scanning.

【０１０８】上記構成において、ピクチャタイプ判別部
は、入力となるＭＰＥＧ２画像圧縮情報（ビットストリ
ーム）内で、Ｉ／Ｐピクチャに関するものだけ残してＢ
ピクチャに関するものは廃棄する。ＭＰＥＧ２画像情報
復号化部（Ｉ／Ｐピクチャ２×４ダウンデコーダ）は、
ピクチャタイプ判別部の出力となる、Ｉ／Ｐピクチャに
関する情報を、水平方向については８次の離散コサイン
変換係数のうち低域２次情報のみを用い、垂直方向につ
いては８次の離散コサイン変換係数のうち低域４次情報
のみを用いて部分復号を行う。走査変換部は、ＭＰＥＧ
２画像情報復号化部（Ｉ／Ｐピクチャ２×４ダウンデ
コーダ）の出力となる画素値のうち、第一フィールド若
しくは第二フイールドのデータのみを残して残りを廃棄
することで入力画像の１／４×１／４の大きさの順次走
査画像へ変換する。ＭＰＥＧ４画像情報符号化部（Ｉ／
Ｐ−ＶＯＰ）は、間引き部の出力となる順次走査の画像
データをＭＰＥＧ４符号化して画像圧縮情報（ビットス
トリーム）として出力する。動きベクトル合成部は、Ｍ
ＰＥＧ２画像情報復号化部（Ｉ／Ｐピクチャ４×４ダ
ウンデコーダ）で検出された、入力となる画像圧縮情報
（ビットストリーム）内の動きベクトル値を元に、走査
変換後の画像データに対する動きベクトル値にマッピン
グを行う。動きベクトル検出部は、動きベクトル合成部
から出力される動きベクトル値を元に、高精度の動きベ
クトル検出を行う。In the above arrangement, the picture type discriminating section leaves only those relating to I / P pictures in the MPEG2 image compression information (bit stream) to be input.
Discard the pictures. The MPEG2 image information decoding unit (I / P picture 2 × 4 down decoder)
Information about the I / P picture, which is output from the picture type discriminating unit, is obtained by using only low-frequency secondary information among the 8th-order discrete cosine transform coefficients in the horizontal direction, and the 8th-order discrete cosine transform coefficient in the vertical direction. , Partial decoding is performed using only the low-frequency fourth-order information. Scan conversion unit is MPEG
Of the pixel values output from the two-image information decoding unit (I / P picture 2 × 4 down decoder), only the data of the first field or the second field is left and the rest is discarded, so that 1/1 of the input image is discarded. The image is converted into a progressively scanned image having a size of 4 × １／. MPEG4 image information encoding unit (I /
P-VOP) encodes progressively scanned image data, which is output from the thinning unit, by MPEG4 encoding and outputs it as image compression information (bit stream). The motion vector synthesis unit calculates M
Based on the motion vector value in the input image compression information (bit stream) detected by the PEG2 image information decoding unit (I / P picture 4 × 4 down decoder), the motion vector for the image data after scan conversion Map to values. The motion vector detection unit performs highly accurate motion vector detection based on the motion vector value output from the motion vector synthesis unit.

【０１０９】[0109]

【発明の実施の形態】以下、図面を参照し、本発明の実
施の形態について説明する。Embodiments of the present invention will be described below with reference to the drawings.

【０１１０】本発明を適用した画像情報変換装置を図１
を参照して説明する。FIG. 1 shows an image information conversion apparatus to which the present invention is applied.
This will be described with reference to FIG.

【０１１１】この画像情報変換装置は、ピクチャタイプ
判別部２５と、ＭＰＥＧ２画像情報復号化部（Ｉ／Ｐピ
クチャ２×４ダウンデコーダ）２６と、走査変換部２
７と、ＭＰＥＧ４画像情報復号化部（Ｉ／Ｐ−ＶＯＰ）
２８と、動きベクトル合成部２９と、動きベクトル検出
部３０とから構成される。This image information conversion apparatus includes a picture type discrimination unit 25, an MPEG2 image information decoding unit (I / P picture 2 × 4 down decoder) 26, and a scan conversion unit 2
7, MPEG4 image information decoding unit (I / P-VOP)
28, a motion vector synthesis unit 29, and a motion vector detection unit 30.

【０１１２】ピクチャタイプ判別部２５には、飛び越し
走査のＭＰＥＧ２画像圧縮情報（ビットストリーム）が
入力される。このＭＰＥＧ２画像圧縮情報（ビットスト
リーム）は、フレーム内で符号化されたイントラ符号化
画像（Ｉピクチャ）と、フレーム間で順方向に他の画像
を参照して予測符号化された順方向予測符号化画像（Ｐ
ピクチャ）と、フレーム間で順方向及び逆方向に他の画
像を参照して予測符号化された双方向予測符号化画像
（Ｂピクチャ）とから構成されている。The MPEG2 image compression information (bit stream) of the interlaced scanning is input to the picture type determination unit 25. This MPEG2 image compression information (bit stream) is composed of an intra-coded image (I picture) coded in a frame and a forward prediction code predicted and coded by referring to another image in the forward direction between frames. Image (P
Picture) and a bidirectionally predicted coded image (B picture) that has been predictively coded with reference to other images in the forward and backward directions between frames.

【０１１３】ピクチャタイプ判別部２５は、入力となる
ＭＰＥＧ２画像圧縮情報（ビットストリーム）内で、Ｉ
ピクチャ及びＰピクチャだけ残してＢピクチャは廃棄す
る。[0113] The picture type discriminating section 25 determines whether or not the MPEG2 image compression information (bit stream) to be input is
The B picture is discarded while leaving only the picture and the P picture.

【０１１４】ＭＰＥＧ２画像情報復号化部２６は、ＭＰ
ＥＧ２画像圧縮情報（ビットストリーム）の画像を構成
するマクロブロックの水平方向及び垂直方向共に８成分
（以下、８×８と表記する。）の離散コサイン変換（di
screte cosine transformation; DCT）係数の内で、
水平方向に低域２成分及び垂直方向に低域４成分（以
下、２×４と表記する。他でも同様とする。）のみを用
いてマクロブロックの部分的な復号を行う。The MPEG2 image information decoding unit 26 outputs
Discrete cosine transform (di) of eight components (hereinafter, referred to as 8 × 8) in both the horizontal and vertical directions of macroblocks constituting an image of EG2 image compression information (bit stream).
screte cosine transformation (DCT) coefficient,
Partial decoding of a macroblock is performed using only two low-frequency components in the horizontal direction and four low-frequency components in the vertical direction (hereinafter, referred to as 2 × 4; the same applies to other components).

【０１１５】すなわち、ＭＰＥＧ２画像情報復号化部
（Ｉ／Ｐピクチャ２×４ダウンデコーダ）２６は、ピ
クチャタイプ判定部２５からＩピクチャ又はＰピクチャ
（以下、Ｉ／Ｐピクチャと称する。）から構成されるＭ
ＰＥＧ２画像圧縮情報（ビットストリーム）が入力さ
れ、Ｉ／Ｐピクチャから飛び越し走査の画像を復号す
る。That is, the MPEG2 image information decoding section (I / P picture 2 × 4 down decoder) 26 is composed of an I picture or a P picture (hereinafter referred to as an I / P picture) from the picture type determination section 25. M
PEG2 image compression information (bit stream) is input, and an interlaced image is decoded from an I / P picture.

【０１１６】走査変換部２７は、ＭＰＥＧ２画像情報復
号化部（Ｉ／Ｐピクチャ２×４ダウンデコーダ）２６
の出力となる飛び越し画像の内、第１フィールド又は第
２フィールドのいずれか一方を残して他を廃棄する。走
査変換部２７は、残されたフィールドから順次走査の画
像を生成し、ＭＰＥＧ２入力画像圧縮情報（ビットスト
リーム）を構成する飛び越し走査の入力画像の１／４×
１／４の大きさの順次走査の画像に変換する。The scan conversion unit 27 includes an MPEG2 image information decoding unit (I / P picture 2 × 4 down decoder) 26
Of the interlaced image output as above, one of the first field and the second field is discarded while the other is left. The scan conversion unit 27 generates a sequentially-scanned image from the remaining fields, and outputs 1/4 × of the interlaced input image constituting the MPEG2 input image compression information (bit stream).
The image is converted into a quarter-scan progressively scanned image.

【０１１７】ＭＰＥＧ４画像情報符号化部（Ｉ／Ｐ−Ｖ
ＯＰ）２８は、走査変換部２７から出力された、入力画
像の１／４×１／４の大きさを有する画像をＭＰＥＧ４
符号化してＭＰＥＧ４画像圧縮情報（ビットストリー
ム）として出力する。The MPEG4 image information encoding unit (I / P-V
OP) 28 outputs the image output from the scan conversion unit 27 and having a size of 1/4 × １／ of the input image according to MPEG4.
It is encoded and output as MPEG4 image compression information (bit stream).

【０１１８】このＭＰＥＧ４画像圧縮情報（ビットスト
リーム）は、ビデオオブジェクト（video object; V
O）にて構成される。ＶＯを構成する画面であるビデオ
オブジェクトプレイン（video object plane; VOP）
は、フレーム内符号化VOPであるＩ−ＶＯＰと、順方向
予測符号化ＶＯＰであるＰ−ＶＯＰと、双方向予測符号
化ＶＯＰと、スプライト符号化ＶＯＰとから構成され
る。The MPEG4 image compression information (bit stream) is a video object (video object; V).
O). Video object plane (VOP), which is the screen that constitutes the VO
Is composed of an I-VOP which is an intra-frame coded VOP, a P-VOP which is a forward prediction coded VOP, a bidirectional prediction coded VOP, and a sprite coded VOP.

【０１１９】ＭＰＥＧ４画像情報符号化部（Ｉ／Ｐ−Ｖ
ＯＰ）２８、走査変換部２７から出力された画像をＩ−
ＶＯＰ及び／又はＰ−ＶＯＰ（Ｉ／Ｐ−ＶＯＰ）にＭＰ
ＥＧ４符号化してＭＰＥＧ４画像圧縮情報（ビットスト
リーム）として出力する。The MPEG4 image information encoding unit (I / P-V
OP) 28, the image output from the scan conversion unit 27 is
MP to VOP and / or P-VOP (I / P-VOP)
EG4 encoding and output as MPEG4 image compression information (bit stream).

【０１２０】動きベクトル合成部２９は、ＭＰＥＧ２画
像情報復号化部（Ｉ／Ｐピクチャ２×４ダウンデコー
ダ）２５で検出された、ＭＰＥＧ２画像圧縮情報（ビッ
トストリーム）内の動きベクトル値を元に、走査変換後
の画像にデータに対して動きベクトル値を用いたマッピ
ングを施す。The motion vector synthesizing unit 29 is based on the motion vector value in the MPEG2 image compression information (bit stream) detected by the MPEG2 image information decoding unit (I / P picture 2 × 4 down decoder) 25. The image after the scan conversion is subjected to mapping using the motion vector value for the data.

【０１２１】動きベクトル検出部３０は、動きベクトル
合成部２９から出力される動きベクトル値を元に、高精
度の動きベクトル検出を行う。The motion vector detecting section 30 performs highly accurate motion vector detection based on the motion vector value output from the motion vector synthesizing section 29.

【０１２２】次に、画像情報変換装置のＭＰＥＧ２画像
情報復号化部（Ｉ／Ｐピクチャ２×４ダウンデコー
ダ）２６について、図２を参照して説明する。Next, the MPEG2 image information decoding section (I / P picture 2 × 4 down decoder) 26 of the image information conversion apparatus will be described with reference to FIG.

【０１２３】このＭＰＥＧ２画像情報復号化部（Ｉ／Ｐ
ピクチャ２×４ダウンデコーダ）２６は、符号バッフ
ァ３１と、圧縮情報解析部３２と、可変長復号化部３３
と、逆量子化部３４と、縮小逆離散コサイン変換部（２
×４）３５と、縮小逆離散コサイン変換部（フィールド
分離）３６と、加算器３７と、動き補償部（フィールド
予測）３８と、動き補償部（フレーム予測）３９と、ビ
デオメモリ４０とから構成されている。This MPEG2 image information decoding unit (I / P
The picture 2 × 4 down decoder) 26 includes a code buffer 31, a compression information analyzer 32, and a variable length decoder 33.
, An inverse quantization unit 34, and a reduced inverse discrete cosine transform unit (2
× 4) 35, a reduced inverse discrete cosine transform unit (field separation) 36, an adder 37, a motion compensation unit (field prediction) 38, a motion compensation unit (frame prediction) 39, and a video memory 40. Have been.

【０１２４】ＭＰＥＧ２画像情報復号化部（Ｉ／Ｐピク
チャ２×４ダウンデコーダ）２６において、符号バッ
ファ３１は入力される画像圧縮情報を一時的に保持し、
圧縮情報解析部３２は入力画像圧縮情報を解析し、可変
長復号部３３は入力画像圧縮情報を可変長復号し、逆量
子化部３４は可変長復号化部３３の出力を逆量子化す
る。In the MPEG2 image information decoding section (I / P picture 2 × 4 down decoder) 26, the code buffer 31 temporarily stores the input image compression information.
The compression information analysis unit 32 analyzes the input image compression information, the variable length decoding unit 33 performs variable length decoding on the input image compression information, and the dequantization unit 34 dequantizes the output of the variable length decoding unit 33.

【０１２５】また、ＭＰＥＧ２画像情報復号化部（Ｉ／
Ｐピクチャ２×４ダウンデコーダ）２６において、縮
小逆離散コサイン変換部（２×４）３５は逆量子化部３
４から出力された８×８成分の内で低域２×４成分のみ
に逆離散コサイン変換を施し、縮小逆離散コサイン変換
部（フィールド分離）３６は飛び越し走査の画像を構成
する第１フィールド及び第２フィールドの分離をする。The MPEG2 image information decoding unit (I /
In the P picture 2 × 4 down decoder) 26, the reduced inverse discrete cosine transform unit (2 × 4) 35 includes the inverse quantization unit 3
The inverse discrete cosine transform unit (field separation) 36 performs an inverse discrete cosine transform only on the low-frequency 2 × 4 component of the 8 × 8 components output from the first and second fields. Separate the second field.

【０１２６】さらに、ＭＰＥＧ２画像情報復号化部（Ｉ
／Ｐピクチャ２×４ダウンデコーダ）２６において、
動き補償部（フィールド予測）３８はビデオメモリ４０
から与えられた画像に対してフィールド単位で動き予測
して動き補償を行い、動き補償部（フレーム予測）３９
はビデオメモリ４０から与えられた画像に対してフレー
ム単位で動き予測して動き補償を行い、加算器３７はこ
れらの出力と縮小逆離散コサイン変換部（４×４）３５
及び縮小逆離散コサイン変換部（フィールド分離）３６
の出力を加算し、ビデオメモリ４０は加算器３７からの
出力を記憶して出力する。Further, the MPEG2 image information decoding unit (I
/ P picture 2 × 4 down decoder) 26
The motion compensator (field prediction) 38 is a video memory 40
The motion compensation unit (frame prediction) 39 performs motion compensation by performing motion prediction on an image given from
Performs motion compensation by predicting the motion of the image supplied from the video memory 40 on a frame basis, and the adder 37 outputs these outputs to the reduced inverse discrete cosine transform unit (4 × 4) 35
And reduced inverse discrete cosine transform unit (field separation) 36
, And the video memory 40 stores and outputs the output from the adder 37.

【０１２７】次に、ＭＰＥＧ２画像情報復号化部（Ｉ／
Ｐピクチャ２×４ダウンデコーダ）２６の縮小逆離散
コサイン変換装置（２×４）３５及び縮小逆離散コサイ
ン変換装置（フィールド分離）３６における水平方向に
対する動作原理について、図３を参照して説明する。Next, the MPEG2 image information decoding unit (I /
The principle of operation in the horizontal direction of the reduced inverse discrete cosine transform device (2 × 4) 35 and the reduced inverse discrete cosine transform device (field separation) 36 of the P picture 2 × 4 down decoder 26 will be described with reference to FIG. .

【０１２８】離散コサイン変換装置（２×４）３５及び
縮小逆離散コサイン変換装置（フィールド分離）３６
は、入力となるＭＰＥＧ２画像圧縮情報（ビットストリ
ーム）における８次の離散コサイン変換係数のうち、水
平方向には低域２次係数ａのみを取り出し、これに２次
の逆離散コサイン変換を施す。この処理を図３に示す。
これにより、ビデオメモリの容量としては、入力となる
ＭＰＥＧ２画像圧縮情報（ビットストリーム）の持つ画
枠に対して、水平方向１／４及び垂直方向１／２で良
い。The discrete cosine transform device (2 × 4) 35 and the reduced inverse discrete cosine transform device (field separation) 36
Extracts only the low-frequency second-order coefficient a in the horizontal direction from the eighth-order discrete cosine transform coefficients in the input MPEG2 image compression information (bit stream), and performs a second-order inverse discrete cosine transform. This process is shown in FIG.
Thus, the capacity of the video memory may be 1/4 in the horizontal direction and 1/2 in the vertical direction with respect to the image frame of the input MPEG2 image compression information (bit stream).

【０１２９】次に、ＭＰＥＧ２画像情報復号化部（Ｉ／
Ｐピクチャ２×４ダウンデコーダ）２６の動き補償装
置（フィールド予測）３８及び動き補償装置（フレーム
予測）３９における水平方向に対する動作原理につい
て、図４を参照して説明する。Next, the MPEG2 image information decoding unit (I /
The principle of operation of the motion compensator (field prediction) 38 and the motion compensator (frame prediction) 39 of the P picture 2 × 4 down decoder 26 in the horizontal direction will be described with reference to FIG.

【０１３０】動き補償装置（フィールド予測）３８及び
動き補償装置（フレーム予測）３９においては、まず、
図４のＡに示す通り、入力となるＭＰＥＧ２画像圧縮情
報（ビットストリーム）における各マクロブロックに対
する動きベクトル情報を元に、ビデオメモリ４０より画
素値ａを抽出する。次に、図４のＢに示すように、ビデ
オメモリ４０より取り出された画素値ａを元に、４倍補
間フィルタを用いて１／４画素精度の画素値ｂを生成す
る。フィルタリングに必要な画素値がビデオメモリ内の
アドレスに存在しない場合には、ミラー処理若しくはホ
ールド処理により仮想的な画素値を生成する。或いはま
た、固定された画素値（例えば１２８）を持つ画素が画
枠の外に仮想的に存在するとして処理を行っても良い。
更に、図４のＣに示すように、線形内挿によって１／８
画素精度の画素値ｃを生成する。In the motion compensation device (field prediction) 38 and the motion compensation device (frame prediction) 39, first,
As shown in FIG. 4A, the pixel value a is extracted from the video memory 40 based on the motion vector information for each macroblock in the input MPEG2 image compression information (bit stream). Next, as shown in FIG. 4B, based on the pixel value a extracted from the video memory 40, a pixel value b with 1/4 pixel accuracy is generated using a quadruple interpolation filter. If a pixel value required for filtering does not exist at an address in the video memory, a virtual pixel value is generated by mirror processing or hold processing. Alternatively, the processing may be performed assuming that a pixel having a fixed pixel value (for example, 128) virtually exists outside the image frame.
Further, as shown in FIG.
Generate a pixel value c with pixel accuracy.

【０１３１】実際の処理としては、一連の処理と等価な
係数を予め用意しておき、入力となるＭＰＥＧ２画像圧
縮情報（ビットストリーム）における各マクロブロック
に対する水平方向成分及び垂直方向成分の動きベクトル
情報に応じた位相の画素値を直接生成することで高速な
実行が可能である。As actual processing, coefficients equivalent to a series of processing are prepared in advance, and the motion vector information of the horizontal component and the vertical component for each macroblock in the input MPEG2 image compression information (bit stream) is prepared. By directly generating a pixel value having a phase corresponding to the above, high-speed execution is possible.

【０１３２】次に、ＭＰＥＧ２画像情報復号化部（Ｉ／
Ｐピクチャ２×４ダウンデコーダ）２６の可変長復号
化部３３における処理について、図５を参照して説明す
る。可変長復号変換部３３は、後続の縮小離散コサイン
変換部（２×４）３５及び縮小離散コサイン変換部（フ
ィールド分離）３６において必要な係数のみを複合化
し、後はＥＯＢが検出されるまで処理を行わないように
することも出来る。Next, the MPEG2 image information decoding unit (I /
The processing in the variable-length decoding unit 33 of the P picture (2 × 4 down decoder) 26 will be described with reference to FIG. The variable-length decoding conversion unit 33 combines only the necessary coefficients in the subsequent reduced discrete cosine conversion unit (2 × 4) 35 and reduced discrete cosine conversion unit (field separation) 36, and thereafter processes until the EOB is detected. Can be avoided.

【０１３３】フィールド離散コサイン変換モードの場
合、復号化される係数は、図５中で破線で囲まれた２×
４の離散コサイン変換係数である。図５のＡはジグザグ
スキャンを、図５のＢはオルタネートスキャンである。
図中の数字は、離散コサイン変換係数をスキャンする順
序を示すものである。これによって、可変長復号化部３
４における処理量の削減が可能である。In the case of the field discrete cosine transform mode, the coefficient to be decoded is 2 ×
4 is the discrete cosine transform coefficient. FIG. 5A shows a zigzag scan, and FIG. 5B shows an alternate scan.
The numbers in the figure indicate the order in which the discrete cosine transform coefficients are scanned. Thereby, the variable length decoding unit 3
4, the amount of processing can be reduced.

【０１３４】同様に、フレーム離散コサイン変換モード
の場合、復号化される係数は、図６中の破線で囲まれた
２×４の離散コサイン変換係数である。図６中のＡはジ
グザグスキャンを、図６中のＢはオルタネートスキャン
である。図中の数字は、図５の場合と同様の意味であ
る。Similarly, in the frame discrete cosine transform mode, the coefficients to be decoded are 2 × 4 discrete cosine transform coefficients surrounded by a broken line in FIG. A in FIG. 6 is a zigzag scan, and B in FIG. 6 is an alternate scan. The numbers in the figure have the same meanings as in FIG.

【０１３５】また、図７に示すように、２×８離散コサ
イン変換係数の一部の離散コサイン変換係数ａを０と置
き換えることで、画質劣化を最小限に抑えながら、縮小
逆離散コサイン変換部（フィールド分離）３６における
処理の削減が可能である。Further, as shown in FIG. 7, by replacing a part of the 2 × 8 discrete cosine transform coefficients with 0, the reduced inverse discrete cosine transform unit can be performed while minimizing image quality deterioration. The processing in (field separation) 36 can be reduced.

【０１３６】以上述べた動作原理により、ＭＰＥＧ２画
像情報復号化部（Ｉ／Ｐピクチャ２×４ダウンデコー
ダ）２６の出力として、入力となる飛び越し走査のＭＰ
ＥＧ２画像圧縮情報（ビットストリーム）に対して１／
４×１／２の画枠を持つ、飛び越し走査の画像情報が得
られる。走査変換部２７を用いて、この飛び越し走査の
画像情報の第一フィールド若しくは第二フィールドのみ
の情報を抽出して残りを破棄することで、入力となる飛
び越し走査のＭＰＥＧ２画像圧縮情報（ビットストリー
ム）に対して１／４×１／４の画枠を持つ、順次走査の
画像情報が出力される。これにより、画像情報の間引き
を行う間引き部は不要になる。In accordance with the operation principle described above, the output of the MPEG2 image information decoding unit (I / P picture 2 × 4 down decoder) 26 is used as the input of the interlaced scanning MP.
For EG2 image compression information (bit stream), 1 /
Interlaced image information having a 4 × 1/2 picture frame can be obtained. By using the scan converter 27 to extract information of only the first field or the second field of the interlaced image information and discarding the rest, the interlaced MPEG2 image compression information (bit stream) to be input is input. , A progressively scanned image information having an image frame of ４ × １／ is output. This eliminates the need for a thinning unit for thinning image information.

【０１３７】以上、入力としてＭＰＥＧ２画像圧縮情報
（ビットストリーム）を、出力としてＭＰＥＧ４画像圧
縮情報（ビットストリーム）を対象としてきたが、入
力、出力ともこれに限らず、例えばＭＰＥＧ−１やＨ．
２６３などの画像圧縮情報（ビットストリーム）でも良
い。As described above, the MPEG2 image compression information (bit stream) has been targeted for input, and the MPEG4 image compression information (bit stream) has been targeted for output. However, input and output are not limited to this. For example, MPEG-1 or H.264.
H.263 or other image compression information (bit stream).

【０１３８】[0138]

【発明の効果】以上述べてきた様に、本発明は、飛び越
し走査のＭＰＥＧ２画像圧縮情報（ビットストリーム）
を入力とし、より少ない演算処理量とビデオメモリ容量
を用いた回路構成により、入力となる画像圧縮情報（ビ
ットストリーム）の１／４×１／４解像度を持つ順次走
査のＭＰＥＧ４画像圧縮情報（ビットストリーム）に変
換して出力する手段を提供するものである。As described above, the present invention provides MPEG2 image compression information (bit stream) for interlaced scanning.
, And a progressively scanned MPEG4 image compression information (bit) having a resolution of １／ × １／ of the input image compression information (bit stream) by a circuit configuration using a smaller amount of arithmetic processing and video memory capacity. (Stream).

【０１３９】[0139]

[Brief description of the drawings]

【図１】本実施の形態の画像情報装置の構成を示すブロ
ック図である。FIG. 1 is a block diagram illustrating a configuration of an image information device according to an embodiment.

【図２】ＭＰＥＧ２画像情報復号化部（Ｉ／Ｐピクチャ
２×４ダウンデコーダ）の構成を示したブロック図で
ある。FIG. 2 is a block diagram illustrating a configuration of an MPEG2 image information decoding unit (I / P picture 2 × 4 down decoder).

【図３】離散コサイン変換部（２×４）及び縮小逆離散
コサイン変換部（フィールド分離）における水平方向の
処理を示した図である。FIG. 3 is a diagram illustrating horizontal processing in a discrete cosine transform unit (2 × 4) and a reduced inverse discrete cosine transform unit (field separation).

【図４】動き補償部（フィールド予測）及び動き補償部
（フレーム予測）における水平方向の処理を示した図で
ある。FIG. 4 is a diagram illustrating horizontal processing in a motion compensation unit (field prediction) and a motion compensation unit (frame prediction).

【図５】入力となるＭＰＥＧ２画像圧縮情報（ビットス
トリーム）がジグザグスキャンである場合の可変長復号
化部の動作原理を示した図である。FIG. 5 is a diagram illustrating an operation principle of a variable length decoding unit when the input MPEG2 image compression information (bit stream) is a zigzag scan.

【図６】入力となるＭＰＥＧ２画像圧縮情報（ビットス
トリーム）がオルタネートキャンである場合の可変長復
号化部の動作原理を示した図である。FIG. 6 is a diagram illustrating an operation principle of a variable length decoding unit when the input MPEG2 image compression information (bit stream) is an alternate scan.

【図７】入力となる画像圧縮情報（ビットストリーム）
の当該マクロブロックがフレーム離散コサイン変換モー
ドである場合の処理量削減を実現する手法の例を示した
図である。FIG. 7 is input image compression information (bit stream).
FIG. 5 is a diagram showing an example of a technique for realizing a reduction in the processing amount when the macroblock is in the frame discrete cosine transform mode.

【図８】本実施の形態の画像情報復号化装置（４×４ダ
ウンデコーダ）で、水平方向、垂直方向共に、８次の離
散コサイン変換係数の内、４次の低域情報のみを用いて
復号処理を行う装置構成を示したブロック図である。FIG. 8 shows that the image information decoding apparatus (4 × 4 down decoder) according to the present embodiment uses only the fourth-order low-frequency information among the eight-order discrete cosine transform coefficients in both the horizontal and vertical directions. It is the block diagram which showed the apparatus structure which performs a decoding process.

【図９】入力となるＭＰＥＧ２画像圧縮情報（ビットス
トリーム）がジグザグスキャンである場合の可変長復号
化部の動作原理を示した図である。FIG. 9 is a diagram illustrating an operation principle of the variable length decoding unit when the input MPEG2 image compression information (bit stream) is a zigzag scan.

【図１０】入力となるＭＰＥＧ２画像圧縮情報（ビット
ストリーム）がオルタネートスキャンである場合の可変
長復号化部の動作原理を示した図である。FIG. 10 is a diagram illustrating the operation principle of the variable length decoding unit when the input MPEG2 image compression information (bit stream) is an alternate scan.

【図１１】ビデオメモリ内における画素の位相を示した
図である。FIG. 11 is a diagram showing a phase of a pixel in a video memory.

【図１２】縮小逆離散コサイン変換部（フィールド分
離）における動作原理を示した図である。FIG. 12 is a diagram illustrating an operation principle in a reduced inverse discrete cosine transform unit (field separation).

【図１３】縮小逆離散コサイン変換部（４×４）におけ
る動作を高速アルゴリズムを用いて実現する手法を示し
た図である。FIG. 13 is a diagram illustrating a technique for realizing the operation in the reduced inverse discrete cosine transform unit (4 × 4) using a high-speed algorithm.

【図１４】縮小逆離散コサイン変換部（フィールド分
離）における処理を、高速アルゴリズムを用いて実現す
る手法を示した図である。FIG. 14 is a diagram illustrating a method of realizing the processing in the reduced inverse discrete cosine transform unit (field separation) using a high-speed algorithm.

【図１５】動き補償部（フィールド予測）における動作
原理を示した図である。FIG. 15 is a diagram illustrating an operation principle in a motion compensation unit (field prediction).

【図１６】動き補償部（フレーム予測）における動作原
理を示した図である。FIG. 16 is a diagram illustrating an operation principle in a motion compensation unit (frame prediction).

【図１７】動き補償部（フィールド予測）及び動き補償
部（フレーム予測）におけるホールド処理／ミラー処理
の手法を示した図である。FIG. 17 is a diagram showing a technique of a hold process / mirror process in a motion compensation unit (field prediction) and a motion compensation unit (frame prediction).

【図１８】ＭＰＥＧ２画像圧縮情報（ビットストリー
ム）を入力とし、ＭＰＥＧ４画像圧縮情報（ビットスト
リーム）を出力する画像情報変換部（トランスコーダ）
の従来技術の構成を示した図である。FIG. 18 is an image information conversion unit (transcoder) that receives MPEG2 image compression information (bit stream) and outputs MPEG4 image compression information (bit stream).
1 is a diagram showing the configuration of the related art.

【図１９】本願出願人により提案された画像情報変換装
置の構成を示すブロック図である。FIG. 19 is a block diagram showing a configuration of an image information conversion device proposed by the present applicant.

【図２０】入力となる画像圧縮情報（ビットストリー
ム）の当該マクロブロックがフレーム離散コサイン変換
モードである場合の処理量削減を実現する手法の例を示
した図である。FIG. 20 is a diagram illustrating an example of a technique for reducing the processing amount when the macroblock of the input image compression information (bit stream) is in the frame discrete cosine transform mode.

【図２１】走査変換部における動作原理を説明した図で
ある。FIG. 21 is a diagram illustrating the operation principle of the scan conversion unit.

【図２２】間引き部における動作原理を説明した図であ
る。FIG. 22 is a diagram illustrating the operation principle of the thinning unit.

[Explanation of symbols]

２５ピクチャタイプ判別部、２６ＭＰＥＧ２画像情
報復号化部（Ｉ／Ｐピクチャ２×４ダウンデコー
ダ）、２７走査変換部、２８ＭＰＥＧ４画像情報符
号化部（Ｉ／Ｐ−ＶＯＰ）、２９動きベクトル合成
部、３０動きベクトル検出部25 picture type discriminator, 26 MPEG2 image information decoder (I / P picture 2 × 4 down decoder), 27 scan converter, 28 MPEG4 image information encoder (I / P-VOP), 29 motion vector synthesizer , 30 motion vector detection unit

───────────────────────────────────────────────────── フロントページの続き (72)発明者鈴木輝彦東京都品川区北品川６丁目７番35号ソニー株式会社内 (72)発明者矢ケ崎陽一東京都品川区北品川６丁目７番35号ソニー株式会社内Ｆターム(参考） 5C053 FA20 FA23 FA30 GA11 GB07 GB38 GB40 KA04 KA11 KA21 KA24 LA06 LA14 5C059 KK10 KK15 KK33 LA05 LB16 LB18 MA00 MA04 MA05 MA14 MA23 MB14 MC38 ME01 NN01 SS02 SS08 SS12 TA08 TA18 TA50 TA57 TB08 TC12 TD20 UA02 UA05 5J064 AA02 BA09 BB03 BC01 BC08 BC16 BD03 ──────────────────────────────────────────────────続き Continued on the front page (72) Inventor Teruhiko Suzuki 6-7-35 Kita-Shinagawa, Shinagawa-ku, Tokyo Inside Sony Corporation (72) Inventor Yoichi Yagasaki 6-35, Kita-Shinagawa, Shinagawa-ku, Tokyo Sony Corporation F-term (reference) UA02 UA05 5J064 AA02 BA09 BB03 BC01 BC08 BC16 BD03

Claims

[Claims]

An image information conversion apparatus for converting the resolution of image compression information obtained by performing discrete cosine transform on a pixel block composed of pixels of eight components in both the horizontal and vertical directions, wherein the image obtained by the interlaced scanning is encoded. 8 in both the horizontal and vertical directions of the pixel blocks constituting the input image compression information
Decoding means for decoding an image of interlaced scanning using only two low-frequency components in the horizontal direction and four low-frequency components in the vertical direction, among the discrete cosine transform coefficients of the components, and interlaced scanning decoded by the decoding means. Scan conversion means for selecting one of the first field and the second field constituting the image of the above and generating a progressively scanned image; and converting the image converted by the scan conversion means with respect to the input image compression information. Encoding means for encoding output image compression information having a resolution of 1/4 in both the horizontal and vertical directions.

2. The compressed image information according to the MPEG2 standard, and the compressed image information according to the MPEG4 standard.
2. The image information conversion device according to claim 1, wherein the image information conversion device conforms to a standard.

3. The input image compression information includes an intra-coded image coded in a frame, a forward prediction coded image predicted and coded with reference to another image in a forward direction between frames. And a bidirectional predictive coded image that is predictively coded with reference to another image in the forward and backward directions between frames, and determines the type of image that constitutes the input image compression information. And a discriminating means for passing the coded image and the forward predictive coded image but discarding the bidirectional predictive coded image, wherein the decoding means receives image compression information through the discriminating means. Claim 1
The image information conversion device described in the above.

4. The image information conversion apparatus according to claim 3, wherein said decoding means decodes only the intra-coded image and the forward prediction coded image.

5. The input image compression information has been subjected to variable-length encoding. The decoding means includes a variable-length decoding means for performing variable-length decoding on the image compression information, and a variable-length decoding means. An inverse discrete cosine transform unit for performing an inverse discrete cosine transform on the variable-length-decoded image compression information, wherein the variable-length decoding unit determines whether the pixel blocks constituting the input image compression information are in a field mode or a frame mode. 2. The image information conversion apparatus according to claim 1, wherein only the discrete cosine transform coefficients necessary for the inverse discrete cosine transform in said inverse discrete cosine transform means are variable-length decoded according to the following equation.

6. The inverse discrete cosine transform means corresponds to a field mode, and includes two low-frequency components and a vertical component in a horizontal direction among discrete cosine transform coefficients having eight components in both horizontal and vertical directions constituting the pixel block. 6. The image information conversion apparatus according to claim 5, wherein an inverse discrete cosine transform is performed on the discrete cosine transform coefficients of four low-frequency components in the direction.

7. The image information conversion apparatus according to claim 5, wherein the inverse discrete cosine transform performs an operation using a predetermined high-speed algorithm.

8. The inverse discrete cosine transform means corresponds to a frame mode, and includes two discrete low-frequency cosine transform coefficients in the horizontal and vertical directions constituting the pixel block in both the horizontal and vertical directions. 6. The image information conversion apparatus according to claim 5, wherein an inverse discrete cosine transform is applied to the cosine transform coefficient, and a field separation type inverse discrete cosine transform is applied in a vertical direction.

9. The image information conversion apparatus according to claim 8, wherein said inverse discrete cosine transform means executes an operation using a predetermined high-speed algorithm.

10. The inverse discrete cosine transform means includes two low-frequency components in the horizontal direction and four low-frequency components in the vertical direction, among the discrete cosine transform coefficients of two low-frequency components in the horizontal direction and eight components in the vertical direction. 9. The image according to claim 8, wherein only the discrete cosine transform coefficients of two low-frequency components in both the horizontal and vertical directions are used in addition to the discrete cosine transform coefficients of the above, and the remaining components are set to 0 to perform inverse discrete cosine transform. Information conversion device.

11. The input image compression information has been motion-compensated using motion vector information, and the decoding means has motion compensation means for motion-compensating an image using motion vector information. The motion compensator is configured to calculate 1/1 in the horizontal direction based on the motion vector information of the input image compression information.
2. The image information conversion apparatus according to claim 1, wherein interpolation processing is performed with a precision of 1/4 pixel in the vertical direction.

12. The motion compensation means according to claim 1, wherein the interpolation processing in the horizontal direction is performed by using a 2 × interpolation digital filter.
12. The image information conversion apparatus according to claim 11, wherein interpolation is performed at a pixel precision, and interpolation is performed at a 1/8 pixel precision by linear interpolation.

13. The motion compensating means performs a horizontal interpolation process on the pixel block in the field mode with a half-pixel accuracy using a double interpolation digital filter. 12. The image information conversion apparatus according to claim 11, wherein interpolation of 1/4 pixel precision is performed in the field by interpolation.

14. The motion compensating means performs a vertical interpolation process on the pixel block in a frame mode.
12. A method according to claim 11, wherein a half-pixel precision interpolation is performed by using a two-fold interpolation digital filter, and a quarter-pixel precision interpolation is performed between fields by linear interpolation.
The image information conversion device described in the above.

15. The image information conversion device according to claim 11, wherein said digital filter is a half-band filter.

16. The digital filter previously calculates a coefficient equivalent to a series of interpolation processing, and directly applies the coefficient to a pixel value according to a value of motion vector information of a pixel block constituting the input image compression information. The image information conversion device according to claim 15, wherein:

17. The image processing apparatus according to claim 1, wherein the motion compensating unit is configured to perform a double-interpolation filter process on pixels existing outside an image frame of the image constituting the input image compression information. 12. The image information conversion apparatus according to claim 11, wherein the filter processing is performed by virtually creating a necessary pixel outside.

18. The motion compensating means wraps a required pixel out of the image frame by folding back at a predetermined position of an existing pixel array and extending the existing pixel array or using a predetermined value. 19. The image information conversion device according to claim 18, wherein the image information conversion device creates the image information.

19. The scanning conversion means selects one of a first field and a second field of an interlaced scanning image decoded by the decoding means, thereby converting the input image compression information. It is characterized in that an interlaced image having a resolution of １／ in the horizontal direction and 垂直 in both the vertical direction is converted into a progressively scanned image having a resolution of 共に in both the horizontal direction and the input image compression information. The image information conversion device according to claim 1, wherein

20. The image information conversion apparatus according to claim 19, wherein said scan conversion means adjusts the number of pixels in the vertical direction so as to correspond to processing corresponding to a pixel block in said encoding means.

21. The output image compression information includes an intra-coded image coded in a frame, a forward prediction coded image predicted and coded by referring to another image in a forward direction between frames, and A bidirectional predictive coded image predictively coded with reference to another image in the forward and backward directions between frames, and a sprite coded image, wherein the coding means includes the intra coded image 2. The image information conversion device according to claim 1, wherein the image is encoded using the forward prediction encoded image.

22. The image compression information, which has been motion-compensated by motion vector information, has a motion vector synthesizing means for synthesizing the motion compensation vector information, and is based on the motion vector information of the input image compression information. 2. The image information conversion apparatus according to claim 1, wherein the motion vector information corresponding to the image output from the decimation means is combined, and the encoding means performs encoding based on the motion vector information.

23. The image information conversion apparatus according to claim 22, further comprising a motion vector detecting means for detecting motion vector information based on the motion vector information synthesized by said motion vector synthesizing means.

24. An image information conversion method for converting the resolution of image compression information obtained by performing discrete cosine conversion on a pixel block consisting of pixels of eight components in both the horizontal direction and the vertical direction, wherein an image obtained by interlaced scanning is encoded. 8 in both the horizontal and vertical directions of the pixel blocks constituting the input image compression information
Among the discrete cosine transform coefficients of the components, the interlaced scan image is decoded using only two low-frequency components in the horizontal direction and only four components in the vertical direction, and the first field and the second field constituting the decoded interlaced scan image are decoded. One of the fields is selected to generate a progressively scanned image, and the generated image is encoded into output image compression information having a resolution of 1/4 in both the horizontal and vertical directions with respect to the input image compression information. Image information converting method.

25. The input image compression information according to the MPEG2 standard, and the output image compression information
The image information conversion method according to claim 24, wherein the method is based on four standards.

26. The input image compression information includes an intra-coded image coded in a frame, and a forward prediction coded image predicted and coded by referring to another image in the forward direction between frames. And a bidirectional predictive coded image that is predictively coded with reference to another image in the forward and backward directions between frames, and determines the type of image that constitutes the input image compression information. 25. The image information conversion method according to claim 24, wherein the encoded image and the forward prediction encoded image are passed, but the bidirectional prediction encoded image is discarded.

27. The image information conversion method according to claim 26, wherein only the intra-coded image and the forward prediction coded image are decoded.