JP2003319394A

JP2003319394A - Encoding apparatus and method, decoding apparatus and method, recording medium, and program

Info

Publication number: JP2003319394A
Application number: JP2002125296A
Authority: JP
Inventors: Kazufumi Sato; 数史佐藤; Teruhiko Suzuki; 輝彦鈴木; Kuhn Peter; クーンピーター; Yoichi Yagasaki; 陽一矢ヶ崎
Original assignee: Sony Corp
Current assignee: Sony Corp
Priority date: 2002-04-26
Filing date: 2002-04-26
Publication date: 2003-11-07

Abstract

<P>PROBLEM TO BE SOLVED: To provide an image encoding apparatus for receiving image information with an interlaced scanning format, applying field/frame adapted encoding processing to the information at a macro block level and employing the ABT (Adaptive Block Transform) for orthogonal transformation. <P>SOLUTION: A field/frame discrimination section 53 discriminates a field- based encoding of a micro block or a frame-based encoding of the macro block which has a higher encoding efficiency. An ABT block size decision section 55 decides a block size providing the highest encoding efficiency. A field/frame conversion section 56 converts a received image into an image with a field structure before the orthogonal transform processing when the macro block is subjected to field-based encoding. An ABT conversion section 57 applies orthogonal transform processing to the received image according to the decided block size. An inverse encoding section 59 applies inverse encoding to a Frame/ Field Flag or the like. This invention is applicable to an image information encoding apparatus and an image information decoding apparatus. <P>COPYRIGHT: (C)2004,JPO

Description

Detailed Description of the Invention

【０００１】[0001]

【発明の属する技術分野】本発明は、符号化装置および
方法、復号装置および方法、記録媒体、並びにプログラ
ムに関し、例えば、画像信号を従来よりも高い圧縮率で
符号化し、伝送または蓄積する場合に用いて好適な符号
化装置および方法、復号装置および方法、記録媒体、並
びにプログラムに関する。BACKGROUND OF THE INVENTION 1. Field of the Invention The present invention relates to an encoding device and method, a decoding device and method, a recording medium, and a program, for example, when an image signal is encoded at a higher compression rate than before and is transmitted or stored. The present invention relates to an encoding device and method suitable for use, a decoding device and method, a recording medium, and a program.

【０００２】[0002]

【背景技術】近年、画像をディジタル信号として取り扱
い、当該ディジタル信号を効率よく伝送、蓄積すること
を目的として、画像情報特有の冗長性を利用して、離散
コサイン変換等の直交変換と動き補償により圧縮するMP
EG(Moving Picture Expert Group）等の方式に準拠した
装置が、放送局などの情報配信、および一般家庭におけ
る情報受信の双方において普及しつつある。BACKGROUND ART Recently, for the purpose of handling an image as a digital signal and efficiently transmitting and accumulating the digital signal, orthogonality transform such as discrete cosine transform and motion compensation are performed by utilizing redundancy peculiar to image information. MP to compress
A device conforming to a method such as EG (Moving Picture Expert Group) is becoming popular both for information distribution at broadcasting stations and for information reception at general homes.

【０００３】特に、MPEG２(ISO/IEC 13818-2)圧縮方式
は、汎用性がある画像圧縮方式として定義された規格で
あり、飛び越し走査画像および順次走査画像の双方、並
びに標準解像度画像および高精細画像を網羅する標準で
あって、例えばDVD(DigitalVersatile Disk)規格に代表
されるように、プロフェッショナル用途およびコンシュ
ーマー用途の広範なアプリケーションに現在広く用いら
れている。In particular, the MPEG2 (ISO / IEC 13818-2) compression method is a standard defined as a versatile image compression method, and includes both interlaced scanning images and progressive scanning images, as well as standard resolution images and high definition images. It is a standard that covers images and is currently widely used in a wide range of applications for professional use and consumer use, as represented by the DVD (Digital Versatile Disk) standard.

【０００４】MPEG２圧縮方式を用いることにより、例え
ば、７２０×４８０画素を持つ標準解像度の飛び越し走
査画像に対しては４乃至８Mbps、１９２０×１０８８画
素を持つ高解像度の飛び越し走査画像に対しては１８乃
至２２Mbpsの符号量（ビットレート）を割り当てること
で、高い圧縮率と良好な画質の実現が可能である。By using the MPEG2 compression method, for example, 4 to 8 Mbps for a standard resolution interlaced scan image having 720 × 480 pixels and 18 for a high resolution interlaced scan image having 1920 × 1088 pixels. By assigning a code amount (bit rate) of 22 Mbps to 22 Mbps, a high compression rate and good image quality can be realized.

【０００５】ところで、MPEG２は、主として放送用に適
合する高画質符号化を対象としていたが、より高い圧縮
率の符号化方式には対応していなかったので、より高い
圧縮率の符号化方式として、MPEG４符号化方式の標準化
が行われた。画像符号化方式に関しては、１９９８年１
２月にISO/IEC 14496-2としてその規格が国際標準に承
認された。By the way, MPEG2 was mainly intended for high-quality coding suitable for broadcasting, but since it did not support a coding method with a higher compression rate, it was used as a coding method with a higher compression rate. , MPEG4 encoding system was standardized. Regarding the image encoding method, 1998 1
In February, the standard was approved as an international standard as ISO / IEC 14496-2.

【０００６】さらに、近年、テレビ会議用の画像符号化
を当初の目的として、国際電気連合の電気通信標準化部
門であるITU-T(International Telecommunication Unio
n −Telecommunication Standardization Sector)によ
るＨ.２６Ｌ（ITU-T Q6/16 VCEG）と称される標準の規
格化が進められている。Further, in recent years, ITU-T (International Telecommunication Unio), which is a telecommunications standardization division of the International Telecommunications Union, has been initially designed for the purpose of image coding for video conferences.
Standardization called H.26L (ITU-T Q6 / 16 VCEG) by the n-Telecommunication Standardization Sector) is under way.

【０００７】Ｈ．２６Ｌは、MPEG２やMPEG４などの従来
の符号化方式に比較して、符号化処理、および復号処理
により多くの演算量が必要となるが、より高い符号化効
率が実現されることが知られている。H. 26L requires a larger amount of calculation in the encoding process and the decoding process as compared with the conventional encoding methods such as MPEG2 and MPEG4, but is known to achieve higher encoding efficiency. There is.

【０００８】またさらに、現在、MPEG４の活動の一環と
してITU-Tと共同で、Ｈ．２６Ｌに基づいた、Ｈ．２６
Ｌではサポートされない機能を取り入れた、より高い符
号化効率を実現する符号化技術の標準化が、Joint Mode
l of Enhanced-CompressionVideo Codingとして進めら
れている。[0008] Furthermore, as a part of the activities of MPEG4, the H.264 standard is currently being collaborated. H.26L based on H.26. 26
Joint Mode is a standardization of coding technology that achieves higher coding efficiency by incorporating functions not supported by L.
l of Enhanced-Compression Video Coding.

【０００９】ここで、離散コサイン変換またはカルーネ
ン・レーベ変換等の直交変換と動き補償とを利用した従
来の画像情報符号化装置について、図１を参照して説明
する。図１は、従来の画像情報符号化装置の構成の一例
を示している。Here, a conventional image information coding apparatus using orthogonal transformation such as discrete cosine transformation or Karhunen-Loeve transformation and motion compensation will be described with reference to FIG. FIG. 1 shows an example of the configuration of a conventional image information encoding device.

【００１０】当該画像情報符号化装置において、アナロ
グ信号である入力画像信号は、Ａ／Ｄ変換部１によって
ディジタル信号に変換された後、画面並べ替えバッファ
２に供給される。画面並べ替えバッファ２は、Ａ／Ｄ変
換部１からの画像情報を、当該画像情報符号化装置が出
力する画像圧縮情報のGOP(Group of Pictures)構造に応
じて、フレームの並べ替えを行う。In the image information coding apparatus, an input image signal which is an analog signal is converted into a digital signal by the A / D converter 1 and then supplied to the screen rearrangement buffer 2. The screen rearrangement buffer 2 rearranges the image information from the A / D converter 1 according to the GOP (Group of Pictures) structure of the image compression information output by the image information encoding apparatus.

【００１１】まず、イントラ（画像内）符号化が行われ
る画像について説明する。画面並び替えバッファ２にお
いて、イントラ符号化が行われる画像については、その
画像情報が加算器３を介して直交変換部４に供給され
る。直交変換部４では、画像情報に対して直交変換（離
散コサイン変換、またはカルーネン・レーベ変換等）が
施され、得られた変換係数が量子化部５に供給される。
量子化部５では、蓄積バッファ７に蓄積された変換係数
のデータ量に基づくレート制御部８からの制御に従い、
直交変換部４から供給された変換係数に対して量子化処
理が施こされる。First, an image in which intra (intra-image) coding is performed will be described. Regarding the image to be intra-coded in the screen rearrangement buffer 2, the image information is supplied to the orthogonal transformation unit 4 via the adder 3. The orthogonal transform unit 4 performs orthogonal transform (discrete cosine transform, Karhunen-Loeve transform, etc.) on the image information, and the obtained transform coefficient is supplied to the quantizer 5.
In the quantization unit 5, according to the control from the rate control unit 8 based on the data amount of the conversion coefficient accumulated in the accumulation buffer 7,
Quantization processing is performed on the transform coefficient supplied from the orthogonal transform unit 4.

【００１２】可逆符号化部６では、量子化部５から供給
された量子化された変換係数や量子化スケール等から符
号化モードが決定され、決定された符号化モードに対し
て可逆符号化（可変長符号化、または算術符号化等）が
施こされ、画像符号化単位のヘッダ部に挿入される情報
が形成される。また、符号化された符号化モードは、蓄
積バッファ７に供給されて蓄積される。蓄積バッファ７
に蓄積された、符号化された符号化モードは、画像圧縮
情報として後段に出力される。In the lossless encoding unit 6, the encoding mode is determined from the quantized transform coefficient and the quantization scale supplied from the quantization unit 5, and the lossless encoding for the determined encoding mode ( Variable length coding, arithmetic coding or the like) is performed to form information to be inserted in the header part of the image coding unit. Further, the encoded coding mode is supplied to and accumulated in the accumulation buffer 7. Accumulation buffer 7
The coded coding mode accumulated in is output to the subsequent stage as image compression information.

【００１３】また、可逆符号化部６では、量子化された
変換係数に対して可逆符号化が施され、符号化された変
換係数が蓄積バッファ７に蓄積させる。蓄積バッファ７
に蓄積された、符号化された変換係数も、画像圧縮情報
として後段に出力される。In addition, the lossless encoding unit 6 performs lossless encoding on the quantized transform coefficient, and stores the encoded transform coefficient in the accumulation buffer 7. Accumulation buffer 7
The coded transform coefficients accumulated in the above are also output to the subsequent stage as image compression information.

【００１４】逆量子化部９では、量子化部５によって量
子化された変換係数が逆量子化される。逆直交変換部１
０では、逆量子化された変換係数に対して逆直交変換処
理が施されて復号画像情報が生成される。ループフィル
タ１１では、生成された復号画像情報のブロック歪が除
去される。ブロック歪が除去された復号画像情報は、フ
レームメモリ１２に蓄積される。The inverse quantization unit 9 inversely quantizes the transform coefficient quantized by the quantization unit 5. Inverse orthogonal transform unit 1
In 0, the inverse quantized transform coefficient is applied to the inversely quantized transform coefficient to generate decoded image information. The loop filter 11 removes the block distortion of the generated decoded image information. The decoded image information from which the block distortion has been removed is stored in the frame memory 12.

【００１５】次に、インター（画像間）符号化が行われ
る画像について説明する。画面並び替えバッファ２にお
いて、インター符号化が行われる画像については、その
画像情報が加算器３および動き予測・補償部１３に供給
される。Next, an image to be inter-coded will be described. Regarding the image to be inter-encoded in the screen rearrangement buffer 2, the image information is supplied to the adder 3 and the motion prediction / compensation unit 13.

【００１６】動き予測・補償部１３では、画面並び替え
バッファ２からのインター符号化が行われる画像に対応
する、参照するための画像情報がフレームメモリ１２か
ら読み出され、動き予測・補償処理を施して参照画像情
報が生成され、加算器３に供給される。また、動き予測
・補償部１３で動き予測・補償処理の際に得られた動き
ベクトル情報は、可逆符号化部６に供給される。In the motion prediction / compensation unit 13, image information for reference corresponding to the image to be inter-encoded from the screen rearrangement buffer 2 is read from the frame memory 12 and the motion prediction / compensation process is performed. Then, the reference image information is generated and supplied to the adder 3. Further, the motion vector information obtained during the motion prediction / compensation processing by the motion prediction / compensation unit 13 is supplied to the lossless coding unit 6.

【００１７】加算器３では、動き予測・補償部１３から
の参照画像情報が、画面並び替えバッファ２からのイン
ター符号化が行われる画像の画像情報との差分信号に変
換される。In the adder 3, the reference image information from the motion prediction / compensation unit 13 is converted into a difference signal from the image information of the image to be inter-encoded from the screen rearrangement buffer 2.

【００１８】インター符号化が行われる画像を処理する
場合、直交変換部４では、差分信号に対して直交変換が
施され、得られる変換係数が量子化部５に供給される。
量子化部５では、レート制御部８からの制御に従い、直
交変換部４から供給された変換係数に対して量子化処理
を施される。When processing an image to be inter-coded, the orthogonal transform unit 4 performs an orthogonal transform on the difference signal, and the obtained transform coefficient is supplied to the quantizer 5.
In the quantizer 5, the transform coefficient supplied from the orthogonal transformer 4 is quantized under the control of the rate controller 8.

【００１９】可逆符号化部６では、量子化部５によって
量子化された変換係数および量子化スケール、並びに動
き予測・補償部１３から供給された動きベクトル情報等
に基づいて符号化モードが決定され、決定された符号化
モードに対して可逆符号化が施され、画像符号化単位の
ヘッダ部に挿入される情報が生成される。符号化された
符号化モードは蓄積バッファ７に蓄積される。蓄積バッ
ファ７に蓄積された符号化された符号化モードは、画像
圧縮情報として出力される。The lossless encoding unit 6 determines the encoding mode based on the transform coefficient and the quantization scale quantized by the quantization unit 5, the motion vector information supplied from the motion prediction / compensation unit 13, and the like. Then, lossless encoding is performed on the determined encoding mode, and information to be inserted into the header portion of the image encoding unit is generated. The encoded coding mode is stored in the storage buffer 7. The encoded encoding mode accumulated in the accumulation buffer 7 is output as image compression information.

【００２０】また、可逆符号化部６では、動き予測・補
償部１３からの動きベクトル情報に対して可逆符号化処
理が施され、画像符号化単位のヘッダ部に挿入される情
報が生成される。Further, the lossless coding unit 6 performs lossless coding processing on the motion vector information from the motion prediction / compensation unit 13 to generate information to be inserted in the header part of the image coding unit. .

【００２１】なお、インター符号化が行われる画像を処
理する場合における逆量子化部９以降の処理について
は、イントラ符号化を施される画像を処理する場合と同
様であるので、その説明を省略する。The processing after the inverse quantizer 9 in the case of processing an image to be inter-coded is similar to that in the case of processing an image to be intra-coded, and therefore its explanation is omitted. To do.

【００２２】次に、図１に示した従来の画像情報符号化
装置が出力する画像圧縮情報を入力とし、画像信号を復
元する従来の画像情報復号装置について、図２を参照し
て説明する。図２は、従来の画像情報復号装置の構成の
一例を示している。Next, a conventional image information decoding apparatus for inputting the image compression information output from the conventional image information encoding apparatus shown in FIG. 1 to restore an image signal will be described with reference to FIG. FIG. 2 shows an example of the configuration of a conventional image information decoding device.

【００２３】当該画像情報復号装置において、入力され
た画像圧縮情報は、蓄積バッファ２１に一時的に格納さ
れた後、可逆復号化部２２に転送される。可逆復号化部
２２は、予め定められている画像圧縮情報のフォーマッ
トに基づき、画像圧縮情報に対して可逆復号（可変長復
号、または算術復号等）を施し、ヘッダ部に格納された
符号化モード情報を取得して逆量子化部２３に供給す
る。また同様に、可逆復号化部２２は、量子化されてい
る変換係数を取得して逆量子化部２３に供給する。さら
に、可逆復号化部２２は、復号するフレームがインター
符号化されたものである場合には、画像圧縮情報のヘッ
ダ部に格納された動きベクトル情報についても復号し、
その情報を動き予測・補償部３０に供給する。In the image information decoding apparatus, the input image compression information is temporarily stored in the accumulation buffer 21 and then transferred to the lossless decoding unit 22. The lossless decoding unit 22 performs lossless decoding (variable length decoding, arithmetic decoding, or the like) on the image compression information based on a predetermined format of the image compression information, and the encoding mode stored in the header section. The information is acquired and supplied to the inverse quantization unit 23. Similarly, the lossless decoding unit 22 acquires the quantized transform coefficient and supplies it to the inverse quantization unit 23. Further, when the frame to be decoded is inter-coded, the lossless decoding unit 22 also decodes the motion vector information stored in the header portion of the image compression information,
The information is supplied to the motion prediction / compensation unit 30.

【００２４】逆量子化部２３は、可逆復号化部２２から
供給された量子化されている変換係数を逆量子化し、得
られる変換係数を逆直交変換部２４に供給する。逆直交
変換部２４は、予め定められている画像圧縮情報のフォ
ーマットに基づき、変換係数に対して逆直交変換（逆離
散コサイン変換、または逆カルーネン・レーベ変換等）
を施す。The dequantization unit 23 dequantizes the quantized transform coefficient supplied from the lossless decoding unit 22, and supplies the obtained transform coefficient to the inverse orthogonal transform unit 24. The inverse orthogonal transform unit 24 performs an inverse orthogonal transform (inverse discrete cosine transform, inverse Karhunen-Loeve transform, etc.) on the transform coefficient based on a predetermined image compression information format.
Give.

【００２５】ここで、対象となるフレームがイントラ符
号化されたものである場合には、逆直交変換が施された
画像情報は、加算器２５を介して画面並べ替えバッファ
２６に格納され、Ｄ／Ａ変換部２７によってアナログ信
号に変換されて後段に出力される。逆直交変換が施され
た画像情報は、ループフィルタ２８によってブロック歪
が除去された後、フレームメモリ２９にも格納される。Here, when the target frame is intra-coded, the image information subjected to the inverse orthogonal transform is stored in the screen rearrangement buffer 26 via the adder 25, and D The signal is converted into an analog signal by the / A converter 27 and output to the subsequent stage. The image information subjected to the inverse orthogonal transformation is stored in the frame memory 29 after the block distortion is removed by the loop filter 28.

【００２６】また、対象となるフレームがインター符号
化されたものである場合には、動き予測・補償部３０で
は、可逆復号化部２２からの動きベクトル情報とフレー
ムメモリ２９に格納された画像情報とに基づいて参照画
像が生成され、加算器２５に供給される。加算器２５で
は、動き予測・補償部２８からの参照画像と逆直交変換
部２５の出力とが合成されて画像情報が生成される。な
お、その他の処理については、イントラ符号化されたフ
レームと同様であるため、説明を省略する。When the target frame is inter-coded, the motion vector information from the lossless decoding unit 22 and the image information stored in the frame memory 29 in the motion prediction / compensation unit 30. A reference image is generated on the basis of and and is supplied to the adder 25. In the adder 25, the reference image from the motion prediction / compensation unit 28 and the output from the inverse orthogonal transform unit 25 are combined to generate image information. Note that the other processing is the same as that of the intra-coded frame, and therefore the description thereof is omitted.

【００２７】ところで、MPEG２においては、入力となる
画像信号が飛び越し走査フォーマットであった場合、マ
クロブロックレベルでフィールド／フレーム適応型符号
化処理が可能とされている。By the way, in the MPEG2, when the input image signal is in the interlaced scanning format, the field / frame adaptive encoding processing can be performed at the macroblock level.

【００２８】現在、Ｈ．２６Ｌにはそのような仕様は定
義されていないが、文献「"Interlace Coding Tools fo
r H.26L Video Coding(L.Wang et al. ,VCEG-O37,Dec.2
001)"」（以下、文献１と記述する）には、Ｈ．２６Ｌ
の仕様を、マクロブロックレベルでフィールド／フレー
ム適応型符号化処理を可能とするように拡張することが
提案されている。Currently, H.264. No such specification is defined in 26L, but the document "" Interlace Coding Tools fo
r H.26L Video Coding (L.Wang et al., VCEG-O37, Dec.2
001) "" (hereinafter referred to as Reference 1) is described in H. 26L
It has been proposed to extend the specification of (1) to enable field / frame adaptive coding processing at the macroblock level.

【００２９】文献１に提案されている、マクロブロック
レベルでのフィールド／フレーム適応型符号化処理につ
いて説明する。The field / frame adaptive encoding process at the macroblock level, which is proposed in Document 1, will be described.

【００３０】現在のＨ．２６Ｌにおいては、マクロブロ
ックにおける動き予測・補償の単位として、図１５に示
すような７種類のモード（mode１乃至７）が定義されて
いる。The current H.264. In 26L, seven types of modes (modes 1 to 7) as shown in FIG. 15 are defined as units of motion prediction / compensation in a macroblock.

【００３１】文献１においては、画像圧縮情報のマクロ
ブロックに対応するシンタクスとして、図４に示すよう
に、RunとMB_typeの間にFrame/Field Flagを持つことが
提案されている。Frame/Field Flagの値が０である場
合、当該マクロブロックはフレームベースの符号化が施
されることを示し、Frame/Field Flagの値が１である場
合、フィールドベースの符号化が施されることを示して
いる。Document 1 proposes that a syntax corresponding to a macroblock of image compression information has a Frame / Field Flag between Run and MB_type as shown in FIG. When the value of Frame / Field Flag is 0, it indicates that the macroblock is subjected to frame-based coding, and when the value of Frame / Field Flag is 1, it is subjected to field-based coding. It is shown that.

【００３２】Frame/Field Flagの値が１である場合（す
なわち、フィールドベースの符号化が施される場合）、
マクロブロック内の画素は、図５に示すように行単位で
画素の並べ替えが行われる。When the value of Frame / Field Flag is 1, that is, when field-based coding is performed,
The pixels in the macroblock are rearranged in units of rows as shown in FIG.

【００３３】Frame/Field Flagの値が１である場合、マ
クロブロックにおける動き予測・補償の単位として、図
３のmode３乃至７に相当する、図６に示す５種類モード
（mode１ａ乃至５ａ）が定義されている。When the value of Frame / Field Flag is 1, five types of modes (modes 1a to 5a) shown in FIG. 6 corresponding to modes 3 to 7 in FIG. 3 are defined as units of motion prediction / compensation in the macroblock. Has been done.

【００３４】例えば、図６のmode２ａにおいて、マクロ
ブロックを４分割した８×８ブロック０乃至３のうち、
ブロック０，１は同一のフィールドパリティに属し、ま
た、ブロック２，３は同一のフィールドパリティに属す
る。また例えば、図６のmode３ａにおいて、マクロブロ
ックを８分割した４×８ブロック０乃至８のうち、ブロ
ック０乃至３は同一のフィールドパリティに属し、ま
た、ブロック４乃至７は同一のフィールドパリティに属
する。For example, in mode 2a of FIG. 6, among 8 × 8 blocks 0 to 3 obtained by dividing a macroblock into four,
Blocks 0 and 1 belong to the same field parity, and blocks 2 and 3 belong to the same field parity. Further, for example, in mode 3a of FIG. 6, out of 4 × 8 blocks 0 to 8 obtained by dividing a macro block into eight, blocks 0 to 3 belong to the same field parity, and blocks 4 to 7 belong to the same field parity. .

【００３５】Frame/Field Flagの値が１である場合のイ
ントラ予測モードについて説明する。例えば、図９に示
した４×４ブロックに位置する画素ａ乃至ｐは、Frame/
Field Flagの値が１である場合においても、隣接する４
×４ブロックに位置する画素Ａ乃至Ｉを用いてイントラ
予測が行われるが、画素ａ乃至ｐ、および画素Ａ乃至Ｉ
が全て同一フィールドパリティに属していることが特徴
である。The intra prediction mode when the value of Frame / Field Flag is 1 will be described. For example, pixels a to p located in the 4 × 4 block shown in FIG.
Even if the value of Field Flag is 1, it is adjacent 4
Intra prediction is performed using the pixels A to I located in the × 4 block, but the pixels a to p and the pixels A to I are used.
Are all belonging to the same field parity.

【００３６】画素Ａ乃至Ｉが、画素ａ乃至ｐと同一のマ
クロブロックに属している場合について、図８を参照し
て説明する。マクロブロックを１６分割した４×４ブロ
ック７に存在する画素ａ乃至ｐは、隣接するブロック
２，３，６の端に存在する画素Ａ乃至Ｉを用いてイント
ラ予測が行われる。A case where the pixels A to I belong to the same macroblock as the pixels a to p will be described with reference to FIG. The pixels a to p existing in the 4 × 4 block 7 obtained by dividing the macroblock into 16 are subjected to intra prediction using the pixels A to I existing at the ends of the adjacent blocks 2, 3, and 6.

【００３７】画素Ａ乃至Ｉが、画素ａ乃至ｐとは異なる
マクロブロックに属する場合について、図９を参照して
説明する。A case where the pixels A to I belong to a macro block different from the pixels a to p will be described with reference to FIG.

【００３８】図９Ａは、処理対象としているマクロブロ
ックの左側のマクロブロックと、上側のマクロブロック
に対するFrame/Field Flagの値がそれぞれ１である場合
を示している。この場合、処理対象としているマクロブ
ロックを１６分割した４×４ブロックＣに存在する画素
のイントラ予測は、左側のマクロブロックを１６分割し
たブ４×４ブロックＡに存在する画素と、上側のマクロ
ブロックを１６分割した４×４ブロックＢに存在する画
素を用いて行われる。４×４ブロックＣ’に存在する画
素のイントラ予測は、４×４ブロックＡ’に存在する画
素と、４×４ブロックＢ’に存在する画素を用いて行わ
れる。FIG. 9A shows a case where the values of the Frame / Field Flags for the macroblock on the left side and the macroblock on the upper side of the macroblock to be processed are 1 respectively. In this case, the intra prediction of the pixels existing in the 4 × 4 block C obtained by dividing the macro block to be processed into 16 is performed by the pixels existing in the 4 × 4 block A obtained by dividing the left macro block into 16 and the upper macro. It is performed using the pixels existing in a 4 × 4 block B obtained by dividing a block into 16 parts. Intra prediction of the pixels existing in the 4 × 4 block C ′ is performed using the pixels existing in the 4 × 4 block A ′ and the pixels existing in the 4 × 4 block B ′.

【００３９】図９Ｂは、処理対象としているマクロブロ
ックに対するFrame/Field Flagの値が１であり、その左
側および上側のマクロブロックに対するFrame/Field Fl
agの値がそれぞれ０である場合を示している。この場
合、処理対象としているマクロブロックを１６分割した
４×４ブロックＣに存在する画素のイントラ予測は、左
側のマクロブロックを１６分割した４×４ブロックＡに
存在する画素と、上側のマクロブロックを１６分割した
４×４ブロックＢに存在する画素を用いて行われる。４
×４ブロックＣ’に存在する画素のイントラ予測は、４
×４ブロックＡ’に存在する画素と、４×４ブロックＢ
に存在する画素を用いて行われる。In FIG. 9B, the value of the Frame / Field Flag for the macroblock to be processed is 1, and the Frame / Field Fl for the macroblocks on the left and upper sides of the Frame / Field Flag.
The case where the value of ag is 0 is shown. In this case, the intra prediction of the pixels existing in the 4 × 4 block C obtained by dividing the macro block to be processed into 16 is performed by the pixels existing in the 4 × 4 block A obtained by dividing the left macro block into 16 and the upper macro block. Is performed by using the pixels existing in the 4 × 4 block B obtained by dividing 16 into 16 blocks. Four
The intra prediction of the pixels existing in the × 4 block C ′ is 4
Pixels existing in × 4 block A ′ and 4 × 4 block B
Is performed using the pixels existing in.

【００４０】次に、色差信号のイントラ予測について、
図１０を参照して説明する。Frame/Field Flagの値が１
である場合、色差信号のイントラ予測モードは１種類だ
けが定義されている。Next, regarding intra prediction of color difference signals,
This will be described with reference to FIG. Frame / Field Flag value is 1
If, then only one type of intra-prediction mode for color difference signals is defined.

【００４１】図１０において、Ａ乃至Ｄは、それぞれ色
差信号の４×４ブロックを示す。ブロックＡ，Ｂは、第
１フィールドに属し、ブロックＣ，Ｄは、第２フィール
ドに属する。ｓ₀乃至ｓ₂は、ブロックＡ乃至Ｄに隣接す
るブロックのうち、第１フィールドパリティに属するブ
ロックに存在する色差信号の合計値である。ｓ₃至ｓ
₅は、ブロックＡ乃至Ｄに隣接するブロックのうち、第
２フィールドパリティに属するブロックに存在する色差
信号の合計値である。In FIG. 10, A to D respectively represent 4 × 4 blocks of color difference signals. Blocks A and B belong to the first field, and blocks C and D belong to the second field. s _{0 to} s ₂ are the total values of the color difference signals existing in the blocks belonging to the first field parity among the blocks adjacent to the blocks A to D. s ₃ to s
₅ is a total value of the color difference signals existing in the blocks belonging to the second field parity among the blocks adjacent to the blocks A to D.

【００４２】ブロックＡ乃至Ｄにそれぞれ対応する予測
値Ａ乃至Ｄは、ｓ₀乃至ｓ₅が全て画枠内に存在する場
合、次式（１）に従って予測される。Ａ＝（ｓ₀＋ｓ₂＋４）／８Ｂ＝（ｓ₁＋２）／４Ｃ＝（ｓ₃＋ｓ₅＋４）／８Ｄ＝（ｓ₄＋２）／４・・・（１）The prediction values A to D respectively corresponding to the blocks A to D are predicted according to the following equation (1) when s _{0 to} s ₅ are all in the image frame. A = (s ₀ + s ₂ +4) / 8 B = (s ₁ +2) / 4 C = (s ₃ + s ₅ +4) / 8 D = (s ₄ +2) / 4 (1)

【００４３】ただし、ｓ₀乃至ｓ₅のうち、ｓ₀，ｓ₁，ｓ
₃，ｓ₄だけが画枠内に存在する場合、ブロックＡ乃至Ｄ
にそれぞれ対応する予測値Ａ乃至Ｄは、次式（２）に従
って予測される。Ａ＝（ｓ₀＋２）／４Ｂ＝（ｓ₁＋２）／４Ｃ＝（ｓ₃＋２）／４Ｄ＝（ｓ₄＋２）／４・・・（２）[0043] However, one of the s ₀ to _{_{_{s 5, s 0, s 1}}} , s
_If only ₃ and s ₄ exist in the image frame, blocks A to D
Predicted values A to D respectively corresponding to are predicted according to the following equation (2). A = (s ₀ +2) / 4 B = (s ₁ +2) / 4 C = (s ₃ +2) / 4 D = (s ₄ +2) / 4 (2)

【００４４】さらに、ｓ₀乃至ｓ₅のうち、ｓ₂ｓ₅だけが
画枠内に存在する場合、ブロックＡ乃至Ｄにそれぞれ対
応する予測値は、次式（３）に従って予測される。Ａ＝（ｓ₂＋２）／４Ｂ＝（ｓ₂＋２）／４Ｃ＝（ｓ₅＋２）／４Ｄ＝（ｓ₅＋２）／４・・・（３）[0044] Furthermore, among the s ₀ to s _5, if only s ₂ s ₅ exist in the picture frame, the predicted values corresponding to the blocks A to D, are predicted according to Expressions (3). A = (s ₂ +2) / 4 B = (s ₂ +2) / 4 C = (s ₅ +2) / 4 D = (s ₅ +2) / 4 (3)

【００４５】図１１は、上述したようにイントラ予測さ
れた後の色差信号の残差成分を符号化する方法を示して
いる。すなわち、それぞれの４×４ブロックに対して直
交変換処理を施した後、第１フィールドおよび第２フィ
ールドの直流成分を用いて図示すような２×２ブロック
が生成され、再び直交変換処理が施される。FIG. 11 shows a method of coding the residual component of the color difference signal after intra prediction as described above. That is, after performing orthogonal transform processing on each 4 × 4 block, a 2 × 2 block as shown in the figure is generated using the DC components of the first field and the second field, and the orthogonal transform processing is performed again. To be done.

【００４６】次に、Frame/Field Flagの値が１である場
合における動き予測・補償処理について説明する。Fram
e/Field Flagの値が１である場合、動き予測補償モード
としては、インター１６×１６モード、インター８×１
６モード、インター８×８モード、インター４×８モー
ド、インター４×４モードの６種類のモードが存在す
る。Next, the motion prediction / compensation process when the value of Frame / Field Flag is 1 will be described. Fram
If the value of e / Field Flag is 1, the motion prediction compensation mode is inter 16 × 16 mode, inter 8 × 1
There are 6 types of modes: 6 mode, inter 8 × 8 mode, inter 4 × 8 mode, and inter 4 × 4 mode.

【００４７】例えば、インター１６×１６モードは、イ
ンター８×１６モードにおける第１フィールドに対する
動きベクトル情報、第２フィールドに対する動きベクト
ル情報、および参照フレームが同等であるモードであ
る。For example, the inter 16 × 16 mode is a mode in which the motion vector information for the first field, the motion vector information for the second field, and the reference frame in the inter 8 × 16 mode are equivalent.

【００４８】これら６種類の動き予測補償モードに対し
て、それぞれCode_Number０乃至５が割り当てられてい
る。Code_Numbers 0 to 5 are assigned to the six types of motion prediction / compensation modes, respectively.

【００４９】現在のＨ．２６Ｌにおいては、図１２に示
すような、複数の参照フレームを設けることができるマ
ルチプルフレーム予測が規定されている。現在のフレー
ムベースのＨ．２６Ｌの規格において、参照フレームに
関する情報は、マクロブロックレベルで定義されてお
り、直前に符号化されたフレームに対し、Code_Number
０が割り当てられており、その１乃至５回前に符号化さ
れたフレームに対し、それぞれCode_Number１乃至５が
割り当てられている。The current H.264. In 26L, multiple frame prediction in which a plurality of reference frames can be provided as shown in FIG. 12 is defined. Current frame-based H.264. In the 26L standard, the information about the reference frame is defined at the macroblock level, and Code_Number is defined for the frame encoded immediately before.
0 is assigned, and Code_Number 1 to 5 are assigned to the frames coded 1 to 5 times before.

【００５０】これに対して、フィールドベース符号化を
行う場合、直前に符号化されたフレームの第１フィール
ドに対してCode_Number０が割り当てられ、第２フィー
ルドに対してCode_Number１が割り当てられる。その１
回前に符号化されたフレームの第１フィールドに対して
Code_Number２が割り当てられ、第２フィールドに対し
てCode_Number３が割り当てられる。さらに１回前に符
号化されたフレームの第１フィールドに対してCode_Num
ber４が割り当てられ、第２フィールドに対してCode_Nu
mber５が割り当てられる。On the other hand, when field-based coding is performed, Code_Number0 is assigned to the first field of the frame coded immediately before, and Code_Number1 is assigned to the second field. Part 1
For the first field of the frame encoded the previous time
Code_Number2 is assigned and Code_Number3 is assigned to the second field. Code_Num for the first field of the frame encoded one more time before
ber4 is assigned and Code_Nu for the second field
mber5 is assigned.

【００５１】また、フィールドベース符号化が行われる
マクロブロックに対しては、第１フィールドに対する参
照フィールドと、第２フィールドに対する参照フィール
ドが別個に規定される。Further, with respect to a macroblock in which field-based coding is performed, a reference field for the first field and a reference field for the second field are separately defined.

【００５２】次に、Frame/Field Flagの値が１である場
合の動きベクトル情報予測方式について説明するが、ま
ず、現在のＨ．２６Ｌにおいて規定されているメディア
ン予測について、図１２を参照して説明する。図１２に
示す１６×１６マクロブロックＥに対応する１６×１
６、８×８、または４×４動きベクトル情報は、隣接す
るマクロブロックＡ乃至Ｃの動きベクトル情報のメディ
アンを用いて予測される。Next, the motion vector information prediction method when the value of Frame / Field Flag is 1 will be described. The median prediction defined in 26L will be described with reference to FIG. 16 × 1 corresponding to the 16 × 16 macroblock E shown in FIG.
6, 8 × 8, or 4 × 4 motion vector information is predicted using the median of the motion vector information of adjacent macroblocks A to C.

【００５３】ただし、マクロブロックＡ乃至Ｃのうち、
画枠内に存在しないものについては、対応する動きベク
トル情報の値は０であるとしてメディアンを算出する。
例えば、マクロブロックＤ，Ｂ，Ｃが画枠内に存在しな
い場合、予測値としてマクロブロックＡに対応する動き
ベクトル情報を用いる。また、マクロブロックＣが画枠
内に存在しない場合、その代わりにマクロブロックＤの
動きベクトル情報を用いてメディアンを算出する。However, among the macro blocks A to C,
For those that do not exist in the image frame, the median is calculated assuming that the value of the corresponding motion vector information is 0.
For example, when the macro blocks D, B, and C do not exist in the image frame, the motion vector information corresponding to the macro block A is used as the prediction value. When the macroblock C does not exist in the image frame, the median is calculated using the motion vector information of the macroblock D instead.

【００５４】なお、マクロブロックＡ乃至Ｄの参照フレ
ームは必ずしも同一でなくてもよい。The reference frames of the macro blocks A to D do not necessarily have to be the same.

【００５５】次に、マクロブロックのブロックサイズ
が、８×１６、１６×８、８×４、または４×８である
場合について、図１４を参照して説明する。なお、注目
するマクロブロックＥとこれに隣接するマクロブロック
Ａ乃至Ｄは、図２４に示すように配置されているとす
る。Next, the case where the block size of the macroblock is 8 × 16, 16 × 8, 8 × 4, or 4 × 8 will be described with reference to FIG. Note that the macroblock E of interest and the macroblocks A to D adjacent to the macroblock E are arranged as shown in FIG.

【００５６】図１４Ａは、マクロブロックＥ１，Ｅ２の
ブロックサイズが８×１６である場合を示している。左
側のマクロブロックＥ１に関しては、左に隣接するマク
ロブロックＡがマクロブロックＥ１と同じフレームを参
照している場合、マクロブロックＡの動きベクトル情報
が予測値として用いられる。左に隣接するマクロブロッ
クＡがマクロブロックＥ１と異なるフレームを参照して
いる場合、上述したメディアン予測が適用される。FIG. 14A shows a case where the block size of the macro blocks E1 and E2 is 8 × 16. Regarding the left macroblock E1, if the left-neighboring macroblock A refers to the same frame as the macroblock E1, the motion vector information of the macroblock A is used as a prediction value. When the macroblock A adjacent to the left refers to a frame different from the macroblock E1, the median prediction described above is applied.

【００５７】右側のマクロブロックＥ２に関しては、右
上に隣接するマクロブロックＣがマクロブロックＥ２と
同じフレームを参照している場合、マクロブロックＣの
動きベクトル情報が予測値として用いられる。右上に隣
接するマクロブロックＣがマクロブロックＥ２と異なる
フレームを参照している場合、上述したメディアン予測
が適用される。Regarding the right macroblock E2, when the macroblock C adjacent to the upper right refers to the same frame as the macroblock E2, the motion vector information of the macroblock C is used as a prediction value. When the macroblock C adjacent to the upper right corner refers to a frame different from the macroblock E2, the median prediction described above is applied.

【００５８】図１４Ｂは、マクロブロックＥ１，Ｅ２の
ブロックサイズが１６×８である場合を示している。上
側のマクロブロックＥ１に関しては、上に隣接するマク
ロブロックＢがマクロブロックＥ１と同じフレームを参
照している場合、マクロブロックＢの動きベクトル情報
が予測値として用いられる。上に隣接するマクロブロッ
クＢがマクロブロックＥ１と異なるフレームを参照して
いる場合、上述したメディアン予測が適用される。FIG. 14B shows the case where the block size of the macro blocks E1 and E2 is 16 × 8. Regarding the upper macroblock E1, when the macroblock B adjacent to the upper macroblock E1 refers to the same frame as the macroblock E1, the motion vector information of the macroblock B is used as a prediction value. If the macroblock B adjacent to the top refers to a frame different from the macroblock E1, the above-described median prediction is applied.

【００５９】下側のマクロブロックＥ２に関しては、左
に隣接するマクロブロックＡがマクロブロックＥ２と同
じフレームを参照している場合、マクロブロックＡの動
きベクトル情報が予測値として用いられる。左に隣接す
るマクロブロックＡがマクロブロックＥ２と異なるフレ
ームを参照している場合、上述したメディアン予測が適
用される。Regarding the lower macroblock E2, when the left-neighboring macroblock A refers to the same frame as the macroblock E2, the motion vector information of the macroblock A is used as a prediction value. When the macroblock A adjacent to the left refers to a frame different from the macroblock E2, the median prediction described above is applied.

【００６０】図１４Ｃは、マクロブロックＥ１乃至Ｅ８
のブロックサイズが８×４である場合を示している。左
側のマクロブロックＥ１乃至Ｅ４に対しては、上述した
メディアン予測が適用され、右側のマクロブロックＥ５
乃至Ｅ８に対しては、左側のマクロブロックＥ１乃至Ｅ
４の動きベクトル情報が予測値として用いられる。FIG. 14C shows the macroblocks E1 to E8.
The block size is 8 × 4. The above median prediction is applied to the left macroblocks E1 to E4, and the right macroblock E5 is applied.
To E8, left macroblocks E1 to E
The motion vector information of 4 is used as the predicted value.

【００６１】図１４Ｄは、マクロブロックＥ１乃至Ｅ８
のブロックサイズが４×８である場合を示している。上
側のマクロブロックＥ１乃至Ｅ４に対しては、上述した
メディアン予測が適用され、下側のマクロブロックＥ５
乃至Ｅ８に対しては、上側のマクロブロックＥ１乃至Ｅ
４の動きベクトル情報が予測値として用いられる。FIG. 14D shows the macroblocks E1 to E8.
The block size is 4 × 8. The above-described median prediction is applied to the upper macroblocks E1 to E4, and the lower macroblock E5 is applied.
To E8, upper macroblocks E1 to E8
The motion vector information of 4 is used as the predicted value.

【００６２】Frame/Field Flagの値が１である場合にお
いても、動きベクトル情報の水平方向成分の予測に関し
ては、図１３および図１４を参照して上述の方式に準ず
る。しかしながら、垂直方向成分に関しては、フィール
ドベースのブロックとフレームベースのブロックが混在
するため、以下のような処理を行う。なお、注目するマ
クロブロックＥとこれに隣接するマクロブロックＡ乃至
Ｄは、図１３に示すように配置されているとする。Even when the value of Frame / Field Flag is 1, the prediction of the horizontal direction component of the motion vector information is based on the method described above with reference to FIGS. 13 and 14. However, with respect to the vertical component, field-based blocks and frame-based blocks coexist, so the following processing is performed. Note that the macroblock E of interest and the macroblocks A to D adjacent thereto are arranged as shown in FIG.

【００６３】マクロブロックＥをフレームベース符号化
する場合であって、隣接するマクロブロックＡ乃至Ｄの
いずれかがフィールドベース符号化されている場合、第
１フィールドに対する動きベクトル情報の垂直方向成分
と、第２フィールドに対する動きベクトル情報の垂直方
向成分の平均値の２倍を算出し、これをフレームベース
の動きベクトル情報に相当するものとして予測処理を行
う。When the macroblock E is frame-based coded and any of the adjacent macroblocks A to D is field-based coded, the vertical component of the motion vector information for the first field, Two times the average value of the vertical component of the motion vector information for the second field is calculated, and the prediction process is performed by assuming that this is equivalent to the frame-based motion vector information.

【００６４】マクロブロックＥをフィールドベース符号
化する場合であって、隣接するブロックＡ乃至Ｄのいず
れかがフレームベース符号化されている場合、動きベク
トル情報の垂直方向成分の値を２で割った商を、フィー
ルドベースの動きベクトルに相当するものとして予測処
理を行う。When the macroblock E is field-based coded and any of the adjacent blocks A to D is frame-based coded, the value of the vertical component of the motion vector information is divided by two. Prediction processing is performed with the quotient corresponding to a field-based motion vector.

【００６５】ところで、現在のＨ．２６Ｌにおいては、
４×４ブロックを単位とした離散コサイン変換処理の
後、量子化処理を行うようになされているが、文献「"A
BT Coding for Higher Resolution Video"(M.Wien et a
l. Joint Video Team of ISO/IEC MPEG & ITU-T VCEG,J
VT-B053)」（以下、文献２と記述する）においては、離
散コサイン変換処理の単位として、ブロックサイズが４
×４、４×８、８×４、および８×８の４種類のブロッ
クを用いることが開示されている。By the way, the present H.264. At 26L,
Quantization processing is performed after discrete cosine transform processing in units of 4 × 4 blocks.
BT Coding for Higher Resolution Video "(M. Wien et a
l. Joint Video Team of ISO / IEC MPEG & ITU-T VCEG, J
VT-B053) ”(hereinafter referred to as Reference 2), the block size is 4 as a unit of discrete cosine transform processing.
It is disclosed that four types of blocks of x4, 4x8, 8x4, and 8x8 are used.

【００６６】以下、文献２に開示されている、ブロック
サイズが４×４、４×８、８×４、および８×８の４種
類のブロックを離散コサイン変換処理の単位として用い
ること（以下、ABTを用いた直交変換とも記述する）に
ついて説明するが、その前に、現在のＨ．２６Ｌで規定
されている、離散コサイン変換処理および量子化処理に
ついて説明する。Hereinafter, use of four types of blocks having block sizes of 4 × 4, 4 × 8, 8 × 4, and 8 × 8 disclosed in Document 2 as a unit of discrete cosine transform processing (hereinafter, (Also referred to as orthogonal transformation using ABT), but before that, the current H.264 standard is described. The discrete cosine transform processing and the quantization processing defined by the standard 26L will be described.

【００６７】例えば量子化されている画素値をａ，ｂ，
ｃ，ｄとし、変換係数をＡ，Ｂ，Ｃ，Ｄとすれば、離散
コサイン変換に相当する処理は次式（４）によって表さ
れる。Ａ＝１３ａ＋１３ｂ＋１３ｃ＋１３ｄＢ＝１７ａ＋７ｂ−７ｃ＋１７ｄＣ＝１３ａ−１３ｂ−１３ｃ＋１３ｄＤ＝７ａ−１７ｂ＋１７ｃ−７ｄ・・・（４）For example, if the quantized pixel values are a, b,
If c and d are used and the conversion coefficients are A, B, C, and D, the process corresponding to the discrete cosine transform is expressed by the following equation (4). A = 13a + 13b + 13c + 13d B = 17a + 7b-7c + 17d C = 13a-13b-13c + 13d D = 7a-17b + 17c-7d (4)

【００６８】さらに、変換後の係数をａ’，ｂ’，
ｃ’，ｄ’とすれば、逆離散コサイン変換に相当する処
理は次式（５）によって表される。ａ’＝１３Ａ＋１７Ｂ＋１３Ｃ＋１７Ｄｂ’＝１３Ａ＋７Ｂ−１３Ｃ−１７Ｄｃ’＝１３Ａ−７Ｂ−１３Ｃ＋１７Ｄｄ’＝１３Ａ−１７Ｂ＋１３Ｃ−７Ｄ・・・（５）Further, the coefficients after conversion are a ′, b ′,
Assuming c ′ and d ′, the process corresponding to the inverse discrete cosine transform is expressed by the following equation (5). a '= 13A + 17B + 13C + 17D b' = 13A + 7B-13C-17D c '= 13A-7B-13C + 17D d' = 13A-17B + 13C-7D ... (5)

【００６９】ここで、量子化されている画素値ａと変換
後の係数ａ’は、次式（６）に示す関係となる。ａ’＝６７６ａ・・・（６）Here, the quantized pixel value a and the coefficient a'after conversion have the relationship shown in the following expression (6). a '= 676a (6)

【００７０】このような量子化されている画素値ａと変
換後の係数ａ’との関係は、式（４）および（５）に正
規化のプロセスが含まれていないことに起因する。な
お、正規化のプロセスは、逆量子化後の最後のシフト演
算によって、後述するように実行される。The relationship between the quantized pixel value a and the coefficient a'after conversion is due to the fact that the normalization process is not included in the equations (4) and (5). Note that the normalization process is executed as will be described later by the final shift operation after dequantization.

【００７１】Ｈ．２６Ｌにおいては、量子化／逆量子化
を行うための媒介変数として、０乃至３１の値を取り得
る量子化パラメータＱＰが定義されている。量子化ステ
ップは、量子化パラメータＱＰの値が１増加する毎に、
１２％ずつ増加するように設計されている。すなわち、
量子化パラメータＱＰの値が６だけ増加した場合、量子
化ステップサイズは２倍になることになる。H. In 26L, a quantization parameter QP that can take values from 0 to 31 is defined as a parameter for performing quantization / inverse quantization. In the quantization step, each time the value of the quantization parameter QP increases by 1,
Designed to increase by 12%. That is,
If the value of the quantization parameter QP is increased by 6, the quantization step size will be doubled.

【００７２】画像圧縮情報中に埋め込まれている量子化
パラメータＱＰは、輝度信号に対するものである。そこ
で、以下、輝度信号に対する量子化パラメータＱＰと区
別するために、色差信号に対する量子化パラメータをＱ
Ｐ_chromaと記述する。輝度信号に対する量子化パラメー
タＱＰと色差信号に対する量子化パラメータをＱＰ
_chromaとの対応関係は以下のとおりである。すなわち、
例えば、輝度信号に対する量子化パラメータＱＰの値０
乃至１７は、それぞれ、色差信号に対する量子化パラメ
ータをＱＰ_chromaの値０乃至１７に対応する。また例え
ば、輝度信号に対する量子化パラメータＱＰの値２０
は、それぞれ、色差信号に対する量子化パラメータをＱ
Ｐ_chromaの値１９に対応する。ＱＰ 0,1,2,・・・,17,18,19,20,21,22,23,24,25,26,27,28,29,30,31 ＱＰ_chroma 0,1,2,・・・,17,17,18,19,20,20,21,22,22,23,23,24,24,25,25The quantization parameter QP embedded in the image compression information is for the luminance signal. Therefore, hereinafter, in order to distinguish from the quantization parameter QP for the luminance signal, the quantization parameter for the color difference signal is set to Q.
_Described as P _chroma . Quantization parameter QP for the luminance signal and QP for the color difference signal
The correspondence with _chroma is as follows. That is,
For example, the value 0 of the quantization parameter QP for the luminance signal
To 17 correspond to the QP _chroma values 0 to 17 as the quantization parameters for the color difference signals, respectively. Further, for example, the value of the quantization parameter QP for the luminance signal is 20
Respectively set the quantization parameter for the color difference signal to Q.
Corresponds to a value of 19 for P _chroma . QP 0,1,2, ..., 17,18,19,20,21,22,23,24,25,26,27,28,29,30,31 QP _chroma 0,1,2, ...・, 17,17,18,19,20,20,21,22,22,23,23,24,24,25,25

【００７３】Ｈ．２６Ｌにおいては、量子化／逆量子化
のため配列Ａ（ＱＰ），Ｂ（ＱＰ）が定義されている。Ａ（ＱＰ＝０，１，・・・，３１）６２０，５５３，４９２，４３９，３９１，３４８，３
１０，２７６，２４６，２１９，１９５，１７４，１５
５，１３８，１２３，１１０，９８，８７，７８，６
９，６２，５５，４９，４４，３９，３５，３１，２
７，２４，２２，１９，１７Ｂ（ＱＰ＝０，１，・・・，３１）３８８１，４３５１，４８９０，５４８１，６１５４，
６９１４，７７６１，８７１８，１０９８７，１２３３
９，１３８２８，１５５２３，１７４３５，１９５６
１，２１８７３，２４５５２，２７６５６，３０８４
７，３４８７０，３８８０７，４３７４７，４９１０
３，５４６８３，６１６９４，６８７４５，７７６１
５，８９１１３，１００２５３，１０９３６６，１２６
６３５，１４１５３３H. In 26L, arrays A (QP) and B (QP) are defined for quantization / dequantization. A (QP = 0,1, ..., 31) 620,553,492,439,391,348,3
10,276,246,219,195,174,15
5,138,123,110,98,87,78,6
9,62,55,49,44,39,35,31,2
7, 24, 22, 19, 17 B (QP = 0, 1, ..., 31) 3881, 4351, 4890, 5481, 6154,
6914, 7761, 8718, 10987, 1233
9,13828,15523,17435,1956
1,1873,24552,27665,3084
7,34870,38807,43747,4910
3,54683,61694,68745,7761
5,89113,100253,109366,126
635,141533

【００７４】なお、配列Ａ（ＱＰ）と配列Ｂ（ＱＰ）に
は、次式（７）に示す関係が成立する。Ａ（ＱＰ）×Ｂ（ＱＰ）×６７６²＝２⁴⁰ ・・・（７）The relationship shown in the following equation (7) is established between the array A (QP) and the array B (QP). A (QP) × B (QP) × 676 ² = 2 ⁴⁰ (7)

【００７５】例えば、係数Ｋを量子化する処理は、次式
（８）にしたがって実行される。 LEVEL＝（Ｋ×Ａ（ＱＰ）＋ｆ×２²⁰）／２²⁰ ・・・（８）ただし、式（８）のｆに関し、｜ｆ｜は、０から０．５
までの値であり、ｆの符号は係数Ｋに一致する。For example, the process of quantizing the coefficient K is executed according to the following equation (8). LEVEL = (K × A (QP) + f × 2 ²⁰ ) / 2 ²⁰ (8) However, with respect to f in the formula (8), | f | is 0 to 0.5.
And the sign of f coincides with the coefficient K.

【００７６】また例えば、逆量子化の処理は、次式
（９）にしたがって実行される。Ｋ’＝LEVEL×Ｂ（ＱＰ）・・・（９）Further, for example, the inverse quantization process is executed according to the following equation (9). K '= LEVEL x B (QP) ... (9)

【００７７】次に、文献２に開示されているABT(Adapti
ve Block Transform)を用いた直交変換を含む符号化方
式について説明する。Next, the ABT (Adapti
An encoding method including orthogonal transformation using ve block transform) will be described.

【００７８】インター符号化されるマクロブロックに関
して定義されている７種類のモードを図１５に示す。す
なわち、モード１は、動き補償・予測のブロックサイズ
が１６×１６であり、離散コサイン変換のブロックサイ
ズが８×８である。モード２は、動き補償・予測のブロ
ックサイズが８×１６であり、離散コサイン変換のブロ
ックサイズが８×８である。モード３は、動き補償・予
測のブロックサイズが１６×８であり、離散コサイン変
換のブロックサイズが８×８である。モード４は、動き
補償・予測のブロックサイズと離散コサイン変換のブロ
ックサイズがともに８×８である。モード５は、動き補
償・予測のブロックサイズと離散コサイン変換のブロッ
クサイズがともに８×４である。モード６は、動き補償
・予測のブロックサイズと離散コサイン変換のブロック
サイズがともに４×８である。モード７は、動き補償・
予測のブロックサイズと離散コサイン変換のブロックサ
イズがともに４×４である。FIG. 15 shows the seven types of modes defined for inter-coded macroblocks. That is, in mode 1, the block size for motion compensation / prediction is 16 × 16, and the block size for discrete cosine transform is 8 × 8. In mode 2, the block size for motion compensation / prediction is 8 × 16, and the block size for discrete cosine transform is 8 × 8. In mode 3, the block size for motion compensation / prediction is 16 × 8, and the block size for discrete cosine transform is 8 × 8. In mode 4, both the block size for motion compensation / prediction and the block size for discrete cosine transform are 8 × 8. In mode 5, both the block size for motion compensation / prediction and the block size for discrete cosine transform are 8 × 4. In mode 6, both the block size for motion compensation / prediction and the block size for discrete cosine transform are 4 × 8. Mode 7 is motion compensation
Both the block size for prediction and the block size for discrete cosine transform are 4 × 4.

【００７９】なお、イントラ符号化されるマクロブロッ
クに関しても、同様に、図１５に示した７種類のモード
が定義されている。ただし、モード４乃至７において
は、Ｈ．２６Ｌで規定されているイントラ予測が行われ
るが、モード０乃至３においては、イントラＤＣ予測だ
けが行われる。Similarly, for macro blocks to be intra-coded, seven types of modes shown in FIG. 15 are defined. However, in modes 4 to 7, H.264. The intra prediction specified in 26L is performed, but in modes 0 to 3, only intra DC prediction is performed.

【００８０】ブロックサイズが４×４以上である場合、
Directional予測を実行する前にブロックの境界に位置
する画素値にフィルタリングを施すことにより、Direct
ional予測の効率を高めることができる。このフィルタ
リングは、長さ８画素のブロック境界に適用される。こ
こで、ブロックの上部境界に位置する画素に対するフィ
ルタリング処理について説明する。なお、ブロックの左
部境界に位置する画素値に関する処理も同様である。When the block size is 4 × 4 or more,
Directing by filtering the pixel values located at the boundaries of blocks before performing Directional prediction.
The efficiency of ional forecast can be increased. This filtering is applied to block boundaries that are 8 pixels in length. Here, the filtering process for the pixels located at the upper boundary of the block will be described. The same applies to the processing for the pixel value located at the left boundary of the block.

【００８１】例えば、Ｎ×Ｍブロック（Ｎ，Ｍ＝４また
は８）に存在する要素をＰ（ｉ，ｊ）（ｉ＝０乃至Ｎ−
１；ｊ＝０乃至Ｍ−１）とし、当該ブロックＮ×Ｍブロ
ックの上部境界に位置する、既に復号されている画素を
Ｐ（ｎ，−１）（ｎ＝０乃至Ｎ−１）とすれば、画素Ｐ
（ｎ，−１）には、次式（１０）に示す３タップのＦＩ
Ｒフィルタが施され、得られる画素Ｐ’（ｎ，−１）を
用いてDirectional予測が行われる。［７，１８，７］//３２・・・（１０）For example, an element existing in an N × M block (N, M = 4 or 8) is defined as P (i, j) (i = 0 to N-).
1; j = 0 to M−1), and the already-decoded pixels located at the upper boundary of the block N × M block are P (n, −1) (n = 0 to N−1). If pixel P
In (n, -1), the 3-tap FI shown in the following equation (10) is used.
Directional prediction is performed using the pixel P ′ (n, −1) obtained by applying the R filter. [7, 18, 7] // 32 (10)

【００８２】次に、文献２に開示されているABTを用い
た直交変換について説明する。ABTを用いた直交変換
は、次式（１１）に示す４×４行列Ｔ₄と、次式（１
２）に示す８×８行列Ｔ₈が用いられた次式（１３）に
したがって行われる。Ｔ₄＝ 13 13 13 13 17 7 -7 -17 13 -13 -13 13 7 -17 17 -7 ・・・（１１）Next, the orthogonal transformation using the ABT disclosed in Document 2 will be described. The orthogonal transformation using the ABT is performed by using the 4 × 4 matrix T ₄ shown in the following equation (11) and the following equation (1
This is performed according to the following equation (13) using the 8 × 8 matrix T ₈ shown in 2). T ₄ = 13 13 13 13 17 7 -7 -17 13 -13 -13 13 7 -17 17 -7 ・・・ (11)

【００８３】Ｔ₈＝ 17 17 17 17 17 17 17 17 24 20 12 6 -6 -12 -20 -24 23 7 -7 -23 -23 -7 7 23 20 -6 -24 -12 12 24 6 -20 17 -17 -17 17 17 -17 -17 17 12 -24 6 20 -20 -6 24 -12 7 -23 23 -7 -7 23 -23 7 6 -12 20 -24 24 -20 12 -6 ・・・（１２）T ₈ = 17 17 17 17 17 17 17 17 24 20 12 6 -6 -12 -20 -24 23 7 -7 -23 -23 -7 7 23 20 -6 -24 -12 12 24 6 -20 17 -17 -17 17 17 -17 -17 17 12 -24 6 20 -20 -6 24 -12 7 -23 23 -7 -7 23 -23 7 6 -12 20 -24 24 -20 12 -6 ...・ (12)

【００８４】［Ｃ_n×m］＝［Ｔ_m］×［Ｂ_n×m］×［Ｔ_n］^T ・・・（１３）ただし、式（１３）において、［Ｂ_n×m］は、ｎ×ｍブ
ロック（ｎ，ｍ＝４または８）の画素であり、［Ｔ_n］
は、水平方向に対する直交変換行列であり、［Ｔ_m］
は、垂直方向に対する直交変換行列である。［Ｃ_n×m］
は、直交変換係数であるが、正規化は、後述するように
量子化処理と同時に施される。[C _{n × m} ] = [T _m ] × [B _{n × m} ] × [T _n ] ^T (13) However, in the equation (13), [B _{n × m} ] is n Pixels of × m blocks (n, m = 4 or 8), and [T _n ].
Is an orthogonal transformation matrix in the horizontal direction, and [T _m ]
Is an orthogonal transformation matrix in the vertical direction. [C _{n × m} ]
Is an orthogonal transform coefficient, but the normalization is performed at the same time as the quantization processing as described later.

【００８５】文献２に開示されているABTを用いた直交
変換を含む符号化方式においては、現在のＨ．２６Ｌに
対応して、量子化パラメータＱＰを媒介変数とする変数
Ａ_n _×m（ＱＰ）および変数Ｂ_n×m（ＱＰ）が、以下のよ
うに定義されている。In the coding method including orthogonal transformation using ABT disclosed in Document 2, the current H.264 standard is used. Corresponding to 26L, a variable A _n _{× m} (QP) and a variable B _{n × m} (QP) having the quantization parameter QP as a parameter are defined as follows.

【００８６】Ａ_4×4（ＱＰ＝０乃至３１）＝620,553,49
2,439,391,348,310,276,246,219,195,174,155,138,123,
110,98,87,78,69,62,55,49,44,39,35,31,27,24,22,19,1
7 Ｂ_4×4（ＱＰ＝０乃至３１）＝3881,4351,4890,5481,61
54,6914,7761,8718,9781,10987,12339,13828,15523,174
35,19561,21873,24552,27656,30847,34870,38807,4374
7,49103,54683,61694,68745,77615,89113,100253,10936
6,126635, 141533 Ａ_8×4（ＱＰ＝０乃至３１）＝335,299,266,237,211,18
8,168,149,133,118,105,94,84,75,67,59,53,47,42,37,3
4,30,26,24,21,19,17,15,13,12,10,9 Ｂ_8×4（ＱＰ＝０乃至３１）＝2100,2353,2645,2968,33
34,3742,4188,4721,5289,5962,6700,7484,8375,9380,10
500,11924,13274,14968,16750,19014,20691,23450,2705
8,29313,33500,37026,41382,46900,54116,58625,70350,
78167 Ａ_8×8（ＱＰ＝０乃至３１）＝181,162,144,128,114,10
2,91,81,72,64,57,51,45,40,36,32,29,25,23,20,18,16,
14,13,11,10,9,8,7,6,6,5 Ｂ_8×8（ＱＰ＝０乃至３１）＝1136,1270,1428,1607,18
04,2017,2260,2539,2857,3214,3609,4033,4571,5142,57
14,6428,7093,8228,8943,10285,11428,12856,14693,158
23,18700,20570,22855,25712,29385,34283,34283,41139A _{4 × 4} (QP = 0 to 31) = 620,553,49
2,439,391,348,310,276,246,219,195,174,155,138,123,
110,98,87,78,69,62,55,49,44,39,35,31,27,24,22,19,1
7 B _{4 × 4} (QP = 0 to 31) = 3881,4351,4890,5481,61
54,6914,7761,8718,9781,10987,12339,13828,15523,174
35,19561,21873,24552,27656,30847,34870,38807,4374
7,49103,54683,61694,68745,77615,89113,100253,10936
6,126635, 141533 A _{8 × 4} (QP = 0 to 31) = 335,299,266,237,211,18
8,168,149,133,118,105,94,84,75,67,59,53,47,42,37,3
4,30,26,24,21,19,17,15,13,12,10,9 B _{8 × 4} (QP = 0 to 31) = 2100,2353,2645,2968,33
34,3742,4188,4721,5289,5962,6700,7484,8375,9380,10
500,11924,13274,14968,16750,19014,20691,23450,2705
8,29313,33500,37026,41382,46900,54116,58625,70350,
78167 A _{8 × 8} (QP = 0 to 31) = 181,162,144,128,114,10
2,91,81,72,64,57,51,45,40,36,32,29,25,23,20,18,16,
14,13,11,10,9,8,7,6,6,5 B _{8 × 8} (QP = 0 to 31) = 1136,1270,1428,1607,18
04,2017,2260,2539,2857,3214,3609,4033,4571,5142,57
14,6428,7093,8228,8943,10285,11428,12856,14693,158
23,18700,20570,22855,25712,29385,34283,34283,41139

【００８７】なお、変数Ａ_n×m（ＱＰ）と変数Ｂ
_n×m（ＱＰ）には、次式（１４）に示す関係が成立す
る。Ａ_n×m（ＱＰ）×Ｂ_n×m（ＱＰ）×Ｎ² _n×m＝２⁴⁰ ・・・（１４）ただし、式（１４）において、Ｎ² _n×mは、図１６に示
すとおりである。The variable A _{n × m} (QP) and the variable B
_{For n × m} (QP), the relationship shown in the following equation (14) is established. A _{n × m} (QP) × B _{n × m} (QP) × N ² _{n × m} = 2 ⁴⁰ (14) However, in the formula (14), N ² _{n × m} is as shown in FIG. Is.

【００８８】なお、量子化処理および逆量子化処理は、
現在のＨ．２６Ｌと同様に、式（８），（９）に従って
実行される。The quantization processing and the inverse quantization processing are
Current H. Similar to 26L, it is executed according to equations (8) and (9).

【００８９】式（１３）に従って直交変換された２次元
配列［Ｃ_n×m］は、そのブロックサイズに応じて、図１
７に示す４種類のスキャン方式のいずれかによって１次
元配列に並べ替えられる。The two-dimensional array [C _{n × m} ] that has been orthogonally transformed according to the equation (13) corresponds to the block size shown in FIG.
It is rearranged into a one-dimensional array by any of the four scanning methods shown in FIG.

【００９０】ABTを用いた直交変換を含む符号化方式に
用いるループフィルタは、現在のＨ．２６Ｌにおいて定
められている方式と同様であるが、直交変換のブロック
サイズが４×８、８×４、または８×８であった場合、
ブロック内の４×４サブブロック境界においては、フィ
ルタリング処理を行わないようになされている。The loop filter used in the coding method including orthogonal transformation using ABT is the current H.264 standard. 26L is the same as the method defined in 26L, but when the block size of orthogonal transformation is 4 × 8, 8 × 4, or 8 × 8,
The filtering process is not performed at the 4 × 4 sub-block boundary in the block.

【００９１】可逆符号化に関しては、現在のＨ．２６Ｌ
においては、可変長符号化の一種であるUVLC(Universal
Variable Length Code)と、算術符号化の一種であるCA
BAC(Context-based adaptive binary arithmetic codin
g)の２種類が定義されており、ユーザは可逆符号化方式
にUVLCまたはCABACの一方を選択して適用することが可
能である。文献２のABTの符号化方式においては、ABTに
対するCABACが規定されている。Regarding lossless encoding, the current H.264 standard is used. 26L
In the case of UVLC (Universal
Variable Length Code) and CA, which is a type of arithmetic coding
BAC (Context-based adaptive binary arithmetic codin
Two types of g) are defined, and the user can select and apply either UVLC or CABAC to the lossless encoding method. In the ABT encoding method of Document 2, CABAC for ABT is specified.

【００９２】ABTを用いた直交変換を含む符号化方式に
おいては、図１８に示すようにCode_Numberが割り当て
られている、イントラABTモードを符号化する必要があ
り、このイントラABTモードは、現在のＨ．２６Ｌにお
いて、インター符号化されるフレームにおけるMB_type
に対して定義されているコンテキストモデルを用いて符
号化される。In the coding method including orthogonal transformation using ABT, it is necessary to code the intra ABT mode to which Code_Number is assigned as shown in FIG. 18, and this intra ABT mode is the current H ． MB_type in a frame to be inter-coded at 26L
It is encoded using the context model defined for.

【００９３】４×４、４×８、８×４、および８×８ブ
ロックに関しては、イントラ予測モードが符号化される
必要があるが、現在のＨ．２６Ｌにおいて定義されてい
るイントラ予測モードがそのまま適用される。For 4x4, 4x8, 8x4, and 8x8 blocks, the intra-prediction mode needs to be coded, but the current H.264 standard. The intra prediction mode defined in 26L is applied as it is.

【００９４】CBPYに関しては、現在のＨ．２６Ｌと同様
の処理が、ABTの符号化方式においても適用される。Regarding CBPY, the current H.264 standard is used. The same processing as that of 26L is also applied to the ABT encoding method.

【００９５】LEVELに関しては、図１５に示した７種類
のモードのうち、モード５乃至７においては、現在の
Ｈ．２６Ｌと同様の手法が適用される。モード１乃至４
においては、CBP＝１であるということは、少なくとも
１つの非零係数が８×８ブロック内に存在することにな
るため、第１のLEVELに関しては、第１のEOB(End Of Bl
ock)が存在することなく符号化される。Regarding LEVEL, in the modes 5 to 7 among the seven types of modes shown in FIG. A method similar to that of 26L is applied. Modes 1 to 4
, CBP = 1 means that at least one non-zero coefficient exists in the 8 × 8 block, so that for the first LEVEL, the first EOB (End Of Bl)
ock) is encoded without being present.

【００９６】RUNに関しては、４×４ブロックにおいて
は、現在のＨ．２６Ｌで定められている手法が適用され
るが、４×８、８×４、および８×８ブロックにおいて
は、図１９に示すようなテーブルに基づいて２値化が行
われる。これらのRUN情報を、CABACを用いて符号化する
ためのコンテキストモデルを図２０に示す。Regarding RUN, in the 4 × 4 block, the current H.264 is set. Although the method defined by 26L is applied, binarization is performed on 4 × 8, 8 × 4, and 8 × 8 blocks based on a table shown in FIG. FIG. 20 shows a context model for encoding these RUN information using CABAC.

【００９７】動き予測に関しては、現在のＨ．２６Ｌに
おいては、複数の４×４ブロックに対して、アダマール
変換が施される。また、動き予測により生成された差分
値を直交変換して得られる係数の絶対値和SATD(the sum
of the absolute values ofthe transformed block di
fferences)を用いて、整数画素精度以下の動き予測が行
われる。Regarding motion estimation, the current H.264 standard is used. In 26L, Hadamard transform is applied to a plurality of 4 × 4 blocks. Also, the absolute value sum SATD (the sum) of the coefficients obtained by orthogonally transforming the difference value generated by motion estimation
of the absolute values of the transformed block di
(fferences) is used to perform motion prediction with an integer pixel precision or less.

【００９８】ABTを用いた直交変換を含む符号化処理に
おいても、同様にアダマール変換が用いられる。ただ
し、正規化プロセスには√２で除算する演算が必要とな
るが、文献２においては、１／√８の近似値として１８
１／５１２を用いている。図２１に、アダマール変換後
の正規化ファクタを示す。The Hadamard transform is also used in the encoding process including the orthogonal transform using the ABT. However, the normalization process requires an operation of dividing by √2, but in Reference 2, it is 18 as an approximate value of 1 / √8.
1/512 is used. FIG. 21 shows the normalization factor after Hadamard transform.

【００９９】イントラモードの選択について説明する。
Low Complexity Modeにおいては、上述した絶対値和SAT
Dを最小とするようなイントラモードが選択される。Hig
h Complexity Modeにおいては、次式（１５）に示すコ
スト関数Ｊを用いたモード判定が行われる。Ｊ＝Ｄ＋λ_modeＲ・・・（１５）ただし、式（１５）において、Ｄは歪であり、Ｒはビッ
トコストである。また、λ_modeは現在のＨ．２６Ｌにお
いて用いられているラグランジェ乗数である。The selection of the intra mode will be described.
In Low Complexity Mode, the above-mentioned absolute value sum SAT
The intra mode that minimizes D is selected. Hig
In h Complexity Mode, mode determination using the cost function J shown in the following equation (15) is performed. J = D + λ _mode R (15) However, in Expression (15), D is distortion and R is bit cost. Also, λ _mode is the current H.264 standard. It is a Lagrange multiplier used in 26L.

【０１００】具体的には、まず、４×４、４×８、８×
４、および８×８の各ABTブロックに対して、最適なイ
ントラ予測モードが選択される。４×４以外のABTブロ
ックにおいてはＤＣ予測のみが適用される。このそれぞ
れに対し、式（１５）を用いてコスト関数Ｊの値が計算
され、最適なABTブロックが選択される。Specifically, first, 4 × 4, 4 × 8, 8 ×
The optimal intra prediction mode is selected for each of the 4 and 8 × 8 ABT blocks. Only DC prediction is applied to ABT blocks other than 4 × 4. For each of these, the value of the cost function J is calculated using equation (15), and the optimum ABT block is selected.

【０１０１】[0101]

【発明が解決しようとする課題】ところで、文献２に提
案されている方法では、入力となる画像情報が飛び越し
走査フォーマットである場合、フィールド構造およびフ
レーム構造の一方の画像情報だけしか符号化することは
できないが、文献１に提案されている、マクロブロック
レベルでのフィールド／フレーム適応型符号化処理を組
み合わせることによって、より高い符号化効率を実現で
きることが期待される。By the way, in the method proposed in Document 2, when the input image information is the interlaced scanning format, only one of the image information of the field structure and the frame structure is encoded. However, it is expected that higher coding efficiency can be realized by combining the field / frame adaptive coding processing at the macroblock level proposed in Reference 1.

【０１０２】本発明はこのような状況に鑑みてなされた
ものであり、飛び越し走査フォーマットの画像情報を入
力とし、マクロブロックレベルでフィールド／フレーム
適応型符号化処理を行い、直交変換にはABTを用いる画
像符号化装置を実現することを目的とする。The present invention has been made in view of such a situation. The image information of the interlaced scanning format is input, the field / frame adaptive encoding processing is performed at the macroblock level, and the ABT is used for orthogonal transformation. It is an object to realize an image encoding device to be used.

【０１０３】[0103]

【課題を解決するための手段】本発明の符号化装置は、
予め設定されている複数の直交変換用ブロックサイズの
いずれかにマクロブロックを分割して直交変換する直交
変換手段と、最も符号化効率が高くなるように、直交変
換用ブロックサイズを決定するブロックサイズ決定手段
と、直交変換手段の処理結果を量子化する量子化手段
と、量子化手段の処理結果を可逆符号化する可逆符号化
手段とを含むことを特徴とする。The encoding device of the present invention is
Orthogonal transform means for dividing a macroblock into any of a plurality of preset orthogonal transform block sizes and performing orthogonal transform, and block size for determining the orthogonal transform block size so as to maximize the coding efficiency. It is characterized by including a determining means, a quantizing means for quantizing the processing result of the orthogonal transforming means, and a lossless encoding means for losslessly encoding the processing result of the quantizing means.

【０１０４】前記直交変換用ブロックサイズは、４×
４，４×８，８×４、および８×８の４種類であるよう
にすることができる。The orthogonal transform block size is 4 ×
There can be four types, 4,4 × 8, 8 × 4, and 8 × 8.

【０１０５】前記ブロックサイズ決定手段は、マクロブ
ロックがインターマクロブロックであってフィールドベ
ースの符号化処理が施される場合、動き予測・補償用ブ
ロックサイズに対応して直交変換用ブロックサイズを決
定するようにすることができる。The block size determining means determines the orthogonal transform block size in correspondence with the motion prediction / compensation block size when the macro block is an inter macro block and is subjected to field-based coding processing. You can

【０１０６】前記ブロックサイズ決定手段は、マクロブ
ロックがインターマクロブロックであってフィールドベ
ースの符号化処理が施される場合、動き予測・補償用ブ
ロックサイズが１６×８であるとき、直交変換用ブロッ
クサイズを８×８に決定するようにすることができる。When the macroblock is an inter macroblock and is subjected to field-based coding processing, and the block size for motion prediction / compensation is 16 × 8, the block size determining means determines the block for orthogonal transform. The size can be determined to be 8 × 8.

【０１０７】前記ブロックサイズ決定手段は、マクロブ
ロックがインターマクロブロックであってフィールドベ
ースの符号化処理が施される場合、動き予測・補償用ブ
ロックサイズが８×８であるとき、直交変換用ブロック
サイズも８×８に決定するようにすることができる。When the macroblock is an inter macroblock and is subjected to field-based coding processing, and the block size for motion prediction / compensation is 8 × 8, the block size determining means determines the block for orthogonal transformation. The size can also be set to 8 × 8.

【０１０８】前記ブロックサイズ決定手段は、マクロブ
ロックがインターマクロブロックであってフィールドベ
ースの符号化処理が施される場合、動き予測・補償用ブ
ロックサイズが８×４であるとき、直交変換用ブロック
サイズも８×４に決定するようにすることができる。When the macroblock is an inter macroblock and is subjected to field-based coding processing, and the block size for motion prediction / compensation is 8 × 4, the block size determining means determines the block for orthogonal transformation. The size can also be set to 8 × 4.

【０１０９】前記ブロックサイズ決定手段は、マクロブ
ロックがインターマクロブロックであってフィールドベ
ースの符号化処理が施される場合、動き予測・補償用ブ
ロックサイズが４×８であるとき、直交変換用ブロック
サイズも４×８に決定するようにすることができる。When the macroblock is an inter macroblock and is subjected to field-based coding processing, and the block size for motion prediction / compensation is 4 × 8, the block size determining means determines the block for orthogonal transformation. The size can also be set to 4 × 8.

【０１１０】前記ブロックサイズ決定手段は、マクロブ
ロックがインターマクロブロックであってフィールドベ
ースの符号化処理が施される場合、動き予測・補償用ブ
ロックサイズが４×４であるとき、直交変換用ブロック
サイズも４×４に決定するようにすることができる。When the macroblock is an inter-macroblock and the field-based coding process is performed, and the block size for motion prediction / compensation is 4 × 4, the block size determining means determines the orthogonal transform block. The size can also be set to 4 × 4.

【０１１１】本発明の符号化装置は、マクロブロックレ
ベルでの符号化処理を、フィールドベースとするかフレ
ームベースとするかを判定するフィールド／フレーム判
定手段をさらに含むことができる。The coding apparatus of the present invention can further include field / frame judging means for judging whether the coding process at the macroblock level is field-based or frame-based.

【０１１２】前記フィールド／フレーム判定手段による
判定と、前記ブロックサイズ決定手段による決定は、同
時に行われるようにすることができる。The determination by the field / frame determining means and the determination by the block size determining means can be performed simultaneously.

【０１１３】前記フィールド／フレーム判定手段による
判定と、前記ブロックサイズ決定手段による決定は、動
き予測処理によって生成された差分値を直交変換して得
られる係数の絶対値和SATDが最小となるように、同時に
行われるようにすることができる。The determination by the field / frame determination means and the determination by the block size determination means are such that the absolute value sum SATD of the coefficients obtained by orthogonally transforming the difference value generated by the motion prediction processing is minimized. , Can be done at the same time.

【０１１４】前記フィールド／フレーム判定手段による
判定と、前記ブロックサイズ決定手段による決定は、コ
スト関数Ｊの値に基づいて、同時に行われるようにする
ことができる。The determination by the field / frame determining means and the determination by the block size determining means can be performed simultaneously based on the value of the cost function J.

【０１１５】前記フィールド／フレーム判定手段は、フ
レーム内で垂直方向に隣接した画素の差分の自乗和Ｖ
_ar1と、フィールド内で垂直方向に隣接した画素の差分
の自乗和Ｖ_ar2との比較結果に対応して、マクロブロッ
クレベルでの符号化処理を、フィールドベースとするか
フレームベースとするかを判定するようにすることがで
きる。The field / frame determination means sums up the squared sum V of the differences between vertically adjacent pixels in the frame.
and _ar1, in response to a result of comparison between the sum of the squares of the differences between adjacent pixels in the vertical direction in the field V _ar2, the encoding processing at the macroblock level, or to any frame-based and field-based determination You can

【０１１６】前記ブロックサイズ決定手段は、動き予測
処理によって生成された差分値を直交変換して得られる
係数の絶対値和SATDが最小となるように、直交変換用ブ
ロックサイズを決定するようにすることができる。The block size determining means determines the block size for orthogonal transformation so that the sum SATD of the absolute values of the coefficients obtained by orthogonally transforming the difference value generated by the motion prediction processing becomes the minimum. be able to.

【０１１７】前記ブロックサイズ決定手段は、コスト関
数Ｊの値に基づいて、直交変換用ブロックサイズを決定
するようにすることができる。The block size determining means may determine the orthogonal transform block size based on the value of the cost function J.

【０１１８】前記可逆符号化手段は、フィールド／フレ
ーム判定手段による判定結果を示すfield/frame_flag、
およびブロックサイズ決定手段による決定結果を示すMB
_typeも可逆符号化するようにすることができる。The lossless encoding means is a field / frame_flag indicating the determination result by the field / frame determining means,
And MB showing the result of determination by the block size determination means
_type can also be losslessly encoded.

【０１１９】前記可逆符号化手段は、UVLCを用いるよう
にすることができる。The lossless encoding means may use UVLC.

【０１２０】前記可逆符号化手段は、CABACを用いるよ
うにすることができる。The lossless encoding means may use CABAC.

【０１２１】本発明の符号化方法は、予め設定されてい
る複数の直交変換用ブロックサイズのいずれかにマクロ
ブロックを分割して直交変換する直交変換ステップと、
最も符号化効率が高くなるように、直交変換用ブロック
サイズを決定するブロックサイズ決定ステップと、直交
変換ステップの処理結果を量子化する量子化ステップ
と、量子化ステップの処理結果を可逆符号化する可逆符
号化ステップとを含むことを特徴とする。The encoding method of the present invention comprises an orthogonal transformation step of dividing a macroblock into any of a plurality of preset orthogonal transformation block sizes and performing orthogonal transformation.
The block size determination step for determining the block size for orthogonal transformation, the quantization step for quantizing the processing result of the orthogonal transformation step, and the lossless encoding for the processing result of the quantization step so that the coding efficiency is the highest. And a lossless encoding step.

【０１２２】本発明の第１の記録媒体のプログラムは、
予め設定されている複数の直交変換用ブロックサイズの
いずれかにマクロブロックを分割して直交変換する直交
変換ステップと、最も符号化効率が高くなるように、直
交変換用ブロックサイズを決定するブロックサイズ決定
ステップと、直交変換ステップの処理結果を量子化する
量子化ステップと、量子化ステップの処理結果を可逆符
号化する可逆符号化ステップとを含むことを特徴とす
る。The program of the first recording medium of the present invention is
An orthogonal transformation step of dividing a macroblock into one of a plurality of preset orthogonal transformation block sizes and performing orthogonal transformation, and a block size that determines the orthogonal transformation block size so that the coding efficiency is highest. It is characterized by including a determining step, a quantization step for quantizing the processing result of the orthogonal transformation step, and a lossless coding step for losslessly coding the processing result of the quantization step.

【０１２３】本発明の第１のプログラムは、予め設定さ
れている複数の直交変換用ブロックサイズのいずれかに
マクロブロックを分割して直交変換する直交変換ステッ
プと、最も符号化効率が高くなるように、直交変換用ブ
ロックサイズを決定するブロックサイズ決定ステップ
と、直交変換ステップの処理結果を量子化する量子化ス
テップと、量子化ステップの処理結果を可逆符号化する
可逆符号化ステップとをコンピュータに実行させること
を特徴とする。The first program of the present invention comprises an orthogonal transformation step of dividing a macroblock into any of a plurality of preset orthogonal transformation block sizes and performing orthogonal transformation, and the highest coding efficiency. In the computer, a block size determination step for determining a block size for orthogonal transformation, a quantization step for quantizing the processing result of the orthogonal transformation step, and a lossless encoding step for losslessly encoding the processing result of the quantization step are stored in a computer. It is characterized by being executed.

【０１２４】本発明の復号装置は、圧縮画像情報を復号
して、マクロブロックレベルでの符号化処理がフィール
ドベースであるかフレームベースであるかを示すfield/
frame_flag、符号化処理に含まれる直交変換処理でのブ
ロックサイズを示すMB_type、量子化されている直交変
換係数、並びに予測モード情報および動きベクトル情報
を生成する復号手段と、量子化されている直交変換係数
を逆量子化する逆量子化手段と、MB_typeに基づき、逆
量子化手段の処理結果を逆直交変換する逆直交変換手段
と、予測モード情報および動きベクトル情報に従い、復
号画像情報を元にして参照画像を生成する参照画像生成
手段とを含むことを特徴とする。The decoding apparatus of the present invention decodes the compressed image information and indicates whether the encoding process at the macroblock level is field-based or frame-based.
frame_flag, MB_type indicating the block size in the orthogonal transform process included in the encoding process, a quantized orthogonal transform coefficient, a decoding unit that generates prediction mode information and motion vector information, and a quantized orthogonal transform Inverse quantization means for inverse quantizing the coefficient, based on MB_type, inverse orthogonal transformation means for inverse orthogonal transforming the processing result of the inverse quantization means, according to the prediction mode information and motion vector information, based on the decoded image information And a reference image generating means for generating a reference image.

【０１２５】本発明の復号装置は、field/frame_flagに
基づき、逆直交手段の処理結果をフレーム構造に変換す
るフレーム構造変換手段をさらに含むことができる。The decoding apparatus of the present invention can further include frame structure conversion means for converting the processing result of the inverse orthogonal means into a frame structure based on field / frame_flag.

【０１２６】本発明の復号装置は、field/frame_flagに
基づき、参照画像生成手段に供給される前の復号情報を
フィールド構造に変換するフィールド構造変換手段をさ
らに含むことができる。The decoding apparatus of the present invention may further include field structure conversion means for converting the decoding information before being supplied to the reference image generation means into a field structure based on field / frame_flag.

【０１２７】本発明の復号装置の復号方法は、圧縮画像
情報を復号して、マクロブロックレベルでの符号化処理
がフィールドベースであるかフレームベースであるかを
示すfield/frame_flag、符号化処理に含まれる直交変換
処理でのブロックサイズを示すMB_type、量子化されて
いる直交変換係数、並びに予測モード情報および動きベ
クトル情報を生成する復号ステップと、量子化されてい
る直交変換係数を逆量子化する逆量子化ステップと、MB
_typeに基づき、逆量子化手段の処理結果を逆直交変換
する逆直交変換ステップと、予測モード情報および動き
ベクトル情報に従い、復号画像情報を元にして参照画像
を生成する参照画像生成ステップとを含むことを特徴と
する。The decoding method of the decoding apparatus of the present invention decodes compressed image information, and field / frame_flag indicating whether the encoding process at the macroblock level is field-based or frame-based MB_type indicating the block size in the included orthogonal transform processing, a decoding step for generating quantized orthogonal transform coefficients, prediction mode information and motion vector information, and dequantizing the quantized orthogonal transform coefficients Inverse quantization step and MB
_type, an inverse orthogonal transform step of performing an inverse orthogonal transform of the processing result of the inverse quantization means, and a reference image generation step of generating a reference image based on the decoded image information according to prediction mode information and motion vector information. It is characterized by

【０１２８】本発明の第２の記録媒体のプログラムは、
圧縮画像情報を復号して、マクロブロックレベルでの符
号化処理がフィールドベースであるかフレームベースで
あるかを示すfield/frame_flag、符号化処理に含まれる
直交変換処理でのブロックサイズを示すMB_type、量子
化されている直交変換係数、並びに予測モード情報およ
び動きベクトル情報を生成する復号ステップと、量子化
されている直交変換係数を逆量子化する逆量子化ステッ
プと、MB_typeに基づき、逆量子化手段の処理結果を逆
直交変換する逆直交変換ステップと、予測モード情報お
よび動きベクトル情報に従い、復号画像情報を元にして
参照画像を生成する参照画像生成ステップとを含むこと
を特徴とする。The program of the second recording medium of the present invention is
Decodes compressed image information, field / frame_flag indicating whether the encoding process at the macroblock level is field-based or frame-based, MB_type indicating the block size in the orthogonal transform process included in the encoding process, Decoding step for generating the quantized orthogonal transform coefficient, prediction mode information and motion vector information, dequantizing step for dequantizing the quantized orthogonal transform coefficient, and dequantizing based on MB_type An inverse orthogonal transform step of performing an inverse orthogonal transform on the processing result of the means, and a reference image generation step of generating a reference image based on the decoded image information according to the prediction mode information and the motion vector information.

【０１２９】本発明の第２のプログラムは、圧縮画像情
報を復号して、マクロブロックレベルでの符号化処理が
フィールドベースであるかフレームベースであるかを示
すfield/frame_flag、符号化処理に含まれる直交変換処
理でのブロックサイズを示すMB_type、量子化されてい
る直交変換係数、並びに予測モード情報および動きベク
トル情報を生成する復号ステップと、量子化されている
直交変換係数を逆量子化する逆量子化ステップと、MB_t
ypeに基づき、逆量子化手段の処理結果を逆直交変換す
る逆直交変換ステップと、予測モード情報および動きベ
クトル情報に従い、復号画像情報を元にして参照画像を
生成する参照画像生成ステップとをコンピュータに実行
させることを特徴とする。The second program of the present invention decodes the compressed image information and includes field / frame_flag which indicates whether the encoding process at the macroblock level is field-based or frame-based, and is included in the encoding process. MB_type indicating the block size in the orthogonal transform processing, the decoding step of generating the quantized orthogonal transform coefficient, the prediction mode information and the motion vector information, and the inverse step of dequantizing the quantized orthogonal transform coefficient. Quantization step and MB_t
A computer includes an inverse orthogonal transform step for performing an inverse orthogonal transform on the processing result of the inverse quantization means based on ype, and a reference image generation step for generating a reference image based on the decoded image information according to prediction mode information and motion vector information. It is characterized by making it execute.

【０１３０】本発明の符号化装置および方法、並びに第
１のプログラムにおいては、最も符号化効率が高くなる
ように、直交変換用ブロックサイズが決定され、決定さ
れた直交変換用ブロックサイズにマクロブロックが分割
されて直交変換され、直交変換の処理結果が量子化さ
れ、量子化の処理結果が可逆符号化される。In the coding apparatus and method and the first program of the present invention, the orthogonal transform block size is determined so that the coding efficiency is maximized, and the macroblock is set to the determined orthogonal transform block size. Is divided and orthogonally transformed, the result of the orthogonal transformation is quantized, and the result of the quantization is losslessly encoded.

【０１３１】本発明の復号装置および方法、並びに第２
のプログラムにおいては、圧縮画像情報が復号されて、
マクロブロックレベルでの符号化処理がフィールドベー
スであるかフレームベースであるかを示すfield/frame_
flag、符号化処理に含まれる直交変換処理でのブロック
サイズを示すMB_type、量子化されている直交変換係
数、並びに予測モード情報および動きベクトル情報が生
成され、量子化されている直交変換係数が逆量子化さ
れ、MB_typeに基づいて逆量子化の処理結果が逆直交変
換される。また、予測モード情報および動きベクトル情
報に従い、復号画像情報を元にして参照画像が生成され
る。Decoding apparatus and method of the present invention, and second
In the program, compressed image information is decoded,
Field / frame_indicating whether the encoding process at the macroblock level is field-based or frame-based
flag, MB_type indicating the block size in the orthogonal transformation process included in the encoding process, quantized orthogonal transformation coefficient, prediction mode information and motion vector information are generated, and the quantized orthogonal transformation coefficient is reversed. After being quantized, the inverse quantization processing result is subjected to inverse orthogonal transformation based on MB_type. Further, a reference image is generated based on the decoded image information according to the prediction mode information and the motion vector information.

【０１３２】符号化装置および復号装置は、互いに独立
した装置であってもよいし、信号処理装置の符号化処理
および復号処理を行うブロックであってもよい。The encoding device and the decoding device may be devices independent of each other, or may be blocks for performing the encoding process and the decoding process of the signal processing device.

【０１３３】[0133]

【発明の実施の形態】以下、本発明を適用した画像情報
符号化装置について、図２２を参照して説明する。BEST MODE FOR CARRYING OUT THE INVENTION An image information coding apparatus to which the present invention is applied will be described below with reference to FIG.

【０１３４】当該画像情報符号化装置において、Ａ／Ｄ
変換部５１は、アナログ信号である入力画像情報をディ
ジタル信号に変換する。画面並べ替えバッファ５２は、
当該画像情報符号化装置が出力する画像圧縮情報のGOP
構造に対応して、入力される画像を並び替える。In the image information coding apparatus, the A / D
The converter 51 converts the input image information, which is an analog signal, into a digital signal. The screen rearrangement buffer 52 is
GOP of image compression information output by the image information encoding device
The input images are rearranged according to the structure.

【０１３５】フィールド／フレーム判定部５３は、マク
ロブロックをフィールドベースで符号化した場合と、フ
レームベースで符号化した場合の符号化効率が高い方を
判定し、対応するFrame/Field Flagを生成して、フィー
ルド／フレーム変換部５６，６４，６７、および可逆符
号化部５９に出力する。The field / frame determination section 53 determines which of the coding efficiency is higher when the macroblock is field-based and when it is frame-based, and generates a corresponding Frame / Field Flag. And outputs it to the field / frame converters 56, 64, 67 and the lossless encoder 59.

【０１３６】加算器５４は、マクロブロックがインター
符号化される場合、画面並べ替えバッファ５２からフィ
ールド／フレーム判定部５３を介して入力される入力画
像と、動き予測・補償部６８からの参照画像との差分画
像を生成してABTブロックサイズ決定部５５に出力す
る。また、加算器５４は、マクロブロックがイントラ符
号化される場合、画面並べ替えバッファ５２からフィー
ルド／フレーム判定部５３を介して入力される入力画像
をそのままABTブロックサイズ決定部５５に出力する。When the macroblock is inter-coded, the adder 54 inputs the input image from the screen rearrangement buffer 52 via the field / frame determination unit 53 and the reference image from the motion prediction / compensation unit 68. A difference image with respect to is generated and output to the ABT block size determination unit 55. Further, when the macroblock is intra-coded, the adder 54 outputs the input image input from the screen rearrangement buffer 52 via the field / frame determination unit 53 to the ABT block size determination unit 55 as it is.

【０１３７】ABTブロックサイズ決定部５５は、４×
４、４×８、８×４、および８×８の中から最も符号化
効率が高いブロックサイズを決定して後段に通知する。
フィールド／フレーム変換部５６は、マクロブロックが
フィールドベース符号化される場合、ABT変換部５７に
供給される前の入力画像を、図２３に示すように、フレ
ーム構造からフィールド構造に変換する。The ABT block size determining unit 55 is 4 ×
The block size with the highest coding efficiency is determined from 4, 4 × 8, 8 × 4, and 8 × 8, and is notified to the subsequent stage.
When the macroblock is field-based coded, the field / frame conversion unit 56 converts the input image before being supplied to the ABT conversion unit 57 from a frame structure to a field structure, as shown in FIG.

【０１３８】ABT変換部５７は、ABTブロックサイズ決定
部５５によって決定されたブロックサイズに従い、入力
画像を直交変換して、得られたABT変換係数を量子化部
５８に出力する。なお、ABT変換部５７においては、マ
クロブロックをフレームベースで符号化する場合に対応
して、図１５に示した７種類のモードと同一のモードが
定義されている。さらに、マクロブロックをフィールド
ベースで符号化する場合に対応して、図２４に示す５種
類のモードが定義されている。すなわち、モード１は、
動き補償・予測のブロックサイズが１６×８であり、離
散コサイン変換のブロックサイズが８×８である。モー
ド２は、動き補償・予測のブロックサイズと離散コサイ
ン変換のブロックサイズがともに８×８である。モード
３は、動き補償・予測のブロックサイズと離散コサイン
変換のブロックサイズがともに８×４である。モード４
は、動き補償・予測のブロックサイズと離散コサイン変
換のブロックサイズがともに４×８である。モード５
は、動き補償・予測のブロックサイズと離散コサイン変
換のブロックサイズがともに４×４である。The ABT converter 57 orthogonally transforms the input image according to the block size determined by the ABT block size determiner 55, and outputs the obtained ABT transform coefficient to the quantizer 58. In the ABT converter 57, the same modes as the seven types of modes shown in FIG. 15 are defined corresponding to the case where the macroblock is encoded on a frame basis. Further, five types of modes shown in FIG. 24 are defined corresponding to the case where the macroblock is encoded on the field basis. That is, in mode 1,
The block size for motion compensation / prediction is 16 × 8, and the block size for discrete cosine transform is 8 × 8. In mode 2, both the block size for motion compensation / prediction and the block size for discrete cosine transform are 8 × 8. In mode 3, both the block size for motion compensation / prediction and the block size for discrete cosine transform are 8 × 4. Mode 4
Has a block size for motion compensation / prediction and a block size for discrete cosine transform of 4 × 8. Mode 5
Has a block size for motion compensation / prediction and a block size for discrete cosine transform of 4 × 4.

【０１３９】量子化部５８は、ABT変換部５７からのABT
変換係数を量子化する。可逆符号化部５９は、量子化部
５８および動き予測・補償部６８から入力される各シン
タクス要素、並びにフィールド／フレーム判定部５３か
ら入力されるFrame/Field Flagを、UVLCまたはCABACに
よって可逆符号化する。蓄積バッファ６０は、符号化さ
れた各シンタクス要素を蓄積し、画像圧縮情報として後
段に出力する。レート制御部６１は、蓄積バッファ６０
の蓄積量に基づき、当該画像符号化装置から出力される
画像圧縮情報のビットレートが目標値となるように、量
子化部５８をフィードバック制御する。The quantizer 58 uses the ABT from the ABT converter 57.
Quantize the transform coefficients. The lossless encoding unit 59 losslessly encodes each syntax element input from the quantization unit 58 and the motion prediction / compensation unit 68, and Frame / Field Flag input from the field / frame determination unit 53 by UVLC or CABAC. To do. The accumulation buffer 60 accumulates each encoded syntax element and outputs it to the subsequent stage as image compression information. The rate control unit 61 includes a storage buffer 60.
The quantization unit 58 is feedback-controlled so that the bit rate of the image compression information output from the image encoding apparatus becomes the target value based on the storage amount of the.

【０１４０】逆量子化部６２は、量子化されているABT
変換係数を逆量子化する。逆ABT変換部６３は、ABT変換
係数を決定されているブロックサイズで逆ABT変換し、
復号画像情報を生成する。フィールド／フレーム変換部
６４は、マクロブロックがフィールドベース符号化され
ている場合、フレームメモリ６６に格納する前の復号画
像情報を、フレーム構造に変換する。すなわち、図２３
に示したフィールド／フレーム変換部５６の動作とは逆
の動作を実行する。ループフィルタ６５は、フレームメ
モリ６６に格納する前の復号画像情報のブロック歪を除
去する。フレームメモリ６６は、参照画像の元となる復
号画像情報を格納する。The inverse quantizer 62 uses the quantized ABT.
Dequantize the transform coefficients. The inverse ABT conversion unit 63 performs inverse ABT conversion on the ABT conversion coefficient with the determined block size,
Decoded image information is generated. When the macroblock is field-based encoded, the field / frame conversion unit 64 converts the decoded image information before being stored in the frame memory 66 into a frame structure. That is, FIG.
The operation opposite to the operation of the field / frame conversion unit 56 shown in FIG. The loop filter 65 removes block distortion of the decoded image information before being stored in the frame memory 66. The frame memory 66 stores the decoded image information that is the source of the reference image.

【０１４１】フィールド／フレーム変換部６７は、動き
予測・補償部６８によるフィールドモードに対応する動
きベクトルの探索に先立ち、フレームメモリ６６内の復
号画像情報をフレーム構造からフィールド構造に変換す
る。動き予測・補償部６８は、動き予測処理により、最
適な予測モード情報および動きベクトル情報を生成する
とともに、参照画像を生成する。The field / frame conversion unit 67 converts the decoded image information in the frame memory 66 from the frame structure to the field structure before the motion prediction / compensation unit 68 searches for the motion vector corresponding to the field mode. The motion prediction / compensation unit 68 generates the optimum prediction mode information and the motion vector information and also generates the reference image by the motion prediction process.

【０１４２】制御部６９は、記録媒体７０に記録されて
いる制御用プログラムに基づき、当該画像情報符号化装
置の各部を制御する。The control section 69 controls each section of the image information coding apparatus based on the control program recorded in the recording medium 70.

【０１４３】ここで、フィールド／フレーム判定部５３
による、マクロブロックの符号化をフィールドベースと
するかフレームベースとするかの判定について説明す
る。Here, the field / frame determination unit 53
The determination of whether the macroblock is encoded based on the field or the frame will be described.

【０１４４】Low Complexity Modeでは、次式（１６）
に示すように、フレーム内で垂直方向に隣接した画素の
差分の自乗和Ｖ_ar1と、次式（１７）に示すように、フ
ィールド内で垂直方向に隣接した画素の差分の自乗和Ｖ
_ar2を比較し、Ｖ_ar1よりもＶ _ar2が小さい場合、フィー
ルドベースで符号化した方の符号化効率が高いと判定
し、Ｖ_ar1よりもＶ_ar2が小さくない場合、フレームベー
スで符号化した方の符号化効率が高いと判定するように
する。Ｖ_ar1＝Σ_uΣ_v｛（X[2v][u]-X[2v+1][u]）²＋（X[2v+1][u]-X[2v+2][u]）²｝・・・（１６）Ｖ_ar2＝Σ_uΣ_v｛（X[2v][u]-X[2v+2][u]）²＋（X[2v+1][u]-X[2v+3][u]）²｝・・・（１７）In Low Complexity Mode, the following equation (16)
As shown in,
Sum of squared difference V_ar1And as shown in the following equation (17),
Sum of squares of differences between vertically adjacent pixels in the field V
_ar2And compare V_ar1Than V _ar2Is small, fee
It is judged that the coding efficiency is higher in the field-based coding.
And then V_ar1Than V_ar2If the frame is not small,
So that the encoding efficiency of the one encoded with
To do. V_ar1= Σ_uΣ_v{(X [2v] [u] -X [2v + 1] [u])²+ (X [2v + 1] [u] -X [2v + 2] [u])²} ... (16) V_ar2= Σ_uΣ_v{(X [2v] [u] -X [2v + 2] [u])²+ (X [2v + 1] [u] -X [2v + 3] [u])²} ... (17)

【０１４５】ただし、式（１６），（１７）において、
演算子Σ_uは、ｕ＝０乃至１５の総和を示し、演算子Σ_v
は、ｖ＝０乃至６の総和を示している。また、X[v][u]
は、マクロブロック輝度信号の第（ｖ，ｕ）要素を示し
ている。However, in equations (16) and (17),
The operator Σ _u represents the sum of u = 0 to 15, and the operator Σ _v
Indicates the sum total of v = 0 to 6. Also, X [v] [u]
Indicates the (v, u) element of the macroblock luminance signal.

【０１４６】High Complexity Modeでは、マクロブロッ
クの符号化をフィールドベースとするかフレームベース
とするかの判定に、フィールド／フレーム判定部５３だ
けでなく、ABTブロックサイズ決定部２８も動作する。
すなわち、図１５に示したものと同等のフレームモード
符号化に対して定義された７種類と、図２５に示したフ
ィールドモード符号化に対して定義された５種類の合計
１２種類のモードに対して、式（１５）によって定義さ
れたコスト関数Ｊを用いて当該マクロブロックの符号化
効率を最良とするモードを選択するようになされてい
る。In the High Complexity Mode, not only the field / frame determination unit 53 but also the ABT block size determination unit 28 operates to determine whether the macroblock coding is field-based or frame-based.
That is, for a total of 12 types of modes, seven types defined for frame mode encoding equivalent to those shown in FIG. 15 and five types defined for field mode encoding shown in FIG. Then, the cost function J defined by the equation (15) is used to select the mode that maximizes the coding efficiency of the macroblock.

【０１４７】次に、ABTブロックサイズ決定部５５によ
る、最も符号化効率が高いブロックサイズを決定する処
理について説明する。Next, the process of determining the block size with the highest coding efficiency by the ABT block size determining unit 55 will be described.

【０１４８】Low Complexity Modeでは、フィールドベ
ース符号化が行われる場合、およびフレームベース符号
化が行われる場合、それぞれにおいて、背景技術で上述
した絶対値和SATDを最小とするモードが選択されるよう
になされている。In the Low Complexity Mode, when the field-based coding is performed and when the frame-based coding is performed, the mode that minimizes the sum of absolute values SATD described in the background art is selected. Has been done.

【０１４９】なお、Low Complexity Modeにおいても、
フィールド／フレーム判定装置２６と、ＡＢＴブロック
サイズ２８が合わせて動作するようにしてもよい。すな
わち、図１５に示したものと同等のフレームモード符号
化に対して定義された７種類と、図２５に示したフィー
ルドモード符号化に対して定義された５種類の合計１２
種類のモードに対して、それぞれ絶対値和SATDを算出
し、それらの最小値に対応するモードを選択するように
してもよい。Even in Low Complexity Mode,
The field / frame determination device 26 and the ABT block size 28 may operate together. That is, a total of 12 types of 7 types defined for the frame mode coding equivalent to that shown in FIG. 15 and 5 types defined for the field mode coding shown in FIG.
The sum of absolute values SATD may be calculated for each type of mode, and the mode corresponding to the minimum value may be selected.

【０１５０】次に、可逆符号化部５９によるフィールド
／フレーム判定部５３によって生成されたfield/frame_
flagを可逆符号化する処理について説明する。なお、fi
eld/frame_flagの値が１である場合（マクロブロックが
フィールドベース符号化される場合）、Ｐピクチャ／Ｂ
ピクチャに対するMB_typeも符号化される。Next, field / frame_generated by the field / frame determination unit 53 by the lossless encoding unit 59.
The process of losslessly encoding the flag will be described. Note that fi
When the value of eld / frame_flag is 1 (when the macroblock is field-based coded), P picture / B
The MB_type for the picture is also encoded.

【０１５１】可逆符号化部５９においてCABACが用いら
れる場合、field/frame_flag、およびＰピクチャ／Ｂピ
クチャに対するMB_typeに対して以下のようなコンテキ
ストモデルが定義されている。When CABAC is used in the lossless encoding unit 59, the following context model is defined for MB / type for field / frame_flag and P picture / B picture.

【０１５２】図２５に示すように配置されているマクロ
ブロックＡ，Ｂ，Ｃのうち、マクロブロックＣに対する
frame／field flagに関するコンテキストモデルについ
て説明する。マクロブロックＣのframe／field flagに
関するコンテキストモデルctx_fifr_flag(C)は、次式
（２１）によって定義される。 ctx_fifr_flag(C)＝ａ＋２ｂ・・・（１８）ただし、式（１８）において、ａ，ｂは、それぞれマク
ロブロックＡ，Ｂのframe／field flagの値である。Of the macro blocks A, B and C arranged as shown in FIG. 25, the macro block C is
The context model regarding the frame / field flag will be described. The context model ctx_fifr_flag (C) regarding the frame / field flag of the macroblock C is defined by the following expression (21). ctx_fifr_flag (C) = a + 2b (18) However, in Expression (18), a and b are the values of the frame / field flags of the macroblocks A and B, respectively.

【０１５３】次に、Ｉピクチャに対するMB_typeに関す
るコンテキストモデルについて説明する。frame／field
flagが１である場合、図２６に示すように、Ｉピクチ
ャに含まれるマクロブロックＣの５種類のMB_typeに対
し、それぞれCode_Numberが割り当てられている。これ
らのCode_Numberに対応するコンテキストモデルctx_mb_
type_intra_field（C）は次式（１９）によって定義さ
れる。 ctx_mb_type_intra_field（C）＝Ａ＋Ｂ・・・（１９）Next, the context model regarding the MB_type for the I picture will be described. frame / field
When the flag is 1, as shown in FIG. 26, Code_Number is assigned to each of the five MB_types of the macroblock C included in the I picture. Context model ctx_mb_ corresponding to these Code_Number
type_intra_field (C) is defined by the following equation (19). ctx_mb_type_intra_field (C) = A + B (19)

【０１５４】ただし、式（１９）において、Ａは、マク
ロブロックＡがIntra４×４である場合には０であり、I
ntra１６×１６である場合には１である。同様に、Ｂ
は、マクロブロックＢがIntra４×４である場合には０
であり、Intra１６×１６である場合には１である。し
たがって、コンテキストモデルctx_mb_type_intra_fiel
d(C)は、０，１，２のいずれかの値をとる。なお、隣接
するマクロブロックＡ，Ｂは、フィールドベース符号化
されていても、フレームベース符号化されていてもかま
わない。However, in the equation (19), A is 0 when the macroblock A is Intra4 × 4, and I
It is 1 when the size is ntra16 × 16. Similarly, B
Is 0 when the macroblock B is Intra4 × 4
And 1 if Intra 16 × 16. Therefore, the context model ctx_mb_type_intra_fiel
d (C) has a value of 0, 1, or 2. The adjacent macroblocks A and B may be field-based coded or frame-based coded.

【０１５５】次に、Ｐ／Ｂピクチャに対するMB_typeに
関するコンテキストモデルについて説明する。マクロブ
ロックＣがＰピクチャに含まれる場合、マクロブロック
ＣのMB_typeに対応するコンテキストモデルctx_mb_type
_inter_field(C)は、次式（２３）によって定義され
る。また、Ｂピクチャに含まれる場合、次式（２４）に
よって定義される。 ctx_mb_type_inter_field(C)＝((A==skip)?0:1)＋２((B==skip)?0:1) ・・・（２０） ctx_mb_type_inter_field(C)＝((A==Direct)?0:1)＋２((B==Direct)?0:1) ・・・（２１）Next, the context model relating to MB_type for P / B pictures will be described. When the macroblock C is included in the P picture, the context model ctx_mb_type corresponding to the MB_type of the macroblock C
_inter_field (C) is defined by the following expression (23). When included in the B picture, it is defined by the following equation (24). ctx_mb_type_inter_field (C) ＝ ((A == skip)? 0: 1) ＋2 ((B == skip)? 0: 1) ・・・ (20) ctx_mb_type_inter_field (C) ＝ ((A == Direct)? 0 : 1) +2 ((B == Direct)? 0: 1) ・・・ (21)

【０１５６】ただし、式（２０）において、演算子((A=
=Skip)?0:1)は、マクロブロックＡがSkipモードである
場合には０を示し、マクロブロックＡがSkipモードでは
ない場合には１を示すものとする。同様に、演算子((B=
=Skip)?0:1)は、マクロブロックＢがSkipモードである
場合には０を示し、マクロブロックＢがSkipモードでは
ない場合には１を示すものとする。However, in the equation (20), the operator ((A =
= Skip)? 0: 1) indicates 0 when the macroblock A is in the Skip mode and 1 when the macroblock A is not in the Skip mode. Similarly, the operator ((B =
= Skip)? 0: 1) indicates 0 when the macroblock B is in the Skip mode and 1 when the macroblock B is not in the Skip mode.

【０１５７】また、式（２１）において、演算子((A==D
irect)?0:1)は、マクロブロックＡがDirectモードであ
る場合には０を示し、マクロブロックＡがDirectモード
ではない場合には１を示すものとする。演算子((B==Dir
ect)?0:1)は、マクロブロックＢがDirectモードである
場合には０を示し、マクロブロックＢがDirectモードで
はない場合には１を示すものとする。In addition, in the equation (21), the operator ((A == D
irect)? 0: 1) indicates 0 when the macroblock A is in the Direct mode, and 1 when the macroblock A is not in the Direct mode. Operator ((B == Dir
ect)? 0: 1) indicates 0 when the macroblock B is in the Direct mode, and 1 when the macroblock B is not in the Direct mode.

【０１５８】したがって、インターフレームにおけるマ
クロブロックＣのMB_typeに対応するコンテキストモデ
ルctx_mb_type_inter_field(C)は、Ｐピクチャである場
合とＢピクチャである場合に対して、それぞれ３種類の
値を取ることになる。Therefore, the context model ctx_mb_type_inter_field (C) corresponding to the MB_type of the macroblock C in the inter frame has three kinds of values for the P picture and the B picture, respectively.

【０１５９】なお、隣接するマクロブロックＡ，Ｂは、
フィールドベース符号化されていても、フレームベース
符号化されていてもかまわない。The adjacent macroblocks A and B are
It may be field-based coded or frame-based coded.

【０１６０】２値化されていないＰピクチャにおけるMB
_typeは、図２７Ａに示すテーブルによって２値化され
る。また、２値化されていないＢピクチャにおけるMB_t
ypeは、図２７Ｂに示すテーブルによって２値化され
る。MB in non-binarized P picture
_type is binarized by the table shown in FIG. 27A. In addition, MB_t in a B-picture that has not been binarized
ype is binarized by the table shown in FIG. 27B.

【０１６１】フレームベース符号化されるマクロブロッ
クに対しては、Ｐピクチャに属する場合、１０種類のMB
_typeが定義されている。一方、フィールドベース符号
化されるマクロブロックに対しては、Ｐピクチャに属す
る場合、前記１６種類のうち、１６×１６モード、およ
び８×１６モードが定義されていない。すなわち、フィ
ールドベース符号化されるマクロブロックに対しては、
Ｐピクチャに関して８種類のMB_typeが定義されてい
る。For a macroblock to be frame-based coded, if it belongs to a P picture, 10 kinds of MB
_type is defined. On the other hand, for a macroblock to be field-based coded, when belonging to a P picture, 16 × 16 mode and 8 × 16 mode are not defined among the 16 types. That is, for field-based encoded macroblocks,
Eight types of MB_type are defined for P pictures.

【０１６２】フレームベース符号化されるマクロブロッ
クに対しては、Ｂピクチャに関して１８種類のMB_type
が定義されている。一方、フィールドベース符号化され
るマクロブロックに対しては、Ｂピクチャに属する場
合、前記１８種類のうち、前方向１６×１６モード、後
方向１６×１６モード、前方向８×１６モード、および
後方向８×１６モードが定義されていない。すなわち、
フィールドベース符号化されるマクロブロックに対して
は、Ｂピクチャに関して１４種類のMB_typeが定義され
ている。For macroblocks to be frame-based coded, there are 18 MB_types for B pictures.
Is defined. On the other hand, for a macroblock to be field-based coded, when belonging to a B picture, among the 18 types, a forward 16 × 16 mode, a backward 16 × 16 mode, a forward 8 × 16 mode, and a backward 16 × 16 mode Direction 8x16 mode is not defined. That is,
For field-based encoded macroblocks, 14 types of MB_type are defined for B pictures.

【０１６３】以上説明したように、本発明をした画像情
報符号化装置によれば、飛び越し走査フォーマットの画
像情報を入力とし、マクロブロックレベルでフィールド
／フレーム適応型符号化処理を行い、直交変換にはABT
を用いるようにしたので、従来よりも高い符号化効率を
実現できることが可能となる。As described above, according to the image information coding apparatus of the present invention, the image information in the interlaced scanning format is input, the field / frame adaptive coding processing is performed at the macroblock level, and the orthogonal transformation is performed. Is ABT
Is used, it is possible to achieve higher coding efficiency than in the past.

【０１６４】次に、図２８は、図２６の画像情報符号化
装置に対応する画像情報復号装置の構成例を示してい
る。Next, FIG. 28 shows a configuration example of an image information decoding device corresponding to the image information encoding device of FIG.

【０１６５】当該画像情報復号装置において、蓄積バッ
ファ８１は、入力される画像圧縮情報を蓄積して、適
宜、可逆復号化部８２に出力する。可逆復号化部８２
は、UVLCまたはCABACを用いて符号化されている画像圧
縮情報を復号し、得られる量子化されているABT変換係
数を逆量子化部８３に出力し、field/frame_flagをフィ
ールド／フレーム変換部８５に出力し、Ｐピクチャ／Ｂ
ピクチャに対するMB_typeを逆ABT変換部８４に出力し、
予測モード情報および動きベクトル情報を動き予測・補
償部９２に出力する。In the image information decoding apparatus, the accumulation buffer 81 accumulates the input image compression information and outputs it to the lossless decoding unit 82 as appropriate. Lossless decoding unit 82
Decodes the image compression information encoded using UVLC or CABAC, outputs the obtained quantized ABT transform coefficient to the dequantization unit 83, and sets field / frame_flag to the field / frame conversion unit 85. Output to P picture / B
The MB_type for the picture is output to the inverse ABT converter 84,
The prediction mode information and the motion vector information are output to the motion prediction / compensation unit 92.

【０１６６】逆量子化部８３は、量子化されているABT
変換係数を逆量子化する。逆ABT変換部８４は、ABT変換
係数を、４×４、４×８、８×４、および８×８のう
ち、MB_typeによって定められているブロックサイズで
逆直交変換する。The inverse quantizing unit 83 uses the quantized ABT.
Dequantize the transform coefficients. The inverse ABT transform unit 84 performs inverse orthogonal transform on the ABT transform coefficient with a block size defined by MB_type among 4 × 4, 4 × 8, 8 × 4, and 8 × 8.

【０１６７】フィールド／フレーム変換部８５は、マク
ロブロックがフィールドベース符号化されている場合、
加算器８６に入力される前の逆ABT変換処理が施された
直交変換係数をフィールド構造からフレーム構造に変換
する。加算器８６は、マクロブロックがインター符号化
されたものである場合、逆ABT変換が施された差分画像
と、動き予測・補償部９２からの参照画像とを合成して
出力画像を生成し、後段に出力する。また、加算器８６
は、マクロブロックがインター符号化されたものである
場合、逆ABT変換が施された画像を、そのまま出力画像
として後段に出力する。The field / frame conversion unit 85, when the macroblock is field-based coded,
The orthogonal transform coefficient that has been subjected to the inverse ABT transform process before being input to the adder 86 is transformed from the field structure to the frame structure. When the macroblock is inter-coded, the adder 86 combines the difference image subjected to the inverse ABT conversion and the reference image from the motion prediction / compensation unit 92 to generate an output image, Output to the latter stage. Also, the adder 86
When the macroblock is inter-coded, outputs the image subjected to the inverse ABT conversion to the subsequent stage as an output image as it is.

【０１６８】画面並べ替えバッファ８７は、入力された
画像圧縮情報のGOP構造に応じ、加算器８６からの出力
画像を並べ替えてＤ／Ａ変換部８８に出力する。Ｄ／Ａ
変換部８８は、ディジタル信号である出力画像をアナロ
グ信号に変換する。The screen rearrangement buffer 87 rearranges the images output from the adder 86 according to the GOP structure of the input image compression information and outputs the rearranged images to the D / A converter 88. D / A
The conversion unit 88 converts the output image, which is a digital signal, into an analog signal.

【０１６９】ループフィルタ８９は、加算器８６からの
出力画像のブロック歪を除去する。フレームメモリ９０
は、ブロック歪が除去された、参照画像の元となる画像
情報を格納する。フィールド／フレーム変換部９１は、
マクロブロックがフィールドベース符号化されている場
合、動き予測・補償部９２に供給する前に、フレームメ
モリ９０に格納されている画像情報をフィールド構造に
変換する。動き予測・補償部９２は、可逆復号化部８２
からの予測モード情報および動きベクトル情報に基づ
き、フレームメモリ９０格納されている画像情報、また
はフィールド／フレーム変換部９１によってフィールド
構造に変換された画像情報を元にして参照画像を生成し
て加算器８６に出力する。The loop filter 89 removes block distortion of the output image from the adder 86. Frame memory 90
Stores the image information that is the source of the reference image from which the block distortion has been removed. The field / frame converter 91
When the macro block is field-based coded, the image information stored in the frame memory 90 is converted into a field structure before being supplied to the motion prediction / compensation unit 92. The motion prediction / compensation unit 92 includes a lossless decoding unit 82.
Based on the prediction mode information and the motion vector information from the image information stored in the frame memory 90 or the image information converted into the field structure by the field / frame conversion unit 91 to generate a reference image and adder Output to 86.

【０１７０】以上説明したように構成される画像情報復
号装置によれば、図２６の画像情報符号化装置が出力す
る画像圧縮情報を復号し、元の画像情報を得ることがで
きる。According to the image information decoding device configured as described above, the original image information can be obtained by decoding the image compression information output by the image information encoding device in FIG.

【０１７１】上述した一連の処理は、ハードウェアによ
り実行させることもできるが、ソフトウェアにより実行
させることもできる。一連の処理をソフトウェアにより
実行させる場合には、そのソフトウェアを構成するプロ
グラムが、専用のハードウェアに組み込まれているコン
ピュータ、または、各種のプログラムをインストールす
ることで、各種の機能を実行することが可能な、例えば
汎用のパーソナルコンピュータなどに、例えば図２２の
記録媒体７０からインストールされる。The series of processes described above can be executed by hardware, but can also be executed by software. When a series of processes is executed by software, a program that constitutes the software can execute various functions by installing a computer in which dedicated hardware is installed or various programs. It is installed in a possible general-purpose personal computer or the like from the recording medium 70 of FIG. 22, for example.

【０１７２】この記録媒体７０は、コンピュータとは別
に、ユーザにプログラムを提供するために配布される、
プログラムが記録されている磁気ディスク（フレキシブ
ルディスクを含む）、光ディスク（CD-ROM(Compact Dis
c-Read Only Memory)、DVD(Digital Versatile Disc)を
含む）、光磁気ディスク（ＭＤ(Mini Disc)を含む）、
もしくは半導体メモリなどよりなるパッケージメディア
により構成されるだけでなく、コンピュータに予め組み
込まれた状態でユーザに提供される、プログラムが記録
されているROMやハードディスクなどで構成される。This recording medium 70 is distributed separately from the computer to provide the program to the user.
Magnetic disks (including flexible disks) and optical disks (CD-ROM (Compact
c-Read Only Memory), DVD (including Digital Versatile Disc)), magneto-optical disc (including MD (Mini Disc)),
Alternatively, it is not only configured by a package medium such as a semiconductor memory, but also configured by a ROM or a hard disk in which a program is recorded and which is provided to a user in a state where the program is installed in advance in a computer.

【０１７３】なお、本明細書において、記録媒体に記録
されるプログラムを記述するステップは、記載された順
序に従って時系列的に行われる処理はもちろん、必ずし
も時系列的に処理されなくとも、並列的あるいは個別に
実行される処理をも含むものである。In the present specification, the steps for writing the program recorded in the recording medium are not limited to the processing performed in time series according to the order described, but are not necessarily performed in time series, but may be performed in parallel. Alternatively, it also includes processes that are individually executed.

【０１７４】[0174]

【発明の効果】以上のように、第１の本発明によれば、
飛び越し走査フォーマットの画像情報を入力とし、マク
ロブロックレベルでフィールド／フレーム適応型符号化
処理を行い、直交変換にはABTを用いる画像符号化装置
を実現することが可能となる。As described above, according to the first aspect of the present invention,
It is possible to realize an image coding apparatus that receives image information in an interlaced scanning format, performs field / frame adaptive coding processing at the macroblock level, and uses ABT for orthogonal transformation.

【０１７５】また、第２の本発明によれば、飛び越し走
査の画像情報がマクロブロックレベルで適応的にフィー
ルドベースまたはフレームベースで符号化されている圧
縮画像情報を元に前記画像情報を復元する画像復号装置
を実現することが可能となる。Further, according to the second aspect of the present invention, the image information of the interlaced scanning is restored based on the compressed image information which is adaptively coded at the macroblock level on the field basis or the frame basis. It becomes possible to realize an image decoding device.

[Brief description of drawings]

【図１】直交変換と動き補償を用いた従来の画像情報符
号化装置の構成の一例を示すブロック図である。FIG. 1 is a block diagram showing an example of the configuration of a conventional image information encoding device using orthogonal transformation and motion compensation.

【図２】図１の画像情報復号装置に対応する画像情報復
号装置の構成例を示すブロック図である。FIG. 2 is a block diagram showing a configuration example of an image information decoding device corresponding to the image information decoding device in FIG.

【図３】Ｈ．２６Ｌにおいて定義されている、マクロブ
ロックに対する動き予測・補償の単位を示す７種類のモ
ードを説明するための図である。FIG. It is a figure for demonstrating seven types of modes which show the unit of the motion prediction / compensation with respect to a macroblock defined in 26L.

【図４】マクロブロックレベルのフィールド／フレーム
適応符号化を可能とするように、Ｈ．２６Ｌを拡張した
場合における画像圧縮情報のシンタクスを示した図であ
る。FIG. 4 is a block diagram of H.264 to enable field / frame adaptive coding at the macroblock level. It is a figure showing the syntax of image compression information in the case of expanding 26L.

【図５】マクロブロックをフィールドベース符号化する
場合において、マクロブロックの画素を行単位で並べ替
える処理を説明するための図である。[Fig. 5] Fig. 5 is a diagram for explaining a process of rearranging pixels of a macroblock in units of rows when performing field-based coding on the macroblock.

【図６】マクロブロックをフィールドベース符号化する
場合の動き予測・補償の単位として定義されている７種
類のモードを説明するための図である。[Fig. 6] Fig. 6 is a diagram for describing seven types of modes defined as units of motion prediction / compensation when field-encoding a macroblock.

【図７】文献１に開示されているイントラ予測モードを
説明するための図である。FIG. 7 is a diagram for explaining an intra prediction mode disclosed in Document 1.

【図８】マクロブロックをフィールドベース符号化する
場合において、マクロブロック内でイントラ予測を行う
動作を説明するための図である。[Fig. 8] Fig. 8 is a diagram for describing an operation of performing intra prediction in a macroblock when the macroblock is field-based encoded.

【図９】マクロブロックをフィールドベース符号化する
場合において、複数のマクロブロックに跨ってイントラ
予測を行う動作を説明するための図である。[Fig. 9] Fig. 9 is a diagram for explaining an operation of performing intra prediction across a plurality of macroblocks when the macroblock is field-based encoded.

【図１０】マクロブロックをフィールドベース符号化す
る場合において、色差信号に対するイントラ予測を行う
動作を説明するための図である。[Fig. 10] Fig. 10 is a diagram for explaining an operation of performing intra prediction on a color difference signal when field-based coding a macroblock.

【図１１】マクロブロックをフィールドベース符号化す
る場合において、色差信号の残差成分を符号化する動作
を説明するための図である。[Fig. 11] Fig. 11 is a diagram for describing an operation of encoding a residual component of a color difference signal when field-based encoding a macroblock.

【図１２】Ｈ．２６Ｌにおいて規定されている、マルチ
プルフレーム予測の概念を説明するための図である。FIG. 12: H. It is a figure for demonstrating the concept of multiple frame prediction prescribed | regulated in 26L.

【図１３】マクロブロックをフィールドベース符号化す
る場合における、動きベクトル情報の予測方式を説明す
るための図である。[Fig. 13] Fig. 13 is a diagram for describing a prediction method of motion vector information when field-encoding a macroblock.

【図１４】Ｈ．２６Ｌで定められている、ブロックの予
測モードが８×１６，１６×８，８×４、または４×８
であった場合における、動きベクトル情報の予測値を生
成する方法を説明するための図である。FIG. 14: H. The prediction mode of the block defined by 26L is 8 × 16, 16 × 8, 8 × 4, or 4 × 8.
FIG. 6 is a diagram for explaining a method of generating a predicted value of motion vector information in the case of being.

【図１５】文献２で定められている、インターマクロブ
ロックに対する７種類のモードを説明するための図であ
る。FIG. 15 is a diagram for explaining seven types of modes for inter macroblocks defined in Reference 2.

【図１６】式（１４）におけるＮ² _n×mの具体的な値を
示す図である。FIG. 16 is a diagram showing specific values of N ² _{n × m} in expression (14).

【図１７】文献２で定められている、２次元配列の直交
変換係数を１次元データに並べ替えるスキャン方式を説
明するための図である。FIG. 17 is a diagram for explaining a scanning method that rearranges orthogonal transformation coefficients of a two-dimensional array into one-dimensional data, which is defined in Reference 2.

【図１８】文献２で規定されているイントラABTブロッ
クモードを示す図である。FIG. 18 is a diagram showing an intra ABT block mode defined in Reference 2.

【図１９】文献２で規定されている、４×８、８×４、
および８×８ブロックに対するRUNを２値化するための
テーブルを示す図である。FIG. 19 is 4 × 8, 8 × 4, which is defined in Reference 2.
It is a figure which shows the table for binarizing RUN with respect to and 8x8 block.

【図２０】文献２で規定されている、ABTブロックの各
ブロックサイズにおけるRUNを、CABACを用いて符号化す
るためのコンテキストモデルを示す図である。FIG. 20 is a diagram showing a context model for encoding RUN in each block size of an ABT block, which is defined in Document 2, using CABAC.

【図２１】文献２に開示されている、ABTブロックの各
ブロックサイズに対するアダマール変換後の正規化ファ
クタを示した図である。FIG. 21 is a diagram showing a normalization factor after Hadamard transform for each block size of an ABT block, which is disclosed in Document 2;

【図２２】本発明の一実施の形態である画像情報符号化
装置の構成例を示すブロック図である。[Fig. 22] Fig. 22 is a block diagram illustrating a configuration example of an image information encoding device which is an embodiment of the present invention.

【図２３】マクロブロックの画素を、フレーム構造から
フィールド構造に変換する処理を説明するための図であ
る。FIG. 23 is a diagram for explaining a process of converting pixels of a macro block from a frame structure to a field structure.

【図２４】図２２のABT変換部５７において、マクロブ
ロックがフィールベース符号化されるときに用いられる
５種類のモードを説明するための図である。FIG. 24 is a diagram for explaining five types of modes used when a macroblock is field-based encoded in the ABT conversion unit 57 of FIG. 22.

【図２５】図２２の可逆符号化部５９がCABACによる符
号化を行う場合、マクロブロックのFrame/Field Flagに
対して定義されているコンテキストモデルを説明するた
めの図である。FIG. 25 is a diagram for explaining a context model defined for Frame / Field Flag of a macroblock when the lossless encoding unit 59 of FIG. 22 performs CABAC encoding.

【図２６】マクロブロックがフィールドベース符号化さ
れるときに用いられるイントラABTブロックモードを示
す図である。FIG. 26 is a diagram showing an intra ABT block mode used when a macroblock is field-based encoded.

【図２７】マクロブロックがフィールドベース符号化さ
れる場合における、ＰピクチャおよびＢピクチャに属す
るマクロブロックのMB_typeを２値化するためのテーブ
ルを示す図である。[Fig. 27] Fig. 27 is a diagram illustrating a table for binarizing MB_types of macroblocks belonging to a P picture and a B picture when the macroblock is field-based encoded.

【図２８】図２２の画像情報符号化装置に対応する、本
発明の一実施の形態である画像情報復号装置の構成例を
示すブロック図である。28 is a block diagram illustrating a configuration example of an image information decoding device which is an embodiment of the present invention and corresponds to the image information encoding device in FIG. 22.

[Explanation of symbols]

５３フィールド／フレーム判定部，５５ ABTブロ
ックサイズ決定部，５６フィールド／フレーム変換
部，５７ ABT変換部，５９可逆符号化部，６
３逆ABT変換部，６４フィールド／フレーム変換
部，６５ループフィルタ，６７フィールド／フ
レーム変換部，６９制御部，７０記録媒体，８
２可逆復号部，８４逆ABT変換部，８５フィー
ルド／フレーム変換部，９１フィールド／フレーム
変換部53 field / frame determination unit, 55 ABT block size determination unit, 56 field / frame conversion unit, 57 ABT conversion unit, 59 lossless encoding unit, 6
3 inverse ABT converter, 64 field / frame converter, 65 loop filter, 67 field / frame converter, 69 controller, 70 recording medium, 8
2 lossless decoding unit, 84 inverse ABT conversion unit, 85 field / frame conversion unit, 91 field / frame conversion unit

───────────────────────────────────────────────────── フロントページの続き (72)発明者ピータークーン東京都品川区北品川６丁目７番35号ソニー株式会社内 (72)発明者矢ヶ崎陽一東京都品川区北品川６丁目７番35号ソニー株式会社内Ｆターム(参考） 5C059 LC03 MA00 MA02 MA03 MA04 MA05 MA14 MA21 MA22 MA23 MC11 MC32 MC34 MC38 ME01 ME11 NN01 NN21 NN28 PP05 PP06 PP07 RC12 RC40 SS07 SS20 TA12 TA13 TA18 TA23 TA24 TA46 TB07 TC02 TC04 TC08 TD02 TD06 UA02 UA32 UA33 UA39 5J064 AA02 BA04 BA09 BA16 BB03 BB12 BC01 BC06 BC08 BC11 BC14 BC16 BC26 BD02 BD03 ─────────────────────────────────────────────────── ─── Continued front page (72) Inventor Peter Kuhn 6-735 Kita-Shinagawa, Shinagawa-ku, Tokyo Soni -Inside the corporation (72) Inventor Yoichi Yagasaki 6-735 Kita-Shinagawa, Shinagawa-ku, Tokyo Soni -Inside the corporation F-term (reference) 5C059 LC03 MA00 MA02 MA03 MA04 MA05 MA14 MA21 MA22 MA23 MC11 MC32 MC34 MC38 ME01 ME11 NN01 NN21 NN28 PP05 PP06 PP07 RC12 RC40 SS07 SS20 TA12 TA13 TA18 TA23 TA24 TA46 TB07 TC02 TC04 TC08 TD02 TD06 UA02 UA32 UA33 UA39 5J064 AA02 BA04 BA09 BA16 BB03 BB12 BC01 BC06 BC08 BC11 BC14 BC16 BC26 BD02 BD03

Claims

[Claims]

1. Inputting image information of interlaced scanning,
In an encoding device that adaptively performs field-based or frame-based encoding processing at the macroblock level, an orthogonal transformation that divides a macroblock into any of a plurality of preset orthogonal transformation block sizes and performs orthogonal transformation Means, a block size determining means for determining the block size for orthogonal transformation so that the coding efficiency is highest, a quantizing means for quantizing the processing result of the orthogonal transforming means, and a processing for the quantizing means An encoding device, comprising: a lossless encoding unit that losslessly encodes a result.

2. The block size for orthogonal transformation is 4 ×
4. The encoding device according to claim 1, wherein there are four types, 4, 4 × 8, 8 × 4, and 8 × 8.

3. The block size determining means, when the macroblock is an inter macroblock and is subjected to the field-based coding processing, the block size determining means for the orthogonal transform corresponding to a block size for motion prediction / compensation. The encoding device according to claim 1, wherein the block size is determined.

4. The block size determination means, when the macroblock is an inter macroblock and the field-based coding process is performed, and when the motion prediction / compensation block size is 16 × 8. The coding apparatus according to claim 3, wherein the block size for orthogonal transformation is determined to be 8x8.

5. The block size determining means, when the macroblock is an inter macroblock and is subjected to the field-based coding process, and when the motion prediction / compensation block size is 8 × 8. The coding apparatus according to claim 3, wherein the block size for orthogonal transformation is also determined to be 8x8.

6. The block size determination means, when the macro block is an inter macro block and the field-based coding process is performed, and when the motion prediction / compensation block size is 8 × 4. The coding apparatus according to claim 3, wherein the block size for orthogonal transformation is also determined to be 8x4.

7. The block size determining means, when the macroblock is an inter macroblock and the field-based coding process is performed, and when the motion prediction / compensation block size is 4 × 8. The coding apparatus according to claim 3, wherein the orthogonal transform block size is also determined to be 4x8.

8. The block size determining means, when the macroblock is an inter macroblock and is subjected to the field-based coding process, and the motion prediction / compensation block size is 4 × 4. The coding apparatus according to claim 3, wherein the block size for orthogonal transformation is also determined to be 4x4.

9. The field / frame determining unit according to claim 1, further comprising a field / frame determining unit that determines whether the encoding process at the macroblock level is the field base or the frame base. Encoding device.

10. The encoding apparatus according to claim 9, wherein the determination by the field / frame determination unit and the determination by the block size determination unit are performed at the same time.

11. The determination by the field / frame determining means and the determination by the block size determining means minimize the sum of absolute values SATD of coefficients obtained by orthogonally transforming the difference values generated by the motion prediction processing. like,
The encoding device according to claim 10, wherein the encoding is performed at the same time.

12. The encoding apparatus according to claim 10, wherein the determination by the field / frame determining unit and the determination by the block size determining unit are performed simultaneously based on the value of the cost function J. .

13. The field / frame determination means compares the square sum V _ar1 of the difference between pixels vertically adjacent in the _frame with the square sum V _ar2 of the difference between pixels vertically adjacent in the field. 10. The encoding apparatus according to claim 9, wherein it is determined whether the encoding process at the macroblock level is based on the field or the frame based on a result.

14. The block size determination means determines the block size for orthogonal transformation so that a sum SATD of absolute values of coefficients obtained by orthogonally transforming a difference value generated by a motion prediction process is minimized. The encoding device according to claim 1, wherein

15. The encoding apparatus according to claim 1, wherein the block size determining means determines the orthogonal transform block size based on a value of a cost function J.

16. The lossless encoding means is a field / fram indicating a determination result by the field / frame determining means.
The encoding device according to claim 1, wherein e_flag and MB_type indicating a determination result by the block size determining unit are also losslessly encoded.

17. The encoding device according to claim 1, wherein the lossless encoding means uses UVLC.

18. The encoding apparatus according to claim 1, wherein the lossless encoding means uses CABAC.

19. The reversible encoding means includes field / fr indicating a determination result by the field / frame determining means.
The encoding device according to claim 18, wherein a context model is defined for each of ame_flag and MB_type indicating a determination result by the block size determining unit.

20. Frame / field fla of macroblock C
The context model ctx_fifr_flag (C) regarding g is defined by the following equation using the values a and b of the frame / field flags of the macroblocks A and B adjacent to the macroblock C. ctx_fifr_flag (C) = The encoding device according to claim 19, wherein a + 2b.

21. A context model ctx_mb_type corresponding to MB_type of a macroblock C included in an I picture.
_intra_field (C) is 0 when each of the macroblocks A and B adjacent to the macroblock C is Intra4 × 4, and is 1 when it is Intra16 × 16.
20. The encoding apparatus according to claim 19, wherein ctx_mb_type_intra_field (C) = A + B defined by the following equation using a value of A, B.

22. A first context model ctx_m corresponding to MB_type of a macroblock C included in a P picture.
b_type_inter_field (C) is an operator (A == Skip) which indicates 0 or 1 depending on whether or not each of the macro blocks A and B adjacent to the macro block C is in the Skip mode?
0: 1), ((B == Skip)? 0: 1) is used to define ctx_mb_type_inter_field (C) ＝ ((A == skip)? 0: 1) ＋2 ((B
== skip)? 0: 1), The encoding device according to claim 19.

23. When the macroblock C included in a P picture is encoded on the field basis, Code_Numbers 0 to 7 corresponding to the MB_type of the macroblock C.
Are 0, 100, 101, 11000 and 11 respectively.
2 in 001, 11010, 11011 or 11100
The encoding device according to claim 22, wherein the encoding device is digitized.

24. A first context model ctx_m corresponding to MB_type of a macroblock C included in a B picture.
b_type_inter_field (C) is an operator (A == Dire) indicating 0 or 1 corresponding to whether or not each of the macro blocks A and B adjacent to the macro block C is in the Direct mode.
ctx_mb_type_inter_field (C) ＝ ((A == Direct)? 0: 1) ＋2 which is defined by the following formula using (ct)? 0: 1) and ((B == Direct)? 0: 1)
The encoding device according to claim 19, wherein ((B == Direct)? 0: 1).

25. When the macroblock C included in a B picture is encoded on the field basis, Code_Numbers 0 to 1 corresponding to MB_type of the macroblock C.
3 is 0, 100, 101, 11000, 1 respectively
1001, 11010, 11011, 11100, 11
1000, 1110001, 1110010, 1110
011, 1110100, 1110101 or 111
The encoding device according to claim 24, wherein the encoding device is binarized to 0110.

26. An encoding method of an encoding device which adaptively performs field-based or frame-based encoding processing at a macroblock level by inputting interlaced scanning image information, wherein a plurality of preset orthogonal transforms are set. Transformation step of dividing a macroblock into any of the block sizes for orthogonal transformation, a block size determination step of determining the orthogonal transformation block size so that the coding efficiency is maximized, and the orthogonal transformation step And a lossless coding step for losslessly coding the processing result of the quantization step.

27. A program for adaptively performing field-based or frame-based encoding processing at a macroblock level by inputting interlaced-scan image information, the method comprising a plurality of preset orthogonal transformation block sizes. An orthogonal transformation step of dividing a macroblock into either one and orthogonal transformation, a block size determination step of determining the orthogonal transformation block size so that the coding efficiency is highest, and a processing result of the orthogonal transformation step. A recording medium having a computer-readable program recorded thereon, comprising: a quantization step of quantizing; and a lossless encoding step of losslessly encoding a processing result of the quantization step.

28. One of a plurality of orthogonal transform block sizes preset in a computer which adaptively performs field-based or frame-based coding processing at a macroblock level using image information of interlaced scanning as input. An orthogonal transformation step of dividing a macroblock into orthogonal transforms, a block size determination step of determining the orthogonal transform block size so that the coding efficiency is maximized, and a quantization result of the orthogonal transformation step. And a reversible encoding step for reversibly encoding the processing result of the quantization step.

29. A decoding device for decompressing the image information based on the compressed image information in which the interlaced scanning image information is adaptively encoded in the field base or frame base at the macroblock level, Decoding, field / frame_flag indicating whether the encoding process at the macroblock level is the field base or the frame base, MB_type indicating the block size in the orthogonal transform process included in the encoding process, Decoding means for generating quantized orthogonal transform coefficient, and prediction mode information and motion vector information, dequantizing means for dequantizing the quantized orthogonal transform coefficient, based on the MB_type, the Inverse orthogonal transform means for inverse orthogonal transforming the processing result of the inverse quantizer means, the prediction mode information and the motion vector According broadcasting, decoding apparatus characterized by comprising a reference image generating means for generating a reference image based on the decoded image information.

30. The decoding apparatus according to claim 28, further comprising frame structure conversion means for converting a processing result of the inverse orthogonal means into a frame structure based on the field / frame_flag.

31. The field structure conversion means according to claim 28, further comprising field structure conversion means for converting the decoding information before being supplied to the reference image generation means into a field structure based on the field / frame_flag. Decoding device.

32. A decoding method of a decoding device, which restores image information based on compressed image information in which interlaced-scan image information is adaptively encoded on a macroblock level on a field-based or frame-based basis. Decoding image information, field / frame_flag indicating whether the encoding process at the macroblock level is the field-based or the frame-based, the block size in the orthogonal transform process included in the encoding process, MB_type shown, a quantized orthogonal transform coefficient, and a decoding step of generating prediction mode information and motion vector information, an inverse quantization step of dequantizing the quantized orthogonal transform coefficient, and to the MB_type An inverse orthogonal transform step of performing an inverse orthogonal transform on the processing result of the inverse quantization means, based on the prediction mode information According spare the motion vector information, decoding method characterized by comprising a reference image generating step of generating a reference image based on the decoded image information.

33. A program for restoring the image information based on the compressed image information in which the interlaced scanning image information is adaptively encoded in the field base or the frame base at the macroblock level, the compressed image information Field / frame_flag indicating whether the encoding process at the macroblock level is the field base or the frame base, MB_type indicating the block size in the orthogonal transform process included in the encoding process. , A quantized orthogonal transform coefficient, and a decoding step of generating prediction mode information and motion vector information, an inverse quantization step of dequantizing the quantized orthogonal transform coefficient, based on the MB_type, An inverse orthogonal transform step of performing an inverse orthogonal transform on the processing result of the inverse quantization means, the prediction mode information and the previous According motion vector information, a recording medium readable program computer, which comprises a reference image generating step of generating a reference image based on the decoded image information is recorded.

34. A computer for restoring the image information based on the compressed image information in which the image information of the interlaced scanning is adaptively encoded in the field base or the frame base at the macroblock level, and decodes the compressed image information. Then, field / frame_flag indicating whether the encoding process at the macroblock level is the field-based or the frame-based, MB_type indicating the block size in the orthogonal transform process included in the encoding process, and a quantum. Decoding step of generating the orthogonal transform coefficient that has been quantized, and prediction mode information and motion vector information, an inverse quantization step of dequantizing the quantized orthogonal transform coefficient, and the inverse step based on the MB_type. An inverse orthogonal transform step of performing an inverse orthogonal transform on the processing result of the quantization means, the prediction mode information and the motion According vector information, a program for executing the reference image generation step of generating a reference image based on the decoded image information.