JP2008219205A

JP2008219205A - Picture information encoder and picture information encoding method

Info

Publication number: JP2008219205A
Application number: JP2007050779A
Authority: JP
Inventors: Junichi Tanaka; 潤一田中; Kazufumi Sato; 数史佐藤; Yoichi Yagasaki; 陽一矢ケ崎
Original assignee: Sony Corp
Current assignee: Sony Corp
Priority date: 2007-02-28
Filing date: 2007-02-28
Publication date: 2008-09-18

Abstract

<P>PROBLEM TO BE SOLVED: To minimize quantization degradation of mosquito noise or the like while faithfully reproducing input picture information while reducing a calculation amount. <P>SOLUTION: By an orthogonal transformation size judgement part 13, an orthogonal transformation size is judged utilizing the feature amounts of the input picture information such as macroblock activity, macroblock dispersion and the number of edges included in a macroblock, and the orthogonal transformation size in an orthogonal transformation part 15 is determined beforehand. <P>COPYRIGHT: (C)2008,JPO&INPIT

Description

本発明は、動画画像情報を可変の直交変換サイズ単位で直交変換して符号化した画像圧縮情報を出力する画像情報符号化装置及び画像情報符号化方法に関し、特に、ＭＰＥＧ、Ｈ.２６ｘ等のように、離散コサイン変換若しくはカルーネン・レーベ変換等の直交変換と動き補償によって圧縮された画像情報（ビットストリーム）を、衛星放送、ケーブルＴＶ、インターネット、携帯電話などのネットワークメディアを介して受信する際に、若しくは光、磁気ディスク、フラッシュメモリのような記憶メディア上で処理する際に用いられる画像情報符号化装置及び画像情報符号化方法に関する。 The present invention relates to an image information encoding apparatus and image information encoding method for outputting compressed image information obtained by orthogonally transforming moving image information in units of variable orthogonal transform sizes, and in particular, MPEG, H.26x, etc. As described above, when receiving image information (bitstream) compressed by orthogonal transformation such as discrete cosine transformation or Karhunen-Labe transformation and motion compensation via network media such as satellite broadcasting, cable TV, the Internet, and cellular phones. The present invention also relates to an image information encoding apparatus and an image information encoding method used when processing on a storage medium such as an optical, magnetic disk, or flash memory.

近年、画像情報をデジタルとして取り扱い、その際、効率の高い情報の伝送、蓄積を目的とし、画像情報特有の冗長性を利用して、離散コサイン変換等の直交変換と動き補償により圧縮するＭＰＥＧなどの方式に準拠した装置が、放送局などの情報配信、及び一般家庭における情報受信の双方において普及しつつある。 In recent years, image information has been handled as digital data. At that time, MPEG is used for the purpose of efficient transmission and storage of information, and compression is performed by orthogonal transform such as discrete cosine transform and motion compensation using redundancy unique to image information. An apparatus conforming to the above-described method is becoming widespread in both information distribution of broadcasting stations and information reception in general households.

特に、ＭＰＥＧ２（ＩＳＯ／ＩＥＣ１３８１８−２）は、汎用画像符号化方式として定義されており、飛び越し走査画像及び順次走査画像の双方、並びに標準解像度（ＳＤ：ＳｔａｎｄａｒｄＤｅｆｉｎｉｔｉｏｎ）画像及び高精細（ＨＤ：ＨｉｇｈＤｅｆｉｎｉｔｉｏｎ）画像を網羅する標準で、プロフェッショナル用途及びコンシューマー用途の広範なアプリケーションに現在広く用いられている。ＭＰＥＧ２圧縮方式を用いることにより、例えば７２０×４８０画素を持つ標準解像度の飛び越し走査画像であれば４〜８Ｍｂｐｓ、１９２０×１０８８画素を持つ高解像度の飛び越し走査画像であれば１８〜２２Ｍｂｐｓの符号量（ビットレート）を割り当てることで、高い圧縮率と良好な画質の実現が可能である。 In particular, MPEG2 (ISO / IEC 13818-2) is defined as a general-purpose image coding system, and includes both interlaced scanning images and progressive scanning images, standard definition (SD) images, and high definition (HD: HD). High Definition) A standard that covers images and is currently widely used in a wide range of professional and consumer applications. By using the MPEG2 compression method, for example, a standard resolution interlaced scanning image having 720 × 480 pixels is 4 to 8 Mbps, and a high resolution interlaced scanning image having 1920 × 1088 pixels is 18 to 22 Mbps. (Bit rate) can be assigned to achieve a high compression rate and good image quality.

ＭＰＥＧ２は主として放送用に適合する高画質符号化を対象としていたが、ＭＰＥＧ１より低い符号量（ビットレート）、つまりより高い圧縮率の符号化方式には対応していなかった。携帯端末の普及により、今後そのような符号化方式のニーズは高まると思われ、これに対応してＭＰＥＧ４符号化方式の標準化が行われた。画像符号化方式に関しては、１９９８年１２月にＩＳＯ／ＩＥＣ１４４９６−２としてその規格が国際標準に承認された。 MPEG2 was mainly intended for high-quality encoding suitable for broadcasting, but did not support encoding methods with a lower code amount (bit rate) than MPEG1, that is, a higher compression rate. With the widespread use of mobile terminals, the need for such an encoding system is expected to increase in the future, and the MPEG4 encoding system has been standardized accordingly. Regarding the image coding system, the standard was approved as an international standard as ISO / IEC 14496-2 in December 1998.

さらに、近年、当初テレビ会議用の画像符号化を目的として、Ｈ.２６Ｌ（ＩＴＵ−ＴＱ６／１６ＶＣＥＧ）という標準の規格化が進んでいる。Ｈ．２６ＬはＭＰＥＧ２やＭＰＥＧ４といった従来の符号化方式に比べ、その符号化、復号により多くの演算量が要求されるものの、より高い符号化効率が実現されることが知られている。また、現在、ＭＰＥＧ４の活動の一環として、このＨ．２６Ｌをベースに、Ｈ．２６Ｌではサポートされない機能をも取り入れ、より高い符号化効率を実現する標準化がＪｏｉｎｔＭｏｄｅｌｏｆＥｎｈａｎｃｅｄ−ＣｏｍｐｒｅｓｓｉｏｎＶｉｄｅｏＣｏｄｉｎｇとして行われている。標準化のスケジュールとしては、２００３年３月にはＨ．２６４及びＭＰＥＧ−４Ｐａｒｔ１０（ＡｄｖａｎｃｅｄＶｉｄｅｏＣｏｄｉｎｇ）という名の元に国際標準となった。 Furthermore, in recent years, the standardization of a standard called H.26L (ITU-T Q6 / 16 VCEG) has been advanced for the purpose of image coding for an initial video conference. H. 26L is known to achieve higher encoding efficiency, although a larger amount of computation is required for encoding and decoding than conventional encoding methods such as MPEG2 and MPEG4. In addition, as part of MPEG4 activities, this H.264 Based on H.26L Standardization that incorporates functions not supported by H.26L and achieves higher coding efficiency is performed as Joint Model of Enhanced-Compression Video Coding. As a standardization schedule, in March 2003, H.264 H.264 and MPEG-4 Part 10 (Advanced Video Coding) have become international standards.

ここで、ＡＶＣ規格に基づいた画像圧縮情報を出力とする画像情報符号化装置１００の概略構成を図２のブロック図に示す。 Here, a schematic configuration of the image information encoding apparatus 100 that outputs image compression information based on the AVC standard is shown in a block diagram of FIG.

この画像情報符号化装置１００は、Ａ／Ｄ変換部１０１、画面並べ替えバッファ１０２、加算器１０３、直交変換部１０４、量子化部１０５、可逆符号化部１０６、蓄積バッファ１０７、逆量子化部１０８、逆直交変換部１０９、デブロックフィルタ１１０、フレームメモリ１１１、イントラ予測部１１２、動き予測・補償部１１３、レート制御部１１４などからなる。 The image information encoding apparatus 100 includes an A / D conversion unit 101, a screen rearrangement buffer 102, an adder 103, an orthogonal transformation unit 104, a quantization unit 105, a lossless encoding unit 106, an accumulation buffer 107, and an inverse quantization unit. 108, an inverse orthogonal transform unit 109, a deblocking filter 110, a frame memory 111, an intra prediction unit 112, a motion prediction / compensation unit 113, a rate control unit 114, and the like.

この図２に示す画像情報符号化装置１００において、Ａ／Ｄ変換部１０１は、入力された画像信号をデジタル信号に変換して画面並べ替えバッファ１０２に供給する。そして、画面並べ替えバッファ１０２は、当該画像情報符号化装置１００から出力する画像圧縮情報のＧＯＰ（ＧｒｏｕｐｏｆＰｉｃｔｕｒｅｓ）構造に応じて、フレームの並べ替えを行う。 In the image information encoding apparatus 100 shown in FIG. 2, the A / D conversion unit 101 converts an input image signal into a digital signal and supplies it to the screen rearrangement buffer 102. Then, the screen rearrangement buffer 102 rearranges the frames in accordance with the GOP (Group of Pictures) structure of the compressed image information output from the image information encoding device 100.

ここで、イントラ符号化、すなわち、単一のフレームを用いて符号化が行われる画像情報に関しては、入力画像情報と、イントラ予測部１１２により生成される画素値の差分情報が直交変換部１０４に入力され、直交変換部１０４において、離散コサイン変換、カルーネン・レーベ変換等の直交変換が施される。直交変換部１０４は、直交変換により得られる変換係数を量子化部１０５に供給する。 Here, with respect to image information that is intra-encoded, that is, encoded using a single frame, the difference between the input image information and the pixel value generated by the intra-prediction unit 112 is sent to the orthogonal transform unit 104. The orthogonal transform unit 104 performs orthogonal transform such as discrete cosine transform and Karhunen-Loeve transform. The orthogonal transform unit 104 supplies transform coefficients obtained by the orthogonal transform to the quantization unit 105.

量子化部１０５は、直交変換部１０４から供給される変換係数に対して量子化処理を施し、量子化した変換係数を可逆符号化部１０６に供給する。 The quantization unit 105 performs a quantization process on the transform coefficient supplied from the orthogonal transform unit 104 and supplies the quantized transform coefficient to the lossless encoding unit 106.

可逆符号化部１０６は、量子化部１０５から供給される量子化された変換係数に対して、可変長符号化、算術符号化等の可逆符号化を施す。この可逆符号化部１０６により可逆符号化された変換係数は、蓄積バッファ１０７に蓄積され、画像圧縮情報として出力される。 The lossless encoding unit 106 performs lossless encoding such as variable length encoding and arithmetic encoding on the quantized transform coefficient supplied from the quantization unit 105. The transform coefficient losslessly encoded by the lossless encoding unit 106 is stored in the storage buffer 107 and output as image compression information.

量子化部１０５の挙動は、レート制御部１１４によって制御される。また、量子化部１０５は、量子化した変換係数を逆量子化部１０８に供給する。さらに、逆直交変換部１０９において逆直交変換処理が施されて、復号画像情報となり、デブロックフィルタ１１０においてブロック歪の除去が施された後、その情報はフレームメモリ１１１に蓄積される。イントラ予測部１１２において、当該ブロック／マクロブロックに対して適用されたイントラ予測モードに関する情報は、可逆符号化部１０６に伝送され、画像圧縮情報におけるヘッダ情報の一部として符号化される。 The behavior of the quantization unit 105 is controlled by the rate control unit 114. Further, the quantization unit 105 supplies the quantized transform coefficient to the inverse quantization unit 108. Further, the inverse orthogonal transform unit 109 performs inverse orthogonal transform processing to obtain decoded image information. After the block distortion is removed by the deblocking filter 110, the information is stored in the frame memory 111. In the intra prediction unit 112, information on the intra prediction mode applied to the block / macro block is transmitted to the lossless encoding unit 106 and encoded as part of header information in the image compression information.

一方、インター符号化、すなわち、複数のフレームを用いて符号化が行われる画像情報に関しては、画面並べ替えバッファ１０２からの画像情報は動き予測・補償部１１３に入力される。動き予測・補償部１１３は、同時に参照される画像情報をフレームメモリ１１１から読み出し、動き予測・補償処理を施して参照画像情報を生成し、この参照画像情報を加算器１０３に供給する。加算器１０３は、画面並べ替えバッファ１０２からの画像情報を参照画像情報との差分信号に変換する。動き予測・補償部１１３は、同時に動きベクトル情報を可逆符号化部１０６に供給する。可逆符号化部１０６は、その動きベクトル情報に対して可変長符号化、算術符号化といった可逆符号化処理を施し、画像圧縮情報のヘッダ部に挿入される情報を形成する。その他の処理はイントラ符号化を施される画像圧縮情報と同様である。 On the other hand, with respect to image information that is inter-encoded, that is, encoded using a plurality of frames, the image information from the screen rearrangement buffer 102 is input to the motion prediction / compensation unit 113. The motion prediction / compensation unit 113 reads image information that is simultaneously referenced from the frame memory 111, performs motion prediction / compensation processing, generates reference image information, and supplies the reference image information to the adder 103. The adder 103 converts the image information from the screen rearrangement buffer 102 into a difference signal from the reference image information. The motion prediction / compensation unit 113 supplies motion vector information to the lossless encoding unit 106 at the same time. The lossless encoding unit 106 performs lossless encoding processing such as variable length encoding and arithmetic encoding on the motion vector information, and forms information to be inserted into the header portion of the image compression information. Other processes are the same as the image compression information subjected to intra coding.

次に、離散コサイン変換若しくはカルーネン・レーベ変換等の直交変換と動き補償により画像圧縮を実現する画像情報復号装置２００の概略構成を図３のブロック図を示す。 Next, a block diagram of FIG. 3 shows a schematic configuration of an image information decoding apparatus 200 that realizes image compression by orthogonal transformation such as discrete cosine transformation or Karhunen-Labe transformation and motion compensation.

この画像情報復号装置２００は、蓄積バッファ２０１、可逆復号部２０２、逆量子化部２０３、逆直交変換部２０４、加算器２０５、画面並べ替えバッファ２０６、Ｄ／Ａ変換部２０７、フレームメモリ２０８、動き補償・補償部２０９、イントラ予測部２１０、デブロックフィルタ２１１などからなる。 The image information decoding apparatus 200 includes a storage buffer 201, a lossless decoding unit 202, an inverse quantization unit 203, an inverse orthogonal transform unit 204, an adder 205, a screen rearrangement buffer 206, a D / A conversion unit 207, a frame memory 208, It includes a motion compensation / compensation unit 209, an intra prediction unit 210, a deblock filter 211, and the like.

この図３に示す画像情報復号装置２００において、蓄積バッファ２０１は、入力された画像圧縮情報を一時的に格納して、可逆復号部２０２に転送する。可逆復号部２０２は、蓄積バッファ２０１から転送されてくる画像圧縮情報に対して、定められた画像圧縮情報のフォーマットに基づき、可変長復号、算術復号等の処理を施す。また、可逆復号部２０２は、当該フレームがイントラ符号化されたものである場合、画像圧縮情報のヘッダ部に格納されたイントラ予測モード情報についても復号し、その情報をイントラ予測部２１０に供給する。さらに、可逆復号部２０２は、当該フレームがインター符号化されたものである場合、画像圧縮情報のヘッダ部に格納された動きベクトル情報についても復号し、その情報を動き予測・補償部２０９に供給する。 In the image information decoding device 200 shown in FIG. 3, the accumulation buffer 201 temporarily stores the input image compression information and transfers it to the lossless decoding unit 202. The lossless decoding unit 202 performs processing such as variable length decoding and arithmetic decoding on the compressed image information transferred from the accumulation buffer 201 based on the determined format of the compressed image information. Further, when the frame is intra-coded, the lossless decoding unit 202 also decodes the intra prediction mode information stored in the header portion of the image compression information, and supplies the information to the intra prediction unit 210. . Further, when the frame is inter-coded, the lossless decoding unit 202 also decodes the motion vector information stored in the header portion of the image compression information and supplies the information to the motion prediction / compensation unit 209. To do.

逆量子化部２０３は、可逆復号部２０２から供給される量子化された変換係数を逆量子化し、変換係数として逆直交変換部２０４に供給する。逆直交変換部２０４は、逆量子化部２０３から供給される変換係数に対して、定められた方式に基づき、４次の逆直交変換を施す。 The inverse quantization unit 203 performs inverse quantization on the quantized transform coefficient supplied from the lossless decoding unit 202 and supplies the quantized transform coefficient to the inverse orthogonal transform unit 204 as a transform coefficient. The inverse orthogonal transform unit 204 performs a fourth-order inverse orthogonal transform on the transform coefficient supplied from the inverse quantization unit 203 based on a determined method.

ここで、当該フレームがイントラ符号化されたものである場合には、逆直交変換処理が施された画像情報は、加算器２０５に供給され、イントラ予測部２１０において生成された予測画像情報と合成され、更に、デブロックフィルタ２１１においてブロック歪の除去が施された後、画面並べ替えバッファ２０６に格納され、Ｄ／Ａ変換部２０７によりＤ／Ａ変換処理の後に出力される。 Here, when the frame is intra-coded, the image information subjected to the inverse orthogonal transform process is supplied to the adder 205 and combined with the predicted image information generated by the intra prediction unit 210. Further, after the block distortion is removed by the deblocking filter 211, it is stored in the screen rearrangement buffer 206, and is output after D / A conversion processing by the D / A conversion unit 207.

一方、当該フレームがインター符号化されたものである場合には、動き予測・補償部２０９は、可逆復号部２０２により可逆復号処理が施された動きベクトル情報、及び、フレームメモリ２０８に格納された画像情報を元に参照画像情報を生成し、加算器２０５に供給する。加算器２０５は、この参照画像情報と逆直交変換部２０４の出力とを合成する。その他の処理はイントラ符号化されたフレームと同様である。 On the other hand, if the frame is inter-coded, the motion prediction / compensation unit 209 stores the motion vector information subjected to the lossless decoding process by the lossless decoding unit 202 and the frame memory 208. Reference image information is generated based on the image information and supplied to the adder 205. The adder 205 synthesizes this reference image information and the output of the inverse orthogonal transform unit 204. Other processing is the same as that of the intra-coded frame.

特開２００５−３４１１８４号公報JP 2005-341184 A 特開２００５−３９７４３号公報JP-A-2005-39743 特開２００２−１８５９１４号公報JP 2002-185914 A

ところで、ＡＶＣではそもそも４×４ＤＣＴしか存在しなかった。これは、ＤＣＴサイズを小さくすることでモスキートノイズの伝播が抑えられるという効果があったためである。しかし、標準解像度（ＳＤ：ＳｔａｎｄａｒｄＤｅｆｉｎｉｔｉｏｎ）画像やそれ以下の画像サイズではその効果が高いが、高精細（ＨＤ：ＨｉｇｈＤｅｆｉｎｉｔｉｏｎ）画像では４×４ＤＣＴではＧｒａｉｎｎｏｉｓｅなどのテクスチャーが忠実に再現できず８×８ＤＣＴの方がより忠実に再現できることが後にわかった。 By the way, in AVC, only 4 × 4 DCT existed in the first place. This is because the propagation of mosquito noise can be suppressed by reducing the DCT size. However, although the effect is high in a standard definition (SD) image or an image size smaller than that, a texture such as grain noise cannot be faithfully reproduced in a 4 × 4 DCT in a high definition (HD) image. It was later found that x8DCT can be reproduced more faithfully.

そこで、２００４年には、原画をより忠実に再現することを目的に従来のプロファイルに加えてＦｉｄｅｌｉｔｙＲａｎｇｅＥｘｔｅｎｔｉｏｎとしてＨｉｇｈＰｒｏｆｉｌｅが追加された。従来のＭａｉｎＰｒｏｆｉｌｅとの大きな相違点は、ＤＣＴサイズを８×８と４×４をマクロブロック単位で切り替えることができる、ＳｃａｌｉｎｇＬｉｓｔＭａｔｒｉｘと呼ばれる量子化マトリクスを使用することができるということである。 Therefore, in 2004, High Profile was added as a Fidelity Range Extension in addition to the conventional profile for the purpose of reproducing the original image more faithfully. A major difference from the conventional Main Profile is that a quantization matrix called a scaling list matrix that can switch the DCT size between 8 × 8 and 4 × 4 in units of macroblocks can be used.

８×８ＤＣＴの導入により高精細（ＨＤ：ＨｉｇｈＤｅｆｉｎｉｔｉｏｎ）画像におけるＧｒａｉｎｎｏｉｓｅなどのテクスチャー（模様）の再現性が向上している。ＳｃａｌｉｎｇＬｉｓｔＭａｔｒｉｘの導入により人間の視覚特性に適した画質向上が図られている。 With the introduction of 8 × 8 DCT, the reproducibility of texture (pattern) such as Grain noise in a high definition (HD) image is improved. The introduction of Scaling List Matrix has improved image quality suitable for human visual characteristics.

８×８ＤＣＴと４×４ＤＣＴを切り替えるためには、ＰＰＳ（Ｐｉｃｔｕｒｅｐａｒａｍｅｔｅｒｓｅｔ）における変換サイズフラグ（ｔｒａｎｓｆｏｒｍ＿ｓｉｚｅ＿８×８＿ｆｌａｇ）を［１］に設定する必要がある。これによりマクロブロック毎にＤＣＴサイズを切り替えることができる。 In order to switch between 8 × 8 DCT and 4 × 4 DCT, it is necessary to set the transform size flag (transform_size — 8 × 8_flag) in PPS (Picture parameter set) to [1]. As a result, the DCT size can be switched for each macroblock.

マクロブロックのＤＣＴサイズはＭａｃｒｏｂｌｏｃｋｌａｙｅｒｓｙｎｔａｘにおけるｔｒａｎｓｆｏｒｍ＿ｓｉｚｅ＿８×８＿ｆｌａｇにより切り替えることができる。このフラグが［０］であれば４×４ＤＣＴ、［１］であれば８×８ＤＣＴである。８×８ＤＣＴの場合は動き補償のブロックサイズが８×８以上でなければならない。 The DCT size of the macro block can be switched by transform_size_8 × 8_flag in Macroblock layer syntax. If this flag is [0], it is 4 × 4 DCT, and if it is [1], it is 8 × 8 DCT. In the case of 8 × 8 DCT, the block size of motion compensation must be 8 × 8 or more.

このように、ＨｉｇｈＰｒｏｆｉｌｅを追加し、ＤＣＴサイズの切り替えが可能になったのであるが、ＨＤでもエッジ部分もしくは極端に複雑なテクスチャーでは８×８ＤＣＴを使用することによるモスキートノイズが観測され、主観画質が悪いという現象が発生する。入力画像情報をより忠実に再現しつつ、モスキートノイズなどの量子化劣化を最小限にするためにＤＣＴサイズを切り替えなければならないという問題点がある。また８×８ＤＣＴ、４×４ＤＣＴの両方を考慮した動き予測およびイントラ予測を行わなければならないため計算量の増加という問題点がある。 In this way, the High Profile is added and the DCT size can be switched. However, in HD, mosquito noise due to the use of 8 × 8 DCT is observed in the edge portion or extremely complex texture, and the subjective image quality The phenomenon that is bad occurs. There is a problem that the DCT size must be switched in order to minimize the quantization deterioration such as mosquito noise while reproducing the input image information more faithfully. In addition, since motion prediction and intra prediction considering both 8 × 8 DCT and 4 × 4 DCT must be performed, there is a problem that the amount of calculation increases.

そこで、本発明の目的は、上述の如き従来の問題点に鑑み、計算量を削減しつつ、入力画像情報をより忠実に再現しつつ、モスキートノイズなどの量子化劣化を最小限にすることができる画像情報符号化装置及び画像情報符号化方法を提供することにある
本発明の更に他の目的、本発明によって得られる具体的な利点は、以下に説明される実施の形態の説明から一層明らかにされる。 Therefore, in view of the conventional problems as described above, an object of the present invention is to minimize quantization degradation such as mosquito noise while reducing the amount of calculation and reproducing the input image information more faithfully. Another object of the present invention and specific advantages obtained by the present invention are further apparent from the description of embodiments described below. To be.

本発明では、上述の課題を解決するために、直交変換サイズ判定手段により、入力画像情報の特徴量を利用して直交変換サイズの判定を行い、前もって直交変換サイズを決定することで、ＤＣＴサイズ判定部を導入する。特徴量は入力マクロブロックにおける画像のエッジ部分もしくは複雑なテクスチャーを表現できるものであれば良く、例えば、マクロブロックアクティビティ、マクロブロック分散、マクロブロックに含まれるエッジの数などである。 In the present invention, in order to solve the above-described problem, the orthogonal transform size determination means determines the orthogonal transform size using the feature amount of the input image information, and determines the orthogonal transform size in advance, thereby determining the DCT size. Introduce a judgment unit. The feature amount may be anything that can represent the edge portion of the image or a complex texture in the input macroblock, such as macroblock activity, macroblock variance, and the number of edges included in the macroblock.

すなわち、本発明は、動画画像情報を可変の直交変換サイズ単位で直交変換して符号化した画像圧縮情報を出力する画像情報符号化装置であって、入力された動画画像情報について、所定画素数単位の画像特徴量に基づいて、上記直交変換サイズを判定する直交変換サイズ判定手段を備え、この直交変換サイズ判定手段による判定結果に応じて上記直交変換サイズを切り換えて直交変換を行うことを特徴とする。 That is, the present invention is an image information encoding device that outputs image compression information obtained by encoding and transforming moving image information in units of variable orthogonal transform sizes. The input moving image information includes a predetermined number of pixels. An orthogonal transform size determining unit that determines the orthogonal transform size based on a unit image feature amount is provided, and the orthogonal transform is performed by switching the orthogonal transform size according to a determination result by the orthogonal transform size determining unit And

また、本発明は、動画画像情報を可変の直交変換サイズ単位で直交変換して符号化した画像圧縮情報を出力する画像情報符号化方法であって、入力された動画画像情報について、所定画素数単位の画像特徴量に基づいて、上記直交変換サイズを判定し、その判定結果に応じて上記直交変換サイズを切り換えて直交変換を行うことを特徴とする。 The present invention is also an image information encoding method for outputting compressed image information obtained by encoding moving image information by performing orthogonal transform in variable orthogonal transform size units, and for input moving image information, a predetermined number of pixels is provided. The orthogonal transform size is determined based on the unit image feature amount, and the orthogonal transform is performed by switching the orthogonal transform size according to the determination result.

本発明では、直交変換サイズ判定手段により、マクロブロックアクティビティ、マクロブロック分散、マクロブロックに含まれるエッジの数などの入力画像情報の特徴量を利用して直交変換サイズの判定を行い、前もって直交変換サイズを決定することで、計算量を削減しつつ、入力画像情報をより忠実に再現しつつ、モスキートノイズなどの量子化劣化を最小限にすることができる。 In the present invention, the orthogonal transform size determination means determines the orthogonal transform size using the feature amount of the input image information such as the macroblock activity, the macroblock variance, the number of edges included in the macroblock, and the orthogonal transform in advance. By determining the size, it is possible to minimize the deterioration of quantization such as mosquito noise while reproducing the input image information more faithfully while reducing the amount of calculation.

以下、本発明の実施の形態について、図面を参照して詳細に説明する。なお、本発明は以下の例に限定されるものではなく、本発明の要旨を逸脱しない範囲で、任意に変更可能であることは言うまでもない。 Hereinafter, embodiments of the present invention will be described in detail with reference to the drawings. Needless to say, the present invention is not limited to the following examples, and can be arbitrarily changed without departing from the gist of the present invention.

本発明は、例えば図１に示すような構成の画像情報符号化装置１０に適用される。 The present invention is applied to, for example, an image information encoding apparatus 10 configured as shown in FIG.

この画像情報符号化装置１０は、Ａ／Ｄ変換部１１、画面並べ替えバッファ１２、ＤＣＴサイズ判定部１３、加算器１４、直交変換部１５、量子化部１６、可逆符号化部１７、蓄積バッファ１８、逆量子化部１９、逆直交変換部２０、デブロックフィルタ２１、フレームメモリ２２、イントラ予測部２３、動き予測・補償部２４、レート制御部２５などからなる。 The image information encoding device 10 includes an A / D conversion unit 11, a screen rearrangement buffer 12, a DCT size determination unit 13, an adder 14, an orthogonal transformation unit 15, a quantization unit 16, a lossless encoding unit 17, and a storage buffer. 18, an inverse quantization unit 19, an inverse orthogonal transform unit 20, a deblock filter 21, a frame memory 22, an intra prediction unit 23, a motion prediction / compensation unit 24, a rate control unit 25, and the like.

この図１に示す画像情報符号化装置１０において、Ａ／Ｄ変換部１１は、入力された動画像信号をデジタル信号に変換して画面並べ替えバッファ１２に供給する。 In the image information encoding apparatus 10 shown in FIG. 1, the A / D conversion unit 11 converts the input moving image signal into a digital signal and supplies the digital signal to the screen rearrangement buffer 12.

画面並べ替えバッファ１２は、Ａ／Ｄ変換部１１によりデジタル信号に変換された入力画像情報について、当該画像情報符号化装置１０から出力する画像圧縮情報のＧＯＰ（ＧｒｏｕｐｏｆＰｉｃｔｕｒｅｓ）構造に応じて、フレームの並べ替えを行い、並べ替え済みの入力画像情報をＤＣＴサイズ判定部１３を介して加算器１４、直交変換部１５、イントラ予測部２３及び動き予測・補償部２４に供給する。 The screen rearrangement buffer 12 is configured to convert the input image information converted into a digital signal by the A / D conversion unit 11 according to a GOP (Group of Pictures) structure of image compression information output from the image information encoding device 10. The rearranged input image information is supplied to the adder 14, the orthogonal transform unit 15, the intra prediction unit 23, and the motion prediction / compensation unit 24 through the DCT size determination unit 13.

ＤＣＴサイズ判定部１３は、画面並べ替えバッファ１２から供給される入力画像情報について、例えば、マクロブロックアクティビティ、マクロブロック分散、マクロブロックに含まれるエッジの数などの特徴量を用いてＤＣＴサイズを判定し、判定結果に応じてＤＣＴサイズを示す変換サイズフラグ（ｔｒａｎｓｆｏｒｍ＿ｓｉｚｅ＿８×８＿ｆｌａｇ）を変更する。 The DCT size determination unit 13 determines the DCT size of the input image information supplied from the screen rearrangement buffer 12 using, for example, feature quantities such as macroblock activity, macroblock variance, and the number of edges included in the macroblock. Then, the conversion size flag (transform_size_8 × 8_flag) indicating the DCT size is changed according to the determination result.

加算器１４は、ＤＣＴサイズ判定部１３を介して供給される入力画像情報とイントラ若しくはインター予測画像情報との差分値をマクロブロック毎に生成する。 The adder 14 generates a difference value between the input image information supplied via the DCT size determination unit 13 and the intra or inter predicted image information for each macroblock.

ここで、イントラ符号化、すなわち、単一のフレームを用いて符号化が行われる画像情報に関しては、入力画像情報と、イントラ予測部２３により生成されるイントラ予測画像情報の差分値が直交変換部１５に入力される。また、インター符号化、すなわち、複数のフレームを用いて符号化が行われる画像情報に関しては、入力画像情報と、動き補償・予測部２４により生成される参照画像情報の差分値が直交変換部１５に入力される。 Here, with respect to image information that is intra-encoded, that is, encoded using a single frame, the difference value between the input image information and the intra-predicted image information generated by the intra-prediction unit 23 is an orthogonal transform unit. 15 is input. Also, for image information that is inter-encoded, that is, encoded using a plurality of frames, the difference value between the input image information and the reference image information generated by the motion compensation / prediction unit 24 is the orthogonal transform unit 15. Is input.

直交変換部１５は、加算器１４から供給されるマクロブロック毎の差分値について、可変の変換サイズ単位で離散コサイン変換、カルーネン・レーベ変換等の直交変換、ここでは、離散コサイン変換（ＤＣＴ）を行い、得られる直交変換（ＤＣＴ）係数を量子化部１６に供給する。この直交変換部１５における可変の変換サイズは、ＤＣＴサイズ判定部１３により与えられる変換サイズフラグ（ｔｒａｎｓｆｏｒｍ＿ｓｉｚｅ＿８×８＿ｆｌａｇ）で示される変換サイズに切り替えられる。ここで、上記変換サイズフラグ（ｔｒａｎｓｆｏｒｍ＿ｓｉｚｅ＿８×８＿ｆｌａｇ）は、［１］の場合に、直交変換サイズが８×８であることを示し、［０］の場合に、直交変換サイズが４×４であることを示す。 The orthogonal transform unit 15 performs orthogonal transform such as discrete cosine transform and Karhunen-Loeve transform on the difference value for each macroblock supplied from the adder 14 in a variable transform size unit, here discrete cosine transform (DCT). The obtained orthogonal transform (DCT) coefficient is supplied to the quantization unit 16. The variable transform size in the orthogonal transform unit 15 is switched to the transform size indicated by the transform size flag (transform_size_8 × 8_flag) given by the DCT size determination unit 13. Here, the transform size flag (transform_size_8 × 8_flag) indicates that the orthogonal transform size is 8 × 8 in the case of [1], and the orthogonal transform size is 4 × 4 in the case of [0]. It shows that.

量子化部１６は、直交変換部１５から供給される変換係数に対して量子化処理を施し、量子化した変換係数を可逆符号化部１７及び逆量子化部１９に供給する。 The quantization unit 16 performs a quantization process on the transform coefficient supplied from the orthogonal transform unit 15 and supplies the quantized transform coefficient to the lossless encoding unit 17 and the inverse quantization unit 19.

量子化部１６の挙動は、レート制御部２５によって制御されている。 The behavior of the quantization unit 16 is controlled by the rate control unit 25.

可逆符号化部１７は、量子化部１６から供給される量子化された変換係数に対して、可変長符号化、算術符号化等の可逆符号化、例えば、ＣＡＢＡＣ（Ｃｏｎｔｅｘｔ−ＡｄａｐｔｉｖｅＢｉｎａｒｙＡｒｉｔｈｍｅｔｉｃＣｏｄｉｎｇ）符号化を施す。この可逆符号化部１７により可逆符号化された変換係数は、蓄積バッファ１８に蓄積され、画像圧縮情報として出力される。 The lossless encoding unit 17 performs lossless encoding such as variable length encoding and arithmetic encoding on the quantized transform coefficient supplied from the quantization unit 16, for example, CABAC (Context-Adaptive Binary Arithmetic Coding). Encode. The transform coefficient losslessly encoded by the lossless encoding unit 17 is accumulated in the accumulation buffer 18 and output as image compression information.

また、逆量子化部１９は、量子化部１６から供給される量子化された直交変換係数の逆量子化処理を行い、直交変換係数を逆直交変換部２０に供給する。 The inverse quantization unit 19 performs an inverse quantization process on the quantized orthogonal transform coefficient supplied from the quantization unit 16 and supplies the orthogonal transform coefficient to the inverse orthogonal transform unit 20.

逆直交変換部２０は、逆量子化部１９から供給される直交変換係数の逆直交変換処理を行い、得られる復号画像情報をデブロックフィルタ２１を介してフレームメモリ２２に供給する。 The inverse orthogonal transform unit 20 performs an inverse orthogonal transform process on the orthogonal transform coefficient supplied from the inverse quantization unit 19 and supplies the obtained decoded image information to the frame memory 22 via the deblock filter 21.

デブロックフィルタ２１は、復号画像情報に含まれるブロック歪を除去する。 The deblocking filter 21 removes block distortion included in the decoded image information.

フレームメモリ２２は、復号画像情報を蓄積する。 The frame memory 22 stores decoded image information.

イントラ予測部２３は、フレームメモリ２２から、隣接する、既に符号化済の画像情報を取り出して、これを元に、ＤＣＴサイズに適したイントラ予測処理のみを行う。ここで、このイントラ予測部２３は、ＤＣＴサイズ判定部１３により与えられる変換サイズフラグ（ｔｒａｎｓｆｏｒｍ＿ｓｉｚｅ＿８×８＿ｆｌａｇ）が［１］の際には、Ｉｎｔｒａ８×８のみのイントラ予測を行う。 The intra prediction unit 23 extracts adjacent, already encoded image information from the frame memory 22 and performs only intra prediction processing suitable for the DCT size based on the extracted image information. Here, when the transform size flag (transform_size_8 × 8_flag) given by the DCT size determination unit 13 is [1], the intra prediction unit 23 performs intra prediction only for Intra8 × 8.

動き予測・補償部２４は、ＤＣＴサイズ判定部１３による判定結果に応じて、必要なブロックサイズのみ、参照画像情報に対して動きベクトルの探索及びインター予測画像情報の生成を行い、ＤＣＴサイズが事前にわからない場合は、ブロックサイズをＤＣＴサイズ判定部１３にフィードバックする。ここで、この動き予測・補償部２４は、ＤＣＴサイズ判定部１３により与えられる変換サイズフラグ（ｔｒａｎｓｆｏｒｍ＿ｓｉｚｅ＿８×８＿ｆｌａｇ）が［１］の場合に、上記８×８サイズのみの動き予測を行い、また、ＤＣＴサイズ判定部１３により与えられる変換サイズフラグ（ｔｒａｎｓｆｏｒｍ＿ｓｉｚｅ＿８×８＿ｆｌａｇ）が［０］の場合には、Ｉｎｔｒａ４×４又はＩｎｔｒａ１６×１６のイントラ予測を行う。 The motion prediction / compensation unit 24 searches the reference image information for the motion vector and generates the inter prediction image information for only the necessary block size according to the determination result by the DCT size determination unit 13, and the DCT size is set in advance. If it is not known, the block size is fed back to the DCT size determination unit 13. Here, when the transform size flag (transform_size_8 × 8_flag) given by the DCT size determination unit 13 is [1], the motion prediction / compensation unit 24 performs motion prediction only on the 8 × 8 size, and When the transform size flag (transform_size_8 × 8_flag) given by the DCT size determination unit 13 is [0], intra 4 × 4 or intra 16 × 16 intra prediction is performed.

レート制御部２５は、フィードバック制御により量子化部１６の動作の制御を行い、出力となる画像圧縮情報の符号量制御を行う。 The rate control unit 25 controls the operation of the quantization unit 16 by feedback control, and controls the code amount of image compression information to be output.

次に、本発明の特徴となるＤＣＴサイズ判定部１３における動作原理について説明する。 Next, the operation principle of the DCT size determination unit 13 that is a feature of the present invention will be described.

ＤＣＴサイズ判定部１３では、マクロブロック単位の特徴量よりＤＣＴサイズを判定する。特徴量は例えばマクロブロックアクティビティ、マクロブロック分散、マクロブロックに含まれるエッジの数などである。 The DCT size determination unit 13 determines the DCT size from the feature quantity in units of macroblocks. The feature amount is, for example, macroblock activity, macroblock variance, the number of edges included in the macroblock, and the like.

マクロブロックアクティビティＭＢＡＣＴは、次のように計算する。 The macroblock activity MBACT is calculated as follows.

ここで、Ｐ_ｋは原画の輝度信号ブロック内画素値である。ここで、ｓｂｌｋはマクロブロックに含まれる４つの８×８画素ブロックを表している。ＤＣＴサイズの判定は、例えば次のようにある閾値Ｔｈ_ａｃｔを用いて判定する。 Here, P _k is a pixel value in the luminance signal block of the original image. Here, sblk represents four 8 × 8 pixel blocks included in the macroblock. The DCT size is determined using a certain threshold Th _act as follows, for example.

また、マクロブロック分散ＭＢＶＡＲは次のように計算する。 The macroblock distribution MBVAR is calculated as follows.

ここで、Ｐ_ｋは原画の輝度信号ブロック内画素値である。ＤＣＴサイズの判定は、例えば閾値Ｔｈ_ｖａｒを用いて次のように判定する。 Here, P _k is a pixel value in the luminance signal block of the original image. The determination of the DCT size is performed as follows using, for example, the threshold value Th _var .

また、マクロブロックエッジＭＢＥＤＧＥは、次のように計算する。エッジ検出には、どのような方法を使用しても良いが、ここではｓｏｂｅｌオペレーションを示す。 The macroblock edge MBEDGE is calculated as follows. Any method may be used for edge detection, but here a sobel operation is shown.

上記係数を入力画像情報のｉ行ｊ列に乗算し合計したものをｇ_ｈｓｉｊ、ｇ_ｈｖｉｊとしたとき、注目画素のｓｏｂｅｌオペレーション後の画素値ｇ_ｉｊは次の式で表される。 When g _hsij and g _hvij are obtained by multiplying the above-mentioned coefficients by i rows and j columns of the input image information and obtaining the sum as g _hsij and g _hvij , the pixel value g _ij after the sobel operation of the target pixel is expressed by the following equation.

ここで、閾値Ｔｈ_{ｓｏｂｅｌ}以上の値の画素値の数をマクロブロックごとに集計したＭＢＥＤＧＥは次のように表される。 Here, _{MBEDGE in which} the number of pixel values greater than or equal to the threshold value Th _sobel is tabulated for each macroblock is expressed as follows.

ここで、Ｅ_ｋはブロック内ｓｏｂｅｌオペレーション後の画素値が閾値Ｔｈ_{ｓｏｂｅｌ}以上の場合［１］それ以外［０］の値をとるものである。ＤＣＴサイズの判定は例えば次のようにある閾値Ｔｈ_ｅｄｇｅを用いて判定する。 Here, E _k takes the value [1] when the pixel value after the intra-block sobel operation is _{equal to} or greater than the threshold Th _sobel [0]. The DCT size is determined using, for example, a certain threshold Th _edge as follows.

この処理に使用するエッジ検出方法はｓｏｂｅｌオペレーションのみに限定されず、さまざまなエッジ検出方法を利用することができる。また、マクロブロック内のエッジの数だけでなく、縦、横それぞれのエッジの数を別々に評価してよい。 The edge detection method used for this process is not limited to the sobel operation, and various edge detection methods can be used. Further, not only the number of edges in the macroblock but also the number of vertical and horizontal edges may be evaluated separately.

次に、イントラ予測部２３における動作原理について説明する。 Next, the operation principle in the intra prediction unit 23 will be described.

ＤＣＴサイズ判定部１３により与えられる変換サイズフラグ（ｔｒａｎｓｆｏｒｍ＿ｓｉｚｅ＿８×８＿ｆｌａｇ）の値が［０］の場合は、ＤＣＴサイズが４×４であるために、ＤＣＴサイズが８×８であるＩｎｔｒａ８×８の予測は行わず、［１］の場合は、ＤＣＴサイズが８×８であるのでＤＣＴサイズが４×４であるＩｎｔｒａ４×４およびＩｎｒａ１６×１６の予測は行わない。これにより計算量の削減が行える。 When the value of the transform size flag (transform_size_8 × 8_flag) given by the DCT size determination unit 13 is [0], since the DCT size is 4 × 4, the intra 8 × 8 prediction with the DCT size of 8 × 8 is performed. In the case of [1], since the DCT size is 8 × 8, prediction of Intra4 × 4 and Inra16 × 16 with a DCT size of 4 × 4 is not performed. Thereby, the amount of calculation can be reduced.

さらに、動き予測・補償部２４における動作原理について説明する。 Further, the operation principle in the motion prediction / compensation unit 24 will be described.

ＤＣＴサイズ判定部１３により与えられる変換サイズフラグ（ｔｒａｎｓｆｏｒｍ＿ｓｉｚｅ＿８×８＿ｆｌａｇ）の値が［１］の場合は、ＤＣＴサイズが８×８であるため、動き補償のブロックサイズが８×８以上でなければならない。そのため、８×８未満の動き予測を行う必要がないため計算の削減を行うことが可能になる。 When the value of the transform size flag (transform_size_8 × 8_flag) given by the DCT size determination unit 13 is [1], the DCT size is 8 × 8, and the motion compensation block size must be 8 × 8 or more. . Therefore, since it is not necessary to perform motion prediction of less than 8 × 8, it is possible to reduce calculation.

しかし、ハードウェアの構成によりパイプライン処理によりＤＣＴサイズ判定部１３でのＤＣＴサイズ判定より先に動き補償を行っている場合に関しては、選択された動き補償のブロックサイズが８×８未満の場合は、変換サイズフラグ（ｔｒａｎｓｆｏｒｍ＿ｓｉｚｅ＿８×８＿ｆｌａｇ）は［０］である必要があるため、その情報をＤＣＴサイズ判定部１３にフィードバックする必要がある。これによりシンタックス上矛盾のないＤＣＴサイズ判定を行うことができる。 However, regarding the case where motion compensation is performed prior to the DCT size determination in the DCT size determination unit 13 by pipeline processing due to the hardware configuration, if the selected motion compensation block size is less than 8 × 8, Since the transform size flag (transform_size_8 × 8_flag) needs to be [0], the information needs to be fed back to the DCT size determination unit 13. This makes it possible to perform DCT size determination that is syntactically consistent.

本発明に係る画像情報符号化装置の構成例を示すブロック図である。It is a block diagram which shows the structural example of the image information encoding apparatus which concerns on this invention. ＡＶＣ符号化方式に基づく画像圧縮情報を出力する画像情報符号化装置の概略構成を示すブロック図である。It is a block diagram which shows schematic structure of the image information encoding apparatus which outputs the image compression information based on an AVC encoding system. ＡＶＣ符号化方式に基づく画像圧縮情報を入力とする画像情報復号装置の概略構成を示すブロック図である。It is a block diagram which shows schematic structure of the image information decoding apparatus which inputs the image compression information based on an AVC encoding system.

Explanation of symbols

１１Ａ／Ｄ変換部、１２画面並べ替えバッファ、１３ＤＣＴサイズ判定部、１４加算器、１５直交変換部、１６量子化部、１７可逆符号化部、１８蓄積バッファ、１９逆量子化部、２０逆直交変換部、２１デブロックフィルタ、２２フレームメモリ、２３イントラ予測部、２４動き予測・補償部、２５レート制御部、 11 A / D conversion unit, 12 screen rearrangement buffer, 13 DCT size determination unit, 14 adder, 15 orthogonal transform unit, 16 quantization unit, 17 lossless encoding unit, 18 accumulation buffer, 19 inverse quantization unit, 20 Inverse orthogonal transform unit, 21 deblock filter, 22 frame memory, 23 intra prediction unit, 24 motion prediction / compensation unit, 25 rate control unit,

Claims

An image information encoding device that outputs compressed image information obtained by encoding and transforming moving image information in units of variable orthogonal transform sizes,
The input moving image information includes an orthogonal transform size determination unit that determines the orthogonal transform size based on an image feature amount in a predetermined number of pixels.
An image information encoding apparatus, wherein orthogonal transform is performed by switching the orthogonal transform size according to a determination result by the orthogonal transform size determining means.

An image information encoding device that outputs image compression information based on MPEG-4, AVC encoding method, etc.
The orthogonal transform size determination means determines the orthogonal transform size of the input moving image information based on the image feature amount in units of macroblocks, and sets a transform size flag (transform_size_8 × 8_flag) according to the determination result. 2. The image information encoding apparatus according to claim 1, wherein the orthogonal transform size is switched to 4 × 4 or 8 × 8 by changing.

3. The image information encoding apparatus according to claim 2, wherein the orthogonal transform size determination unit switches the orthogonal transform using a macroblock activity MBACT as the image feature amount.

3. The image information encoding apparatus according to claim 2, wherein the orthogonal transform size determination unit switches the orthogonal transform using a macroblock distributed MBVAR as the image feature amount.

3. The image information encoding apparatus according to claim 2, wherein the orthogonal transform size determination unit switches the orthogonal transform using an edge MBEDGE included in a macroblock as an image feature amount.

Predicting / compensating means for performing motion prediction / compensation processing on the moving image information to be orthogonally transformed,
By determining the orthogonal transform size based on the determination result by the orthogonal transform size determination unit before performing the motion prediction by the prediction / compensation unit, the prediction / compensation unit has the orthogonal transform size of 8 × 8. 3. The image information encoding apparatus according to claim 2, wherein when the transform size flag (transform_size — 8 × 8_flag) indicating that it is “1”, motion prediction only for the 8 × 8 size is performed. 4.

Predicting / compensating means for performing motion prediction / compensation processing on the moving image information to be orthogonally transformed,
When the orthogonal transform size is determined after motion prediction is performed by the prediction / compensation unit, the orthogonal transform size determination unit determines that the transform size flag (transform_size_8 × 8_flag) is [1] and is determined by motion prediction. 3. The image information encoding apparatus according to claim 2, wherein the transform size flag (transform_size_8 × 8_flag) is changed to [0] when the size of the motion compensation performed is less than the 8 × 8 block size.

Intra prediction means for performing motion intra prediction processing on the moving image information to be orthogonally transformed,
By determining the DCT size based on the determination result by the orthogonal transform size determination means before performing the intra prediction by the intra prediction means, the intra prediction means has the transform size flag (transform_size_8 × 8_flag) set to [ 3. The image information encoding apparatus according to claim 2, wherein intra prediction of only Intra 8 × 8 is performed in the case of 1].

Intra prediction means for performing motion intra prediction processing on the moving image information to be orthogonally transformed,
By determining the DCT size based on the determination result by the orthogonal transform size determination means before performing the intra prediction by the intra prediction means, the intra prediction means has the transform size flag (transform_size_8 × 8_flag) set to [ 0], the intra prediction of Intra4 × 4 or Intra16 × 16 is performed.

An image information encoding method for outputting compressed image information obtained by orthogonally transforming and encoding moving image information in variable orthogonal transform size units,
For the input video image information, the orthogonal transformation size is determined based on the image feature amount in a predetermined number of pixels,
An image information encoding method, wherein orthogonal transform is performed by switching the orthogonal transform size according to the determination result.