JP5377395B2

JP5377395B2 - Encoding device, decoding device, and program

Info

Publication number: JP5377395B2
Application number: JP2010086549A
Authority: JP
Inventors: 敦郎市ヶ谷; 慎一境田
Original assignee: Japan Broadcasting Corp
Current assignee: Japan Broadcasting Corp
Priority date: 2010-04-02
Filing date: 2010-04-02
Publication date: 2013-12-25
Anticipated expiration: 2030-04-02
Also published as: JP2011223068A

Abstract

<P>PROBLEM TO BE SOLVED: To provide an encoder, decoder, and program for generating a predicted image corresponding to coding target blocks that are subregions into which a frame image is divided and for encoding differential signals between the coding target blocks and the predicted image. <P>SOLUTION: An encoder 10 provided by the present invention includes an orthogonal converter 13 comprising: a changeover switch for selecting encoding through the application of first orthogonal transformation processing to the differential signal or encoding through the application of second orthogonal transformation processing, in which the n-th base is orthogonal to the n-th base in the first orthogonal transformation processing, to the differential signal; and a cost calculation section for calculating the cost of the encoding of the frame image and controlling the switching means for each differential signal to reduce the amount of code. The encoder 10 also includes a quantization section 14 for applying quantization processing to orthogonal transformation coefficients which can be obtained through the first and/or second orthogonal transformation processing. The quantization section 14 performs quantization by using a normalization coefficient matrix common to the first and/or second orthogonal transformation processing. <P>COPYRIGHT: (C)2012,JPO&INPIT

Description

本発明は、画像符号化技術に関し、特に、非可逆な符号化方式による画像処理装置、符号化装置、復号装置及びプログラムに関する。 The present invention relates to an image encoding technique, and more particularly to an image processing device, an encoding device, a decoding device, and a program using an irreversible encoding method.

フレーム画像の画素値を直交変換によって直交変換係数で表し、この直交変換係数に対して必要に応じて量子化及びエントロピー符号化を施すことによって、少ない信号表現で品質を損なわずに、或いは最小限の品質の低下によって信号を伝送する画像符号化技術が知られている。特に、映像符号化の分野などで利用される技術である。 By representing the pixel values of the frame image with orthogonal transform coefficients by orthogonal transform, and performing quantization and entropy coding on the orthogonal transform coefficients as necessary, the quality can be minimized with minimal signal representation or minimally. There is known an image coding technique for transmitting a signal due to a decrease in quality of the image. In particular, it is a technique used in the field of video encoding.

代表的な直交変換としては、ＤＦＴ（離散フーリエ変換）、ＤＣＴ（離散コサイン変換）、ＤＷＴ（離散ウェーブレット変換）が知られている。特に、小領域にフレーム画像を分割して直交変換を施すブロックベースの映像符号化では、ＤＣＴが最も広く用いられている。 As typical orthogonal transforms, DFT (Discrete Fourier Transform), DCT (Discrete Cosine Transform), and DWT (Discrete Wavelet Transform) are known. In particular, DCT is most widely used in block-based video coding in which a frame image is divided into small areas and orthogonal transformation is performed.

広く用いられている映像符号化としてＭＰＥＧが知られている。ＭＰＥＧ−２やＭＰＥＧ−４ＡＶＣ／Ｈ．２６４は、フレーム画像を小領域に分割し、例えば８×８画素ブロックごとに直交変換を施し、各直交変換係数を量子化し、エントロピー符号化を施す。このような符号化では、フレーム画像の画像信号と直交変換の特性からＤＣＴが広く用いられている。 MPEG is known as a widely used video encoding. MPEG-2 and MPEG-4 AVC / H. H.264 divides the frame image into small regions, for example, performs orthogonal transform for each 8 × 8 pixel block, quantizes each orthogonal transform coefficient, and performs entropy coding. In such encoding, DCT is widely used because of the characteristics of an image signal of a frame image and orthogonal transformation.

従来からの動画像用の符号化装置（例としてＭＰＥＧ−４ＡＶＣ／Ｈ．２６４）の構成例を図１７に示す。 FIG. 17 shows a configuration example of a conventional moving picture coding apparatus (for example, MPEG-4AVC / H.264).

［符号化装置］
一般的に動画像の符号化では動き補償予測、直交変換、量子化、エントロピー符号化によって画像を符号化する。動き補償予測や画面内予測を用いる符号化方式の場合、復号された画像を予測に用いるため符号化装置内に復号装置を内包している。 [Encoding device]
In general, in encoding a moving image, an image is encoded by motion compensation prediction, orthogonal transformation, quantization, and entropy encoding. In the case of an encoding method using motion compensation prediction or intra prediction, a decoding device is included in an encoding device in order to use a decoded image for prediction.

従来からの符号化装置１００は、ブロック化部１０１と、減算部１０２と、整数精度変換部１０５（直交変換部１０３及び量子化部１０４からなる）と、整数精度逆変換部１０８（逆量子化部１０６及び逆直交変換部１０７からなる）と、加算部１０９と、デブロッキングフィルタ１２０と、メモリ１２１と、フレーム内予測部１２２と、動き補償予測部１２３と、切替スイッチ１２４と、スキャニング部１２５と、エントロピー符号化部１２６とを備える。 A conventional encoding apparatus 100 includes a blocking unit 101, a subtracting unit 102, an integer precision transform unit 105 (consisting of an orthogonal transform unit 103 and a quantization unit 104), and an integer precision inverse transform unit 108 (inverse quantization). , An addition unit 109, a deblocking filter 120, a memory 121, an intra-frame prediction unit 122, a motion compensation prediction unit 123, a changeover switch 124, and a scanning unit 125. And an entropy encoding unit 126.

ブロック化部１０１は、フレーム画像を入力してバッファ（図示せず）に保持し、このバッファから１６×１６画素のマクロブロックに分割し、このマクロブロックを８×８画素に分割するとともに、更に４×４画素に分割して予め定めた順で減算部１０２及び動き補償予測部１２３に送出する。尚、マクロブロックの分割は、８×８画素及び４×４画素に分割した小領域の画素ブロック単位で処理する例を代表的に説明し、以下、符号化対象ブロックと称する。 The blocking unit 101 receives a frame image, holds it in a buffer (not shown), divides the buffer image into 16 × 16 pixel macroblocks, divides the macroblock into 8 × 8 pixels, and further Divided into 4 × 4 pixels and sent to the subtraction unit 102 and the motion compensation prediction unit 123 in a predetermined order. The macroblock division is described as an example of processing in units of pixel blocks of small areas divided into 8 × 8 pixels and 4 × 4 pixels, and is hereinafter referred to as an encoding target block.

フレーム間予測の場合、動き補償予測部１２３は、ブロック化部１０１から供給される符号化対象ブロックに対して、メモリ１２１から取得する参照画像を用いて動きベクトル検出を行い、得られた動きベクトルを用いて動き補償を行い、その結果得られた予測画像を、切替スイッチ１２４を介して減算部１０２及び加算部１０９に出力する。動きベクトルの情報は、エントロピー符号化部１２６に送出される。 In the case of inter-frame prediction, the motion compensation prediction unit 123 performs motion vector detection on the encoding target block supplied from the blocking unit 101 using the reference image acquired from the memory 121, and the obtained motion vector , And the predicted image obtained as a result is output to the subtraction unit 102 and the addition unit 109 via the changeover switch 124. The motion vector information is sent to the entropy encoding unit 126.

フレーム内予測の場合、フレーム内予測部１２２は、メモリ１２１から取得する参照画像を用いて外挿補間を行って、その結果得られた予測画像を、切替スイッチ１２４を介して減算部１０２及び加算部１０９に出力する。 In the case of intra-frame prediction, the intra-frame prediction unit 122 performs extrapolation using the reference image acquired from the memory 121, and the prediction image obtained as a result is added to the subtraction unit 102 and the addition via the changeover switch 124. Output to the unit 109.

以下、フレーム間予測及びフレーム内予測のいずれも同様の処理が施されるため、フレーム間予測の場合を説明する。以下、代表的に、４×４ブロックの符号化について説明するが、８×８ブロックや１６×１６ブロックによる符号化も同様である。 Hereinafter, since the same processing is performed for both inter-frame prediction and intra-frame prediction, the case of inter-frame prediction will be described. Hereinafter, representatively, encoding of 4 × 4 blocks will be described, but the same applies to encoding by 8 × 8 blocks or 16 × 16 blocks.

減算部１０２は、ブロック化部１０１からの符号化対象ブロックと、動き補償予測部１２３からの予測画像との差分信号を生成して直交変換部１０３に送出する。 The subtraction unit 102 generates a difference signal between the encoding target block from the blocking unit 101 and the predicted image from the motion compensated prediction unit 123 and sends the difference signal to the orthogonal transform unit 103.

直交変換部１０３は、減算部１０２から供給される４×４ブロックの差分信号に対して直交変換（整数精度ＤＣＴ）を施し、量子化部１０４に送出する。 The orthogonal transform unit 103 performs orthogonal transform (integer precision DCT) on the difference signal of 4 × 4 blocks supplied from the subtraction unit 102 and sends the result to the quantization unit 104.

量子化部１０４は、直交変換部１０３から供給される４×４ブロックの差分信号の直交変換係数に対応する配列で量子化を行うための正規化係数マトリクス（量子化演算に用いるＭＦ：ＭｕｌｔｉｐｌｉｃａｔｉｏｎＦａｃｔｏｒのマトリクス）を用いて、４×４ブロックの差分信号の直交変換係数に対して量子化処理を行い、得られた量子化ブロックを、スキャニング部１２５を経てエントロピー符号化部１２６へ送出するとともに、逆量子化部１０６に送出する。 The quantization unit 104 is a normalized coefficient matrix (MF: Multiplication Factor used for quantization operation) for performing quantization with an array corresponding to the orthogonal transform coefficient of the 4 × 4 block difference signal supplied from the orthogonal transform unit 103. And the quantization process is performed on the orthogonal transform coefficients of the difference signal of the 4 × 4 block, and the obtained quantized block is sent to the entropy encoding unit 126 via the scanning unit 125, and The data is sent to the inverse quantization unit 106.

スキャニング部１２５は、直交変換（整数精度ＤＣＴ）に対応する予め定めたスキャニングオーダーで量子化ブロックの読出しを行い、エントロピー符号化部１２６へ送出する。 The scanning unit 125 reads out the quantized block with a predetermined scanning order corresponding to orthogonal transform (integer precision DCT), and sends it to the entropy encoding unit 126.

エントロピー符号化部１２６は、量子化部１０４から供給される量子化ブロックについてエントロピー符号化処理（適応的な可変長符号化処理や算術符号化処理を選択可能）を施しビットストリームを生成するとともに、動き補償予測部１２３から供給される動きベクトルの情報もエントロピー符号化処理を施して出力する。 The entropy encoding unit 126 performs entropy encoding processing (selectable adaptive variable length encoding processing and arithmetic encoding processing) on the quantization block supplied from the quantization unit 104 to generate a bitstream, Information on the motion vector supplied from the motion compensation prediction unit 123 is also subjected to entropy coding processing and output.

逆量子化部１０６は、量子化部１０４から供給される量子化ブロックについて逆量子化処理を行って逆直交変換部１０７に出力する。 The inverse quantization unit 106 performs an inverse quantization process on the quantization block supplied from the quantization unit 104 and outputs the result to the inverse orthogonal transform unit 107.

逆直交変換部１０７は、逆量子化部１０６から供給される直交変換係数に対して逆直交変換（整数精度ＩＤＣＴ）を施し、加算部１０９に出力する。 The inverse orthogonal transform unit 107 performs inverse orthogonal transform (integer accuracy IDCT) on the orthogonal transform coefficient supplied from the inverse quantization unit 106 and outputs the result to the addition unit 109.

加算部１０９では、逆直交変換部１０７から得られる逆直交変換した信号と、動き補償予測部１２３を経て得られる予測画像とを加算処理して復号画像を生成し、ブロック歪みを抑制するデブロッキングフィルタ１２０を介してメモリ１２１に格納する。メモリ１２１に格納された画像は、フレーム間予測における参照画像、フレーム内予測における参照画像として用いられる。 The adding unit 109 adds the signal obtained by the inverse orthogonal transform obtained from the inverse orthogonal transform unit 107 and the prediction image obtained through the motion compensation prediction unit 123 to generate a decoded image, and performs deblocking that suppresses block distortion. The data is stored in the memory 121 via the filter 120. The image stored in the memory 121 is used as a reference image in inter-frame prediction and a reference image in intra-frame prediction.

尚、切替スイッチ１２４は、画面内予測と画面間予測の切り替えに用いられる。 The changeover switch 124 is used for switching between intra prediction and inter prediction.

ところで、ＭＰＥＧ−４ＡＶＣ／Ｈ．２６４では、直交変換部１０３及び量子化部１０４、並びに逆量子化部１０６及び逆直交変換部１０７は、それぞれ一括処理されるため、本説明では、それぞれ整数精度変換部１０５及び整数精度逆変換部１０８として図示している。 By the way, MPEG-4 AVC / H. In H.264, since the orthogonal transform unit 103 and the quantization unit 104, and the inverse quantization unit 106 and the inverse orthogonal transform unit 107 are each collectively processed, in this description, the integer precision transform unit 105 and the integer precision inverse transform unit are respectively provided. This is illustrated as 108.

これは、直交変換と量子化を一括処理する際に、正規化係数が用いられるためである。即ち、直交変換行列で４×４ブロックの差分信号を周波数係数列に変換した値に対して正規化係数を乗じると本来の直交変換係数列となるが、この正規化係数に対応させた量子化パラメータｑＰを予め規定しておき、量子化ステップの対数と量子化パラメータｑＰが比例関係（量子化係数ｑＰが６増えると量子化ステップが２倍）となるように、予め正規化係数マトリクスの要素位置ごとの値（後述するα，β，γごとの値）を量子化係数として規定しておく。 This is because the normalization coefficient is used when the orthogonal transform and the quantization are collectively processed. In other words, when a normalization coefficient is multiplied to a value obtained by converting a difference signal of a 4 × 4 block into a frequency coefficient string by an orthogonal transformation matrix, an original orthogonal transformation coefficient string is obtained. A quantization corresponding to the normalization coefficient The parameter qP is defined in advance, and the elements of the normalization coefficient matrix are set in advance so that the logarithm of the quantization step and the quantization parameter qP have a proportional relationship (the quantization step is doubled when the quantization coefficient qP increases by 6). Values for each position (values for α, β, and γ described later) are defined as quantization coefficients.

逆直交変換と逆量子化も同様な逆処理で一括処理される。 Inverse orthogonal transform and inverse quantization are collectively processed by the same inverse process.

次に、従来からの対応する復号装置（例としてＭＰＥＧ−４ＡＶＣ／Ｈ．２６４）の構成例を図１８に示す。上記と同様、フレーム間予測及びフレーム内予測のいずれも同様の処理が施されるため、フレーム間予測の場合を説明する。 Next, FIG. 18 shows a configuration example of a conventional decoding apparatus (for example, MPEG-4AVC / H.264). Similar to the above, since the same processing is performed for both inter-frame prediction and intra-frame prediction, the case of inter-frame prediction will be described.

［復号装置］
従来からの復号装置３００は、エントロピー復号部３０１と、スキャニング部３０２と、整数精度逆変換部３０５（逆量子化部３０３及び逆直交変換部３０４からなる）と、加算部３０６と、デブロッキングフィルタ３０７と、メモリ３０８と、フレーム内予測部３０９と、動き補償予測部３１０と、切替スイッチ３１１と、並べ替え部３１２とを備える。以下、代表的に、４×４ブロックの符号化について説明するが、８×８ブロックや１６×１６ブロックによる符号化も同様である。 [Decoding device]
A conventional decoding apparatus 300 includes an entropy decoding unit 301, a scanning unit 302, an integer precision inverse transform unit 305 (consisting of an inverse quantization unit 303 and an inverse orthogonal transform unit 304), an addition unit 306, a deblocking filter 307, a memory 308, an intra-frame prediction unit 309, a motion compensation prediction unit 310, a changeover switch 311, and a rearrangement unit 312. Hereinafter, representatively, encoding of 4 × 4 blocks will be described, but the same applies to encoding by 8 × 8 blocks or 16 × 16 blocks.

エントロピー復号部３０１は、符号化されたビットストリームを入力して、エントロピー復号処理（符号化装置側で指定される適応的な可変長復号処理や算術符号化の復号処理）を施し、符号化装置側で指定されるスキャニングオーダーで量子化ブロックを読み出し、逆量子化部３０３に送出するとともに、動きベクトルの情報を復号して動き補償予測部３１０に送出する。 The entropy decoding unit 301 receives the encoded bit stream, performs entropy decoding processing (adaptive variable length decoding processing or arithmetic coding decoding processing specified on the encoding device side), and the encoding device The quantization block is read in the scanning order specified on the side, and is sent to the inverse quantization unit 303, and the motion vector information is decoded and sent to the motion compensation prediction unit 310.

逆量子化部３０３は、エントロピー復号部３０１から供給される量子化ブロックの量子化信号に対して逆量子化処理を施して４×４ブロックの差分信号の直交変換係数を取得し、逆直交変換部３０４に送出する。 The inverse quantization unit 303 performs inverse quantization processing on the quantization signal of the quantization block supplied from the entropy decoding unit 301 to obtain orthogonal transform coefficients of the difference signal of 4 × 4 blocks, and performs inverse orthogonal transform To the unit 304.

逆直交変換部３０４は、逆量子化部３０３から供給される４×４ブロックの差分信号の直交変換係数に対して、逆直交変換（整数精度ＩＤＣＴ）を施し、得られる当該４×４ブロックの差分信号を加算部３０６に送出する。 The inverse orthogonal transform unit 304 performs inverse orthogonal transform (integer accuracy IDCT) on the orthogonal transform coefficient of the difference signal of the 4 × 4 block supplied from the inverse quantization unit 303, and obtains the obtained 4 × 4 block. The difference signal is sent to the adding unit 306.

動き補償予測部３１０は、メモリ３１０から得られる参照画像とエントロピー復号部３０１から得られる動きベクトルとを用いて予測画像を生成し、切替スイッチ３１１を介して加算部３０６に出力する。 The motion compensation prediction unit 310 generates a prediction image using the reference image obtained from the memory 310 and the motion vector obtained from the entropy decoding unit 301, and outputs the prediction image to the addition unit 306 via the changeover switch 311.

加算部３０６は、逆直交変換部３０４から得られる当該差分信号と、動き補償予測部３１０から供給される予測画像とを加算して４×４画素ブロックの画像信号を復元し、復元した画像信号を、ブロック歪みを抑制するデブロッキングフィルタ３０７を介して並べ替え部３１２に送出するとともに、メモリ３０８に格納する。 The adding unit 306 adds the difference signal obtained from the inverse orthogonal transform unit 304 and the prediction image supplied from the motion compensation prediction unit 310 to restore the image signal of the 4 × 4 pixel block, and restores the restored image signal. Is transmitted to the rearrangement unit 312 via the deblocking filter 307 that suppresses block distortion and stored in the memory 308.

並べ替え部３１２は、復元した画像信号を表示信号として並べ替えを行う。 The rearrangement unit 312 rearranges the restored image signal as a display signal.

整数精度逆変換部３０５の逆直交変換と逆量子化は、符号化装置側と同様な逆処理で一括処理される。 The inverse orthogonal transform and inverse quantization of the integer precision inverse transform unit 305 are collectively processed by the inverse process similar to that on the encoding device side.

このように、ＭＰＥＧ−４ＡＶＣ／Ｈ．２６４では、動き補償予測又は画面内予測の予測信号と、ブロック分割された画像信号との差分信号に対して、整数精度ＤＣＴの直交変換（整数精度ＩＤＣＴの逆直交変換）が採用されている。 In this way, MPEG-4 AVC / H. In H.264, integer precision DCT orthogonal transformation (integer precision IDCT inverse orthogonal transformation) is employed for a difference signal between a prediction signal of motion compensation prediction or intra prediction and an image signal divided into blocks.

画像信号を周波数領域で表現すると、一般的に低域にエネルギーが集中することが知られており、ＤＣＴを使用することで低周波領域にエネルギーを集中させることができ、例えば８×８画素に対応する符号化対象ブロックでＤＣＴを施した場合、６４個の直交変換係数の信号を形成することができるが、高周波成分の信号が０又は０に近い数値になりやすい。そこで、映像符号化では、このように０と分かっている直交変換係数の信号をわざわざ伝送する必要がないため、高周波成分以外の直交変換係数の信号だけを伝送することで伝送信号の圧縮を実現する。 When an image signal is expressed in the frequency domain, it is generally known that energy is concentrated in the low frequency range. By using DCT, the energy can be concentrated in the low frequency domain, for example, 8 × 8 pixels. When DCT is performed on the corresponding encoding target block, 64 orthogonal transform coefficient signals can be formed, but the high-frequency component signal tends to be 0 or a value close to 0. Therefore, in video coding, it is not necessary to bother to transmit the signal of the orthogonal transform coefficient known as 0 in this way, so the transmission signal is compressed by transmitting only the signal of the orthogonal transform coefficient other than the high frequency component. To do.

しかしながら、例えば吹雪や水しぶき、或いは極端に複雑なテクスチャなどの画像信号は、高周波成分の信号を多く含み、ＤＣＴを用いて周波数領域の信号に変換してもエネルギー集中が十分ではなく、符号化効率が低下する場合があり、復号した際の画像信号の劣化につながることもあることが知られている。 However, for example, image signals such as snowstorms, splashes, and extremely complicated textures contain many high-frequency component signals, and even when converted to frequency domain signals using DCT, energy concentration is not sufficient and coding efficiency is high. It is known that the image signal may decrease, and this may lead to degradation of the image signal at the time of decoding.

このために、ＤＣＴベースの符号化方式では符号化が困難な画像信号を効率的に符号化する技術が望まれている。 For this reason, there is a demand for a technique for efficiently encoding an image signal that is difficult to be encoded by a DCT-based encoding method.

一方、高周波成分の信号に対し良好なエネルギー集中度を示す直交変換としてＤＳＴ（離散コサイン変換）が知られている。ＤＣＴとＤＳＴは、ＤＣＴの変換核がコサイン（ｃｏｓ）で構成されるのに対して、ＤＳＴの変換核がサイン(ｓｉｎ)である点だけが相違する。だたし、ＤＣＴは、ｃｏｓ(０)が直流成分を表すことができるのに対し、ＤＳＴはｓｉｎ(０)が常に０となるために、直流成分を単独の直交変換係数として表すことができないという性質がある。このため、ＤＣＴは、直流成分を多く持つ画像信号に対してエネルギー集中度が高くなり、且つ１つの直交変換係数で直流成分を表すことができるが、ＤＳＴは、直流成分を多く持つ画像信号に対してエネルギー分散を生じ、複数の直交変換係数で直流成分を表すことになる。 On the other hand, DST (Discrete Cosine Transform) is known as an orthogonal transform that shows a good energy concentration level for a high-frequency component signal. DCT and DST are different from each other only in that the conversion nucleus of DCT is constituted by cosine, whereas the conversion nucleus of DST is sine. However, in DCT, cos (0) can represent a direct current component, whereas in DST, since sin (0) is always 0, the direct current component cannot be represented as a single orthogonal transform coefficient. It has the nature of For this reason, DCT has a high energy concentration level for an image signal having a large amount of direct current component and can express the direct current component with one orthogonal transform coefficient. DST is an image signal having a large amount of direct current component. On the other hand, energy dispersion occurs, and a direct current component is represented by a plurality of orthogonal transform coefficients.

このような性質の違いから、一般的に低周波成分を多く含む画像信号の符号化では、ＤＣＴが使用され、ＤＳＴが使用されることは極めて稀である。 Due to such a difference in characteristics, in general, DCT is used and DST is very rarely used for encoding an image signal including many low frequency components.

ところで、このＤＣＴとＤＳＴの性質の違いを利用して、フレーム画像を小領域のブロックに分割し、このブロックをＤＣＴとＤＳＴの双方で直交変換を施し、ブロック内の画素間相関の強さを、ＤＣＴとＤＳＴの各係数の０ランレングスの長さで比較して、いずれが適切な直交変換であるかを判別して、量子化するための直交変換係数を決定する技術が開示されている（例えば、特許文献１参照）。 By utilizing the difference between the characteristics of DCT and DST, the frame image is divided into small area blocks, this block is subjected to orthogonal transform by both DCT and DST, and the correlation strength between pixels in the block is increased. , A technique for comparing orthogonal DCT and DST coefficients with a length of 0 run length to determine which is an appropriate orthogonal transform and determining an orthogonal transform coefficient for quantization is disclosed. (For example, refer to Patent Document 1).

特開平５−２２７１５号公報JP-A-5-22715

前述した特許文献１の技術では、符号化対象ブロック内の画素間相関の強さを、ＤＣＴとＤＳＴとの係数で、単純な０ランレングスの長さで比較し、差分信号を符号化するものではなく、符号化対象ブロックの画素値に対して直交変換を施して得られる直交変換係数を量子化して符号化する技術である。 In the technique of Patent Document 1 described above, the intensity of inter-pixel correlation in a block to be encoded is compared with a coefficient of DCT and DST with a simple length of 0 run length, and a differential signal is encoded. Rather, it is a technique for quantizing and encoding orthogonal transform coefficients obtained by performing orthogonal transform on the pixel values of the encoding target block.

しかしながら、ＤＣＴとＤＳＴの各係数で、単純な０ランレングスの長さで比較するよりも、確実に符号化時の影響を反映させたブロック内の画素間相関の強さに応じた最適な切り替えとしては改善の余地がある。 However, the optimum switching according to the strength of the correlation between the pixels in the block that reliably reflects the effect at the time of encoding, rather than comparing the coefficients of DCT and DST with a simple length of 0 run length. There is room for improvement.

また、上述したように、映像符号化では、動き補償予測技術や画面内予測技術などの発展により現在の符号化方式において実際に直交変換処理の対象となる信号は、これらの差分信号である。例えば、ＭＰＥＧでは、この差分信号に対してＤＣＴの直交変換を行うことが多い。特に、ＭＰＥＧ−４ＡＶＣ／Ｈ．２６４では、動き補償予測又は画面内予測の予測信号と、ブロック分割された画像信号との差分信号（差分信号）に対して、整数精度ＤＣＴの直交変換が採用されている。 Further, as described above, in video coding, signals that are actually subjected to orthogonal transform processing in the current coding scheme due to development of motion compensation prediction technology and intra-screen prediction technology are these differential signals. For example, in MPEG, DCT orthogonal transformation is often performed on this difference signal. In particular, MPEG-4 AVC / H. In H.264, orthogonal transform of integer precision DCT is adopted for a difference signal (difference signal) between a prediction signal of motion compensation prediction or intra prediction and an image signal divided into blocks.

この整数精度ＤＣＴは、後述する変換行列を２次元の画素ブロックに対して水平（この変換行列に入力して直交変換を実行）、垂直（この変換行列を転置した行列に入力して直交変換を実行）に１回ずつ適用することで２次元の直交変換係数を求める。この整数精度ＤＣＴの直交変換は、符号化／復号時の演算精度の違いによる演算誤差（量子化誤差）を極力抑えるべく、正規化係数を用いて量子化演算と一括して処理される。尚、正規化係数は、整数精度直交変換の前後で信号のエネルギーの大きさ（直交変換係数の２乗の総和）が変化しないように補正する係数である。 In this integer precision DCT, a transformation matrix, which will be described later, is horizontally (inputted to this transformation matrix to perform orthogonal transformation) with respect to a two-dimensional pixel block, and is vertical (inputted to a matrix obtained by transposing this transformation matrix to perform orthogonal transformation. To obtain a two-dimensional orthogonal transform coefficient. This orthogonal transform of integer precision DCT is processed together with a quantization operation using a normalization coefficient in order to suppress an operation error (quantization error) due to a difference in operation accuracy during encoding / decoding as much as possible. The normalization coefficient is a coefficient that is corrected so that the magnitude of the signal energy (the sum of the squares of the orthogonal transform coefficients) does not change before and after the integer precision orthogonal transform.

そこで、現行の符号化技術におけるＭＰＥＧ−４ＡＶＣ／Ｈ．２６４に対して、前述した特許文献１の技術を適用した場合、例えば、図１９に示すような構成が考えられる。 Therefore, MPEG-4 AVC / H. For example, when the above-described technique of Patent Document 1 is applied to H.264, a configuration as shown in FIG.

即ち、図１９は、図１６に示す符号装置１００における直交変換部１０３において、入力される４×４ブロックの差分信号に対して整数精度ＤＣＴを施すＤＣＴ１０３１ａ及び整数精度ＤＳＴを施すＤＳＴ１０３１ｂと、ＤＣＴ１０３１ａとＤＳＴ１０３１ｂの各係数に対して、単純な０ランレングスの長さで比較して量子化するための直交変換係数を決定する０ランレングス比較部１０３２と、決定された直交変換係数を量子化するべく切り替えを行う切替スイッチ１０３３とを構成した場合を想定した図である。 That is, FIG. 19 shows a DCT 1031a that performs integer precision DCT, a DST 1031b that performs integer precision DST, and a DCT 1031a that performs integer precision DCT on the input differential signal of 4 × 4 blocks in the orthogonal transform unit 103 in the coding apparatus 100 shown in FIG. For each coefficient of DST 1031b, a 0-run length comparison unit 1032 that determines an orthogonal transform coefficient to be quantized by comparing with a simple 0-run length length, and to quantize the determined orthogonal transform coefficient It is the figure which assumed the case where the change-over switch 1033 which switches was comprised.

上記の差分信号に対してＤＣＴを施した結果は、画素ブロックに直接的にＤＣＴを施した場合よりも、直流成分のエネルギーが減少し、全体的なエネルギー集中度が減少する。一方で、上記の差分信号に対してＤＳＴを施した結果は、画素ブロックに直接的にＤＳＴを施した場合よりも、全体的なエネルギー集中度が増大する。 As a result of applying DCT to the difference signal, the direct current component energy is reduced and the overall energy concentration is reduced as compared with the case where DCT is directly applied to the pixel block. On the other hand, as a result of applying DST to the above difference signal, the overall energy concentration increases as compared with the case where DST is directly applied to the pixel block.

従って、上記の差分信号に対してＤＣＴとＤＳＴの双方で直交変換を施し、ＤＣＴとＤＳＴの各係数の０ランレングスの長さで単純に比較するやり方では、高域に存在する“はずれ値”があるとその影響を受けてしまい必ずしも安定した符号化効率が得られないので、より符号化効率を確実に高める技法が望まれる。 Therefore, when the above-described differential signal is subjected to orthogonal transform in both DCT and DST and simply compared with the length of 0 run length of each coefficient of DCT and DST, the “outlier value” existing in the high band is used. Therefore, a stable encoding efficiency cannot always be obtained, and a technique for reliably increasing the encoding efficiency is desired.

また、ＭＰＥＧ−４ＡＶＣ／Ｈ．２６４では、演算負担を軽減させることを意図して、整数精度ＤＣＴと量子化を一括処理するように規定されており、単にＤＣＴとＤＳＴの双方で直交変換を施すように構成することは現在における映像符号化の趣旨にも反することになりかねず、ＤＣＴとＤＳＴの双方で直交変換を施すとしても如何にして演算負担を軽減させるかが問題となる。 In addition, MPEG-4 AVC / H. H.264 specifies that the integer precision DCT and quantization are batch-processed with the intention of reducing the computation burden, and it is currently possible to simply perform orthogonal transformation on both DCT and DST. This may be contrary to the purpose of video coding, and even if orthogonal transform is performed in both DCT and DST, how to reduce the calculation burden becomes a problem.

本発明の目的は、上述のような問題に鑑みて、演算負担をさほど増大させることなく、最適な直交変換係数を判断して符号化する符号化装置、この符号化した信号を復号する復号装置及びプログラムを提供することにある。 In view of the above-described problems, an object of the present invention is to provide an encoding device that determines and encodes an optimal orthogonal transform coefficient without significantly increasing the calculation burden, and a decoding device that decodes the encoded signal. And providing a program.

本発明では、ＤＣＴとＤＳＴの性質の違いをより的確に利用して符号化効率を高める。１つの例では、ＤＣＴとＤＳＴのエネルギーの総和で比較して、視覚感度の高い信号成分のエネルギーの集中度が高いほうを選択することにより、任意の画像に対して適応した符号化を符号量も鑑みて選択することができ、結果として従来技術よりも符号量も少なく画質もよくなる。
即ち、本発明による符号化装置は、フレーム画像を小領域に分割した符号化対象ブロックに対して予測画像を生成し、該符号化対象ブロックと前記予測画像の差分信号を符号化する符号化装置であって、前記差分信号に対して第１の直交変換処理を施して符号化するか、又は前記差分信号に対して前記第１の直交変換処理に対してｎ次の基底同士が直交関係にある第２の直交変換処理を施して符号化するかを切り替える切替手段と、前記フレーム画像に対して符号化を行う場合のコスト計算を行って、より少ない符号量となるよう前記差分信号ごとに前記切替手段の切り替えを制御するコスト算出手段と、前記第１の直交変換処理及び／又は前記第２の直交変換処理によって得られる直交変換係数に対して量子化処理を施す量子化手段とを備え、前記量子化手段は、前記第１の直交変換処理及び前記第２の直交変換処理に対して共通の正規化係数マトリクスを用いて量子化する手段を有することを特徴とする。 In the present invention, the difference in characteristics between DCT and DST is used more appropriately to increase the coding efficiency. In one example, by comparing the sum of the energy of DCT and DST and selecting the one with the higher concentration of energy of the signal component with high visual sensitivity, coding adapted to an arbitrary image is encoded. In view of this, the code amount can be reduced and the image quality can be improved as compared with the prior art.
That is, an encoding apparatus according to the present invention generates a prediction image for an encoding target block obtained by dividing a frame image into small regions, and encodes a difference signal between the encoding target block and the prediction image. The differential signal is encoded by applying a first orthogonal transform process, or the n-th bases are orthogonal to the first orthogonal transform process with respect to the differential signal. Switching means for switching whether to perform encoding by performing a second orthogonal transformation process, and cost calculation when encoding the frame image, for each difference signal so as to reduce the code amount Cost calculating means for controlling switching of the switching means, and quantization means for performing quantization processing on orthogonal transform coefficients obtained by the first orthogonal transform processing and / or the second orthogonal transform processing. The quantizing means is characterized in that it comprises means for quantizing using a common normalization coefficient matrix to said first orthogonal transform process and the second orthogonal transform of.

即ち、本発明による符号化装置は、前記量子化手段は、前記正規化係数マトリクスの読み出し順を切替えることにより前記切替手段の切り替えによらず、当該共通の正規化係数マトリクスを用いて量子化する手段を有することを特徴とする。 That is, in the encoding device according to the present invention, the quantization means quantizes using the common normalization coefficient matrix without switching the switching means by switching the reading order of the normalization coefficient matrix. It has the means.

また、本発明による符号化装置において、前記量子化手段による量子化後の伝送すべき係数を１次元配列に変換する際のスキャニングオーダーを前記第１の直交変換処理及び前記第２の直交変換処理の切り替えに応じて切り替えるスキャニング手段を有し、前記第１の直交変換処理及び前記第２の直交変換処理における互いの変換行列が、前記第１の直交変換処理及び前記第２の直交変換処理の切り替えに応じて切り替えられるスキャニングオーダーに対応して当該共通の正規化係数マトリクスを用いて量子化することができる直交関係で規定されていることを特徴とする。 Further, in the encoding apparatus according to the present invention, the first orthogonal transform process and the second orthogonal transform process may be used as a scanning order when transforming the coefficient to be transmitted after quantization by the quantizing means into a one-dimensional array. Scanning means for switching according to the switching of the first orthogonal transformation process and the second orthogonal transformation process, the mutual transformation matrix of the first orthogonal transformation process and the second orthogonal transformation process It is defined by an orthogonal relationship that can be quantized using the common normalization coefficient matrix corresponding to a scanning order that is switched according to switching.

また、本発明による符号化装置において、前記コスト算出手段は、前記符号化対象ブロックに対して異なる画素ブロックサイズごとの差分信号についてそれぞれコスト計算を行って、最も少ない符号量となる画素ブロックサイズ及び直交変換処理の組み合わせを決定し、前記符号化対象ブロックの差分信号に対する前記切替手段の切り替えを制御する手段を有することを特徴とする。 Further, in the encoding device according to the present invention, the cost calculation means performs cost calculation on the difference signals for different pixel block sizes with respect to the encoding target block, and the pixel block size and the smallest code amount, A unit for determining a combination of orthogonal transform processing and controlling switching of the switching unit with respect to a differential signal of the encoding target block is provided.

また、本発明による符号化装置において、前記第１の直交変換処理は、前記第２の直交変換処理よりも少ない係数で直流成分を表現する基底に基づいており、前記切替手段は、前記第１の直交変換処理及び前記第２の直交変換処理の前後に切替スイッチを有し、前記コスト算出手段は、前記差分信号の分散値をアクティビティの値として算出して分散値が所定の閾値より大きい場合にのみ前記第２の直交変換処理を選択するか、又は前記差分信号内の隣接画素間の相関係数をアクティビティの値として算出して相関係数が所定の閾値より小さい場合にのみ前記第２の直交変換処理を選択するよう前記切替スイッチを制御する比較手段を有することを特徴とする。 Also, in the encoding device according to the present invention, the first orthogonal transform process is based on a basis expressing a DC component with a smaller number of coefficients than the second orthogonal transform process, and the switching means includes the first A changeover switch before and after the orthogonal transformation process and the second orthogonal transformation process, and the cost calculation means calculates a variance value of the difference signal as an activity value and the variance value is larger than a predetermined threshold value Only when the second orthogonal transform process is selected, or the correlation coefficient between adjacent pixels in the difference signal is calculated as an activity value and the correlation coefficient is smaller than a predetermined threshold. Comparing means for controlling the changeover switch so as to select the orthogonal transformation process is provided.

また、本発明による符号化装置において、前記第１の直交変換処理は、前記第２の直交変換処理よりも少ない係数で直流成分を表現する基底に基づいており、前記切替手段は、前記第１の直交変換処理及び前記第２の直交変換処理の後段に切替スイッチを有し、前記コスト算出手段は、前記第１の直交変換処理によるマクロブロックの直交変換係数のエネルギー分布を算出してエネルギー集中度を算出する第１マクロブロックエネルギー集中度算出手段と、前記第２の直交変換処理によるマクロブロックの直交変換係数のエネルギー分布を算出してエネルギー集中度を算出する第２マクロブロックエネルギー集中度算出手段と、各々で算出したエネルギー集中度を比較して、エネルギー集中度の高いほうの前記第１の直交変換処理又は前記第２の直交変換処理のいずれかを選択するよう前記切替スイッチを制御する手段とを有することを特徴とする。 Also, in the encoding device according to the present invention, the first orthogonal transform process is based on a basis expressing a DC component with a smaller number of coefficients than the second orthogonal transform process, and the switching means includes the first The cost calculation means calculates the energy distribution of the orthogonal transform coefficient of the macroblock by the first orthogonal transform process and concentrates the energy in the latter stage of the orthogonal transform process and the second orthogonal transform process. First macroblock energy concentration calculating means for calculating the degree of energy, and second macroblock energy concentration calculating for calculating the energy concentration by calculating the energy distribution of the orthogonal transform coefficient of the macroblock by the second orthogonal transform processing The energy concentration degree calculated by each of the means and the first orthogonal transform process or the first one having the higher energy concentration degree. And having a means for controlling the changeover switch to select one of the orthogonal transform processing.

また、本発明による符号化装置において、前記第１の直交変換処理は、前記第２の直交変換処理よりも少ない係数で直流成分を表現する基底に基づいており、前記切替手段は、前記第１の直交変換処理の後段処理としての量子化及びエントロピー符号化処理を施す第１信号系統と、前記第２の直交変換処理の後段処理としての量子化及びエントロピー符号化処理を施す第２信号系統の後段に切替スイッチを有し、前記コスト算出手段は、前記第１信号系統の符号化値の歪み量を算出する第１歪み量算出手段と、前記第２信号系統の符号化値の歪み量を算出する第２歪み量算出手段と、各々で算出した歪み量について、レート歪み最適化法に基づくコスト値を比較して効率の良いほうの前記第１信号系統又は前記第２信号系統のいずれかを選択するよう前記切替スイッチを制御する手段とを有することを特徴とする。 Also, in the encoding device according to the present invention, the first orthogonal transform process is based on a basis expressing a DC component with a smaller number of coefficients than the second orthogonal transform process, and the switching means includes the first A first signal system that performs quantization and entropy coding processing as a subsequent process of the orthogonal transform process, and a second signal system that performs quantization and entropy coding process as a subsequent process of the second orthogonal transform process. The cost calculating means includes a first distortion amount calculating means for calculating a distortion amount of the encoded value of the first signal system, and a distortion amount of the encoded value of the second signal system. The second distortion amount calculation means to calculate, and the distortion amount calculated by each of the first signal system or the second signal system, which is more efficient by comparing the cost values based on the rate distortion optimization method choose And having a means for controlling so that the change-over switch.

更に、本発明による復号装置は、フレーム画像を小領域に分割した符号化対象ブロックに対して予測画像を生成し、該符号化対象ブロックと前記予測画像の差分信号を符号化することによって符号化されたフレーム画像を復号する復号装置であって、本発明の符号化装置における比較手段によって生成される、前記第１の直交変換処理及び前記第２の直交変換処理のいずれを選択したかを表す制御信号を取得して、符号化された当該フレーム画像を復号する手段を有し、該手段は、符号化された当該フレーム画像における量子化信号に対して、前記第１の直交変換処理及び前記第２の直交変換処理に共通する正規化係数マトリクスを用いて逆量子化処理を施す手段を有することを特徴とする。 Furthermore, the decoding device according to the present invention generates a prediction image for an encoding target block obtained by dividing a frame image into small regions, and encodes a difference signal between the encoding target block and the prediction image by encoding. A decoding device that decodes a frame image that has been selected, and represents which one of the first orthogonal transformation processing and the second orthogonal transformation processing generated by the comparison means in the encoding device of the present invention is selected. Means for obtaining a control signal and decoding the encoded frame image, the means for the quantized signal in the encoded frame image, the first orthogonal transform process and the The present invention is characterized by comprising means for performing an inverse quantization process using a normalization coefficient matrix common to the second orthogonal transform process.

また、本発明による符号化装置及び復号装置は、コンピュータとして構成することができ、各符号化及び復号処理を実行するプログラムとしての特徴を有する。 Moreover, the encoding apparatus and decoding apparatus by this invention can be comprised as a computer, and have the characteristic as a program which performs each encoding and decoding process.

本発明によれば、演算負担をさほど増大させることなく、ＤＣＴベースの符号化方式ではエネルギー集中度が十分でないような画像信号に対しても、最適な直交変換係数を判断して符号化及び復号することができる。 According to the present invention, encoding and decoding are performed by determining an optimal orthogonal transform coefficient even for an image signal whose degree of energy concentration is not sufficient in the DCT-based encoding method without increasing the calculation burden so much. can do.

本発明による一実施例の符号化装置の概略図である。1 is a schematic diagram of an encoding apparatus according to an embodiment of the present invention. 本発明による一実施例の復号装置の概略図である。FIG. 3 is a schematic diagram of a decoding device according to an embodiment of the present invention. 本発明による一実施例のＤＣＴ及びＤＳＴの選択を画素値レベルでアクティビティを比較して決定する方法（技法１）に対応する整数精度変換部における直交変換部の構成を示す図である。It is a figure which shows the structure of the orthogonal transformation part in the integer precision conversion part corresponding to the method (technique 1) which determines selection of DCT and DST of one Example by this invention by comparing activity by a pixel value level. 本発明による一実施例のＤＣＴ及びＤＳＴの双方の直交変換係数のエネルギー分布を算出して決定する方法（技法２）に対応する整数精度変換部における直交変換部の構成を示す図である。It is a figure which shows the structure of the orthogonal transformation part in the integer precision conversion part corresponding to the method (technique 2) which calculates and determines the energy distribution of the orthogonal transformation coefficient of both DCT and DST of one Example by this invention. 本発明による一実施例のＤＣＴ及びＤＳＴの双方の直交変換係数の量子化及びエントロピー符号化による符号量を算出し、その際のレート歪み最適化法に基づくコスト値を比較して効率の良いほうを選択する方法（技法３）に対応する構成を示す図である。The code amount by quantization and entropy coding of both orthogonal transform coefficients of DCT and DST of one embodiment according to the present invention is calculated, and the cost value based on the rate distortion optimization method at that time is compared and the more efficient one It is a figure which shows the structure corresponding to the method (technique 3) of selecting. 本発明による一実施例のＤＣＴ及びＤＳＴの選択を画素値レベルでアクティビティを比較して決定する方法（技法１）及びＤＣＴ及びＤＳＴの双方の直交変換係数のエネルギー分布を算出して決定する方法（技法２）の双方に適用可能な整数精度逆変換部における逆直交変換部の構成を示す図である。A method of determining DCT and DST selection according to an embodiment of the present invention by comparing activities at a pixel value level (Technique 1) and a method of calculating and determining energy distributions of orthogonal transform coefficients of both DCT and DST ( It is a figure which shows the structure of the inverse orthogonal transformation part in the integer precision inverse transformation part applicable to both of technique 2). 本発明による一実施例のＤＣＴ及びＤＳＴの整数精度の変換行列の例を示す図である。It is a figure which shows the example of the conversion matrix of the integer precision of DCT and DST of one Example by this invention. 本発明による一実施例の量子化部の構成を示す図である。It is a figure which shows the structure of the quantization part of one Example by this invention. 本発明による一実施例の逆量子化部の構成を示す図である。It is a figure which shows the structure of the inverse quantization part of one Example by this invention. 本発明による一実施例の正規化係数マトリクスの例を示す図である。It is a figure which shows the example of the normalization coefficient matrix of one Example by this invention. 本発明による一実施例の量子化係数の例を示す図である。It is a figure which shows the example of the quantization coefficient of one Example by this invention. 本発明による一実施例の正規化係数マトリクスの読み出し順を切替えの説明図である。It is explanatory drawing of switching the reading order of the normalization coefficient matrix of one Example by this invention. 本発明による一実施例のＤＳＴの変換行列の変形例を示す図である。It is a figure which shows the modification of the conversion matrix of DST of one Example by this invention. 本発明による一実施例のＤＳＴの変換行列の変形例を示す図である。It is a figure which shows the modification of the conversion matrix of DST of one Example by this invention. 本発明による一実施例のＤＣＴの場合におけるスキャニングオーダーを示す図である。It is a figure which shows the scanning order in the case of DCT of one Example by this invention. 本発明による一実施例のＤＣＴの場合におけるスキャニングオーダーを示す図である。It is a figure which shows the scanning order in the case of DCT of one Example by this invention. 従来からの動画像用の符号化装置（例としてＭＰＥＧ−４ＡＶＣ／Ｈ．２６４）の構成例を示す図である。It is a figure which shows the structural example of the encoding apparatus for moving images from the past (MPEG-4AVC / H.264 as an example). 従来からの対応する復号装置（例としてＭＰＥＧ−４ＡＶＣ／Ｈ．２６４）の構成例を示す図である。It is a figure which shows the structural example of the corresponding decoding apparatus (MPEG-4AVC / H.264 as an example) from the past. ＤＣＴとＤＳＴの各係数に対して、０ランレングスの長さで比較して切り替えを行う例である。This is an example in which switching is performed by comparing each coefficient of DCT and DST with a length of 0 run length.

以下、本発明による一実施例の符号化装置１０及び復号装置３０を説明する。 Hereinafter, an encoding device 10 and a decoding device 30 according to an embodiment of the present invention will be described.

本発明による符号化装置１０は、フレーム画像を小領域に分割した符号化対象ブロックに対して予測画像を生成し、該符号化対象ブロックと前記予測画像の差分信号を符号化する符号化装置であり、この差分信号に対して第１の直交変換処理（例えば、ＤＣＴ)を施して符号化するか、又は当該差分信号に対して第１の直交変換処理に対してｎ次の基底同士が直交関係にある第２の直交変換処理（例えば、ＤＳＴ)を施して符号化するかを切り替える切替スイッチと、フレーム画像に対して符号化を行う場合のコスト計算を行って、より少ない符号量となるよう前記差分信号ごとに切替スイッチの切り替えを制御するコスト算出部とを備える。第１の直交変換処理（例えば、ＤＣＴ)は、第２の直交変換処理（例えば、ＤＳＴ)よりも少ない係数で直流成分を表現する基底に基づいており、例えば、このＤＣＴ及びＤＳＴの選択基準の方法例として、ＤＣＴ及びＤＳＴの双方の直交変換係数のエネルギー分布を算出して決定する方法（技法１）、画素値レベルでアクティビティを比較して決定する方法（技法２）、及び、ＤＣＴ及びＤＳＴの双方の直交変換係数の量子化及びエントロピー符号化による符号量を算出し、その際のレート歪み最適化法に基づくコスト値を比較して効率の良いほうを選択する方法（技法３）がある。いずれの場合も、最適な直交変換係数を判断して符号化することができる。
尚、レート歪み最適化法に基づくコスト算出は、例えば、
http://www.tom.comm.waseda.ac.jp/~tsukuba/source/avm20050609.pdf
を参照されたい。 An encoding apparatus 10 according to the present invention is an encoding apparatus that generates a prediction image for an encoding target block obtained by dividing a frame image into small regions, and encodes a difference signal between the encoding target block and the prediction image. Yes, this differential signal is encoded by applying a first orthogonal transform process (for example, DCT), or n-order bases are orthogonal to the first orthogonal transform process for the differential signal. A changeover switch for switching whether to perform encoding by performing a second orthogonal transformation process (for example, DST), and a cost calculation when encoding a frame image, a smaller code amount is obtained. A cost calculating unit for controlling switching of the changeover switch for each differential signal. The first orthogonal transform process (for example, DCT) is based on a base that expresses a direct current component with a smaller number of coefficients than the second orthogonal transform process (for example, DST). For example, the selection criterion of the DCT and DST Examples of methods include a method for calculating and determining the energy distribution of both DCT and DST orthogonal transform coefficients (Technique 1), a method for determining by comparing activities at the pixel value level (Technique 2), and DCT and DST. There is a method (Technique 3) for calculating the amount of code by quantization and entropy coding of both orthogonal transform coefficients and comparing cost values based on the rate distortion optimization method at that time to select the more efficient one (Technique 3) . In either case, the optimum orthogonal transform coefficient can be determined and encoded.
The cost calculation based on the rate distortion optimization method is, for example,
http://www.tom.comm.waseda.ac.jp/~tsukuba/source/avm20050609.pdf
Please refer to.

符号化装置１０では、各符号化した伝送信号に、ＤＣＴによる直交変換であるかＤＳＴによる直交変換であるかを識別可能な直交変換種別を表す制御信号（例えば、１ビット）を生成する。この制御信号は、選択した直交変換係数列のヘッダ情報として付加して後段の処理へと伝送するやり方と、制御信号を付加情報として別途伝送するやり方とがある。 The encoding device 10 generates a control signal (for example, 1 bit) representing an orthogonal transformation type that can identify whether the orthogonal transformation is DCT or DST, for each encoded transmission signal. There are a method of adding this control signal as header information of the selected orthogonal transform coefficient sequence and transmitting it to the subsequent processing, and a method of separately transmitting the control signal as additional information.

選択した直交変換係数列のヘッダ情報として付加して後段の処理へと伝送するやり方の場合、局部復号時や復号装置３０では、この制御信号に従って続いて伝送される直交変換係数を、当該制御信号によって示される直交変換種別に応じてＩＤＣＴ又はＩＤＳＴの直交変換処理を切替える。 In the case of a method of adding as header information of the selected orthogonal transform coefficient sequence and transmitting it to subsequent processing, the local transform or decoding apparatus 30 uses the orthogonal transform coefficient that is subsequently transmitted according to this control signal as the control signal. The IDCT or IDST orthogonal transform processing is switched according to the orthogonal transform type indicated by.

制御信号を付加情報として別途伝送するやり方では、この制御信号に応じて処理対象の直交変換係数に対応して適応的に処理する。 In the method of separately transmitting the control signal as additional information, adaptive processing is performed in accordance with the orthogonal transform coefficient to be processed according to the control signal.

以下の説明では、制御信号を付加情報として別途伝送するやり方で説明することで、選択した直交変換係数列のヘッダ情報として付加して後段の処理へと伝送するやり方も容易に理解されるようになる。 In the following description, the method of separately transmitting the control signal as additional information will be described so that the method of adding the control signal as header information of the selected orthogonal transform coefficient sequence and transmitting it to subsequent processing can be easily understood. Become.

図１は、本発明による一実施例の符号化装置１０の概略図である。前述した図１６に示す従来からの符号化装置１００と同様な構成要素には同一の参照番号を付しており、これらの説明は省略する。 FIG. 1 is a schematic diagram of an encoding apparatus 10 according to an embodiment of the present invention. Components similar to those of the conventional encoding device 100 shown in FIG. 16 are given the same reference numerals, and descriptions thereof are omitted.

本実施例の符号化装置１０は、従来の符号化装置１００に対して、ＤＣＴ及びＤＳＴの選択を画素値レベルでアクティビティを比較して決定する方法（技法１）及びＤＣＴ及びＤＳＴの双方の直交変換係数のエネルギー分布を算出して決定する方法（技法２）を適するべく、整数精度変換部１０５（直交変換部１０３及び量子化部１０４）の代わりに、直交変換種別の選択のための制御信号を扱う整数精度変換部１５（直交変換部１３及び量子化部１４）が設けられ、整数精度逆変換部１０８（逆量子化部１０６及び逆直交変換部１０７）の代わりに、直交変換種別の選択のための制御信号を扱う整数精度逆変換部１８（逆量子化部１６及び逆直交変換部１７）が設けられている点で相違する。 The encoding apparatus 10 according to the present embodiment is different from the conventional encoding apparatus 100 in that the selection of DCT and DST is determined by comparing activities at the pixel value level (Technique 1) and orthogonality of both DCT and DST. A control signal for selecting an orthogonal transform type instead of the integer precision transform unit 105 (orthogonal transform unit 103 and quantization unit 104) in order to suit the method (technique 2) for calculating and determining the energy distribution of transform coefficients. Integer precision conversion unit 15 (orthogonal transform unit 13 and quantization unit 14) is provided, and an orthogonal transform type is selected instead of integer precision inverse transform unit 108 (inverse quantization unit 106 and inverse orthogonal transform unit 107). The difference is that an integer precision inverse transform unit 18 (inverse quantization unit 16 and inverse orthogonal transform unit 17) that handles a control signal for use is provided.

また、図２は、本発明による一実施例の復号装置３０の概略図である。前述した図１８に示す従来からの復号装置３００と同様な構成要素には同一の参照番号を付しており、これらの説明は省略する。 FIG. 2 is a schematic diagram of a decoding device 30 according to an embodiment of the present invention. Constituent elements similar to those of the conventional decoding device 300 shown in FIG. 18 described above are denoted by the same reference numerals, and description thereof is omitted.

本実施例の復号装置３０は、従来の復号装置３００に対して、画素値レベルでアクティビティを比較して決定する方法（技法１）及びＤＣＴ及びＤＳＴの双方の直交変換係数のエネルギー分布を算出して決定する方法（技法２）を適するべく、整数精度逆変換部３０５（逆量子化部３０３及び逆直交変換部３０４）の代わりに、直交変換種別の選択のための制御信号を扱う整数精度逆変換部３５（逆量子化部３３及び逆直交変換部３４）が設けられている点で相違する。 The decoding apparatus 30 of the present embodiment calculates a method (Technique 1) for comparing and determining activities at the pixel value level and the energy distribution of orthogonal transform coefficients of both DCT and DST with respect to the conventional decoding apparatus 300. In order to suit the determination method (Technique 2), instead of the integer precision inverse transform unit 305 (inverse quantization unit 303 and inverse orthogonal transform unit 304), the integer precision inverse that handles the control signal for selecting the orthogonal transform type is used. The difference is that a transform unit 35 (an inverse quantization unit 33 and an inverse orthogonal transform unit 34) is provided.

図３は、ＤＣＴ及びＤＳＴの選択を画素値レベルでアクティビティを比較して決定する方法（技法１）に対応する整数精度変換部１５における直交変換部１３の構成を示しており、図４は、ＤＣＴ及びＤＳＴの双方の直交変換係数のエネルギー分布を算出して決定する方法（技法２）に対応する整数精度変換部１５における直交変換部１３の構成を示している。 FIG. 3 shows a configuration of the orthogonal transform unit 13 in the integer precision transform unit 15 corresponding to a method (Technique 1) in which selection of DCT and DST is determined by comparing activities at the pixel value level. The structure of the orthogonal transformation part 13 in the integer precision conversion part 15 corresponding to the method (technique 2) which calculates and determines the energy distribution of the orthogonal transformation coefficient of both DCT and DST is shown.

また、図６は、ＤＣＴ及びＤＳＴの選択を画素値レベルでアクティビティを比較して決定する方法（技法１）及びＤＣＴ及びＤＳＴの双方の直交変換係数のエネルギー分布を算出して決定する方法（技法２）の双方に適用可能な整数精度逆変換部１８における逆直交変換部１７を示している。 FIG. 6 shows a method of determining DCT and DST selection by comparing activities at the pixel value level (Technique 1) and a method of calculating and determining the energy distribution of orthogonal transform coefficients of both DCT and DST (Technique). 2 shows an inverse orthogonal transform unit 17 in the integer precision inverse transform unit 18 applicable to both cases 2).

更に、図８は、ＤＣＴ及びＤＳＴの選択を画素値レベルでアクティビティを比較して決定する方法（技法１）及びＤＣＴ及びＤＳＴの双方の直交変換係数のエネルギー分布を算出して決定する方法（技法２）の双方に適用可能な整数精度変換部１５における量子化部１４の構成を示している。 Further, FIG. 8 shows a method for determining DCT and DST selection by comparing activities at the pixel value level (Technique 1) and a method for calculating and determining the energy distribution of orthogonal transform coefficients of both DCT and DST (Technique 1). 2 shows the configuration of the quantization unit 14 in the integer precision conversion unit 15 applicable to both cases.

更に、図９は、ＤＣＴ及びＤＳＴの選択を画素値レベルでアクティビティを比較して決定する方法（技法１）及びＤＣＴ及びＤＳＴの双方の直交変換係数のエネルギー分布を算出して決定する方法（技法２）の双方に適用可能な整数精度逆変換部３５における逆量子化部３３の構成を示している。 Further, FIG. 9 shows a method of determining DCT and DST selection by comparing activities at a pixel value level (Technique 1) and a method of calculating and determining energy distribution of both DCT and DST orthogonal transform coefficients (Technique 1). 2 shows the configuration of the inverse quantization unit 33 in the integer precision inverse transform unit 35 applicable to both cases 2).

尚、復号装置３０における整数精度逆変換部３５（逆量子化部３３及び逆直交変換部３４）は、整数精度逆変換部１８（逆量子化部１６及び逆直交変換部１７）と同様に動作する。 The integer precision inverse transform unit 35 (inverse quantization unit 33 and inverse orthogonal transform unit 34) in the decoding device 30 operates in the same manner as the integer precision inverse transform unit 18 (inverse quantization unit 16 and inverse orthogonal transform unit 17). To do.

以下、具体的にそれぞれのＤＣＴ及びＤＳＴの選択を画素値レベルでアクティビティを比較して決定する方法（技法１）及びＤＣＴ及びＤＳＴの双方の直交変換係数のエネルギー分布を算出して決定する方法（技法２）について順に説明する。 Hereinafter, specifically, a method of determining the selection of each DCT and DST by comparing activities at the pixel value level (Technique 1) and a method of calculating and determining the energy distribution of orthogonal transform coefficients of both DCT and DST ( Technique 2) will be described in order.

［技法１］
図３（Ａ）を参照するに、ＤＣＴ及びＤＳＴの選択を画素値レベルでアクティビティを比較して決定する方法（技法１）に対応する直交変換部１３は、バッファ１３１と、切替スイッチ１３２ａ及び１３２ｂと、ＤＣＴ１３３ａ及びＤＳＴ１３３ｂと、コスト算出部１３４とを備える。 [Technique 1]
Referring to FIG. 3A, the orthogonal transform unit 13 corresponding to the method (Technique 1) for determining the selection of DCT and DST by comparing the activities at the pixel value level includes a buffer 131 and changeover switches 132a and 132b. A DCT 133a and a DST 133b, and a cost calculation unit 134.

ＤＣＴ１３３ａ及びＤＳＴ１３３ｂの処理は、図７に示すような整数精度の変換行列が用いられる。図７は、本発明による一実施例のＤＣＴ及びＤＳＴの整数精度の変換行列の例を示している。 The processing of DCT 133a and DST 133b uses an integer precision conversion matrix as shown in FIG. FIG. 7 shows an example of a DCT and DST integer precision conversion matrix according to an embodiment of the present invention.

ＭＰＥＧ−４ＡＶＣ／Ｈ．２６４では、動き補償予測又は画面内予測の予測画像の信号と、ブロック分割された符号化対象ブロックの画像信号との差分信号に対して、整数精度ＤＣＴの直交変換が採用されている。 MPEG-4 AVC / H. In H.264, an orthogonal transform of integer precision DCT is adopted for a difference signal between a prediction image signal of motion compensation prediction or intra prediction and an image signal of an encoding target block that is divided into blocks.

整数精度ＤＣＴの変換行列の基底と、整数精度ＤＳＴの変換行列の基底は、それぞれｎ次の基底同士が直交関係にあることが重要である。例えば、整数精度ＤＣＴの変換行列では、０次（１行目）が［１１１１］、１次（２行目）が［２１ −１ −２］、２次（３行目）が［１ −１ −１１］、３次（４行目）が［１ −２２ −１］であり、整数精度ＤＳＴの変換行列では、１次（１行目）が［１２２１］、２次（２行目）が［１１ −１ −１］、３次（３行目）が［２ −１ −１２］、４次（４行目）が［１ −１１ −１］であるから、整数精度ＤＣＴの変換行列の１次の［２１ −１ −２］と、整数精度ＤＳＴの変換行列の１次の［１２２１］が直交関係にある。 It is important that the bases of the integer precision DCT transform matrix and the bases of the integer precision DST transform matrix are orthogonal to each other. For example, in the conversion matrix of integer precision DCT, the 0th order (first line) is [1 1 1 1], the first order (second line) is [2 1 −1 -2], and the second order (third line) is. [1 -1 -1 1], the third order (fourth row) is [1 -2 2 -1], and in the conversion matrix of integer precision DST, the first order (first row) is [1 2 2 1]. The second order (second line) is [1 1 -1 -1], the third order (third line) is [2 -1 -1 2], and the fourth order (fourth line) is [1 -1 1 -1]. ], The first-order [2 1 −1 -2] of the integer precision DCT transformation matrix and the first-order [1 2 2 1] of the integer precision DST transformation matrix are orthogonal to each other.

また、ＭＰＥＧ−４ＡＶＣ／Ｈ．２６４では、４×４直交変換のほかに、８×８直交変換を用いることもできるが、この場合も同様に、整数精度ＤＣＴと整数精度ＤＳＴを定義する。ＭＰＥＧ−４ＡＶＣ／Ｈ．２６４以外の直交変換を用いる場合も、同様に基底ごとに互いに直交関係にある２種類の直交変換の変換行列を規定することができる。 In addition, MPEG-4 AVC / H. In H.264, in addition to 4 × 4 orthogonal transformation, 8 × 8 orthogonal transformation can also be used. In this case as well, integer precision DCT and integer precision DST are defined. MPEG-4 AVC / H. Similarly, when orthogonal transforms other than H.264 are used, two types of orthogonal transformation transform matrices that are orthogonal to each other can be defined for each base.

本実施例では、基底ごとに互いに直交関係にある２種類の直交変換として、ＤＣＴ（整数精度ＤＣＴ）及びＤＳＴ（整数精度ＤＳＴ）を併用するものとして説明するが、基底ごとに互いに直交関係にある２種類の直交変換であれば、同様の効果を得ることができる。 In the present embodiment, two types of orthogonal transforms that are orthogonal to each other are described as being used in combination with DCT (integer precision DCT) and DST (integer precision DST), but each base is orthogonal to each other. Similar effects can be obtained with two types of orthogonal transforms.

図３（Ａ）に示す（技法１）における直交変換部１３は、減算部１０２から供給される差分信号をバッファ１３１に保持して、コスト算出部１３４によって、バッファ１３１から処理対象の差分信号を取り出し、そのアクティビティの値からＤＣＴ１３３ａ及びＤＳＴ１３３ｂのいずれかを選択するよう制御信号を発生させ、切替スイッチ１３２ａ及び１３２ｂで切り替えを行うように構成される。 The orthogonal transform unit 13 in (Technique 1) shown in FIG. 3A holds the difference signal supplied from the subtraction unit 102 in the buffer 131, and the cost calculation unit 134 returns the difference signal to be processed from the buffer 131. A control signal is generated so as to select one of DCT 133a and DST 133b from the taken-out activity value, and switching is performed by change-over switches 132a and 132b.

図３（Ｂ）に示すように、コスト算出部１３４は、バッファ１３１から処理対象の差分信号を取り出し、アクティビティの値を算出するアクティビティ算出部１３４１と、このアクティビティの値と予め定めた閾値と比較してＤＣＴ１３３ａ及びＤＳＴ１３３ｂのいずれかを選択するよう制御信号を発生する比較部１３４２とを備える。 As shown in FIG. 3B, the cost calculation unit 134 extracts the difference signal to be processed from the buffer 131, compares the activity value with a predetermined threshold, and the activity calculation unit 1341 that calculates the activity value. And a comparator 1342 for generating a control signal so as to select either DCT 133a or DST 133b.

コスト算出部１３４は、例えば入力信号（差分信号）の分散値をアクティビティの値として算出して分散値が所定の閾値より大きい場合にのみＤＳＴを選択するか、又は隣接画素間の相関係数をアクティビティの値として算出して相関係数が所定の閾値より小さい場合にのみＤＳＴを選択するように構成される。 For example, the cost calculation unit 134 calculates a variance value of an input signal (difference signal) as an activity value and selects a DST only when the variance value is larger than a predetermined threshold value, or calculates a correlation coefficient between adjacent pixels. The DST is selected only when the activity value is calculated and the correlation coefficient is smaller than a predetermined threshold.

これにより、簡便な方法として、入力信号（差分信号）を直交変換前に分析していずれの直交変換を用いるかを決定することができ、アクティビティ算出よりも処理コストが大きいＤＣＴ１３３ａ及びＤＳＴ１３３ｂの並列処理が不要となり、処理コストの低減を図ることができる。 Thus, as a simple method, the input signal (difference signal) can be analyzed before orthogonal transformation to determine which orthogonal transformation is used, and parallel processing of DCT 133a and DST 133b, which has a higher processing cost than activity calculation. Is unnecessary, and the processing cost can be reduced.

図８は、本発明による一実施例の量子化部の構成を示す図である。図８を参照するに、整数精度変換部１５における量子化部１４は、正規化係数マトリクス記憶部１４１１と、マトリクス読出制御部１４１２と、量子化ステップ算出部１４１３と、量子化演算部１４１４とを備える。量子化部１４の各構成は、ＭＰＥＧ−４ＡＶＣ／Ｈ．２６４で採用されている処理構成とほぼ同様であるが、ＤＣＴ１３３ａ及びＤＳＴ１３３ｂの各々用の正規化係数マトリクスを正規化係数マトリクス記憶部１４１１に記憶しておき、マトリクス読出制御部１４１２が、直交変換部１３からの直交変換種別の選択のための制御信号に従って各直交変換に対応する正規化係数マトリクスを読み出すように構成している点で相違する。尚、この制御信号は、ヘッダ情報として付加される場合には、入力される処理対象の信号から抽出する。 FIG. 8 is a diagram illustrating a configuration of a quantization unit according to an embodiment of the present invention. Referring to FIG. 8, the quantization unit 14 in the integer precision conversion unit 15 includes a normalization coefficient matrix storage unit 1411, a matrix read control unit 1412, a quantization step calculation unit 1413, and a quantization calculation unit 1414. Prepare. Each structure of the quantization unit 14 is MPEG-4 AVC / H. However, the normalization coefficient matrix for each of the DCT 133a and DST 133b is stored in the normalization coefficient matrix storage unit 1411, and the matrix read control unit 1412 includes the orthogonal transform unit. 13 is different in that the normalization coefficient matrix corresponding to each orthogonal transform is read out in accordance with the control signal for selecting the orthogonal transform type from 13. When this control signal is added as header information, it is extracted from the input signal to be processed.

量子化ステップ算出部１４１３は、逆量子化でも共通に用いる予め定めた量子化パラメータｑＰに従う量子化係数を正規化係数マトリクスから読み出して量子化ステップを算出し、量子化演算部１４１４が、入力される直交変換係数の信号を、算出した量子化ステップで除算して量子化ブロックの量子化信号を生成して外部に送出する。 The quantization step calculation unit 1413 reads a quantization coefficient according to a predetermined quantization parameter qP that is also commonly used in inverse quantization from the normalized coefficient matrix, calculates a quantization step, and the quantization calculation unit 1414 is input. The orthogonal transform coefficient signal is divided by the calculated quantization step to generate a quantized signal of the quantized block and send it out.

正規化係数マトリクスの例、及び量子化係数の例を、それぞれ図１０及び図１１に示す。この場合の量子化ステップは、量子化パラメータｑＰを６で除算した剰余（０〜５の値）で定まる要素位置ごとの値（α，β，γごとの値）に対して当該除算した商でビット左シフトさせて得られる値に相当する（ＭＰＥＧ−４ＡＶＣ／Ｈ．２６４の方式と同様）。 An example of the normalization coefficient matrix and an example of the quantization coefficient are shown in FIGS. 10 and 11, respectively. The quantization step in this case is a quotient obtained by dividing the value for each element position (value for each α, β, γ) determined by the remainder (value of 0 to 5) obtained by dividing the quantization parameter qP by 6. It corresponds to a value obtained by shifting the bit left (similar to the MPEG-4 AVC / H.264 system).

このように、（技法１）によれば、処理コストを低減させつつ、最適な直交変換係数を判断して符号化することができる。 As described above, according to (Technique 1), it is possible to determine and code the optimal orthogonal transform coefficient while reducing the processing cost.

尚、符号化装置１０の局部復号の整数精度逆変換部１８（逆量子化部１６及び逆直交変換部１７）の動作は、図６及び図９に示すように簡単な構成で処理コストを増大させることなく実現することができる。 The operation of the integer precision inverse transform unit 18 (inverse quantization unit 16 and inverse orthogonal transform unit 17) of the local decoding of the encoding device 10 increases the processing cost with a simple configuration as shown in FIGS. It can be realized without doing.

図９は、本発明による一実施例の逆量子化部の構成を示す図である。図９を参照するに、整数精度逆変換部１８における逆量子化部１６は、正規化係数マトリクス記憶部１６１１と、マトリクス読出制御部１６１２と、量子化ステップ算出部１６１３と、逆量子化演算部１６１４とを備える。逆量子化部１８の各構成は、ＭＰＥＧ−４ＡＶＣ／Ｈ．２６４で採用されている処理構成とほぼ同様であるが、ＤＣＴ１３３ａ及びＤＳＴ１３３ｂの各々用の正規化係数マトリクスを正規化係数マトリクス記憶部１６１１に記憶しておき、マトリクス読出制御部１６１２が、直交変換部１３からの直交変換種別の選択のための制御信号に従って各直交変換に対応する正規化係数マトリクスを読み出すように構成している点で相違する。尚、この制御信号は、ヘッダ情報として付加される場合には、入力される処理対象の信号から抽出する。 FIG. 9 is a diagram illustrating a configuration of an inverse quantization unit according to an embodiment of the present invention. Referring to FIG. 9, the inverse quantization unit 16 in the integer precision inverse transform unit 18 includes a normalized coefficient matrix storage unit 1611, a matrix read control unit 1612, a quantization step calculation unit 1613, and an inverse quantization operation unit. 1614. Each configuration of the inverse quantization unit 18 includes MPEG-4 AVC / H. However, the normalization coefficient matrix for each of the DCT 133a and the DST 133b is stored in the normalization coefficient matrix storage unit 1611, and the matrix read control unit 1612 includes the orthogonal transform unit. 13 is different in that the normalization coefficient matrix corresponding to each orthogonal transform is read out in accordance with the control signal for selecting the orthogonal transform type from 13. When this control signal is added as header information, it is extracted from the input signal to be processed.

量子化ステップ算出部１６１３は、量子化でも共通に用いる予め定めた量子化パラメータｑＰに従う量子化係数を正規化係数マトリクスから読み出して量子化ステップを算出し、量子化演算部１６１４が、入力される量子化ブロックの量子化信号を、算出した量子化ステップで乗算して直交変換係数の信号を生成して外部に送出する。 The quantization step calculation unit 1613 calculates a quantization step by reading a quantization coefficient according to a predetermined quantization parameter qP that is also used in quantization from the normalization coefficient matrix, and the quantization operation unit 1614 is input. The quantized signal of the quantized block is multiplied by the calculated quantization step to generate an orthogonal transform coefficient signal and send it out.

図６に示す逆直交変換部１７は、直交変換部１３からの制御信号に従って、逆量子化部１６から供給される対応する逆直交変換のＩＤＣＴ１７１ａ又はＩＤＳＴ１７１ｂのいずれかを選択する切替スイッチ１７２ａ及び１７２ｂで切り替えを行って、適応的に差分信号を復元する。 The inverse orthogonal transform unit 17 illustrated in FIG. 6 switches the changeover switches 172a and 172b for selecting either the corresponding inverse orthogonal transform IDCT 171a or IDST 171b supplied from the inverse quantization unit 16 in accordance with the control signal from the orthogonal transform unit 13. To switch to adaptively restore the differential signal.

整数精度逆変換部１８（逆量子化部１６及び逆直交変換部１７）の動作は、復号装置３０における整数精度逆変換部３５（逆量子化部３３及び逆直交変換部３４）の動作も同様である。 The operations of the integer precision inverse transform unit 18 (the inverse quantization unit 16 and the inverse orthogonal transform unit 17) are the same as the operations of the integer precision inverse transform unit 35 (the inverse quantization unit 33 and the inverse orthogonal transform unit 34) in the decoding device 30. It is.

従って、（技法１）によれば、処理コストを低減させつつ、最適な直交変換係数を判断して符号化することができる。 Therefore, according to (Technique 1), it is possible to determine and code the optimum orthogonal transform coefficient while reducing the processing cost.

次に、ＤＣＴ及びＤＳＴの双方の直交変換係数のエネルギー分布を算出して決定する方法（技法２）について説明する。 Next, a method (Technique 2) for calculating and determining the energy distribution of orthogonal transform coefficients of both DCT and DST will be described.

［技法２］
図４（Ａ）を参照するに、ＤＣＴ及びＤＳＴの双方の直交変換係数のエネルギー分布を算出して決定する方法（技法２）に対応する直交変換部１３は、ＤＣＴ１３３ａ及びＤＳＴ１３３ｂと、バッファ１３４ａ，１３４ｂと、切替スイッチ１３５と、コスト算出部１３６とを備える。 [Technique 2]
Referring to FIG. 4A, the orthogonal transform unit 13 corresponding to the method (technique 2) for calculating and determining the energy distribution of the orthogonal transform coefficients of both DCT and DST includes DCT 133a and DST 133b, buffers 134a, 134b, a changeover switch 135, and a cost calculation unit 136.

図４（Ａ）に示す（技法３）における直交変換部１３は、減算部１０２から供給される差分信号をＤＣＴ１３３ａ及びＤＳＴ１３３ｂにてそれぞれ直交変換を施し、それぞれバッファ１３４ａ，１３４ｂに格納するとともに、コスト算出部１３６によって、ＤＣＴ１３３ａ及びＤＳＴ１３３ｂの双方の直交変換係数のエネルギー分布からＤＣＴ１３３ａ及びＤＳＴ１３３ｂのいずれかを選択するよう制御信号を発生させ、切替スイッチ１３５でそれぞれのバッファ１３４ａ，１３４ｂからの読出しを制御して切替を行うように構成される。 The orthogonal transformation unit 13 in (Technique 3) shown in FIG. 4A performs orthogonal transformation on the difference signal supplied from the subtraction unit 102 by the DCT 133a and DST 133b, respectively, and stores them in the buffers 134a and 134b. The calculation unit 136 generates a control signal so as to select either DCT 133a or DST 133b from the energy distribution of the orthogonal transform coefficients of both DCT 133a and DST 133b, and the selector switch 135 controls reading from the respective buffers 134a and 134b. Are configured to perform switching.

図４（Ｂ）に示すように、コスト算出部１３６は、ＤＣＴ１３３ａのマクロブロックにおける直交変換係数のエネルギー分布を算出してエネルギー集中度を算出するマクロブロックエネルギー集中度算出部１３６１と、ＤＳＴ１３３ｂのマクロブロックにおける直交変換係数のエネルギー分布を算出してエネルギー集中度を算出するマクロブロックエネルギー集中度算出部１３６２と、マクロブロックエネルギー集中度算出部１３６１，１３６２の各々で算出したエネルギー集中度を比較して、エネルギー集中度の高いほうのＤＣＴ１３３ａ及びＤＳＴ１３３ｂのいずれかを選択するよう制御信号を発生する比較部１３６３とを備える。 As shown in FIG. 4B, the cost calculation unit 136 calculates the energy distribution of orthogonal transform coefficients in the macroblock of the DCT 133a to calculate the energy concentration level, and the macro block energy concentration level calculation unit 1361 and the macro of the DST 133b. The energy concentration calculated by each of the macroblock energy concentration calculating unit 1362 that calculates the energy distribution of the orthogonal transformation coefficient in the block and calculating the energy concentration is compared with the energy concentration calculated by each of the macroblock energy concentration calculating units 1361 and 1362. A comparison unit 1363 that generates a control signal so as to select one of DCT 133a and DST 133b having a higher energy concentration.

尚、整数精度変換部１５における量子化部１４の構成は前述の図８の説明と同様であり、符号化装置１０の局部復号の整数精度逆変換部１８（逆量子化部１６及び逆直交変換部１７）の動作や復号装置３０における整数精度逆変換部３５（逆量子化部３３及び逆直交変換部３４）の動作も、前述の図６及び図９の説明と同様である。 Note that the configuration of the quantization unit 14 in the integer precision conversion unit 15 is the same as the description of FIG. 8 described above, and the integer precision inverse transform unit 18 (inverse quantization unit 16 and inverse orthogonal transform of local decoding of the encoding device 10). The operation of the unit 17) and the operations of the integer precision inverse transform unit 35 (inverse quantization unit 33 and inverse orthogonal transform unit 34) in the decoding device 30 are the same as those described above with reference to FIGS.

これにより、（技法２）によれば、（技法１）と比較してＤＣＴ１３３ａ及びＤＳＴ１３３ｂの並列処理が必要ではあるが、実際に処理した直交変換係数の値からそのエネルギー集中度を比較するため、（技法１）と比較して符号化効率が改善する。 As a result, according to (Technique 2), parallel processing of DCT 133a and DST 133b is necessary as compared with (Technique 1), but in order to compare the energy concentration from the value of the actually processed orthogonal transform coefficient, Compared with (Technique 1), the coding efficiency is improved.

尚、（技法２）の変形例として、ＤＣＴ及びＤＳＴの選択を画素値レベルでアクティビティを比較して決定する代わりに、ＤＣＴ１３３ａ及び量子化部からなる信号系統と、ＤＳＴ１３３ｂ及び量子化部からなる信号系統の出力からマクロブロックにおけるエネルギー分布を算出してエネルギー集中度を算出してエネルギー集中度の高いほうの信号系統を選択するよう制御信号を発生するよう整数精度変換部１５を構成することもできる（図示せず）。 As a modified example of (Technique 2), instead of determining the selection of DCT and DST by comparing activities at the pixel value level, a signal system including DCT 133a and a quantization unit, and a signal including DST 133b and a quantization unit The integer precision conversion unit 15 can also be configured to generate a control signal so as to calculate the energy distribution in the macroblock from the output of the system, calculate the energy concentration, and select the signal system with the higher energy concentration. (Not shown).

上述した（技法１）及び（技法２）では、従来からのＭＰＥＧ−４ＡＶＣ／Ｈ．２６４の方式の符号化装置３００に対して、整数精度変換部１５及び整数精度逆変換部１８，３５を組み替えるのみで、最適な直交変換係数を判断して符号化することができる。一方で、更に符号化効率を向上させたい場合に、ＤＣＴ及びＤＳＴの双方の直交変換係数の量子化及びエントロピー符号化による符号量を算出し、その際のレート歪み最適化法に基づくコスト値を比較して効率の良いほうを選択する方法（技法３）があり、以下、説明する。 In the above (Technique 1) and (Technique 2), the conventional MPEG-4 AVC / H. Only the integer precision conversion unit 15 and the integer precision inverse conversion units 18 and 35 are rearranged with respect to the H.264 encoding apparatus 300, and the optimum orthogonal transform coefficient can be determined and encoded. On the other hand, when it is desired to further improve the coding efficiency, the code amount by quantization and entropy coding of the orthogonal transform coefficients of both DCT and DST is calculated, and the cost value based on the rate distortion optimization method at that time is calculated. There is a method (Technique 3) for selecting the more efficient one, which will be described below.

図５は、ＤＣＴ及びＤＳＴの双方の直交変換係数の量子化及びエントロピー符号化による符号量を算出し、その際のレート歪み最適化法に基づくコスト値を比較して効率の良いほうを選択する方法（技法３）に対応する構成を示している。図５（Ａ）に示すように、ＤＣＴ及びＤＳＴの双方の直交変換係数の量子化及びエントロピー符号化による符号量を算出するよう構成されたコスト算出処理ユニット１９は、ＤＣＴ１３３ａ及びＤＳＴ１３３ｂと、量子化部１３７ａ，１３７ｂと、エントロピー符号化部１３８ａ，１３８ｂと、バッファ１３９ａ，１３９ｂと、切替スイッチ１４０と、コスト算出部１４１とを備える。 FIG. 5 calculates the amount of code by quantization and entropy coding of orthogonal transform coefficients of both DCT and DST, and compares the cost values based on the rate distortion optimization method at that time to select the more efficient one The structure corresponding to the method (Technique 3) is shown. As shown in FIG. 5A, the cost calculation processing unit 19 configured to calculate the code amount by quantization and entropy coding of orthogonal transform coefficients of both DCT and DST includes DCT 133a and DST 133b, Units 137a and 137b, entropy encoding units 138a and 138b, buffers 139a and 139b, a changeover switch 140, and a cost calculation unit 141.

コスト算出処理ユニット１９は、減算部１０２から供給される差分信号をＤＣＴ１３３ａ及びＤＳＴ１３３ｂにてそれぞれ直交変換を施し、さらに量子化部１３７a，１３７ｂによって、それぞれの直交変換ごとに量子化を施し、さらにエントロピー符号化部１３８ａ，１３８ｂによってそれぞれの量子化ブロックの量子化信号ごとにエントロピー符号化を施し、各符号化したビットストリーム信号（符号化値）をそれぞれバッファ１３９ａ，１３９ｂに格納するとともに、コスト算出部１４１によって、ＤＣＴ１３３ａ及びＤＳＴ１３３ｂの双方の信号系統における符号量からＤＣＴ１３３ａ及びＤＳＴ１３３ｂの信号系統のいずれかを選択するよう制御信号を発生させ、切替スイッチ１３５でそれぞれのバッファ１３９ａ，１３９ｂからの読出しを制御して切り替えを行うように構成される。 The cost calculation processing unit 19 performs orthogonal transform on the difference signal supplied from the subtracting unit 102 by the DCT 133a and DST 133b, respectively, further performs quantization for each orthogonal transform by the quantizing units 137a and 137b, and further entropy. The encoding units 138a and 138b perform entropy encoding for each quantized signal of each quantization block, store the encoded bit stream signals (encoded values) in the buffers 139a and 139b, respectively, and cost calculation units 141, a control signal is generated so as to select one of the signal systems of DCT 133a and DST 133b from the code amount in the signal systems of both DCT 133a and DST 133b, and the changeover switch 135 outputs the control signal from each of the buffers 139a and 139b. It is configured to perform switching by controlling reading.

図５（Ｂ）に示すように、コスト算出部１４１は、ＤＣＴ係数に基づく符号化値から歪み量を算出する歪み量算出部１４２１と、ＤＳＴ係数に基づく符号化値から歪み量を算出する歪み量算出部１４２２と、レート歪み最適化法に基づくコスト値を比較して効率の良いほうを選択する比較部１４２３とを備える。 As shown in FIG. 5B, the cost calculation unit 141 includes a distortion amount calculation unit 1421 that calculates a distortion amount from an encoded value based on a DCT coefficient, and a distortion that calculates a distortion amount from an encoded value based on a DST coefficient. An amount calculation unit 1422 and a comparison unit 1423 that compares cost values based on the rate distortion optimization method and selects the more efficient one are provided.

これにより、（技法３）によれば、（技法１）及び（技法２）と比較してＤＣＴ１３３ａ及びＤＳＴ１３３ｂからの各信号系統の並列処理が必要ではあるが、実際に処理した直交変換係数の値からその符号化コストと品質の効率によって比較するため、（技法１）及び（技法２）と比較して符号化効率が改善する。 Thus, according to (Technique 3), the parallel processing of each signal system from DCT 133a and DST 133b is necessary as compared with (Technique 1) and (Technique 2), but the values of the orthogonal transform coefficients actually processed Therefore, the coding efficiency is improved as compared with (Technique 1) and (Technique 2).

尚、当業者であれば、コスト算出処理ユニット１９を、前述した符号化装置１０（又は復号装置３０）に適用可能であることは上記の説明から理解される。 A person skilled in the art understands that the cost calculation processing unit 19 can be applied to the encoding device 10 (or the decoding device 30) described above.

上記の説明では、４×４ブロックの符号化処理についてコスト計算して最適な直交変換（又は直交変換の信号系統）を選択する例について説明したが、ＭＰＥＧ−４ＡＶＣ／Ｈ．２６４の方式では、８×８ブロックの符号化処理もサポートされている。従って、本実施例の符号化装置１０において、４×４ブロックと８×８ブロックの双方の符号化処理についてコスト計算して最適な直交変換（又は直交変換の信号系統）を選択するように構成することもできる。 In the above description, an example in which the cost is calculated for the encoding process of 4 × 4 blocks and the optimum orthogonal transform (or signal system of the orthogonal transform) is selected has been described, but MPEG-4 AVC / H. In the H.264 system, 8 × 8 block encoding processing is also supported. Therefore, the encoding apparatus 10 of the present embodiment is configured to calculate the cost for both the 4 × 4 block and the 8 × 8 block encoding processes and select the optimal orthogonal transform (or signal system of the orthogonal transform). You can also

最も好適なコスト算出において、符号化対象ブロックに対して異なる画素ブロックサイズごとの差分信号についてそれぞれコスト計算を行って、最も少ない符号量となる画素ブロックサイズ及び直交変換処理の組み合わせを決定し、符号化対象ブロックの差分信号に対する最適な直交変換（又は直交変換の信号系統）を選択するように切り替えを制御することである。つまり、４×４ブロックと８×８ブロックの双方の符号化処理についてコスト計算することは、前述したいずれの例でも適用可能である。 In the most preferable cost calculation, the cost calculation is performed for each difference signal for different pixel block sizes with respect to the encoding target block, and the combination of the pixel block size and the orthogonal transformation process that provides the smallest code amount is determined. The switching is controlled so as to select the optimal orthogonal transform (or signal system of the orthogonal transform) for the difference signal of the conversion target block. That is, the cost calculation for both the 4 × 4 block and the 8 × 8 block encoding processes can be applied to any of the examples described above.

例えば、ブロック化部１０１によって、１６×１６画素マクロブロックから、８×８画素（８×８ブロック）に分割し、さらに、４×４画素（４×４ブロック）に再分割する例を考える。この場合、８×８ブロックで、直交変換（又は直交変換の信号系統）のコスト計算を行うとともに、４×４ブロックで、直交変換（又は直交変換の信号系統）のコスト計算を行う。続いて、例えばレート歪み最適化法に基づいて算出した全てのコスト値から最小のものを選択し、符号化ビットストリームを最終決定する。 For example, consider an example in which the blocking unit 101 divides a 16 × 16 pixel macroblock into 8 × 8 pixels (8 × 8 blocks), and further subdivides them into 4 × 4 pixels (4 × 4 blocks). In this case, cost calculation of orthogonal transformation (or signal system of orthogonal transformation) is performed with 8 × 8 blocks, and cost calculation of orthogonal transformation (or signal system of orthogonal transformation) is performed with 4 × 4 blocks. Subsequently, for example, the smallest one is selected from all the cost values calculated based on the rate distortion optimization method, and the encoded bit stream is finally determined.

上記の例では、ＤＣＴベースの符号化方式ではエネルギー集中度が十分でないような画像信号に対しても、ブロック内の画素間相関の強さに適切に応じた直交変換係数を判断して符号化する符号化装置１０、この符号化した信号を復号する復号装置３０が実現される。ただし、ＭＰＥＧ−４ＡＶＣ／Ｈ．２６４では、演算負担を軽減させることを意図して、整数精度ＤＣＴと量子化を一括処理するように規定されており、単にＤＣＴとＤＳＴの双方で直交変換を施すように構成することは現在における映像符号化の趣旨にも反することになりかねない。そこで、ＤＣＴとＤＳＴの双方で直交変換を施すとしても如何にして演算負担を軽減させるかについて以下説明する。 In the above example, even for an image signal whose energy concentration is not sufficient in the DCT-based encoding method, encoding is performed by determining an orthogonal transform coefficient appropriately corresponding to the strength of inter-pixel correlation in the block. And the decoding device 30 for decoding the encoded signal are realized. However, MPEG-4 AVC / H. H.264 specifies that the integer precision DCT and quantization are batch-processed with the intention of reducing the computation burden, and it is currently possible to simply perform orthogonal transformation on both DCT and DST. This may be contrary to the purpose of video coding. Therefore, how to reduce the calculation burden even if orthogonal transformation is performed in both DCT and DST will be described below.

通常、ＭＰＥＧ−４ＡＶＣ／Ｈ．２６４の方式のままでＤＣＴとＤＳＴを併用すると、正規化係数マトリクス（量子化演算に用いるＭＦ：ＭｕｌｔｉｐｌｉｃａｔｉｏｎＦａｃｔｏｒのマトリクス）として、各直交変換に固有の量子化係数からなるものを用意する必要があり、ＤＣＴとＤＳＴでは個別の正規化係数マトリクスを用いることになる。このような正規化係数マトリクスは符号化装置及び復号装置でメモリに保持し続ける必要があり、２種類の直交変換を採用する場合、量子化のためのメモリ量が２倍になるという問題が生じる。 Usually, MPEG-4 AVC / H. When DCT and DST are used together in the H.264 format, it is necessary to prepare a normalization coefficient matrix (MF: matrix used for quantization operation) consisting of quantization coefficients specific to each orthogonal transform. In DCT and DST, separate normalization coefficient matrices are used. Such a normalization coefficient matrix needs to be kept in the memory by the encoding device and the decoding device. When two types of orthogonal transformation are adopted, there is a problem that the amount of memory for quantization is doubled. .

しかしながら、図１０から明らかなように、整数精度ＤＣＴと整数精度ＤＳＴでは、正規化係数マトリクスの配置が、右下方向に（１，１）ずらした関係にあり、各々の係数値が一致する。 However, as is apparent from FIG. 10, in the integer precision DCT and the integer precision DST, the arrangement of the normalized coefficient matrix is shifted by (1, 1) in the lower right direction, and the coefficient values thereof match.

そこで、マトリクス読出制御部１４１２，１６１２（図８及び図９）によって、ＤＣＴの正規化係数マトリクスのみを正規化係数マトリクス記憶部１４１１，１６１１（図８及び図９）に格納しておき、マトリクス読出制御部１４１２，１６１２（図８及び図９）によって、ＤＣＴとＤＳＴで正規化係数マトリクスの読み出し順を切替えることで、ＤＣＴとＤＳＴで同一の正規化係数マトリクスからそれぞれの量子化係数の値を用いることができるようになり、メモリ量の増大の問題を解決することができる（図１２参照）。 Therefore, the matrix read control units 1412 and 1612 (FIGS. 8 and 9) store only the DCT normalization coefficient matrix in the normalization coefficient matrix storage units 1411 and 1611 (FIGS. 8 and 9) to read the matrix. The control units 1412 and 1612 (FIGS. 8 and 9) use the quantization coefficient values from the same normalization coefficient matrix in DCT and DST by switching the reading order of the normalization coefficient matrix in DCT and DST. As a result, the problem of an increase in the amount of memory can be solved (see FIG. 12).

更に、別の方法として、正規化係数マトリクスの読み出し順を切替える代わりに、マトリクス読出制御部１４１２，１６１２（図８及び図９）によって、ＤＣＴの正規化係数マトリクスのみを正規化係数マトリクス記憶部１４１１，１６１１（図８及び図９）に格納しておき、ＤＳＴの変換行列を図１３及び図１４に示すごとく変形し、ＤＣＴ及びＤＳＴにおける互いの変換行列が、ＤＣＴ及びＤＳＴの切り替えに応じて切り替えられるスキャニングオーダーに対応して当該共通の正規化係数マトリクスを用いて量子化することができる直交関係で規定する。これにより、スキャニング部１２５によって当該直交変換の切り替えを表す制御信号に従って、量子化後の伝送すべき係数を１次元配列に変換する際のスキャニングオーダーをＤＣＴ（図１５）とＤＳＴ（図１６）で切替えることで、ＤＣＴとＤＳＴで同一の正規化係数マトリクスからそれぞれの量子化係数の値を用いることができるようになり、メモリ量の増大の問題を解決することができる。 Furthermore, as another method, instead of switching the reading order of the normalization coefficient matrix, only the normalization coefficient matrix storage part 1411 of the DCT normalization coefficient matrix is obtained by the matrix read control units 1412 and 1612 (FIGS. 8 and 9). 1611 (FIGS. 8 and 9), the DST transformation matrix is transformed as shown in FIGS. 13 and 14, and the mutual transformation matrices in DCT and DST are switched according to the switching of DCT and DST. It is defined by an orthogonal relationship that can be quantized using the common normalization coefficient matrix corresponding to the scanning order to be generated. Accordingly, the scanning order when transforming the coefficient to be transmitted after quantization into a one-dimensional array according to the control signal indicating the switching of the orthogonal transform by the scanning unit 125 is represented by DCT (FIG. 15) and DST (FIG. 16). By switching, it becomes possible to use the values of the respective quantized coefficients from the same normalized coefficient matrix in DCT and DST, and the problem of increase in the memory amount can be solved.

即ち、ＤＳＴの変換行列を変形して直交変換を施し、ＤＣＴと同一の正規化係数マトリクスを用いると、ＤＳＴの最低次数が（０，０）の位置でなく、（１，１）に配置されるため、量子化後の伝送すべき係数を１次元配列に変換する際のスキャニングオーダーを（１，１）から開始するようにする。 That is, when the transformation matrix of DST is transformed and orthogonal transformation is performed, and the same normalization coefficient matrix as DCT is used, the lowest order of DST is placed at (1, 1) instead of (0, 0). Therefore, the scanning order for converting the coefficient to be transmitted after quantization into a one-dimensional array is started from (1, 1).

符号化装置１０における局部復号時や、復号装置３０における逆量子化を行う場合も同様に、ＤＣＴ及びＤＳＴのいずれを選択したかを表す制御信号を取得して、符号化された当該フレーム画像を復号する際に、符号化された当該フレーム画像における量子化信号に対して、ＤＣＴ及びＤＳＴに共通する正規化係数マトリクスを用いて逆量子化処理を施すように構成する。 Similarly, when local decoding is performed in the encoding device 10 or when inverse quantization is performed in the decoding device 30, a control signal indicating which one of DCT and DST is selected is acquired, and the encoded frame image is obtained. At the time of decoding, the quantization signal in the encoded frame image is subjected to inverse quantization processing using a normalization coefficient matrix common to DCT and DST.

また、ＭＰＥＧ−４ＡＶＣ／Ｈ．２６４では、画面内予測時に、例えば４×４ブロック（輝度信号）及び２×２ブロック（色差信号）のＤＣＴの直交変換係数の直流成分に対しては、１つの直流成分の画素ブロックを構成してアダマール変換を実行することで空間周波数分解能を改善する技術があるが、本発明に係るＤＳＴの最低次数の成分に対して同様にアダマール変換を実行するよう構成することもできる。 In addition, MPEG-4 AVC / H. In H.264, a pixel block of one DC component is configured for DC components of DCT orthogonal transform coefficients of, for example, 4 × 4 blocks (luminance signals) and 2 × 2 blocks (color difference signals) during intra prediction. Although there is a technique for improving the spatial frequency resolution by executing the Hadamard transform, the Hadamard transform can be similarly performed on the lowest-order component of the DST according to the present invention.

これにより、演算負担をさほど増大させることなく、ＤＣＴベースの符号化方式ではエネルギー集中度が十分でないような画像信号に対しても、ブロック内の画素間相関の強さに適切に応じた直交変換係数を判断して符号化する符号化装置、及び、この符号化した信号を復号する復号装置を実現することができる。 This makes it possible to perform orthogonal transformation in accordance with the strength of inter-pixel correlation within a block, even for image signals whose energy concentration is not sufficient in the DCT-based coding method without increasing the computational burden. An encoding device that determines and encodes a coefficient, and a decoding device that decodes the encoded signal can be realized.

更に、本発明の一態様として、符号化装置１０及び復号装置３０をコンピュータとして構成させることができる。コンピュータに、前述した符号化装置１０及び復号装置３０の各構成要素を実現させるためのプログラムは、コンピュータの内部又は外部に備えられる記憶部に記憶される。そのような記憶部は、外付けハードディスクなどの外部記憶装置、或いはＲＯＭ又はＲＡＭなどの内部記憶装置で実現することができる。コンピュータに備えられる制御部は、中央演算処理装置（ＣＰＵ）などの制御で実現することができる。即ち、ＣＰＵが、各構成要素の機能を実現するための処理内容が記述されたプログラムを、適宜、記憶部から読み込んで、各構成要素の機能をコンピュータ上で実現させることができる。ここで、各構成要素の機能をハードウェアの一部で実現しても良い。 Furthermore, as one aspect of the present invention, the encoding device 10 and the decoding device 30 can be configured as a computer. A program for causing a computer to realize each component of the encoding device 10 and the decoding device 30 described above is stored in a storage unit provided inside or outside the computer. Such a storage unit can be realized by an external storage device such as an external hard disk or an internal storage device such as ROM or RAM. The control unit provided in the computer can be realized by controlling a central processing unit (CPU) or the like. In other words, the CPU can appropriately read from the storage unit a program in which the processing content for realizing the function of each component is described, and realize the function of each component on the computer. Here, the function of each component may be realized by a part of hardware.

また、この処理内容を記述したプログラムを、例えばＤＶＤ又はＣＤ−ＲＯＭなどの可搬型記録媒体の販売、譲渡、貸与等により流通させることができるほか、そのようなプログラムを、例えばネットワーク上にあるサーバの記憶部に記憶しておき、ネットワークを介してサーバから他のコンピュータにそのプログラムを転送することにより、流通させることができる。 In addition, the program describing the processing contents can be distributed by selling, transferring, or lending a portable recording medium such as a DVD or CD-ROM, and such a program can be distributed on a server on a network, for example. Can be distributed by transferring the program from the server to another computer via the network.

また、そのようなプログラムを実行するコンピュータは、例えば、可搬型記録媒体に記録されたプログラム又はサーバから転送されたプログラムを、一旦、自己の記憶部に記憶することができる。また、このプログラムの別の実施態様として、コンピュータが可搬型記録媒体から直接プログラムを読み取り、そのプログラムに従った処理を実行することとしてもよく、更に、このコンピュータにサーバからプログラムが転送される度に、逐次、受け取ったプログラムに従った処理を実行することとしてもよい。 In addition, a computer that executes such a program can temporarily store, for example, a program recorded on a portable recording medium or a program transferred from a server in its own storage unit. As another embodiment of the program, the computer may directly read the program from a portable recording medium and execute processing according to the program, and each time the program is transferred from the server to the computer. In addition, the processing according to the received program may be executed sequentially.

以上、具体例を挙げて本発明の実施例を詳細に説明したが、本発明の特許請求の範囲から逸脱しない限りにおいて、あらゆる変形や変更が可能であることは当業者に明らかである。特に、正規化部における正規化処理は、教師信号による修正処理における信号間の相関をとるためのレベル調整を行うものであればよい。 While the embodiments of the present invention have been described in detail with specific examples, it will be apparent to those skilled in the art that various modifications and changes can be made without departing from the scope of the claims of the present invention. In particular, the normalization process in the normalization unit only needs to perform level adjustment for correlation between signals in the correction process using the teacher signal.

本発明によれば、ＤＣＴベースの符号化方式ではエネルギー集中度が十分でないような画像信号に対しても、最適な直交変換係数を判断して符号化することが可能となるので、映像符号化技術に有用である。 According to the present invention, it is possible to determine and code an optimal orthogonal transform coefficient even for an image signal whose energy concentration is not sufficient in the DCT-based encoding method. Useful for technology.

１０符号化装置
１３直交変換部
１４量子化部
１５整数精度変換部
１６逆量子化部
１７逆直交変換部
１８整数精度逆変換部
１９コスト算出処理ユニット
３０復号装置
３３逆量子化部
３４逆直交変換部
３５整数精度逆変換部
１００符号化装置
１０１ブロック化部
１０２減算部
１０３直交変換部
１０４量子化部
１０５整数精度変換部
１０６逆量子化部
１０７逆直交変換部
１０８整数精度逆変換部
１０９加算部
１２０デブロッキングフィルタ
１２１メモリ
１２２フレーム内予測部
１２３動き補償予測部
１２４切替スイッチ
１２５スキャニング部
１２６エントロピー符号化部
１３１バッファ
１３４コスト算出部
１３４ａ，１３４ｂバッファ
１３５切替スイッチ
１３６コスト算出部
１３７ａ，１３７ｂ量子化部
１３８ａ，１３８ｂエントロピー符号化部
１３９ａ，１３９ｂバッファ
１４０切替スイッチ
１４１コスト算出部
１７１ａ整数精度ＩＤＣＴ
１７１ｂ整数精度ＩＤＳＴ
１７２ａ，１７２ｂ切替スイッチ
３００復号装置
３０１エントロピー復号部
３０２スキャニング部
３０３逆量子化部
３０４逆直交変換部
３０５整数精度逆変換部
３０６加算部
３０７デブロッキングフィルタ
３０８メモリ
３０９フレーム内予測部
３１０動き補償予測部
３１１切替スイッチ
３１２並べ替え部
１０３１ａ整数精度ＤＣＴ
１０３１ｂ整数精度ＤＳＴ
１０３２０ランレングス比較部
１０３３切替スイッチ
１３４１アクティビティ算出部
１３６１，１３６２マクロブロックエネルギー集中度算出部
１３６３比較部
１４１１正規化係数マトリクス記憶部
１４１２マトリクス読出制御部
１４１３量子化ステップ算出部
１４１４量子化演算部
１４２１，１４２２歪み量算出部
１４２３比較部
１６１１正規化係数マトリクス記憶部
１６１２マトリクス読出制御部
１６１３量子化ステップ算出部
１６１４逆量子化演算部 DESCRIPTION OF SYMBOLS 10 Encoding apparatus 13 Orthogonal transformation part 14 Quantization part 15 Integer precision transformation part 16 Inverse quantization part 17 Inverse orthogonal transformation part 18 Integer precision inverse transformation part 19 Cost calculation processing unit 30 Decoding apparatus 33 Inverse quantization part 34 Inverse orthogonal transformation Unit 35 integer precision inverse transform unit 100 encoding device 101 blocking unit 102 subtracting unit 103 orthogonal transform unit 104 quantizing unit 105 integer precision transforming unit 106 inverse quantization unit 107 inverse orthogonal transform unit 108 integer precision inverse transform unit 109 adding unit 120 Deblocking Filter 121 Memory 122 Intraframe Prediction Unit 123 Motion Compensation Prediction Unit 124 Change Switch 125 Scanning Unit 126 Entropy Coding Unit 131 Buffer 134 Cost Calculation Unit 134a, 134b Buffer 135 Change Switch 136 Cost Calculation Unit 137a, 137b Amount Unit 138a, 138b entropy encoding unit 139a, 139b buffer 140 changeover switch 141 cost calculation unit 171a integer precision IDCT
171b Integer precision IDST
172a, 172b changeover switch 300 decoding device 301 entropy decoding unit 302 scanning unit 303 inverse quantization unit 304 inverse orthogonal transform unit 305 integer precision inverse transform unit 306 addition unit 307 deblocking filter 308 memory 309 intra-frame prediction unit 310 motion compensation prediction unit 311 changeover switch 312 rearrangement unit 1031a integer precision DCT
1031b integer precision DST
1032 0-run length comparison unit 1033 changeover switch 1341 activity calculation unit 1361, 1362 macroblock energy concentration calculation unit 1363 comparison unit 1411 normalization coefficient matrix storage unit 1412 matrix read control unit 1413 quantization step calculation unit 1414 quantization operation unit 1421 , 1422 Distortion amount calculation unit 1423 Comparison unit 1611 Normalization coefficient matrix storage unit 1612 Matrix readout control unit 1613 Quantization step calculation unit 1614 Inverse quantization operation unit

Claims

A difference signal between the encoding target block and the prediction image is generated based on the encoding target block obtained by dividing the frame image and the prediction image generated for the encoding target block. And an encoding device for encoding the difference signal ,
An integer conversion process for applying an integer conversion process selected from a first integer conversion process and a second integer conversion process different from the first integer conversion process to the difference signal and performing an integer conversion process on the difference signal And
Using the normalization coefficient matrix common to the difference signal to which the first integer conversion process is applied and the difference signal to which the second integer conversion process is applied, the integer conversion processing unit performs the integer conversion process. And a quantizing unit that quantizes the difference signal .

The encoding apparatus according to claim 1, wherein the first integer transform process is a discrete cosine transform process, and the second integer transform process is a discrete sine transform process .

The integer conversion processing unit selects an integer conversion process to be applied to the other difference signal from the first integer conversion process and the second integer conversion process based on the block size of the encoding target block. The encoding apparatus according to claim 1 or 2 , characterized by the above.

The quantization unit switches the reading order of the normalization coefficient matrix between the difference signal to which the first integer transformation process is applied and the difference signal to which the second integer transformation process is applied. The encoding apparatus according to any one of claims 1 to 3, wherein the encoding apparatus is characterized in that:

A decoding device for decoding an encoded frame image encoded by an encoding device for encoding a frame image,
  The encoding of the frame image is performed based on the encoding target block obtained by dividing the frame image and the prediction image generated for the encoding target block, and the prediction image. Including a procedure for generating a difference signal between and encoding the difference signal,
  An inverse quantization unit that performs inverse quantization of the differential signal;
  The inverse integer transform process selected from the first inverse integer transform process and the second inverse integer transform process different from the first inverse integer transform process is subjected to the inverse quantization by the inverse quantization unit. An inverse integer conversion processing unit that applies the difference signal and performs an inverse integer conversion process on the difference signal;
  The inverse quantization unit uses the normalization coefficient matrix common to the difference signal to which the first inverse integer transform process is applied and the difference signal to which the second inverse integer transform process is applied, to use the difference signal. The decoding apparatus characterized by performing inverse quantization of.

The decoding apparatus according to claim 5, wherein the first inverse integer transform process is an inverse discrete cosine transform process, and the second inverse integer transform process is an inverse discrete sine transform process.

The inverse integer transform processing unit applies an inverse integer transform to be applied to the other difference signal from the first inverse integer transform process and the second inverse integer transform process based on the block size of the encoding target block. The decoding apparatus according to claim 5 or 6, wherein a process is selected.

The inverse quantization unit may change a reading order of the normalized coefficient matrix between the difference signal to which the first inverse integer transform process is applied and the difference signal to which the second inverse integer transform process is applied. 8. The decoding device according to claim 5, wherein switching is performed.