JP4529615B2

JP4529615B2 - Encoding apparatus, encoding method, encoding method program, and recording medium recording the encoding method program

Info

Publication number: JP4529615B2
Application number: JP2004276393A
Authority: JP
Inventors: 数史佐藤; 潤一田中; 陽一矢ヶ崎
Original assignee: Sony Corp
Current assignee: Sony Corp
Priority date: 2004-09-24
Filing date: 2004-09-24
Publication date: 2010-08-25
Anticipated expiration: 2024-09-24
Also published as: JP2006094081A

Description

本発明は、符号化装置、符号化方法、符号化方法のプログラム及び符号化方法のプログラムを記録した記録媒体に関し、動画による撮像結果を記録するビデオカメラ、電子スチルカメラ、監視装置等に適用することができる。本発明は、符号化効率を示すコスト関数によるコスト値の比較により、複数のイントラ予測モード、複数のインター予測モードから最適モードをマクロブロック毎に検出して画像データを符号化処理する場合に、アクティビティによりコスト値を補正して最適モードを検出することにより、コスト関数によりイントラ予測モード、インター予測モードから最適モードを選択して画像データを符号化処理する場合に、アクティビティの低い領域における画質劣化を防止することができるようにする。 The present invention relates to an encoding apparatus, an encoding method, an encoding method program, and a recording medium on which the encoding method program is recorded, and is applied to a video camera, an electronic still camera, a monitoring apparatus, and the like that record imaging results of moving images. be able to. The present invention detects the optimum mode for each macroblock from a plurality of intra prediction modes and a plurality of inter prediction modes by comparing the cost values with the cost function indicating the coding efficiency. By detecting the optimal mode by correcting the cost value by the activity, image quality degradation in the low activity area when the optimal mode is selected from the intra prediction mode and the inter prediction mode by the cost function and the image data is encoded. To be able to prevent.

近年、放送局、一般家庭等に係る動画の伝送、記録においては、画像データの冗長性を有効に利用して効率良く画像データを伝送、蓄積する装置が普及しつつあり、このような装置は、例えばＭＰＥＧ（Moving Picture Experts Group ）等の方式に準拠して、離散コサイン変換等の直交変換と動き補償とにより画像データをデータ圧縮するように構成されている。 In recent years, in the transmission and recording of moving images related to broadcasting stations, general homes, etc., devices that efficiently transmit and store image data by effectively using the redundancy of image data are becoming popular. For example, in accordance with a method such as MPEG (Moving Picture Experts Group), image data is compressed by orthogonal transform such as discrete cosine transform and motion compensation.

ここでこのような方式の１つであるＭＰＥＧ２（ISO/IEC 13818-2 ）は、汎用の画像符号化方式として定義された方式であり、飛び越し走査方式、順次走査方式の双方に対応できるように、また標準解像度画像、高精細画像の双方に対応できるように定義され、これらにより現在、プロフェッショナル用途及びコンシューマー用途の広範なアプリケーションに広く用いられている。具体的にＭＰＥＧ２によれば、例えば７２０×４８０画素による標準解像度、飛び越し走査方式の画像データを４〜８〔Ｍｂｐｓ〕のビットレートにデータ圧縮して、また１９２０×１０８８画素による高解像度、飛び越し走査方式の画像データを１８〜２２〔Ｍｂｐｓ〕のビットレートにデータ圧縮して、高画質で高い圧縮率を確保することができる。 Here, MPEG2 (ISO / IEC 13818-2), which is one of such systems, is a system defined as a general-purpose image coding system so that it can handle both the interlace scanning system and the progressive scanning system. In addition, it is defined so as to be compatible with both standard resolution images and high-definition images, and is now widely used in a wide range of applications for professional use and consumer use. Specifically, according to MPEG2, for example, a standard resolution of 720 × 480 pixels and interlaced scanning image data are compressed to a bit rate of 4 to 8 Mbps, and a high resolution of 1920 × 1088 pixels and interlaced scanning are used. The image data of the system can be compressed to a bit rate of 18 to 22 [Mbps], and a high compression rate can be ensured with high image quality.

しかしながらＭＰＥＧ２は、放送用に適合した高画質符号化方式であり、ＭＰＥＧ１より符号量の少ない高圧縮率の符号化方式には対応していない。これに対して近年の携帯端末の普及により、このようなＭＰＥＧ１より符号量の少ない高圧縮率の符号化方式のニーズの高まりが予測される。このためＭＰＥＧ４による符号化方式の規格が、ＩＳＯ／ＩＥＣ（International 0rganization for Standardization／International Electrotechnical Commission ）１４４９６−２により１９９８年１２月に国際標準に承認された。 However, MPEG2 is a high-quality encoding system suitable for broadcasting, and does not support a high compression rate encoding system with a smaller code amount than MPEG1. On the other hand, with the spread of portable terminals in recent years, it is expected that there will be an increasing need for an encoding method with a high compression rate with a smaller code amount than MPEG1. For this reason, an MPEG-4 encoding system standard was approved in December 1998 by ISO / IEC (International Organization for Standardization / International Electrotechnical Commission) 14496-2.

またこのような方式にあっては、当初はテレビ会議用の画像符号化を目的としたものであったＨ２６Ｌ（ITU-T Q6/16 VCEG）の規格化が進み、ＭＰＥＧ２、ＭＰＥＧ４に比して演算量が増大するものの、ＭＰＥＧ２、ＭＰＥＧ４に比して高い符号化効率を確保できるようになり、またＭＰＥＧ４の活動の一環として、このＨ２６Ｌをベースにして各種機能を取り入れ、さらに一段と高い符号化効率を確保する符号化方式の標準化が、Joint Model of Enhanced-Compression Video Codingとして進められ、これらの方式にあっては、２００３年３月に、Ｈ２６４及びＭＰＥＧ−４Ｐａｒｔ１０（ＡＶＣ：Advanced Video Coding ）との名称により国際標準に設定された。 In such a system, the standardization of H26L (ITU-T Q6 / 16 VCEG), which was originally intended for video coding for video conferencing, has progressed, compared to MPEG2 and MPEG4. Although the amount of computation increases, it becomes possible to secure higher encoding efficiency compared to MPEG2 and MPEG4. As part of MPEG4 activities, various functions are incorporated based on this H26L, and the encoding efficiency is even higher. Standardization of coding schemes to ensure the image quality is being promoted as Joint Model of Enhanced-Compression Video Coding. In these schemes, in March 2003, H264 and MPEG-4 Part 10 (AVC: Advanced Video Coding) Was set as an international standard.

ここで図３は、このＡＶＣに基づく符号化装置を示すブロック図である。この符号化装置１は、複数のイントラ予測モードと複数のインター予測モードとから最適な予測モードを選択し、この選択した予測モードによる予測値を画像データから減算して差分データを生成し、この差分データを直交変換処理、量子化処理、可変長符号化処理することにより、この画像データをイントラ符号化、インター符号化により符号化処理する。 Here, FIG. 3 is a block diagram showing an encoding device based on this AVC. The encoding device 1 selects an optimal prediction mode from a plurality of intra prediction modes and a plurality of inter prediction modes, generates a difference data by subtracting a prediction value based on the selected prediction mode from image data, The difference data is subjected to orthogonal transform processing, quantization processing, and variable length coding processing, whereby the image data is coded by intra coding and inter coding.

すなわちこの符号化装置１において、アナログディジタル変換回路（Ａ／Ｄ）２は、ビデオ信号ＳＶをアナログディジタル変換処理して画像データＤ１を出力する。画面並べ替えバッファ３は、このアナログディジタル変換回路２から出力される画像データＤ１を入力し、この符号化装置１の符号化処理に係るＧＯＰ（Group of Pictures ）構造に応じて、この画像データＤ１のフレームを並べ替えて出力する。 That is, in this encoding apparatus 1, the analog-digital conversion circuit (A / D) 2 performs analog-digital conversion processing on the video signal SV and outputs image data D1. The screen rearrangement buffer 3 receives the image data D1 output from the analog-digital conversion circuit 2, and the image data D1 according to the GOP (Group of Pictures) structure related to the encoding process of the encoding device 1. Sort and output the frames.

減算回路４は、この画面並べ替えバッファ３から出力される画像データＤ１を受け、イントラ符号化においては、イントラ予測回路５で生成される予測値との差分データＤ２を生成して出力するのに対し、インター符号化においては、動き予測・補償回路６で生成される予測値との差分データＤ２を生成して出力する。直交変換回路７は、減算回路４の出力データＤ２を入力し、離散コサイン変換、カルーネン・レーベ変換等の直交変換処理を実行し、その処理結果による変換係数データＤ３を出力する。 The subtraction circuit 4 receives the image data D1 output from the screen rearrangement buffer 3, and generates and outputs difference data D2 from the prediction value generated by the intra prediction circuit 5 in intra coding. On the other hand, in inter coding, difference data D2 from the prediction value generated by the motion prediction / compensation circuit 6 is generated and output. The orthogonal transformation circuit 7 receives the output data D2 of the subtraction circuit 4, performs orthogonal transformation processing such as discrete cosine transformation and Karhunen-Labe transformation, and outputs transformation coefficient data D3 based on the processing result.

量子化回路８は、レート制御回路９のレート制御による量子化スケールにより、この変換係数データＤ３を量子化して出力する。可逆符号化回路１０は、この量子化回路８の出力データを可変長符号化、算術符号化等により可逆符号化処理して出力する。また可逆符号化回路１０は、イントラ符号化に係るイントラ予測モードに関する情報、インター符号化に係る動きベクトルに関する情報等をイントラ予測回路５、動き予測・補償回路６から取得し、これらの情報を出力データＤ４のヘッダ情報に設定して出力する。 The quantization circuit 8 quantizes the transform coefficient data D3 by the quantization scale by the rate control of the rate control circuit 9 and outputs it. The lossless encoding circuit 10 performs lossless encoding processing on the output data of the quantization circuit 8 by variable length encoding, arithmetic encoding, or the like, and outputs the result. Further, the lossless encoding circuit 10 acquires information on the intra prediction mode related to intra encoding, information about a motion vector related to inter encoding, and the like from the intra prediction circuit 5 and the motion prediction / compensation circuit 6, and outputs these information. Set in the header information of data D4 and output.

蓄積バッファ１１は、この可逆符号化回路１０の出力データＤ４を蓄積して続く伝送路の伝送速度により出力する。レート制御回路９は、この蓄積バッファ１１の空き容量の監視により符号化処理による発生符号量を監視すると共に、この監視結果により量子化回路８における量子化スケールを切り換え、これによりこの符号化装置１による発生符号量を制御する。 The accumulation buffer 11 accumulates the output data D4 of the lossless encoding circuit 10 and outputs it at the transmission rate of the subsequent transmission path. The rate control circuit 9 monitors the amount of code generated by the encoding process by monitoring the free capacity of the storage buffer 11, and switches the quantization scale in the quantization circuit 8 based on the monitoring result, whereby the encoding device 1 Controls the amount of generated code.

逆量子化回路１３は、量子化回路８の出力データを逆量子化処理し、これにより量子化回路８の入力データを再生する。逆直交変換回路１４は、逆量子化回路１３の出力データを逆直交変換処理し、これにより直交変換回路７の入力データを再生する。デブロックフィルタ１５は、この逆直交変換回路１４の出力データよりブロック歪を除去して出力する。フレームメモリ１６は、このデブロックフィルタ１５の出力データに、適宜、イントラ予測回路５又は動き予測・補償回路６により生成される予測値を加算して参照画像情報として記録する。 The inverse quantization circuit 13 performs inverse quantization processing on the output data of the quantization circuit 8, thereby reproducing the input data of the quantization circuit 8. The inverse orthogonal transform circuit 14 performs inverse orthogonal transform processing on the output data of the inverse quantization circuit 13, thereby reproducing the input data of the orthogonal transform circuit 7. The deblocking filter 15 removes block distortion from the output data of the inverse orthogonal transform circuit 14 and outputs the result. The frame memory 16 appropriately adds a prediction value generated by the intra prediction circuit 5 or the motion prediction / compensation circuit 6 to the output data of the deblocking filter 15 and records it as reference image information.

しかして動き予測・補償回路６は、インター符号化において、このフレームメモリ１６に保持された参照画像情報による予測フレームより画面並べ替えバッファ３から出力される画像データの動きベクトルを検出し、またこの検出した動きベクトルによりフレームメモリ１６に保持した参照画像情報を動き補償して予測画像情報を生成し、この予測画像情報による予測値を減算回路４に出力する。 Therefore, the motion prediction / compensation circuit 6 detects the motion vector of the image data output from the screen rearrangement buffer 3 from the prediction frame based on the reference image information held in the frame memory 16 in the inter coding. Based on the detected motion vector, the reference image information held in the frame memory 16 is motion compensated to generate predicted image information, and a predicted value based on the predicted image information is output to the subtraction circuit 4.

イントラ予測回路５は、イントラ符号化において、フレームメモリ１６に蓄積された参照画像情報に基づいてイントラ予測モードを判定し、この判定結果により参照画像情報から予測画像情報の予測値を生成して減算回路４に出力する。 In the intra coding, the intra prediction circuit 5 determines the intra prediction mode based on the reference image information stored in the frame memory 16, and generates and subtracts a predicted value of the predicted image information from the reference image information based on the determination result. Output to circuit 4.

これらによりこの符号化方式においては、インター符号化とイントラ符号化とでそれぞれインター予測に係る動き補償による差分データＤ２とイントラ予測による差分データＤ２とを生成し、これらの差分データＤ２を直交変換処理、量子化処理、可変長符号化処理して伝送する。 Accordingly, in this encoding method, difference data D2 by motion compensation related to inter prediction and difference data D2 by intra prediction are generated by inter encoding and intra encoding, respectively, and the difference data D2 is subjected to orthogonal transform processing. , Quantization processing, variable length coding processing, and transmission.

図４は、このようにして符号化処理された符号化データＤ４を復号化処理する復号化装置を示すブロック図である。この復号化装置２０において、蓄積バッファ２１は、伝送路を介して入力される符号化データＤ４を一時蓄積して出力する。可逆復号化回路２２は、この蓄積バッファ２１の出力データを可変長復号化、算術復号化等により復号化処理し、符号化装置１における可逆符号化回路１０の入力データを再生する。またこのときこの出力データがイントラ符号化されたものである場合、ヘッダに格納されたイントラ予測モードの情報を復号化してイントラ予測回路２３に伝送するのに対し、この出力データがインター符号化されたものである場合、ヘッダに格納された動きベクトルに関する情報を復号して動き予測・補償回路２４へ転送する。 FIG. 4 is a block diagram showing a decoding apparatus that decodes the encoded data D4 encoded in this way. In the decoding device 20, the accumulation buffer 21 temporarily accumulates and outputs the encoded data D4 input via the transmission path. The lossless decoding circuit 22 decodes the output data of the storage buffer 21 by variable length decoding, arithmetic decoding, etc., and reproduces the input data of the lossless encoding circuit 10 in the encoding device 1. At this time, if the output data is intra-coded, the intra-prediction mode information stored in the header is decoded and transmitted to the intra-prediction circuit 23, whereas the output data is inter-coded. If it is, the information on the motion vector stored in the header is decoded and transferred to the motion prediction / compensation circuit 24.

逆量子化回路２５は、可逆復号化回路２２の出力データを逆量子化処理し、これにより符号化装置１の量子化回路８に入力される変換係数データＤ３を再生する。逆直交変換回路２６は、この逆量子化回路２５から出力される変換係数データを受け、４次の逆直交変換処理を実行し、これにより符号化装置１の直交変換回路７に入力される差分データＤ２を再生する。 The inverse quantization circuit 25 performs inverse quantization processing on the output data of the lossless decoding circuit 22, thereby reproducing the transform coefficient data D <b> 3 input to the quantization circuit 8 of the encoding device 1. The inverse orthogonal transform circuit 26 receives the transform coefficient data output from the inverse quantization circuit 25 and executes a fourth-order inverse orthogonal transform process, whereby the difference input to the orthogonal transform circuit 7 of the encoding device 1. Data D2 is reproduced.

加算器２７は、逆直交変換回路２６から出力される差分データＤ２を受け、イントラ符号化において、イントラ予測回路２３で生成される予測画像による予測値を加算して出力するのに対し、インター符号化において、動き予測・補償回路２４から出力される予測画像による予測値を加算して出力する。これにより加算器２７は、符号化装置１における減算回路４の入力データを再生する。 The adder 27 receives the difference data D2 output from the inverse orthogonal transform circuit 26, adds the predicted value based on the prediction image generated by the intra prediction circuit 23 in intra coding, and outputs the result. In the conversion, the predicted value based on the predicted image output from the motion prediction / compensation circuit 24 is added and output. As a result, the adder 27 reproduces the input data of the subtraction circuit 4 in the encoding device 1.

デブロックフィルタ２８は、この加算器２７の出力データよりブロック歪を除去して出力し、画面並べ替えバッファ２９は、このデブロックフィルタ２８から出力される画像データのフレームをＧＯＰ構造に応じて並べ替えて出力する。ディジタルアナログ変換回路（Ｄ／Ａ）３０は、この画面並べ替えバッファ２９の出力データをディジタルアナログ変換処理して出力する。 The deblock filter 28 removes block distortion from the output data of the adder 27 and outputs the result. The screen rearrangement buffer 29 arranges the frames of the image data output from the deblock filter 28 according to the GOP structure. Change the output. A digital / analog conversion circuit (D / A) 30 performs a digital / analog conversion process on the output data of the screen rearrangement buffer 29 and outputs the result.

フレームメモリ３１は、デブロックフィルタ２８の出力データを参照画像情報として記録して保持する。動き予測・補償回路２４は、インター符号化において、可逆復号化回路２２から通知される動きベクトルの情報によりフレームメモリ３１に保持された参照画像情報を動き補償して予測画像による予測値を生成し、この予測値を加算器２７に出力する。またイントラ予測回路２３は、イントラ符号化において、可逆復号化回路２２から通知されるイントラ予測モードによりフレームメモリ３１に保持された参照画像情報より予測画像による予測値を生成し、この予測値を加算器２７に出力する。 The frame memory 31 records and holds the output data of the deblock filter 28 as reference image information. In inter coding, the motion prediction / compensation circuit 24 performs motion compensation on the reference image information held in the frame memory 31 based on the motion vector information notified from the lossless decoding circuit 22, and generates a predicted value based on the predicted image. The predicted value is output to the adder 27. Also, the intra prediction circuit 23 generates a prediction value based on the prediction image from the reference image information held in the frame memory 31 by the intra prediction mode notified from the lossless decoding circuit 22 in the intra coding, and adds this prediction value. To the device 27.

しかしてこのような一連の処理によるＡＶＣの符号化処理においては、図５に示すように、１つのマクロブロックが、輝度信号Ｙでは１６×１６画素により形成されるのに対し、色差信号Ｃｒ、Ｃｂでは８×８画素により形成され、それぞれマクロブロックを単位にして処理される。すなわちこれらマクロブロックは、数字０〜２５により示す４×４画素による小ブロックに分割され、各小ブロック毎に、差分データＤ２が直交変換処理、量子化処理される。 In the AVC encoding process by such a series of processes, as shown in FIG. 5, one macro block is formed by 16 × 16 pixels in the luminance signal Y, whereas the color difference signals Cr, Cb is formed by 8 × 8 pixels, and each macroblock is processed as a unit. That is, these macroblocks are divided into small blocks of 4 × 4 pixels indicated by numerals 0 to 25, and difference data D2 is subjected to orthogonal transform processing and quantization processing for each small block.

この処理において、色差信号Ｃｒ、Ｃｂは、直交変換処理による係数から直流成分がマクロブロック毎に集められて２×２マトリックスが形成され、この２×２マトリックスが２次のアダマール変換処理の後、量子化処理される。また後述するイントラ１６×１６予測モードによる場合、輝度信号Ｙは、直交変換処理による係数から直流成分がマクロブロック毎に集められて４×４マトリックスが形成され、この４×４マトリックスが４次のアダマール変換処理後、量子化処理される。 In this process, the chrominance signals Cr and Cb are obtained by collecting DC components for each macroblock from the coefficients obtained by the orthogonal transform process to form a 2 × 2 matrix. Quantized. In addition, in the case of the intra 16 × 16 prediction mode described later, the luminance signal Y is obtained by collecting DC components for each macroblock from coefficients obtained by orthogonal transform processing to form a 4 × 4 matrix. After Hadamard transform processing, quantization processing is performed.

しかしてこのような符号化処理に係るイントラ符号化は、輝度信号の処理に関して、イントラ４×４予測モードとイントラ１６×１６予測モードとが用意されている。ここでＡＶＣでは上述したように４×４画素のブロック単位で差分データＤ２を直交変換処理し、イントラ４×４予測モードは、この直交変換処理のブロック単位で、イントラ予測に係る予測値を生成するモードである。これに対してイントラ１６×１６予測モードは、この直交変換処理のブロックの複数個を単位にしてイントラ予測に係る予測値を生成するモードであり、この複数個が水平方向及び垂直方向にそれぞれ４個に設定される。 For intra coding related to such coding processing, an intra 4 × 4 prediction mode and an intra 16 × 16 prediction mode are prepared for luminance signal processing. In the AVC, as described above, the difference data D2 is subjected to orthogonal transform processing in units of 4 × 4 pixels, and the intra 4 × 4 prediction mode generates prediction values related to intra prediction in units of blocks of the orthogonal transform processing. It is a mode to do. On the other hand, the intra 16 × 16 prediction mode is a mode for generating a prediction value related to intra prediction in units of a plurality of blocks of the orthogonal transform process, and the plurality of 4 × 4 in the horizontal direction and the vertical direction, respectively. Set to

このうちイントラ４×４予測モードでは、図６に示すように、予測値を生成する４×４画素ａ〜ｐによるブロックに対して、近傍１３個の画素Ａ〜Ｍの一部が予測値の生成に供する予測画素に設定され、この予測画素より予測値が生成される。なおここでこの１３個の画素Ａ〜Ｍは、このブロックの走査開始端側、垂直方向に隣接する４個の画素Ａ〜Ｄと、この４個の画素Ａ〜Ｄの走査終了端側の画素Ｄに続く４個の画素Ｅ〜Ｆと、このブロックの走査開始端側、水平方向に隣接する４個の画素Ｉ〜Ｌと、この水平方向に隣接する４個の画素Ｉ〜Ｌのうちの走査開始端側の画素Ｉの上方に位置する画素Ｍとにより形成される。 Among them, in the intra 4 × 4 prediction mode, as shown in FIG. 6, a part of the neighboring 13 pixels A to M has a predicted value for a block of 4 × 4 pixels a to p that generate a predicted value. It is set to a prediction pixel to be used for generation, and a prediction value is generated from this prediction pixel. Here, the thirteen pixels A to M are the four pixels A to D adjacent in the vertical direction in the scanning start side of the block, and the pixels on the scanning end side of the four pixels A to D. Among four pixels E to F following D, four pixels I to L adjacent in the horizontal direction on the scanning start end side of this block, and four pixels I to L adjacent in the horizontal direction The pixel M is located above the pixel I on the scanning start end side.

イントラ４×４予測モードでは、これら１３個の予測画素Ａ〜Ｍと、予測値の生成に供する４×４個の画素ａ〜ｐとの相対的な関係により、図７及び図８に示すように、モード０〜モード８の予測モードが定義されている。すなわち図６に示すように、例えばモード０及び１では、予測値の生成に使用する１３個の予測画素Ａ〜Ｍのうち、それぞれ垂直方向及び水平方向に隣接する予測画素Ａ〜Ｄ及びＩ〜Ｌにより予測値を生成する。 In the intra 4 × 4 prediction mode, as shown in FIG. 7 and FIG. 8, depending on the relative relationship between the 13 predicted pixels A to M and the 4 × 4 pixels a to p used for generating a predicted value. In addition, prediction modes of mode 0 to mode 8 are defined. That is, as shown in FIG. 6, for example, in modes 0 and 1, among the 13 prediction pixels A to M used to generate a prediction value, prediction pixels A to D and I to I that are adjacent in the vertical direction and the horizontal direction, respectively. A predicted value is generated by L.

より具体的には、図９（Ａ）において矢印により示すように、モード０は、垂直方向に隣接する予測画素Ａ〜Ｄより予測値を生成するモードであり、次式により示すように、予測値を生成する４×４個の画素ａ〜ｐのうち、垂直方向に連続する１列目の画素ａ、ｅ、ｉ、ｍは、その上方向の画素Ａが予測画素に設定される。また続く２列目の画素ｂ、ｆ、ｊ、ｎは、その上方向の画素Ｂが予測画素に設定され、続く３列目及び４列目の画素ｃ、ｇ、ｋ、ｏ及びｄ、ｈ、ｌ、ｐは、それぞれ上方の画素Ｃ及びＤが予測画素に設定され、これら予測画素Ａ〜Ｄの画素値がそれぞれ対応する画素ａ〜ｐの予測値に設定される。なおモード０は、このモードにおける予測画素Ａ〜Ｄが有意である場合にのみ適用される。 More specifically, as indicated by an arrow in FIG. 9A, mode 0 is a mode for generating a prediction value from prediction pixels A to D adjacent in the vertical direction. Among the 4 × 4 pixels a to p that generate values, the pixels a, e, i, and m in the first column that are continuous in the vertical direction have the pixel A in the upper direction set as a predicted pixel. In the subsequent pixels b, f, j, and n in the second column, the pixel B in the upper direction is set as a predicted pixel, and the pixels c, g, k, o, and d, h in the subsequent third and fourth columns are set. , L, and p, the upper pixels C and D are set as predicted pixels, and the pixel values of the predicted pixels A to D are set to the predicted values of the corresponding pixels a to p, respectively. Note that mode 0 is applied only when the prediction pixels A to D in this mode are significant.

また図９（Ｂ）に示すように、モード１は、水平方向に隣接する予測画素Ｉ〜Ｌより予測値を生成するモードであり、次式により示すように、予測値を生成する４×４個の画素ａ〜ｐのうち、水平方向に連続する１ラインの画素ａ〜ｄは、その左方の画素Ｉが予測画素に設定される。また続く２ライン目の画素ｅ〜ｈは、その左方の画素Ｊが予測画素に設定され、続く３ライン目及び４ライン目の画素ｉ〜ｌ及びｍ〜ｐは、それぞれ左方の画素Ｋ及びＬが予測画素に設定され、これら予測画素Ｉ〜Ｌの画素値がそれぞれ対応する画素ａ〜ｐの予測値に設定される。なおモード１は、このモードにおける予測画素Ｉ〜Ｌが有意である場合にのみ適用される。 As shown in FIG. 9B, mode 1 is a mode for generating a prediction value from the prediction pixels I to L adjacent in the horizontal direction. As shown by the following equation, 4 × 4 for generating a prediction value. Among the pixels a to p, in the pixels a to d on one line continuous in the horizontal direction, the pixel I on the left side is set as the prediction pixel. The pixels e to h in the subsequent second line are set to the pixel J on the left side, and the pixels i to l and mp in the third line and the fourth line are respectively set to the left pixel K. And L are set as prediction pixels, and the pixel values of these prediction pixels I to L are set to the prediction values of the corresponding pixels a to p, respectively. Note that mode 1 is applied only when the prediction pixels I to L in this mode are significant.

これに対してモード２は、図９（Ｃ）に示すように、１３個の予測画素Ａ〜Ｍのうち、このブロックの垂直方向及び水平方向に隣接する画素Ａ〜Ｄ及びＩ〜Ｌより予測値を生成するモードであり、これらの画素Ａ〜Ｄ及びＩ〜Ｌが全て有意な場合に、次式により各画素ａ〜ｐの予測値が生成される。 On the other hand, mode 2 is predicted from pixels A to D and I to L adjacent in the vertical and horizontal directions of the block among the 13 prediction pixels A to M, as shown in FIG. This is a mode for generating values, and when these pixels A to D and I to L are all significant, predicted values of the pixels a to p are generated according to the following equations.

なおモード２においては、画素Ａ〜Ｄが全て有意でない場合、予測値は、（４）式により生成され、画素Ｉ〜Ｌが全て有意でない場合、予測値は、（５）式により生成され、画素Ａ〜Ｄ及びＩ〜Ｌが全て有意でない場合、予測値は値１２８に設定される。 In mode 2, when all of the pixels A to D are not significant, the predicted value is generated by the equation (4). When all of the pixels I to L are not significant, the predicted value is generated by the equation (5). If the pixels A to D and I to L are not all significant, the predicted value is set to the value 128.

これに対してモード３は、図９（Ｄ）に示すように、１３個の予測画素Ａ〜Ｍのうち、水平方向に連続する画素Ａ〜Ｈより予測値を生成するモードであり、これらの画素Ａ〜Ｈのうちの画素Ａ〜Ｄと画素Ｉ〜Ｍとが全て有意な場合にのみ適用されて、次式により各画素ａ〜ｐの予測値が生成される。 On the other hand, as shown in FIG. 9D, mode 3 is a mode in which predicted values are generated from pixels A to H that are continuous in the horizontal direction among the 13 predicted pixels A to M. This is applied only when the pixels A to D and the pixels I to M among the pixels A to H are significant, and the predicted values of the pixels a to p are generated by the following equation.

これに対してモード４は、図９（Ｅ）に示すように、１３個の予測画素Ａ〜Ｍのうち、４×４個の画素ａ〜ｐによるブロックに隣接する画素Ａ〜Ｄ、Ｉ〜Ｍにより予測値を生成するモードであり、これらの画素Ａ〜Ｄ、Ｉ〜Ｍが全て有意な場合にのみ適用されて、次式により各画素ａ〜ｐの予測値が生成される。 On the other hand, in the mode 4, as shown in FIG. 9E, among the 13 prediction pixels A to M, the pixels A to D and I to N adjacent to the block of 4 × 4 pixels ap are used. This is a mode for generating a predicted value by M, and is applied only when these pixels A to D and I to M are all significant, and the predicted value of each pixel a to p is generated by the following equation.

これに対してモード５は、図９（Ｆ）に示すように、１３個の予測画素Ａ〜Ｍのうち、４×４個の画素ａ〜ｐによるブロックに隣接する画素Ａ〜Ｄ、Ｉ〜Ｋ、Ｍとにより予測値を生成するモードであり、予測画素Ａ〜Ｄ、Ｉ〜Ｍが全て有意な場合にのみ適用されて、次式により各画素ａ〜ｐの予測値が生成される。 On the other hand, in the mode 5, as shown in FIG. 9F, among the 13 prediction pixels A to M, the pixels A to D and I to N adjacent to the block of 4 × 4 pixels ap are used. This is a mode for generating predicted values by K and M, and is applied only when the predicted pixels A to D and I to M are all significant, and the predicted values of the pixels a to p are generated by the following equations.

これに対してモード６は、図９（Ｇ）に示すように、１３個の予測画素Ａ〜Ｍのうち、４×４個の画素ａ〜ｐによるブロックに隣接する画素Ａ〜Ｃ、Ｉ〜Ｍとにより予測値を生成するモードであり、予測画素Ａ〜Ｄ、Ｉ〜Ｍが全て有意な場合にのみ適用されて、次式により各画素ａ〜ｐの予測値が生成される。 On the other hand, in the mode 6, as shown in FIG. 9G, among the 13 prediction pixels A to M, the pixels A to C, I to C adjacent to the block of 4 × 4 pixels a to p are arranged. M is a mode for generating a prediction value, and is applied only when the prediction pixels A to D and I to M are all significant, and the prediction value of each pixel a to p is generated by the following equation.

これに対してモード７は、図９（Ｈ）に示すように、１３個の予測画素Ａ〜Ｍのうち、４×４個の画素ａ〜ｐによるブロックの上方に隣接する４個の画素Ａ〜Ｄと、この４個の画素Ａ〜Ｄに続く４個の画素Ｅ〜Ｇとにより予測値を生成するモードであり、これらのうちの画素Ａ〜Ｄ及び画素Ｉ〜Ｍが全て有意な場合にのみ適用されて、次式により各画素ａ〜ｐの予測値が生成される。 On the other hand, in the mode 7, as shown in FIG. 9H, among the 13 prediction pixels A to M, four pixels A adjacent above the block of 4 × 4 pixels ap. To D and the four pixels E to G following the four pixels A to D are modes for generating a prediction value, and the pixels A to D and the pixels I to M are all significant. The prediction value of each pixel ap is generated by the following equation.

これに対してモード８は、図９（Ｉ）に示すように、１３個の予測画素Ａ〜Ｍのうち、４×４個の画素によるブロックの左方に隣接する４個の画素Ｉ〜Ｌにより予測値を生成するモードであり、画素Ａ〜Ｄ及び画素Ｉ〜Ｍが全て有意な場合にのみ適用されて、次式により各画素ａ〜ｐの予測値が生成される。 On the other hand, in the mode 8, as shown in FIG. 9I, four pixels I to L adjacent to the left of the block of 4 × 4 pixels among the 13 prediction pixels A to M. Is a mode for generating predicted values, and is applied only when the pixels A to D and the pixels I to M are all significant, and the predicted values of the pixels a to p are generated by the following equations.

このようなＡＶＣは、イントラ４×４予測モードによる符号化処理においては、ラスタ走査の順序による処理を有効に利用して予測モードを伝送対象に通知する。すなわち４×４予測モードによりイントラ符号化する場合において、図１０に示すように、処理対象であるブロックＣの予測モードIntra 4x4 pred mode Ｃに対して、水平方向及び垂直方向に処理を完了している隣接ブロックＡ及びＢの予測モードIntra 4x4 pred mode Ａ及びIntra 4x4 pred mode Ｂは、高い相関を有する。これによりこれら隣接ブロックＡ及びＢの予測モードIntra 4x4 pred mode Ａ及びIntra 4x4 pred mode Ｂを用いて、次式により、最も可能性の高い予測モードMost Probable Modeを定義する。なおこの（１２）式におけるｍｉｎの判定は、これら予測モードの伝送に供するコードmode numberにより実行し、コードmode number の値の小さい側の予測モードを最も可能性の高い予測モードMost Probable Modeに設定する。 In such an AVC, in the encoding process using the intra 4 × 4 prediction mode, the prediction mode is notified to the transmission target by effectively using the process based on the raster scanning order. That is, in the case of intra-encoding in the 4 × 4 prediction mode, as shown in FIG. The prediction modes Intra 4x4 pred mode A and Intra 4x4 pred mode B of adjacent blocks A and B have high correlation. Thus, the most probable prediction mode Most Probable Mode is defined by the following equation using the prediction modes Intra 4x4 pred mode A and Intra 4x4 pred mode B of the adjacent blocks A and B. The determination of min in this equation (12) is executed by the code mode number used for transmission of these prediction modes, and the prediction mode with the smaller value of the code mode number is set to the most likely prediction mode Most Probable Mode. To do.

またビットストリーム中に、この４×４画素によるブロックに係るパラメータとして、予測モードの伝送の有無を示すフラグprev intra 4x4 pred mode flag[luma 4x4 BlkIdx]と、予測モード rem intra 4x4 pred mode[luma 4x4 BlkIdx] とが定義され、復号側は、Ｃ言語の記述により図１１に示すようにこれら２つのパラメータを処理して、処理対象であるブロックＣの予測モードIntra 4x4 pred mode Ｃを検出する。なおここで[luma 4x4
BlkIdx] は、輝度データに係る対象ブロックを特定するブロック番号である。 In addition, in the bitstream, as a parameter related to the block of 4 × 4 pixels, a flag prev intra 4x4 pred mode flag [luma 4x4 BlkIdx] indicating whether or not the prediction mode is transmitted and a prediction mode rem intra 4x4 pred mode [luma 4x4 BlkIdx] is defined, and the decoding side processes these two parameters as shown in FIG. 11 according to the description in C language, and detects the prediction mode Intra 4x4 pred mode C of the block C to be processed. Here [luma 4x4
BlkIdx] is a block number that identifies a target block related to luminance data.

すなわちこの場合、予測モードの伝送の有無を示すフラグprev intra 4x4 pred mode flag[luma 4x4BlkIdx]が設定されている場合、隣接ブロックＡ及びＢの予測モードIntra 4x4 pred mode Ａ及びIntra 4x4 pred mode Ｂを用いて（１２）式により復号側で検出される最も可能性の高い予測モードMost Probable Modeを処理対象であるブロックＣの予測モードに設定する。またこのフラグprev intra 4x4 pred mode flag[luma 4x4 BlkIdx]が設定されていない場合にあって、最も可能性の高い予測モードMost Probable Modeより伝送された予測モード rem intra 4x4 pred mode[luma 4x4 BlkIdx] のコードmode number が小さい場合、伝送された予測モード rem intra 4x4 pred mode[luma 4x4 BlkIdx] を処理対象ブロックＣの予測モードに設定する。またフラグprev intra 4x4 pred mode flag[luma4x4 BlkIdx] が設定されていない場合にあって、最も可能性の高い予測モードMost Probable Modeより伝送された予測モード rem intra 4x4 pred mode[luma 4x4 BlkIdx] のコードmode number が小さくない場合、伝送された予測モード rem intra 4x4 pred mode[luma 4x4 BlkIdx] のコードmode number に値１を加算したコードmode number の予測モードを処理対象ブロックＣの予測モードに設定する。 That is, in this case, when the flag prev intra 4x4 pred mode flag [luma 4x4BlkIdx] indicating whether or not the prediction mode is transmitted is set, the prediction modes Intra 4x4 pred mode A and Intra 4x4 pred mode B of the adjacent blocks A and B are set. By using the equation (12), the most probable prediction mode Most Probable Mode detected on the decoding side is set as the prediction mode of the block C to be processed. Also, when this flag prev intra 4x4 pred mode flag [luma 4x4 BlkIdx] is not set, the prediction mode transmitted from the most probable prediction mode Most Probable Mode rem intra 4x4 pred mode [luma 4x4 BlkIdx] When the code mode number of is smaller, the transmitted prediction mode rem intra 4x4 pred mode [luma 4x4 BlkIdx] is set as the prediction mode of the processing target block C. Also, when the flag prev intra 4x4 pred mode flag [luma4x4 BlkIdx] is not set, the prediction mode transmitted from the most probable prediction mode Most Probable Mode rem intra 4x4 pred mode [luma 4x4 BlkIdx] code If the mode number is not small, the prediction mode of the code mode number obtained by adding the value 1 to the code mode number of the transmitted prediction mode rem intra 4x4 pred mode [luma 4x4 BlkIdx] is set as the prediction mode of the processing target block C.

これらにより符号化装置１は、最も可能性の高い予測モードMost Probable Modeが処理対象ブロックＣの予測モードと一致する場合、予測モードの伝送の有無を示すフラグprev intra 4x4 pred mode flag[luma 4x4 BlkIdx]を設定して、予測モード rem intra 4x4 pred mode[luma 4x4 BlkIdx] の伝送を中止し、伝送に供するデータ量を削減する。 Accordingly, when the most probable prediction mode Most Probable Mode matches the prediction mode of the processing target block C, the encoding device 1 uses the flag prev intra 4x4 pred mode flag [luma 4x4 BlkIdx ] To stop transmission of the prediction mode rem intra 4x4 pred mode [luma 4x4 BlkIdx] and reduce the amount of data used for transmission.

これに対してイントラ１６×１６予測モードでは、図１２に示すように、予測値を生成する１６×１６個の画素Ｐ（０，１５）〜Ｐ（１５，１５）によるブロックＢに対して、このブロックを構成する画素Ｐ（０，１５）〜Ｐ（１５，１５）と、このブロックＢの上方及び左方に隣接する画素Ｐ（０，−１）〜Ｐ（１５，−１）及びＰ（−１，０）〜Ｐ（−１，１５）が予測画素に設定され、これらの予測画素により予測値が生成される。 On the other hand, in the intra 16 × 16 prediction mode, as shown in FIG. 12, for a block B made up of 16 × 16 pixels P (0, 15) to P (15, 15) for generating a prediction value, Pixels P (0,15) to P (15,15) constituting this block, and pixels P (0, -1) to P (15, -1) and P adjacent above and to the left of this block B (-1, 0) to P (-1, 15) are set as prediction pixels, and a prediction value is generated by these prediction pixels.

イントラ１６×１６予測モードでは、図１３に示すように、モード０〜モード３の予測モードが定義され、このうちモード０は、処理対象ブロックＢの上方に隣接する画素Ｐ（０，−１）〜Ｐ（１５，−１）（Ｐ（ｘ，−１）；ｘ，ｙ＝−１〜１５）が有意な場合にのみ適用されて、次式により示すように、ブロックＢを構成する各画素Ｐ（０，１５）〜Ｐ（１５，１５）の予測値が生成される。これにより図１４（Ａ）に示すように、ブロックＢに隣接する各画素Ｐ（０，−１）〜Ｐ（１５，−１）の画素値によりブロックＢの垂直方向に連続する各画素の予測値が生成される。 In the intra 16 × 16 prediction mode, as shown in FIG. 13, prediction modes of mode 0 to mode 3 are defined. Of these, mode 0 is a pixel P (0, −1) adjacent above the processing target block B. -P (15, -1) (P (x, -1); x, y = -1 to 15) is applied only when it is significant, and as shown by the following equation, each pixel constituting the block B Predicted values of P (0,15) to P (15,15) are generated. As a result, as shown in FIG. 14A, prediction of each pixel continuous in the vertical direction of the block B is performed by the pixel values of the pixels P (0, −1) to P (15, −1) adjacent to the block B. A value is generated.

これに対してモード１は、ブロックＢの左方に隣接する画素Ｐ（−１，０）〜Ｐ（−１，１５）（Ｐ（−１，ｙ）；ｘ，ｙ＝−１〜１５）が有意な場合にのみ適用されて、次式により示すように、ブロックＢを構成する各画素Ｐ（０，１５）〜Ｐ（１５，１５）の予測値が生成され、これにより図１４（Ｂ）に示すように、ブロックＢに隣接する各画素Ｐ（−１，０）〜Ｐ（−１，１５）の画素値によりブロックＢの水平方向に連続する各画素の予測値が生成される。 On the other hand, in the mode 1, the pixels P (−1, 0) to P (−1, 15) adjacent to the left side of the block B (P (−1, y); x, y = −1 to 15). Is applied only when is significant, and predicted values of the pixels P (0,15) to P (15,15) constituting the block B are generated as shown by the following equation, and FIG. ), Predicted values of pixels continuous in the horizontal direction of the block B are generated by the pixel values of the pixels P (−1, 0) to P (−1, 15) adjacent to the block B.

これに対してモード２は、ブロックＢの上方及び左方に隣接する画素Ｐ（０，−１）〜Ｐ（１５，−１）及びＰ（−１，０）〜Ｐ（−１，１５）が全て有意な場合には、次式により予測値が求められ、これにより図１４（Ｃ）に示すように、これらの画素Ｐ（０，−１）〜Ｐ（１５，−１）及びＰ（−１，０）〜Ｐ（−１，１５）による画素値の平均値によりブロックＢを構成する各画素の予測値が生成される。 On the other hand, in the mode 2, the pixels P (0, −1) to P (15, −1) and P (−1, 0) to P (−1, 15) adjacent to the upper side and the left side of the block B are used. Are all significant, a predicted value is obtained by the following equation, and as shown in FIG. 14C, these pixels P (0, -1) to P (15, -1) and P ( A predicted value of each pixel constituting the block B is generated based on the average value of the pixel values of (−1, 0) to P (−1, 15).

なおモード２においては、これらブロックＢの上方及び左方に隣接する画素Ｐ（０，−１）〜Ｐ（１５，−１）及びＰ（−１，０）〜Ｐ（−１，１５）のうち、上方に隣接する画素Ｐ（−１，０）〜Ｐ（−１，１５）が有意でない場合、（１６）式が適用されて有意な側の隣接画素の平均値により各画素の予測値が生成される。また左方に隣接する画素Ｐ（−１，０）〜Ｐ（−１，１５）が有意でない場合、（１７）式が適用され、この場合も有意な側の隣接画素の平均値によりブロックＢを構成する各画素の予測値が生成される。またブロックＢの上方及び左方に隣接する画素Ｐ（０，−１）〜Ｐ（１５，−１）及びＰ（−１，０）〜Ｐ（−１，１５）の全てが有意でない場合、値１２８に予測値が設定される。 In mode 2, pixels P (0, -1) to P (15, -1) and P (-1, 0) to P (-1, 15) adjacent to the upper and left sides of the block B are used. Among them, when the pixels P (−1, 0) to P (−1, 15) adjacent to the upper side are not significant, the predicted value of each pixel is calculated based on the average value of the adjacent pixels on the significant side by applying the equation (16). Is generated. Further, when the pixels P (−1, 0) to P (−1, 15) adjacent to the left are not significant, the equation (17) is applied. In this case as well, the block B is determined by the average value of the adjacent pixels on the significant side. The predicted value of each pixel that constitutes is generated. If all of the pixels P (0, −1) to P (15, −1) and P (−1, 0) to P (−1, 15) adjacent to the upper and left sides of the block B are not significant, A predicted value is set to the value 128.

これに対してモード３は、ブロックＢの上方及び左方に隣接する画素Ｐ（０，−１）〜Ｐ（１５，−１）及びＰ（−１，０）〜Ｐ（−１，１５）が全て有意な場合にのみ適用され、次式により予測値が求められ、これにより図１４（Ｄ）に示すように、斜め方向の演算処理により各画素の予測値が生成される。 On the other hand, in the mode 3, the pixels P (0, −1) to P (15, −1) and P (−1, 0) to P (−1, 15) adjacent to the upper side and the left side of the block B are used. Is applied only when all are significant, and a predicted value is obtained by the following equation. As a result, as shown in FIG. 14D, a predicted value of each pixel is generated by a calculation process in an oblique direction.

このような輝度信号に係る各種のイントラ予測モードに対して、色差信号は、輝度信号におけるイントラ１６×１６予測モードと同様に予測モードが設定される。但し、イントラ１６×１６予測モードが１６×１６画素のマクロブロックが処理対象であるのに対し、色差信号に対するイントラ予測モードは８×８画素のマクロブロックが処理対象であり、また図１５に示すように、輝度信号の場合に比して、モード番号と対応する予測モードとが異なる。また輝度信号と色差信号とでは、予測モードがそれぞれ独立に設定される。 For various intra prediction modes related to such a luminance signal, the prediction mode is set for the color difference signal in the same manner as the intra 16 × 16 prediction mode for the luminance signal. However, while the intra 16 × 16 prediction mode is a macro block of 16 × 16 pixels, the intra prediction mode for the color difference signal is a macro block of 8 × 8 pixels, and is shown in FIG. As described above, the mode number and the corresponding prediction mode are different from those in the case of the luminance signal. The prediction mode is set independently for the luminance signal and the color difference signal.

すなわちモード０においては、画素Ｐ（ｘ，−１）及び画素Ｐ（−１，ｙ）が有意な場合に、次式により予測値が生成される。 That is, in mode 0, when the pixel P (x, −1) and the pixel P (−1, y) are significant, a predicted value is generated by the following equation.

なお画素Ｐ（−１，ｙ）が有意でない場合、（２０）式により、画素Ｐ（ｘ，−１）が有意でない場合、（２１）式により予測値が生成される。 When the pixel P (−1, y) is not significant, the predicted value is generated according to the equation (21) when the pixel P (x, −1) is not significant according to the equation (20).

またモード１においては、画素Ｐ（−１，ｙ）が有意な場合にのみ適用されて、次式により予測値が生成される。 In mode 1, it is applied only when the pixel P (-1, y) is significant, and a predicted value is generated by the following equation.

またモード２においては、Ｐ（ｘ，−１）が有意な場合にのみ適用されて、次式により予測値が生成される。 In mode 2, it is applied only when P (x, -1) is significant, and a predicted value is generated by the following equation.

またモード３においては、画素Ｐ（ｘ，−１）及び画素Ｐ（−１，ｙ）が有意な場合に、次式により予測値が生成される。 In mode 3, when the pixel P (x, -1) and the pixel P (-1, y) are significant, a predicted value is generated by the following equation.

これに対してインター符号化においては、Multiple Reference Frames により、図１６に示すように、処理対象のフレームＯｒｇに対して、複数の参照フレームＲｅｆの何れかを選択して動き補償できるように設定され、これにより直前のフレームにおいて動き補償のブロックに対応する部位が隠れている場合、さらにはフラッシュ等により直前のフレームで一時的に全体の画素値が変動した場合等にあっても、高い精度により動き補償してデータ圧縮効率を向上する。 On the other hand, in inter coding, multiple reference frames are set so that motion compensation can be performed by selecting any of a plurality of reference frames Ref for the processing target frame Org as shown in FIG. Thus, even when the part corresponding to the motion compensation block is hidden in the immediately preceding frame, or even when the entire pixel value temporarily changes in the immediately preceding frame due to flash or the like, it is possible to achieve high accuracy. Data compensation efficiency is improved by motion compensation.

また動き補償に係るブロックにおいては、図１７（Ａ１）に示すように、１６画素×１６画素によるブロックを基準にして動き補償するようになされているものの、variable
MC Block Sizeによりtree-structured motion compensation がサポートされており、これにより図１７（Ａ２）〜（Ａ４）に示すように、１６画素×１６画素によるブロックを水平方向及び又は垂直方向に２分割して、１６画素×８画素、８画素×１６画素、８画素×８画素によるサブマクロブロックによりそれぞれ独立に動きベクトル、参照フレームを設定して動き補償できるように設定されている。また８画素×８画素によるサブマクロブロックについては、図１７（Ｂ１）〜（Ｂ４）に示すように、８画素×８画素、８画素×４画素、４画素×８画素、４画素×４画素によるサブマクロブロックにさらに分割して、それぞれ独立に動きベクトル、参照フレームを設定して動き補償できるように設定されている。 In the block relating to motion compensation, as shown in FIG. 17A1, motion compensation is performed with reference to a block of 16 pixels × 16 pixels.
MC-block size supports tree-structured motion compensation. As shown in FIGS. 17 (A2) to (A4), a block of 16 pixels × 16 pixels is divided into two in the horizontal direction and / or the vertical direction. , 16 pixels × 8 pixels, 8 pixels × 16 pixels, and 8 pixels × 8 pixels are set so that motion compensation can be performed by setting a motion vector and a reference frame independently. In addition, as shown in FIGS. 17B1 to 17B4, the sub-macro block having 8 pixels × 8 pixels has 8 pixels × 8 pixels, 8 pixels × 4 pixels, 4 pixels × 8 pixels, 4 pixels × 4 pixels. Are further divided into sub-macroblocks, and motion vectors and reference frames are set independently so that motion compensation can be performed.

また動き補償においては、６タップのＦＩＲフィルタを用いて１／４画素精度により動き補償できるように設定されている。これにより図１８において、符号Ａにより１画素精度の画素値、符号ｂ〜ｄにより１／２画素精度の画素値、符号ｅ１〜ｅ３により１／４画素精度の画素値を示すように、動き予測・補償回路６は、６タップのＦＩＲフィルタの各タップ入力を値１、−５、２０、２０、−５、１により重み付けして次式の演算処理を実行することにより、水平方向又は垂直方向の連続する画素間に１／２画素精度による画素値ｂ又はｄを計算する。 The motion compensation is set such that motion compensation can be performed with a 1/4 pixel accuracy using a 6-tap FIR filter. Accordingly, in FIG. 18, motion prediction is performed so that a pixel value of 1 pixel accuracy is indicated by the symbol A, a pixel value of 1/2 pixel accuracy is indicated by the symbols b to d, and a pixel value of 1/4 pixel accuracy is indicated by the symbols e1 to e3. The compensation circuit 6 performs horizontal calculation or vertical calculation by weighting each tap input of the 6-tap FIR filter with the values 1, -5, 20, 20, -5, and 1 and executing the following arithmetic processing. A pixel value b or d with 1/2 pixel accuracy is calculated between successive pixels.

またこのようにして計算した１／２画素精度による画素値ｂ又はｄを用いて、６タップのＦＩＲフィルタの各タップ入力を値１、−５、２０、２０、−５、１により重み付けして次式の演算処理を実行することにより、水平方向及び垂直方向の連続する画素間の１／２画素精度による画素値ｃを計算する。 Also, using the pixel value b or d with 1/2 pixel accuracy calculated in this way, each tap input of the 6-tap FIR filter is weighted by the values 1, -5, 20, 20, -5, and 1. By executing the arithmetic processing of the following equation, a pixel value c with a ½ pixel accuracy between consecutive pixels in the horizontal direction and the vertical direction is calculated.

またこのようにして計算した１／２画素精度による画素値ｂ〜ｄを用いて、直線補間による次式の演算処理を実行することにより、１／４画素精度による画素値ｅ１〜ｅ３を計算する。なおこの（２５）式及び（２６）式の重み付け加算に係る正規化の処理においては、垂直方向及び水平方向の全ての補間処理が完了して実行される。 Further, the pixel values e1 to e3 with the ¼ pixel accuracy are calculated by executing the following arithmetic processing by linear interpolation using the pixel values b to d with the ½ pixel accuracy calculated in this way. . Note that in the normalization processing related to the weighted addition of the equations (25) and (26), all the interpolation processes in the vertical direction and the horizontal direction are completed and executed.

このような輝度信号に対する動き補償の処理に対して、色差信号に対する動き補償は、線型補間により実行される。すなわち図１９に示すように、画素ピッチｓによる隣接画素Ａ〜Ｄに対して、水平方向及び垂直方向にそれぞれ内分比ｄx、ｓ−ｄx及びｄy、ｓ−ｄyに係るサンプリング点に設定される画素値νは、次式により表される。 In contrast to such motion compensation processing for luminance signals, motion compensation for color difference signals is executed by linear interpolation. That is, as shown in FIG. 19, the adjacent pixels A to D with the pixel pitch s are set to sampling points with internal division ratios dx, s-dx, dy, and s-dy in the horizontal and vertical directions, respectively. The pixel value ν is expressed by the following equation.

ＡＶＣでは、このようなインター予測に係る符号化の情報である動きベクトルついても、連続するマクロブロック、サブマクロブロック間の相関を有効に利用してデータ伝送量を低減する。すなわちＡＶＣ符号化においては、１つのマクロブロックを複数のサブマクロブロックに分割してそれぞれ動き補償することも可能であることにより、動きベクトルの伝送に供する符号量が増大する。このためブロック毎にそれぞれ水平方向成分及び垂直方向成分について動きベクトル予測値pmv を生成し、この動きベクトル予測値pmv と実際の動きベクトルmvとの間で次式により表される演算処理による計算される差分値の動きベクトル情報ＭＶＤ(Motion Vector Data)を符号化して伝送する。 In AVC, even for a motion vector that is coding information related to such inter prediction, the data transmission amount is reduced by effectively using the correlation between successive macroblocks and sub-macroblocks. That is, in AVC coding, one macroblock can be divided into a plurality of sub-macroblocks and motion compensation can be performed, thereby increasing the amount of code used for motion vector transmission. Therefore, a motion vector prediction value pmv is generated for each horizontal component and vertical component for each block, and the motion vector prediction value pmv and the actual motion vector mv are calculated by the arithmetic processing represented by the following equation. The difference vector motion vector information MVD (Motion Vector Data) is encoded and transmitted.

但し、図２０（Ａ）に示すように、動きベクトルmvに係るブロックが、１つのマクロブロックを水平方向に２分割して形成される２つのサブマクロブロックうちの右側のサブマクロブロックＣの場合であって、動きベクトル予測値mvの検出に係る参照フレームrefIdxE が、残る左側に隣接するサブマクロブロックＡの参照フレームrefIdxAと等しい場合、次式により示すように、この左側に隣接するサブマクロブロックＡで検出された動きベクトルmvA を動きベクトル予測値pmv に設定する。 However, as shown in FIG. 20A, the block related to the motion vector mv is the right sub-macroblock C of the two sub-macroblocks formed by dividing one macroblock into two in the horizontal direction. If the reference frame refIdxE related to the detection of the motion vector prediction value mv is equal to the reference frame refIdxA of the remaining left submacroblock A, the submacroblock adjacent to the left side as shown by the following equation The motion vector mvA detected in A is set as the motion vector prediction value pmv.

またこれとは逆に、動きベクトルmvに係るブロックが、左側のサブマクロブロックＡの場合であって、動きベクトル予測値mvの検出に係る参照フレームrefIdxE が、残る右側に隣接するサブマクロブロックＣの参照フレームrefIdxC と等しい場合、次式により示すように、この右側に隣接するサブマクロブロックＣで検出された動きベクトルmvC を動きベクトル予測値pmv に設定する。 On the other hand, when the block related to the motion vector mv is the left sub-macroblock A, the reference frame refIdxE related to the detection of the motion vector prediction value mv is adjacent to the remaining left sub-macroblock C. If it is equal to the reference frame refIdxC, the motion vector mvC detected in the sub-macroblock C adjacent to the right side is set to the motion vector prediction value pmv as shown by the following equation.

また図２０（Ｂ）に示すように、動きベクトルmvに係るブロックが、１つのマクロブロックを垂直方向に２分割して形成される２つのサブマクロブロックうちの上側のサブマクロブロックＣの場合であって、動きベクトル予測値mvの検出に係る参照フレームrefIdxE が、残る下側に隣接するサブマクロブロックＢの参照フレームrefIdxA と等しい場合、次式により示すように、この下側に隣接するサブマクロブロックＢで検出された動きベクトルmvB を動きベクトル予測値pmv に設定する。 Further, as shown in FIG. 20B, the block related to the motion vector mv is an upper sub-macroblock C of two sub-macroblocks formed by dividing one macroblock into two in the vertical direction. When the reference frame refIdxE related to the detection of the motion vector prediction value mv is equal to the reference frame refIdxA of the remaining lower submacroblock B, as shown by the following equation, The motion vector mvB detected in the block B is set to the motion vector prediction value pmv.

またこれとは逆に、動きベクトルmvに係るブロックが、下側のサブマクロブロックＢの場合であって、動きベクトル予測値mvの検出に係る参照フレームrefIdxE が、残る上側に隣接するサブマクロブロックＡの参照フレームrefIdxA と等しい場合、次式により示すように、この下側に隣接するサブマクロブロックＡで検出された動きベクトルmvA を動きベクトル予測値pmv に設定する。 On the contrary, when the block related to the motion vector mv is the lower sub-macroblock B, the reference frame refIdxE related to the detection of the motion vector prediction value mv is the remaining adjacent sub-macroblock. When equal to the reference frame refIdxA of A, as shown by the following equation, the motion vector mvA detected in the sub macroblock A adjacent on the lower side is set to the motion vector prediction value pmv.

またこれら以外の場合にあっては、図２１（Ａ）に示すように、動き補正に係るブロックＥに対して、隣接するブロックで検出される動きベクトルにより動きベクトルの予測値pmv を生成する。なおここでこの隣接するブロックは、ラスタ走査順序による水平方向の走査開始側に隣接するブロックＡ、ラスタ走査の順序により垂直方向の走査開始側に隣接するブロックＢ、このブロックの左右のブロックＣ、Ｄである。なおこれら隣接するブロックによる動きベクトルの予測値pmvは、図２１（Ｂ）に示すように、この隣接するブロックに属するサブマクロブロックで検出される動きベクトルにも適用される。 In other cases, as shown in FIG. 21A, a motion vector prediction value pmv is generated from a motion vector detected in an adjacent block for a block E related to motion correction. Here, this adjacent block includes a block A adjacent to the horizontal scanning start side according to the raster scanning order, a block B adjacent to the vertical scanning start side according to the raster scanning order, and left and right blocks C of this block, D. Note that the motion vector prediction value pmv by these adjacent blocks is also applied to the motion vector detected by the sub-macroblock belonging to this adjacent block, as shown in FIG.

具体的に、各隣接ブロックの検出に係る参照フレームインデックスrefIdxA 、refIdxB 、refIdxC の値により、動き補正に係るブロックＥとの間で参照フレームが一致する隣接ブロックが存在する場合、次式により、この参照フレームが一致してなる隣接ブロック（N=A or B or C ）による動きベクトルmvN を動きベクトル予測値pmv に設定する。 Specifically, when there is an adjacent block whose reference frame matches with the block E related to motion correction based on the values of the reference frame indexes refIdxA, refIdxB, and refIdxC related to detection of each adjacent block, A motion vector mvN by an adjacent block (N = A or B or C) having a matching reference frame is set as a motion vector prediction value pmv.

またこれ以外の場合には、垂直方向及び水平方向の各成分について、次式により、メディアンフィルタによる処理結果による成分を動きベクトル予測値pmv の各成分に設定する。 In other cases, for each component in the vertical direction and the horizontal direction, the component based on the processing result by the median filter is set as each component of the motion vector prediction value pmv by the following equation.

但し、垂直方向に隣接するブロックＢ、又はこのブロックＢに続くブロックＣの何れかが有意でない場合であって、水平方向に隣接するブロックＡが有意である場合、これら垂直方向に係る隣接ブロックＢ及びＣの動きベクトルmv及び参照フレームインデックスrefIdxは、次式により示すように、ブロックＡによる動きベクトルmvA 及び参照フレームインデックスrefIdxA が代用される。 However, when either the block B adjacent in the vertical direction or the block C following the block B is not significant and the block A adjacent in the horizontal direction is significant, the adjacent block B in the vertical direction The motion vector mvA and the reference frame index refIdx of the block A are substituted for the motion vector mv and the reference frame index refIdx of C and C, as shown by the following equation.

なおＡＶＣでは、Ｂピクチャにおいて、テンポラル（時間）ダイレクトモードと、スペーシャル（空間）ダイレクトモードとによるダイレクトモードが設けられており、このダイレクトモードでは動きベクトルに関する情報の伝送を中止して符号化効率を向上する。 In the AVC, the B picture has a direct mode of a temporal (temporal) direct mode and a spatial (spatial) direct mode. In this direct mode, transmission of information on motion vectors is stopped to improve encoding efficiency. improves.

すなわちスペーシャルダイレクトモードでは、予測ベクトルpmv を動きベクトルに設定して復号化処理を実行する。これに対してテンポラルダイレクトモードは、動きが線形であると仮定して、図２２に示すように、符号化処理を完了した予測フレームＬ１の対応するブロック（Ｃｏ−ＬｏｃａｔｅｄＢｌｏｃｋ）の動きベクトルmvcol を用いた線型補間により、処理対象のＢピクチャに係る動きベクトルＭＶ_l0及びＭＶ_l1を作成するものである。なお、ＡＶＣ画像圧縮情報においては、これらピクチャＬ０、Ｌ１との間の時間情報に係るパラメータＴＤが存在しないことにより、これに代えてPOC (Picture Order Count) が用いられる。 That is, in the spatial direct mode, the decoding process is executed with the prediction vector pmv set as a motion vector. On the other hand, in the temporal direct mode, assuming that the motion is linear, as shown in FIG. 22, the motion vector mvcol of the corresponding block (Co-Located Block) of the prediction frame L1 that has completed the encoding process is obtained. by linear interpolation using, is to create a motion vector MV _l0 and MV _l1 according to B-picture to be processed. In the AVC image compression information, since the parameter TD relating to the time information between these pictures L0 and L1 does not exist, POC (Picture Order Count) is used instead.

ＡＶＣは、これらイントラ及びインター予測に係る予測モードに関して、ＡＶＣに係るＪｏｉｎｔＭｏｄｅｌ（ＡＶＣ参照符号化方式）により、マルチパスエンコードを前提としたＨｉｇｈＣｏｍｐｌｅｘｉｔｙＭｏｄｅと、１パスエンコードを前提としたＬｏｗＣｏｍｐｌｅｘｉｔｙＭｏｄｅとが定義されており、これらの定義に従って最適なモードを選択して符号化処理を実行する。またこれらのモードのうち、ＬｏｗＣｏｍｐｌｅｘｉｔｙＭｏｄｅでは、符号化効率を示すコスト関数を次式により定義し、このコスト関数により得られるコスト値Ｃｏｓｔ（Ｍｏｄｅ）の比較により最適モードを検出する。 With respect to the prediction modes related to intra and inter prediction, AVC uses High Model Mode assuming multi-pass encoding and Low Complexity Mode assuming one-pass encoding by the Joint Model (AVC reference encoding method) related to AVC. Are defined, and an optimum mode is selected according to these definitions, and the encoding process is executed. Of these modes, in Low Complexity Mode, a cost function indicating encoding efficiency is defined by the following equation, and an optimal mode is detected by comparing cost values Cost (Mode) obtained by the cost function.

ここでＳＡ（Ｔ）Ｄは、原画像と予測画像との誤差値であり、これら原画像と予測画像との間の、画素値差分値の絶対値誤差和が適用される。またＳＡ（Ｔ）Ｄ０は、ヘッダビット、モード判定の際の重みとなるコストであり、誤差値ＳＡ（Ｔ）Ｄに与えられるオフセット値であり、動きベクトル等の付加的な情報の伝送に供するデータ量が示される。 Here, SA (T) D is an error value between the original image and the predicted image, and an absolute value error sum of pixel value difference values between the original image and the predicted image is applied. SA (T) D0 is a header bit and a cost as a weight for mode determination, an offset value given to the error value SA (T) D, and is used for transmission of additional information such as a motion vector. The amount of data is shown.

具体的に絶対値誤差和ＳＡＤは、各マクロブロックについて、次式により示され、それぞれ各予測モードＭｏｄｅにおける原画像と予測画像の差分値が適用される。 Specifically, the absolute value error sum SAD is expressed by the following equation for each macroblock, and the difference value between the original image and the predicted image in each prediction mode Mode is applied.

なおここでこの（３８）式による絶対値誤差和ＳＡＤに代えて、次式による得られる差分加算値ＳＡＴＤ（Ｍode ）を用いてもよい。 Here, instead of the absolute value error sum SAD according to the equation (38), a difference addition value SATD (Mode) obtained by the following equation may be used.

なおＨａｄａｍａｒｄ（）は、次式により示すように、対象の行列にアダマール変換行列を掛けるアダマール変換操作である。なおアダマール変換行列は、（４１）式により表され、Ｈ^Tは、アダマール変換行列の転置行列である。 Hadamard () is a Hadamard transform operation for multiplying the target matrix by a Hadamard transform matrix, as shown by the following equation. Note Hadamard transform matrix is expressed by equation (41), H ^T is a transposed matrix of the Hadamard transform matrix.

またオフセット値ＳＡ（Ｔ）Ｄ０は、前予測モードにおいては、次式により示される。なおここでＱＰ０（ＱＰ）は、量子化パラメータＱＰを量子化スケールに変換する関数であり、ＭＶＤＦＷは、前予測に係る動きベクトルであり、Bit to code は、この動きベクトルに係るビットストリーム上の符号量である。 The offset value SA (T) D0 is represented by the following equation in the previous prediction mode. Here, QP0 (QP) is a function for converting the quantization parameter QP into a quantization scale, MFDFW is a motion vector related to the previous prediction, and Bit to code is a bit stream related to this motion vector. Code amount.

またオフセット値ＳＡ（Ｔ）Ｄ０は、後予測モードにおいては、次式により表される。なおここでＭＶＤＢＷは、後予測に係る動きベクトルである。 The offset value SA (T) D0 is expressed by the following equation in the post-prediction mode. Here, MVDBW is a motion vector related to post prediction.

またオフセット値ＳＡ（Ｔ）Ｄ０は、 Bi-Predictive予測モードにおいては、次式により表される。なおここでBit to code forward Blk size、Bit to code backward Blk size は、それぞれ前予測及び後予測に係る動き補償ブロックに関する情報の伝送に必要なビットストリーム上における符号量である。 The offset value SA (T) D0 is expressed by the following equation in the Bi-Predictive prediction mode. Here, Bit to code forward Blk size and Bit to code backward Blk size are code amounts on the bitstream necessary for transmission of information related to the motion compensation block related to the forward prediction and backward prediction, respectively.

またダイレクトモードにおいては、オフセット値ＳＡ（Ｔ）Ｄ０は、次式により求められる。 In the direct mode, the offset value SA (T) D0 is obtained by the following equation.

またイントラ４×４予測モードでは、オフセット値ＳＡ（Ｔ）Ｄ０は、次式により求められる。 In the intra 4 × 4 prediction mode, the offset value SA (T) D0 is obtained by the following equation.

因みに、このコスト関数にあっては、動きベクトルの探索にも適用され、次式により示すように、コスト値Ｃｏｓｔを最小にする動きベクトルが検出される。 Incidentally, this cost function is also applied to a search for a motion vector, and a motion vector that minimizes the cost value Cost is detected as shown by the following equation.

これらによりＬｏｗＣｏｍｐｌｅｘｉｔｙＭｏｄｅにおいて、最適モードを検出する場合、符号化装置１では、イントラ予測回路５及び動き予測・補償回路６において、輝度信号を用いて、それぞれイントラ符号化及びインター符号化の全ての予測モードのコスト値Ｃｏｓｔを計算し、このコスト値Ｃｏｓｔの最も小さな予測モードをそれぞれ選択してイントラ符号化の最適モード及びインター符号化の最適モードを検出する。またこれらイントラ符号化の最適モード及びインター符号化の最適モードにおけるコスト値Ｃｏｓｔの比較により、イントラ符号化、インター符号化を選択すると共に、輝度信号の最適モードを検出する。またこれによりイントラ符号化が選択された場合、色差信号について各イントラ予測モードのコスト値を計算し、このコスト値の比較により最も値の小さなイントラ予測モードが色差信号の最適モードに設定される。なお、インター符号化が選択された場合、色差信号は、輝度信号に係る参照フレーム、動きベクトル、輝度信号に対応する動き補償ブロックにより予測値が生成される。 Accordingly, when the optimum mode is detected in the Low Complexity Mode, the encoding device 1 uses the luminance signal in the intra prediction circuit 5 and the motion prediction / compensation circuit 6, respectively, to perform all of intra coding and inter coding. The cost value Cost of the prediction mode is calculated, the prediction mode having the smallest cost value Cost is selected, and the optimum mode for intra coding and the optimum mode for inter coding are detected. In addition, by comparing the cost value Cost in the optimum mode for intra coding and the optimum mode for inter coding, intra coding and inter coding are selected, and the optimum mode of the luminance signal is detected. When intra coding is selected in this way, the cost value of each intra prediction mode is calculated for the color difference signal, and the intra prediction mode having the smallest value is set as the optimum mode of the color difference signal by comparing the cost values. When inter coding is selected, a predicted value of the color difference signal is generated by a reference frame related to the luminance signal, a motion vector, and a motion compensation block corresponding to the luminance signal.

これらによりＡＶＣでは、複数のイントラ予測モード、複数のインター予測モードから最適モードをマクロブロック毎に検出し、この最適モードにより画像データを符号化処理し、これにより画像データを効率良く符号化処理する。 As a result, in AVC, the optimal mode is detected for each macroblock from a plurality of intra prediction modes and a plurality of inter prediction modes, and image data is encoded by this optimal mode, thereby efficiently encoding the image data. .

またＡＶＣにおいて、デブロックフィルタ１５、２８は、復号画像におけるブロック歪を除去すると共に、動き補償処理によるブロック歪の伝播を防止する為に適用され、以下のように定義される。なおここで量子化パラメータＱＰは、輝度信号の処理においては、ＱＰＹが適用され、色差信号の処理においては、ＱＰＣが適用される。またデブロックフィルタ処理は、隣接画素に関しては、異なるスライスに属する画素値でも、同一のピクチャに属する場合は有意であるとして処理が実行される。 In AVC, the deblocking filters 15 and 28 are applied to remove block distortion in a decoded image and prevent propagation of block distortion due to motion compensation processing, and are defined as follows. Here, as the quantization parameter QP, QPY is applied in the luminance signal processing, and QPC is applied in the color difference signal processing. In the deblocking filter processing, regarding adjacent pixels, even if pixel values belonging to different slices belong to the same picture, the processing is executed.

ここで図２３に示すように、ブロック境界を間に挟んで連続する画素について、デブロックフィルタによる処理前の画素値をｐ０〜ｐ３、ｑ０〜ｑ３とし、処理後の画素値をｐ０' 〜ｐ３' 、ｑ０' 〜ｑ３' とする。これら処理対象の画素値に対して、図２４に示すように、各画素がイントラマクロブロックに属するか否か等によりブロック境界の強度値（Bs：Boundary Strength）が定義される。 Here, as shown in FIG. 23, for the pixels that are continuous with the block boundary in between, the pixel values before processing by the deblocking filter are p0 to p3 and q0 to q3, and the pixel values after processing are p0 ′ to p3. ', Q0' to q3 '. With respect to the pixel values to be processed, as shown in FIG. 24, block boundary strength values (Bs: Boundary Strength) are defined depending on whether or not each pixel belongs to an intra macroblock.

この定義を前提に、次式により示す関係式が成立する場合に、デブロックフィルタの処理が実行される。 On the premise of this definition, the deblocking filter process is executed when the relational expression expressed by the following expression holds.

なおここで定数α、βは、デフォルトでは次式により示すように、量子化パラメータＱＰにより図２５に示すように設定されるが、矢印Ａにより示すように、画像圧縮情報中のスライスヘッダに含まれるパラメータslice alpha c0 offset div2及びslice beta offset div2により用度を調整することが可能である。なおここで図２６は、α及びβの設定を示す図表であり、この図２６におけるindexAとindexBは、次式により定義され、オフセット値Filter OffsetA 及びFilter OffsetBがユーザによる調整分に相当する。 Here, the constants α and β are set by default as shown in FIG. 25 by the quantization parameter QP as shown by the following expression, but are included in the slice header in the image compression information as shown by the arrow A. The usage can be adjusted by the slice alpha c0 offset div2 and the slice beta offset div2. Note that FIG. 26 is a chart showing the settings of α and β. In FIG. 26, indexA and indexB are defined by the following equations, and the offset values Filter OffsetA and Filter OffsetB correspond to adjustments by the user.

ＡＶＣでは、ブロック境界の強度値Ｂｓが値４以下の場合、次式により示すように、画素値ｐ０' 、ｑ０' が設定される。 In AVC, when the block boundary intensity value Bs is 4 or less, pixel values p0 ′ and q0 ′ are set as shown by the following equations.

ここでｔｃは、クロマエッジフラグ（chroma Edge Flag）が値０の場合、（５１）式による値に設定され、それ以外の場合、（５２）式による値に設定される。またｔｃｏは、indexA、indexBと、Ｂｓとにより図２７に示すように定義される。 Here, tc is set to a value according to equation (51) when the chroma edge flag is 0, and is set to a value according to equation (52) otherwise. Tco is defined by indexA, indexB, and Bs as shown in FIG.

またａｐ及びａｑは、次式により表される。 Ap and aq are expressed by the following equations.

これに対して画素値ｐ１' は、クロマエッジフラグ（chroma Edge Flag）が値０であって、かつａｐの値がｂ以下の場合、（５４）式による値に設定され、それ以外の場合、（５５）式による値に設定される。 On the other hand, the pixel value p1 ′ is set to a value according to the equation (54) when the chroma edge flag is 0 and the value of ap is less than or equal to b, otherwise, The value is set according to equation (55).

また画素値ｑ１' は、クロマエッジフラグ（chroma Edge Flag）が値０であって、かつａｑの値がｂ以下の場合、（５６）式による値に設定され、それ以外の場合、（５７）式による値に設定される。 The pixel value q1 ′ is set to a value according to the expression (56) when the chroma edge flag is 0 and the value of aq is less than or equal to b, and in other cases, (57) Set to an expression value.

また画素値ｐ２' 及びｑ２' は、次式により示すように、処理前の画素値ｐ２及びｑ２に設定される。 The pixel values p2 ′ and q2 ′ are set to the pixel values p2 and q2 before processing, as shown by the following equation.

これに対してブロック境界の強度値Ｂｓが値４の場合、処理後の画素値ｐｉ' （ｉ＝０〜２）は、クロマエッジフラグ（chroma Edge Flag）が値０の場合であって、次式の関係式が成立する場合、（６０）式により示す値に設定される。 On the other hand, when the block boundary intensity value Bs is 4, the processed pixel value pi ′ (i = 0 to 2) is the case where the chroma edge flag is 0, and When the relational expression is established, it is set to a value indicated by the expression (60).

またこのような条件に該当しない場合、次式により示す値に設定される。 If this condition is not met, the value is set according to the following equation.

またブロック境界の強度値Ｂｓが値４の場合、処理後の画素値ｑｉ' （ｉ＝０〜２）は、クロマエッジフラグ（chroma Edge Flag）が値０の場合であって、次式の関係式が成立する場合、（６３）式により示す値に設定される。 When the block boundary intensity value Bs is 4, the processed pixel value qi ′ (i = 0 to 2) is a case where the chroma edge flag is 0, and the relationship of the following equation: When the formula is established, the value is set to a value indicated by the formula (63).

ＡＶＣによる符号化装置１及び復号化装置２０において、デブロックフィルタ２８は、これらにより適宜特性を切り換えて、ブロック歪の発生を防止する。 In the AVC encoding device 1 and decoding device 20, the deblocking filter 28 switches the characteristics accordingly to prevent the occurrence of block distortion.

これに対してレート制御においては、例えばＴＭ５（ＭＰＥＧ−２ＴｅｓｔＭｏｄｅｌ５）による手法が適用される。ここでＴＭ５によるレート制御は、各ピクチャへの目標符号量を設定するビット配分のステップと、仮想バッファ制御を用いたレート制御のステップと、視覚特性を考慮した適応量子化のステップとによる３つの階層から構成される。 On the other hand, in rate control, for example, a technique based on TM5 (MPEG-2 Test Model 5) is applied. Here, the rate control by TM5 has three steps: a bit allocation step for setting a target code amount to each picture, a rate control step using virtual buffer control, and an adaptive quantization step considering visual characteristics. Consists of hierarchies.

これらのステップのうちビット配分のステップでは、１ＧＯＰへの割当ビット量、それまでの発生符号量から、未だ符号化処理されていないピクチャへの目標符号量を計算し、以下の２つの仮定に基づいて、各ピクチャへの符号量割当量を計算する。 Of these steps, in the bit allocation step, the target code amount for a picture that has not yet been encoded is calculated from the bit amount allocated to 1 GOP and the code amount generated so far, and based on the following two assumptions: Thus, the code amount allocation amount for each picture is calculated.

ここで第１の仮定は、各ピクチャを符号化する際に用いる平均量子化スケールと、発生符号量との積は、画面が変化しない限り、ピクチャタイプ毎に一定値であるとの仮定である。これによりこのレート制御においては、各ピクチャを符号化処理した後、各ピクチャタイプ毎に、画面の複雑さを表すパラメータＸi、Ｘp、Ｘb（global complexity measure ) を次式により更新する。これによりＴＭ５によるレート制御においては、これらのパラメータＸi、Ｘp、Ｘbにより、次のピクチャを符号化処理する際の量子化スケールコードと発生符号量との関係を推定する。 Here, the first assumption is that the product of the average quantization scale used when encoding each picture and the generated code amount is a constant value for each picture type unless the screen changes. . Thus, in this rate control, after encoding each picture, parameters Xi, Xp, and Xb (global complexity measure) representing the complexity of the screen are updated by the following equation for each picture type. Thereby, in rate control by TM5, the relationship between the quantization scale code and the generated code amount when the next picture is encoded is estimated by using these parameters Xi, Xp, and Xb.

ここで（６５）式の各変数の添え字は、それぞれＩピクチャ、Ｐピクチャ、Ｂピクチャを示す添え字である。またＳi 、Ｓp 、Ｓb は、各ピクチャの符号化処理による発生符号ビット量であり、Ｑi 、Ｑp 、Ｑb は、各ピクチャの符号化時における平均量子化スケールコードである。またパラメータＸi 、Ｘp 、Ｘb の初期値は、目標符号量bit rate〔bit/sec 〕を用いて、次式により与えられる。 Here, the subscript of each variable in the expression (65) is a subscript indicating an I picture, a P picture, and a B picture, respectively. Si, Sp, and Sb are generated code bit amounts by the encoding process of each picture, and Qi, Qp, and Qb are average quantization scale codes at the time of encoding of each picture. The initial values of the parameters Xi, Xp, and Xb are given by the following equation using the target code amount bit rate [bit / sec].

また第２の仮定は、Ｉピクチャの量子化スケールに対するＰピクチャの量子化スケールコードの比率Ｋp 、Ｉピクチャの量子化スケールに対するＢピクチャの量子化スケールコードの比率Ｋb が、次式の関係に保持されている場合に、常に全体の画質が最良となるとの仮定である。 The second assumption is that the ratio Kp of the quantization scale code of the P picture to the quantization scale of the I picture and the ratio Kb of the quantization scale code of the B picture to the quantization scale of the I picture are held in the relationship of the following equations: It is assumed that the overall image quality is always best when

すなわちこの仮定は、Ｉピクチャ、Ｐピクチャの量子化スケールに対してＢピクチャの量子化スケールを常に１．４倍に設定することにより全体の画質が最良となることを意味するものであり、Ｉピクチャ、Ｐピクチャに比してＢピクチャを粗く量子化してＢピクチャに割当る符号量を節約し、その分、Ｉピクチャ、Ｐピクチャに多くの符号量を振り分けてＩピクチャ、Ｐピクチャの画質を向上すると共に、Ｉピクチャ、Ｐピクチャを参照するＢピクチャの画質も併せて向上し、これらにより全体的に見た画質を最良とするものである。 In other words, this assumption means that the overall picture quality is best when the quantization scale of the B picture is always set to 1.4 times the quantization scale of the I picture and the P picture. Compared to pictures and P pictures, the B picture is roughly quantized to save the code amount assigned to the B picture, and accordingly, a large amount of code amount is allocated to the I picture and P picture to improve the image quality of the I picture and P picture. In addition to the improvement, the image quality of the B picture referring to the I picture and the P picture is also improved, and the overall image quality is thereby optimized.

これらによりＴＭ５では、次式の演算処理により、各ピクチャへの割当ビット量Ｔi、Ｔp、Ｔbを計算する。なおここでＮp、Ｎbは、処理対象であるＧＯＰ内で、未だ符号化されていないＰピクチャ、Ｂピクチャの枚数である。 As a result, TM5 calculates the allocated bit amounts Ti, Tp, and Tb for each picture by the following arithmetic processing. Here, Np and Nb are the number of P pictures and B pictures that have not been encoded in the GOP to be processed.

これによりＴＭ５では、上述した２つの仮定に基づいて、各ピクチャの発生符号量を推定する。このとき符号割当対象とは異なるピクチャタイプのピクチャについては、画質最適化条件の下で、そのピクチャの発生する符号量が、割当対象ピクチャの発生符号量の何倍となるかを推定する。またこの推定により、ＧＯＰ内の未符号化ピクチャが、符号割当対象のピクチャタイプにおける何枚分のピクチャに相当するかを推計し、この推計結果より各ピクチャへの割当ビット量を計算する。なおこの場合に、レート制御回路９は、ヘッダ等の固定的に必要となる符号量を考慮して、その値に下限を設定して割当ビット量を計算する。 Thereby, TM5 estimates the generated code amount of each picture based on the above two assumptions. At this time, for a picture of a picture type different from the code allocation target, it is estimated how many times the code amount generated by the picture is larger than the generated code amount of the allocation target picture under the image quality optimization condition. Also, by this estimation, it is estimated how many pictures in the picture type to be code assigned correspond to the uncoded pictures in the GOP, and the allocated bit amount to each picture is calculated from this estimation result. In this case, the rate control circuit 9 considers the fixedly required code amount such as a header and sets the lower limit to the value and calculates the allocated bit amount.

これに対して続くレート制御のステップでは、ビット配分のステップで求められた各ピクチャへの割当ビット量Ｔi 、Ｔp 、Ｔb と、実際の発生符号量とを一致させるため、各ピクチャタイプ毎に独立に３種類の仮想バッファを設定し、この仮想バッファの容量に基づいて量子化回路８の量子化スケールをマクロブロック単位のフィードバック制御により計算する。 On the other hand, in the subsequent rate control step, the allocated bit amounts Ti, Tp, Tb obtained in the bit allocation step and the actual generated code amount are matched with each other, so that each picture type is independent. The three types of virtual buffers are set in the virtual buffer, and the quantization scale of the quantization circuit 8 is calculated by feedback control in units of macroblocks based on the capacity of the virtual buffer.

ここで始めに、これら３種類の仮想バッファの占有率を、次式の演算式により計算する。なおここでｄ0i、ｄ0p、ｄ0bは、各仮想バッファの初期占有量、Ｂjは、ピクチャ先頭からｊ番目のマクロブロックまでの発生ビット量、ＭＢ＿ｃｎｔは、１ピクチャ内でのマクロブロック数である。 First, the occupation ratios of these three types of virtual buffers are calculated by the following formula. Here, d0i, d0p, and d0b are initial occupancy amounts of each virtual buffer, Bj is the generated bit amount from the beginning of the picture to the jth macroblock, and MB_cnt is the number of macroblocks in one picture.

この（６９）式により計算結果に基づいてｊ番目のマクロブロックに対する量子化スケールを、次式により計算する。 The quantization scale for the jth macroblock is calculated by the following equation based on the calculation result by the equation (69).

なおここでｒは、リアクションパラメータであり、フィードバックの応答を制御するパラメータである。ＴＭ５において、リアクションパラメータｒ及び初期値ｄ0i、ｄ0p、ｄ0bは、次式により与えられる。 Here, r is a reaction parameter, which is a parameter for controlling a feedback response. In TM5, the reaction parameter r and the initial values d0i, d0p, d0b are given by the following equations.

なおシーケンス先頭における仮想バッファの初期値は以下の式により与えられる。 The initial value of the virtual buffer at the beginning of the sequence is given by the following equation.

続く適応量子化のステップでは、レート制御のステップで計算された量子化スケールを視覚特性を考慮して補正し、これにより視覚特性を考慮した最適量子化の処理を実行する。ここでこの最適量子化の処理においては、視覚的に劣化の目立ちやすい平坦部ではより細かく量子化するように、また劣化の比較的目立ちにくい絵柄の複雑な部分でより粗く量子化するように、各マクロブロックの平坦度を示すアクティビティにより、量子化スケールを補正する。 In the subsequent adaptive quantization step, the quantization scale calculated in the rate control step is corrected in consideration of the visual characteristics, thereby executing an optimal quantization process in consideration of the visual characteristics. Here, in this optimal quantization process, in order to quantize more finely in the flat part where deterioration is visually noticeable, and coarser in the complicated part of the pattern where deterioration is relatively inconspicuous, The quantization scale is corrected by the activity indicating the flatness of each macroblock.

ここでアクティビティは、１６×１６画素の大きさによるマクロブロック毎に、このマクロブロックを構成する８×８画素による４個のブロックについて、フレームＤＣＴモードにおける４個のブロックと、フィールドＤＣＴモードにおける４個のブロックとによる計８個のブロックの画素値を用いて、次式により算出され、これにより該当マクロブロックにおける輝度レベルの平滑度を示すようになされている。 Here, for each macroblock having a size of 16 × 16 pixels, the activity is divided into four blocks in the frame DCT mode and four blocks in the field DCT mode for four blocks of 8 × 8 pixels constituting the macroblock. The pixel values of a total of eight blocks are calculated using the following equation, thereby indicating the smoothness of the luminance level in the corresponding macroblock.

なおここでＰk は、原画の輝度信号ブロック内画素値である。この（７３）式において最小値を取るのは、このマクロブロック内の一部だけでも平坦部分のある場合には量子化ステップを細かくして画質劣化を防止するためである。 Here, Pk is the pixel value in the luminance signal block of the original image. The reason why the minimum value is taken in the equation (73) is to prevent the image quality deterioration by making the quantization step finer when only a part of the macroblock has a flat part.

ＴＭ５では、この計算式により求めたアクティビティを次式により正規化し、これにより０．５〜２の範囲で値を取る正規化アクティビティＮａｃｔj を求める。なおここでａｖｇ＿ａｃｔは、直前に符号化したピクチャにおけるアクティビティａｃｔj の平均値である。 In TM5, the activity obtained by this calculation formula is normalized by the following formula, thereby obtaining a normalized activity Nactj having a value in the range of 0.5-2. Here, avg_act is an average value of activity actj in the picture encoded immediately before.

またこの正規化アクティビティＮａｃｔj により次式の演算処理を実行し、レート制御のステップで計算した量子化スケールＱj を補正する。 In addition, the normalization activity Nactj is used to execute the following arithmetic processing to correct the quantization scale Qj calculated in the rate control step.

これらにより符号化装置１では、レート制御回路９によりこれらＴＭ５に係るレート制御の処理を実行して逐次画像データＤ１を符号化処理する。 Thus, in the encoding apparatus 1, the rate control circuit 9 executes the rate control processing related to TM5 and sequentially encodes the image data D1.

このような符号化装置に関しては、例えば特開２００４−５６８２７号公報等に復号化処理等の利便を図る工夫が種々に提案されている。 With regard to such an encoding apparatus, for example, various ideas for convenience of decoding processing and the like have been proposed in Japanese Patent Application Laid-Open No. 2004-56827.

ところでＡＶＣでは、比較的アクティビティの低い領域では、イントラ予測モードによる誤差値ＳＡ（Ｔ）Ｄが小さくなり、（３７）式に示すコスト関数において、動きベクトルに関する情報を伝送しなくて済む分、イントラ予測モードの方がコスト値Ｃｏｓｔ（Ｍｏｄｅ）が小さくなる場合がある。これによりＡＶＣでは、例えば地面のように、比較的アクティビティの低い領域が画像の後半部分で多くを占める場合、インタースライスにおいても、このアクティビティの低い領域で、イントラ予測モードが選択され易くなる。 By the way, in the AVC, the error value SA (T) D due to the intra prediction mode is small in a region where the activity is relatively low. The cost value Cost (Mode) may be smaller in the prediction mode. Accordingly, in AVC, when an area with relatively low activity occupies a large portion in the second half of the image, such as the ground, the intra prediction mode is easily selected in the area with low activity even in the inter slice.

しかしながらこのようなこのアクティビティの低い領域のイントラ予測モードによる符号化処理においては、ノイズの影響により、図８及び図９について上述した９種類のイントラ４×４予測モードで、予測モードを切り換えて符号化処理する場合もあり、この場合には、予測モードの切り換わりがばたついた感じとなって復号した画像に表れる。ここでこのようなばたつきは、フリッカのように見て取られることにより、視聴者の目につきやすく、これによりこのような場合、従来のＡＶＣでは画質が損なわれる問題があった。 However, in such an encoding process using the intra prediction mode in the low activity area, the coding is performed by switching the prediction mode in the nine types of intra 4 × 4 prediction modes described above with reference to FIGS. In this case, the switching of the prediction mode appears as if it fluctuates and appears in the decoded image. Here, such fluttering is easily seen by the viewer by being seen like flicker, and in this case, there is a problem that image quality is impaired in the conventional AVC.

またさらにＡＶＣでは、図２４に示したように、イントラマクロブロックでブロック境界の強度値Ｂｓが大きくなることにより、このようにアクティビティの低い領域をイントラ予測モードにより符号化処理した場合、デブロックフィルタにより過剰にブロック境界歪を抑圧することになる。ここでこのような過剰なブロック境界歪の抑圧にあっては、平坦な部分における局所的な変化が損なわれることにより、解像度が著しく低下したように視聴者に認識される。これによってもＡＶＣでは、画質が損なわれる問題があった。
特開２００４−５６８２７号公報 Furthermore, in the AVC, as shown in FIG. 24, when the block boundary strength value Bs is increased in an intra macroblock, when a region with low activity is encoded in the intra prediction mode, a deblock filter is used. This excessively suppresses block boundary distortion. Here, in the suppression of the excessive block boundary distortion, the viewer recognizes that the resolution is remarkably lowered by losing the local change in the flat portion. This also has the problem that the image quality is impaired in AVC.
JP 2004-56827 A

本発明は以上の点を考慮してなされたもので、コスト関数によりイントラ予測モード、インター予測モードから最適モードを選択して画像データを符号化処理する場合に、アクティビティの低い領域における画質劣化を防止することができる符号化装置、符号化方法、符号化方法のプログラム及び符号化方法のプログラムを記録した記録媒体を提案しようとするものである。 The present invention has been made in consideration of the above points. When image data is encoded by selecting an optimal mode from an intra prediction mode and an inter prediction mode by a cost function, image quality degradation in a low activity area is reduced. It is an object of the present invention to propose an encoding apparatus, an encoding method, an encoding method program, and a recording medium on which the encoding method program can be prevented.

かかる課題を解決するため請求項１の発明においては、符号化効率を示すコスト関数によるコスト値の比較により、複数のイントラ予測モード、複数のインター予測モードから最適モードをマクロブロック毎に検出し、前記最適モードにより画像データを符号化処理する符号化装置に適用して、前記マクロブロック毎に、前記画像データによる画像の平坦度を示すアクティビティを計算するアクティビティ計算手段と、前記アクティビティにより前記コスト値を補正して前記最適モードを検出する最適モード検出手段とを備えるようにする。 In order to solve such a problem, in the invention of claim 1, an optimal mode is detected for each macroblock from a plurality of intra prediction modes and a plurality of inter prediction modes by comparing cost values with a cost function indicating coding efficiency. Applied to an encoding device that encodes image data in the optimum mode, activity calculating means for calculating an activity indicating the flatness of the image by the image data for each macroblock, and the cost value by the activity And an optimum mode detecting means for detecting the optimum mode.

また請求項１１の発明においては、符号化効率を示すコスト関数によるコスト値の比較により、複数のイントラ予測モード、複数のインター予測モードから最適モードをマクロブロック毎に検出し、前記最適モードにより画像データを符号化処理する符号化方法に適用して、前記マクロブロック毎に、前記画像データによる画像の平坦度を示すアクティビティを計算するアクティビティ計算のステップと、前記アクティビティにより前記コスト値を補正して前記最適モードを検出する最適モード検出のステップとを有するようにする。 In the invention of claim 11, an optimum mode is detected for each macroblock from a plurality of intra prediction modes and a plurality of inter prediction modes by comparing cost values with a cost function indicating coding efficiency, and an image is obtained by the optimum mode. Applying to an encoding method for encoding data, an activity calculation step for calculating an activity indicating the flatness of the image based on the image data for each macroblock, and correcting the cost value by the activity An optimum mode detecting step for detecting the optimum mode.

また請求項１２の発明においては、符号化効率を示すコスト関数によるコスト値の比較により、複数のイントラ予測モード、複数のインター予測モードから最適モードをマクロブロック毎に検出する最適モード検出のステップと、前記最適モードにより画像データを符号化処理する符号化処理のステップとを有する符号化方法のプログラムに適用して、前記マクロブロック毎に、前記画像データによる画像の平坦度を示すアクティビティを計算するアクティビティ計算のステップとを有し、前記最適モード検出のステップは、前記アクティビティにより前記コスト値を補正して前記最適モードを検出する。 Further, in the invention of claim 12, an optimum mode detection step of detecting an optimum mode for each macroblock from a plurality of intra prediction modes and a plurality of inter prediction modes by comparing cost values with a cost function indicating coding efficiency; And an encoding method program including an encoding process step for encoding image data in the optimum mode, and calculating an activity indicating the flatness of the image based on the image data for each macroblock. A step of calculating an activity, and the step of detecting the optimum mode detects the optimum mode by correcting the cost value by the activity.

また請求項１３の発明においては、演算処理手段により実行される符号化方法のプログラムを記録した記録媒体に適用して、前記符号化方法のプログラムは、符号化効率を示すコスト関数によるコスト値の比較により、複数のイントラ予測モード、複数のインター予測モードから最適モードをマクロブロック毎に検出する最適モード検出のステップと、前記最適モードにより画像データを符号化処理する符号化処理のステップと、前記マクロブロック毎に、前記画像データによる画像の平坦度を示すアクティビティを計算するアクティビティ計算のステップとを有し、前記最適モード検出のステップは、前記アクティビティにより前記コスト値を補正して前記最適モードを検出する。 According to a thirteenth aspect of the present invention, the program of the encoding method is applied to a recording medium on which a program of the encoding method executed by the arithmetic processing means is recorded. By comparison, a plurality of intra prediction modes, a step of optimal mode detection for detecting an optimal mode for each macroblock from a plurality of inter prediction modes, a step of encoding processing for encoding image data in the optimal mode, An activity calculation step of calculating an activity indicating the flatness of the image based on the image data for each macroblock, and the step of detecting the optimum mode corrects the cost value by the activity and sets the optimum mode. To detect.

請求項１の構成により、符号化効率を示すコスト関数によるコスト値の比較により、複数のイントラ予測モード、複数のインター予測モードから最適モードをマクロブロック毎に検出し、前記最適モードにより画像データを符号化処理する符号化装置に適用して、前記マクロブロック毎に、前記画像データによる画像の平坦度を示すアクティビティを計算するアクティビティ計算手段と、前記アクティビティにより前記コスト値を補正して前記最適モードを検出する最適モード検出手段とを備えるようにすれば、このコスト値の補正によりアクティビティが低い場合にはイントラ予測モードを選択しないようにコスト値を設定することができる。これにより過剰なブロック境界歪の抑圧を防止して解像度の低下を防止することができ、また複数のイントラ予測モードの切り換わりによる画質劣化を防止することができ、これらによりコスト関数によりイントラ予測モード、インター予測モードから最適モードを選択して画像データを符号化処理する場合に、アクティビティの低い領域における画質劣化を防止することができる。 According to the configuration of claim 1, an optimum mode is detected for each macroblock from a plurality of intra prediction modes and a plurality of inter prediction modes by comparing cost values using a cost function indicating coding efficiency, and image data is detected by the optimum mode. Applying to an encoding apparatus that performs encoding processing, activity calculating means for calculating an activity indicating the flatness of the image by the image data for each macroblock, and correcting the cost value by the activity to correct the optimal mode If an optimum mode detecting means for detecting the cost is provided, the cost value can be set so that the intra prediction mode is not selected when the activity is low due to the correction of the cost value. As a result, suppression of excessive block boundary distortion can be prevented to prevent resolution degradation, and image quality deterioration due to switching of a plurality of intra prediction modes can be prevented. When image data is encoded by selecting the optimum mode from the inter prediction mode, it is possible to prevent image quality deterioration in a low activity area.

これにより請求項１１、請求項１２、請求項１３の構成によれば、コスト関数によりイントラ予測モード、インター予測モードから最適モードを選択して画像データを符号化処理する場合に、アクティビティの低い領域における画質劣化を防止することができる符号化方法、符号化方法のプログラム、符号化方法のプログラムを記録した記録媒体を提供することができる。 Thus, according to the configurations of claims 11, 12, and 13, when the optimal mode is selected from the intra prediction mode and the inter prediction mode by the cost function and the image data is encoded, the region with low activity Encoding method, encoding method program, and recording medium recording the encoding method program can be provided.

本発明によれば、コスト関数によりイントラ予測モード、インター予測モードから最適モードを選択して画像データを符号化処理する場合に、アクティビティの低い領域における画質劣化を防止することができる。 According to the present invention, when image data is encoded by selecting an optimal mode from an intra prediction mode and an inter prediction mode by a cost function, it is possible to prevent image quality deterioration in a low activity area.

以下、適宜図面を参照しながら本発明の実施例を詳述する。 Hereinafter, embodiments of the present invention will be described in detail with reference to the drawings as appropriate.

（１）実施例の構成
図１は、本発明の実施例に係るＡＶＣ方式の符号化装置を示すブロック図である。この符号化装置４１において、図３について上述した符号化装置１と同一の構成は、対応する符号を付して示し、重複した説明は省略する。これによりこの符号化装置４１は、順次入力されるビデオ信号Ｓ１をアナログディジタル変換処理して画像データＤ１に変換した後、イントラ予測モード、インター予測モードより最適モードを選択してこの画像データＤ１を符号化する。 (1) Configuration of Embodiment FIG. 1 is a block diagram showing an AVC encoding apparatus according to an embodiment of the present invention. In this encoding device 41, the same components as those of the encoding device 1 described above with reference to FIG. 3 are denoted by the corresponding reference numerals, and redundant description is omitted. As a result, the encoding device 41 performs analog-digital conversion processing on the sequentially input video signal S1 and converts it into image data D1, and then selects an optimum mode from the intra prediction mode and the inter prediction mode to select the image data D1. Encode.

この符号化装置４１において、アクティビティ算出回路４２は、この処理対象の画像データＤ１について、１６×１６画素によるマクロブロック毎に、画像データＤ１による画像の平坦度を示すパラメータを計算し、この実施例では、このパラメータにアクティビティが適用される。これによりアクティビティ算出回路４２は、次式の演算処理の実行により、マクロブロック毎に画素値の分散値を計算してアクティビティａｃｔを計算する。アクティビティ算出回路４２は、このアクティビティａｃｔをマクロブロックのアクティビティＭＢａｃｔに設定して出力する。 In this encoding device 41, the activity calculation circuit 42 calculates a parameter indicating the flatness of the image based on the image data D1 for each macroblock of 16 × 16 pixels for the image data D1 to be processed. Now the activity is applied to this parameter. As a result, the activity calculation circuit 42 calculates the activity act by calculating the variance value of the pixel value for each macroblock by executing the arithmetic processing of the following equation. The activity calculation circuit 42 sets this activity act to the activity MB act of the macro block and outputs it.

レート制御回路４３は、このアクティビティ算出回路４２より得られるアクティビティＭＢａｃｔを用いて上述したＴＭ５の手法によりレート制御の処理を実行する。 The rate control circuit 43 uses the activity MB act obtained from the activity calculation circuit 42 to execute rate control processing by the above-described TM5 method.

動き予測・補償回路４４は、イントラ・インター判定回路４５の制御により、インター符号化に係る輝度信号の全ての予測モードについて、（３７）式のコスト値Ｃｏｓｔ（Ｍｏｄｅ）を計算し、各予測モードにおけるコスト値Ｃｏｓｔ（Ｍｏｄｅ）の比較により、最も値の小さな予測モードを検出する。これにより動き予測・補償回路４４は、インター予測モードより最適モードを検出し、この最適モードのコスト値Ｃｏｓｔ（Ｍｏｄｅ）をイントラ・インター判定回路４５に通知する。またこの通知により得られるイントラ・インター判定回路４５からの指示により、インター符号化処理の場合に、この最適モードによる予測値を輝度信号及び色差信号について生成して減算回路４に出力する。 The motion prediction / compensation circuit 44 calculates the cost value Cost (Mode) of the equation (37) for all prediction modes of the luminance signal related to inter coding under the control of the intra / inter determination circuit 45, and determines each prediction mode. The prediction mode with the smallest value is detected by comparing the cost value Cost (Mode). Thereby, the motion prediction / compensation circuit 44 detects the optimum mode from the inter prediction mode, and notifies the intra / inter determination circuit 45 of the cost value Cost (Mode) of this optimum mode. In addition, according to an instruction from the intra / inter determination circuit 45 obtained by this notification, in the case of inter coding processing, a prediction value in the optimum mode is generated for the luminance signal and the color difference signal and output to the subtraction circuit 4.

イントラ予測回路４６は、同様に、イントラ・インター判定回路４５の制御により、イントラ符号化に係る輝度信号の全ての予測モードについて、（３７）式に関して上述した原画像と予測画像との誤差値ＳＡ（Ｔ）Ｄ、オフセット値ＳＡ（Ｔ）Ｄ０を計算してイントラ・インター判定回路４５に通知する。またこの通知により得られるイントラ・インター判定回路４５からの指示により、イントラ符号化処理の場合に、対応する予測値を減算回路４に出力する。またこの場合、色差信号についてコスト値を計算して最適モードを検出し、この最適モードによる予測値を減算回路４に出力する。 Similarly, the intra prediction circuit 46 controls the error value SA between the original image and the prediction image described above with respect to the expression (37) for all prediction modes of the luminance signal related to intra coding under the control of the intra / inter determination circuit 45. (T) D and the offset value SA (T) D0 are calculated and notified to the intra / inter determination circuit 45. In addition, according to an instruction from the intra / inter determination circuit 45 obtained by this notification, a corresponding predicted value is output to the subtraction circuit 4 in the case of intra coding processing. In this case, the cost value is calculated for the color difference signal to detect the optimum mode, and the predicted value in the optimum mode is output to the subtraction circuit 4.

イントラ・インター判定回路４５は、イントラ予測回路４６から通知される誤差値ＳＡ（Ｔ）Ｄ、オフセット値ＳＡ（Ｔ）Ｄ０を用いて全てのイントラ予測モードについてコスト値Ｃｏｓｔ（Ｍｏｄｅ）を計算し、この計算したコスト値Ｃｏｓｔ（Ｍｏｄｅ）と動き予測・補償回路４４から通知されるコスト値Ｃｏｓｔ（Ｍｏｄｅ）とを比較し、最もコスト値Ｃｏｓｔ（Ｍｏｄｅ）の小さな予測モードを検出する。イントラ・インター判定回路４５は、この予測モードの検出により、マクロブロック毎にイントラ予測、インター予測を選択し、また複数のイントラ予測モード、複数のインター予測モードから最適モードをマクロブロック毎に検出する。またこの検出した最適モードによる予測値の出力をイントラ予測回路４６、動き予測・補償回路４４に指示する。 The intra / inter determination circuit 45 calculates the cost value Cost (Mode) for all intra prediction modes using the error value SA (T) D and the offset value SA (T) D0 notified from the intra prediction circuit 46, The calculated cost value Cost (Mode) is compared with the cost value Cost (Mode) notified from the motion prediction / compensation circuit 44, and the prediction mode with the smallest cost value Cost (Mode) is detected. The intra / inter determination circuit 45 selects intra prediction and inter prediction for each macroblock by detecting the prediction mode, and detects an optimum mode for each macroblock from a plurality of intra prediction modes and a plurality of inter prediction modes. . The output of the predicted value in the detected optimum mode is instructed to the intra prediction circuit 46 and the motion prediction / compensation circuit 44.

この一連の処理において、イントラ・インター判定回路４５は、アクティビティ算出回路４２より得られるアクティビティＭＢａｃｔにより、イントラ４×４予測モードに係るコスト値Ｃｏｓｔ（Ｍｏｄｅ）を補正する。 In this series of processing, the intra / inter determination circuit 45 corrects the cost value Cost (Mode) related to the intra 4 × 4 prediction mode based on the activity MB act obtained from the activity calculation circuit 42.

すなわち（４６）式について上述したように、従来、ＬｏｗＣｏｍｐｌｅｘｉｔｙ
Ｍｏｄｅにおいて、イントラ４×４予測モードにおけるコスト値Ｃｏｓｔ（Ｍｏｄｅ）は、次式により示すように、値２４による定数に、付加情報に関する量子化パラメータから量子化値への変換関数ＱＰ０（ＱＰ）を乗算してオフセット値ＳＡ（Ｔ）Ｄ０が定義され、このオフセット値ＳＡ（Ｔ）Ｄ０によりコスト値Ｃｏｓｔ（Ｍｏｄｅ）をオフセットさせることにより、インタースライスにおけるイントラマクロブロックの発生を低減している。ここでこの値２４は、経験値に基づく値である。 That is, as described above with respect to the equation (46), conventionally, the Low Complexity.
In Mode, the cost value Cost (Mode) in the intra 4 × 4 prediction mode is expressed by the following equation, and a conversion function QP0 (QP) from a quantization parameter related to additional information to a quantization value is added to a constant of value 24. An offset value SA (T) D0 is defined by multiplication, and the cost value Cost (Mode) is offset by this offset value SA (T) D0, thereby reducing the occurrence of intra macroblocks in the inter slice. Here, this value 24 is a value based on experience values.

しかしてこのような設定により、アクティビティの高い領域において、イントラマクロブロックが選択された場合、絵柄に応じた特定の方向のイントラ予測モードが適切に選択され、これにより符号化効率を確保しつつ、画質劣化を有効に回避することができる。しかしながらアクティビティの低い領域においては、ノイズの影響により予測モードが種々に切り換わったり、またデブロックフィルタにより過剰にブロック境界歪を抑圧することになる。 With this setting, when an intra macroblock is selected in a region with high activity, an intra prediction mode in a specific direction corresponding to a picture is appropriately selected, thereby ensuring coding efficiency, Image quality degradation can be effectively avoided. However, in the low activity region, the prediction mode is switched variously due to the influence of noise, and the block boundary distortion is excessively suppressed by the deblocking filter.

これによりイントラ・インター判定回路４５は、この（７７）式に代えて、次式の演算処理によりオフセット値ＳＡ（Ｔ）Ｄ０を設定する。なおここでｆ（ＭＢａｃｔ）は、アクティビティＭＢａｃｔを変数とする関数である。これによりイントラ・インター判定回路４５は、アクティビティＭＢａｃｔにより、マクロブロックにおいて高周波成分が少ない場合、イントラ予測モードが選択され難くなるように、コスト値を補正する。より具体的に、この実施例では、このアクティビティＭＢａｃｔを変数とする関数ｆ（ＭＢａｃｔ）に単調減少関数が適用され、これによりアクティビティの低い領域程、より大きな値のオフセット値を設定して、その分、イントラマクロブロックが選択されに難くする。またこれとは逆に、アクティビティの高い領域では、小さな値のオフセット値が適用され、イントラマクロブロックを選択され易くする。なお、単調減少関数には、例えば出力値が２値の関数、一次関数、種々の関数を広く適用することができる。 Thereby, the intra / inter determination circuit 45 sets the offset value SA (T) D0 by the calculation processing of the following equation instead of the equation (77). Here, f (MB act) is a function having the activity MB act as a variable. Thereby, the intra / inter determination circuit 45 corrects the cost value so that it is difficult to select the intra prediction mode when the high-frequency component is small in the macro block by the activity MB act. More specifically, in this embodiment, a monotonically decreasing function is applied to the function f (MB act) having the activity MB act as a variable, thereby setting a larger offset value in a lower activity region. Therefore, it is difficult to select an intra macroblock. On the other hand, a small offset value is applied to a region with high activity to facilitate selection of an intra macroblock. For example, a function having a binary output value, a linear function, and various functions can be widely applied to the monotonically decreasing function.

しかして図２は、この最適予測モードに係る符号化装置４１の処理手順を示すフローチャートである。符号化装置４１においては、マクロブロック毎に、この処理手順を実行し、ステップＳＰ１によりインター予測モードに係る動き予測の処理を実行し、また続くステップＳＰ２において、各予測モードのコスト値を計算する。また続くステップＳＰ３において、この計算したコスト値の比較により、最適なインター予測モードを検出する。 FIG. 2 is a flowchart showing a processing procedure of the encoding device 41 according to the optimum prediction mode. In the encoding device 41, this processing procedure is executed for each macroblock, the motion prediction process related to the inter prediction mode is executed in step SP1, and the cost value of each prediction mode is calculated in the subsequent step SP2. . In the subsequent step SP3, an optimum inter prediction mode is detected by comparing the calculated cost values.

またこのような動き予測・補償回路４４に係る処理と同時並列的なアクティビティ算出回路４２、イントラ予測回路４６、イントラ・インター判定回路４５の処理により、ステップＳＰ４において、アクティビティを計算した後、続くステップＳＰ５において、このアクティビティによりオフセット値を計算する。また続くステップＳＰ６において、この計算したオフセット値により各イントラ予測モードに係るコスト値を計算し、ステップＳＰ７において、この計算したコスト値とステップＳＰ３で計算したコスト値とを比較し、この比較結果により続くステップＳＰ８において、最適モードを検出する。 In addition, after the activity is calculated in step SP4 by the processes of the activity calculation circuit 42, the intra prediction circuit 46, and the intra / inter determination circuit 45 that are simultaneously and parallel to the process related to the motion prediction / compensation circuit 44, the following steps are performed. In SP5, an offset value is calculated by this activity. In subsequent step SP6, a cost value related to each intra prediction mode is calculated based on the calculated offset value. In step SP7, the calculated cost value is compared with the cost value calculated in step SP3. In the following step SP8, the optimum mode is detected.

（２）実施例の動作
以上の構成において、この符号化装置４１（図１）において、順次入力されるビデオ信号Ｓ１は、アナログディジタル変換回路２により画像データＤ１に変換され、この画像データＤ１が画面並べ替えバッファ３により処理の順序に並べ替えられて減算回路４に入力される。ここで画像データＤ１は、イントラ予測、インター予測による予測値との間で減算されて減算データＤ２が生成され、この減算データＤ２が直交変換回路７、量子化回路８、可逆符号化回路１０で順次処理されて符号化データＤ４に変換され、この符号化データＤ４が例えば記録系により記録媒体に記録される。また量子化回路８の出力データが、画像データに復号されてフレームメモリ１６に参照画像として記録され、この参照画像より動き予測・補償回路４４、イントラ予測回路４６でインター予測、イントラ予測の予測値が生成される。 (2) Operation of the embodiment In the above configuration, in the encoding device 41 (FIG. 1), the video signal S1 sequentially input is converted into the image data D1 by the analog-digital conversion circuit 2, and the image data D1 is converted into the image data D1. The data is rearranged in the processing order by the screen rearrangement buffer 3 and input to the subtraction circuit 4. Here, the image data D1 is subtracted between the prediction values obtained by intra prediction and inter prediction to generate subtraction data D2, and the subtraction data D2 is generated by the orthogonal transform circuit 7, the quantization circuit 8, and the lossless encoding circuit 10. It is sequentially processed and converted into encoded data D4, and this encoded data D4 is recorded on a recording medium by a recording system, for example. Further, the output data of the quantization circuit 8 is decoded into image data and recorded as a reference image in the frame memory 16, and based on this reference image, the motion prediction / compensation circuit 44 and the intra prediction circuit 46 predict inter prediction and intra prediction prediction values. Is generated.

これら一連の処理において、画像データＤ１は、動き予測・補償回路４４、イントラ予測回路４６において、それぞれインター予測、イントラ予測の各予測モードについて、符号化効率を示すコスト関数によりコスト値が求められ、インター予測については、動き予測・補償回路４４におけるコスト値の比較により、最も符号化処理に適した最適モードが検出される。またイントラ・インター判定回路４５において、イントラ予測の各予測モードによるコスト値と、動き予測・補償回路４４で検出されたインター予測に係る最適モードのコスト値との比較により、最適な予測モードが検出される。これにより符号化装置４１では、この最適な予測モードによりイントラ予測、インター予測の何れの予測方式により符号化処理するかが決定され、イントラ予測による場合には、イントラ予測回路４６で最適モードによる予測値が生成されて減算回路４に出力される。またインター予測による場合には、動き予測・補償回路４４で最適モードによる予測値が生成されて減算回路４に出力される。これらにより符号化装置４１では、符号化効率を示すコスト関数によるコスト値の比較により、複数のイントラ予測モード、複数のインター予測モードから最適モードがマクロブロック毎に検出され、この最適モードにより画像データＤ１を順次符号化処理する。 In these series of processing, the image data D1 is obtained in the motion prediction / compensation circuit 44 and the intra prediction circuit 46 for the cost value indicating the coding efficiency for each prediction mode of inter prediction and intra prediction, respectively. For inter prediction, an optimum mode most suitable for encoding processing is detected by comparing cost values in the motion prediction / compensation circuit 44. The intra / inter determination circuit 45 detects the optimal prediction mode by comparing the cost value of each prediction mode of intra prediction with the cost value of the optimal mode related to inter prediction detected by the motion prediction / compensation circuit 44. Is done. As a result, the encoding device 41 determines whether to perform the encoding process by intra prediction or inter prediction based on the optimal prediction mode. In the case of intra prediction, the intra prediction circuit 46 performs prediction based on the optimal mode. A value is generated and output to the subtraction circuit 4. In the case of inter prediction, the motion prediction / compensation circuit 44 generates a prediction value in the optimum mode and outputs it to the subtraction circuit 4. Accordingly, the encoding device 41 detects the optimum mode for each macroblock from the plurality of intra prediction modes and the plurality of inter prediction modes by comparing the cost values with the cost function indicating the coding efficiency. D1 is sequentially encoded.

しかしてこれら各予測モードにおけるコスト値のうち、イントラ４×４予測モードにおけるコスト値は、（７７）式に示すように、従来、値２４による定数に、付加情報に関する量子化パラメータから量子化値への変換関数ＱＰ０（ＱＰ）を乗算してオフセット値ＳＡ（Ｔ）Ｄ０を計算し、このオフセット値ＳＡ（Ｔ）Ｄ０によりコスト値Ｃｏｓｔ（Ｍｏｄｅ）をオフセットさせることにより、インタースライスにおけるイントラマクロブロックの発生を低減するように設定される。 Therefore, among the cost values in each of these prediction modes, the cost value in the intra 4 × 4 prediction mode has conventionally been changed from a quantization parameter related to additional information to a quantized value based on a constant of value 24 as shown in Equation (77). An intra macro block in the inter slice is calculated by multiplying the conversion function QP0 (QP) into the offset value SA (T) D0 and offsetting the cost value Cost (Mode) by the offset value SA (T) D0. Is set so as to reduce the occurrence of.

これによりアクティビティの高い領域においては、イントラマクロブロックが選択された場合に、絵柄に応じた特定の方向のイントラ予測モードが適切に選択され、これにより符号化効率を確保しつつ、画質劣化を有効に回避することができる。しかしながらアクティビティの低い領域においては、ノイズの影響により予測モードが種々に切り換わったり、またデブロックフィルタにより過剰にブロック境界歪を抑圧することになる。 As a result, in areas with high activity, when an intra macroblock is selected, an intra prediction mode in a specific direction according to the design is appropriately selected, thereby ensuring image coding efficiency and effective image quality degradation. Can be avoided. However, in the low activity region, the prediction mode is switched variously due to the influence of noise, and the block boundary distortion is excessively suppressed by the deblocking filter.

このためこの符号化装置４１において、画像データＤ１は、アクティビティ算出回路４２において、画像の平坦度を示すパラメータとしてアクティビティが計算され、このアクティビティによりイントラ・インター判定回路４５でイントラ４×４予測モードのコスト値が補正された後、最適モードが検出される。これによりこの符号化装置４１では、アクティビティに応じて最適モードの選択を制御するように構成され、この構成により適切に最適モードを選択してアクティビティの低い領域における画質劣化を防止することが可能となる。 For this reason, in this encoding device 41, the activity of the image data D1 is calculated by the activity calculation circuit 42 as a parameter indicating the flatness of the image, and the intra / inter determination circuit 45 uses this activity in the intra 4 × 4 prediction mode. After the cost value is corrected, the optimum mode is detected. As a result, the encoding device 41 is configured to control the selection of the optimal mode according to the activity. With this configuration, it is possible to appropriately select the optimal mode and prevent image quality deterioration in a low activity area. Become.

すなわちこの符号化装置４１では、このコスト値の補正により、マクロブロックにおいて高周波成分が少ない場合に、イントラ予測モードが選択され難くなるように設定され、これによりアクティビティの低い領域におけるイントラ４×４予測モードの頻繁な切り換わりによる復号した画像のばたつき感が防止され、フリッカのような画質劣化が防止される。またさらにこのようにアクティビティの低い領域におけるデブロックフィルタによる過剰なブロック境界歪の抑圧を防止することができ、これにより見かけの解像度の低下を防止して画質劣化を防止することができる。 That is, in this encoding device 41, the correction of the cost value is set so that the intra prediction mode becomes difficult to be selected when there are few high-frequency components in the macroblock, so that the intra 4 × 4 prediction in the low activity region is performed. A fluttering feeling of the decoded image due to frequent switching of modes is prevented, and image quality deterioration such as flicker is prevented. Furthermore, it is possible to prevent excessive block boundary distortion suppression by the deblocking filter in the low activity region as described above, thereby preventing the apparent resolution from deteriorating and preventing the image quality from deteriorating.

より具体的に、この実施例では、このイントラ４×４予測モードにおいて、原画像と予測画像との誤差値ＳＡ（Ｔ）Ｄに対してオフセット値ＳＡ（Ｔ）Ｄ０を与える関数により定義されているコスト関数について、アクティビティＭＢａｃｔを変数とする関数ｆ（ＭＢａｃｔ）と、付加情報に関する量子化パラメータから量子化値への変換関数ＱＰ０（ＱＰ）との乗算値をオフセット値ＳＡ（Ｔ）Ｄ０に設定することにより、アクティビティに応じてコスト値を補正するように構成され、これによりこのアクティビティＭＢ
ａｃｔを変数とする関数ｆ（ＭＢａｃｔ）の設定により必要に応じて種々の特性によりコスト値を補正することができ、これにより簡易かつ確実に、かつ種々に画質を向上することができる。 More specifically, in this embodiment, the intra 4 × 4 prediction mode is defined by a function that gives an offset value SA (T) D0 to the error value SA (T) D between the original image and the predicted image. For the cost function, the multiplication value of the function f (MB act) having the activity MB act as a variable and the conversion function QP0 (QP) from the quantization parameter to the quantization value regarding the additional information is set as the offset value SA (T) D0. Is set to correct the cost value according to the activity, and this activity MB
By setting the function f (MB act) with act as a variable, it is possible to correct the cost value with various characteristics as necessary, thereby improving the image quality in a simple and reliable manner.

またこのアクティビティＭＢａｃｔを変数とする関数ｆ（ＭＢａｃｔ）がこの実施例では単調減少関数に設定され、これによりアクティビティの低い領域における画質劣化を確実に防止することができる。 Further, in this embodiment, the function f (MB act) having the activity MB act as a variable is set as a monotonically decreasing function, and thereby it is possible to reliably prevent image quality deterioration in a low activity area.

またこの処理基準であるアクティビティが、画像データＤ１による画素値の分散値により計算されることにより、このような画質劣化を知覚し易い領域で適切にイントラ予測モードの選択を制御することができ、これにより従来に比して一段と画質を向上することができる。 In addition, the activity that is the processing standard is calculated based on the variance value of the pixel values based on the image data D1, so that the selection of the intra prediction mode can be appropriately controlled in a region where such image quality degradation is easily perceived. As a result, the image quality can be further improved as compared with the prior art.

（３）実施例の効果
以上の構成によれば、符号化効率を示すコスト関数によるコスト値の比較により、複数のイントラ予測モード、複数のインター予測モードから最適モードをマクロブロック毎に検出して画像データを符号化処理する場合に、アクティビティによりコスト値を補正して最適モードを検出することにより、コスト関数によりイントラ予測モード、インター予測モードから最適モードを選択して画像データを符号化処理する場合に、アクティビティの低い領域における画質劣化を防止することができる。 (3) Effects of the embodiment According to the above configuration, the optimum mode is detected for each macroblock from the plurality of intra prediction modes and the plurality of inter prediction modes by comparing the cost values with the cost function indicating the coding efficiency. When image data is encoded, the cost value is corrected by activity and the optimal mode is detected, and the optimal mode is selected from the intra prediction mode and the inter prediction mode by the cost function to encode the image data. In this case, it is possible to prevent image quality deterioration in a low activity area.

またこのときこのコスト値の補正が、マクロブロックにおいて高周波成分が少ない場合に、イントラ予測モードが選択され難くなるようにするコスト値の補正であることにより、確実に、アクティビティの低い領域における画質劣化を防止することができる。 Also, at this time, the correction of the cost value is a correction of the cost value that makes it difficult to select the intra prediction mode when there are few high-frequency components in the macroblock. Can be prevented.

またこの処理基準であるアクティビティが、画像データＤ１による画素値の分散値であることにより、画質劣化を知覚し易い領域で適切にイントラ予測モードの選択を制御することができ、これにより従来に比して一段と画質を向上することができる。 In addition, since the activity that is the processing standard is a dispersion value of pixel values based on the image data D1, it is possible to appropriately control the selection of the intra prediction mode in an area where image quality degradation is easily perceived. Thus, the image quality can be further improved.

また原画像と予測画像との誤差値に対してオフセット値を与えるコスト関数によるコスト値について、アクティビティを変数とする関数と、付加情報に関する量子化パラメータから量子化値への変換関数との乗算値をオフセット値に設定し、これによりアクティビティによりコスト値を補正することにより、このアクティビティを変数とする関数の設定により必要に応じて種々の特性によりコスト値を補正することができ、これにより簡易かつ確実に、種々に画質を向上することができる。 Also, for the cost value by the cost function that gives an offset value to the error value between the original image and the predicted image, the product of the function with the activity as a variable and the conversion function from the quantization parameter to the quantization value for the additional information Is set as an offset value, and the cost value is corrected by the activity, so that the cost value can be corrected by various characteristics as required by setting the function using this activity as a variable. Certainly, the image quality can be improved in various ways.

より具体的には、このアクティビティを変数とする関数に単調減少関数を適用することにより、アクティビティの低い領域における画質劣化を確実に防止することができる。 More specifically, by applying a monotone decreasing function to a function having this activity as a variable, it is possible to reliably prevent image quality deterioration in a low activity area.

この実施例においては、実施例１について上述した１６×１６画素によるマクロブロックを単位にしたマクロブロックのアクティビティＭＢａｃｔの直接の計算に代えて、マクロブロックを水平方向及び垂直方向にそれぞれ４分割した４×４画素によるブロックを単位にしてマクロブロックのアクティビティＭＢａｃｔを計算する。なおこの実施例に係る符号化装置は、このアクティビティの検出に係るアクティビティ算出回路の構成が異なる点を除いて、実施例１について上述した符号化装置４１と同一に構成される。 In this embodiment, instead of directly calculating the macroblock activity MB act in units of macroblocks of 16 × 16 pixels described above for the first embodiment, the macroblock is divided into four in the horizontal and vertical directions, respectively. The macro block activity MB act is calculated in units of blocks of 4 × 4 pixels. The encoding apparatus according to the present embodiment is configured in the same manner as the encoding apparatus 41 described above with respect to the first embodiment except that the configuration of the activity calculation circuit related to the detection of this activity is different.

すなわちこの実施例において、アクティビティ算出回路は、マクロブロックを水平方向及び垂直方向にそれぞれ４分割した４×４画素によるブロック毎に、次式の演算処理を実行し、これによりこの４×４画素によるブロックの分散により各４×４画素ブロックのアクティビティａｃｔを検出する。 That is, in this embodiment, the activity calculation circuit executes the following arithmetic processing for each block of 4 × 4 pixels obtained by dividing the macroblock into 4 parts in the horizontal direction and the vertical direction, and thereby the 4 × 4 pixels are used. The activity act of each 4 × 4 pixel block is detected by block distribution.

またアクティビティ算出回路は、このようにして計算される各種ブロックによるアクティビティａｃｔをマクロブロックによりまとめて、マクロブロックのアクティビティＭＢａｃｔを計算する。具体的にアクティビティ算出回路は、次式の演算処理により、マクロブロックを構成する４×４画素ブロックのアクティビティａｃｔより最小値を求めてマクロブロックのアクティビティＭＢａｃｔを計算する。 Further, the activity calculation circuit calculates the activity MB act of the macro block by grouping the activity act of the various blocks calculated in this way by the macro block. Specifically, the activity calculation circuit calculates the minimum value of the activity MB act of the macro block by calculating the minimum value from the activity act of the 4 × 4 pixel block constituting the macro block by the arithmetic processing of the following equation.

この実施例のように、マクロブロックを細分割したブロック毎に、画像データＤ１による画素値の分散値を計算した後、最小値を検出してアクティビティに設定するようにしても、実施例１と同様の効果を得ることができる。 As in this embodiment, after calculating the dispersion value of the pixel value based on the image data D1 for each block obtained by subdividing the macroblock, the minimum value may be detected and set in the activity. Similar effects can be obtained.

この実施例においては、実施例２について上述した４×４画素によるブロックを単位にしたマクロブロックのアクティビティＭＢａｃｔの直接の計算に代えて、マクロブロックを水平方向及び垂直方向にそれぞれ２分割した８×８画素によるブロックを単位にしてマクロブロックのアクティビティＭＢａｃｔを計算する。なおこの実施例に係る符号化装置は、このアクティビティの検出に係るアクティビティ算出回路の構成が異なる点を除いて、実施例２について上述した符号化装置４１と同一に構成される。 In this embodiment, instead of the direct calculation of the activity MBact of the macroblock in units of blocks of 4 × 4 pixels described above for the second embodiment, the macroblock is divided into two in the horizontal direction and the vertical direction, respectively. The activity MB act of the macroblock is calculated in units of blocks of x8 pixels. Note that the encoding apparatus according to this embodiment is configured in the same manner as the encoding apparatus 41 described above with respect to the second embodiment, except that the configuration of the activity calculation circuit for detecting this activity is different.

この実施例のように、マクロブロックを細分割したブロックを８×８画素のブロックに設定して、このブロック毎に、画像データＤ１による画素値の分散値を計算した後、最小値を検出してアクティビティに設定するようにしても、実施例２と同様の効果を得ることができる。 As in this embodiment, a block obtained by subdividing a macroblock is set to a block of 8 × 8 pixels, and a variance value of pixel values based on image data D1 is calculated for each block, and then a minimum value is detected. Even if the activity is set, the same effect as in the second embodiment can be obtained.

この実施例では、上述した平均値を基準にした分散値の計算によるアクティビティＭＢａｃｔの検出に代えて、アダマール変換処理によりアクティビティＭＢａｃｔを計算する。なおこの実施例に係る符号化装置は、このアクティビティの検出に係るアクティビティ算出回路の構成が異なる点を除いて、実施例２について上述した符号化装置と同一に構成される。 In this embodiment, instead of detecting the activity MB act by calculating the variance value based on the average value, the activity MB act is calculated by Hadamard transform processing. The encoding apparatus according to the present embodiment is configured in the same manner as the encoding apparatus described above with respect to the second embodiment, except that the configuration of the activity calculation circuit for detecting the activity is different.

すなわちこの実施例において、アクティビティ算出回路は、マクロブロックを水平方向及び垂直方向にそれぞれ４分割した４×４画素によるブロック毎に、次式の演算処理を実行し、これによりアダマール変換処理により４×４画素ブロックのアクティビティａｃｔを検出する。 In other words, in this embodiment, the activity calculation circuit performs the following arithmetic processing for each block of 4 × 4 pixels obtained by dividing the macroblock into 4 parts in the horizontal direction and the vertical direction, thereby performing 4 × by Hadamard transform processing. The activity act of the 4-pixel block is detected.

なおここでＨ₄は、（８２）式により示す４次のアダマール行列である。また（８１）式の演算処理により得られる行列に対して、直流成分を除いた絶対値和を（８３）式より求め、これを当該４×４画素ブロックのアクティビティａｃｔとする。なおこのような４次のアダマール行列による４×４画素ブロックの処理に代えて、８次又は１６次のアダマール行列による８×８画素ブロック又は１６×１６画素ブロックの処理によりアクティビティを検出するようにしてもよい。 Here, H ₄ is a fourth-order Hadamard matrix expressed by the equation (82). Further, the absolute value sum excluding the DC component is obtained from the equation (83) for the matrix obtained by the arithmetic processing of the equation (81), and this is defined as the activity act of the 4 × 4 pixel block. Instead of such a 4 × 4 pixel block process using a 4th order Hadamard matrix, the activity is detected by an 8 × 8 pixel block process or a 16 × 16 pixel block process using an 8th order or 16th order Hadamard matrix. May be.

またこのようにして計算した４×４画素ブロックによるアクティビティａｃｔを（８０）式の演算処理によりマクロブロックでまとめてアクティビティＭＢａｃｔを検出する。 In addition, the activity act based on the 4 × 4 pixel block calculated in this way is collected in a macro block by the arithmetic processing of the expression (80) to detect the activity MB act.

この実施例のように、アダマール変換処理によりアクティビティＭＢａｃｔを検出するようにしても、実施例１と同様の効果を得ることができる。 Even if the activity MB act is detected by Hadamard transform processing as in this embodiment, the same effect as in the first embodiment can be obtained.

この実施例においては、イントター予測に係る最適モードの検出においても、アクティビティによりコスト関数を補正する。なおこの実施例に係る符号化装置は、このイントター予測の処理に係る動き予測・補償回路の構成が異なる点を除いて、実施例１について上述した符号化装置４１と同一に構成される。 In this embodiment, the cost function is corrected by the activity even in the detection of the optimal mode related to the inter prediction. The encoding apparatus according to the present embodiment is configured in the same manner as the encoding apparatus 41 described above with respect to the first embodiment except that the configuration of the motion prediction / compensation circuit according to the inter prediction process is different.

すなわちこの実施例において、動き予測・補償回路は、予測モードより最適モードを検出する際のコスト値の計算において、アクティビティＭＢａｃｔによりオフセット値を生成し、このオフセット値によりコスト値を補正する。これにより動き予測・補償回路は、動き補償ブロックの大きさの選択、前予測、後予測、双方向予測の選択をアクティビティにより制御する。 That is, in this embodiment, the motion prediction / compensation circuit generates an offset value based on the activity MB act and calculates the cost value based on this offset value in calculating the cost value when detecting the optimum mode from the prediction mode. Thus, the motion prediction / compensation circuit controls the selection of the size of the motion compensation block, the pre-prediction, the post-prediction, and the bi-directional prediction according to the activity.

これによりこの符号化装置では、例えばアクティビティが低い場合には、動き補償ブロックの大きさの頻繁な切り換わり、前予測、後予測、双方向予測の頻繁な切り換わりを防止し、その分、このような切り換わりによるばたついた感じを防止して一段と画質を向上する。 As a result, in this encoding apparatus, for example, when the activity is low, frequent switching of the size of the motion compensation block, frequent switching of the forward prediction, the post prediction, and the bidirectional prediction is prevented. The flickering feeling caused by such switching is prevented and the image quality is further improved.

この実施例によれば、さらに動き補償ブロックの大きさの選択、前予測、後予測、双方向予測の選択をアクティビティにより制御することにより、一段と画質を向上することができる。 According to this embodiment, it is possible to further improve the image quality by further controlling the selection of the size of the motion compensation block, the pre-prediction, the post-prediction, and the bi-directional prediction by the activity.

なお上述の実施例においては、ＡＶＣにおけるＬｏｗＣｏｍｐｌｅｘｉｔｙＭｏｄｅに本発明を適用する場合について述べたが、本発明はこれに限らず、ＨｉｇｈＣｏｍｐｌｅｘｉｔｙＭｏｄｅに適用するようにしてもよい。 In the above-described embodiments, the case where the present invention is applied to the Low Complexity Mode in AVC has been described. However, the present invention is not limited to this, and may be applied to the High Complexity Mode.

また上述の実施例においては、本発明をＡＶＣによる符号化装置に適用する場合について述べたが、本発明はこれに限らず、符号化効率を示すコスト関数によるコスト値の比較により、複数のイントラ予測モード、複数のインター予測モードから最適モードをマクロブロック毎に検出して画像データを符号化処理する場合に広く適用することができる。 In the above-described embodiments, the case where the present invention is applied to an AVC encoding apparatus has been described. However, the present invention is not limited to this, and a plurality of intra-frames can be obtained by comparing cost values using cost functions indicating encoding efficiency. The present invention can be widely applied to a case where an optimal mode is detected for each macroblock from a prediction mode and a plurality of inter prediction modes and image data is encoded.

また上述の実施例においては、本発明をハードウエアの構成に適用する場合について述べたが、本発明はこれに限らず、画像データをソフトウエアにより処理する場合にも適用することができる。なおこのようなソフトウエアに係る符号化処理、復号化処理のプログラムにおいては、例えばインターネット等のネットワークにより提供する場合、光ディスク、磁気ディスク、メモリカード等、種々の記録媒体により提供する場合に、広く適用することができる。 In the above-described embodiments, the case where the present invention is applied to the hardware configuration has been described. However, the present invention is not limited to this, and can also be applied to the case where image data is processed by software. Note that the encoding processing and decoding processing programs related to such software are widely used when provided by a network such as the Internet, when provided by various recording media such as an optical disk, a magnetic disk, and a memory card. Can be applied.

本発明は、符号化装置、符号化方法、符号化方法のプログラム及び符号化方法のプログラムを記録した記録媒体に関し、動画による撮像結果を記録するビデオカメラ、電子スチルカメラ、監視装置等に適用することができる。 The present invention relates to an encoding apparatus, an encoding method, an encoding method program, and a recording medium on which the encoding method program is recorded, and is applied to a video camera, an electronic still camera, a monitoring apparatus, and the like that record imaging results of moving images. be able to.

本発明の実施例１に係る符号化装置を示すブロック図である。It is a block diagram which shows the encoding apparatus which concerns on Example 1 of this invention. 図１の符号化装置における最適モード検出の処理手順を示すフローチャートである。It is a flowchart which shows the process sequence of the optimal mode detection in the encoding apparatus of FIG. ＡＶＣ方式の符号化装置を示すブロック図である。1 is a block diagram illustrating an AVC encoding apparatus. FIG. ＡＶＣ方式の復号化装置を示すブロック図である。It is a block diagram which shows the decoding apparatus of an AVC system. ＡＶＣ方式による係数データの処理の説明に供する略線図である。It is a basic diagram with which it uses for description of the process of the coefficient data by an AVC system. ＡＶＣ方式のイントラ４×４予測モードにおける予測画素の設定の説明に供する略線図である。It is a basic diagram with which it uses for description of the setting of the prediction pixel in the intra 4 * 4 prediction mode of AVC system. イントラ４×４予測モードの説明に供する略線図である。It is a basic diagram with which it uses for description of intra 4x4 prediction mode. イントラ４×４予測モードを示す図表である。It is a chart which shows intra 4x4 prediction mode. イントラ４×４予測モードの各モードの説明に供する略線図である。It is a basic diagram with which it uses for description of each mode of intra 4x4 prediction mode. 予測モードの伝送の説明に供する略線図である。It is a basic diagram with which it uses for description of transmission of prediction mode. Ｃ言語の記述により予測モードの復号処理を示す図表である。It is a graph which shows the decoding process of prediction mode by the description of C language. イントラ１６×１６予測モードの予測画素の説明に供する略線図である。It is a basic diagram with which it uses for description of the prediction pixel of intra 16x16 prediction mode. イントラ１６×１６予測モードを示す図表である。It is a graph which shows intra 16x16 prediction mode. イントラ１６×１６予測モードの説明に供する略線図である。It is a basic diagram with which it uses for description of intra 16x16 prediction mode. 色差信号に係るイントラ予測モードの説明に供する図表である。It is a graph with which it uses for description of the intra prediction mode which concerns on a color difference signal. ＡＶＣ方式の参照フレームの説明に供する略線図である。It is a basic diagram with which it uses for description of the reference frame of an AVC system. ＡＶＣ方式の動き補償の説明に供する略線図である。It is a basic diagram with which it uses for description of the motion compensation of an AVC system. ＡＶＣ方式の動き補償精度の説明に供する略線図である。It is a basic diagram with which it uses for description of the motion compensation precision of an AVC system. 色差信号の動き補償の説明に供する略線図である。It is an approximate line figure used for explanation of motion compensation of a color difference signal. サブマクロブロックに係る動きベクトルの予測値の説明に供する略線図である。It is a basic diagram with which it uses for description of the predicted value of the motion vector which concerns on a submacroblock. 他の例による動きベクトルの予測値の説明に供する略線図である。It is a basic diagram with which it uses for description of the predicted value of the motion vector by another example. テンポラルダイレクトモードの説明に供する略線図である。It is an approximate line figure used for explanation of temporal direct mode. デブロックフィルタの処理の説明に供する略線図である。It is an approximate line figure used for explanation of processing of a deblocking filter. ブロック境界の強度の説明に供する図表である。It is a table | surface used for description of the intensity | strength of a block boundary. デブロックフィルタの強度の調整の説明に供する特性曲線図である。It is a characteristic curve figure with which it uses for description of adjustment | control of the intensity | strength of a deblocking filter. デブロックフィルタの特性の設定に係るパラメータα及びβを示す図表である。6 is a chart showing parameters α and β related to setting of characteristics of a deblocking filter. デブロックフィルタの特性の設定に係るパラメータｔｃｏを示す図表である。It is a graph which shows the parameter tco which concerns on the setting of the characteristic of a deblocking filter.

Explanation of symbols

１……符号化装置、５、２３、４６……イントラ予測回路、６、２４、４４……動き予測・補償回路、１５、２８……デブロックフィルタ、４２……アクティビティ算出回路、イントラ・インター判定回路
DESCRIPTION OF SYMBOLS 1 ... Coding device 5, 23, 46 ... Intra prediction circuit, 6, 24, 44 ... Motion prediction / compensation circuit, 15, 28 ... Deblock filter, 42 ... Activity calculation circuit, Intra / inter Judgment circuit

Claims

An encoding device that detects an optimal mode for each macroblock from a plurality of intra prediction modes and a plurality of inter prediction modes by comparing cost values with cost functions indicating encoding efficiency, and encodes image data in the optimal mode In
Activity calculating means for calculating an activity indicating the flatness of the image by the image data for each macroblock;
Optimum mode detection means for correcting the cost value by the activity and detecting the optimum mode;
Equipped with a,
The encoding apparatus, wherein the correction of the cost value by the optimum mode detection means is a correction of a cost value that makes it difficult to select the intra prediction mode when there are few high-frequency components in the macroblock.

The encoding apparatus according to claim 1, wherein the activity is a variance value of pixel values based on the image data.

The activity calculating means calculates a variance value of pixel values based on the image data for each block obtained by subdividing the macroblock, and detects a minimum value from the variance value of the subdivided block for each macroblock. The encoding apparatus according to claim 2 , wherein the activity is set to the activity.

The activity calculation means includes:
Performing Hadamard transform for each block obtained by subdividing the macroblock,
By calculating the absolute value sum by removing the coefficient of the DC component from the coefficient data by the processing result, the activity is calculated for each of the subdivided blocks,
The encoding apparatus according to claim 1 , wherein a minimum value is detected from the activity of the subdivided block and set as the activity of the macroblock for each macroblock.

The cost function is a function that gives an offset value with respect to an error value between an original image and a predicted image,
The activity calculation means sets the cost value by the activity by setting a multiplication value of a function having the activity as a variable and a conversion function from a quantization parameter to a quantization value regarding additional information to the offset value. The encoding device according to claim 1, wherein correction is performed.

The encoding apparatus according to claim 5 , wherein the function having the activity as a variable is a monotone decreasing function.

An encoding method for detecting an optimal mode for each macroblock from a plurality of intra prediction modes and a plurality of inter prediction modes by comparing cost values using a cost function indicating encoding efficiency, and encoding the image data in the optimal mode In
An activity calculating step for calculating an activity indicating the flatness of the image by the image data for each macroblock;
An optimal mode detection step of detecting the optimal mode by correcting the cost value by the activity;
I have a,
The encoding method, wherein the correction of the cost value in the step of detecting the optimum mode is a correction of a cost value that makes it difficult to select the intra prediction mode when there are few high-frequency components in the macroblock .

An optimum mode detection step for detecting an optimum mode for each macroblock from a plurality of intra prediction modes and a plurality of inter prediction modes by comparing cost values using a cost function indicating coding efficiency, and encoding the image data by the optimum mode An encoding method program comprising:
An activity calculating step for calculating an activity indicating flatness of an image based on the image data for each macroblock;
The step of detecting the optimum mode is a step of detecting the optimum mode by correcting the cost value by the activity ,
The program of the encoding method , wherein the correction of the cost value in the optimum mode detection step is a correction of a cost value that makes it difficult to select the intra prediction mode when there are few high-frequency components in the macroblock .

In a recording medium on which a program of an encoding method executed by arithmetic processing means is recorded,
The encoding method program is:
Optimal mode detection step for detecting an optimal mode for each macroblock from a plurality of intra-prediction modes and a plurality of inter-prediction modes by comparing cost values with a cost function indicating coding efficiency;
A step of encoding processing for encoding image data in the optimum mode;
An activity calculating step for calculating an activity indicating flatness of an image based on the image data for each macroblock;
The step of detecting the optimum mode is a step of detecting the optimum mode by correcting the cost value by the activity ,
A program of an encoding method , wherein the correction of the cost value in the optimal mode detection step is a correction of a cost value that makes it difficult to select the intra prediction mode when there are few high-frequency components in the macroblock. Recorded recording medium.