JP2014135742A

JP2014135742A - Image processing device and method, and recording medium

Info

Publication number: JP2014135742A
Application number: JP2014030815A
Authority: JP
Inventors: Kazufumi Sato; 数史佐藤
Original assignee: Sony Corp
Current assignee: Sony Corp
Priority date: 2014-02-20
Filing date: 2014-02-20
Publication date: 2014-07-24
Anticipated expiration: 2029-04-24
Also published as: JP5776804B2

Abstract

PROBLEM TO BE SOLVED: To enable encoding efficiency in intra prediction to be improved.SOLUTION: In the event that the optimal intra prediction mode is mode 0, adjacent pixels to be used for prediction of the target block are pixels A0, A1, A2, and A3. According to these pixels and a 6-tap FIR filter, pixels a-0.5, a+0.5, and so on with 1/2 pixel precision are generated, and further, pixels a-0.75, a-0.25, a+0.25, and a+0.75 with 1/4 pixel precision are generated by linear interpolation. Subsequently, the optimal shift amount is determined with a value of -0.75 through +0.75 that is phase difference between an integer pixel and generated fractional pixel precision serving as a candidate of the shift amount in the horizontal direction. The present invention may be applied to an image encoding device which performs encoding using the H.264/AVC system, for example.

Description

本発明は画像処理装置および方法、並びに記録媒体に関し、特に、イントラ予測における符号化効率を向上させるようにした画像処理装置および方法、並びに記録媒体に関する。 The present invention relates to an image processing apparatus and method, and a recording medium, and more particularly, to an image processing apparatus and method, and a recording medium that improve encoding efficiency in intra prediction.

近年、画像情報をデジタルとして取り扱い、その際、効率の高い情報の伝送、蓄積を目的とし、画像情報特有の冗長性を利用して、離散コサイン変換等の直交変換と動き補償により圧縮する符号化方式を採用して画像を圧縮符号する装置が普及しつつある。この符号化方式には、例えば、MPEG（Moving Picture Experts Group）などがある。 In recent years, image information has been handled as digital data, and at that time, for the purpose of efficient transmission and storage of information, encoding is performed by orthogonal transform such as discrete cosine transform and motion compensation using redundancy unique to image information. An apparatus that employs a method to compress and code an image is becoming widespread. This encoding method includes, for example, MPEG (Moving Picture Experts Group).

特に、MPEG2(ISO/IEC 13818-2)は、汎用画像符号化方式として定義されており、飛び越し走査画像及び順次走査画像の双方、並びに標準解像度画像及び高精細画像を網羅する標準である。例えば、MPEG2は、プロフェッショナル用途及びコンシューマ用途の広範なアプリケーションに現在広く用いられている。MPEG2圧縮方式を用いることにより、例えば720×480画素を持つ標準解像度の飛び越し走査画像であれば4乃至8Mbpsの符号量（ビットレート）が割り当てられる。また、MPEG2圧縮方式を用いることにより、例えば1920×1088画素を持つ高解像度の飛び越し走査画像であれば18乃至22 Mbpsの符号量（ビットレート）が割り当てられる。これにより、高い圧縮率と良好な画質の実現が可能である。 In particular, MPEG2 (ISO / IEC 13818-2) is defined as a general-purpose image coding system, and is a standard that covers both interlaced scanning images and progressive scanning images, as well as standard resolution images and high-definition images. For example, MPEG2 is currently widely used in a wide range of applications for professional and consumer applications. By using the MPEG2 compression method, for example, a code amount (bit rate) of 4 to 8 Mbps is assigned to an interlaced scanned image having a standard resolution of 720 × 480 pixels. Further, by using the MPEG2 compression method, for example, in the case of a high-resolution interlaced scanned image having 1920 × 1088 pixels, a code amount (bit rate) of 18 to 22 Mbps is allocated. As a result, a high compression rate and good image quality can be realized.

MPEG2は主として放送用に適合する高画質符号化を対象としていたが、MPEG1より低い符号量（ビットレート）、つまりより高い圧縮率の符号化方式には対応していなかった。携帯端末の普及により、今後そのような符号化方式のニーズは高まると思われ、これに対応してMPEG4符号化方式の標準化が行われた。画像符号化方式に関しては、1998年12月にISO/IEC 14496-2としてその規格が国際標準に承認された。 MPEG2 was mainly intended for high-quality encoding suitable for broadcasting, but did not support encoding methods with a lower code amount (bit rate) than MPEG1, that is, a higher compression rate. With the widespread use of mobile terminals, the need for such an encoding system is expected to increase in the future, and the MPEG4 encoding system has been standardized accordingly. Regarding the image coding system, the standard was approved as an international standard as ISO / IEC 14496-2 in December 1998.

さらに、近年、当初テレビ会議用の画像符号化を目的として、H．26L (ITU-T Q6/16 VCEG）という標準の規格化が進んでいる。H．26LはMPEG2やMPEG4といった従来の符号化方式に比べ、その符号化、復号化により多くの演算量が要求されるものの、より高い符号化効率が実現されることが知られている。また、現在、MPEG4の活動の一環として、このH．26Lをベースに、H．26Lではサポートされない機能をも取り入れ、より高い符号化効率を実現する標準化がJoint Model of Enhanced-Compression Video Codingとして行われている。標準化のスケジュールとしては、2003年3月にはH．264及びMPEG-4 Part10 （Advanced Video Coding、以下H．264/AVCと記す）という国際標準となっている。 Furthermore, in recent years, for the purpose of image coding for the initial video conference, The standardization of 26L (ITU-T Q6 / 16 VCEG) is in progress. H. 26L is known to achieve higher encoding efficiency than the conventional encoding schemes such as MPEG2 and MPEG4, although a large amount of calculation is required for encoding and decoding. In addition, as part of MPEG4 activities, this H. Based on 26L, H. Standardization to achieve higher coding efficiency by incorporating functions not supported by 26L is being carried out as Joint Model of Enhanced-Compression Video Coding. As for the standardization schedule, H. H.264 and MPEG-4 Part10 (Advanced Video Coding, hereinafter referred to as H.264 / AVC).

さらに、その拡張として、RGBや4:2:2、4:4:4といった、業務用に必要な符号化ツールや、MPEG-2で規定されていた8x8DCTや量子化マトリクスをも含んだFRExt (Fidelity Range Extension) の標準化が2005年2月に完了している。これにより、H．264/AVCを用いて、映画に含まれるフィルムノイズをも良好に表現することが可能な符号化方式となって、Blu-Ray Disc（商標）等の幅広いアプリケーションに用いられる運びとなった。 As an extension, FRExt (including RGB, 4: 2: 2, 4: 4: 4 coding tools necessary for business use, 8x8DCT and quantization matrix specified by MPEG-2) Fidelity Range Extension) standardization was completed in February 2005. As a result, H.C. Using 264 / AVC, it became an encoding method that can express film noise contained in movies well, and it has been used in a wide range of applications such as Blu-Ray Disc (trademark).

しかしながら、昨今、ハイビジョン画像の4倍の、4000×2000画素程度の画像を圧縮したいといった、更なる高圧縮率符号化に対するニーズが高まっている。あるいは、インターネットのような、限られた伝送容量の環境において、ハイビジョン画像を配信したいといった、更なる高圧縮率符号化に対するニーズが高まっている。このため、先述の、ITU-T傘下のVCEG (=Video Coding Expert Group) において、符号化効率の改善に関する検討が継続され行なわれている。 However, nowadays, there is an increasing need for further high-compression encoding such as wanting to compress an image of about 4000 × 2000 pixels, which is four times the high-definition image. Alternatively, there is a growing need for further high compression rate encoding such as the desire to deliver high-definition images in an environment with a limited transmission capacity such as the Internet. For this reason, in the above-mentioned VCEG (= Video Coding Expert Group) under the ITU-T, studies on improving the coding efficiency are being continued.

例えば、MPEG2方式においては、線形内挿処理により１／２画素精度の動き予測・補償処理が行われている。一方、H．264/AVC方式においては、６タップのFIR (Finite Impulse Response Filter)フィルタを用いた１／４画素精度の予測・補償処理が行われている。 For example, in the MPEG2 system, motion prediction / compensation processing with 1/2 pixel accuracy is performed by linear interpolation processing. On the other hand, H. In the H.264 / AVC format, prediction / compensation processing with 1/4 pixel accuracy using a 6-tap FIR (Finite Impulse Response Filter) filter is performed.

この１／４画素精度の予測・補償処理に対して、近年、H．264/AVC方式の効率をさらに向上させようという検討が行われている。このための符号化方式の１つとして、非特許文献１においては、１／８画素精度の動き予測が提案されている。 In recent years, the H.264 pixel accuracy prediction / compensation process has been developed. Studies are underway to further improve the efficiency of the H.264 / AVC format. As one encoding method for this purpose, Non-Patent Document 1 proposes motion prediction with 1/8 pixel accuracy.

すなわち、非特許文献１において、１／２画素精度の補間処理はフィルタ[-3,12,-39,158,158,-39,12,-3]/256により行われる。また、１／４画素精度の補間処理はフィルタ[-3,12,-37,229,71,-21,6,-1]/256により行われ、１／８画素精度の補間処理は線形補間により行われる。 That is, in Non-Patent Document 1, the interpolation process with 1/2 pixel accuracy is performed by the filter [−3,12, −39,158,158, −39,12, −3] / 256. The interpolation processing with 1/4 pixel accuracy is performed by the filter [-3,12, -37,229,71, -21,6, -1] / 256, and the interpolation processing with 1/8 pixel accuracy is performed by linear interpolation. Is called.

このように、より高い画素精度の補間処理を用いた動き予測を行うことにより、特に、解像度が高いテクスチャを持ち、比較的ゆっくりした動きのシーケンスにおいては、予測精度を向上させ、符号化効率の向上を実現することができる。 In this way, by performing motion prediction using interpolation processing with higher pixel accuracy, it is possible to improve prediction accuracy and improve coding efficiency, particularly in a relatively slow motion sequence having a texture with high resolution. Improvements can be realized.

ところで、H．264/AVC方式が、従来のMPEG2方式などに比して高い符号化効率を実現する要因の１つとして、次に説明するイントラ予測方式の採用が挙げられる。 H. One of the factors that realize the high encoding efficiency of the H.264 / AVC format compared to the conventional MPEG2 format is the adoption of the intra prediction method described below.

H．264/AVC方式において、輝度信号については、９種類の４×４画素および８×８画素のブロック単位、並びに４種類の１６×１６画素のマクロブロック単位のイントラ予測モードが定められている。色差信号については、４種類の８×８画素のブロック単位のイントラ予測モードが定められている。色差信号のイントラ予測モードは、輝度信号のイントラ予測モードと独立に設定が可能である。なお、予測モードの種類は、図１の番号０，１，３乃至８で示される方向に対応している。予測モード２は平均値予測である。 H. In the H.264 / AVC format, nine types of 4 × 4 pixel and 8 × 8 pixel block units and four types of 16 × 16 pixel macroblock unit intra prediction modes are defined for luminance signals. For color difference signals, four types of 8 × 8 pixel block-unit intra prediction modes are defined. The color difference signal intra prediction mode can be set independently of the luminance signal intra prediction mode. Note that the types of prediction modes correspond to the directions indicated by the numbers 0, 1, 3 to 8 in FIG. Prediction mode 2 is average value prediction.

このようなイントラ予測方式が採用されることで、予測精度の向上が実現されている。しかしながら、H．264/AVC方式においては、図１の方向に示されるように、22.5°単位でのイントラ予測しか行われない。したがって、エッジの傾きが、それ以外の角度の場合、符号化効率の向上が制限されてしまう。 By adopting such an intra prediction method, improvement in prediction accuracy is realized. However, H.C. In the H.264 / AVC format, as shown in the direction of FIG. 1, only intra prediction in units of 22.5 ° is performed. Therefore, when the inclination of the edge is other than that, the improvement of the encoding efficiency is limited.

そこで、さらなる符号化効率の改善のため、非特許文献２においては、22.5°の単位よりも細かい角度で予測を行う提案がなされている。 Therefore, in order to further improve the coding efficiency, Non-Patent Document 2 proposes that prediction is performed at an angle smaller than the unit of 22.5 °.

”Motion compensated prediction with 1/8-pel displacement vector resolution”,VCEG-AD09,ITU-Telecommunications Standardization Sector STUDY GROUP Question 6 Video coding Experts Group(VCEG), 23-27 Oct 2006“Motion compensated prediction with 1 / 8-pel displacement vector resolution”, VCEG-AD09, ITU-Telecommunications Standardization Sector STUDY GROUP Question 6 Video coding Experts Group (VCEG), 23-27 Oct 2006 Virginie Drugeon, Thomas Wedi, and Torsten Palfner ,”High Precision Edge Prediction for Intra Coding”, 2008Virginie Drugeon, Thomas Wedi, and Torsten Palfner, “High Precision Edge Prediction for Intra Coding”, 2008

しかしながら、H．264/AVC方式のイントラ予測では、予測に、符号化対象となるブロックの所定の隣接画素が用いられるのに対して、非特許文献２に記載の提案では、符号化対象となるブロックの隣接画素以外の画素も予測に用いなければならない。 However, H.C. In the intra prediction of the H.264 / AVC method, a predetermined adjacent pixel of the block to be encoded is used for the prediction, whereas in the proposal described in Non-Patent Document 2, the adjacent pixel of the block to be encoded is used. Other pixels must also be used for prediction.

したがって、非特許文献２に記載の提案では、22.5°の単位よりも細かい角度で予測を行ったとしても、メモリアクセス回数や処理が増大してしまう。 Therefore, in the proposal described in Non-Patent Document 2, even if the prediction is performed at an angle smaller than the unit of 22.5 °, the number of memory accesses and the processing increase.

本発明は、このような状況に鑑みてなされたものであり、メモリアクセス回数や処理を増大させることなく、イントラ予測における符号化効率をさらに向上させるものである。 The present invention has been made in view of such a situation, and further improves the encoding efficiency in intra prediction without increasing the number of memory accesses and processing.

本発明の一側面の画像処理装置は、符号化処理する画像を対象として、符号化処理の対象となる対象ブロックの画素に対してイントラ予測を行う際に参照する隣接画素を蓄積するメモリと、前記対象ブロックの画素に対してイントラ予測を行う際の予測方向とブロックサイズとに応じて、前記メモリから読み出した隣接画素の位相をシフトさせるか、前記メモリから読み出した隣接画素の位相をシフトさせないかを選択する選択部と、前記隣接画素を用いて、前記対象ブロックの画素に対してイントラ予測を行い、予測画像を生成するイントラ予測部と、前記イントラ予測部により生成された予測画像を用いて、前記画像を符号化処理する符号化部とを備える。 An image processing apparatus according to an aspect of the present invention is directed to an image to be encoded, a memory that accumulates adjacent pixels that are referred to when performing intra prediction on pixels of a target block that is an object of encoding processing, The phase of adjacent pixels read from the memory is shifted or the phase of adjacent pixels read from the memory is not shifted according to the prediction direction and block size when performing intra prediction on the pixels of the target block. A selection unit that selects the image, an intra prediction unit that performs intra prediction on the pixel of the target block using the adjacent pixel, and generates a prediction image, and a prediction image generated by the intra prediction unit And an encoding unit for encoding the image.

前記イントラ予測部は、前記選択部により前記メモリから読み出した隣接画素の位相がシフトさせると選択された場合に、位相がシフトされた隣接画素を用いて、前記対象ブロックの画素に対してイントラ予測を行うことができる。 When the selection unit selects that the phase of the adjacent pixel read from the memory is shifted, the intra prediction unit uses the adjacent pixel whose phase is shifted to perform intra prediction on the pixel of the target block. It can be performed.

前記イントラ予測部は、前記選択部により前記メモリから読み出した隣接画素の位相がシフトさせないと選択された場合に、位相がシフトされなかった隣接画素を用いて、前記対象ブロックの画素に対してイントラ予測を行うことができる。 When the selection unit selects that the phase of the adjacent pixel read from the memory is not shifted, the intra prediction unit uses the adjacent pixel whose phase has not been shifted to the intra of the pixel of the target block. Predictions can be made.

本発明の一側面の画像処理方法は、画像処理装置が、符号化処理する画像を対象として、符号化処理の対象となる対象ブロックの画素に対してイントラ予測を行う際の予測方向とブロックサイズとに応じて、前記対象ブロックの画素に対してイントラ予測を行う際に参照する隣接画素を蓄積するメモリから読み出した隣接画素の位相をシフトさせるか、前記メモリから読み出した隣接画素の位相をシフトさせないかを選択し、前記隣接画素を用いて、前記対象ブロックの画素に対してイントラ予測を行い、予測画像を生成し、生成された予測画像を用いて、前記画像を符号化処理する。 An image processing method according to an aspect of the present invention is directed to a prediction direction and a block size when an image processing apparatus performs intra prediction on a pixel of a target block that is a target of encoding processing for an image to be encoded. In response to this, the phase of the adjacent pixel read from the memory that stores the adjacent pixel to be referred to when performing intra prediction on the pixel of the target block is shifted, or the phase of the adjacent pixel read from the memory is shifted. Whether to perform the intra prediction on the pixels of the target block using the adjacent pixels, generates a prediction image, and encodes the image using the generated prediction image.

前記メモリから読み出した隣接画素の位相がシフトさせると選択された場合に、位相がシフトされた隣接画素を用いて、前記対象ブロックの画素に対してイントラ予測を行うことができる。 When it is selected that the phase of the adjacent pixel read from the memory is shifted, intra prediction can be performed on the pixel of the target block using the adjacent pixel whose phase is shifted.

前記メモリから読み出した隣接画素の位相がシフトさせないと選択された場合に、位相がシフトされなかった隣接画素を用いて、前記対象ブロックの画素に対してイントラ予測を行うことができる。 When it is selected that the phase of the adjacent pixel read from the memory is not shifted, intra prediction can be performed on the pixel of the target block using the adjacent pixel whose phase is not shifted.

本発明の一側面の記録媒体は、本発明の一側面の画像処理装置により生成された符号化ストリームを記録した記録媒体である。 A recording medium according to one aspect of the present invention is a recording medium that records an encoded stream generated by the image processing apparatus according to one aspect of the present invention.

本発明の一側面においては、符号化処理する画像を対象として、符号化処理の対象となる対象ブロックの画素に対してイントラ予測を行う際の予測方向とブロックサイズとに応じて、前記対象ブロックの画素に対してイントラ予測を行う際に参照する隣接画素を蓄積するメモリから読み出した隣接画素の位相をシフトさせるか、前記メモリから読み出した隣接画素の位相をシフトさせないかが選択される。そして、前記隣接画素を用いて、前記対象ブロックの画素に対してイントラ予測が行われ、予測画像が生成され、生成された予測画像を用いて、前記画像が符号化処理される。 In one aspect of the present invention, for an image to be encoded, the target block is selected according to a prediction direction and a block size when performing intra prediction on pixels of the target block to be encoded. It is selected whether to shift the phase of the adjacent pixel read from the memory that stores the adjacent pixel to be referred to when intra prediction is performed on this pixel, or not to shift the phase of the adjacent pixel read from the memory. Then, intra prediction is performed on the pixels of the target block using the adjacent pixels, a predicted image is generated, and the image is encoded using the generated predicted image.

なお、上述の画像処理装置のそれぞれは、独立した装置であっても良いし、１つの画像符号化装置または画像復号装置を構成している内部ブロックであってもよい。 Note that each of the above-described image processing apparatuses may be an independent apparatus, or may be an internal block constituting one image encoding apparatus or image decoding apparatus.

本発明によれば、イントラ予測により予測画像を生成することができる。また、本発明によれば、メモリアクセス回数や処理を増大させることなく、符号化効率を向上させることができる。 According to the present invention, a prediction image can be generated by intra prediction. Further, according to the present invention, it is possible to improve the encoding efficiency without increasing the number of memory accesses or processing.

４×４画素のイントラ予測の方向を説明する図である。It is a figure explaining the direction of 4 * 4 pixel intra prediction. 本発明を適用した画像符号化装置の一実施の形態の構成を示すブロック図である。It is a block diagram which shows the structure of one Embodiment of the image coding apparatus to which this invention is applied. １／４画素精度の動き予測・補償処理を説明する図である。It is a figure explaining the motion prediction / compensation process of 1/4 pixel precision. マルチ参照フレームの動き予測・補償方式について説明する図である。It is a figure explaining the motion prediction and compensation system of a multi reference frame. 動きベクトル情報の生成方法の例を説明する図である。It is a figure explaining the example of the production | generation method of motion vector information. イントラ予測部および隣接画素内挿部の構成例を示すブロック図である。It is a block diagram which shows the structural example of an intra estimation part and an adjacent pixel interpolation part. 図２の画像符号化装置の符号化処理を説明するフローチャートである。3 is a flowchart for describing an encoding process of the image encoding device in FIG. 2. 図７のステップＳ２１の予測処理を説明するフローチャートである。It is a flowchart explaining the prediction process of step S21 of FIG. １６×１６画素のイントラ予測モードの場合の処理順序を説明する図である。It is a figure explaining the processing order in the case of 16 * 16 pixel intra prediction mode. 輝度信号の４×４画素のイントラ予測モードの種類を示す図である。It is a figure which shows the kind of 4 * 4 pixel intra prediction mode of a luminance signal. 輝度信号の４×４画素のイントラ予測モードの種類を示す図である。It is a figure which shows the kind of 4 * 4 pixel intra prediction mode of a luminance signal. ４×４画素のイントラ予測の方向を説明する図である。It is a figure explaining the direction of 4 * 4 pixel intra prediction. ４×４画素のイントラ予測を説明する図である。It is a figure explaining intra prediction of 4x4 pixels. 輝度信号の４×４画素のイントラ予測モードの符号化を説明する図である。It is a figure explaining encoding of the 4 * 4 pixel intra prediction mode of a luminance signal. 輝度信号の１６×１６画素のイントラ予測モードの種類を示す図である。It is a figure which shows the kind of 16 * 16 pixel intra prediction mode of a luminance signal. 輝度信号の１６×１６画素のイントラ予測モードの種類を示す図である。It is a figure which shows the kind of 16 * 16 pixel intra prediction mode of a luminance signal. １６×１６画素のイントラ予測を説明する図である。It is a figure explaining the 16 * 16 pixel intra prediction. 小数画素精度のイントラ予測を実現するための動作を説明する図である。It is a figure explaining the operation | movement for implement | achieving the intra prediction of decimal pixel precision. 小数画素精度のイントラ予測の効果例を説明する図である。It is a figure explaining the example of an effect of intra prediction of decimal pixel accuracy. 図８のステップＳ３１のイントラ予測処理を説明するフローチャートである。It is a flowchart explaining the intra prediction process of step S31 of FIG. 図２０のステップＳ４５の隣接画素内挿処理を説明するフローチャートである。It is a flowchart explaining the adjacent pixel interpolation process of step S45 of FIG. 図８のステップＳ３２のインター動き予測処理を説明するフローチャートである。It is a flowchart explaining the inter motion prediction process of step S32 of FIG. イントラ予測部および隣接画素内挿部の他の構成例を示すブロック図である。It is a block diagram which shows the other structural example of an intra estimation part and an adjacent pixel interpolation part. 図８のステップＳ３１のイントラ予測処理の他の例を説明するフローチャートである。It is a flowchart explaining the other example of the intra prediction process of FIG.8 S31. 図２４のステップＳ１０１の隣接画素内挿処理を説明するフローチャートである。It is a flowchart explaining the adjacent pixel interpolation process of step S101 of FIG. 本発明を適用した画像復号装置の一実施の形態の構成を示すブロック図である。It is a block diagram which shows the structure of one Embodiment of the image decoding apparatus to which this invention is applied. イントラ予測部および隣接画素内挿部の他の構成例を示すブロック図である。It is a block diagram which shows the other structural example of an intra estimation part and an adjacent pixel interpolation part. 図２６の画像復号装置の復号処理を説明するフローチャートである。It is a flowchart explaining the decoding process of the image decoding apparatus of FIG. 図２８のステップＳ１３８の予測処理を説明するフローチャートである。It is a flowchart explaining the prediction process of step S138 of FIG. コンピュータのハードウエアの構成例を示すブロック図である。It is a block diagram which shows the structural example of the hardware of a computer.

以下、図を参照して本発明の実施の形態について説明する。 Hereinafter, embodiments of the present invention will be described with reference to the drawings.

［画像符号化装置の構成例］
図２は、本発明を適用した画像処理装置としての画像符号化装置の一実施の形態の構成を表している。 [Configuration Example of Image Encoding Device]
FIG. 2 shows a configuration of an embodiment of an image encoding apparatus as an image processing apparatus to which the present invention is applied.

この画像符号化装置５１は、例えば、H．264及びMPEG-4 Part10（Advanced Video Coding）（以下H．264/AVCと記す）方式で画像を圧縮符号化する。 This image encoding device 51 is, for example, H.264. 264 and MPEG-4 Part 10 (Advanced Video Coding) (hereinafter referred to as H.264 / AVC) format is used for compression coding.

図２の例において、画像符号化装置５１は、Ａ／Ｄ変換部６１、画面並べ替えバッファ６２、演算部６３、直交変換部６４、量子化部６５、可逆符号化部６６、蓄積バッファ６７、逆量子化部６８、逆直交変換部６９、演算部７０、デブロックフィルタ７１、フレームメモリ７２、スイッチ７３、イントラ予測部７４、隣接画素内挿部７５、動き予測・補償部７６、予測画像選択部７７、およびレート制御部７８により構成されている。 In the example of FIG. 2, the image encoding device 51 includes an A / D conversion unit 61, a screen rearrangement buffer 62, a calculation unit 63, an orthogonal transformation unit 64, a quantization unit 65, a lossless encoding unit 66, a storage buffer 67, Inverse quantization unit 68, inverse orthogonal transform unit 69, operation unit 70, deblock filter 71, frame memory 72, switch 73, intra prediction unit 74, adjacent pixel interpolation unit 75, motion prediction / compensation unit 76, prediction image selection The unit 77 and the rate control unit 78 are configured.

Ａ／Ｄ変換部６１は、入力された画像をＡ／Ｄ変換し、画面並べ替えバッファ６２に出力し、記憶させる。画面並べ替えバッファ６２は、記憶した表示の順番のフレームの画像を、ＧＯＰ（Group of Picture）に応じて、符号化のためのフレームの順番に並べ替える。 The A / D conversion unit 61 performs A / D conversion on the input image, outputs it to the screen rearrangement buffer 62, and stores it. The screen rearrangement buffer 62 rearranges the stored frame images in the display order in the order of frames for encoding in accordance with GOP (Group of Picture).

演算部６３は、画面並べ替えバッファ６２から読み出された画像から、予測画像選択部７７により選択されたイントラ予測部７４からの予測画像または動き予測・補償部７６からの予測画像を減算し、その差分情報を直交変換部６４に出力する。直交変換部６４は、演算部６３からの差分情報に対して、離散コサイン変換、カルーネン・レーベ変換等の直交変換を施し、その変換係数を出力する。量子化部６５は直交変換部６４が出力する変換係数を量子化する。 The calculation unit 63 subtracts the prediction image from the intra prediction unit 74 or the prediction image from the motion prediction / compensation unit 76 selected by the prediction image selection unit 77 from the image read from the screen rearrangement buffer 62, The difference information is output to the orthogonal transform unit 64. The orthogonal transform unit 64 subjects the difference information from the calculation unit 63 to orthogonal transform such as discrete cosine transform and Karhunen-Loeve transform, and outputs the transform coefficient. The quantization unit 65 quantizes the transform coefficient output from the orthogonal transform unit 64.

量子化部６５の出力となる、量子化された変換係数は、可逆符号化部６６に入力され、ここで可変長符号化、算術符号化等の可逆符号化が施され、圧縮される。 The quantized transform coefficient that is the output of the quantization unit 65 is input to the lossless encoding unit 66, where lossless encoding such as variable length encoding and arithmetic encoding is performed and compressed.

可逆符号化部６６は、イントラ予測を示す情報などをイントラ予測部７４から取得し、インター予測モードを示す情報などを動き予測・補償部７６から取得する。なお、イントラ予測を示す情報は、以下、イントラ予測モード情報とも称する。また、インター予測を示す情報モードを示す情報は、以下、インター予測モード情報とも称する。 The lossless encoding unit 66 acquires information indicating intra prediction from the intra prediction unit 74 and acquires information indicating inter prediction mode from the motion prediction / compensation unit 76. Note that information indicating intra prediction is hereinafter also referred to as intra prediction mode information. In addition, information indicating an information mode indicating inter prediction is hereinafter also referred to as inter prediction mode information.

可逆符号化部６６は、量子化された変換係数を符号化するとともに、イントラ予測を示す情報やインター予測モードを示す情報などを符号化し、圧縮画像におけるヘッダ情報の一部とする。可逆符号化部６６は、符号化したデータを蓄積バッファ６７に供給して蓄積させる。 The lossless encoding unit 66 encodes the quantized transform coefficient and also encodes information indicating intra prediction, information indicating inter prediction mode, and the like, and uses it as a part of header information in the compressed image. The lossless encoding unit 66 supplies the encoded data to the accumulation buffer 67 for accumulation.

例えば、可逆符号化部６６においては、可変長符号化または算術符号化等の可逆符号化処理が行われる。可変長符号化としては、H．264/AVC方式で定められているCAVLC（Context-Adaptive Variable Length Coding）などがあげられる。算術符号化としては、CABAC（Context-Adaptive Binary Arithmetic Coding）などがあげられる。 For example, the lossless encoding unit 66 performs lossless encoding processing such as variable length encoding or arithmetic encoding. Examples of variable length coding include H.264. CAVLC (Context-Adaptive Variable Length Coding) defined in H.264 / AVC format. Examples of arithmetic coding include CABAC (Context-Adaptive Binary Arithmetic Coding).

蓄積バッファ６７は、可逆符号化部６６から供給されたデータを、H．264/AVC方式で符号化された圧縮画像として、例えば、後段の図示せぬ記録装置や伝送路などに出力する。 The accumulation buffer 67 converts the data supplied from the lossless encoding unit 66 to H.264. As a compressed image encoded by the H.264 / AVC format, for example, it is output to a recording device or a transmission path (not shown) in the subsequent stage.

また、量子化部６５より出力された、量子化された変換係数は、逆量子化部６８にも入力され、逆量子化された後、さらに逆直交変換部６９において逆直交変換される。逆直交変換された出力は演算部７０により予測画像選択部７７から供給される予測画像と加算されて、局部的に復号された画像となる。デブロックフィルタ７１は、復号された画像のブロック歪を除去した後、フレームメモリ７２に供給し、蓄積させる。フレームメモリ７２には、デブロックフィルタ７１によりデブロックフィルタ処理される前の画像も供給され、蓄積される。 Further, the quantized transform coefficient output from the quantization unit 65 is also input to the inverse quantization unit 68, and after inverse quantization, the inverse orthogonal transform unit 69 further performs inverse orthogonal transform. The output subjected to the inverse orthogonal transform is added to the predicted image supplied from the predicted image selection unit 77 by the calculation unit 70, and becomes a locally decoded image. The deblocking filter 71 removes block distortion from the decoded image, and then supplies the deblocking filter 71 to the frame memory 72 for accumulation. The image before the deblocking filter processing by the deblocking filter 71 is also supplied to the frame memory 72 and accumulated.

スイッチ７３はフレームメモリ７２に蓄積された参照画像を動き予測・補償部７６またはイントラ予測部７４に出力する。 The switch 73 outputs the reference image stored in the frame memory 72 to the motion prediction / compensation unit 76 or the intra prediction unit 74.

この画像符号化装置５１においては、例えば、画面並べ替えバッファ６２からのＩピクチャ、Ｂピクチャ、およびＰピクチャが、イントラ予測（イントラ処理とも称する）する画像として、イントラ予測部７４に供給される。また、画面並べ替えバッファ６２から読み出されたＢピクチャおよびＰピクチャが、インター予測（インター処理とも称する）する画像として、動き予測・補償部７６に供給される。 In the image encoding device 51, for example, the I picture, the B picture, and the P picture from the screen rearrangement buffer 62 are supplied to the intra prediction unit 74 as images for intra prediction (also referred to as intra processing). Further, the B picture and the P picture read from the screen rearrangement buffer 62 are supplied to the motion prediction / compensation unit 76 as an image to be inter predicted (also referred to as inter processing).

イントラ予測部７４は、画面並べ替えバッファ６２から読み出されたイントラ予測する画像とフレームメモリ７２から供給された参照画像に基づいて、候補となる全てのイントラ予測モードのイントラ予測処理を行い、予測画像を生成する。 The intra prediction unit 74 performs intra prediction processing of all candidate intra prediction modes based on the image to be intra predicted read from the screen rearrangement buffer 62 and the reference image supplied from the frame memory 72, and performs prediction. Generate an image.

イントラ予測部７４は、予測画像を生成したイントラ予測モードに対してコスト関数値を算出し、算出したコスト関数値が最小値を与えるイントラ予測モードを、最適イントラ予測モードとして選択する。イントラ予測部７４は、イントラ予測する対象ブロックの隣接画素と、最適イントラ予測モードの情報を、隣接画素内挿部７５に供給する。 The intra prediction unit 74 calculates a cost function value for the intra prediction mode in which the predicted image is generated, and selects an intra prediction mode in which the calculated cost function value gives the minimum value as the optimal intra prediction mode. The intra prediction unit 74 supplies the adjacent pixel of the target block to be intra-predicted and the information of the optimal intra prediction mode to the adjacent pixel interpolation unit 75.

隣接画素内挿部７５は、イントラ予測部７４からの最適イントラ予測モードに応じたシフト方向に、候補となるシフト量で、隣接画素の位相をシフトさせる。実際には、隣接画素内挿部７５は、最適イントラ予測モードに応じたシフト方向について、隣接画素に対して、６タップのFIRフィルタをかけ、線形内挿することで、隣接画素の位相を、小数画素精度にシフトさせる。したがって、以下、説明の便宜上、６タップのFIRフィルタおよび線形内挿により位相がシフトされた隣接画素を、内挿された隣接画素または位相がシフトされた隣接画素と適宜説明するが、それらは同意である。 The adjacent pixel interpolation unit 75 shifts the phase of the adjacent pixel in the shift direction according to the optimal intra prediction mode from the intra prediction unit 74 by a candidate shift amount. Actually, the adjacent pixel interpolation unit 75 applies a 6-tap FIR filter to the adjacent pixels in the shift direction corresponding to the optimal intra prediction mode, and linearly interpolates the adjacent pixels to obtain the phase of the adjacent pixels. Shift to decimal pixel accuracy. Therefore, for convenience of explanation, adjacent pixels whose phases are shifted by a 6-tap FIR filter and linear interpolation will be appropriately described as adjacent pixels that have been interpolated or whose phases have been shifted, but they agree. It is.

隣接画素内挿部７５は、位相がシフトされた隣接画素を、イントラ予測部７４に供給する。 The adjacent pixel interpolation unit 75 supplies the adjacent pixel whose phase has been shifted to the intra prediction unit 74.

イントラ予測部７４は、隣接画像バッファ８１からの隣接画素の画素値および隣接画素内挿部７５により位相がシフトされた隣接画素の画素値を用いて、隣接画素に対して位相の最適シフト量を決定する。また、イントラ予測部７４は、決定した最適シフト量で位相がシフトされた隣接画素の画素値を用いて、対象ブロックの予測画像を生成し、生成された予測画像と、対応する最適イントラ予測モードについて算出されたコスト関数値を、予測画像選択部７７に供給する。 The intra prediction unit 74 uses the pixel value of the adjacent pixel from the adjacent image buffer 81 and the pixel value of the adjacent pixel whose phase is shifted by the adjacent pixel interpolation unit 75 to calculate the optimum phase shift amount for the adjacent pixel. decide. Further, the intra prediction unit 74 generates a predicted image of the target block using the pixel value of the adjacent pixel whose phase is shifted by the determined optimal shift amount, and the generated predicted image and the corresponding optimal intra prediction mode. The cost function value calculated for is supplied to the predicted image selection unit 77.

イントラ予測部７４は、予測画像選択部７７により最適イントラ予測モードで生成された予測画像が選択された場合、最適イントラ予測モードを示す情報と最適シフト量の情報を、可逆符号化部６６に供給する。可逆符号化部６６は、イントラ予測部７４から情報が送られてきた場合には、この情報を符号化し、圧縮画像におけるヘッダ情報の一部とする。 When the predicted image generated in the optimal intra prediction mode is selected by the predicted image selection unit 77, the intra prediction unit 74 supplies information indicating the optimal intra prediction mode and information on the optimal shift amount to the lossless encoding unit 66. To do. When information is sent from the intra prediction unit 74, the lossless encoding unit 66 encodes this information and uses it as a part of header information in the compressed image.

動き予測・補償部７６は、候補となる全てのインター予測モードの動き予測・補償処理を行う。すなわち、動き予測・補償部７６には、画面並べ替えバッファ６２から読み出されたインター処理する画像と、スイッチ７３を介してフレームメモリ７２から参照画像が供給される。動き予測・補償部７６は、インター処理する画像と参照画像に基づいて、候補となる全てのインター予測モードの動きベクトルを検出し、動きベクトルに基づいて参照画像に補償処理を施し、予測画像を生成する。 The motion prediction / compensation unit 76 performs motion prediction / compensation processing for all candidate inter prediction modes. That is, the inter prediction image read from the screen rearrangement buffer 62 and the reference image from the frame memory 72 are supplied to the motion prediction / compensation unit 76 via the switch 73. The motion prediction / compensation unit 76 detects motion vectors of all candidate inter prediction modes based on the inter-processed image and the reference image, performs compensation processing on the reference image based on the motion vector, and obtains the predicted image. Generate.

また、動き予測・補償部７６は、候補となる全てのインター予測モードに対してコスト関数値を算出する。動き予測・補償部７６は、算出したコスト関数値のうち、最小値を与える予測モードを、最適インター予測モードとして決定する。 Also, the motion prediction / compensation unit 76 calculates cost function values for all candidate inter prediction modes. The motion prediction / compensation unit 76 determines the prediction mode that gives the minimum value among the calculated cost function values as the optimal inter prediction mode.

動き予測・補償部７６は、最適インター予測モードで生成された予測画像とそのコスト関数値を、予測画像選択部７７に供給する。動き予測・補償部７６は、予測画像選択部７７により最適インター予測モードで生成された予測画像が選択された場合、最適インター予測モードを示す情報（インター予測モード情報）を可逆符号化部６６に出力する。 The motion prediction / compensation unit 76 supplies the predicted image generated in the optimal inter prediction mode and its cost function value to the predicted image selection unit 77. When the predicted image generated in the optimal inter prediction mode is selected by the predicted image selection unit 77, the motion prediction / compensation unit 76 transmits information indicating the optimal inter prediction mode (inter prediction mode information) to the lossless encoding unit 66. Output.

なお、必要であれば、動きベクトル情報、フラグ情報、参照フレーム情報なども可逆符号化部６６に出力される。可逆符号化部６６は、動き予測・補償部７６からの情報をやはり可変長符号化、算術符号化といった可逆符号化処理し、圧縮画像のヘッダ部に挿入する。 If necessary, motion vector information, flag information, reference frame information, and the like are also output to the lossless encoding unit 66. The lossless encoding unit 66 performs lossless encoding processing such as variable length encoding and arithmetic encoding on the information from the motion prediction / compensation unit 76 and inserts the information into the header portion of the compressed image.

予測画像選択部７７は、イントラ予測部７４または動き予測・補償部７６より出力された各コスト関数値に基づいて、最適イントラ予測モードと最適インター予測モードから、最適予測モードを決定する。そして、予測画像選択部７７は、決定された最適予測モードの予測画像を選択し、演算部６３，７０に供給する。このとき、予測画像選択部７７は、予測画像の選択情報を、イントラ予測部７４または動き予測・補償部７６に供給する。 The predicted image selection unit 77 determines an optimal prediction mode from the optimal intra prediction mode and the optimal inter prediction mode based on each cost function value output from the intra prediction unit 74 or the motion prediction / compensation unit 76. Then, the predicted image selection unit 77 selects a predicted image in the determined optimal prediction mode and supplies the selected predicted image to the calculation units 63 and 70. At this time, the predicted image selection unit 77 supplies the prediction image selection information to the intra prediction unit 74 or the motion prediction / compensation unit 76.

レート制御部７８は、蓄積バッファ６７に蓄積された圧縮画像に基づいて、オーバーフローあるいはアンダーフローが発生しないように、量子化部６５の量子化動作のレートを制御する。 The rate control unit 78 controls the rate of the quantization operation of the quantization unit 65 based on the compressed image stored in the storage buffer 67 so that overflow or underflow does not occur.

［H．264/AVC方式の説明］
図３は、H．264/AVC方式における動き予測・補償のブロックサイズの例を示す図である。H．264/AVC方式においては、ブロックサイズを可変にして、動き予測・補償が行われる。 [H. Explanation of H.264 / AVC format]
FIG. 3 is a diagram illustrating an example of a block size for motion prediction / compensation in the H.264 / AVC format. FIG. H. In the H.264 / AVC format, motion prediction / compensation is performed with a variable block size.

図３の上段には、左から、１６×１６画素、１６×８画素、８×１６画素、および８×８画素のパーティションに分割された１６×１６画素で構成されるマクロブロックが順に示されている。また、図３の下段には、左から、８×８画素、８×４画素、４×８画素、および４×４画素のサブパーティションに分割された８×８画素のパーティションが順に示されている。 In the upper part of FIG. 3, macroblocks composed of 16 × 16 pixels divided into 16 × 16 pixels, 16 × 8 pixels, 8 × 16 pixels, and 8 × 8 pixel partitions are sequentially shown from the left. ing. Further, in the lower part of FIG. 3, from the left, 8 × 8 pixel partitions divided into 8 × 8 pixel, 8 × 4 pixel, 4 × 8 pixel, and 4 × 4 pixel subpartitions are sequentially shown. Yes.

すなわち、H．264/AVC方式においては、１つのマクロブロックを、１６×１６画素、１６×８画素、８×１６画素、あるいは８×８画素のいずれかのパーティションに分割して、それぞれ独立した動きベクトル情報を持つことが可能である。また、８×８画素のパーティションに関しては、８×８画素、８×４画素、４×８画素、あるいは４×４画素のいずれかのサブパーティションに分割し、それぞれ独立した動きベクトル情報を持つことが可能である。 That is, H. In the H.264 / AVC format, one macroblock is divided into any partition of 16 × 16 pixels, 16 × 8 pixels, 8 × 16 pixels, or 8 × 8 pixels, and independent motion vector information is obtained. It is possible to have. In addition, an 8 × 8 pixel partition is divided into 8 × 8 pixel, 8 × 4 pixel, 4 × 8 pixel, or 4 × 4 pixel subpartitions and has independent motion vector information. Is possible.

図４は、H．264/AVC方式における１／４画素精度の予測・補償処理を説明する図である。H．264/AVC方式においては、６タップのFIR (Finite Impulse Response Filter)フィルタを用いた１／４画素精度の予測・補償処理が行われている。 FIG. It is a figure explaining the prediction and compensation process of the 1/4 pixel precision in a H.264 / AVC system. H. In the H.264 / AVC format, prediction / compensation processing with 1/4 pixel accuracy using a 6-tap FIR (Finite Impulse Response Filter) filter is performed.

図４の例において、位置Ａは、整数精度画素の位置、位置ｂ，ｃ，ｄは、１／２画素精度の位置、位置ｅ１，ｅ２，ｅ３は、１／４画素精度の位置を示している。まず、以下においては、Clip()を次の式（１）のように定義する。 In the example of FIG. 4, the position A is the position of the integer precision pixel, the positions b, c, and d are the positions of the 1/2 pixel precision, and the positions e1, e2, and e3 are the positions of the 1/4 pixel precision. Yes. First, in the following, Clip () is defined as the following equation (1).

なお、入力画像が８ビット精度である場合、max_pixの値は255となる。

When the input image has 8-bit precision, the value of max_pix is 255.

位置ｂおよびｄにおける画素値は、６タップのFIRフィルタを用いて、次の式（２）のように生成される。

The pixel values at the positions b and d are generated by the following equation (2) using a 6-tap FIR filter.

位置ｃにおける画素値は、水平方向および垂直方向に６タップのFIRフィルタを適用し、次の式（３）のように生成される。

なお、Clip処理は、水平方向および垂直方向の積和処理の両方を行った後、最後に１度のみ実行される。 The pixel value at the position c is generated as in the following Expression (3) by applying a 6-tap FIR filter in the horizontal direction and the vertical direction.

The clip process is executed only once at the end after performing both the horizontal and vertical product-sum processes.

位置ｅ１乃至ｅ３は、次の式（４）のように線形内挿により生成される。

The positions e1 to e3 are generated by linear interpolation as in the following equation (4).

H．264/AVC方式においては、図３および図４を参照して上述した動き予測・補償処理が行われることにより、膨大な動きベクトル情報が生成され、これをこのまま符号化することは、符号化効率の低下を招いてしまう。これに対して、H．264/AVC方式においては、図５に示す方法により、動きベクトルの符号化情報の低減が実現されている。 H. In the H.264 / AVC format, a large amount of motion vector information is generated by performing the motion prediction / compensation process described above with reference to FIG. 3 and FIG. Will be reduced. In contrast, H. In the H.264 / AVC format, motion vector encoding information is reduced by the method shown in FIG.

図５は、H．264/AVC方式による動きベクトル情報の生成方法について説明する図である。 FIG. It is a figure explaining the production | generation method of the motion vector information by a H.264 / AVC system.

図５の例において、これから符号化される対象ブロックＥ（例えば、１６×１６画素）と、既に符号化済みであり、対象ブロックＥに隣接するブロックＡ乃至Ｄが示されている。 In the example of FIG. 5, a target block E to be encoded (for example, 16 × 16 pixels) and blocks A to D that have already been encoded and are adjacent to the target block E are shown.

すなわち、ブロックＤは、対象ブロックＥの左上に隣接しており、ブロックＢは、対象ブロックＥの上に隣接しており、ブロックＣは、対象ブロックＥの右上に隣接しており、ブロックＡは、対象ブロックＥの左に隣接している。なお、ブロックＡ乃至Ｄが区切られていないのは、それぞれ、図２で上述した１６×１６画素乃至４×４画素のうちのいずれかの構成のブロックであることを表している。 That is, the block D is adjacent to the upper left of the target block E, the block B is adjacent to the upper side of the target block E, the block C is adjacent to the upper right of the target block E, and the block A is , Adjacent to the left of the target block E. It should be noted that the blocks A to D are not divided represent blocks having any one of the 16 × 16 pixels to 4 × 4 pixels described above with reference to FIG.

例えば、Ｘ（＝Ａ，Ｂ，Ｃ，Ｄ，Ｅ）に対する動きベクトル情報を、mvXで表す。まず、対象ブロックＥに対する予測動きベクトル情報pmvEは、ブロックＡ，Ｂ，Ｃに関する動きベクトル情報を用いて、メディアン予測により次の式（５）のように生成される。 For example, motion vector information for X (= A, B, C, D, E) is represented by mvX. First, the predicted motion vector information pmvE for the target block E is generated by the median prediction using the motion vector information regarding the blocks A, B, and C as shown in the following equation (5).

pmvE = med(mvA,mvB,mvC) ・・・（５）

ブロックＣに関する動きベクトル情報が、画枠の端であったり、あるいは、まだ符号化されていないなどの理由により、利用可能でない（unavailableである）場合がある。この場合には、ブロックＣに関する動きベクトル情報は、ブロックＤに関する動きベクトル情報で代用される。 pmvE = med (mvA, mvB, mvC) (5)

The motion vector information related to the block C may be unavailable (unavailable) because it is at the edge of the image frame or is not yet encoded. In this case, the motion vector information regarding the block C is substituted with the motion vector information regarding the block D.

対象ブロックＥに対する動きベクトル情報として、圧縮画像のヘッダ部に付加されるデータmvdEは、pmvEを用いて、次の式（６）のように生成される。

mvdE = mvE - pmvE ・・・（６） Data mvdE added to the header portion of the compressed image as motion vector information for the target block E is generated as shown in the following equation (6) using pmvE.

mvdE = mvE-pmvE (6)

なお、実際には、動きベクトル情報の水平方向、垂直方向のそれぞれの成分に対して、独立に処理が行われる。 Actually, processing is performed independently for each of the horizontal and vertical components of the motion vector information.

このように、予測動きベクトル情報を生成し、隣接するブロックとの相関で生成された予測動きベクトル情報と動きベクトル情報との差分であるデータmvdEを、動きベクトル情報として、圧縮画像のヘッダ部に付加することにより、動きベクトル情報を低減することができる。 As described above, the motion vector information is generated, and the data mvdE that is the difference between the motion vector information and the motion vector information predicted by the correlation with the adjacent block is used as motion vector information in the header portion of the compressed image. By adding, motion vector information can be reduced.

ここで、図４を参照して上述したH．264/AVC方式における１／４画素精度の予測・補償処理は、動き予測・補償部において実行されるが、図２の画像符号化装置５１においては、１／４画素精度の予測が、イントラ予測においても行われる。この小数画素精度のイントラ予測は、次に説明するイントラ予測部７４および隣接画素内挿部７５により実行される。 Here, as described above with reference to FIG. The prediction / compensation processing with 1/4 pixel accuracy in the H.264 / AVC format is executed by the motion prediction / compensation unit. In the image encoding device 51 in FIG. 2, prediction with 1/4 pixel accuracy is performed by intra prediction. Also performed in This intra prediction with decimal pixel accuracy is executed by an intra prediction unit 74 and an adjacent pixel interpolation unit 75 described below.

［イントラ予測部および隣接画素内挿部の構成例］
図６は、イントラ予測部および隣接画素内挿部の詳細な構成例を示すブロック図である。 [Configuration Example of Intra Prediction Unit and Adjacent Pixel Interpolation Unit]
FIG. 6 is a block diagram illustrating a detailed configuration example of the intra prediction unit and the adjacent pixel interpolation unit.

図６の例の場合、イントラ予測部７４は、隣接画像バッファ８１、最適モード決定部８２、最適シフト量決定部８３、および予測画像生成部８４により構成されている。 In the case of the example of FIG. 6, the intra prediction unit 74 includes an adjacent image buffer 81, an optimal mode determination unit 82, an optimal shift amount determination unit 83, and a predicted image generation unit 84.

隣接画素内挿部７５は、モード判別部９１、水平方向内挿部９２、および垂直方向内挿部９３により構成されている。 The adjacent pixel interpolation unit 75 includes a mode determination unit 91, a horizontal direction interpolation unit 92, and a vertical direction interpolation unit 93.

隣接画像バッファ８１は、フレームメモリ７２からのイントラ予測の対象ブロックの隣接画素を蓄積する。図６の場合、スイッチ７３の図示は省略されているが、隣接画素は、フレームメモリ７２から、スイッチ７３を介して隣接画像バッファ８１に供給される。 The adjacent image buffer 81 accumulates adjacent pixels of the target block for intra prediction from the frame memory 72. In the case of FIG. 6, the illustration of the switch 73 is omitted, but the adjacent pixels are supplied from the frame memory 72 to the adjacent image buffer 81 via the switch 73.

最適モード決定部８２には、画面並べ替えバッファ６２から読み出されたイントラ予測する画像が入力される。最適モード決定部８２は、イントラ予測する対象ブロックに対応する隣接画素を隣接画像バッファ８１から読み出す。 The image for intra prediction read from the screen rearrangement buffer 62 is input to the optimum mode determination unit 82. The optimum mode determination unit 82 reads out adjacent pixels corresponding to the target block to be intra predicted from the adjacent image buffer 81.

最適モード決定部８２は、イントラ予測する対象ブロックの画像と対応する隣接画素を用いて、候補となる全てのイントラ予測モードのイントラ予測処理を行い、予測画像を生成する。最適モード決定部８２は、予測画像を生成したイントラ予測モードに対してコスト関数値を算出し、算出したコスト関数値が最小値を与えるイントラ予測モードを、最適イントラ予測モードとして決定する。決定された予測モードの情報は、モード判別部９１、最適シフト量決定部８３、および予測画像生成部８４に供給される。また、予測画像生成部８４には、供給される予測モードに対応するコスト関数値も供給される。 The optimal mode determination unit 82 performs intra prediction processing of all candidate intra prediction modes using adjacent pixels corresponding to the image of the target block to be intra predicted, and generates a prediction image. The optimal mode determination unit 82 calculates a cost function value for the intra prediction mode in which the predicted image is generated, and determines an intra prediction mode in which the calculated cost function value gives the minimum value as the optimal intra prediction mode. Information on the determined prediction mode is supplied to the mode determination unit 91, the optimum shift amount determination unit 83, and the predicted image generation unit 84. The predicted image generation unit 84 is also supplied with a cost function value corresponding to the supplied prediction mode.

最適シフト量決定部８３には、画面並べ替えバッファ６２から読み出されたイントラ予測する画像と、最適モード決定部８２により最適と決定された予測モードの情報が入力される。また、最適シフト量決定部８３には、最適イントラ予測モードに応じて、水平方向内挿部９２および垂直方向内挿部９３により線形内挿され、位相がシフトされた隣接画素が入力される。最適シフト量決定部８３は、イントラ予測する対象ブロックに対応する隣接画素を隣接画像バッファ８１から読み出す。 The optimal shift amount determination unit 83 receives the image for intra prediction read from the screen rearrangement buffer 62 and information on the prediction mode determined to be optimal by the optimal mode determination unit 82. In addition, the optimum shift amount determination unit 83 receives adjacent pixels whose phases are linearly interpolated by the horizontal direction interpolation unit 92 and the vertical direction interpolation unit 93 in accordance with the optimum intra prediction mode. The optimum shift amount determination unit 83 reads adjacent pixels corresponding to the target block to be intra-predicted from the adjacent image buffer 81.

最適シフト量決定部８３は、最適モード決定部８２により決定された予測モードについて、イントラ予測する対象ブロックの画像、対応する隣接画素、および対応する内挿された隣接画素の画素値を用いて、最適なシフト量を決定する。最適シフト量決定部８３は、例えば、予測誤差（残差）などを算出して、算出した予測誤差の小さいものを最適なシフト量として決定する。最適シフト量決定部８３により決定された最適シフト量の情報は、予測画像生成部８４に供給される。 The optimum shift amount determination unit 83 uses the image of the target block to be intra-predicted, the corresponding adjacent pixel, and the pixel value of the corresponding interpolated adjacent pixel for the prediction mode determined by the optimal mode determination unit 82, Determine the optimal shift amount. The optimal shift amount determination unit 83 calculates, for example, a prediction error (residual) and determines the calculated small shift as the optimal shift amount. Information on the optimal shift amount determined by the optimal shift amount determination unit 83 is supplied to the predicted image generation unit 84.

予測画像生成部８４には、最適モード決定部８２により決定された予測モードの情報と対応するコスト関数値、および最適シフト量決定部８３により決定された最適シフト量の情報が入力される。予測画像生成部８４は、イントラ予測する対象ブロックに対応する隣接画素を隣接画像バッファ８１から読み出し、予測モードに応じた位相方向に、読みだした隣接画素を最適シフト量で、隣接画素の位相をシフトする。 The predicted image generation unit 84 receives the cost function value corresponding to the prediction mode information determined by the optimal mode determination unit 82 and the information of the optimal shift amount determined by the optimal shift amount determination unit 83. The predicted image generation unit 84 reads the adjacent pixel corresponding to the target block to be intra-predicted from the adjacent image buffer 81, and sets the read adjacent pixel in the phase direction according to the prediction mode with the optimum shift amount and the phase of the adjacent pixel. shift.

予測画像生成部８４は、位相がシフトされた隣接画素を用いて、最適モード決定部８２により決定された最適イントラ予測モードでイントラ予測を行い、対象ブロックの予測画像を生成する。予測画像生成部８４は、生成された予測画像と、対応するコスト関数値を予測画像選択部７７に出力する。 The predicted image generation unit 84 performs intra prediction in the optimal intra prediction mode determined by the optimal mode determination unit 82 using adjacent pixels whose phases are shifted, and generates a predicted image of the target block. The predicted image generation unit 84 outputs the generated predicted image and the corresponding cost function value to the predicted image selection unit 77.

また、予測画像生成部８４は、予測画像選択部７７により最適イントラ予測モードで生成された予測画像が選択された場合、最適イントラ予測モードを示す情報とシフト量の情報を、可逆符号化部６６に供給する。 Further, when the predicted image generated in the optimal intra prediction mode is selected by the predicted image selection unit 77, the predicted image generation unit 84 converts the information indicating the optimal intra prediction mode and the shift amount information into the lossless encoding unit 66. To supply.

モード判別部９１は、最適モード決定部８２により決定された予測モードに応じた制御信号を、水平方向内挿部９２および垂直方向内挿部９３に出力する。例えば、内挿処理のＯＮを示す制御信号が予測モードに応じて出力される。 The mode determination unit 91 outputs a control signal corresponding to the prediction mode determined by the optimal mode determination unit 82 to the horizontal direction interpolation unit 92 and the vertical direction interpolation unit 93. For example, a control signal indicating ON of interpolation processing is output according to the prediction mode.

水平方向内挿部９２および垂直方向内挿部９３は、モード判別部９１からの制御信号に応じて、隣接画像バッファ８１より隣接画素をそれぞれ読み出す。水平方向内挿部９２および垂直方向内挿部９３は、読み出した隣接画素に対して、６タップのFIRフィルタおよび線形内挿により、水平方向および垂直方向に、位相をそれぞれシフトする。水平方向内挿部９２および垂直方向内挿部９３により内挿された隣接画素の情報は、最適シフト量決定部８３に供給される。 The horizontal direction interpolation unit 92 and the vertical direction interpolation unit 93 read adjacent pixels from the adjacent image buffer 81 in response to a control signal from the mode determination unit 91. The horizontal direction interpolation unit 92 and the vertical direction interpolation unit 93 shift the phase in the horizontal direction and the vertical direction, respectively, by a 6-tap FIR filter and linear interpolation with respect to the read adjacent pixels. Information on adjacent pixels interpolated by the horizontal direction interpolation unit 92 and the vertical direction interpolation unit 93 is supplied to the optimum shift amount determination unit 83.

［画像符号化装置の符号化処理の説明］
次に、図７のフローチャートを参照して、図２の画像符号化装置５１の符号化処理について説明する。 [Description of Encoding Process of Image Encoding Device]
Next, the encoding process of the image encoding device 51 of FIG. 2 will be described with reference to the flowchart of FIG.

ステップＳ１１において、Ａ／Ｄ変換部６１は入力された画像をＡ／Ｄ変換する。ステップＳ１２において、画面並べ替えバッファ６２は、Ａ／Ｄ変換部６１より供給された画像を記憶し、各ピクチャの表示する順番から符号化する順番への並べ替えを行う。 In step S11, the A / D converter 61 performs A / D conversion on the input image. In step S12, the screen rearrangement buffer 62 stores the image supplied from the A / D conversion unit 61, and rearranges the picture from the display order to the encoding order.

ステップＳ１３において、演算部６３は、ステップＳ１２で並び替えられた画像と予測画像との差分を演算する。予測画像は、インター予測する場合は動き予測・補償部７６から、イントラ予測する場合はイントラ予測部７４から、それぞれ予測画像選択部７７を介して演算部６３に供給される。 In step S13, the calculation unit 63 calculates the difference between the image rearranged in step S12 and the predicted image. The prediction image is supplied from the motion prediction / compensation unit 76 in the case of inter prediction, and from the intra prediction unit 74 in the case of intra prediction, to the calculation unit 63 via the prediction image selection unit 77, respectively.

差分データは元の画像データに較べてデータ量が小さくなっている。したがって、画像をそのまま符号化する場合に較べて、データ量を圧縮することができる。 The difference data has a smaller data amount than the original image data. Therefore, the data amount can be compressed as compared with the case where the image is encoded as it is.

ステップＳ１４において、直交変換部６４は演算部６３から供給された差分情報を直交変換する。具体的には、離散コサイン変換、カルーネン・レーベ変換等の直交変換が行われ、変換係数が出力される。ステップＳ１５において、量子化部６５は変換係数を量子化する。この量子化に際しては、後述するステップＳ２５の処理で説明されるように、レートが制御される。 In step S 14, the orthogonal transform unit 64 performs orthogonal transform on the difference information supplied from the calculation unit 63. Specifically, orthogonal transformation such as discrete cosine transformation and Karhunen-Loeve transformation is performed, and transformation coefficients are output. In step S15, the quantization unit 65 quantizes the transform coefficient. At the time of this quantization, the rate is controlled as described in the process of step S25 described later.

以上のようにして量子化された差分情報は、次のようにして局部的に復号される。すなわち、ステップＳ１６において、逆量子化部６８は量子化部６５により量子化された変換係数を量子化部６５の特性に対応する特性で逆量子化する。ステップＳ１７において、逆直交変換部６９は逆量子化部６８により逆量子化された変換係数を直交変換部６４の特性に対応する特性で逆直交変換する。 The difference information quantized as described above is locally decoded as follows. That is, in step S 16, the inverse quantization unit 68 inversely quantizes the transform coefficient quantized by the quantization unit 65 with characteristics corresponding to the characteristics of the quantization unit 65. In step S 17, the inverse orthogonal transform unit 69 performs inverse orthogonal transform on the transform coefficient inversely quantized by the inverse quantization unit 68 with characteristics corresponding to the characteristics of the orthogonal transform unit 64.

ステップＳ１８において、演算部７０は、予測画像選択部７７を介して入力される予測画像を局部的に復号された差分情報に加算し、局部的に復号された画像（演算部６３への入力に対応する画像）を生成する。ステップＳ１９においてデブロックフィルタ７１は、演算部７０より出力された画像をフィルタリングする。これによりブロック歪みが除去される。ステップＳ２０においてフレームメモリ７２は、フィルタリングされた画像を記憶する。なお、フレームメモリ７２にはデブロックフィルタ７１によりフィルタ処理されていない画像も演算部７０から供給され、記憶される。 In step S18, the calculation unit 70 adds the predicted image input via the predicted image selection unit 77 to the locally decoded difference information, and outputs the locally decoded image (for input to the calculation unit 63). Corresponding image). In step S 19, the deblock filter 71 filters the image output from the calculation unit 70. Thereby, block distortion is removed. In step S20, the frame memory 72 stores the filtered image. Note that an image that has not been filtered by the deblocking filter 71 is also supplied to the frame memory 72 from the computing unit 70 and stored therein.

ステップＳ２１において、イントラ予測部７４および動き予測・補償部７６は、それぞれ画像の予測処理を行う。すなわち、ステップＳ２１において、イントラ予測部７４は、イントラ予測モードのイントラ予測処理を行う。動き予測・補償部７６は、インター予測モードの動き予測・補償処理を行う。 In step S21, the intra prediction unit 74 and the motion prediction / compensation unit 76 each perform image prediction processing. That is, in step S21, the intra prediction unit 74 performs an intra prediction process in the intra prediction mode. The motion prediction / compensation unit 76 performs motion prediction / compensation processing in the inter prediction mode.

ステップＳ２１における予測処理の詳細は、図８を参照して後述するが、この処理により、候補となる全ての予測モードでの予測処理がそれぞれ行われ、候補となる全ての予測モードでのコスト関数値がそれぞれ算出される。そして、算出されたコスト関数値に基づいて、最適イントラ予測モードが選択され、最適イントラ予測モードのイントラ予測により生成された予測画像とそのコスト関数値が予測画像選択部７７に供給される。 The details of the prediction process in step S21 will be described later with reference to FIG. 8. With this process, prediction processes in all candidate prediction modes are performed, and cost functions in all candidate prediction modes are obtained. Each value is calculated. Then, based on the calculated cost function value, the optimal intra prediction mode is selected, and the predicted image generated by the intra prediction in the optimal intra prediction mode and its cost function value are supplied to the predicted image selection unit 77.

具体的には、このとき、イントラ予測部７４は、６タップのFIRフィルタおよび線形内挿により、最適イントラ予測モードに応じたシフト方向に、最適シフト量で、位相がシフトされた隣接画素を用いてイントラ予測で生成した予測画像を予測画像選択部７７に供給する。なお、予測画像と一緒に、最適イントラ予測モードについてのコスト関数値も、予測画像選択部７７に供給される。 Specifically, at this time, the intra prediction unit 74 uses adjacent pixels whose phase is shifted by the optimal shift amount in the shift direction according to the optimal intra prediction mode by the 6-tap FIR filter and linear interpolation. Then, the predicted image generated by the intra prediction is supplied to the predicted image selection unit 77. A cost function value for the optimal intra prediction mode is also supplied to the predicted image selection unit 77 together with the predicted image.

一方、算出されたコスト関数値に基づいて、インター予測モードの中から、最適インター予測モードが決定され、最適インター予測モードで生成された予測画像とそのコスト関数値が、予測画像選択部７７に供給される。 On the other hand, the optimal inter prediction mode is determined from the inter prediction modes based on the calculated cost function value, and the predicted image generated in the optimal inter prediction mode and its cost function value are sent to the predicted image selection unit 77. Supplied.

ステップＳ２２において、予測画像選択部７７は、イントラ予測部７４および動き予測・補償部７６より出力された各コスト関数値に基づいて、最適イントラ予測モードと最適インター予測モードのうちの一方を、最適予測モードに決定する。そして、予測画像選択部７７は、決定した最適予測モードの予測画像を選択し、演算部６３，７０に供給する。この予測画像が、上述したように、ステップＳ１３，Ｓ１８の演算に利用される。 In step S 22, the predicted image selection unit 77 optimizes one of the optimal intra prediction mode and the optimal inter prediction mode based on the cost function values output from the intra prediction unit 74 and the motion prediction / compensation unit 76. Determine the prediction mode. Then, the predicted image selection unit 77 selects the predicted image of the determined optimal prediction mode and supplies it to the calculation units 63 and 70. As described above, this predicted image is used for the calculations in steps S13 and S18.

なお、この予測画像の選択情報は、イントラ予測部７４または動き予測・補償部７６に供給される。最適イントラ予測モードの予測画像が選択された場合、イントラ予測部７４は、最適イントラ予測モードを示す情報（すなわち、イントラ予測モード情報）と最適と決定されたシフト量の情報を、可逆符号化部６６に供給する。 The prediction image selection information is supplied to the intra prediction unit 74 or the motion prediction / compensation unit 76. When a prediction image in the optimal intra prediction mode is selected, the intra prediction unit 74 converts information indicating the optimal intra prediction mode (that is, intra prediction mode information) and information on the shift amount determined to be optimal into a lossless encoding unit. 66.

最適インター予測モードの予測画像が選択された場合、動き予測・補償部７６は、最適インター予測モードを示す情報と、必要に応じて、最適インター予測モードに応じた情報を可逆符号化部６６に出力する。最適インター予測モードに応じた情報としては、動きベクトル情報やフラグ情報、参照フレーム情報などがあげられる。すなわち、最適インター予測モードとして、インター予測モードによる予測画像が選択されているときには、動き予測・補償部７６は、インター予測モード情報、動きベクトル情報、参照フレーム情報を可逆符号化部６６に出力する。 When the prediction image of the optimal inter prediction mode is selected, the motion prediction / compensation unit 76 sends information indicating the optimal inter prediction mode and, if necessary, information corresponding to the optimal inter prediction mode to the lossless encoding unit 66. Output. Information according to the optimal inter prediction mode includes motion vector information, flag information, reference frame information, and the like. That is, when a prediction image in the inter prediction mode is selected as the optimal inter prediction mode, the motion prediction / compensation unit 76 outputs the inter prediction mode information, motion vector information, and reference frame information to the lossless encoding unit 66. .

ステップＳ２３において、可逆符号化部６６は量子化部６５より出力された量子化された変換係数を符号化する。すなわち、差分画像が可変長符号化、算術符号化等の可逆符号化され、圧縮される。このとき、上述したステップＳ２２において可逆符号化部６６に入力された、イントラ予測部７４からのイントラ予測モード情報、または、動き予測・補償部７６からの最適インター予測モードに応じた情報なども符号化され、ヘッダ情報に付加される。 In step S23, the lossless encoding unit 66 encodes the quantized transform coefficient output from the quantization unit 65. That is, the difference image is subjected to lossless encoding such as variable length encoding and arithmetic encoding, and is compressed. At this time, the intra prediction mode information from the intra prediction unit 74 or the information corresponding to the optimal inter prediction mode from the motion prediction / compensation unit 76, which is input to the lossless encoding unit 66 in step S22 described above, is also encoded. And added to the header information.

ステップＳ２４において蓄積バッファ６７は差分画像を圧縮画像として蓄積する。蓄積バッファ６７に蓄積された圧縮画像が適宜読み出され、伝送路を介して復号側に伝送される。 In step S24, the accumulation buffer 67 accumulates the difference image as a compressed image. The compressed image stored in the storage buffer 67 is appropriately read and transmitted to the decoding side via the transmission path.

ステップＳ２５においてレート制御部７８は、蓄積バッファ６７に蓄積された圧縮画像に基づいて、オーバーフローあるいはアンダーフローが発生しないように、量子化部６５の量子化動作のレートを制御する。 In step S 25, the rate control unit 78 controls the quantization operation rate of the quantization unit 65 based on the compressed image stored in the storage buffer 67 so that overflow or underflow does not occur.

［予測処理の説明］
次に、図８のフローチャートを参照して、図７のステップＳ２１における予測処理を説明する。 [Explanation of prediction processing]
Next, the prediction process in step S21 in FIG. 7 will be described with reference to the flowchart in FIG.

画面並べ替えバッファ６２から供給される処理対象の画像がイントラ処理されるブロックの画像である場合、参照される復号済みの画像がフレームメモリ７２から読み出され、スイッチ７３を介してイントラ予測部７４に供給される。 When the processing target image supplied from the screen rearrangement buffer 62 is an image of a block to be intra-processed, the decoded image to be referred to is read from the frame memory 72, and the intra prediction unit 74 via the switch 73. To be supplied.

ステップＳ３１において、イントラ予測部７４は、供給された画像を用いて、処理対象のブロックの画素を、候補となる全てのイントラ予測モードでイントラ予測する。なお、参照される復号済みの画素としては、デブロックフィルタ７１によりデブロックフィルタリングされていない画素が用いられる。 In step S31, the intra prediction unit 74 performs intra prediction on the pixels of the block to be processed in all candidate intra prediction modes, using the supplied image. Note that pixels that have not been deblocked filtered by the deblocking filter 71 are used as decoded pixels that are referred to.

ステップＳ３１におけるイントラ予測処理の詳細は、図２０を参照して後述するが、この処理により、候補となる全てのイントラ予測モードでイントラ予測が行われる。そして、候補となる全てのイントラ予測モードに対してコスト関数値が算出され、算出されたコスト関数値に基づいて、最適イントラ予測モードが決定される。 The details of the intra prediction process in step S31 will be described later with reference to FIG. 20. With this process, intra prediction is performed in all candidate intra prediction modes. Then, cost function values are calculated for all candidate intra prediction modes, and the optimal intra prediction mode is determined based on the calculated cost function values.

そして、６タップのFIRフィルタおよび線形内挿により、決定された最適イントラ予測モードに応じたシフト方向に最適シフト量で、隣接画素の位相がシフトされる。この位相がシフトされた隣接画素が用いられて、最適イントラ予測モードでのイントラ予測により予測画像が生成される。生成された予測画像と最適イントラ予測モードのコスト関数値は、予測画像選択部７７に供給される。 Then, by the 6-tap FIR filter and linear interpolation, the phase of the adjacent pixel is shifted by the optimal shift amount in the shift direction corresponding to the determined optimal intra prediction mode. Adjacent pixels whose phases are shifted are used, and a prediction image is generated by intra prediction in the optimal intra prediction mode. The generated predicted image and the cost function value of the optimal intra prediction mode are supplied to the predicted image selection unit 77.

画面並べ替えバッファ６２から供給される処理対象の画像がインター処理される画像である場合、参照される画像がフレームメモリ７２から読み出され、スイッチ７３を介して動き予測・補償部７６に供給される。これらの画像に基づいて、ステップＳ３２において、動き予測・補償部７６はインター動き予測処理を行う。すなわち、動き予測・補償部７６は、フレームメモリ７２から供給される画像を参照して、候補となる全てのインター予測モードの動き予測処理を行う。 When the processing target image supplied from the screen rearrangement buffer 62 is an image to be inter-processed, the referenced image is read from the frame memory 72 and supplied to the motion prediction / compensation unit 76 via the switch 73. The Based on these images, in step S32, the motion prediction / compensation unit 76 performs an inter motion prediction process. That is, the motion prediction / compensation unit 76 refers to the image supplied from the frame memory 72 and performs motion prediction processing for all candidate inter prediction modes.

ステップＳ３２におけるインター動き予測処理の詳細は、図２２を参照して後述するが、この処理により、候補となる全てのインター予測モードで動き予測処理が行われ、候補となる全てのインター予測モードに対してコスト関数値が算出される。 Details of the inter motion prediction process in step S32 will be described later with reference to FIG. 22. With this process, the motion prediction process is performed in all candidate inter prediction modes, and all candidate inter prediction modes are set. On the other hand, a cost function value is calculated.

ステップＳ３３において、動き予測・補償部７６は、ステップＳ３２において算出されたインター予測モードに対してのコスト関数値を比較し、最小値を与える予測モードを、最適インター予測モードとして決定する。そして、動き予測・補償部７６は、最適インター予測モードで生成された予測画像とそのコスト関数値を、予測画像選択部７７に供給する。 In step S33, the motion prediction / compensation unit 76 compares the cost function value for the inter prediction mode calculated in step S32, and determines the prediction mode that gives the minimum value as the optimal inter prediction mode. Then, the motion prediction / compensation unit 76 supplies the predicted image generated in the optimal inter prediction mode and its cost function value to the predicted image selection unit 77.

［Ｈ．２６４／ＡＶＣ方式におけるイントラ予測処理の説明］
次に、Ｈ．２６４／ＡＶＣ方式で定められているイントラ予測の各モードについて説明する。 [H. Explanation of Intra Prediction Processing in H.264 / AVC System]
Next, H.I. Each mode of intra prediction defined in the H.264 / AVC format will be described.

まず、輝度信号に対するイントラ予測モードについて説明する。輝度信号のイントラ予測モードには、イントラ４×４予測モード、イントラ８×８予測モード、およびイントラ１６×１６予測モードの３通りの方式が定められている。これは、ブロック単位を定めるモードであり、マクロブロック毎に設定される。また、色差信号に対しては、マクロブロック毎に輝度信号とは独立したイントラ予測モードを設定することが可能である。 First, the intra prediction mode for the luminance signal will be described. In the luminance signal intra prediction mode, three methods are defined: an intra 4 × 4 prediction mode, an intra 8 × 8 prediction mode, and an intra 16 × 16 prediction mode. This is a mode for determining a block unit, and is set for each macroblock. For color difference signals, an intra prediction mode independent of the luminance signal can be set for each macroblock.

さらに、イントラ４×４予測モードの場合、４×４画素の対象ブロック毎に、９種類の予測モードから１つの予測モードを設定することができる。イントラ８×８予測モードの場合、８×８画素の対象ブロック毎に、９種類の予測モードから１つの予測モードを設定することができる。また、イントラ１６×１６予測モードの場合、１６×１６画素の対象マクロブロックに対して、４種類の予測モードから１つの予測モードを設定することができる。 Further, in the case of the intra 4 × 4 prediction mode, one prediction mode can be set from nine types of prediction modes for each target block of 4 × 4 pixels. In the case of the intra 8 × 8 prediction mode, one prediction mode can be set from nine types of prediction modes for each target block of 8 × 8 pixels. In the case of the intra 16 × 16 prediction mode, one prediction mode can be set from four types of prediction modes for a target macroblock of 16 × 16 pixels.

なお、以下、イントラ４×４予測モード、イントラ８×８予測モード、およびイントラ１６×１６予測モードは、それぞれ、４×４画素のイントラ予測モード、８×８画素のイントラ予測モード、および１６×１６画素のイントラ予測モードとも適宜称する。 Note that, hereinafter, the intra 4 × 4 prediction mode, the intra 8 × 8 prediction mode, and the intra 16 × 16 prediction mode will be referred to as 4 × 4 pixel intra prediction mode, 8 × 8 pixel intra prediction mode, and 16 ×, respectively. This is also referred to as a 16-pixel intra prediction mode as appropriate.

図９の例において、各ブロックに付されている数字−１乃至２５は、その各ブロックのビットストリーム順（復号側における処理順）を表している。なお、輝度信号については、マクロブロックが４×４画素に分割されて、４×４画素のＤＣＴが行われる。そして、イントラ１６×１６予測モードの場合のみ、−１のブロックに示されるように、各ブロックの直流成分を集めて、４×４行列が生成され、これに対して、さらに、直交変換が施される。 In the example of FIG. 9, the numbers −1 to 25 attached to each block indicate the bit stream order (processing order on the decoding side) of each block. For the luminance signal, the macroblock is divided into 4 × 4 pixels, and DCT of 4 × 4 pixels is performed. Only in the case of the intra 16 × 16 prediction mode, as shown in the block of −1, the DC components of each block are collected to generate a 4 × 4 matrix, and further, orthogonal transformation is performed on this. Is done.

一方、色差信号については、マクロブロックが４×４画素に分割され、４×４画素のＤＣＴが行われた後に、１６および１７の各ブロックに示されるように、各ブロックの直流成分を集めて、２×２行列が生成され、これに対して、さらに、直交変換が施される。 On the other hand, for the color difference signal, after the macroblock is divided into 4 × 4 pixels and the DCT of 4 × 4 pixels is performed, the DC components of each block are collected as shown in the blocks 16 and 17. A 2 × 2 matrix is generated, and is further subjected to orthogonal transformation.

なお、このことは、イントラ８×８予測モードに関しては、ハイプロファイルまたはそれ以上のプロファイルで、対象マクロブロックに、８×８直交変換が施される場合についてのみ適用可能である。 Note that this can be applied to the intra 8 × 8 prediction mode only when the target macroblock is subjected to 8 × 8 orthogonal transformation with a high profile or higher profile.

図１０および図１１は、９種類の輝度信号の４×４画素のイントラ予測モード(Intra_4x4_pred_mode)を示す図である。平均値（ＤＣ）予測を示すモード２以外の８種類の各モードは、それぞれ、上述した図１の番号０，１，３乃至８で示される方向に対応している。 FIG. 10 and FIG. 11 are diagrams illustrating nine types of luminance signal 4 × 4 pixel intra prediction modes (Intra — 4 × 4_pred_mode). Each of the eight modes other than mode 2 indicating average value (DC) prediction corresponds to the direction indicated by the numbers 0, 1, 3 to 8 in FIG.

９種類のIntra_4x4_pred_modeについて、図１２を参照して説明する。図１２の例において、画素ａ乃至ｐは、イントラ処理される対象ブロックの画素を表し、画素値Ａ乃至Ｍは、隣接ブロックに属する画素の画素値を表している。すなわち、画素ａ乃至ｐは、画面並べ替えバッファ６２から読み出された処理対象の画像であり、画素値Ａ乃至Ｍは、フレームメモリ７２から読み出され、参照される復号済みの画像の画素値である。 Nine types of Intra_4x4_pred_mode will be described with reference to FIG. In the example of FIG. 12, pixels a to p represent pixels of a target block to be intra-processed, and pixel values A to M represent pixel values of pixels belonging to adjacent blocks. That is, the pixels a to p are images to be processed that are read from the screen rearrangement buffer 62, and the pixel values A to M are pixel values of a decoded image that is read from the frame memory 72 and referred to. It is.

図１０および図１１に示す各イントラ予測モードの場合、画素ａ乃至ｐの予測画素値は、隣接ブロックに属する画素の画素値Ａ乃至Ｍを用いて、以下のように生成される。なお、画素値が“available”であるとは、画枠の端であったり、あるいは、まだ符号化されていないなどの理由がなく、利用可能であることを表す。これに対して、画素値が“unavailable”であるとは、画枠の端であったり、あるいは、まだ符号化されていないなどの理由により利用可能ではないことを表す。 In the case of each intra prediction mode shown in FIGS. 10 and 11, the predicted pixel values of the pixels a to p are generated as follows using the pixel values A to M of the pixels belonging to the adjacent blocks. Note that the pixel value “available” indicates that the pixel value can be used without any reason such as being at the end of the image frame or not yet encoded. On the other hand, the pixel value “unavailable” indicates that the pixel value is not usable because it is at the end of the image frame or has not been encoded yet.

モード０はVertical Prediction modeであり、画素値Ａ乃至Ｄが “available” の場合のみ適用される。この場合、画素ａ乃至ｐの予測画素値は、次の式（７）のように生成される。

画素ａ, ｅ, ｉ, ｍの予測画素値＝Ａ
画素ｂ, ｆ, ｊ, ｎの予測画素値＝Ｂ
画素ｃ, ｇ, ｋ, ｏの予測画素値＝Ｃ
画素ｄ, ｈ, ｌ, ｐの予測画素値＝Ｄ・・・（７） Mode 0 is the Vertical Prediction mode, and is applied only when the pixel values A to D are “available”. In this case, the predicted pixel values of the pixels a to p are generated as in the following Expression (7).

Predicted pixel value of pixels a, e, i, m = A
Predicted pixel value of pixels b, f, j, n = B
Predicted pixel value of pixels c, g, k, o = C
Predicted pixel value of pixels d, h, l, and p = D (7)

モード１はHorizontal Prediction modeであり、画素値Ｉ乃至Ｌが “available” の場合のみ適用される。この場合、画素ａ乃至ｐの予測画素値は、次の式（８）のように生成される。

画素ａ, ｂ, ｃ, ｄの予測画素値＝Ｉ
画素ｅ, ｆ, ｇ, ｈの予測画素値＝Ｊ
画素ｉ, ｊ, ｋ, ｌの予測画素値＝Ｋ
画素ｍ, ｎ, ｏ, ｐの予測画素値＝Ｌ・・・（８） Mode 1 is a horizontal prediction mode and is applied only when the pixel values I to L are “available”. In this case, the predicted pixel values of the pixels a to p are generated as in the following Expression (8).

Predicted pixel value of pixels a, b, c, d = I
Predicted pixel value of pixels e, f, g, h = J
Predicted pixel value of pixels i, j, k, l = K
Predicted pixel value of pixels m, n, o, p = L (8)

モード２はDC Prediction modeであり、画素値Ａ,Ｂ,Ｃ,Ｄ,Ｉ,Ｊ,Ｋ,Ｌが全て “available” である時、予測画素値は式（９）のように生成される。

（Ａ＋Ｂ＋Ｃ＋Ｄ＋Ｉ＋Ｊ＋Ｋ＋Ｌ＋４）＞＞３・・・（９） Mode 2 is a DC Prediction mode. When the pixel values A, B, C, D, I, J, K, and L are all “available”, the predicted pixel value is generated as shown in Expression (9).

(A + B + C + D + I + J + K + L + 4) >> 3 (9)

また、画素値Ａ,Ｂ,Ｃ,Ｄが全て “unavailable” である時、予測画素値は式（１０）のように生成される。

（Ｉ＋Ｊ＋Ｋ＋Ｌ＋２）＞＞２・・・（１０）
Also, when the pixel values A, B, C, and D are all “unavailable”, the predicted pixel value is generated as in Expression (10).

(I + J + K + L + 2) >> 2 (10)

また、画素値Ｉ,Ｊ,Ｋ,Ｌが全て “unavailable” である時、予測画素値は式（１１）のように生成される。

（Ａ＋Ｂ＋Ｃ＋Ｄ＋２）＞＞２・・・（１１）
Further, when the pixel values I, J, K, and L are all “unavailable”, the predicted pixel value is generated as in Expression (11).

(A + B + C + D + 2) >> 2 (11)

なお、画素値Ａ,Ｂ,Ｃ,Ｄ,Ｉ,Ｊ,Ｋ,Ｌが全て“unavailable” である時、１２８を予測画素値として用いる。 When the pixel values A, B, C, D, I, J, K, and L are all “unavailable”, 128 is used as the predicted pixel value.

モード３はDiagonal_Down_Left Prediction modeであり、画素値Ａ,Ｂ,Ｃ,Ｄ,Ｉ,Ｊ,Ｋ,Ｌ,Ｍが“available”の場合のみ適用される。この場合、画素ａ乃至ｐの予測画素値は、次の式（１２）のように生成される。

画素ａの予測画素値＝（Ａ＋２Ｂ＋Ｃ＋２）＞＞２
画素ｂ，ｅの予測画素値＝（Ｂ＋２Ｃ＋Ｄ＋２）＞＞２
画素ｃ，ｆ，ｉの予測画素値＝（Ｃ＋２Ｄ＋Ｅ＋２）＞＞２
画素ｄ，ｇ，ｊ，ｍの予測画素値＝（Ｄ＋２Ｅ＋Ｆ＋２）＞＞２
画素ｈ，ｋ，ｎの予測画素値＝（Ｅ＋２Ｆ＋Ｇ＋２）＞＞２
画素ｌ，ｏの予測画素値＝（Ｆ＋２Ｇ＋Ｈ＋２）＞＞２
画素ｐの予測画素値＝（Ｇ＋３Ｈ＋２）＞＞２
・・・（１２） Mode 3 is a Diagonal_Down_Left Prediction mode, and is applied only when the pixel values A, B, C, D, I, J, K, L, and M are “available”. In this case, the predicted pixel values of the pixels a to p are generated as in the following Expression (12).

Predicted pixel value of pixel a = (A + 2B + C + 2) >> 2
Predicted pixel value of pixels b and e = (B + 2C + D + 2) >> 2
Predicted pixel value of pixels c, f, i = (C + 2D + E + 2) >> 2
Predicted pixel value of pixels d, g, j, m = (D + 2E + F + 2) >> 2
Predicted pixel value of pixels h, k, n = (E + 2F + G + 2) >> 2
Predicted pixel value of pixels l and o = (F + 2G + H + 2) >> 2
Predicted pixel value of pixel p = (G + 3H + 2) >> 2
(12)

モード４はDiagonal_Down_Right Prediction modeであり、画素値Ａ,Ｂ,Ｃ,Ｄ,Ｉ,Ｊ,Ｋ,Ｌ,Ｍが “available” の場合のみ適用される。この場合、画素ａ乃至ｐの予測画素値は、次の式（１３）のように生成される。

画素ｍの予測画素値＝（Ｊ＋２Ｋ＋Ｌ＋２）＞＞２
画素ｉ，ｎの予測画素値＝（Ｉ＋２Ｊ＋Ｋ＋２）＞＞２
画素ｅ，ｊ，ｏの予測画素値＝（Ｍ＋２Ｉ＋Ｊ＋２）＞＞２
画素ａ，ｆ，ｋ，ｐの予測画素値＝（Ａ＋２Ｍ＋Ｉ＋２）＞＞２
画素ｂ，ｇ，ｌの予測画素値＝（Ｍ＋２Ａ＋Ｂ＋２）＞＞２
画素ｃ，ｈの予測画素値＝（Ａ＋２Ｂ＋Ｃ＋２）＞＞２
画素ｄの予測画素値＝（Ｂ＋２Ｃ＋Ｄ＋２）＞＞２
・・・（１３） Mode 4 is a Diagonal_Down_Right Prediction mode, and is applied only when the pixel values A, B, C, D, I, J, K, L, and M are “available”. In this case, the predicted pixel values of the pixels a to p are generated as in the following Expression (13).

Predicted pixel value of pixel m = (J + 2K + L + 2) >> 2
Predicted pixel value of pixels i and n = (I + 2J + K + 2) >> 2
Predicted pixel value of pixels e, j, o = (M + 2I + J + 2) >> 2
Predicted pixel value of pixels a, f, k, p = (A + 2M + I + 2) >> 2
Predicted pixel value of pixels b, g, l = (M + 2A + B + 2) >> 2
Predicted pixel value of pixels c and h = (A + 2B + C + 2) >> 2
Predicted pixel value of pixel d = (B + 2C + D + 2) >> 2
... (13)

モード５はDiagonal_Vertical_Right Prediction modeであり、画素値Ａ,Ｂ,Ｃ,Ｄ,Ｉ,Ｊ,Ｋ,Ｌ,Ｍが“available”の場合のみ適用される。この場合、画素ａ乃至ｐの予測画素値は、次の式（１４）のように生成される。

画素ａ，ｊの予測画素値＝（Ｍ＋Ａ＋１）＞＞１
画素ｂ，ｋの予測画素値＝（Ａ＋Ｂ＋１）＞＞１
画素ｃ，ｌの予測画素値＝（Ｂ＋Ｃ＋１）＞＞１
画素ｄの予測画素値＝（Ｃ＋Ｄ＋１）＞＞１
画素ｅ，ｎの予測画素値＝（Ｉ＋２Ｍ＋Ａ＋２）＞＞２
画素ｆ，ｏの予測画素値＝（Ｍ＋２Ａ＋Ｂ＋２）＞＞２
画素ｇ，ｐの予測画素値＝（Ａ＋２Ｂ＋Ｃ＋２）＞＞２
画素ｈの予測画素値＝（Ｂ＋２Ｃ＋Ｄ＋２）＞＞２
画素ｉの予測画素値＝（Ｍ＋２Ｉ＋Ｊ＋２）＞＞２
画素ｍの予測画素値＝（Ｉ＋２Ｊ＋Ｋ＋２）＞＞２
・・・（１４）
Mode 5 is a Diagonal_Vertical_Right Prediction mode, and is applied only when the pixel values A, B, C, D, I, J, K, L, and M are “available”. In this case, the predicted pixel values of the pixels a to p are generated as in the following Expression (14).

Predicted pixel value of pixels a and j = (M + A + 1) >> 1
Predicted pixel value of pixels b and k = (A + B + 1) >> 1
Predicted pixel value of pixels c and l = (B + C + 1) >> 1
Predicted pixel value of pixel d = (C + D + 1) >> 1
Predicted pixel value of pixels e and n = (I + 2M + A + 2) >> 2
Predicted pixel value of pixels f and o = (M + 2A + B + 2) >> 2
Predicted pixel value of pixels g and p = (A + 2B + C + 2) >> 2
Predicted pixel value of pixel h = (B + 2C + D + 2) >> 2
Predicted pixel value of pixel i = (M + 2I + J + 2) >> 2
Predicted pixel value of pixel m = (I + 2J + K + 2) >> 2
(14)

モード６はHorizontal_Down Prediction modeであり、画素値Ａ,Ｂ,Ｃ,Ｄ,Ｉ,Ｊ,Ｋ,Ｌ,Ｍが“available”の場合のみ適用される。この場合、画素ａ乃至ｐの予測画素値は、次の式（１５）のように生成される。

画素ａ，ｇの予測画素値＝（Ｍ＋Ｉ＋１）＞＞１
画素ｂ，ｈの予測画素値＝（Ｉ＋２Ｍ＋Ａ＋２）＞＞２
画素ｃの予測画素値＝（Ｍ＋２Ａ＋Ｂ＋２）＞＞２
画素ｄの予測画素値＝（Ａ＋２Ｂ＋Ｃ＋２）＞＞２
画素ｅ，ｋの予測画素値＝（Ｉ＋Ｊ＋１）＞＞１
画素ｆ，ｌの予測画素値＝（Ｍ＋２Ｉ＋Ｊ＋２）＞＞２
画素ｉ，ｏの予測画素値＝（Ｊ＋Ｋ＋１）＞＞１
画素ｊ，ｐの予測画素値＝（Ｉ＋２Ｊ＋Ｋ＋２）＞＞２
画素ｍの予測画素値＝（Ｋ＋Ｌ＋１）＞＞１
画素ｎの予測画素値＝（Ｊ＋２Ｋ＋Ｌ＋２）＞＞２
・・・（１５） Mode 6 is a Horizontal_Down Prediction mode, and is applied only when the pixel values A, B, C, D, I, J, K, L, and M are “available”. In this case, the predicted pixel values of the pixels a to p are generated as in the following Expression (15).

Predicted pixel value of pixels a and g = (M + I + 1) >> 1
Predicted pixel value of pixels b and h = (I + 2M + A + 2) >> 2
Predicted pixel value of pixel c = (M + 2A + B + 2) >> 2
Predicted pixel value of pixel d = (A + 2B + C + 2) >> 2
Predicted pixel value of pixels e and k = (I + J + 1) >> 1
Predicted pixel value of pixels f and l = (M + 2I + J + 2) >> 2
Predicted pixel value of pixels i and o = (J + K + 1) >> 1
Predicted pixel value of pixels j and p = (I + 2J + K + 2) >> 2
Predicted pixel value of pixel m = (K + L + 1) >> 1
Predicted pixel value of pixel n = (J + 2K + L + 2) >> 2
... (15)

モード７は、Vertical_Left Prediction modeであり、画素値Ａ,Ｂ,Ｃ,Ｄ,Ｉ,Ｊ,Ｋ,Ｌ,Ｍが“available”の場合のみ適用される。この場合、画素ａ乃至ｐの予測画素値は、次の式（１６）のように生成される。

画素ａの予測画素値＝（Ａ＋Ｂ＋１）＞＞１
画素ｂ，ｉの予測画素値＝（Ｂ＋Ｃ＋１）＞＞１
画素ｃ，ｊの予測画素値＝（Ｃ＋Ｄ＋１）＞＞１
画素ｄ，ｋの予測画素値＝（Ｄ＋Ｅ＋１）＞＞１
画素ｌの予測画素値＝（Ｅ＋Ｆ＋１）＞＞１
画素ｅの予測画素値＝（Ａ＋２Ｂ＋Ｃ＋２）＞＞２
画素ｆ，ｍの予測画素値＝（Ｂ＋２Ｃ＋Ｄ＋２）＞＞２
画素ｇ，ｎの予測画素値＝（Ｃ＋２Ｄ＋Ｅ＋２）＞＞２
画素ｈ，ｏの予測画素値＝（Ｄ＋２Ｅ＋Ｆ＋２）＞＞２
画素ｐの予測画素値＝（Ｅ＋２Ｆ＋Ｇ＋２）＞＞２
・・・（１６） Mode 7 is a Vertical_Left Prediction mode, and is applied only when the pixel values A, B, C, D, I, J, K, L, and M are “available”. In this case, the predicted pixel values of the pixels a to p are generated as in the following Expression (16).

Predicted pixel value of pixel a = (A + B + 1) >> 1
Predicted pixel value of pixels b and i = (B + C + 1) >> 1
Predicted pixel value of pixels c and j = (C + D + 1) >> 1
Predicted pixel value of pixels d and k = (D + E + 1) >> 1
Predicted pixel value of pixel l = (E + F + 1) >> 1
Predicted pixel value of pixel e = (A + 2B + C + 2) >> 2
Predicted pixel value of pixels f and m = (B + 2C + D + 2) >> 2
Predicted pixel value of pixels g and n = (C + 2D + E + 2) >> 2
Predicted pixel value of pixels h and o = (D + 2E + F + 2) >> 2
Predicted pixel value of pixel p = (E + 2F + G + 2) >> 2
... (16)

モード８は、Horizontal_Up Prediction modeであり、画素値Ａ,Ｂ,Ｃ,Ｄ,Ｉ,Ｊ,Ｋ,Ｌ,Ｍが“available”の場合のみ適用される。この場合、画素ａ乃至ｐの予測画素値は、次の式（１７）のように生成される。

画素ａの予測画素値＝（Ｉ＋Ｊ＋１）＞＞１
画素ｂの予測画素値＝（Ｉ＋２Ｊ＋Ｋ＋２）＞＞２
画素ｃ，ｅの予測画素値＝（Ｊ＋Ｋ＋１）＞＞１
画素ｄ，ｆの予測画素値＝（Ｊ＋２Ｋ＋Ｌ＋２）＞＞２
画素ｇ，ｉの予測画素値＝（Ｋ＋Ｌ＋１）＞＞１
画素ｈ，ｊの予測画素値＝（Ｋ＋３Ｌ＋２）＞＞２
画素ｋ，ｌ，ｍ，ｎ，ｏ，ｐの予測画素値＝Ｌ
・・・（１７） Mode 8 is a Horizontal_Up Prediction mode, and is applied only when the pixel values A, B, C, D, I, J, K, L, and M are “available”. In this case, the predicted pixel values of the pixels a to p are generated as in the following Expression (17).

Predicted pixel value of pixel a = (I + J + 1) >> 1
Predicted pixel value of pixel b = (I + 2J + K + 2) >> 2
Predicted pixel value of pixels c and e = (J + K + 1) >> 1
Predicted pixel value of pixels d and f = (J + 2K + L + 2) >> 2
Predicted pixel value of pixels g and i = (K + L + 1) >> 1
Predicted pixel value of pixels h and j = (K + 3L + 2) >> 2
Predicted pixel value of pixels k, l, m, n, o, p = L
... (17)

次に、図１３を参照して、輝度信号の４×４画素のイントラ予測モード(Intra_4x4_pred_mode)の符号化方式について説明する。図１３の例において、４×４画素からなり、符号化対象となる対象ブロックＣが示されており、対象ブロックＣに隣接する４×４画素からなるブロックＡおよびブロックＢが示されている。 Next, a 4 × 4 pixel intra prediction mode (Intra — 4 × 4_pred_mode) encoding method for luminance signals will be described with reference to FIG. In the example of FIG. 13, a target block C that is 4 × 4 pixels and is an encoding target is illustrated, and a block A and a block B that are 4 × 4 pixels adjacent to the target block C are illustrated.

この場合、対象ブロックＣにおけるIntra_4x4_pred_modeと、ブロックＡおよびブロックＢにおけるIntra_4x4_pred_modeとは高い相関があると考えられる。この相関性を用いて、次のように符号化処理を行うことにより、より高い符号化効率を実現することができる。 In this case, it is considered that Intra_4x4_pred_mode in the target block C and Intra_4x4_pred_mode in the block A and the block B have a high correlation. By using this correlation and performing encoding processing as follows, higher encoding efficiency can be realized.

すなわち、図１３の例において、ブロックＡおよびブロックＢにおけるIntra_4x4_pred_modeを、それぞれ、Intra_4x4_pred_modeAおよびIntra_4x4_pred_modeBとして、MostProbableModeを次の式（１８）と定義する。

MostProbableMode=Min(Intra_4x4_pred_modeA, Intra_4x4_pred_modeB)
・・・（１８） That is, in the example of FIG. 13, Intra_4x4_pred_mode in the block A and the block B is set as Intra_4x4_pred_modeA and Intra_4x4_pred_modeB, respectively, and MostProbableMode is defined as the following equation (18).

MostProbableMode = Min (Intra_4x4_pred_modeA, Intra_4x4_pred_modeB)
... (18)

すなわち、ブロックＡおよびブロックＢのうち、より小さなmode_numberを割り当てられている方をMostProbableModeとする。 That is, among blocks A and B, the one to which a smaller mode_number is assigned is referred to as MostProbableMode.

ビットストリーム中には、対象ブロックＣに対するパラメータとして、prev_intra4x4_pred_mode_flag[luma4x4BlkIdx] および rem_intra4x4_pred_mode[luma4x4BlkIdx] という２つの値が定義されており、次の式（１９）に示される擬似コードに基づく処理により、復号処理が行われ、対象ブロックＣに対するIntra_4x4_pred_mode、Intra4x4PredMode[luma4x4BlkIdx] の値を得ることができる。 In the bitstream, two values, prev_intra4x4_pred_mode_flag [luma4x4BlkIdx] and rem_intra4x4_pred_mode [luma4x4BlkIdx], are defined as parameters for the target block C. And the values of Intra_4x4_pred_mode and Intra4x4PredMode [luma4x4BlkIdx] for the target block C can be obtained.

if(prev_intra4x4_pred_mode_flag[luma4x4BlkIdx])
Intra4x4PredMode[luma4x4BlkIdx] = MostProbableMode
else
if(rem_intra4x4_pred_mode[luma4x4BlkIdx] < MostProbableMode)
Intra4x4PredMode[luma4x4BlkIdx]=rem_intra4x4_pred_mode[luma4x4BlkIdx]
else
Intra4x4PredMode[luma4x4BlkIdx]=rem_intra4x4_pred_mode[luma4x4BlkIdx] + 1
・・・（１９） if (prev_intra4x4_pred_mode_flag [luma4x4BlkIdx])
Intra4x4PredMode [luma4x4BlkIdx] = MostProbableMode
else
if (rem_intra4x4_pred_mode [luma4x4BlkIdx] <MostProbableMode)
Intra4x4PredMode [luma4x4BlkIdx] = rem_intra4x4_pred_mode [luma4x4BlkIdx]
else
Intra4x4PredMode [luma4x4BlkIdx] = rem_intra4x4_pred_mode [luma4x4BlkIdx] + 1
... (19)

次に、１６×１６画素のイントラ予測モードについて説明する。図１４および図１５は、４種類の輝度信号の１６×１６画素のイントラ予測モード(Intra_16x16_pred_mode)を示す図である。 Next, the 16 × 16 pixel intra prediction mode will be described. FIG. 14 and FIG. 15 are diagrams showing 16 × 16 pixel intra prediction modes (Intra_16 × 16_pred_mode) of four types of luminance signals.

４種類のイントラ予測モードについて、図１６を参照して説明する。図１６の例において、イントラ処理される対象マクロブロックＡが示されており、P(x,y);x,y=-1,0,…,15は、対象マクロブロックＡに隣接する画素の画素値を表している。 The four types of intra prediction modes will be described with reference to FIG. In the example of FIG. 16, the target macroblock A to be intra-processed is shown, and P (x, y); x, y = −1,0,..., 15 are pixels adjacent to the target macroblock A. It represents a pixel value.

モード０は、Vertical Prediction modeであり、P(x,-1); x,y=-1,0,…,15が “available” である時のみ適用される。この場合、対象マクロブロックＡの各画素の予測画素値Pred(x,y)は、次の式（２０）のように生成される。

Pred(x,y) = P(x,-1);x,y=0,…,15
・・・（２０） Mode 0 is a Vertical Prediction mode, and is applied only when P (x, -1); x, y = -1,0,..., 15 is “available”. In this case, the predicted pixel value Pred (x, y) of each pixel of the target macroblock A is generated as in the following equation (20).

Pred (x, y) = P (x, -1); x, y = 0, ..., 15
... (20)

モード１はHorizontal Prediction modeであり、P(-1,y); x,y=-1,0,…,15が “available” である時のみ適用される。この場合、対象マクロブロックＡの各画素の予測画素値Pred(x,y)は、次の式（２１）のように生成される。

Pred(x,y) = P(-1,y);x,y=0,…,15
・・・（２１） Mode 1 is a horizontal prediction mode and is applied only when P (-1, y); x, y = -1,0,..., 15 is “available”. In this case, the predicted pixel value Pred (x, y) of each pixel of the target macroblock A is generated as in the following equation (21).

Pred (x, y) = P (-1, y); x, y = 0, ..., 15
... (21)

モード２はDC Prediction modeであり、P(x,-1)およびP(-1,y); x,y=-1,0,…,15が全て “available” である場合、対象マクロブロックＡの各画素の予測画素値Pred(x,y)は、次の式（２２）のように生成される。

Mode 2 is a DC Prediction mode, and when P (x, -1) and P (-1, y); x, y = -1,0, ..., 15 are all "available", the target macroblock A The predicted pixel value Pred (x, y) of each pixel is generated as in the following Expression (22).

また、P(x,-1); x,y=-1,0,…,15が “unavailable” である場合、対象マクロブロックＡの各画素の予測画素値Pred(x,y)は、次の式（２３）のように生成される。

When P (x, -1); x, y = -1,0, ..., 15 is "unavailable", the predicted pixel value Pred (x, y) of each pixel of the target macroblock A is Is generated as shown in Equation (23).

P(-1,y); x,y=-1,0,…,15が “unavailable” である場合、対象マクロブロックＡの各画素の予測画素値Pred(x,y)は、次の式（２４）のように生成される。

When P (-1, y); x, y = −1,0,..., 15 is “unavailable”, the predicted pixel value Pred (x, y) of each pixel of the target macroblock A is expressed by the following equation: It is generated as in (24).

P(x,-1)およびP(-1,y); x,y=-1,0,…,15が全て “unavailable” である場合には、予測画素値として１２８を用いる。 When P (x, −1) and P (−1, y); x, y = −1,0,..., 15 are all “unavailable”, 128 is used as the predicted pixel value.

モード３はPlane Prediction modeであり、P(x,-1)及びP(-1,y); x,y=-1,0,…,15が全て “available” の場合のみ適用される。この場合、対象マクロブロックＡの各画素の予測画素値Pred(x,y)は、次の式（２５）のように生成される。

Mode 3 is a plane prediction mode, and is applied only when P (x, -1) and P (-1, y); x, y = -1,0, ..., 15 are all "available". In this case, the predicted pixel value Pred (x, y) of each pixel of the target macroblock A is generated as in the following Expression (25).

次に、色差信号に対するイントラ予測モードについて説明する。図１７は、４種類の色差信号のイントラ予測モード(Intra_chroma_pred_mode)を示す図である。色差信号のイントラ予測モードは、輝度信号のイントラ予測モードと独立に設定が可能である。色差信号に対するイントラ予測モードは、上述した輝度信号の１６×１６画素のイントラ予測モードに順ずる。 Next, the intra prediction mode for color difference signals will be described. FIG. 17 is a diagram illustrating four types of color difference signal intra prediction modes (Intra_chroma_pred_mode). The color difference signal intra prediction mode can be set independently of the luminance signal intra prediction mode. The intra prediction mode for the color difference signal is in accordance with the 16 × 16 pixel intra prediction mode of the luminance signal described above.

ただし、輝度信号の１６×１６画素のイントラ予測モードが、１６×１６画素のブロックを対象としているのに対し、色差信号に対するイントラ予測モードは、８×８画素のブロックを対象としている。さらに、上述した図１４と図１７に示されるように、両者においてモード番号は対応していない。 However, the 16 × 16 pixel intra prediction mode for the luminance signal is intended for a block of 16 × 16 pixels, whereas the intra prediction mode for a color difference signal is intended for a block of 8 × 8 pixels. Furthermore, as shown in FIGS. 14 and 17 described above, the mode numbers do not correspond to each other.

ここで、図１６を参照して上述した輝度信号の１６×１６画素のイントラ予測モードの対象マクロブロックＡの画素値および隣接する画素値の定義に準じる。例えば、イントラ処理される対象マクロブロックＡ（色差信号の場合は、８×８画素）に隣接する画素の画素値をP(x,y);x,y=-1,0,…,7とする。 Here, the definition of the pixel value of the target macroblock A and the adjacent pixel value in the 16 × 16 pixel intra prediction mode of the luminance signal described above with reference to FIG. 16 is applied. For example, pixel values of pixels adjacent to the target macroblock A to be intra-processed (8 × 8 pixels in the case of a color difference signal) are P (x, y); x, y = −1,0,. To do.

モード０はDC Prediction modeであり、P(x,-1)およびP(-1,y); x,y=-1,0,…,7が全て “available” である場合、対象マクロブロックＡの各画素の予測画素値Pred(x,y)は、次の式（２６）のように生成される。

Mode 0 is DC Prediction mode, and when P (x, -1) and P (-1, y); x, y = -1,0, ..., 7 are all "available", the target macroblock A The predicted pixel value Pred (x, y) of each pixel is generated as in the following Expression (26).

また、P(-1,y) ; x,y=-1,0,…,7が “unavailable” である場合、対象マクロブロックＡの各画素の予測画素値Pred(x,y)は、次の式（２７）のように生成される。

Further, when P (−1, y); x, y = −1,0,..., 7 is “unavailable”, the predicted pixel value Pred (x, y) of each pixel of the target macroblock A is (27).

また、P(x,-1) ; x,y=-1,0,…,7が “unavailable”である場合、対象マクロブロックＡの各画素の予測画素値Pred(x,y)は、次の式（２８）のように生成される。

When P (x, -1); x, y = -1,0,..., 7 is “unavailable”, the predicted pixel value Pred (x, y) of each pixel of the target macroblock A is Is generated as shown in Equation (28).

モード１はHorizontal Prediction modeであり、P(-1,y) ; x,y=-1,0,…,7が “available” の場合にのみ適用される。この場合、対象マクロブロックＡの各画素の予測画素値Pred(x,y)は、次の式（２９）のように生成される。

Pred(x,y) = P(-1,y);x,y=0,…,7
・・・（２９） Mode 1 is a Horizontal Prediction mode, and is applied only when P (-1, y); x, y = -1,0,..., 7 is “available”. In this case, the predicted pixel value Pred (x, y) of each pixel of the target macroblock A is generated as in the following Expression (29).

Pred (x, y) = P (-1, y); x, y = 0, ..., 7
... (29)

モード２はVertical Prediction modeであり、P(x,-1) ; x,y=-1,0,…,7が “available” の場合にのみ適用される。この場合、対象マクロブロックＡの各画素の予測画素値Pred(x,y)は、次の式（３０）のように生成される。

Pred(x,y) = P(x,-1);x,y=0,…,7
・・・（３０）
Mode 2 is the Vertical Prediction mode, and is applied only when P (x, -1); x, y = -1,0, ..., 7 is "available". In this case, the predicted pixel value Pred (x, y) of each pixel of the target macroblock A is generated as in the following equation (30).

Pred (x, y) = P (x, -1); x, y = 0, ..., 7
... (30)

モード３はPlane Prediction modeであり、P(x,-1)及びP(-1,y) ; x,y=-1,0,…,7 が “available” の場合にのみ適用される。この場合、対象マクロブロックＡの各画素の予測画素値Pred(x,y)は、次の式（３１）のように生成される。

Mode 3 is a plane prediction mode and is applied only when P (x, -1) and P (-1, y); x, y = -1,0, ..., 7 are "available". In this case, the predicted pixel value Pred (x, y) of each pixel of the target macroblock A is generated as in the following Expression (31).

以上のように、輝度信号のイントラ予測モードには、９種類の４×４画素および８×８画素のブロック単位、並びに４種類の１６×１６画素のマクロブロック単位の予測モードがある。このブロック単位のモードは、マクロブロック単位毎に設定される。色差信号のイントラ予測モードには、４種類の８×８画素のブロック単位の予測モードがある。この色差信号のイントラ予測モードは、輝度信号のイントラ予測モードと独立に設定が可能である。 As described above, the luminance signal intra prediction modes include nine types of 4 × 4 pixel and 8 × 8 pixel block units, and four types of 16 × 16 pixel macroblock unit prediction modes. This block unit mode is set for each macroblock unit. The color difference signal intra prediction modes include four types of prediction modes in units of 8 × 8 pixel blocks. This color difference signal intra prediction mode can be set independently of the luminance signal intra prediction mode.

また、輝度信号の４×４画素のイントラ予測モード（イントラ４×４予測モード）および８×８画素のイントラ予測モード（イントラ８×８予測モード）については、４×４画素および８×８画素の輝度信号のブロック毎に１つのイントラ予測モードが設定される。輝度信号の１６×１６画素のイントラ予測モード（イントラ１６×１６予測モード）と色差信号のイントラ予測モードについては、１つのマクロブロックに対して１つの予測モードが設定される。 In addition, the 4 × 4 pixel intra prediction mode (intra 4 × 4 prediction mode) and the 8 × 8 pixel intra prediction mode (intra 8 × 8 prediction mode) of the luminance signal are 4 × 4 pixels and 8 × 8 pixels. One intra prediction mode is set for each block of luminance signals. For the 16 × 16 pixel intra prediction mode for luminance signals (intra 16 × 16 prediction mode) and the intra prediction mode for color difference signals, one prediction mode is set for one macroblock.

なお、予測モードの種類は、上述した図１の番号０，１，３乃至８で示される方向に対応している。予測モード２は平均値予測である。 Note that the types of prediction modes correspond to the directions indicated by the numbers 0, 1, 3 to 8 in FIG. Prediction mode 2 is average value prediction.

以上のようにH．264/AVC方式におけるイントラ予測は整数画素精度で行われる。これに対して、画像符号化装置５１においては、小数画素精度のイントラ予測が行われる。 As above. Intra prediction in the H.264 / AVC format is performed with integer pixel accuracy. On the other hand, in the image encoding device 51, intra prediction with decimal pixel accuracy is performed.

［小数画素精度のイントラ予測の動作］
次に、図１８を参照して、小数画素精度のイントラ予測を実現するための動作について説明する。なお、図１８の例においては、対象ブロックが４×４画素の場合の例が示されている。 [Operation of intra prediction with decimal pixel precision]
Next, with reference to FIG. 18, an operation for realizing the intra prediction with decimal pixel accuracy will be described. In the example of FIG. 18, an example in which the target block is 4 × 4 pixels is shown.

図１８の例の場合、黒丸がイントラ予測の対象ブロックの画素を表しており、白丸が対象ブロックに隣接する隣接画素を表している。さらに詳細には、白丸の隣接画素のうち、対象ブロックの左上部に隣接する左上部隣接画素は、Ａ-1かつＩ-1であり、この画素は、図１２の画素値Ｍの画素に相当する。白丸の隣接画素のうち、対象ブロックの上部に隣接する上部隣接画素は、Ａ0, Ａ1, Ａ2,…であり、これらの画素は、図１２の画素値Ａ乃至Ｈの画素に相当する。白丸の隣接画素のうち、対象ブロックの左部に隣接する左部隣接画素は、Ｉ0, Ｉ1, Ｉ2,…であり、これらの画素は、図１２の画素値Ｉ乃至Ｌの画素に相当する。 In the example of FIG. 18, black circles represent pixels of the target block for intra prediction, and white circles represent adjacent pixels adjacent to the target block. More specifically, among the adjacent pixels of the white circle, the upper left adjacent pixel adjacent to the upper left of the target block is A-1 and I-1, and this pixel corresponds to the pixel having the pixel value M in FIG. To do. Among the white circle adjacent pixels, the upper adjacent pixels adjacent to the upper part of the target block are A0, A1, A2,..., And these pixels correspond to the pixels having the pixel values A to H in FIG. Among the white circle adjacent pixels, the left adjacent pixels adjacent to the left portion of the target block are I0, I1, I2,..., And these pixels correspond to pixels having pixel values I to L in FIG.

また、隣接画素の間に示されるａ-0.5,ａ+0.5,…およびｉ-0.5,ｉ+0.5,…は、１／２画素精度の画素を表している。さらに、ａ-0.5,ａ+0.5,…およびｉ-0.5,ｉ+0.5,…の画素間に示されるａ-0.75,ａ-0.25,ａ+0.25,ａ+0.75,…およびｉ-0.75,ｉ-0.25,ｉ+0.25,ｉ+0.75,…は、１／４画素精度の画素を表している。 Also, a-0.5, a + 0.5,... And i-0.5, i + 0.5,... Shown between adjacent pixels represent pixels with 1/2 pixel accuracy. Further, a-0.75, a-0.25, a + 0.25, a + 0.75,... And i-0.75, i shown between the pixels of a-0.5, a + 0.5, ... and i-0.5, i + 0.5, ... -0.25, i + 0.25, i + 0.75,... Represent pixels with 1/4 pixel accuracy.

まず、第１の動作として、イントラ予測部７４においては、図１２に示される画素値Ａ乃至Ｍを用いて、各イントラ予測モードに対してイントラ予測が行われ、各イントラ予測モードの中から、最適イントラ予測モードが決定される。対象ブロックが４×４の場合、この最適イントラ予測モードは、図１０または図１１の９つの予測モードのいずれかである。 First, as a first operation, the intra prediction unit 74 performs intra prediction for each intra prediction mode using the pixel values A to M shown in FIG. An optimal intra prediction mode is determined. When the target block is 4 × 4, this optimal intra prediction mode is one of the nine prediction modes in FIG. 10 or FIG. 11.

例えば、最適イントラ予測モードとして、モード０（Vertical Prediction mode）が選ばれたとする。このとき、対象ブロックの予測に用いられる隣接画素は、図１２の画素値Ａ乃至Ｄの画素であり、図１８の画素Ａ0, Ａ1, Ａ2, Ａ3となる。 For example, it is assumed that mode 0 (Vertical Prediction mode) is selected as the optimal intra prediction mode. At this time, adjacent pixels used for prediction of the target block are pixels having pixel values A to D in FIG. 12, and are pixels A0, A1, A2, and A3 in FIG.

第２の動作として、隣接画素内挿部７５においては、図４を参照して上述したH．264/AVC方式における６タップのFIRフィルタにより、図１８の１／２画素精度の画素ａ-0.5,ａ+0.5,…が生成される。すなわち、画素ａ-0.5は、次の式（３２）で示される。

ａ-0.5 = (Ａ-2 -5*Ａ-1 + 20*Ａ0 + 20*Ａ1 -5*Ａ1 + Ａ2 + 16) >> 5
・・・（３２）

他の１／２画素精度の画素ａ+0.5, ａ+1.5などに関しても同様である。 As a second operation, in the adjacent pixel interpolation unit 75, the H.P. described above with reference to FIG. The pixels a-0.5, a + 0.5,... With 1/2 pixel accuracy in FIG. That is, the pixel a-0.5 is represented by the following equation (32).

a-0.5 = (A-2 -5 * A-1 + 20 * A0 + 20 * A1 -5 * A1 + A2 + 16) >> 5
... (32)

The same applies to the other pixels a + 0.5, a + 1.5, etc. with half-pixel accuracy.

第３の動作として、隣接画素内挿部７５においては、画素Ａ0, Ａ1, Ａ2, Ａ3および画素ａ-0.5,ａ+0.5などから、図１８の１／４画素精度の画素ａ-0.75,ａ-0.25,ａ+0.25,ａ+0.75が、線形内挿により生成される。すなわち、画素ａ+0.25は、次の式（３３）で示される。 As a third operation, in the adjacent pixel interpolation unit 75, from the pixels A0, A1, A2, A3, the pixels a-0.5, a + 0.5, and the like, the pixels a-0.75, a having the 1/4 pixel accuracy of FIG. -0.25, a + 0.25, a + 0.75 are generated by linear interpolation. That is, the pixel a + 0.25 is expressed by the following equation (33).

ａ-0.5 = Ａ0 + ａ+0.5 + 1) >> 2 ・・・（３３）

他の１／４画素精度の画素に関しても同様である。 a-0.5 = A0 + a + 0.5 + 1) >> 2 (33)

The same applies to other ¼ pixel precision pixels.

第４の動作として、イントラ予測部７４においては、モード０の場合、整数画素と各小数画素精度の位相差である-0.75,-0.50,-0.25,+0.25,+0.50,+0.75の値を、水平方向のシフト量の候補として、最適なシフト量が決定される。 As a fourth operation, in the mode 0, in the intra prediction unit 74, values of −0.75, −0.50, −0.25, +0.25, +0.50, and +0.75, which are phase differences between the integer pixels and the decimal pixel precision, are set. The optimum shift amount is determined as a candidate for the horizontal shift amount.

例えば、シフト量が+0.25の場合には、画素Ａ0, Ａ1, Ａ2, Ａ3の画素値の代わりに、画素ａ+0.25,ａ+1.25,ａ+2.25,ａ+3.25の画素値が用いられて、イントラ予測が行われる。 For example, when the shift amount is +0.25, the pixel values of the pixels a + 0.25, a + 1.25, a + 2.25, a + 3.25 are used instead of the pixel values of the pixels A0, A1, A2, A3. Intra prediction is performed.

このように、第１の動作で選択された最適イントラ予測モードに対して、最適なシフト量が決定される。例えば、シフト量が0の場合が最適とされて、整数画素の画素値が用いられる場合もあり得る。 Thus, the optimal shift amount is determined for the optimal intra prediction mode selected in the first operation. For example, it may be optimal when the shift amount is 0, and a pixel value of an integer pixel may be used.

なお、図１０または図１１に示される９つの予測モードのうち、モード２（DC prediction mode）に関しては、平均値処理を行うことになる。したがって、シフトを行ったとしても、符号化効率の向上には直接関与しないため、上述した動作は禁止され、行われない。 Of the nine prediction modes shown in FIG. 10 or FIG. 11, the average value processing is performed for mode 2 (DC prediction mode). Therefore, even if the shift is performed, the above-described operation is prohibited and is not performed because it is not directly related to the improvement of the coding efficiency.

モード０（Vertical Prediction mode）、モード３（Diagonal_Down_Left Prediction mode）、またはモード７（Vertical_Left Prediction mode）に関しては、図１８における上部隣接画素Ａ0, Ａ1, Ａ2,…のみのシフトが候補となる。 For mode 0 (Vertical Prediction mode), mode 3 (Diagonal_Down_Left Prediction mode), or mode 7 (Vertical_Left Prediction mode), only the upper adjacent pixels A0, A1, A2,.

モード１（Horizontal Prediction mode）、またはモード８（Horizontal_Up Prediction mode）に関しては、図１８における左部隣接画素Ｉ0, Ｉ1, Ｉ2,…のみのシフトが候補となる。 For mode 1 (Horizontal Prediction mode) or mode 8 (Horizontal_Up Prediction mode), only the left adjacent pixels I0, I1, I2,... In FIG.

その他のモード（モード４乃至６）に関しては、上部隣接画素および左部隣接画素の双方に関して、シフトを考慮する必要がある。 For the other modes (modes 4 to 6), it is necessary to consider the shift for both the upper adjacent pixel and the left adjacent pixel.

また、上部隣接画素に関しては、水平方向のシフト量のみが決定され、左部隣接画素に関しては、垂直方向のシフト量のみが決定される。 For the upper adjacent pixel, only the horizontal shift amount is determined, and for the left adjacent pixel, only the vertical shift amount is determined.

以上の第１乃至第４の動作を行い、最適なシフト量を決定することにより、イントラ予測モードにおいて用いられる画素値の選択肢を増やすことができ、より最適なイントラ予測を行うことができる。これにより、イントラ予測における符号化効率をさらに向上させることが可能である。 By performing the above first to fourth operations and determining the optimum shift amount, the choices of pixel values used in the intra prediction mode can be increased, and more optimum intra prediction can be performed. Thereby, it is possible to further improve the encoding efficiency in intra prediction.

また、H．264/AVC方式においては、図４を参照して上述したように、インター動き予測補償にしか用いられなかった６タップのFIRフィルタの回路を、イントラ予測にも有効活用することができる。これにより、回路を増大させることなく、効率を改善することができる。 H. In the H.264 / AVC format, as described above with reference to FIG. 4, the 6-tap FIR filter circuit that was used only for inter motion prediction compensation can be effectively used for intra prediction. Thereby, the efficiency can be improved without increasing the circuit.

さらに、H．264/AVC方式において定められているイントラ予測の分解能である22.5°よりさらに細かい分解能でのイントラ予測を行うことができる。 In addition, H. It is possible to perform intra prediction at a resolution finer than 22.5 ° which is the resolution of intra prediction defined in the H.264 / AVC format.

［小数画素精度のイントラ予測の効果例］
図１９の例において、点線は、図１を参照して上述したH．264/AVC方式のイントラ予測の予測モードの方向を表している。点線に付された番号は、図１０または図１１に示されている９つの予測モードの番号に対応している。なお、モード２は、平均値予測であるため、その番号が示されていない。 [Example of effects of intra prediction with decimal pixel precision]
In the example of FIG. 19, the dotted line indicates the H.264 described above with reference to FIG. This indicates the direction of the prediction mode of the H.264 / AVC system intra prediction. The numbers given to the dotted lines correspond to the numbers of the nine prediction modes shown in FIG. 10 or FIG. Since mode 2 is an average value prediction, its number is not shown.

H．264/AVC方式においては、点線に示す22.5°の分解能でしかイントラ予測を行うことができなかった。これに対して、画像符号化装置５１においては、小数画素精度のイントラ予測を行うことにより、太線が表すように22.5°よりさらに細かな分解能でのイントラ予測を行うことができる。これにより、特に、斜めエッジを持つテクスチャに対する符号化効率を向上させることができる。 H. In the H.264 / AVC format, intra prediction could be performed only at the resolution of 22.5 ° shown by the dotted line. On the other hand, in the image encoding device 51, by performing intra prediction with decimal pixel accuracy, it is possible to perform intra prediction with a resolution finer than 22.5 ° as indicated by a thick line. Thereby, especially the encoding efficiency with respect to the texture with a diagonal edge can be improved.

［イントラ予測処理の説明］
次に、図２０のフローチャートを参照して、上述した動作としてのイントラ予測処理を説明する。なお、このイントラ予測処理は、図８のステップＳ３１におけるイントラ予測処理であり、図２０の例においては、輝度信号の場合を例として説明する。 [Description of intra prediction processing]
Next, the intra prediction process as the operation described above will be described with reference to the flowchart of FIG. This intra prediction process is the intra prediction process in step S31 of FIG. 8, and in the example of FIG. 20, a case of a luminance signal will be described as an example.

最適モード決定部８２は、ステップＳ４１において、４×４画素、８×８画素、および１６×１６画素の各イントラ予測モードに対してイントラ予測を行う。 In step S41, the optimal mode determination unit 82 performs intra prediction for each of the 4 × 4 pixel, 8 × 8 pixel, and 16 × 16 pixel intra prediction modes.

上述したように、イントラ４×４予測モードとイントラ８×８予測モードには、９種類の予測モードがあり、それぞれブロック毎に１つの予測モードを定義することができる。イントラ１６×１６予測モードと色差信号のイントラ予測モードには、４種類の予測モードがあり、１つのマクロブロックに対して１つの予測モードを定義することができる。 As described above, the intra 4 × 4 prediction mode and the intra 8 × 8 prediction mode have nine types of prediction modes, and one prediction mode can be defined for each block. The intra 16 × 16 prediction mode and the color difference signal intra prediction mode include four types of prediction modes, and one prediction mode can be defined for one macroblock.

最適モード決定部８２は、処理対象のブロックの画素を、隣接画像バッファ８１から読み出される復号済みの隣接画像を参照して、各イントラ予測モードのすべての種類の予測モードで、イントラ予測する。これにより、各イントラ予測モードのすべての種類の予測モードでの予測画像が生成される。なお、参照される復号済みの画素としては、デブロックフィルタ７１によりデブロックフィルタリングされていない画素が用いられる。 The optimum mode determination unit 82 performs intra prediction on the pixels of the processing target block with reference to the decoded adjacent image read from the adjacent image buffer 81 in all types of prediction modes of each intra prediction mode. Thereby, the prediction image in all kinds of prediction modes of each intra prediction mode is generated. Note that pixels that have not been deblocked filtered by the deblocking filter 71 are used as decoded pixels that are referred to.

最適モード決定部８２は、ステップＳ４２において、４×４画素、８×８画素、および１６×１６画素の各イントラ予測モードに対するコスト関数値を算出する。ここで、コスト関数値としては、High Complexity モードか、Low Complexity モードのいずれかの手法に基づいて行う。これらのモードは、H．264/AVC方式における参照ソフトウエアであるＪＭ(Joint Model)で定められている。 In step S42, the optimum mode determination unit 82 calculates cost function values for the 4 × 4 pixel, 8 × 8 pixel, and 16 × 16 pixel intra prediction modes. Here, the cost function value is determined based on either the High Complexity mode or the Low Complexity mode. These modes are H.264. It is defined by JM (Joint Model) which is reference software in the H.264 / AVC format.

すなわち、High Complexity モードにおいては、ステップＳ４１の処理として、候補となる全ての予測モードに対して、仮に符号化処理までが行われる。そして、次の式（３４）で表わされるコスト関数値が各予測モードに対して算出され、その最小値を与える予測モードが最適予測モードであるとして選択される。 In other words, in the High Complexity mode, the process up to step S41 is temporarily performed up to the encoding process for all candidate prediction modes. Then, the cost function value represented by the following equation (34) is calculated for each prediction mode, and the prediction mode that gives the minimum value is selected as the optimal prediction mode.

Cost(Mode) = D + λ・R ・・・（３４）

Dは、原画像と復号画像の差分（歪）、Rは、直交変換係数まで含んだ発生符号量、λは、量子化パラメータQPの関数として与えられるラグランジュ乗数である。 Cost (Mode) = D + λ · R (34)

D is a difference (distortion) between the original image and the decoded image, R is a generated code amount including up to the orthogonal transform coefficient, and λ is a Lagrange multiplier given as a function of the quantization parameter QP.

一方、Low Complexity モードにおいては、ステップＳ４１の処理として、候補となる全ての予測モードに対して、予測画像の生成、および、動きベクトル情報や予測モード情報、フラグ情報などのヘッダビットまでが算出される。そして、次の式（３５）で表わされるコスト関数値が各予測モードに対して算出され、その最小値を与える予測モードが最適予測モードであるとして選択される。 On the other hand, in the Low Complexity mode, as a process in step S41, generation of a prediction image and header bits such as motion vector information, prediction mode information, and flag information are calculated for all candidate prediction modes. The Then, the cost function value represented by the following equation (35) is calculated for each prediction mode, and the prediction mode that gives the minimum value is selected as the optimum prediction mode.

Cost(Mode) = D + QPtoQuant(QP)・Header_Bit ・・・（３５）

Dは、原画像と復号画像の差分（歪）、Header_Bitは、予測モードに対するヘッダビット、QPtoQuantは、量子化パラメータQPの関数として与えられる関数である。 Cost (Mode) = D + QPtoQuant (QP) · Header_Bit (35)

D is a difference (distortion) between the original image and the decoded image, Header_Bit is a header bit for the prediction mode, and QPtoQuant is a function given as a function of the quantization parameter QP.

Low Complexity モードにおいては、全ての予測モードに対して、予測画像を生成するのみで、符号化処理および復号処理を行う必要がないため、演算量が少なくて済む。 In the Low Complexity mode, only a prediction image is generated for all prediction modes, and it is not necessary to perform encoding processing and decoding processing.

最適モード決定部８２は、ステップＳ４３において、４×４画素、８×８画素、および１６×１６画素の各イントラ予測モードに対して、それぞれ最適モードを決定する。すなわち、上述したように、イントラ４×４予測モードおよびイントラ８×８予測モードの場合には、予測モードの種類が９種類あり、イントラ１６×１６予測モードの場合には、予測モードの種類が４種類ある。したがって、最適モード決定部８２は、ステップＳ４２において算出されたコスト関数値に基づいて、それらの中から、最適イントラ４×４予測モード、最適イントラ８×８予測モード、最適イントラ１６×１６予測モードを決定する。 In step S43, the optimal mode determination unit 82 determines an optimal mode for each of the 4 × 4 pixel, 8 × 8 pixel, and 16 × 16 pixel intra prediction modes. That is, as described above, in the case of the intra 4 × 4 prediction mode and the intra 8 × 8 prediction mode, there are nine types of prediction modes, and in the case of the intra 16 × 16 prediction mode, there are types of prediction modes. There are four types. Therefore, based on the cost function value calculated in step S42, the optimal mode determination unit 82 selects among them the optimal intra 4 × 4 prediction mode, optimal intra 8 × 8 prediction mode, and optimal intra 16 × 16 prediction mode. To decide.

最適モード決定部８２は、ステップＳ４４において、４×４画素、８×８画素、および１６×１６画素の各イントラ予測モードに対して決定された各最適モードの中から、ステップＳ４２において算出されたコスト関数値に基づく最適イントラ予測モードを選択する。すなわち、４×４画素、８×８画素、および１６×１６画素に対して決定された各最適モードの中から、コスト関数値が最小値であるモードを、最適イントラ予測モードとして選択する。 The optimum mode determination unit 82 is calculated in step S42 from among the optimum modes determined for the 4 × 4 pixel, 8 × 8 pixel, and 16 × 16 pixel intra prediction modes in step S44. The optimal intra prediction mode based on the cost function value is selected. That is, the mode having the minimum cost function value is selected as the optimum intra prediction mode from among the optimum modes determined for 4 × 4 pixels, 8 × 8 pixels, and 16 × 16 pixels.

決定された予測モードの情報は、モード判別部９１、最適シフト量決定部８３、および予測画像生成部８４に供給される。また、予測画像生成部８４には、予測モードに対応するコスト関数値も供給される。 Information on the determined prediction mode is supplied to the mode determination unit 91, the optimum shift amount determination unit 83, and the predicted image generation unit 84. The cost function value corresponding to the prediction mode is also supplied to the predicted image generation unit 84.

隣接画素内挿部７５および最適シフト量決定部８３は、ステップＳ４５において、隣接内挿処理を実行する。ステップＳ４５における隣接内挿処理の詳細は、図２１を参照して後述するが、この処理により、決定された最適イントラ予測モードに応じたシフト方向に、最適なシフト量が決定される。決定された最適なシフト量に関する情報は、予測画像生成部８４に供給される。 In step S45, the adjacent pixel interpolation unit 75 and the optimum shift amount determination unit 83 perform adjacent interpolation processing. The details of the adjacent interpolation process in step S45 will be described later with reference to FIG. 21. With this process, an optimal shift amount is determined in the shift direction corresponding to the determined optimal intra prediction mode. Information on the determined optimum shift amount is supplied to the predicted image generation unit 84.

ステップＳ４６において、予測画像生成部８４は、最適シフト量で位相がシフトされた隣接画素を用いて、予測画像を生成する。 In step S 46, the predicted image generation unit 84 generates a predicted image using adjacent pixels whose phase is shifted by the optimum shift amount.

すなわち、予測画像生成部８４は、イントラ予測する対象ブロックに対応する隣接画素を隣接画像バッファ８１から読み出す。そして、予測画像生成部８４は、６タップのFIRフィルタおよび線形内挿により、予測モードに応じた位相方向に、読みだした隣接画素の位相を最適シフト量でシフトする。予測画像生成部８４は、位相がシフトされた隣接画素を用いて、最適モード決定部８２により決定された予測モードでイントラ予測を行い、対象ブロックの予測画像を生成し、生成された予測画像と、対応するコスト関数値を予測画像選択部７７に供給する。 That is, the predicted image generation unit 84 reads adjacent pixels corresponding to the target block to be intra predicted from the adjacent image buffer 81. Then, the predicted image generation unit 84 shifts the phase of the read adjacent pixel by the optimal shift amount in the phase direction corresponding to the prediction mode, using a 6-tap FIR filter and linear interpolation. The prediction image generation unit 84 performs intra prediction in the prediction mode determined by the optimal mode determination unit 82 using the adjacent pixels whose phases are shifted, generates a prediction image of the target block, and the generated prediction image and The corresponding cost function value is supplied to the predicted image selection unit 77.

なお、最適シフト量が０の場合には、隣接画像バッファ８１からの隣接画素の画素値が用いられる。 When the optimum shift amount is 0, the pixel value of the adjacent pixel from the adjacent image buffer 81 is used.

予測画像選択部７７により最適イントラ予測モードで生成された予測画像が選択された場合、予測画像生成部８４により、これらの最適イントラ予測モードを示す情報とシフト量の情報は、可逆符号化部６６に供給される。そして、可逆符号化部６６において、符号化され、圧縮画像のヘッダ情報に付加される（上述した図７のステップＳ２３）。 When a predicted image generated in the optimal intra prediction mode is selected by the predicted image selection unit 77, the information indicating the optimal intra prediction mode and the information on the shift amount are stored in the lossless encoding unit 66 by the predicted image generation unit 84. To be supplied. Then, it is encoded by the lossless encoding unit 66 and added to the header information of the compressed image (step S23 in FIG. 7 described above).

なお、このシフト量の情報の符号化としては、決定された対象ブロックのシフト量と、図１３を参照して上述したMostProbableModeを与えるブロックにおけるシフト量との差分が符号化される。 As the encoding of the shift amount information, the difference between the determined shift amount of the target block and the shift amount in the block that gives the MostProbableMode described above with reference to FIG. 13 is encoded.

ただし、例えば、MostProbableModeがモード２（DC予測）であり、対象ブロックの予測モードがモード０（Vertical予測）である場合、MostProbableModeを与えるブロックにおける水平方向のシフト量は存在しないことになる。また、インタースライスにおけるイントラマクロブロックであるという事情によっても、MostProbableModeを与えるブロックにおける水平方向のシフト量は存在しないことになる。 However, for example, when MostProbableMode is mode 2 (DC prediction) and the prediction mode of the target block is mode 0 (Vertical prediction), there is no horizontal shift amount in the block giving MostProbableMode. Also, due to the fact that it is an intra macroblock in an inter slice, there is no horizontal shift amount in a block that gives MostProbableMode.

このような場合には、MostProbableModeを与えるブロックにおける水平方向のシフト量は０であるとして、差分符号化処理が行われる。 In such a case, the differential encoding process is performed assuming that the horizontal shift amount in the block to which MostProbableMode is given is zero.

［隣接画素内挿処理の説明］
次に、図２１のフローチャートを参照して、図２０のステップＳ４５の隣接画素内挿処理について説明する。図２１の例においては、対象ブロックが４×４の場合について説明される。 [Description of adjacent pixel interpolation processing]
Next, the adjacent pixel interpolation processing in step S45 of FIG. 20 will be described with reference to the flowchart of FIG. In the example of FIG. 21, a case where the target block is 4 × 4 will be described.

最適モード決定部８２により決定された予測モードの情報は、モード判別部９１に供給される。モード判別部９１は、ステップＳ５１において、最適イントラ予測モードがDCモードであるか否かを判定する。ステップＳ５１において、最適イントラ予測モードがDCモードではないと判定された場合、処理は、ステップＳ５２に進む。 Information on the prediction mode determined by the optimal mode determination unit 82 is supplied to the mode determination unit 91. In step S51, the mode determination unit 91 determines whether or not the optimal intra prediction mode is the DC mode. If it is determined in step S51 that the optimal intra prediction mode is not the DC mode, the process proceeds to step S52.

ステップＳ５２において、モード判別部９１は、最適イントラ予測モードが、Vertical Prediction mode、Diagonal_Down_Left Prediction mode、またはVertical_Left Prediction modeであるか否かを判定する。 In step S52, the mode determination unit 91 determines whether or not the optimal intra prediction mode is Vertical Prediction mode, Diagonal_Down_Left Prediction mode, or Vertical_Left Prediction mode.

ステップＳ５２において、最適イントラ予測モードが、Vertical Prediction mode、Diagonal_Down_Left Prediction mode、またはVertical_Left Prediction modeであると判定された場合、処理は、ステップＳ５３に進む。 In Step S52, when it is determined that the optimal intra prediction mode is Vertical Prediction mode, Diagonal_Down_Left Prediction mode, or Vertical_Left Prediction mode, the process proceeds to Step S53.

ステップＳ５３において、モード判別部９１は、水平方向内挿部９２に制御信号を出力し、水平方向の内挿を行わせる。すなわち、水平方向内挿部９２は、モード判別部９１からの制御信号に応じて、隣接画像バッファ８１より上部隣接画素を読み出し、６タップのFIRフィルタおよび線形内挿により、読み出した上部隣接画素に対して、水平方向の位相をシフトする。水平方向内挿部９２は、内挿された上部隣接画素の情報を、最適シフト量決定部８３に供給する。 In step S 53, the mode determination unit 91 outputs a control signal to the horizontal direction interpolation unit 92 to perform horizontal direction interpolation. That is, the horizontal direction interpolation unit 92 reads the upper adjacent pixel from the adjacent image buffer 81 in accordance with the control signal from the mode determination unit 91, and uses the 6-tap FIR filter and linear interpolation to read the upper adjacent pixel. On the other hand, the phase in the horizontal direction is shifted. The horizontal direction interpolation unit 92 supplies the information on the interpolated upper adjacent pixel to the optimum shift amount determination unit 83.

ステップＳ５４において、最適シフト量決定部８３は、最適モード決定部８２により決定された予測モードについて、-0.75乃至+0.75のうち、上部隣接画素の最適シフト量を決定する。なお、この決定には、イントラ予測する対象ブロックの画像、隣接画像バッファ８１から読み出される上部隣接画素、および内挿された上部隣接画素の情報が用いられる。また、このとき、左部隣接画素についての最適シフト量は０とされる。決定された最適シフト量の情報は、予測画像生成部８４に供給される。 In step S54, the optimal shift amount determination unit 83 determines the optimal shift amount of the upper adjacent pixel from −0.75 to +0.75 for the prediction mode determined by the optimal mode determination unit 82. For this determination, information on the target block image to be intra-predicted, upper adjacent pixels read from the adjacent image buffer 81, and interpolated upper adjacent pixels are used. At this time, the optimum shift amount for the left adjacent pixel is set to zero. Information on the determined optimum shift amount is supplied to the predicted image generation unit 84.

ステップＳ５２において、最適イントラ予測モードが、Vertical Prediction mode、Diagonal_Down_Left Prediction mode、およびVertical_Left Prediction modeではないと判定された場合、処理は、ステップＳ５５に進む。 If it is determined in step S52 that the optimal intra prediction mode is not Vertical Prediction mode, Diagonal_Down_Left Prediction mode, or Vertical_Left Prediction mode, the process proceeds to step S55.

ステップＳ５５において、モード判別部９１は、最適イントラ予測モードがHorizontal Prediction mode、またはHorizontal_Up Prediction modeであるか否かを判定する。ステップＳ５５において、最適イントラ予測モードがHorizontal Prediction mode、またはHorizontal_Up Prediction modeであると判定された場合、処理は、ステップＳ５６に進む。 In step S55, the mode determination unit 91 determines whether or not the optimal intra prediction mode is the Horizontal Prediction mode or the Horizontal_Up Prediction mode. In Step S55, when it is determined that the optimal intra prediction mode is Horizontal Prediction mode or Horizontal_Up Prediction mode, the process proceeds to Step S56.

ステップＳ５６において、モード判別部９１は、垂直方向内挿部９３に制御信号を出力し、垂直方向の内挿を行わせる。すなわち、垂直方向内挿部９３は、モード判別部９１からの制御信号に応じて、隣接画像バッファ８１より左部隣接画素を読み出し、６タップのFIRフィルタおよび線形内挿により、読み出した左部隣接画素に対して垂直方向の位相をシフトする。垂直方向内挿部９３は、内挿された左部隣接画素の情報を、最適シフト量決定部８３に供給する。 In step S56, the mode determination unit 91 outputs a control signal to the vertical direction interpolation unit 93 to perform vertical direction interpolation. That is, the vertical interpolation unit 93 reads the left adjacent pixel from the adjacent image buffer 81 in accordance with the control signal from the mode determination unit 91, and reads the left adjacent pixel by the 6-tap FIR filter and linear interpolation. The phase in the vertical direction is shifted with respect to the pixel. The vertical direction interpolation unit 93 supplies the interpolated left adjacent pixel information to the optimum shift amount determination unit 83.

ステップＳ５７において、最適シフト量決定部８３は、最適モード決定部８２により決定された予測モードについて、-0.75乃至+0.75のうち、左部隣接画素の最適シフト量を決定する。なお、この決定には、イントラ予測する対象ブロックの画像、隣接画像バッファ８１から読み出される左部隣接画素、および内挿された左部隣接画素の情報が用いられる。また、このとき、上部隣接画素についての最適シフト量は０とされる。決定された最適シフト量の情報は、予測画像生成部８４に供給される。 In step S57, the optimal shift amount determination unit 83 determines the optimal shift amount of the left adjacent pixel from −0.75 to +0.75 for the prediction mode determined by the optimal mode determination unit 82. For this determination, information on the target block image to be intra-predicted, the left adjacent pixel read from the adjacent image buffer 81, and the interpolated left adjacent pixel are used. At this time, the optimum shift amount for the upper adjacent pixel is set to zero. Information on the determined optimum shift amount is supplied to the predicted image generation unit 84.

ステップＳ５５において、最適イントラ予測モードがHorizontal Prediction mode、およびHorizontal_Up Prediction modeではないと判定された場合、処理は、ステップＳ５８に進む。 In Step S55, when it is determined that the optimal intra prediction mode is not the Horizontal Prediction mode and the Horizontal_Up Prediction mode, the process proceeds to Step S58.

ステップＳ５８において、モード判別部９１は、水平方向内挿部９２に制御信号を出力し、水平方向の内挿を行わせ、垂直方向内挿部９３に制御信号を出力し、垂直方向の内挿を行わせる。 In step S58, the mode determination unit 91 outputs a control signal to the horizontal interpolation unit 92, performs horizontal interpolation, outputs a control signal to the vertical interpolation unit 93, and performs vertical interpolation. To do.

すなわち、水平方向内挿部９２は、モード判別部９１からの制御信号に応じて、隣接画像バッファ８１より上部隣接画素を読み出し、６タップのFIRフィルタおよび線形内挿により、読み出した上部隣接画素に対して水平方向の位相をシフトする。水平方向内挿部９２は、内挿された上部隣接画素の情報を、最適シフト量決定部８３に供給する。 That is, the horizontal direction interpolation unit 92 reads the upper adjacent pixel from the adjacent image buffer 81 in accordance with the control signal from the mode determination unit 91, and uses the 6-tap FIR filter and linear interpolation to read the upper adjacent pixel. In contrast, the phase in the horizontal direction is shifted. The horizontal direction interpolation unit 92 supplies the information on the interpolated upper adjacent pixel to the optimum shift amount determination unit 83.

また、垂直方向内挿部９３は、モード判別部９１からの制御信号に応じて、隣接画像バッファ８１より左部隣接画素を読み出し、６タップのFIRフィルタおよび線形内挿により、読み出した左部隣接画素に対して垂直方向の位相をシフトする。垂直方向内挿部９３は、内挿された左部隣接画素の情報を、最適シフト量決定部８３に供給する。 The vertical interpolation unit 93 reads the left adjacent pixel from the adjacent image buffer 81 in accordance with the control signal from the mode determination unit 91, and reads the left adjacent pixel by a 6-tap FIR filter and linear interpolation. The phase in the vertical direction is shifted with respect to the pixel. The vertical direction interpolation unit 93 supplies the interpolated left adjacent pixel information to the optimum shift amount determination unit 83.

ステップＳ５９において、最適シフト量決定部８３は、最適モード決定部８２により決定された予測モードについて、-0.75乃至+0.75のうち、上部および左部隣接画素の最適シフト量を決定する。この決定には、イントラ予測する対象ブロックの画像、隣接画像バッファ８１から読み出される上部および左部隣接画素、並びに内挿された上部および左部隣接画素の情報が用いられる。決定された最適シフト量の情報は、予測画像生成部８４に供給される。 In step S59, the optimal shift amount determination unit 83 determines the optimal shift amounts of the upper and left adjacent pixels from −0.75 to +0.75 for the prediction mode determined by the optimal mode determination unit 82. This determination uses the image of the target block to be intra-predicted, the upper and left adjacent pixels read from the adjacent image buffer 81, and the interpolated upper and left adjacent pixels. Information on the determined optimum shift amount is supplied to the predicted image generation unit 84.

一方、ステップＳ５１において、最適イントラ予測モードがDCモードであると判定された場合、隣接画素内挿処理は終了される。すなわち、水平方向内挿部８２および垂直方向内挿部８３は動作せず、最適シフト量決定部８３においては、シフト量０が最適シフト量に決定される。 On the other hand, when it is determined in step S51 that the optimal intra prediction mode is the DC mode, the adjacent pixel interpolation process is ended. That is, the horizontal direction interpolation unit 82 and the vertical direction interpolation unit 83 do not operate, and the optimal shift amount determination unit 83 determines the shift amount 0 as the optimal shift amount.

［インター動き予測処理の説明］
次に、図２２のフローチャートを参照して、図８のステップＳ３２のインター動き予測処理について説明する。 [Explanation of inter motion prediction processing]
Next, the inter motion prediction process in step S32 in FIG. 8 will be described with reference to the flowchart in FIG.

動き予測・補償部７６は、ステップＳ６１において、１６×１６画素乃至４×４画素からなる８種類の各インター予測モードに対して動きベクトルと参照画像をそれぞれ決定する。すなわち、各インター予測モードの処理対象のブロックについて、動きベクトルと参照画像がそれぞれ決定される。 In step S61, the motion prediction / compensation unit 76 determines a motion vector and a reference image for each of eight types of inter prediction modes including 16 × 16 pixels to 4 × 4 pixels. That is, a motion vector and a reference image are determined for each block to be processed in each inter prediction mode.

動き予測・補償部７６は、ステップＳ６２において、１６×１６画素乃至４×４画素からなる８種類の各インター予測モードについて、ステップＳ６１で決定された動きベクトルに基づいて、参照画像に動き予測と補償処理を行う。この動き予測と補償処理により、各インター予測モードでの予測画像が生成される。 In step S62, the motion prediction / compensation unit 76 performs motion prediction on the reference image based on the motion vector determined in step S61 for each of the eight types of inter prediction modes including 16 × 16 pixels to 4 × 4 pixels. Perform compensation processing. By this motion prediction and compensation processing, a prediction image in each inter prediction mode is generated.

動き予測・補償部７６は、ステップＳ６３において、１６×１６画素乃至４×４画素からなる８種類の各インター予測モードに対して決定された動きベクトルについて、圧縮画像に付加するための動きベクトル情報を生成する。このとき、図５を参照して上述した動きベクトルの生成方法が用いられる。 In step S63, the motion prediction / compensation unit 76 adds motion vector information for adding to the compressed image the motion vectors determined for each of the eight types of inter prediction modes including 16 × 16 pixels to 4 × 4 pixels. Is generated. At this time, the motion vector generation method described above with reference to FIG. 5 is used.

生成された動きベクトル情報は、次のステップＳ６４におけるコスト関数値算出の際にも用いられ、最終的に予測画像選択部７７により対応する予測画像が選択された場合には、予測モード情報および参照フレーム情報とともに、可逆符号化部６６へ出力される。 The generated motion vector information is also used when calculating the cost function value in the next step S64. When the corresponding predicted image is finally selected by the predicted image selection unit 77, the prediction mode information and reference It is output to the lossless encoding unit 66 together with the frame information.

動き予測・補償部７６は、ステップＳ６４において、１６×１６画素乃至４×４画素からなる８種類の各インター予測モードに対して、上述した式（３４）または式（３５）で示されるコスト関数値を算出する。ここで算出されたコスト関数値は、上述した図８のステップＳ３４で最適インター予測モードを決定する際に用いられる。 In step S64, the motion prediction / compensation unit 76 performs the cost function represented by the above-described equation (34) or equation (35) for each of the eight types of inter prediction modes including 16 × 16 pixels to 4 × 4 pixels. Calculate the value. The cost function value calculated here is used when determining the optimal inter prediction mode in step S34 of FIG. 8 described above.

なお、本発明における動作原理は、図１８、または図２０および図２１を参照して上述した動作に限らない。例えば、すべてのイントラ予測モードに対して、候補となるすべてのシフト量の予測値を算出し、その残差を算出して、最適イントラ予測モードおよび最適シフト量を決定するようにすることもできる。この動作を行う場合のイントラ予測部および隣接画素内挿部の構成例は、図２３に示される。 The operation principle in the present invention is not limited to the operation described above with reference to FIG. 18, or FIG. 20 and FIG. For example, for all intra prediction modes, prediction values of all candidate shift amounts can be calculated, and the residual can be calculated to determine the optimal intra prediction mode and the optimal shift amount. . A configuration example of the intra prediction unit and the adjacent pixel interpolation unit in the case of performing this operation is illustrated in FIG.

［イントラ予測部および隣接画素内挿部の他の構成例］
図２３は、イントラ予測部および隣接画素内挿部の他の構成例を示すブロック図である。 [Other Configuration Examples of Intra Prediction Unit and Adjacent Pixel Interpolation Unit]
FIG. 23 is a block diagram illustrating another configuration example of the intra prediction unit and the adjacent pixel interpolation unit.

図２３の例の場合、イントラ予測部７４は、隣接画像バッファ１０１、最適モード／最適シフト量決定部１０２、および予測画像生成部１０３により構成されている。 In the case of the example in FIG. 23, the intra prediction unit 74 includes an adjacent image buffer 101, an optimal mode / optimum shift amount determination unit 102, and a predicted image generation unit 103.

隣接画素内挿部７５は、水平方向内挿部１１１、および垂直方向内挿部１１２により構成されている。 The adjacent pixel interpolation unit 75 includes a horizontal direction interpolation unit 111 and a vertical direction interpolation unit 112.

隣接画像バッファ１０１は、フレームメモリ７２からのイントラ予測の対象ブロックの隣接画素を蓄積する。図２３の場合も、スイッチ７３の図示は省略されているが、実際には、隣接画素は、フレームメモリ７２から、スイッチ７３を介して隣接画像バッファ１０１に供給される。 The adjacent image buffer 101 accumulates adjacent pixels of the target block for intra prediction from the frame memory 72. Also in FIG. 23, the switch 73 is not shown, but actually, the adjacent pixels are supplied from the frame memory 72 to the adjacent image buffer 101 via the switch 73.

最適モード／最適シフト量決定部１０２には、イントラ予測する対象ブロックの画素が画面並べ替えバッファ６２から入力される。最適モード／最適シフト量決定部１０２は、イントラ予測する対象ブロックに対応する隣接画素を隣接画像バッファ１０１から読み出す。 The optimal mode / optimum shift amount determination unit 102 receives the pixel of the target block for intra prediction from the screen rearrangement buffer 62. The optimum mode / optimum shift amount determination unit 102 reads adjacent pixels corresponding to the target block to be intra predicted from the adjacent image buffer 101.

最適モード／最適シフト量決定部１０２は、候補のイントラ予測モード（以下、候補モードと称する）の情報を、水平方向内挿部１１１および垂直方向内挿部１１２に供給する。最適モード／最適シフト量決定部１０２には、候補モードに応じて内挿された隣接画素の情報が水平方向内挿部１１１および垂直方向内挿部１１２から入力される。 The optimum mode / optimum shift amount determination unit 102 supplies information on candidate intra prediction modes (hereinafter referred to as candidate modes) to the horizontal direction interpolation unit 111 and the vertical direction interpolation unit 112. Information of adjacent pixels interpolated according to the candidate mode is input from the horizontal direction interpolation unit 111 and the vertical direction interpolation unit 112 to the optimal mode / optimum shift amount determination unit 102.

最適モード／最適シフト量決定部１０２は、イントラ予測する対象ブロックの画素、対応する隣接画素、および内挿された隣接画素の画素値を用いて、すべての候補モードおよびすべての候補シフト量に対して、イントラ予測を行い、予測画像を生成する。そして、最適モード／最適シフト量決定部１０２は、コスト関数値や予測残差などを算出して、すべての候補モードおよびすべてのシフト量の中から、最適なイントラ予測モードおよび最適なシフト量を決定する。決定された予測モードおよびシフト量の情報は、予測画像生成部１０３に供給される。なお、このとき、予測モードに対応するコスト関数値も予測画像生成部１０３に供給される。 The optimal mode / optimum shift amount determination unit 102 uses the pixel values of the target block to be intra-predicted, the corresponding adjacent pixels, and the interpolated adjacent pixels for all candidate modes and all candidate shift amounts. Then, intra prediction is performed to generate a predicted image. Then, the optimum mode / optimum shift amount determination unit 102 calculates a cost function value, a prediction residual, and the like, and selects an optimum intra prediction mode and an optimum shift amount from all candidate modes and all shift amounts. decide. Information on the determined prediction mode and shift amount is supplied to the predicted image generation unit 103. At this time, the cost function value corresponding to the prediction mode is also supplied to the predicted image generation unit 103.

予測画像生成部１０３は、イントラ予測する対象ブロックに対応する隣接画素を隣接画像バッファ１０１から読み出し、６タップのFIRフィルタおよび線形内挿により、予測モードに応じた位相方向に、読みだした隣接画素の位相を最適シフト量でシフトする。 The predicted image generation unit 103 reads adjacent pixels corresponding to the target block to be intra-predicted from the adjacent image buffer 101, and reads the adjacent pixels in the phase direction corresponding to the prediction mode by a 6-tap FIR filter and linear interpolation. Is shifted by the optimum shift amount.

予測画像生成部１０３は、位相がシフトされた隣接画素を用いて、最適モード／最適シフト量決定部１０２により決定された最適イントラ予測モードでイントラ予測を行い、対象ブロックの予測画像を生成する。予測画像生成部１０３は、生成された予測画像と、対応するコスト関数値を予測画像選択部７７に出力する。 The prediction image generation unit 103 performs intra prediction in the optimal intra prediction mode determined by the optimal mode / optimum shift amount determination unit 102 using adjacent pixels whose phases are shifted, and generates a prediction image of the target block. The predicted image generation unit 103 outputs the generated predicted image and the corresponding cost function value to the predicted image selection unit 77.

また、予測画像生成部１０３は、予測画像選択部７７により最適イントラ予測モードで生成された予測画像が選択された場合、最適イントラ予測モードを示す情報とシフト量の情報を、可逆符号化部６６に供給する。 Further, when the predicted image generated in the optimal intra prediction mode is selected by the predicted image selection unit 77, the predicted image generation unit 103 converts the information indicating the optimal intra prediction mode and the shift amount information into the lossless encoding unit 66. To supply.

水平方向内挿部１１１および垂直方向内挿部１１２は、最適モード／最適シフト量決定部１０２からの候補モードに応じて、隣接画像バッファ１０１より隣接画素をそれぞれ読み出す。水平方向内挿部１１１および垂直方向内挿部１１２は、６タップのFIRフィルタおよび線形内挿により、読み出した隣接画素に対して、水平方向および垂直方向に、それぞれ、位相をシフトする。 The horizontal direction interpolation unit 111 and the vertical direction interpolation unit 112 read adjacent pixels from the adjacent image buffer 101 according to the candidate mode from the optimal mode / optimum shift amount determination unit 102, respectively. The horizontal direction interpolating unit 111 and the vertical direction interpolating unit 112 shift the phase in the horizontal direction and the vertical direction, respectively, with respect to the read adjacent pixels by a 6-tap FIR filter and linear interpolation.

［イントラ予測処理の他の説明］
次に、図２４のフローチャートを参照して、図２３のイントラ予測部７４および隣接画素内挿部７５が行うイントラ予測処理を説明する。なお、このイントラ予測処理は、図８のステップＳ３１におけるイントラ予測処理の他の例である。 [Other explanation of intra prediction processing]
Next, the intra prediction process performed by the intra prediction unit 74 and the adjacent pixel interpolation unit 75 in FIG. 23 will be described with reference to the flowchart in FIG. This intra prediction process is another example of the intra prediction process in step S31 of FIG.

最適モード／最適シフト量決定部１０２は、候補のイントラ予測モードの情報を、水平方向内挿部１１１および垂直方向内挿部１１２に供給する。 The optimum mode / optimum shift amount determination unit 102 supplies information on candidate intra prediction modes to the horizontal direction interpolation unit 111 and the vertical direction interpolation unit 112.

ステップＳ１０１において、水平方向内挿部１１１および垂直方向内挿部１１２は、すべての候補のイントラ予測モードに対して、隣接画素内挿処理を実行する。すなわち、ステップＳ１０１においては、４×４画素、８×８画素、および１６×１６画素の各イントラ予測モードに対して、隣接画素内挿処理がそれぞれ実行される。 In step S101, the horizontal direction interpolation unit 111 and the vertical direction interpolation unit 112 perform adjacent pixel interpolation processing on all candidate intra prediction modes. That is, in step S101, adjacent pixel interpolation processing is executed for each intra prediction mode of 4 × 4 pixels, 8 × 8 pixels, and 16 × 16 pixels.

ステップＳ１０１における隣接内挿処理の詳細は、図２５を参照して後述するが、この処理により、各イントラ予測モードに応じたシフト方向に内挿された隣接画素の情報が、最適モード／最適シフト量決定部１０２に供給される。 The details of the adjacent interpolation process in step S101 will be described later with reference to FIG. 25. By this process, information on adjacent pixels interpolated in the shift direction corresponding to each intra prediction mode is converted into the optimum mode / optimum shift. The amount is supplied to the quantity determination unit 102.

ステップＳ１０２において、最適モード／最適シフト量決定部１０２は、４×４画素、８×８画素、および１６×１６画素の各イントラ予測モードと各シフト量に対してイントラ予測を行う。 In step S102, the optimum mode / optimum shift amount determination unit 102 performs intra prediction for each intra prediction mode and each shift amount of 4 × 4 pixels, 8 × 8 pixels, and 16 × 16 pixels.

すなわち、最適モード／最適シフト量決定部１０２は、イントラ予測する対象ブロックの画素、対応する隣接画素、および内挿された隣接画素の画素値を用いて、すべてのイントラ予測モードおよびすべての候補シフト量に対して、イントラ予測を行う。この結果、すべてのイントラ予測モードおよびすべての候補シフト量に対して、予測画像が生成される。 That is, the optimum mode / optimum shift amount determination unit 102 uses the pixel values of the target block to be intra-predicted, the corresponding adjacent pixels, and the interpolated adjacent pixel values, for all intra prediction modes and all candidate shifts. Intra prediction is performed on the quantity. As a result, prediction images are generated for all intra prediction modes and all candidate shift amounts.

ステップＳ１０３において、最適モード／最適シフト量決定部１０２は、予測画像を生成した４×４画素、８×８画素、および１６×１６画素の各イントラ予測モードと各シフト量に対して、上述した式（３４）または式（３５）のコスト関数値を算出する。 In step S103, the optimum mode / optimum shift amount determination unit 102 described above for each intra prediction mode and each shift amount of 4 × 4 pixels, 8 × 8 pixels, and 16 × 16 pixels that generated the predicted image. The cost function value of Expression (34) or Expression (35) is calculated.

ステップＳ１０４において、最適モード／最適シフト量決定部１０２は、算出したコスト関数値を比較することで、４×４画素、８×８画素、および１６×１６画素の各イントラ予測モードに対して、それぞれ最適モードおよび最適シフト量を決定する。 In step S104, the optimum mode / optimum shift amount determination unit 102 compares the calculated cost function values with respect to each of the 4 × 4 pixel, 8 × 8 pixel, and 16 × 16 pixel intra prediction modes. The optimum mode and the optimum shift amount are determined respectively.

ステップＳ１０５において、最適モード／最適シフト量決定部１０２は、ステップＳ１０４において決定された各最適モードと最適シフト量の中から、ステップＳ１０３において算出されたコスト関数値に基づいて、最適イントラ予測モードと最適シフト量を選択する。すなわち、４×４画素、８×８画素、および１６×１６画素の各イントラ予測モードに対して決定された各最適モードと最適シフト量の中から、最適イントラ予測モードと最適シフト量が選択される。選択された予測モードおよびシフト量の情報は、対応するコスト関数値とともに、予測画像生成部１０３に供給される。 In step S105, the optimum mode / optimum shift amount determination unit 102 determines the optimum intra prediction mode based on the cost function value calculated in step S103 from among the optimum modes and optimum shift amounts determined in step S104. Select the optimal shift amount. That is, the optimal intra prediction mode and the optimal shift amount are selected from the optimal modes and the optimal shift amounts determined for the 4 × 4 pixel, 8 × 8 pixel, and 16 × 16 pixel intra prediction modes. The The information on the selected prediction mode and shift amount is supplied to the predicted image generation unit 103 together with the corresponding cost function value.

ステップＳ１０６において、予測画像生成部１０３は、最適シフト量で位相がシフトされた隣接画素を用いて、予測画像を生成する。 In step S 106, the predicted image generation unit 103 generates a predicted image using adjacent pixels whose phase is shifted by the optimal shift amount.

すなわち、予測画像生成部１０３は、イントラ予測する対象ブロックに対応する隣接画素を隣接画像バッファ１０１から読み出す。そして、予測画像生成部１０３は、６タップのFIRフィルタおよび線形内挿により、決定された予測モードに応じた位相方向に、読みだした隣接画素の位相を最適シフト量でシフトする。 That is, the predicted image generation unit 103 reads adjacent pixels corresponding to the target block to be intra-predicted from the adjacent image buffer 101. Then, the predicted image generation unit 103 shifts the phase of the read adjacent pixel by the optimal shift amount in the phase direction corresponding to the determined prediction mode by the 6-tap FIR filter and linear interpolation.

予測画像生成部１０３は、位相がシフトされた隣接画素を用いて、最適モード／最適シフト量決定部１０２により決定された予測モードでイントラ予測を行い、対象ブロックの予測画像を生成する。生成された予測画像は、対応するコスト関数値とともに予測画像選択部７７に供給される。 The prediction image generation unit 103 performs intra prediction in the prediction mode determined by the optimal mode / optimum shift amount determination unit 102 using adjacent pixels whose phases are shifted, and generates a prediction image of the target block. The generated predicted image is supplied to the predicted image selection unit 77 together with the corresponding cost function value.

［隣接画素内挿処理の説明］
次に、図２５のフローチャートを参照して、図２４のステップＳ１０１の隣接画素内挿処理について説明する。なお、この隣接画素内挿処理は、候補のイントラ予測モード毎に行われる処理である。また、図２５のステップＳ１１１乃至Ｓ１１６は、図２１のステップＳ５１乃至Ｓ５３、Ｓ５５、Ｓ５６、およびＳ５８と同様の処理を行うので、その詳細な説明は適宜省略する。 [Description of adjacent pixel interpolation processing]
Next, the adjacent pixel interpolation processing in step S101 in FIG. 24 will be described with reference to the flowchart in FIG. The adjacent pixel interpolation process is a process performed for each candidate intra prediction mode. Also, steps S111 to S116 in FIG. 25 perform the same processing as steps S51 to S53, S55, S56, and S58 in FIG. 21, and thus detailed description thereof will be omitted as appropriate.

最適モード／最適シフト量決定部１０２から候補のイントラ予測モードの情報は、水平方向内挿部１１１および垂直方向内挿部１１２に供給される。水平方向内挿部１１１および垂直方向内挿部１１２は、ステップＳ１１１において、候補のイントラ予測モードがDCモードであるか否かを判定する。ステップＳ１１１において、候補のイントラ予測モードがDCモードではないと判定された場合、処理は、ステップＳ１１２に進む。 Information on candidate intra prediction modes from the optimum mode / optimum shift amount determination unit 102 is supplied to the horizontal direction interpolation unit 111 and the vertical direction interpolation unit 112. In step S111, the horizontal direction interpolation unit 111 and the vertical direction interpolation unit 112 determine whether the candidate intra prediction mode is the DC mode. If it is determined in step S111 that the candidate intra prediction mode is not the DC mode, the process proceeds to step S112.

ステップＳ１１２において、水平方向内挿部１１１および垂直方向内挿部１１２は、候補のイントラ予測モードが、Vertical Prediction mode、Diagonal_Down_Left Prediction mode、またはVertical_Left Prediction modeであるか否かを判定する。 In step S112, the horizontal direction interpolation unit 111 and the vertical direction interpolation unit 112 determine whether the candidate intra prediction mode is Vertical Prediction mode, Diagonal_Down_Left Prediction mode, or Vertical_Left Prediction mode.

ステップＳ１１２において、候補のイントラ予測モードが、Vertical Prediction mode、Diagonal_Down_Left Prediction mode、またはVertical_Left Prediction modeであると判定された場合、処理は、ステップＳ１１３に進む。 If it is determined in step S112 that the candidate intra prediction mode is Vertical Prediction mode, Diagonal_Down_Left Prediction mode, or Vertical_Left Prediction mode, the process proceeds to step S113.

ステップＳ１１３において、水平方向内挿部１１１は、候補のイントラ予測モードに応じて、水平方向の内挿を行う。水平方向内挿部１１１は、内挿された上部隣接画素の情報を、最適モード／最適シフト量決定部１０２に供給する。このとき、垂直方向内挿部１１２は、垂直方向の内挿処理を行わない。 In step S113, the horizontal interpolation unit 111 performs horizontal interpolation according to the candidate intra prediction mode. The horizontal direction interpolation unit 111 supplies the information on the interpolated upper adjacent pixel to the optimum mode / optimum shift amount determination unit 102. At this time, the vertical direction interpolation unit 112 does not perform vertical direction interpolation processing.

ステップＳ１１２において、候補のイントラ予測モードが、Vertical Prediction mode、Diagonal_Down_Left Prediction mode、およびVertical_Left Prediction modeではないと判定された場合、処理は、ステップＳ１１４に進む。 If it is determined in step S112 that the candidate intra prediction mode is not Vertical Prediction mode, Diagonal_Down_Left Prediction mode, or Vertical_Left Prediction mode, the process proceeds to step S114.

ステップＳ１１４において、水平方向内挿部１１１および垂直方向内挿部１１２は、候補のイントラ予測モードがHorizontal Prediction mode、またはHorizontal_Up Prediction modeであるか否かを判定する。ステップＳ１１４において、候補のイントラ予測モードがHorizontal Prediction mode、またはHorizontal_Up Prediction modeであると判定された場合、処理は、ステップＳ１１５に進む。 In step S114, the horizontal direction interpolation unit 111 and the vertical direction interpolation unit 112 determine whether the candidate intra prediction mode is the Horizontal Prediction mode or the Horizontal_Up Prediction mode. If it is determined in step S114 that the candidate intra prediction mode is the Horizontal Prediction mode or Horizontal_Up Prediction mode, the process proceeds to Step S115.

ステップＳ１１５において、垂直方向内挿部１１２は、候補のイントラ予測モードに応じて、垂直方向の内挿を行う。垂直方向内挿部１１２は、内挿された左部隣接画素の情報を、最適モード／最適シフト量決定部１０２に供給する。このとき、水平方向内挿部１１１は、水平方向の内挿を行なわない。 In step S115, the vertical direction interpolation unit 112 performs vertical direction interpolation in accordance with the candidate intra prediction mode. The vertical direction interpolation unit 112 supplies the interpolated information on the left adjacent pixel to the optimum mode / optimum shift amount determination unit 102. At this time, the horizontal interpolation unit 111 does not perform horizontal interpolation.

ステップＳ１１４において、候補のイントラ予測モードがHorizontal Prediction mode、およびHorizontal_Up Prediction modeではないと判定された場合、処理は、ステップＳ１１６に進む。 If it is determined in step S114 that the candidate intra prediction mode is not the Horizontal Prediction mode and Horizontal_Up Prediction mode, the process proceeds to step S116.

ステップＳ１１６において、水平方向内挿部１１１および垂直方向内挿部１１２は、候補のイントラ予測モードに応じて、水平方向の内挿、および垂直方向の内挿をそれぞれ行う。水平方向内挿部１１１および垂直方向内挿部１１２は、内挿された上部隣接画素および左部隣接画素の情報を、最適モード／最適シフト量決定部１０２にそれぞれ供給する。 In step S116, the horizontal direction interpolation unit 111 and the vertical direction interpolation unit 112 perform horizontal direction interpolation and vertical direction interpolation, respectively, according to the candidate intra prediction modes. The horizontal direction interpolation unit 111 and the vertical direction interpolation unit 112 supply information on the interpolated upper adjacent pixel and left adjacent pixel to the optimum mode / optimum shift amount determination unit 102, respectively.

符号化された圧縮画像は、所定の伝送路を介して伝送され、画像復号装置により復号される。 The encoded compressed image is transmitted via a predetermined transmission path and decoded by the image decoding device.

［画像復号装置の構成例］
図２６は、本発明を適用した画像処理装置としての画像復号装置の一実施の形態の構成を表している。 [Configuration Example of Image Decoding Device]
FIG. 26 shows a configuration of an embodiment of an image decoding apparatus as an image processing apparatus to which the present invention is applied.

画像復号装置１５１は、蓄積バッファ１６１、可逆復号部１６２、逆量子化部１６３、逆直交変換部１６４、演算部１６５、デブロックフィルタ１６６、画面並べ替えバッファ１６７、Ｄ／Ａ変換部１６８、フレームメモリ１６９、スイッチ１７０、イントラ予測部１７１、隣接画素内挿部１７２、動き予測・補償部１７３、およびスイッチ１７４により構成されている。 The image decoding device 151 includes a storage buffer 161, a lossless decoding unit 162, an inverse quantization unit 163, an inverse orthogonal transform unit 164, an operation unit 165, a deblock filter 166, a screen rearrangement buffer 167, a D / A conversion unit 168, a frame The memory 169, the switch 170, the intra prediction unit 171, the adjacent pixel interpolation unit 172, the motion prediction / compensation unit 173, and the switch 174 are configured.

蓄積バッファ１６１は伝送されてきた圧縮画像を蓄積する。可逆復号部１６２は、蓄積バッファ１６１より供給された、図２の可逆符号化部６６により符号化された情報を、可逆符号化部６６の符号化方式に対応する方式で復号する。逆量子化部１６３は可逆復号部１６２により復号された画像を、図２の量子化部６５の量子化方式に対応する方式で逆量子化する。逆直交変換部１６４は、図２の直交変換部６４の直交変換方式に対応する方式で逆量子化部１６３の出力を逆直交変換する。 The accumulation buffer 161 accumulates the transmitted compressed image. The lossless decoding unit 162 decodes the information supplied from the accumulation buffer 161 and encoded by the lossless encoding unit 66 in FIG. 2 by a method corresponding to the encoding method of the lossless encoding unit 66. The inverse quantization unit 163 inversely quantizes the image decoded by the lossless decoding unit 162 by a method corresponding to the quantization method of the quantization unit 65 in FIG. The inverse orthogonal transform unit 164 performs inverse orthogonal transform on the output of the inverse quantization unit 163 by a method corresponding to the orthogonal transform method of the orthogonal transform unit 64 in FIG.

逆直交変換された出力は演算部１６５によりスイッチ１７４から供給される予測画像と加算されて復号される。デブロックフィルタ１６６は、復号された画像のブロック歪を除去した後、フレームメモリ１６９に供給し、蓄積させるとともに、画面並べ替えバッファ１６７に出力する。 The inverse orthogonal transformed output is added to the predicted image supplied from the switch 174 by the arithmetic unit 165 and decoded. The deblocking filter 166 removes block distortion of the decoded image, and then supplies the frame to the frame memory 169 to store it, and outputs it to the screen rearrangement buffer 167.

画面並べ替えバッファ１６７は、画像の並べ替えを行う。すなわち、図２の画面並べ替えバッファ６２により符号化の順番のために並べ替えられたフレームの順番が、元の表示の順番に並べ替えられる。Ｄ／Ａ変換部１６８は、画面並べ替えバッファ１６７から供給された画像をＤ／Ａ変換し、図示せぬディスプレイに出力し、表示させる。 The screen rearrangement buffer 167 rearranges images. That is, the order of frames rearranged for the encoding order by the screen rearrangement buffer 62 in FIG. 2 is rearranged in the original display order. The D / A conversion unit 168 D / A converts the image supplied from the screen rearrangement buffer 167, and outputs and displays the image on a display (not shown).

スイッチ１７０は、インター処理される画像と参照される画像をフレームメモリ１６９から読み出し、動き予測・補償部１７３に出力するとともに、イントラ予測に用いられる画像をフレームメモリ１６９から読み出し、イントラ予測部１７１に供給する。 The switch 170 reads an image to be inter-processed and a referenced image from the frame memory 169 and outputs them to the motion prediction / compensation unit 173, and also reads an image used for intra prediction from the frame memory 169 to the intra prediction unit 171. Supply.

イントラ予測部１７１には、ヘッダ情報を復号して得られたイントラ予測モードを示す情報と、隣接画素のシフト量の情報が、可逆復号部１６２から供給される。イントラ予測部１７１は、これらの情報を、隣接画素内挿部１７２にも供給する。 The intra prediction unit 171 is supplied with information indicating the intra prediction mode obtained by decoding the header information and information on the shift amount of adjacent pixels from the lossless decoding unit 162. The intra prediction unit 171 also supplies this information to the adjacent pixel interpolation unit 172.

イントラ予測部１７１は、これらの情報に基づき、必要に応じて、隣接画素内挿部１７２に隣接画素の位相をシフトさせ、隣接画素または位相がシフトされた隣接画素を用いて、予測画像を生成し、生成した予測画像を、スイッチ１７４に出力する。 Based on the information, the intra prediction unit 171 shifts the phase of the adjacent pixel to the adjacent pixel interpolation unit 172 as necessary, and generates a predicted image using the adjacent pixel or the adjacent pixel whose phase is shifted. Then, the generated predicted image is output to the switch 174.

隣接画素内挿部１７２は、イントラ予測部１７１から供給されるイントラ予測モードに応じたシフト方向に、イントラ予測部１７１から供給されるシフト量で、隣接画素の位相をシフトさせる。実際には、隣接画素内挿部１７２は、イントラ予測モードに応じたシフト方向について、隣接画素に対して、６タップのFIRフィルタをかけ、線形内挿することで、隣接画素の位相を、小数画素精度にシフトさせる。隣接画素内挿部１７２は、位相がシフトされた隣接画素を、イントラ予測部１７１に供給する。 The adjacent pixel interpolation unit 172 shifts the phase of the adjacent pixel by the shift amount supplied from the intra prediction unit 171 in the shift direction corresponding to the intra prediction mode supplied from the intra prediction unit 171. Actually, the adjacent pixel interpolation unit 172 applies a 6-tap FIR filter to the adjacent pixels in the shift direction according to the intra prediction mode, and linearly interpolates the phase of the adjacent pixels to a decimal number. Shift to pixel accuracy. The adjacent pixel interpolation unit 172 supplies the adjacent pixel whose phase has been shifted to the intra prediction unit 171.

動き予測・補償部１７３には、ヘッダ情報を復号して得られた情報（予測モード情報、動きベクトル情報、参照フレーム情報）が可逆復号部１６２から供給される。インター予測モードを示す情報が供給された場合、動き予測・補償部１７３は、動きベクトル情報と参照フレーム情報に基づいて画像に動き予測と補償処理を施し、予測画像を生成する。動き予測・補償部１７３は、インター予測モードにより生成された予測画像をスイッチ１７４に出力する。 Information (prediction mode information, motion vector information, reference frame information) obtained by decoding the header information is supplied from the lossless decoding unit 162 to the motion prediction / compensation unit 173. When information indicating the inter prediction mode is supplied, the motion prediction / compensation unit 173 performs motion prediction and compensation processing on the image based on the motion vector information and the reference frame information, and generates a predicted image. The motion prediction / compensation unit 173 outputs the prediction image generated in the inter prediction mode to the switch 174.

スイッチ１７４は、動き予測・補償部１７３またはイントラ予測部１７１により生成された予測画像を選択し、演算部１６５に供給する。 The switch 174 selects the prediction image generated by the motion prediction / compensation unit 173 or the intra prediction unit 171 and supplies the selected prediction image to the calculation unit 165.

［イントラ予測部および隣接画素内挿部の構成例］
図２７は、イントラ予測部および隣接画素内挿部の詳細な構成例を示すブロック図である。 [Configuration Example of Intra Prediction Unit and Adjacent Pixel Interpolation Unit]
FIG. 27 is a block diagram illustrating a detailed configuration example of the intra prediction unit and the adjacent pixel interpolation unit.

図２７の例の場合、イントラ予測部１７１は、予測モード受信部１８１、シフト量受信部１８２、イントラ予測画像生成部１８３により構成されている。隣接画素内挿部１７２は、水平方向内挿部１９１および垂直方向内挿部１９２により構成されている。 In the case of the example in FIG. 27, the intra prediction unit 171 includes a prediction mode reception unit 181, a shift amount reception unit 182, and an intra prediction image generation unit 183. The adjacent pixel interpolation unit 172 includes a horizontal direction interpolation unit 191 and a vertical direction interpolation unit 192.

予測モード受信部１８１は、可逆復号部１６２により復号されたイントラ予測モード情報を受信する。予測モード受信部１８１は、受信したイントラ予測モード情報を、イントラ予測画像生成部１８３、水平方向内挿部１９１、および垂直方向内挿部１９２に供給する。 The prediction mode receiving unit 181 receives the intra prediction mode information decoded by the lossless decoding unit 162. The prediction mode reception unit 181 supplies the received intra prediction mode information to the intra prediction image generation unit 183, the horizontal direction interpolation unit 191, and the vertical direction interpolation unit 192.

シフト量受信部１８２は、可逆復号部１６２により復号されたシフト量（水平方向および垂直方向）の情報を受信する。シフト量受信部１８２は、受信したシフト量のうち、水平方向のシフト量を、水平方向内挿部１９１に供給し、垂直方向のシフト量を、垂直方向内挿部１９２に供給する。 The shift amount receiving unit 182 receives information on the shift amount (horizontal direction and vertical direction) decoded by the lossless decoding unit 162. Of the received shift amounts, the shift amount reception unit 182 supplies the horizontal shift amount to the horizontal interpolation unit 191 and supplies the vertical shift amount to the vertical interpolation unit 192.

イントラ予測画像生成部１８３には、予測モード受信部１８１により受信されたイントラ予測モードの情報が入力される。また、イントラ予測画像生成部１８３には、水平方向内挿部１９１から上部隣接画素または内挿された上部隣接画素の情報と、垂直方向内挿部１９２から左部隣接画素または内挿された左部隣接画素の情報が入力される。 Information on the intra prediction mode received by the prediction mode reception unit 181 is input to the intra prediction image generation unit 183. In addition, the intra predicted image generation unit 183 includes information on the upper adjacent pixel or the upper adjacent pixel interpolated from the horizontal interpolation unit 191 and the left adjacent pixel or the interpolated left from the vertical interpolation unit 192. Information of adjacent pixels is input.

イントラ予測画像生成部１８３は、入力されるイントラ予測モード情報が示す予測モードで、隣接画素または内挿された隣接画素の画素値を用いて、イントラ予測を行い、予測画像を生成し、生成した予測画像を、スイッチ１７４に出力する。 The intra prediction image generation unit 183 generates and generates a prediction image by performing intra prediction using the pixel value of the adjacent pixel or the interpolated adjacent pixel in the prediction mode indicated by the input intra prediction mode information. The predicted image is output to the switch 174.

水平方向内挿部１９１は、予測モード受信部１８１からの予測モードに応じて、フレームメモリ１６９より上部隣接画素を読み出す。水平方向内挿部１９１は、６タップのFIRフィルタおよび線形内挿により、読み出した上部隣接画素に対して、シフト量受信部１８２からの水平方向のシフト量で、位相をシフトする。内挿された上部隣接画素または内挿が行われなかった上部隣接画素（すなわち、フレームメモリ１６９からの隣接画素）の情報は、イントラ予測画像生成部１８３に供給される。図２７の場合、スイッチ１７０の図示は省略されているが、隣接画素は、フレームメモリ１６９から、スイッチ１７０を介して読み出される。 The horizontal direction interpolation unit 191 reads the upper adjacent pixel from the frame memory 169 according to the prediction mode from the prediction mode reception unit 181. The horizontal interpolation unit 191 shifts the phase by the horizontal shift amount from the shift amount reception unit 182 with respect to the read upper adjacent pixel by a 6-tap FIR filter and linear interpolation. Information on the interpolated upper adjacent pixel or the upper adjacent pixel that has not been interpolated (that is, the adjacent pixel from the frame memory 169) is supplied to the intra-predicted image generation unit 183. In the case of FIG. 27, the switch 170 is not shown, but adjacent pixels are read out from the frame memory 169 via the switch 170.

垂直方向内挿部１９２は、予測モード受信部１８１からの予測モードに応じて、フレームメモリ１６９より左部隣接画素を読み出す。垂直方向内挿部１９２は、６タップのFIRフィルタおよび線形内挿により、読み出した左部隣接画素に対して、シフト量受信部１８２からの垂直方向のシフト量で、位相をシフトする。線形内挿された左部隣接画素または内挿が行われなかった左部隣接画素（すなわち、フレームメモリ１６９からの隣接画素）の情報は、イントラ予測画像生成部１８３に供給される。 The vertical interpolation unit 192 reads the left adjacent pixel from the frame memory 169 in accordance with the prediction mode from the prediction mode receiving unit 181. The vertical interpolation unit 192 shifts the phase by the shift amount in the vertical direction from the shift amount reception unit 182 with respect to the read left adjacent pixel by a 6-tap FIR filter and linear interpolation. Information on the left adjacent pixel that has been linearly interpolated or the left adjacent pixel that has not been interpolated (that is, the adjacent pixel from the frame memory 169) is supplied to the intra-predicted image generation unit 183.

［画像復号装置の復号処理の説明］
次に、図２８のフローチャートを参照して、画像復号装置１５１が実行する復号処理について説明する。 [Description of Decoding Process of Image Decoding Device]
Next, the decoding process executed by the image decoding device 151 will be described with reference to the flowchart of FIG.

ステップＳ１３１において、蓄積バッファ１６１は伝送されてきた画像を蓄積する。ステップＳ１３２において、可逆復号部１６２は、蓄積バッファ１６１から供給される圧縮画像を復号する。すなわち、図２の可逆符号化部６６により符号化されたＩピクチャ、Ｐピクチャ、並びにＢピクチャが復号される。 In step S131, the accumulation buffer 161 accumulates the transmitted image. In step S132, the lossless decoding unit 162 decodes the compressed image supplied from the accumulation buffer 161. That is, the I picture, P picture, and B picture encoded by the lossless encoding unit 66 in FIG. 2 are decoded.

このとき、動きベクトル情報、参照フレーム情報、予測モード情報（イントラ予測モード、またはインター予測モードを示す情報）、フラグ情報、およびシフト量の情報なども復号される。 At this time, motion vector information, reference frame information, prediction mode information (information indicating an intra prediction mode or an inter prediction mode), flag information, shift amount information, and the like are also decoded.

すなわち、予測モード情報がイントラ予測モード情報である場合、予測モード情報とシフト量の情報は、イントラ予測部１７１に供給される。予測モード情報がインター予測モード情報である場合、予測モード情報と対応する動きベクトル情報および参照フレーム情報は、動き予測・補償部１７３に供給される。 That is, when the prediction mode information is intra prediction mode information, the prediction mode information and the shift amount information are supplied to the intra prediction unit 171. When the prediction mode information is inter prediction mode information, motion vector information and reference frame information corresponding to the prediction mode information are supplied to the motion prediction / compensation unit 173.

ステップＳ１３３において、逆量子化部１６３は可逆復号部１６２により復号された変換係数を、図２の量子化部６５の特性に対応する特性で逆量子化する。ステップＳ１３４において逆直交変換部１６４は逆量子化部１６３により逆量子化された変換係数を、図２の直交変換部６４の特性に対応する特性で逆直交変換する。これにより図２の直交変換部６４の入力（演算部６３の出力）に対応する差分情報が復号されたことになる。 In step S133, the inverse quantization unit 163 inversely quantizes the transform coefficient decoded by the lossless decoding unit 162 with characteristics corresponding to the characteristics of the quantization unit 65 in FIG. In step S134, the inverse orthogonal transform unit 164 performs inverse orthogonal transform on the transform coefficient inversely quantized by the inverse quantization unit 163 with characteristics corresponding to the characteristics of the orthogonal transform unit 64 in FIG. As a result, the difference information corresponding to the input of the orthogonal transform unit 64 of FIG. 2 (output of the calculation unit 63) is decoded.

ステップＳ１３５において、演算部１６５は、後述するステップＳ１４１の処理で選択され、スイッチ１７４を介して入力される予測画像を差分情報と加算する。これにより元の画像が復号される。ステップＳ１３６においてデブロックフィルタ１６６は、演算部１６５より出力された画像をフィルタリングする。これによりブロック歪みが除去される。ステップＳ１３７においてフレームメモリ１６９は、フィルタリングされた画像を記憶する。 In step S135, the calculation unit 165 adds the prediction image selected in the process of step S141 described later and input via the switch 174 to the difference information. As a result, the original image is decoded. In step S136, the deblocking filter 166 filters the image output from the calculation unit 165. Thereby, block distortion is removed. In step S137, the frame memory 169 stores the filtered image.

ステップＳ１３８において、イントラ予測部１７１、および動き予測・補償部１７３は、可逆復号部１６２から供給される予測モード情報に対応して、それぞれ画像の予測処理を行う。 In step S138, the intra prediction unit 171 and the motion prediction / compensation unit 173 perform image prediction processing corresponding to the prediction mode information supplied from the lossless decoding unit 162, respectively.

すなわち、可逆復号部１６２からイントラ予測モード情報が供給された場合、イントラ予測部１７１は、イントラ予測モードのイントラ予測処理を行う。このとき、イントラ予測部１７１は、イントラ予測モードに応じたシフト方向に、可逆復号部１６２から供給されるシフト量で位相がシフトされた隣接画素を用いて、イントラ予測処理を行う。 That is, when intra prediction mode information is supplied from the lossless decoding unit 162, the intra prediction unit 171 performs an intra prediction process in the intra prediction mode. At this time, the intra prediction unit 171 performs an intra prediction process using adjacent pixels whose phase is shifted by the shift amount supplied from the lossless decoding unit 162 in the shift direction according to the intra prediction mode.

ステップＳ１３８における予測処理の詳細は、図２９を参照して後述するが、この処理により、イントラ予測部１７１により生成された予測画像、または動き予測・補償部１７３により生成された予測画像がスイッチ１７４に供給される。 The details of the prediction process in step S138 will be described later with reference to FIG. 29, and as a result, the prediction image generated by the intra prediction unit 171 or the prediction image generated by the motion prediction / compensation unit 173 is switched by the switch 174. To be supplied.

ステップＳ１３９において、スイッチ１７４は予測画像を選択する。すなわち、イントラ予測部１７１により生成された予測画像、または動き予測・補償部１７３により生成された予測画像供給される。したがって、供給された予測画像が選択されて演算部１６５に供給され、上述したように、ステップＳ１３４において逆直交変換部１６４の出力と加算される。 In step S139, the switch 174 selects a predicted image. That is, a prediction image generated by the intra prediction unit 171 or a prediction image generated by the motion prediction / compensation unit 173 is supplied. Therefore, the supplied predicted image is selected and supplied to the calculation unit 165, and is added to the output of the inverse orthogonal transform unit 164 in step S134 as described above.

ステップＳ１４０において、画面並べ替えバッファ１６７は並べ替えを行う。すなわち画像符号化装置５１の画面並べ替えバッファ６２により符号化のために並べ替えられたフレームの順序が、元の表示の順序に並べ替えられる。 In step S140, the screen rearrangement buffer 167 performs rearrangement. That is, the order of frames rearranged for encoding by the screen rearrangement buffer 62 of the image encoding device 51 is rearranged to the original display order.

ステップＳ１４１において、Ｄ／Ａ変換部１６８は、画面並べ替えバッファ１６７からの画像をＤ／Ａ変換する。この画像が図示せぬディスプレイに出力され、画像が表示される。 In step S141, the D / A converter 168 D / A converts the image from the screen rearrangement buffer 167. This image is output to a display (not shown), and the image is displayed.

［予測処理の説明］
次に、図２９のフローチャートを参照して、図２８のステップＳ１３８の予測処理を説明する。 [Explanation of prediction processing]
Next, the prediction processing in step S138 in FIG. 28 will be described with reference to the flowchart in FIG.

予測モード受信部１８１は、ステップＳ１７１において、対象ブロックがイントラ符号化されているか否かを判定する。可逆復号部１６２からイントラ予測モード情報が予測モード受信部１８１に供給されると、予測モード受信部１８１は、ステップ１７１において、対象ブロックがイントラ符号化されていると判定し、処理は、ステップＳ１７２に進む。 In step S171, the prediction mode receiving unit 181 determines whether the target block is intra-coded. When the intra prediction mode information is supplied from the lossless decoding unit 162 to the prediction mode receiving unit 181, the prediction mode receiving unit 181 determines in step 171 that the target block is intra-encoded, and the processing is performed in step S 172. Proceed to

予測モード受信部１８１は、ステップＳ１７２において、可逆復号部１６２からのイントラ予測モード情報を受信、取得する。予測モード受信部１８１は、受信したイントラ予測モード情報を、イントラ予測画像生成部１８３、水平方向内挿部１９１、および垂直方向内挿部１９２に供給する。 In step S172, the prediction mode receiving unit 181 receives and acquires intra prediction mode information from the lossless decoding unit 162. The prediction mode reception unit 181 supplies the received intra prediction mode information to the intra prediction image generation unit 183, the horizontal direction interpolation unit 191, and the vertical direction interpolation unit 192.

シフト量受信部１８２は、ステップＳ１７３において、可逆復号部１６２により復号された隣接画素のシフト量（水平方向および垂直方向）の情報を受信、取得する。シフト量受信部１８２は、受信したシフト量のうち、水平方向のシフト量を、水平方向内挿部１９１に供給し、垂直方向のシフト量を、垂直方向内挿部１９２に供給する。 In step S173, the shift amount reception unit 182 receives and acquires information on the shift amounts (horizontal direction and vertical direction) of the adjacent pixels decoded by the lossless decoding unit 162. Of the received shift amounts, the shift amount reception unit 182 supplies the horizontal shift amount to the horizontal interpolation unit 191 and supplies the vertical shift amount to the vertical interpolation unit 192.

水平方向内挿部１９１および垂直方向内挿部１９２は、フレームメモリ１６９より隣接画素を読み出し、ステップＳ１７４において、隣接画素内挿処理を実行する。ステップＳ１７４における隣接内挿処理の詳細は、図２５を参照して上述した隣接内挿処理と基本的に同様の処理であるため、その説明と図示は省略される。 The horizontal direction interpolation unit 191 and the vertical direction interpolation unit 192 read adjacent pixels from the frame memory 169, and perform adjacent pixel interpolation processing in step S174. Details of the adjacent interpolation processing in step S174 are basically the same as the adjacent interpolation processing described above with reference to FIG.

この処理により、予測モード受信部１８１からのイントラ予測モードに応じたシフト方向に、内挿された隣接画素、あるいは、イントラ予測モードに応じて内挿されなかった隣接画素が、イントラ予測画像生成部１８３に供給される。 By this processing, the adjacent pixels that are interpolated in the shift direction according to the intra prediction mode from the prediction mode receiving unit 181 or the adjacent pixels that are not interpolated according to the intra prediction mode are converted into the intra prediction image generation unit. 183.

すなわち、イントラ予測モードがモード２（DC 予測）の場合、水平方向内挿部１９１および垂直方向内挿部１９２は、隣接画素の内挿を行わず、フレームメモリ１６９から読み出された上部および左部隣接画素を、イントラ予測画像生成部１８３に供給する。 That is, when the intra prediction mode is mode 2 (DC prediction), the horizontal direction interpolation unit 191 and the vertical direction interpolation unit 192 do not perform interpolation of adjacent pixels, and the upper and left sides read from the frame memory 169 The adjacent pixels are supplied to the intra predicted image generation unit 183.

イントラ予測モードがモード０（Vertical 予測）、モード３（Diagonal_Down_Left 予測）、またはモード７（Vertical_Left 予測）の場合、水平方向の内挿のみが行われる。すなわち、水平方向内挿部１９１は、フレームメモリ１６９から読み出された上部隣接画素に対して、シフト量受信部１８２からの水平方向のシフト量で内挿し、内挿された上部隣接画素を、イントラ予測画像生成部１８３に供給する。このとき、垂直方向内挿部１９２は、左部隣接画素の内挿を行わず、フレームメモリ１６９から読み出された左部隣接画素を、イントラ予測画像生成部１８３に供給する。 When the intra prediction mode is mode 0 (Vertical prediction), mode 3 (Diagonal_Down_Left prediction), or mode 7 (Vertical_Left prediction), only horizontal interpolation is performed. That is, the horizontal direction interpolation unit 191 interpolates the upper adjacent pixel read from the frame memory 169 with the horizontal shift amount from the shift amount reception unit 182, and the interpolated upper adjacent pixel is It supplies to the intra estimated image generation part 183. At this time, the vertical direction interpolation unit 192 supplies the left adjacent pixel read from the frame memory 169 to the intra predicted image generation unit 183 without performing interpolation of the left adjacent pixel.

イントラ予測モードがモード１（Horizontal 予測）、またはモード８（Horizontal_Up 予測）の場合、垂直方向のみの内挿が行われる。すなわち、垂直方向内挿部１９２は、フレームメモリ１６９から読み出された左部隣接画素に対して、シフト量受信部１８２からの垂直方向のシフト量で内挿し、内挿された左部隣接画素を、イントラ予測画像生成部１８３に供給する。このとき、水平方向内挿部１９１は、上部隣接画素の内挿を行わず、フレームメモリ１６９から読み出された上部隣接画素を、イントラ予測画像生成部１８３に供給する。 When the intra prediction mode is mode 1 (horizontal prediction) or mode 8 (horizontal_up prediction), interpolation is performed only in the vertical direction. That is, the vertical interpolation unit 192 interpolates the left adjacent pixel read from the frame memory 169 with the shift amount in the vertical direction from the shift amount reception unit 182, and interpolates the left adjacent pixel. Is supplied to the intra predicted image generation unit 183. At this time, the horizontal direction interpolation unit 191 supplies the upper adjacent pixel read from the frame memory 169 to the intra predicted image generation unit 183 without performing interpolation of the upper adjacent pixel.

イントラ予測モードがその他の予測モードの場合、水平方向および垂直方向の内挿が行われる。すなわち、水平方向内挿部１９１は、フレームメモリ１６９から読み出された上部隣接画素に対して、シフト量受信部１８２からの水平方向のシフト量で内挿し、内挿された上部隣接画素を、イントラ予測画像生成部１８３に供給する。垂直方向内挿部１９２は、フレームメモリ１６９から読み出された左部隣接画素に対して、シフト量受信部１８２からの垂直方向のシフト量で内挿し、内挿された左部隣接画素を、イントラ予測画像生成部１８３に供給する。 When the intra prediction mode is any other prediction mode, horizontal and vertical interpolation is performed. That is, the horizontal direction interpolation unit 191 interpolates the upper adjacent pixel read from the frame memory 169 with the horizontal shift amount from the shift amount reception unit 182, and the interpolated upper adjacent pixel is It supplies to the intra estimated image generation part 183. The vertical interpolation unit 192 interpolates the left adjacent pixel read from the frame memory 169 with the vertical shift amount from the shift amount receiving unit 182, and the interpolated left adjacent pixel is It supplies to the intra estimated image generation part 183.

ステップＳ１７５において、イントラ予測画像生成部１８３は、入力されるイントラ予測モード情報が示す予測モードで、水平方向内挿部１９１および垂直方向内挿部１９２からの隣接画素または内挿された隣接画素の画素値を用いて、イントラ予測を行う。このイントラ予測により、予測画像が生成され、生成された予測画像は、スイッチ１７４に出力される。 In step S175, the intra predicted image generation unit 183 is the prediction mode indicated by the input intra prediction mode information, and the adjacent pixels from the horizontal direction interpolation unit 191 and the vertical direction interpolation unit 192 or the interpolated adjacent pixels are displayed. Intra prediction is performed using pixel values. A prediction image is generated by the intra prediction, and the generated prediction image is output to the switch 174.

一方、ステップＳ１７１において、イントラ符号化されていないと判定された場合、処理は、ステップＳ１７６に進む。 On the other hand, if it is determined in step S171 that the intra encoding has not been performed, the process proceeds to step S176.

処理対象の画像がインター処理される画像である場合、可逆復号部１６２からインター予測モード情報、参照フレーム情報、動きベクトル情報が動き予測・補償部１７３に供給される。ステップＳ１７６において、動き予測・補償部１７３は、可逆復号部１６２からのインター予測モード情報、参照フレーム情報、動きベクトル情報などを取得する。 When the processing target image is an inter-processed image, the inter prediction mode information, the reference frame information, and the motion vector information are supplied from the lossless decoding unit 162 to the motion prediction / compensation unit 173. In step S176, the motion prediction / compensation unit 173 acquires inter prediction mode information, reference frame information, motion vector information, and the like from the lossless decoding unit 162.

そして、動き予測・補償部１７３は、ステップＳ１７７において、インター動き予測を行う。すなわち、処理対象の画像がインター予測処理される画像である場合、必要な画像がフレームメモリ１６９から読み出され、スイッチ１７０を介して動き予測・補償部１７３に供給される。ステップＳ１７７において動き予測・補償部１７３は、ステップＳ１７６で取得した動きベクトルに基づいて、インター予測モードの動き予測をし、予測画像を生成する。生成した予測画像は、スイッチ１７４に出力される。 Then, the motion prediction / compensation unit 173 performs inter motion prediction in step S177. That is, when the image to be processed is an image subjected to inter prediction processing, a necessary image is read from the frame memory 169 and supplied to the motion prediction / compensation unit 173 via the switch 170. In step S177, the motion prediction / compensation unit 173 performs motion prediction in the inter prediction mode based on the motion vector acquired in step S176, and generates a predicted image. The generated predicted image is output to the switch 174.

以上のように、画像符号化装置５１において、６タップのFIRフィルタと線形内挿により、小数画素精度の画素を求め、最適なシフト量を決定するようにしたので、イントラ予測モードにおいて用いられる画素値の選択肢を増やすことができる。これにより最適なイントラ予測を行うことができ、イントラ予測における符号化効率をさらに向上させることが可能である。 As described above, in the image encoding device 51, pixels with decimal pixel precision are obtained by a 6-tap FIR filter and linear interpolation, and the optimum shift amount is determined, so that pixels used in the intra prediction mode are used. You can increase the value choices. Thereby, optimal intra prediction can be performed, and the encoding efficiency in intra prediction can be further improved.

また、H．264/AVC方式においては、図４を参照して上述したインター動き予測補償にしか用いられなかった６タップのFIRフィルタの回路を、イントラ予測にも有効活用することができる。これにより、回路を増大させることなく、効率を改善することができる。 H. In the H.264 / AVC format, the 6-tap FIR filter circuit that has been used only for the inter motion prediction compensation described above with reference to FIG. 4 can be effectively used for intra prediction. Thereby, the efficiency can be improved without increasing the circuit.

なお、画像符号化装置５１においては、非特許文献２に記載の提案と異なり、H．264/AVC方式のイントラ予測で用いられる対象ブロックに所定の位置で隣接する画素のみがイントラ予測で用いられる。すなわち、隣接画素バッファ８１に読み出す画素は隣接画素のみでよい。 Note that in the image encoding device 51, unlike the proposal described in Non-Patent Document 2, H.264 is used. Only pixels adjacent to the target block used in the intra prediction of the H.264 / AVC system at a predetermined position are used in the intra prediction. That is, only the adjacent pixels need be read out to the adjacent pixel buffer 81.

したがって、非特許文献２の提案における符号化対象となるブロックの隣接画素以外の画素も予測に用いることによるメモリアクセス回数や処理の増加、すなわち、処理効率の低下を回避することができる。 Therefore, it is possible to avoid an increase in the number of memory accesses and an increase in processing, that is, a decrease in processing efficiency, by using pixels other than the adjacent pixels of the encoding target block in the proposal of Non-Patent Document 2 for prediction.

なお、上記説明においては、隣接画素内挿処理として、輝度信号のイントラ４×４予測モードの場合を例に説明したが、本発明は、イントラ８×８やイントラ１６×１６予測モードの場合にも適用することができる。また、本発明は、色差信号のイントラ予測モードの場合にも適用することができる。 In the above description, the case of the luminance signal intra 4 × 4 prediction mode has been described as an example of the adjacent pixel interpolation processing. However, the present invention is applicable to the case of the intra 8 × 8 or intra 16 × 16 prediction mode. Can also be applied. The present invention can also be applied to the case of the color difference signal intra prediction mode.

なお、イントラ８×８予測モードの場合には、イントラ４×４予測モードの場合と同様に、モード２（DC prediction mode）に関しては、平均値処理を行うことになる。したがって、シフトを行ったとしても、符号化効率の向上には直接関与しないため、上述した動作は禁止され、行われない。 In addition, in the case of the intra 8 × 8 prediction mode, as in the case of the intra 4 × 4 prediction mode, average value processing is performed for mode 2 (DC prediction mode). Therefore, even if the shift is performed, the above-described operation is prohibited and is not performed because it is not directly related to the improvement of the coding efficiency.

また、イントラ１６×１６予測モードおよび色差信号のイントラ予測モードの場合、Vertical Prediction modeに関しては、上部隣接画素の水平方向のシフトのみが行われる。Horizontal Prediction modeに関しては、左部隣接画素の垂直方向のシフトのみが行われる。DC Prediction modeに関しては、シフト処理は行われない。Plane Prediction modeに関しては、上部隣接画素の水平方向のシフトおよび左部隣接画素の垂直方向のシフトの両方が行われる。 In addition, in the case of the intra 16 × 16 prediction mode and the color difference signal intra prediction mode, only the horizontal shift of the upper adjacent pixels is performed in the vertical prediction mode. For the horizontal prediction mode, only the vertical shift of the left adjacent pixel is performed. Shift processing is not performed for DC Prediction mode. Regarding the Plane Prediction mode, both the horizontal shift of the upper adjacent pixel and the vertical shift of the left adjacent pixel are performed.

さらに、非特許文献１に記載されているように、動き予測で１／８画素精度の内挿処理が行われている場合、本発明においても、１／８画素精度の内挿処理が行われる。 Furthermore, as described in Non-Patent Document 1, when interpolation processing with 1/8 pixel accuracy is performed in motion prediction, the interpolation processing with 1/8 pixel accuracy is also performed in the present invention. .

以上においては、符号化方式としてH．264/AVC方式を用いるようにしたが、本発明はこれに限らず、隣接画素を用いたイントラ予測を行う、その他の符号化方式／復号方式を適用することができる。 In the above, the encoding method is H.264. The H.264 / AVC system is used, but the present invention is not limited to this, and other encoding systems / decoding systems that perform intra prediction using adjacent pixels can be applied.

なお、本発明は、例えば、MPEG、H．26x等の様に、離散コサイン変換等の直交変換と動き補償によって圧縮された画像情報（ビットストリーム）を、衛星放送、ケーブルテレビジョン、インターネット、または携帯電話機などのネットワークメディアを介して受信する際に用いられる画像符号化装置および画像復号装置に適用することができる。また、本発明は、光、磁気ディスク、およびフラッシュメモリのような記憶メディア上で処理する際に用いられる画像符号化装置および画像復号装置に適用することができる。さらに、本発明は、それらの画像符号化装置および画像復号装置などに含まれる動き予測補償装置にも適用することができる。 Note that the present invention is, for example, MPEG, H.264. When receiving image information (bitstream) compressed by orthogonal transformation such as discrete cosine transformation and motion compensation, such as 26x, via network media such as satellite broadcasting, cable television, the Internet, or mobile phones. The present invention can be applied to an image encoding device and an image decoding device used in the above. Further, the present invention can be applied to an image encoding device and an image decoding device used when processing on a storage medium such as an optical, magnetic disk, and flash memory. Furthermore, the present invention can also be applied to motion prediction / compensation devices included in such image encoding devices and image decoding devices.

上述した一連の処理は、ハードウエアにより実行することもできるし、ソフトウエアにより実行することもできる。一連の処理をソフトウエアにより実行する場合には、そのソフトウエアを構成するプログラムが、コンピュータにインストールされる。ここで、コンピュータには、専用のハードウエアに組み込まれているコンピュータや、各種のプログラムをインストールすることで、各種の機能を実行することが可能な汎用のパーソナルコンピュータなどが含まれる。 The series of processes described above can be executed by hardware or can be executed by software. When a series of processing is executed by software, a program constituting the software is installed in the computer. Here, the computer includes a computer incorporated in dedicated hardware, a general-purpose personal computer capable of executing various functions by installing various programs, and the like.

図３０は、上述した一連の処理をプログラムにより実行するコンピュータのハードウエアの構成例を示すブロック図である。 FIG. 30 is a block diagram illustrating a configuration example of hardware of a computer that executes the above-described series of processing by a program.

コンピュータにおいて、CPU(Central Processing Unit)３０１、ROM(Read Only Memory)３０２、RAM(Random Access Memory)３０３は、バス３０４により相互に接続されている。 In a computer, a central processing unit (CPU) 301, a read only memory (ROM) 302, and a random access memory (RAM) 303 are connected to each other via a bus 304.

バス３０４には、さらに、入出力インタフェース３０５が接続されている。入出力インタフェース３０５には、入力部３０６、出力部３０７、記憶部３０８、通信部３０９、およびドライブ３１０が接続されている。 An input / output interface 305 is further connected to the bus 304. An input unit 306, an output unit 307, a storage unit 308, a communication unit 309, and a drive 310 are connected to the input / output interface 305.

入力部３０６は、キーボード、マウス、マイクロフォンなどよりなる。出力部３０７は、ディスプレイ、スピーカなどよりなる。記憶部３０８は、ハードディスクや不揮発性のメモリなどよりなる。通信部３０９は、ネットワークインタフェースなどよりなる。ドライブ３１０は、磁気ディスク、光ディスク、光磁気ディスク、又は半導体メモリなどのリムーバブルメディア３１１を駆動する。 The input unit 306 includes a keyboard, a mouse, a microphone, and the like. The output unit 307 includes a display, a speaker, and the like. The storage unit 308 includes a hard disk, a nonvolatile memory, and the like. The communication unit 309 includes a network interface and the like. The drive 310 drives a removable medium 311 such as a magnetic disk, an optical disk, a magneto-optical disk, or a semiconductor memory.

以上のように構成されるコンピュータでは、CPU３０１が、例えば、記憶部３０８に記憶されているプログラムを入出力インタフェース３０５及びバス３０４を介してRAM３０３にロードして実行することにより、上述した一連の処理が行われる。 In the computer configured as described above, for example, the CPU 301 loads the program stored in the storage unit 308 to the RAM 303 via the input / output interface 305 and the bus 304 and executes the program, thereby performing the series of processes described above. Is done.

コンピュータ（CPU３０１）が実行するプログラムは、例えば、パッケージメディア等としてのリムーバブルメディア３１１に記録して提供することができる。また、プログラムは、ローカルエリアネットワーク、インターネット、デジタル放送といった、有線または無線の伝送媒体を介して提供することができる。 The program executed by the computer (CPU 301) can be provided by being recorded on a removable medium 311 as a package medium or the like, for example. The program can be provided via a wired or wireless transmission medium such as a local area network, the Internet, or digital broadcasting.

コンピュータでは、プログラムは、リムーバブルメディア３１１をドライブ３１０に装着することにより、入出力インタフェース３０５を介して、記憶部３０８にインストールすることができる。また、プログラムは、有線または無線の伝送媒体を介して、通信部３０９で受信し、記憶部３０８にインストールすることができる。その他、プログラムは、ROM３０２や記憶部３０８に、あらかじめインストールしておくことができる。 In the computer, the program can be installed in the storage unit 308 via the input / output interface 305 by attaching the removable medium 311 to the drive 310. Further, the program can be received by the communication unit 309 via a wired or wireless transmission medium and installed in the storage unit 308. In addition, the program can be installed in advance in the ROM 302 or the storage unit 308.

なお、コンピュータが実行するプログラムは、本明細書で説明する順序に沿って時系列に処理が行われるプログラムであっても良いし、並列に、あるいは呼び出しが行われたとき等の必要なタイミングで処理が行われるプログラムであっても良い。 The program executed by the computer may be a program that is processed in time series in the order described in this specification, or in parallel or at a necessary timing such as when a call is made. It may be a program for processing.

本発明の実施の形態は、上述した実施の形態に限定されるものではなく、本発明の要旨を逸脱しない範囲において種々の変更が可能である。 The embodiments of the present invention are not limited to the above-described embodiments, and various modifications can be made without departing from the scope of the present invention.

５１画像符号化装置，６６可逆符号化部，７４イントラ予測部，７５隣接画素内挿部，７６動き予測・補償部，７７予測画像選択部，８１隣接画素バッファ，８２最適モード決定部，８３最適シフト量決定部，８４予測画像生成部，９１モード判定部，９２水平方向内挿部，９３垂直方向内挿部，１５１画像復号装置，１６２可逆復号部，１７１イントラ予測部，１７２隣接画素内挿部，１７３動き予測・補償部，１７４スイッチ，１８１予測モード受信部，１８２シフト量受信部，１８３イントラ予測画像生成部，１９１水平方向内挿部，１９２垂直方向内挿部 51 image encoding device, 66 lossless encoding unit, 74 intra prediction unit, 75 adjacent pixel interpolation unit, 76 motion prediction / compensation unit, 77 predicted image selection unit, 81 adjacent pixel buffer, 82 optimum mode determination unit, 83 optimal Shift amount determination unit, 84 predicted image generation unit, 91 mode determination unit, 92 horizontal direction interpolation unit, 93 vertical direction interpolation unit, 151 image decoding device, 162 lossless decoding unit, 171 intra prediction unit, 172 adjacent pixel interpolation Unit, 173 motion prediction / compensation unit, 174 switch, 181 prediction mode reception unit, 182 shift amount reception unit, 183 intra prediction image generation unit, 191 horizontal direction interpolation unit, 192 vertical direction interpolation unit

Claims

A memory that accumulates adjacent pixels that are referred to when performing intra prediction on the pixels of the target block that is the target of the encoding process for the image to be encoded;
The phase of adjacent pixels read from the memory is shifted or the phase of adjacent pixels read from the memory is not shifted according to the prediction direction and block size when performing intra prediction on the pixels of the target block. A selection section for selecting or
An intra prediction unit that performs intra prediction on the pixels of the target block using the adjacent pixels and generates a predicted image;
An image processing apparatus comprising: an encoding unit that encodes the image using a predicted image generated by the intra prediction unit.

When the selection unit selects that the phase of the adjacent pixel read from the memory is shifted, the intra prediction unit uses the adjacent pixel whose phase is shifted to perform intra prediction on the pixel of the target block. The image processing apparatus according to claim 1.

When the selection unit selects that the phase of the adjacent pixel read from the memory is not shifted, the intra prediction unit uses the adjacent pixel whose phase has not been shifted to the intra of the pixel of the target block. The image processing apparatus according to claim 1, wherein prediction is performed.

The image processing device
Intra-prediction is performed on pixels of the target block according to the prediction direction and block size when performing intra-prediction on the pixels of the target block that is the target of encoding processing for the image to be encoded. Select whether to shift the phase of the adjacent pixel read from the memory that stores the adjacent pixel to be referred to when performing, or not to shift the phase of the adjacent pixel read from the memory,
Using the adjacent pixels, perform intra prediction on the pixels of the target block to generate a predicted image,
An image processing method for encoding the image using the generated predicted image.

The image processing according to claim 4, wherein when it is selected that the phase of an adjacent pixel read from the memory is shifted, intra prediction is performed on the pixel of the target block using the adjacent pixel whose phase is shifted. Method.

The image according to claim 4, wherein when it is selected that the phase of the adjacent pixel read from the memory is not shifted, intra prediction is performed on the pixel of the target block using the adjacent pixel whose phase is not shifted. Processing method.

The recording medium which recorded the encoding stream produced | generated by the image processing apparatus described in Claim 1.