JP5219062B2

JP5219062B2 - Image data generation method

Info

Publication number: JP5219062B2
Application number: JP2006107394A
Authority: JP
Inventors: 彰岡本; 英輝大安
Original assignee: MegaChips Corp
Current assignee: MegaChips Corp
Priority date: 2006-04-10
Filing date: 2006-04-10
Publication date: 2013-06-26
Anticipated expiration: 2026-04-10
Also published as: WO2007116617A1; JP2007282014A

Description

本発明は画像データの生成方法に関し、特に予測モードの決定を簡易に行うイントラ予測を含んだ画像データの生成方法に関する。 The present invention relates to a method for generating image data, and more particularly to a method for generating image data including intra prediction for easily determining a prediction mode.

電気通信に関する国際標準化機関であるＩＴＵ−Ｔ（International Telecommunication Union-Telecommunication Standardization Sector）のＶＣＥＧ（Video Coding Experts Group）とＩＳＯ／ＩＥＣのＭＰＥＧ（Moving Picture Expetrts Group）によって、動画像符号化の最新国際標準としてＨ．２６４が開発され、実用化されている。 The latest international standard for video coding by VCEG (Video Coding Experts Group) of ITU-T (International Telecommunication Union-Telecommunication Standardization Sector) and ISO / IEC Moving Picture Expetrts Group (MPEG) As H. H.264 has been developed and put into practical use.

Ｈ．２６４は、ＭＰＥＧ−２やＭＰＥＧ−４に比べて、同程度の画質で２倍の圧縮を行うことができ、低ビットレートのテレビ会議からＨＤＴＶ（High Definition Television）までの幅広い用途に適用可能である。 H. H.264 can compress twice as much as the same image quality as MPEG-2 and MPEG-4, and can be applied to a wide range of applications from low bit rate video conferencing to HDTV (High Definition Television). is there.

Ｈ．２６４の特徴の１つに、フレーム内予測（イントラ予測）と呼称される方式が挙げられる。これは、画素相関を利用して圧縮率を向上させるための方式であり、サブブロック間でのデータの補間により予測画像を生成する方式であり、符号化対象サブブロックに隣接する符号化済みサブブロックを参照して、画素レベルでの比較により予測を行うものである。 H. One of the characteristics of H.264 is a method called intra-frame prediction (intra prediction). This is a method for improving the compression rate using pixel correlation, and is a method for generating a predicted image by interpolating data between sub-blocks. By referring to the block, prediction is performed by comparison at the pixel level.

イントラ予測は、ＭＰＥＧ−４においても採用されているが、ＭＰＥＧ−４が符号化対象サブブロックの左側、上側および左上側のサブブロックを参照するのに対し、Ｈ．２６４の場合は、左側、上側、左上側および右上側のサブブロックを参照するので、より精度の高い予測画像を得ることができる。 Intra prediction is also employed in MPEG-4, but MPEG-4 refers to the left, upper, and upper left subblocks of the encoding target subblock. In the case of H.264, since the left, upper, upper left, and upper right sub-blocks are referred to, a predicted image with higher accuracy can be obtained.

また、ＭＰＥＧ−４においては８×８画素単位で予測を行うのに対し、Ｈ．２６４では、複雑な画像は４×４画素単位で予測を行い、単純な画像は１６×１６画素単位で予測を行うので、効率の良い予測が可能となる。 In MPEG-4, prediction is performed in units of 8 × 8 pixels. In H.264, complicated images are predicted in units of 4 × 4 pixels, and simple images are predicted in units of 16 × 16 pixels. Therefore, efficient prediction is possible.

このような、Ｈ．２６４におけるイントラ予測においては、１６×１６画素単位での予測では４通りの予測方法（予測モード）を使用して最適な予測モードを決定し、４×４画素単位での予測では９通りの予測モードを使用して最適な予測モードを決定するので、膨大な計算処理を要し、予測結果が得られるまでに時間がかかるという問題がある。 H. In intra prediction in H.264, the optimal prediction mode is determined by using four prediction methods (prediction modes) for prediction in 16 × 16 pixel units, and nine predictions in prediction in 4 × 4 pixel units. Since the optimum prediction mode is determined using the mode, there is a problem that enormous calculation processing is required and it takes time until the prediction result is obtained.

例えば特許文献１においては、イントラ予測を、同時に複数の符号化対象サブブロックに対して行うことで、イントラ予測の時間短縮を達成する技術が開示されている。 For example, Patent Literature 1 discloses a technique for achieving intra prediction time reduction by simultaneously performing intra prediction on a plurality of encoding target sub-blocks.

特開２００５−１３０５０９号公報JP 2005-130509 A

以上説明したように、イントラ予測に費やす時間をできるだけ短縮することがＨ．２６４に求められており、特許文献１にはその解決法の１つが提案されているが、イントラ予測を行うための計算器を複数設けることで総合的な時間を短縮するものであり、個々のサブブロックのイントラ予測に費やす時間を短縮するものはなかった。 As described above, it is necessary to reduce the time spent for intra prediction as much as possible. H.264, and one of the solutions has been proposed in Japanese Patent Application Laid-Open No. H11-260, but the overall time is shortened by providing a plurality of calculators for performing intra prediction. There was nothing to reduce the time spent in intra prediction of sub-blocks.

本発明は上記のような問題点を解消するためになされたもので、個々のサブブロックのイントラ予測を効率化して、イントラ予測に費やす時間を短縮した画像データの生成方法を提供することを目的とする。 The present invention has been made to solve the above-described problems, and an object of the present invention is to provide a method for generating image data in which the intra prediction of each sub-block is made efficient and the time spent for intra prediction is shortened. And

本発明に係る請求項１記載の画像データの生成方法は、符号化済みの複数の第１のサブブロックの画素値を用いて、符号化対象となる第２のサブブロックの画像予測を行うイントラ予測を有した画像データの生成方法であって、前記イントラ予測に先立って、前記第１のサブブロックの符号化が完了していない場合に、前記第１のサブブロックの符号化前の前記画素値を第１の予測値とし、該第１の予測値を用いて前記第２のサブブロックに対して複数の予測モードごとに補間を実行し、前記複数の予測モードごとの補間の結果に基づいて、前記複数の予測モードから少なくとも１つを選択して最適予測モード候補とする簡易イントラ予測を有し、前記イントラ予測は、前記少なくとも１つの最適予測モード候補を用いて前記画像予測を行い、前記イントラ予測は、前記第１のサブブロックの全てについて符号化が完了した後に実行され、符号化が完了した前記第１のサブブロックの符号化後の前記画素値を第２の予測値とし、該第２の予測値を用いて前記第２のサブブロックに対して前記少なくとも１つの最適予測モード候補ごとに補間を実行し、前記少なくとも１つの最適予測モード候補ごとの補間の結果に基づいて、前記少なくとも１つの最適予測モード候補の中から最適予測モードを決定して前記画像予測を行い、前記簡易イントラ予測において、符号化が完了している前記第１のサブブロックについては、前記第１のサブブロックの符号化後の前記画素値または符号化前の前記画素値を前記第１の予測値として用いる。 According to a first aspect of the present invention, there is provided an image data generation method including an intra prediction for predicting an image of a second sub-block to be encoded using pixel values of a plurality of encoded first sub-blocks. A method of generating image data having prediction, wherein the pixel before encoding of the first sub-block when encoding of the first sub-block is not completed prior to the intra prediction Based on the result of the interpolation for each of the plurality of prediction modes, the value is a first prediction value, interpolation is performed for each of the plurality of prediction modes for the second sub-block using the first prediction value. A simple intra prediction that selects at least one of the plurality of prediction modes as an optimal prediction mode candidate, and the intra prediction performs the image prediction using the at least one optimal prediction mode candidate. The intra prediction is performed after encoding is completed for all of the first sub-blocks, and the pixel values after encoding of the first sub-blocks for which encoding is completed are used as second prediction values, Performing interpolation for each of the at least one optimum prediction mode candidate for the second sub-block using the second prediction value, and based on the result of interpolation for each of the at least one optimum prediction mode candidate, There line the image prediction to determine the optimum prediction mode from the at least one optimum prediction mode candidate, the in simplified intra prediction, for the first sub-block coding is completed, the first The pixel value after encoding of the sub-block or the pixel value before encoding is used as the first predicted value .

本発明に係る請求項２記載の画像データの生成方法は、前記簡易イントラ予測においては、前記複数の予測モードごとの補間の結果得られた前記第２のサブブロックの第１の予測画像と、前記第２のサブブロックの補間が実行される前の画素値との差分の絶対値和を、前記複数の予測モードごとに取得し、前記絶対値和の小さいものから順に選択することで、前記少なくとも１つの最適予測モード候補を選択する。 According to a second aspect of the present invention, in the simple intra prediction, the first predicted image of the second sub-block obtained as a result of interpolation for each of the plurality of prediction modes; By acquiring the absolute value sum of the difference from the pixel value before the interpolation of the second sub-block is performed for each of the plurality of prediction modes, and selecting in order from the smallest absolute value sum, At least one optimum prediction mode candidate is selected.

本発明に係る請求項３記載の画像データの生成方法は、前記イントラ予測が、前記少なくとも１つの最適予測モード候補ごとの補間の結果得られた前記第２のサブブロックの第２の予測画像と、前記第２のサブブロックの補間が実行される前の画素値との差分の絶対値和を、前記少なくとも１つの最適予測モード候補ごとに取得し、前記絶対値和の最も小さいものを選択することで、前記最適予測モードを決定する。 According to a third aspect of the present invention, there is provided the method for generating image data according to the second prediction image of the second sub-block obtained as a result of interpolation for each of the at least one optimum prediction mode candidate. The absolute value sum of the difference from the pixel value before the interpolation of the second sub-block is executed is obtained for each of the at least one optimum prediction mode candidate, and the one having the smallest absolute value sum is selected. Thus, the optimum prediction mode is determined.

本発明に係る請求項４記載の画像データの生成方法は、前記簡易イントラ予測および前記イントラ予測がパイプライン処理によって実行され、前記パイプライン処理は、ｎ番のサブブロックに対して前記簡易イントラ予測を実行しているｎ番の処理サイクルにおいて、ｎ−１番の処理サイクルで前記簡易イントラ予測が実行されたｎ−１番のサブブロックに対しての前記イントラ予測が実行されるとともに、前記イントラ予測により決定した前記最適予測モードを使用して、前記ｎ−１番のサブブロックに対してのローカルデコードを実行する。 According to a fourth aspect of the present invention, in the image data generation method, the simple intra prediction and the intra prediction are executed by pipeline processing, and the pipeline processing is performed with respect to the nth subblock. In the n-th processing cycle in which the intra-prediction is performed, the intra-prediction is performed on the n-1-th sub-block for which the simple intra-prediction is performed in the n-1th processing cycle, and the intra Using the optimal prediction mode determined by prediction, local decoding is performed on the n-1st sub-block.

本発明に係る請求項１記載の画像データの生成方法によれば、第１のサブブロックの符号化が完了していない場合に、第１のサブブロックの符号化前の画素値を第１の予測値とし、該第１の予測値を用いて第２のサブブロックに対して複数の予測モードごとに補間を実行し、複数の予測モードごとの補間の結果に基づいて、複数の予測モードから少なくとも１つを選択して最適予測モード候補とする簡易イントラ予測を有するので、参照対象のサブブロックである第１のサブブロックの全てについて符号化が終わっていない段階でも画像予測を始めることができ、個々のサブブロックのイントラ予測を効率化を図ることができる。また、イントラ予測に費やす時間を短縮でき、また、第１のサブブロックにおいて符号化が完了しているものと完了していないものとが存在する場合でも、適切に最適予測モード候補の選択をすることができる。 According to the image data generation method of the first aspect of the present invention, when the encoding of the first sub-block is not completed, the pixel value before the encoding of the first sub-block is set to the first Using the first prediction value as a prediction value, interpolation is performed for each of the plurality of prediction modes for the second sub-block, and based on the result of the interpolation for each of the plurality of prediction modes, from the plurality of prediction modes Since it has simple intra prediction that selects at least one as an optimal prediction mode candidate, it is possible to start image prediction even at a stage where coding has not been completed for all of the first sub-blocks that are reference target sub-blocks. Thus, it is possible to improve the efficiency of intra prediction of individual sub-blocks. In addition, the time required for intra prediction can be shortened , and even when there is a coding that has been completed and a coding that has not been completed in the first sub-block, the optimum prediction mode candidate is appropriately selected. it is Ru can.

本発明に係る請求項２記載の画像データの生成方法によれば、簡易イントラ予測において適切に最適予測モード候補の選択をすることができる。 According to the image data generation method of the second aspect of the present invention, the optimum prediction mode candidate can be appropriately selected in the simple intra prediction.

本発明に係る請求項３記載の画像データの生成方法によれば、イントラ予測において適切に最適予測モードの決定をすることができる。 According to the image data generation method of the third aspect of the present invention, it is possible to appropriately determine the optimum prediction mode in intra prediction.

本発明に係る請求項４記載の画像データの生成方法によれば、簡易イントラ予測およびイントラ予測をパイプライン処理によって実行することで、画像データの符号化処理を効率的に行うことができる。
According to the image data generation method of the fourth aspect of the present invention, it is possible to efficiently perform the encoding process of the image data by executing the simple intra prediction and the intra prediction by pipeline processing.

＜イントラ予測モードについて＞
発明の実施の形態の説明に先立って、Ｈ．２６４におけるイントラ予測において、４×４画素単位での予測に使用される９通りの予測モードについて説明する。 <Intra prediction mode>
Prior to the description of the embodiments of the invention, H.C. In the intra prediction in H.264, nine prediction modes used for prediction in units of 4 × 4 pixels will be described.

図１は、１６×１６画素で構成されるマクロブロックＭＢを模式的に表す図であり、マクロブロックＭＢは１６個のサブブロックＳＢで構成されている。 FIG. 1 is a diagram schematically showing a macro block MB composed of 16 × 16 pixels, and the macro block MB is composed of 16 sub-blocks SB.

各サブブロックＳＢは４×４画素で構成され、サブブロック単位でイントラ予測および符号化が実行される。図１において各サブブロックＳＢに割り当てられた数字は符号化の順番を表し、左上隅のサブブロックを０番とし、その右隣りが１番、０番の下が２番というようにジグザグに割り当てられている。 Each sub-block SB is composed of 4 × 4 pixels, and intra prediction and encoding are executed in units of sub-blocks. In FIG. 1, the numbers assigned to each sub-block SB indicate the encoding order, and the sub-block in the upper left corner is numbered 0, the right next to it is number 1 and the number below 0 is numbered zigzag. It has been.

また、４個のサブブロックＳＢで８×８画素単位のブロックＢＬを構成し、０番〜３番のサブブロックＳＢ、４番〜７番のサブブロックＳＢ、８番〜１１番のサブブロックＳＢおよび１２番〜１５番のサブブロックＳＢで、それぞれ１つのブロックＢＬを構成している。 Further, a block BL of 8 × 8 pixel units is constituted by four sub blocks SB, and the 0th to 3rd sub blocks SB, the 4th to 7th sub blocks SB, and the 8th to 11th sub blocks SB The 12th to 15th sub-blocks SB constitute one block BL.

以下、１２番のサブブロックＳＢに対してイントラ予測を行う場合を想定して、図２〜図１１を用いて、９通りの予測モードについて説明する。 Hereinafter, assuming the case where intra prediction is performed on the twelfth sub-block SB, nine prediction modes will be described with reference to FIGS.

図２は、上述した１２番のサブブロックＳＢを構成する１６個の符号化対象画素ＵＰのそれぞれに、便宜的にａ〜ｐの記号を割り当てた図であり、左上隅の画素をａ番とし、その右隣りをｂ番とし、以後、ｃ番、ｄ番と連続して割り当て、ａ番の下の画素をｅ番とし、以後、同様の規則でｐ番まで割り当てられている。 FIG. 2 is a diagram in which the symbols a to p are assigned to the 16 encoding target pixels UP constituting the 12th sub-block SB for convenience, and the pixel in the upper left corner is designated as a. The right-hand side is b, the c and d are assigned consecutively, the pixels below the a are the e, and the same rules are used up to p.

また、１２番のサブブロックＳＢに隣接する、符号化済みのサブブロック内の符号化済み画素ＥＰのうち、イントラ予測において参照画素として使用される画素には、便宜的にＡ〜Ｍの記号を割り当てている。 In addition, among the encoded pixels EP in the encoded sub-block adjacent to the 12th sub-block SB, pixels used as reference pixels in the intra prediction are denoted by symbols A to M for convenience. Assigned.

すなわち、図１に示す６番のサブブロックＳＢ（上側ブロック）における符号化済み画素ＥＰには、左から順にＡ〜Ｄを割り当て、７番のサブブロックＳＢ（右上側ブロック）における符号化済み画素ＥＰには、左から順にＥ〜Ｈを割り当て、９番のサブブロックＳＢ（左側ブロック）における符号化済み画素ＥＰには、上から順にＩ〜Ｌを割り当て、３番のサブブロックＳＢ（左上側ブロック）における符号化済み画素ＥＰには、Ｍを割り当てている。 That is, A to D are assigned to the encoded pixels EP in the sixth sub-block SB (upper block) shown in FIG. 1 in order from the left, and the encoded pixels in the seventh sub-block SB (upper right block) are assigned. EP is assigned E to H in order from the left, and the encoded pixels EP in the ninth sub-block SB (left block) are assigned I to L in order from the top, and the third sub-block SB (upper left) M is assigned to the encoded pixel EP in the block).

図３は、予測モード０と呼称される予測方法を模式的に表す図であり、矢印で示されるように、Ａ〜Ｄ番の符号化済み画素ＥＰのそれぞれの画素値を、Ａ〜Ｄ番の符号化済み画素ＥＰのそれぞれの垂直方向に存在する符号化対象画素ＵＰの予測値として用いて補間を実行することで予測画像を得る。なお、当該予測画像の適否については、符号化対象画素ＵＰの当初の画素値（補間が実行される前の画素値）との差分を算出し、それらの絶対値和の大小によって判断する。 FIG. 3 is a diagram schematically showing a prediction method called the prediction mode 0. As indicated by arrows, the pixel values of the A to D encoded pixels EP are expressed as A to D, respectively. A prediction image is obtained by performing interpolation using the prediction value of the encoding target pixel UP existing in the vertical direction of each of the encoded pixels EP. Whether or not the predicted image is appropriate is determined by calculating a difference from the initial pixel value (pixel value before the interpolation is performed) of the encoding target pixel UP and by determining the magnitude of the absolute value sum thereof.

図４は、予測モード１と呼称される予測方法を模式的に表す図であり、矢印で示されるように、Ｉ〜Ｌ番の符号化済み画素ＥＰのそれぞれの画素値を、Ｉ〜Ｌ番の符号化済み画素ＥＰのそれぞれの水平方向に存在する符号化対象画素ＵＰの予測値として用いて補間を実行することで予測画像を得る。なお、当該予測画像の適否については、符号化対象画素ＵＰの当初の画素値との差分を算出し、それらの絶対値和の大小によって判断する。 FIG. 4 is a diagram schematically illustrating a prediction method referred to as prediction mode 1. As indicated by arrows, the pixel values of I to L encoded pixels EP are expressed as I to L, respectively. The prediction image is obtained by performing interpolation using the prediction value of the encoding target pixel UP existing in the horizontal direction of each of the encoded pixels EP. Whether or not the predicted image is appropriate is determined by calculating the difference from the original pixel value of the encoding target pixel UP and determining the magnitude of the sum of absolute values thereof.

図５は、予測モード２と呼称される予測方法を模式的に表す図であり、Ａ〜Ｄ番およびＩ〜Ｌ番の符号化済み画素ＥＰのそれぞれの画素値の平均値を、全ての符号化対象画素ＵＰの予測値として用いて補間を実行することで、予測画像を得るものであり、ＤＣ（直流）モードとも呼称される。なお、当該予測画像の適否については、符号化対象画素ＵＰの当初の画素値との差分を算出し、それらの絶対値和の大小によって判断する。 FIG. 5 is a diagram schematically illustrating a prediction method referred to as prediction mode 2, in which the average values of the respective pixel values of the A to D and I to L encoded pixels EP are represented by all codes. A prediction image is obtained by executing interpolation using the prediction value of the pixel to be converted UP, and is also referred to as a DC (direct current) mode. Whether or not the predicted image is appropriate is determined by calculating the difference from the original pixel value of the encoding target pixel UP and determining the magnitude of the sum of absolute values thereof.

図６は、予測モード３と呼称される予測方法を模式的に表す図であり、矢印で示されるように、Ｂ〜Ｈ番の符号化済み画素ＥＰのそれぞれの画素値を、Ｂ〜Ｈ番の符号化済み画素ＥＰのそれぞれの対角線左下がり方向の位置に存在する符号化対象画素ＵＰの予測値として用いて補間を実行することで予測画像を得る。なお、当該予測画像の適否については、符号化対象画素ＵＰの当初の画素値との差分を算出し、それらの絶対値和の大小によって判断する。 FIG. 6 is a diagram schematically showing a prediction method called prediction mode 3, in which the pixel values of B to H encoded pixels EP are expressed as B to H as indicated by arrows. A predicted image is obtained by performing interpolation using the encoded pixel EP as a predicted value of the encoding target pixel UP existing at a position in the diagonally downward left direction of each encoded pixel EP. Whether or not the predicted image is appropriate is determined by calculating the difference from the original pixel value of the encoding target pixel UP and determining the magnitude of the sum of absolute values thereof.

図７は、予測モード４と呼称される予測方法を模式的に表す図であり、矢印で示されるように、Ａ〜Ｃ、Ｍ、ＩおよびＪ番の符号化済み画素ＥＰのそれぞれの画素値を、Ａ〜Ｃ、Ｍ、ＩおよびＪ番の符号化済み画素ＥＰのそれぞれの対角線右下がり方向の位置に存在する符号化対象画素ＵＰの予測値として用いて補間を実行することで予測画像を得る。なお、当該予測画像の適否については、符号化対象画素ＵＰの当初の画素値との差分を算出し、それらの絶対値和の大小によって判断する。 FIG. 7 is a diagram schematically showing a prediction method called prediction mode 4, and each pixel value of encoded pixels EP of A to C, M, I, and J as indicated by arrows. Is used as the predicted value of the encoding target pixel UP existing at the position of the diagonally lower right direction of each of the encoded pixels EP of A to C, M, I, and J, and the prediction image is executed. obtain. Whether or not the predicted image is appropriate is determined by calculating the difference from the original pixel value of the encoding target pixel UP and determining the magnitude of the sum of absolute values thereof.

図８は、予測モード５と呼称される予測方法を模式的に表す図であり、矢印で示されるように、Ｍ、ＪおよびＡ〜Ｃ番の符号化済み画素ＥＰのそれぞれの画素値を、Ｍ、ＪおよびＡ〜Ｃ番の符号化済み画素ＥＰのそれぞれの垂直方向よりも右側の位置に存在する符号化対象画素ＵＰの予測値として用いて補間を実行することで予測画像を得る。なお、当該予測画像の適否については、符号化対象画素ＵＰの当初の画素値との差分を算出し、それらの絶対値和の大小によって判断する。 FIG. 8 is a diagram schematically showing a prediction method called prediction mode 5, and as indicated by arrows, respective pixel values of M, J, and A to C encoded pixels EP are expressed as follows. A predicted image is obtained by performing interpolation using the predicted values of the encoding target pixel UP existing at positions on the right side of the vertical direction of each of the M, J, and A to C encoded pixels EP. Whether or not the predicted image is appropriate is determined by calculating the difference from the original pixel value of the encoding target pixel UP and determining the magnitude of the sum of absolute values thereof.

図９は、予測モード６と呼称される予測方法を模式的に表す図であり、矢印で示されるように、ＭおよびＩ〜Ｋ番の符号化済み画素ＥＰのそれぞれの画素値を、ＭおよびＩ〜Ｋ番の符号化済み画素ＥＰのそれぞれの水平方向よりも下側の位置に存在する符号化対象画素ＵＰの予測値として用いて補間を実行することで予測画像を得る。なお、当該予測画像の適否については、符号化対象画素ＵＰの当初の画素値との差分を算出し、それらの絶対値和の大小によって判断する。 FIG. 9 is a diagram schematically showing a prediction method called prediction mode 6, and as indicated by arrows, respective pixel values of M and I to K encoded pixels EP are represented by M and A prediction image is obtained by performing interpolation using the prediction values of the encoding target pixel UP existing at positions lower than the horizontal direction of each of the I to K encoded pixels EP. Whether or not the predicted image is appropriate is determined by calculating the difference from the original pixel value of the encoding target pixel UP and determining the magnitude of the sum of absolute values thereof.

図１０は、予測モード７と呼称される予測方法を模式的に表す図であり、矢印で示されるように、Ｂ〜Ｅ番の符号化済み画素ＥＰのそれぞれの画素値を、Ｂ〜Ｅ番の符号化済み画素ＥＰのそれぞれの垂直方向よりも左側の位置に存在する、符号化対象画素ＵＰの予測値として用いて補間を実行することで予測画像を得る。なお、当該予測画像の適否については符号化対象画素ＵＰの当初の画素値との差分を算出し、それらの絶対値和の大小によって判断する。 FIG. 10 is a diagram schematically showing a prediction method called prediction mode 7, and as indicated by arrows, the respective pixel values of the encoded pixels EP of B to E are expressed as B to E. The prediction image is obtained by performing interpolation using the prediction value of the encoding target pixel UP present at the position on the left side of each of the encoded pixels EP in the vertical direction. Note that the suitability of the predicted image is determined by calculating the difference between the original pixel value of the encoding target pixel UP and the magnitude of the sum of absolute values thereof.

図１１は、予測モード８と呼称される予測方法を模式的に表す図であり、矢印で示されるように、Ｊ〜Ｌ番の符号化済み画素ＥＰのそれぞれの画素値を、Ｊ〜Ｌ番の符号化済み画素ＥＰのそれぞれの水平方向よりも上側の位置に存在する符号化対象画素ＵＰの予測値として用いて補間を実行することで予測画像を得る。なお、当該予測画像の適否については、符号化対象画素ＵＰの当初の画素値との差分を算出し、それらの絶対値和の大小によって判断する。 FIG. 11 is a diagram schematically showing a prediction method called the prediction mode 8, and as indicated by arrows, the pixel values of the encoded pixels EP of J to L are represented by J to L, respectively. The prediction image is obtained by performing interpolation using the prediction value of the encoding target pixel UP existing at a position higher than the horizontal direction of each of the encoded pixels EP. Whether or not the predicted image is appropriate is determined by calculating the difference from the original pixel value of the encoding target pixel UP and determining the magnitude of the sum of absolute values thereof.

Ｈ．２６４におけるイントラ予測においては、上述した９通りの予測モードを用いて予測を行い、各モードで得られた差分の絶対値和の最も小さい予測モードが、最適の予測モードであるものと決定し、当該モードを用いて符号化対象画素ＵＰの符号化を行うことになる。 H. In intra prediction in H.264, prediction is performed using the nine prediction modes described above, and the prediction mode with the smallest sum of absolute values of the differences obtained in each mode is determined to be the optimal prediction mode. The encoding target pixel UP is encoded using this mode.

また、図３〜図１１を用いて説明したように、イントラ予測を行うには、符号化対象サブブロックの左側、上側、左上側および右上側のサブブロックを参照し、それらに含まれる符号化済み画素ＥＰの画素値を予測値として使用するので、これらのサブブロックの全てが符号化されていることが必要である。 In addition, as described with reference to FIGS. 3 to 11, in order to perform intra prediction, reference is made to the left, upper, upper left, and upper right subblocks of the encoding target subblock, and the encoding included in them Since the pixel value of the completed pixel EP is used as a predicted value, all of these sub-blocks need to be encoded.

ところが、符号化にあたっては、後に説明するように、変換、量子化、逆量子化および逆変換という処理を順に施す必要があり、１つのサブブロックＳＢの全て符号化対象画素ＵＰを符号化するには多大な処理時間が必要となる。これは、動画像の品位が高まるにつれて増大し、ＨＤＴＶではさらに長時間化することが予想される。 However, in encoding, as will be described later, it is necessary to sequentially perform processing such as transformation, quantization, inverse quantization, and inverse transformation, and all the encoding target pixels UP of one subblock SB are encoded. Requires a lot of processing time. This increases as the quality of moving images increases, and it is expected that the time will be further extended in HDTV.

また、図１を参照して、例えば１２番のサブブロックＳＢに対してイントラ予測を行う場合には、先に説明したように、３番、６番、７番および９番のサブブロックＳＢを参照するが、これらは１２番のサブブロックＳＢのイントラ予測を行う時点では符号化されており、１２番のサブブロックＳＢのイントラ予測に支障は生じないが、例えば１３番のサブブロックＳＢに対してイントラ予測を行う場合、１２番のサブブロックＳＢの符号化が終わらないと、９通りの予測モードを全て実行することができない。 In addition, referring to FIG. 1, for example, when performing intra prediction on the 12th sub-block SB, the 3rd, 6th, 7th and 9th sub-blocks SB are changed as described above. Although referred to, these are encoded at the time when intra prediction of the 12th subblock SB is performed, and there is no problem in the intra prediction of the 12th subblock SB, but for example, for the 13th subblock SB When intra prediction is performed, all nine prediction modes cannot be executed unless encoding of the 12th sub-block SB is completed.

また、１４番のサブブロックＳＢについては、１３番のサブブロックＳＢの符号化が終わっていないと、当該サブブロックＳＢを参照するモード、すなわち予測モード３および７が実行できないことになる。 For the 14th sub-block SB, unless the 13th sub-block SB has been encoded, the modes referring to the sub-block SB, that is, the prediction modes 3 and 7 cannot be executed.

また、０番および１番のサブブロックＳＢにおいては、マクロブロックＭＢが画面内のどの位置にあるかによって状況が異なる。 In the 0th and 1st sub-blocks SB, the situation differs depending on where the macroblock MB is located in the screen.

すなわち、マクロブロックＭＢが画面の一番左上の部分を構成する場合、０番のサブブロックＳＢでは、一切の参照可能なサブブロックが存在しないので、参照値を例えば１２８として、予測モード２により画像予測を行うことになる。 That is, when the macro block MB constitutes the upper left part of the screen, there is no sub-block that can be referred to in the 0th sub-block SB. Make predictions.

また、１番のサブブロックＳＢでは、上側、左上側および右上側のサブブロックが存在しないので、左側、すなわち０番のサブブロックＳＢのみを参照した予測モード１および８と、平均値（０番のサブブロックＳＢの右端の４つの画素の平均値）を用いる予測モード２だけが使用可能であり、０番のサブブロックＳＢの符号化が終わらないと、何れの予測モードも使用できない。これは、５番のサブブロックＳＢについても同様である。 Further, in the first sub-block SB, there are no upper, upper left and upper right sub-blocks, and therefore, prediction modes 1 and 8 referring only to the left side, that is, the 0th sub-block SB, and an average value (0th Only the prediction mode 2 using the average value of the four pixels at the right end of the sub-block SB) can be used, and any prediction mode cannot be used unless the encoding of the 0th sub-block SB is finished. The same applies to the fifth sub-block SB.

しかし、マクロブロックＭＢが、画面の一番左上の右隣りの部分を構成する場合は、０番のサブブロックＳＢは、画面の一番左上のマクロブロックＭＢを構成する５番のサブブロックＳＢを参照することができるようになる。 However, when the macro block MB constitutes the right adjacent portion at the top left of the screen, the 0th sub block SB is the fifth sub block SB that constitutes the top left macro block MB of the screen. You can refer to it.

また、マクロブロックＭＢが、画面の一番左上の下隣りの部分を構成する場合は、０番のサブブロックＳＢは、画面の一番左上のマクロブロックＭＢを構成する９番、１１番のサブブロックＳＢを参照することができ、１番のサブブロックＳＢは、画面の一番左上のマクロブロックＭＢを構成する９番、１１番および１４番のサブブロックＳＢを参照することができる。 When the macro block MB constitutes the lower left upper part of the screen, the 0th sub-block SB corresponds to the 9th and 11th sub blocks constituting the upper left macro block MB of the screen. The block SB can be referred to, and the 1st sub-block SB can refer to the 9th, 11th and 14th sub-blocks SB constituting the upper left macroblock MB of the screen.

このように、イントラ予測においては、参照対象となるサブブロックＳＢの符号化が終わっていないと、９通りの予測モードを全て実行することができない、あるいは使用可能な予測モードの全てを実行することができないので、参照対象となる全てのサブブロックＳＢの符号化が終わってからでないと、符号化対象となるサブブロックＳＢに対してイントラ予測を始めることができず、効率的でないという問題がある。 Thus, in intra prediction, if encoding of the sub-block SB to be referred to is not completed, all nine prediction modes cannot be executed or all usable prediction modes are executed. Therefore, unless all the sub-blocks SB to be referenced are encoded, intra prediction cannot be started for the sub-block SB to be encoded, which is not efficient. .

そこで、発明者達は、直前のサブブロックＳＢに対する符号化が終わっていない段階でイントラ予測を行うことができれば効率化できるものと考え、最適な予測モードを決定するまでの過程を２段階に分け、第１段階では符号化済みの画素値だけでなく、簡易的に符号化前の画素値（未符号化状態の画素値）も予測値として使用するイントラ予測（以後、簡易イントラ予測と呼称）により予測モードの絞り込みを行い、第２段階では、絞り込まれた予測モードについての通常のイントラ予測を行って、最適な予測モードの決定を行うという技術思想に到達した。 Therefore, the inventors consider that the efficiency can be improved if intra prediction can be performed at the stage where encoding for the immediately preceding sub-block SB is not finished, and the process until the optimum prediction mode is determined is divided into two stages. In the first stage, intra prediction using not only encoded pixel values but also pixel values before encoding (pixel values in an unencoded state) simply as prediction values (hereinafter referred to as simple intra prediction). In the second stage, the technical idea of performing the normal intra prediction for the narrowed prediction mode and determining the optimum prediction mode has been reached.

＜簡易イントラ予測を有した画像データの生成方法＞
以下、本発明に係る実施の形態として、簡易イントラ予測を有した画像データの生成方法について説明する。 <Generation method of image data with simple intra prediction>
Hereinafter, a method for generating image data having simplified intra prediction will be described as an embodiment according to the present invention.

＜エンコーダの構成および動作＞
図１２は、簡易イントラ予測を有した画像データの生成を実行するＨ．２６４のエンコーダ１００の構成を示すブロック図である。 <Configuration and operation of encoder>
FIG. 12 shows an H.264 execution for generating image data having simple intra prediction. 2 is a block diagram illustrating a configuration of an H.264 encoder 100. FIG.

図１２に示すようにエンコーダ１００は、簡易イントラ予測部１、イントラ予測部２、変換部３、量子化部４、エントロピー符号化部５、逆量子化部６、逆変換部７、ループフィルタ８、フレームメモリ９、フレーム間予測部１０および動き予測部１１を備えている。 As illustrated in FIG. 12, the encoder 100 includes a simple intra prediction unit 1, an intra prediction unit 2, a conversion unit 3, a quantization unit 4, an entropy encoding unit 5, an inverse quantization unit 6, an inverse conversion unit 7, and a loop filter 8. A frame memory 9, an inter-frame prediction unit 10, and a motion prediction unit 11.

エンコーダ１００に入力された入力画像信号は、簡易イントラ予測部１および動き予測部１１に与えられるとともに、フレーム間予測部１０から出力される予測画像との差分を取るためにも使用される。 The input image signal input to the encoder 100 is given to the simple intra prediction unit 1 and the motion prediction unit 11 and is also used to take a difference from the predicted image output from the inter-frame prediction unit 10.

簡易イントラ予測部１では、参照対象のサブブロック（第１のサブブロック）の符号化済みの画素値だけでなく、符号化前の画素値も予測値（第１の予測値）として使用して、最大で９通りの予測モードを実行し、各予測モードにおける予測画像（第１の予測画像）を得るとともに、符号化対象のサブブロック（第２のサブブロック）の符号化対象画素の画素値との差分の絶対値和を取得する。そして、それらの絶対値和の中から、値の小さなものから順に幾つかを選択することで、予測モードの絞り込みを行って最適予測モード候補を選択する。 The simplified intra prediction unit 1 uses not only the encoded pixel value of the reference target sub-block (first sub-block) but also the pixel value before encoding as the predicted value (first predicted value). A maximum of nine prediction modes are executed to obtain a prediction image (first prediction image) in each prediction mode, and the pixel value of the encoding target pixel of the encoding target sub-block (second sub-block) Get the absolute value sum of the differences. Then, by selecting some of the absolute value sums in descending order, the prediction mode is narrowed down to select the optimal prediction mode candidate.

なお、使用可能な予測モードが少ない場合には、結果的に１つの予測モードだけを最適予測モード候補として選択することもある。 When there are few usable prediction modes, only one prediction mode may be selected as the optimum prediction mode candidate as a result.

簡易イントラ予測部１での絞り込みの結果は、入力画像信号とともにイントラ予測部１に与えられ、イントラ予測部１においては、絞り込みによって得られた最適予測モード候補について、通常のイントラ予測を実行し、最適予測モードを決定する。 The result of narrowing down by the simple intra prediction unit 1 is given to the intra prediction unit 1 together with the input image signal, and the intra prediction unit 1 executes normal intra prediction for the optimum prediction mode candidate obtained by narrowing down. Determine the optimal prediction mode.

ここで、通常のイントラ予測とは、符号化済みの画素値を予測値（第２の予測値）として用いて予測を行うものであり、この段階では、直前のサブブロックに対する符号化が終わっているので、通常のイントラ予測を行うのに不都合は生じない。なお、簡易イントラ予測およびイントラ予測については、後にさらに説明する。 Here, normal intra prediction is to perform prediction using an encoded pixel value as a prediction value (second prediction value). At this stage, encoding for the immediately preceding sub-block is finished. Therefore, there is no inconvenience in performing normal intra prediction. Note that simple intra prediction and intra prediction will be further described later.

イントラ予測部２において決定された最適予測モードによって得られた、各符号化対象画素と予測画像（第２の予測画像）との差分データは、変換部３に与えられる。なお、変換部３には、上述したフレーム内予測を行うフレーム内モードで得られる差分データと、フレーム間予測を行うフレーム間モードで得られる差分データとが、モードによって切り換えて与えられる構成となっている。 Difference data between each encoding target pixel and the predicted image (second predicted image) obtained by the optimal prediction mode determined in the intra prediction unit 2 is given to the conversion unit 3. Note that the conversion unit 3 is configured to be supplied with the difference data obtained in the above-described intraframe mode in which intraframe prediction is performed and the difference data obtained in the interframe mode in which interframe prediction is performed by switching between modes. ing.

変換部３では、与えられた差分データに対して離散コサイン変換（ＤＣＴ）を施し、変換係数を出力するが、Ｈ．２６４では、浮動小数点計算を整数化した整数変換（Integer Transform）を用いる。なお、変換の単位は、４×４画素単位である。 The transform unit 3 performs discrete cosine transform (DCT) on the given difference data and outputs transform coefficients. In H.264, integer transform (Integer Transform) in which floating point calculation is converted into an integer is used. The unit of conversion is a 4 × 4 pixel unit.

量子化部４では、変換部３から与えられるＤＣＴ変換係数に対して量子化を行い、量子化された変換係数を出力する。 The quantization unit 4 quantizes the DCT transform coefficient given from the transform unit 3 and outputs the quantized transform coefficient.

量子化された変換係数はエントロピー符号化部５に与えられ、エントロピー符号化部５では、ベースライン・プロフアイルおよびＸプロファイルでは、指数ゴロム符号（Exponential Golomb Coding）と、それを応用したＣＡＶＬＣ（Context−based Adaptive Variable Length Coding）を使用し、また、メイン・プロファイルおよびハイ・プロファイルではＣＡＢＡＣ（Context−based Adaptive Binary Arithmetic Coding）という算術符号を用いて、量子化された変換係数に対するエントロピー符号化を行って、圧縮された画像データをビットストリームとして出力する。 The quantized transform coefficient is given to the entropy coding unit 5, and in the entropy coding unit 5, in the baseline profile and the X profile, an exponential Golomb code and a CAVLC (Context -Based Adaptive Variable Length Coding), and in the main profile and high profile, CABAC (Context-based Adaptive Binary Arithmetic Coding) is used to perform entropy coding on the quantized transform coefficients. The compressed image data is output as a bit stream.

なお、量子化された変換係数は逆量子化部６にも与えられ、さらに逆変換部７を経て差分データに戻される。 Note that the quantized transform coefficient is also supplied to the inverse quantization unit 6 and further returned to the difference data through the inverse transform unit 7.

フレーム間モードにおいては、スイッチＳＷ１が入って、フレーム間予測部１０で作成された直前のフレームの画素値に上記差分データがそれぞれ加えられた後に、ループフィルタ８を介してフレームメモリ９に蓄積される。このフレームのデータはデコーダ側で再生される符号化済み画像と同じであり、直前のフレームとして、次のフレームを作成するときに使用される。 In the inter-frame mode, the switch SW1 is turned on and the difference data is added to the pixel value of the immediately preceding frame created by the inter-frame prediction unit 10 and then stored in the frame memory 9 via the loop filter 8. The The data of this frame is the same as the encoded image reproduced on the decoder side, and is used when the next frame is created as the immediately preceding frame.

一方、フレーム内モードの場合は、スイッチＳＷ２が入って、イントラ予測部２において決定された最適の予測モードによって得られた予測画像の画素値に、逆変換後の差分データがそれぞれ加えられて、符号化済み画像データとして、簡易イントラ予測部１およびイントラ予測部２に与えられ、簡易イントラ予測およびイントラ予測に使用される。なお、この符号化済み画像データはデコーダ側で再生される画像データと同じである。 On the other hand, in the case of the intra-frame mode, the switch SW2 is turned on, and the difference data after inverse transformation is added to the pixel value of the predicted image obtained by the optimal prediction mode determined in the intra prediction unit 2, The encoded image data is given to the simple intra prediction unit 1 and the intra prediction unit 2 and is used for simple intra prediction and intra prediction. The encoded image data is the same as the image data reproduced on the decoder side.

ここで、変換部３、量子化部４、逆量子化部６および逆変換部７における、変換処理、量子化処理、逆変換処理および逆量子化処理によって、デコーダ側で再生される画像データと同じ画像データを得る処理をローカルデコードと呼称する。 Here, the image data reproduced on the decoder side by the transformation process, the quantization process, the inverse transformation process and the inverse quantization process in the transformation unit 3, the quantization unit 4, the inverse quantization unit 6 and the inverse transformation unit 7 The process for obtaining the same image data is called local decoding.

ループフィルタ８は、整数変換のブロック境界のみを平滑化してブロックノイズの発生を抑制するデブロッキングフィルタである。 The loop filter 8 is a deblocking filter that smoothes only the block boundary of integer conversion and suppresses the generation of block noise.

フレーム間予測部１０では、フレームメモリ９に蓄積された、直前のフレームの画像データを使用して、画面の動きを考慮して次のフレームの予測画像を作成する動き補償が実行される。ここで作成された予測画像は、フレーム間モードにおいて、入力画像との間で差分を取るために使用される。 The inter-frame prediction unit 10 uses the image data of the immediately previous frame stored in the frame memory 9 to perform motion compensation that creates a predicted image of the next frame in consideration of screen motion. The predicted image created here is used to obtain a difference from the input image in the inter-frame mode.

また、動き予測部１１においては、入力画像信号から動きベクトルを検出して画面の動き予測を行い、その結果をフレーム間予測部１０に与える。 Further, the motion prediction unit 11 detects a motion vector from the input image signal, performs motion prediction of the screen, and gives the result to the inter-frame prediction unit 10.

＜パイプライン処理＞
次に、図１２を参照しつつ、パイプライン処理を模式的に示す図１３を用いて、簡易イントラ予測およびイントラ予測について説明する。 <Pipeline processing>
Next, simple intra prediction and intra prediction will be described with reference to FIG. 12 and FIG. 13 schematically showing pipeline processing.

図１３においては横軸に時間の経過を示し、１連の処理サイクルをブロック処理サイクルとして示しており、図１３では（ｎ−１）、（ｎ）および（ｎ＋１）の３つのブロック処理サイクルを示している。 In FIG. 13, the horizontal axis indicates the passage of time, and a series of processing cycles is shown as a block processing cycle. In FIG. 13, three block processing cycles (n−1), (n), and (n + 1) are shown. Show.

ここで、ｎ番のサブブロックが符号化対象のサブブロックであるとすると、ｎ番のサブブロックに対して、簡易イントラ予測部１において、図３〜図１１を用いて説明した予測モード０〜８の９通りの予測モードを実行する。 Here, assuming that the n-th sub-block is a sub-block to be encoded, the simplified intra prediction unit 1 performs prediction modes 0 to 0 described with reference to FIGS. 3 to 11 for the n-th sub-block. Nine prediction modes of 8 are executed.

簡易イントラ予測においては、参照対象となるサブブロックの符号化が終わっていない場合には、入力画像信号の画素値を予測値として使用する。 In simple intra prediction, when the encoding of a sub-block to be referenced is not finished, the pixel value of the input image signal is used as a prediction value.

例えば、図１に示す１３番のサブブロックＳＢをｎ番のサブブロックとした場合、通常のイントラ予測では１２番のサブブロックＳＢの符号化が終わらないと、９通りの予測モードを全て実行することができないが、簡易イントラ予測では１２番のサブブロックＳＢの未符号化の状態にある画素、すなわち符号化対象画素の画素値を用いることで、９通りの予測モードを全て実行することができる。 For example, if the 13th subblock SB shown in FIG. 1 is the nth subblock, all the nine prediction modes are executed if the encoding of the 12th subblock SB is not completed in normal intra prediction. However, in the simple intra prediction, all the nine prediction modes can be executed by using the unencoded pixel of the 12th sub-block SB, that is, the pixel value of the encoding target pixel. .

また、図１における１番のサブブロックＳＢについては、０番のサブブロックＳＢの符号化対象画素の画素値を用いることで、０番のサブブロックＳＢの符号化が終わらない状態で、予測モード１、２および８を実行することができる。 Further, for the first sub-block SB in FIG. 1, the prediction mode is used in a state in which the encoding of the 0-th sub-block SB is not finished by using the pixel value of the encoding target pixel of the 0-th sub-block SB. 1, 2 and 8 can be performed.

図１３に示すように、ｎ番のサブブロックに対する簡易イントラ予測をブロック処理サイクル（ｎ）において行うが、９通りの予測モードを全て実行するには時間が必要であり、ブロック処理サイクル（ｎ）のほぼ全期間を費やして簡易イントラ予測を実行する。 As shown in FIG. 13, simple intra prediction for the nth sub-block is performed in the block processing cycle (n), but it takes time to execute all nine prediction modes, and the block processing cycle (n) The simple intra prediction is executed over almost the entire period.

そして、各モードにおいて得られた、予測画像と符号化対象画素ＵＰの当初の画素値との差分の絶対値和の中から、値の小さなもの順に幾つかを選択することで、予測モードの絞り込みを行う。 Then, by selecting some of the absolute value sums of the differences between the predicted image and the initial pixel value of the encoding target pixel UP obtained in each mode, the prediction mode is narrowed down by selecting some of them in ascending order. I do.

このように、予測モードを１つに決定するのではなく、幾通りかの最適予測モード候補に絞り込むのは、簡易イントラ予測が、通常のイントラ予測ほど精度の高いものではないからである。 As described above, the reason why the prediction mode is not determined to be one, but is narrowed down to several optimum prediction mode candidates is that the simple intra prediction is not as accurate as the normal intra prediction.

また、ブロック処理サイクル（ｎ）では、簡易イントラ予測と平行して、ｎ−１番のサブブロックに対して、イントラ予測部２において通常のイントラ予測を実行する。 Further, in the block processing cycle (n), in parallel with the simple intra prediction, the intra prediction unit 2 executes normal intra prediction on the (n−1) th subblock.

ｎ−１番のサブブロックに関しては、ブロック処理サイクル（ｎ−１）において簡易イントラ予測を実行して予測モードの絞り込みを行っているので、当該絞り込みの結果を反映させて、ブロック処理サイクル（ｎ）において通常のイントラ予測を実行することで、当該処理サイクル期間内で最適予測モードを決定することができる。 For the (n−1) th sub-block, since the prediction mode is narrowed down by executing the simple intra prediction in the block processing cycle (n−1), the result of the narrowing down is reflected and the block processing cycle (n ), The optimum prediction mode can be determined within the processing cycle period.

すなわち、ここで実行されるイントラ予測では、簡易イントラ予測で幾通りかに絞り込まれた最適予測モード候補についてのみ、再度、予測を実行すれば良く、簡易イントラ予測よりも短い時間で最適予測モードを決定することができる。また、ここで実行されるイントラ予測では、参照対象画素は全て符号化されているので、精度の高い予測が可能となる。 That is, in the intra prediction performed here, only the optimal prediction mode candidates narrowed down by the simple intra prediction need be executed again, and the optimal prediction mode can be set in a shorter time than the simple intra prediction. Can be determined. Moreover, in the intra prediction performed here, since all reference object pixels are encoded, prediction with high accuracy is possible.

なお、ブロック処理サイクル（ｎ−１）においては、ｎ−２番のサブブロックに対するローカルデコードが終わっているので、ｎ−１番のサブブロックに対して、通常のイントラ予測を実行することについて支障は生じない。 In the block processing cycle (n−1), since local decoding for the n−2th subblock has been completed, it is difficult to perform normal intra prediction for the n−1th subblock. Does not occur.

ｎ−１番のサブブロックに対して、通常のイントラ予測を実行して最適予測モードを決定した後は、ブロック処理サイクル（ｎ−１）において、当該最適予測モードを使用して、ｎ−１番のサブブロックのローカルデコードを実行する。 After performing the normal intra prediction and determining the optimal prediction mode for the n-1st sub-block, in the block processing cycle (n-1), the optimal prediction mode is used, and n-1 Perform local decoding of sub-block number.

ローカルデコードによって符号化された、ｎ−１番のサブブロックの画素データはイントラ予測部２内のメモリに記憶され、ｎ番のサブブロックに対する通常のイントラ予測において使用される。 The pixel data of the (n−1) th subblock encoded by local decoding is stored in the memory in the intra prediction unit 2 and used in normal intra prediction for the nth subblock.

同様に、ブロック処理サイクル（ｎ＋１）においては、ｎ＋１番のサブブロックに関しての簡易イントラ予測を実行するとともに、ｎ番のサブブロックに対して、通常のイントラ予測を実行して最適予測モードを決定し、その後は、当該最適予測モードで得られた差分データを使用して、ｎ番のサブブロックのローカルデコードを実行する。 Similarly, in the block processing cycle (n + 1), the simple intra prediction for the n + 1-th subblock is executed, and the normal prediction is executed for the n-th subblock to determine the optimum prediction mode. Thereafter, the local decoding of the nth sub-block is executed using the difference data obtained in the optimum prediction mode.

図１４には、符号化対象サブブロックに対して実行する、簡易イントラ予測、通常のイントラ予測およびローカルデコードの流れをフローチャートで示す。 FIG. 14 is a flowchart showing the flow of simple intra prediction, normal intra prediction, and local decoding to be performed on the encoding target sub-block.

図１４において、エンコーダ１００に符号化対象画像のデータが入力されると、簡易イントラ予測部１において簡易イントラ予測が実行され（ステップＳ１）、その後、各モードにおいて得られた予測画像と、符号化対象画素の当初の画素値との差分の絶対値和の中から、値の小さなもの順に幾つかを選択することで、予測モードが絞り込まれる（ステップＳ２）。なお、イントラ予測部２にも符号化対象画素のデータが与えられる。 In FIG. 14, when data of an encoding target image is input to the encoder 100, simple intra prediction is performed in the simple intra prediction unit 1 (step S1), and then the predicted image obtained in each mode, and the encoding are performed. The prediction mode is narrowed down by selecting several from the sum of absolute values of differences from the original pixel value of the target pixel in ascending order of values (step S2). The intra prediction unit 2 is also provided with the data of the encoding target pixel.

そして、イントラ予測部２において、ステップＳ２で絞り込まれた最適予測モード候補に対して、符号化された画素値を用いてイントラ予測を実行し（ステップＳ３）、その後、各モードにおいて得られた、予測画像（第１の予測画像）と、符号化対象画素の当初の画素値との差分の絶対値和の中から、値の最も小さなものを１つ選択することで、最適予測モードを決定する（ステップＳ４）。 Then, the intra prediction unit 2 performs intra prediction on the optimal prediction mode candidates narrowed down in step S2 using the encoded pixel values (step S3), and then obtained in each mode. The optimum prediction mode is determined by selecting one of the absolute value sums of differences between the prediction image (first prediction image) and the original pixel value of the encoding target pixel to select one having the smallest value. (Step S4).

次に、決定した最適予測モードによって得られた予測画像（第２の予測画像）と各符号化対象画素の当初の画素値との差分データに対して、変換部３において直交変換（例えばＤＣＴ）を施し（ステップＳ５）、続いて、量子化部４における量子化（ステップＳ６）、逆量子化部６における逆量子化（ステップＳ７）、逆変換部７における逆変換を施す（ステップＳ８）。 Next, orthogonal transform (for example, DCT) is performed on the difference data between the predicted image (second predicted image) obtained by the determined optimal prediction mode and the original pixel value of each encoding target pixel. (Step S5), followed by quantization in the quantization unit 4 (step S6), inverse quantization in the inverse quantization unit 6 (step S7), and inverse transformation in the inverse transformation unit 7 (step S8).

そして、最適予測モードによって得られた予測画像に逆変換後のデータ（差分データ）を加えることで、符号化済み画像データを生成し（ステップＳＴ９）、イントラ予測部２にフィードバックする。 Then, encoded image data is generated by adding data (difference data) after inverse transformation to the predicted image obtained in the optimal prediction mode, and fed back to the intra prediction unit 2.

＜効果＞
以上説明したように、エンコーダ１００においては、符号化前の画素値も予測値として使用してイントラ予測を行う簡易イントラ予測部１を設け、直前のサブブロックに対する符号化が終わっていない段階でも画像予測を始めることができ、イントラ予測に費やす時間を短縮して、イントラ予測の効率化を図ることができる。 <Effect>
As described above, in the encoder 100, the simplified intra prediction unit 1 that performs intra prediction using the pixel value before encoding as a prediction value is provided, and the image is processed even when the encoding for the immediately preceding sub-block is not finished. Prediction can be started, the time spent for intra prediction can be shortened, and the efficiency of intra prediction can be improved.

また、同じ符号化対象サブブロックに対する簡易イントラ予測と、通常のイントラ予測とを、パイプライン処理により異なる処理サイクルで処理するものとし、ローカルデコードは通常のイントラ予測の後に行うようにしたので、画像データの符号化処理を効率的に行うことができる。 In addition, simple intra prediction and normal intra prediction for the same encoding target sub-block are processed in different processing cycles by pipeline processing, and local decoding is performed after normal intra prediction. Data encoding processing can be performed efficiently.

１６×１６画素で構成されるマクロブロックを模式的に表す図である。It is a figure which represents typically the macroblock comprised by 16x16 pixel. サブブロックを構成する１６個の符号化対象画素と、イントラ予測において使用される参照画素を模式的に表す図である。It is a figure which represents typically the 16 encoding object pixel which comprises a subblock, and the reference pixel used in intra prediction. 予測モード０を模式的に表す図である。It is a figure which represents prediction mode 0 typically. 予測モード１を模式的に表す図である。It is a figure which represents prediction mode 1 typically. 予測モード２を模式的に表す図である。It is a figure which represents the prediction mode 2 typically. 予測モード３を模式的に表す図である。It is a figure which represents the prediction mode 3 typically. 予測モード４を模式的に表す図である。It is a figure which represents the prediction mode 4 typically. 予測モード５を模式的に表す図である。It is a figure which represents the prediction mode 5 typically. 予測モード６を模式的に表す図である。It is a figure which represents the prediction mode 6 typically. 予測モード７を模式的に表す図である。It is a figure which represents the prediction mode 7 typically. 予測モード８を模式的に表す図である。It is a figure which represents the prediction mode 8 typically. 本発明に係る簡易イントラ予測を有した画像データの生成方法を実行するＨ．２６４のエンコーダの構成を示すブロック図である。A method for generating image data having simplified intra prediction according to the present invention is described. 2 is a block diagram illustrating a configuration of an H.264 encoder. FIG. パイプライン処理を模式的に示す図である。It is a figure which shows a pipeline process typically. 本発明に係る簡易イントラ予測を有した画像データの生成方法を説明するフローチャートである。It is a flowchart explaining the production | generation method of the image data which has the simple intra prediction which concerns on this invention.

Explanation of symbols

ＭＢマクロブロック
ＳＢサブブロック
ＥＰ符号化済み画素
ＵＰ符号化対象画素
ＢＬブロック
MB Macro block SB Sub block EP Encoded pixel UP Encoding target pixel BL block

Claims

A method of generating image data having intra prediction that performs image prediction of a second sub-block to be encoded using pixel values of a plurality of encoded first sub-blocks,
Prior to the intra prediction,
When the encoding of the first sub-block is not completed, the pixel value before the encoding of the first sub-block is set as a first predicted value, and the first predicted value is used to Perform interpolation for each of a plurality of prediction modes for two sub-blocks,
Based on the result of interpolation for each of the plurality of prediction modes, having at least one of the plurality of prediction modes to be a simple intra prediction as an optimal prediction mode candidate,
The intra prediction performs the image prediction using the at least one optimal prediction mode candidate,
The intra prediction is performed after encoding is completed for all of the first sub-blocks;
The pixel value after encoding of the first sub-block that has been encoded is set as a second predicted value, and the at least one optimal value is used for the second sub-block using the second predicted value. Perform interpolation for each prediction mode candidate,
On the basis of the results of at least one optimum prediction mode candidate for each of the interpolation, it has rows the picture prediction to determine the optimum prediction mode from the at least one optimum prediction mode candidates,
In the simple intra prediction ,
For the first sub-block that has been encoded , image data that uses the pixel value after encoding of the first sub-block or the pixel value before encoding as the first predicted value Generation method.

In the simple intra prediction ,
The absolute value sum of the differences between the first predicted image of the second sub-block obtained as a result of the interpolation for each of the plurality of prediction modes and the pixel value before the interpolation of the second sub-block is executed The image data generation method according to claim 1, wherein the at least one optimum prediction mode candidate is selected by acquiring for each of the plurality of prediction modes and selecting in order from the smallest absolute value sum .

The intra prediction is
The difference between the second predicted image of the second sub-block obtained as a result of interpolation for each of the at least one optimum prediction mode candidate and the pixel value before the interpolation of the second sub-block is executed The method of generating image data according to claim 1 , wherein an absolute value sum is obtained for each of the at least one optimum prediction mode candidate, and the optimum prediction mode is determined by selecting the smallest absolute value sum. .

The simple intra prediction and the intra prediction are executed by pipeline processing,
The pipeline processing is
In the nth processing cycle in which the simple intra prediction is executed for the nth subblock, for the n−1th subblock in which the simple intra prediction is executed in the n−1th processing cycle. 2. The image data according to claim 1 , wherein the intra prediction is performed and local decoding is performed on the n−1th sub-block using the optimal prediction mode determined by the intra prediction. Generation method.