JP2013090253A

JP2013090253A - Image processor and program

Info

Publication number: JP2013090253A
Application number: JP2011231113A
Authority: JP
Inventors: Kazuhisa Iguchi; 和久井口; Toshie Misu; 俊枝三須; Yoshiaki Shishikui; 善明鹿喰; Atsuro Ichigaya; 敦郎市ヶ谷
Original assignee: Nippon Hoso Kyokai NHK; Japan Broadcasting Corp
Current assignee: Japan Broadcasting Corp
Priority date: 2011-10-20
Filing date: 2011-10-20
Publication date: 2013-05-13
Anticipated expiration: 2031-10-20
Also published as: JP5789172B2

Abstract

PROBLEM TO BE SOLVED: To improve efficiency of in-screen prediction while suppressing increase in a calculation amount.SOLUTION: An image processor for performing in-screen prediction using plural prediction directions comprises: a sorting unit for sorting plural reference pixels into plural groups on the basis of possibility of being referred to; a filter determination unit for determining different filters for respective sorted groups; a filtering unit which filters the pixel values of the reference pixels included in each group using the filter determined by the filter determination unit; and a generation unit for generating a predictive image of the in-screen prediction using the pixel values filtered by the filtering unit.

Description

本発明は、複数の予測方向を用いて画面内予測を行う画像処理装置及びプログラムに関する。 The present invention relates to an image processing apparatus and a program for performing intra prediction using a plurality of prediction directions.

近年マルチメディア化が進み、動画像を扱う機会が多くなってきている。画像データは情報量が大きく、画像の集合である動画像は、テレビ放送の主流であるハイビジョン（１９２０×１０８０）放送で１Ｇｂｉｔ／ｓｅｃを超える情報量である。 In recent years, with the advance of multimedia, there are many opportunities to handle moving images. Image data has a large amount of information, and a moving image, which is a set of images, has an information amount exceeding 1 Gbit / sec in high-definition (1920 × 1080) broadcasting, which is the mainstream of television broadcasting.

そのため、動画像データは、Ｈ．２６４／ＡＶＣ（Advanced Video Coding）や規格化作業中のＨＥＶＣ（High Efficiency Video Coding）などに代表される符号化標準技術によって、情報量を圧縮して伝送・蓄積が行われている。 Therefore, the moving image data is H.264. H.264 / AVC (Advanced Video Coding) and standardization work HEVC (High Efficiency Video Coding) and the like are used for transmission and storage by compressing the amount of information.

符号化標準技術であるＨ．２６４／ＡＶＣは、直交変換や画面間予測、画面内予測、算術符号化、デブロッキングフィルタなどのツールを利用して、動画像の１／１００程度までの圧縮を実現している。ツールの一つである画面内予測は、処理対象ブロックに隣接する画素値を参照画素として、複数の予測モードから最適な予測モードを決定して画面内予測を行う。 H., an encoding standard technology. H.264 / AVC uses a tool such as orthogonal transform, inter-screen prediction, intra-screen prediction, arithmetic coding, deblocking filter, and the like to achieve compression of about 1/100 of a moving image. Intra prediction, which is one of the tools, performs intra prediction by determining an optimal prediction mode from a plurality of prediction modes using a pixel value adjacent to the processing target block as a reference pixel.

また、Ｈ．２６４／ＡＶＣで、高画質化のために追加された技術に、８×８の画面内予測がある（非特許文献１）。 H. As a technique added to improve image quality in H.264 / AVC, there is 8 × 8 intra-screen prediction (Non-Patent Document 1).

図１は、Ｈ．２６４／ＡＶＣの８×８の画面内予測を説明するための図である。図１に示す画素Ａ〜Ｙは、参照画素を示し、文字が無い白丸の画素は、処理対象ブロックの画素を示す。ここで、画面内予測を行う前に、参照画素Ａ〜Ｙに対して、［１／４，１／２，１／４］のローパスフィルタが適用される。ローパスフィルタは以下、ＬＰＦとも呼ぶ。 FIG. 3 is a diagram for explaining H.264 / AVC 8 × 8 intra-screen prediction. FIG. Pixels A to Y shown in FIG. 1 indicate reference pixels, and white circle pixels without characters indicate pixels of a processing target block. Here, before performing intra prediction, a low-pass filter of [1/4, 1/2, 1/4] is applied to the reference pixels A to Y. Hereinafter, the low-pass filter is also referred to as LPF.

例えば、参照画素Ｃを参照するときは、Ｂ×０．２５＋Ｃ×０．５＋Ｄ×０．２５の画素値が用いられる。両端の参照画素ＡとＹとについては、［１／４，３／４］のフィルタが用いられる。例えば、参照画素Ａを参照するときは、Ｂ×０．２５＋Ａ×０．７５の画素値が用いられる。 For example, when referring to the reference pixel C, a pixel value of B × 0.25 + C × 0.5 + D × 0.25 is used. For the reference pixels A and Y at both ends, [1/4, 3/4] filters are used. For example, when referring to the reference pixel A, a pixel value of B × 0.25 + A × 0.75 is used.

また、規格化作業中であるＨＥＶＣ方式でも、Ｈ．２６４／ＡＶＣの８×８の画面内予測で用いた参照画素のフィルタリングが、ブロックサイズに限らず画面内予測の前に行なわれることで規格化が進められている。 Even in the HEVC system, which is being standardized, H.264 / AVC 8 × 8 intra-screen prediction uses reference pixel filtering not only for block size but before intra-screen prediction, so that standardization is being promoted.

このＨＥＶＣ方式において、着目画素の前後の画素値の差の絶対値が閾値以下の場合はフィルタリングを行い、閾値より大きい場合はフィルタリングを行わない、という参照画素のフィルタリング方法が提案されている（非特許文献２）。 In this HEVC method, a reference pixel filtering method has been proposed in which filtering is performed when the absolute value of the difference between the pixel values before and after the pixel of interest is equal to or smaller than a threshold value, and filtering is not performed when the absolute value is larger than the threshold value (non-null). Patent Document 2).

提案されたフィルタリング方法では、例えば、図１に示すＤの画素を参照する場合、ａｂｓ（Ｃ−Ｅ）の値が閾値以下の場合はフィルタリングを行った画素値を用い、ａｂｓ（Ｃ−Ｅ）の値が閾値を超える場合はＤの値をそのまま用いる。 In the proposed filtering method, for example, when referring to the pixel D shown in FIG. 1, if the value of abs (CE) is equal to or less than the threshold, the filtered pixel value is used, and abs (CE). When the value of exceeds the threshold, the value of D is used as it is.

大久保榮監修，「改訂三版Ｈ．２６４／ＡＶＣ教科書」，インプレスＲ＆Ｄ，ｐ．２６４−２６８，２００９年１月１日Supervised by Satoshi Okubo, “Revised Third Edition H.264 / AVC Textbook”, Impress R & D, p. 264-268, January 1, 2009 Jane Zhao,「On intra prediction(JCTVC-E437)」,Joint Collaborative Team on Video Coding (JCT-VC) of ITU-T SG16 WP3 and ISO/IEC JTC1/SC29/WG11 5th Meeting: Geneva, CH, 16-23 March, 2011Jane Zhao, `` On intra prediction (JCTVC-E437) '', Joint Collaborative Team on Video Coding (JCT-VC) of ITU-T SG16 WP3 and ISO / IEC JTC1 / SC29 / WG11 5th Meeting: Geneva, CH, 16-23 March, 2011

ここで、画面内予測の参照画素にフィルタリングを行う目的は、参照画素に含まれるノイズを低減することである。フィルタリングを行わない場合、参照画素にノイズが含まれていると、画面内予測の予測方向にそのノイズが引き延ばされてしまう。この場合、処理対象ブロック（予測ブロック）には存在していない縞模様が画面内予測により生成されるため、画面内予測の効率が低下する。 Here, the purpose of filtering the reference pixels for intra prediction is to reduce noise contained in the reference pixels. When filtering is not performed, if noise is included in the reference pixel, the noise is extended in the prediction direction of intra prediction. In this case, since a striped pattern that does not exist in the processing target block (predicted block) is generated by intra prediction, the efficiency of intra prediction is reduced.

また、画面内予測の予測方向にエッジが存在していた場合、ＬＰＦを適用することでエッジの値が変化するため、画面内予測の効率が低下する。これは、本来存在していたエッジとは異なるエッジが、画面内予測により生成されるからである。 Further, when an edge exists in the prediction direction of the intra prediction, the value of the edge is changed by applying the LPF, so that the efficiency of the intra prediction is reduced. This is because an edge different from the edge that originally existed is generated by intra prediction.

このように、ノイズ削減効果と、エッジが存在していた場合に画面内予測の効率が低下することは表裏一体の関係となっている。 Thus, the noise reduction effect and the fact that the efficiency of intra prediction is reduced when there is an edge is a two-sided relationship.

そのため、非特許文献２では、参照画素に隣接する画素値を用いて、ノイズとエッジとの判定を行うことで、画面内予測の効率を向上させていた。 For this reason, in Non-Patent Document 2, the efficiency of intra prediction is improved by determining noise and edge using a pixel value adjacent to a reference pixel.

しかしながら、非特許文献２では、参照画素ごとに、参照画素に隣接する画素値を用いた判定を行うため、計算量が増大してしまうという問題点があった。 However, Non-Patent Document 2 has a problem in that the amount of calculation increases because determination is performed using a pixel value adjacent to the reference pixel for each reference pixel.

そこで、本発明は、上記課題に鑑みてなされたものであり、計算量の増加を抑えつつ、画面内予測の効率を向上させることができる画像処理装置及びプログラムを提供することを目的とする。 Therefore, the present invention has been made in view of the above problems, and an object thereof is to provide an image processing apparatus and a program capable of improving the efficiency of intra prediction while suppressing an increase in calculation amount.

本発明の一態様における画像処理装置は、複数の予測方向を用いて画面内予測を行う画像処理装置であって、参照される可能性に基づいて複数の参照画素を複数のグループに分類する分類部と、分類された各グループに対して異なるフィルタを決定するフィルタ決定部と、前記フィルタ決定部により決定されたフィルタを用いて、各グループに含まれる前記参照画素の画素値にフィルタリングを行うフィルタリング部と、前記フィルタリング部によるフィルタリング後の画素値を用いて、前記画面内予測の予測画像を生成する生成部と、を備える。 An image processing apparatus according to an aspect of the present invention is an image processing apparatus that performs intra prediction using a plurality of prediction directions, and classifies a plurality of reference pixels into a plurality of groups based on a possibility of being referred to. Filtering for filtering the pixel values of the reference pixels included in each group using a filter, a filter determination unit that determines a different filter for each classified group, and a filter determined by the filter determination unit And a generation unit that generates a predicted image of the intra prediction using the pixel value after filtering by the filtering unit.

また、前記分類部は、参照される予測方向数の数に基づいて前記複数の参照画素を分類してもよい。 The classification unit may classify the plurality of reference pixels based on the number of reference prediction directions.

また、前記分類部は、処理対象ブロックに隣接するか否かで前記複数の参照画素を分類してもよい。 Further, the classification unit may classify the plurality of reference pixels depending on whether or not they are adjacent to the processing target block.

また、前記分類部は、前記処理対象ブロックに隣接する参照画素、又は前記処理対象ブロックに隣接しない画素をさらに複数のグループに分類してもよい。 The classification unit may further classify reference pixels adjacent to the processing target block or pixels not adjacent to the processing target block into a plurality of groups.

また、前記フィルタ決定部は、前記グループ内の参照画素が参照される可能性が高いほど、通過帯域の広いローパスフィルタに決定してもよい。 The filter determination unit may determine a low-pass filter having a wider pass band as the reference pixel in the group is more likely to be referred to.

また、前記フィルタ決定部は、参照される可能性が最も高い参照画素を含む第１グループに対して、ノイズ除去フィルタに決定し、前記第１グループ以外のグループに対して、該グループ内の参照画素が参照される可能性が高いほど、通過帯域の広いローパスフィルタに決定してもよい。 The filter determination unit determines a noise removal filter for a first group including a reference pixel that is most likely to be referred to, and for a group other than the first group, a reference within the group. You may determine to a low pass filter with a wide pass band, so that a possibility that a pixel will be referred to is high.

また、本発明の他の態様におけるプログラムは、参照される可能性に基づいて複数の参照画素を複数のグループに分類する分類ステップと、分類された各グループに対して異なるフィルタを決定するフィルタ決定ステップと、決定されたフィルタを用いて、各グループに含まれる前記参照画素の画素値にフィルタリングを行うフィルタリングステップと、フィルタリング後の画素値を用いて、画面内予測の予測画像を生成する生成ステップとをコンピュータに実行させる。 A program according to another aspect of the present invention includes a classification step for classifying a plurality of reference pixels into a plurality of groups based on a possibility of reference, and a filter determination for determining a different filter for each classified group. A filtering step for filtering the pixel values of the reference pixels included in each group using the determined filter, and a generation step for generating a prediction image for intra prediction using the pixel values after filtering And let the computer run.

本発明によれば、計算量の増加を抑えつつ、画面内予測の効率を向上させることができる。 According to the present invention, it is possible to improve the efficiency of in-screen prediction while suppressing an increase in the amount of calculation.

Ｈ．２６４／ＡＶＣの８×８の画面内予測を説明するための図。H. The figure for demonstrating the prediction in H.264 / AVC 8x8. 実施例１における画像符号化装置の概略構成の一例を示すブロック図。1 is a block diagram illustrating an example of a schematic configuration of an image encoding device according to Embodiment 1. FIG. イントラ予測部の構成の一例を示すブロック図。The block diagram which shows an example of a structure of an intra estimation part. ８×８の画面内予測符号化の予測方向を示す図。The figure which shows the prediction direction of 8 * 8 in-screen prediction encoding. Ｈ．２６４／ＡＶＣにおける参照画素Ｙを参照する予測方向を示す図。H. The figure which shows the prediction direction which refers to the reference pixel Y in H.264 / AVC. Ｈ．２６４／ＡＶＣにおける参照画素Ｕを参照する予測方向を示す図。H. The figure which shows the prediction direction which refers the reference pixel U in H.264 / AVC. Ｈ．２６４／ＡＶＣにおける参照画素Ｑを参照する予測方向を示す図。H. The figure which shows the prediction direction which refers the reference pixel Q in H.264 / AVC. Ｈ．２６４／ＡＶＣにおける参照画素Ｍを参照する予測方向を示す図。H. The figure which shows the prediction direction which refers to the reference pixel M in H.264 / AVC. 参照画素を２分類する一例を示す図。The figure which shows an example which classify | categorizes a reference pixel into two. 参照画素を３分類する一例を示す図。The figure which shows an example which classify | categorizes a reference pixel into three. 実験結果を示す図。The figure which shows an experimental result. 実施例１におけるが画面内予測処理の一例を示すフローチャート。The flowchart which shows an example of the prediction process in a screen in Example 1. FIG. 実施例２における画像符号化装置の構成の一例を示すブロック図。FIG. 6 is a block diagram illustrating an example of a configuration of an image encoding device according to a second embodiment.

以下、添付図面を参照しながら各実施例について詳細に説明する。 Hereinafter, embodiments will be described in detail with reference to the accompanying drawings.

［実施例１］
＜構成＞
図２は、実施例１における画像符号化装置１００の概略構成の一例を示すブロック図である。図２に示す例では、画像符号化装置１００は、予測誤差信号生成部１０１、直交変換部１０２、量子化部１０３、エントロピー符号化部１０４、逆量子化部１０５、逆直交変換部１０６、復号画像生成部１０７、デブロッキングフィルタ部１０８、復号画像記憶部１０９、イントラ予測部１１０、インター予測部１１１、動きベクトル計算部１１２、符号化制御及びヘッダ生成部１１３及び予測画像選択部１１４を有する。各部についての概略を以下に説明する。 [Example 1]
<Configuration>
FIG. 2 is a block diagram illustrating an example of a schematic configuration of the image encoding device 100 according to the first embodiment. In the example illustrated in FIG. 2, the image encoding device 100 includes a prediction error signal generation unit 101, an orthogonal transform unit 102, a quantization unit 103, an entropy encoding unit 104, an inverse quantization unit 105, an inverse orthogonal transform unit 106, and a decoding The image generation unit 107, the deblocking filter unit 108, the decoded image storage unit 109, the intra prediction unit 110, the inter prediction unit 111, the motion vector calculation unit 112, the encoding control and header generation unit 113, and the predicted image selection unit 114 are included. An outline of each part will be described below.

予測誤差信号生成部１０１は、入力された動画像データの符号化対象画像が、例えば１６×１６画素のブロックに分割されたブロックデータを取得する。 The prediction error signal generation unit 101 acquires block data obtained by dividing an encoding target image of input moving image data into blocks of 16 × 16 pixels, for example.

予測誤差信号生成部１０１は、そのブロックデータと、予測画像選択部１１４から出力される予測画像のブロックデータとにより、予測誤差信号を生成する。予測誤差信号生成部１０１は、生成された予測誤差信号を直交変換部１０２に出力する。 The prediction error signal generation unit 101 generates a prediction error signal from the block data and the block data of the prediction image output from the prediction image selection unit 114. The prediction error signal generation unit 101 outputs the generated prediction error signal to the orthogonal transformation unit 102.

直交変換部１０２は、入力された予測誤差信号を直交変換処理する。直交変換部１０２は、直交変換処理によって水平及び垂直方向の周波数成分に分離された信号を量子化部１０３に出力する。 The orthogonal transform unit 102 performs orthogonal transform processing on the input prediction error signal. The orthogonal transform unit 102 outputs a signal separated into horizontal and vertical frequency components by the orthogonal transform process to the quantization unit 103.

量子化部１０３は、直交変換部１０２からの出力信号を量子化する。量子化部１０３は、量子化することによって出力信号の符号量を低減し、この出力信号をエントロピー符号化部１０４及び逆量子化部１０５に出力する。 The quantization unit 103 quantizes the output signal from the orthogonal transform unit 102. The quantization unit 103 reduces the code amount of the output signal by quantization, and outputs this output signal to the entropy encoding unit 104 and the inverse quantization unit 105.

エントロピー符号化部１０４は、量子化部１０３からの出力信号をエントロピー符号化して出力する。エントロピー符号化とは、シンボルの出現頻度に応じて可変長の符号を割り当てる方式をいう。 The entropy encoding unit 104 performs entropy encoding on the output signal from the quantization unit 103 and outputs the result. Entropy coding is a method of assigning variable-length codes according to the appearance frequency of symbols.

逆量子化部１０５は、量子化部１０３からの出力信号を逆量子化してから逆直交変換部１０６に出力する。逆直交変換部１０６は、逆量子化部１０５からの出力信号を逆直交変換処理してから復号画像生成部１０７に出力する。これら逆量子化部１０５及び逆直交変換部１０６によって復号処理が行われることにより、符号化前の予測誤差信号と同程度の信号が得られる。 The inverse quantization unit 105 dequantizes the output signal from the quantization unit 103 and then outputs the output signal to the inverse orthogonal transform unit 106. The inverse orthogonal transform unit 106 performs an inverse orthogonal transform process on the output signal from the inverse quantization unit 105 and then outputs the output signal to the decoded image generation unit 107. By performing decoding processing by the inverse quantization unit 105 and the inverse orthogonal transform unit 106, a signal having the same level as the prediction error signal before encoding is obtained.

復号画像生成部１０７は、インター予測部１１１で動き補償された画像のブロックデータと、逆量子化部１０５及び逆直交変換部１０６により復号処理された予測誤差信号とを加算する。復号画像生成部１０７は、加算して生成した復号画像のブロックデータを、デブロッキングフィルタ部１０８に出力する。 The decoded image generation unit 107 adds the block data of the image subjected to motion compensation by the inter prediction unit 111 and the prediction error signal decoded by the inverse quantization unit 105 and the inverse orthogonal transform unit 106. The decoded image generation unit 107 outputs the decoded image block data generated by the addition to the deblocking filter unit 108.

デブロッキングフィルタ部１０８は、復号画像生成部１０７から出力された復号画像に対し、ブロック歪を低減するためのフィルタをかけ、復号画像記憶部１０９に出力する。 The deblocking filter unit 108 applies a filter for reducing block distortion to the decoded image output from the decoded image generation unit 107 and outputs the filtered image to the decoded image storage unit 109.

復号画像記憶部１０９は、入力した復号画像のブロックデータを新たな参照画像のデータとして記憶し、イントラ予測部１１０、インター予測部１１１及び動きベクトル計算部１１２に出力する。 The decoded image storage unit 109 stores the input block data of the decoded image as new reference image data, and outputs the data to the intra prediction unit 110, the inter prediction unit 111, and the motion vector calculation unit 112.

イントラ予測部１１０は、符号化対象画像の処理対象ブロックに対して、すでに符号化された参照画素から予測画像を生成する。イントラ予測部１１０の詳細は、図２を用いて後述する。 The intra prediction unit 110 generates a predicted image from reference pixels that have already been encoded for the processing target block of the encoding target image. Details of the intra prediction unit 110 will be described later with reference to FIG.

インター予測部１１１は、復号画像記憶部１０９から取得した参照画像のデータを動きベクトル計算部１１２から提供される動きベクトルで動き補償する。これにより、動き補償された参照画像としてのブロックデータが生成される。 The inter prediction unit 111 performs motion compensation on the reference image data acquired from the decoded image storage unit 109 with the motion vector provided from the motion vector calculation unit 112. Thereby, block data as a motion-compensated reference image is generated.

動きベクトル計算部１１２は、符号化対象画像におけるブロックデータと、復号画像記憶部１０９から取得する参照画像とを用いて、動きベクトルを求める。動きベクトルとは、ブロック単位で参照画像内から処理対象ブロックに最も類似している位置を探索するブロックマッチング技術を用いて求められるブロック単位の空間的なずれを示す値である。動きベクトル計算部１１２は、求めた動きベクトルをインター予測部１１１に出力する。 The motion vector calculation unit 112 obtains a motion vector using the block data in the encoding target image and the reference image acquired from the decoded image storage unit 109. The motion vector is a value indicating a spatial deviation in units of blocks obtained using a block matching technique for searching for a position most similar to the processing target block from the reference image in units of blocks. The motion vector calculation unit 112 outputs the obtained motion vector to the inter prediction unit 111.

イントラ予測部１１０とインター予測部１１１から出力されたブロックデータは、予測画像選択部１１４に入力される。 The block data output from the intra prediction unit 110 and the inter prediction unit 111 are input to the predicted image selection unit 114.

予測画像選択手段１１４は、イントラ予測部１１０とインター予測部１１１から取得したブロックデータのうち、どちらか一方のブロックデータを予測画像として選択する。選択された予測画像は、予測誤差信号生成部１０１に出力される。 The predicted image selection unit 114 selects one of the block data acquired from the intra prediction unit 110 and the inter prediction unit 111 as a predicted image. The selected prediction image is output to the prediction error signal generation unit 101.

また、符号化制御及びヘッダ生成部１１３について、符号化の全体制御とヘッダ生成を行う。符号化制御及びヘッダ生成部１１３は、イントラ予測部１１０に対して、スライス分割有無の通知、デブロッキングフィルタ部１０８に対して、デブロッキングフィルタ有無の通知、動きベクトル計算部１１２に対して参照画像の制限通知などを行う。 The encoding control and header generation unit 113 performs overall encoding control and header generation. The encoding control and header generation unit 113 notifies the intra prediction unit 110 of presence / absence of slice division, notifies the deblocking filter unit 108 of presence / absence of deblocking filter, and provides a reference image to the motion vector calculation unit 112. Notification of restrictions, etc.

符号化制御及びヘッダ生成部１１３は、その制御結果を用いて、例えばＨ．２６４／ＡＶＣのヘッダ情報を生成する。生成されたヘッダ情報は、エントロピー符号化部１０４に出力され、画像データ、動きベクトルデータとともにストリームとして出力される。 The encoding control and header generation unit 113 uses the control result, for example, H.264. H.264 / AVC header information is generated. The generated header information is output to the entropy encoding unit 104, and is output as a stream together with image data and motion vector data.

＜イントラ予測部の構成＞
次に、イントラ予測部１１０の構成について説明する。図３は、イントラ予測部１１０の構成の一例を示すブロック図である。図３に示すイントラ予測部１１０は、分類部２０１、フィルタ決定部２０２、フィルタリング部２０３、生成部２０４を有する。 <Configuration of intra prediction unit>
Next, the configuration of the intra prediction unit 110 will be described. FIG. 3 is a block diagram illustrating an example of the configuration of the intra prediction unit 110. The intra prediction unit 110 illustrated in FIG. 3 includes a classification unit 201, a filter determination unit 202, a filtering unit 203, and a generation unit 204.

分類部２０１は、画面内予測で用いる複数の参照画素を、その参照画素が参照される可能性に基づいて複数のグループに分類する。分類部２０１は、例えば、参照される予測方向の数や処理対象ブロックに隣接するか否かで、参照画素を分類する。参照画素の分類の詳細については、図４〜１０を用いて後述する。 The classification unit 201 classifies the plurality of reference pixels used in the intra prediction into a plurality of groups based on the possibility that the reference pixels are referred to. The classification unit 201 classifies the reference pixels based on, for example, the number of prediction directions to be referenced and whether or not they are adjacent to the processing target block. Details of the reference pixel classification will be described later with reference to FIGS.

分類部２０１は、複数の参照画素を複数のグループに分類した場合、参照画素がどのグループに属するかを示す分類情報をフィルタ決定部２０２に出力する。 When the plurality of reference pixels are classified into a plurality of groups, the classification unit 201 outputs classification information indicating to which group the reference pixel belongs to the filter determination unit 202.

フィルタ決定部２０２は、分類情報を取得すると、分類された各グループに対して異なるフィルタを決定する。フィルタ決定手段２０２は、例えば、グループに含まれる参照画素が参照される可能性が高いほど、通過帯域の広いＬＰＦに決定する。また、フィルタ決定部２０２は、ノイズ除去フィルタを、参照される可能性が最も高い参照画素を含むグループに決定してもよい。フィルタ決定部２０２の詳細についても後述する。 When the filter determination unit 202 acquires the classification information, the filter determination unit 202 determines a different filter for each classified group. For example, the filter determination unit 202 determines an LPF having a wider passband as the reference pixel included in the group is more likely to be referred to. Further, the filter determination unit 202 may determine the noise removal filter as a group including reference pixels that are most likely to be referred to. Details of the filter determination unit 202 will also be described later.

フィルタ決定部２０２は、どのグループにどのフィルタを決定したかを示すフィルタ情報をフィルタリング部２０３に出力する。 The filter determination unit 202 outputs filter information indicating which filter is determined for which group to the filtering unit 203.

フィルタリング部２０３は、復号画像記憶部１０９から参照画素の画素値を取得する。また、フィルタリング部２０３は、フィルタ決定部２０２からフィルタ情報を取得し、どの参照画素にどのフィルタを用いるかを把握する。 The filtering unit 203 acquires the pixel value of the reference pixel from the decoded image storage unit 109. Further, the filtering unit 203 acquires filter information from the filter determination unit 202 and grasps which filter is used for which reference pixel.

フィルタリング部２０３は、フィルタ決定部２０２により決定されたフィルタを用いて、各グループに含まれる参照画素の画素値をフィルタリングする。フィルタリング部２０３は、フィルタリング後の画素値を生成部２０４に出力する。 The filtering unit 203 filters the pixel values of the reference pixels included in each group using the filter determined by the filter determining unit 202. The filtering unit 203 outputs the filtered pixel value to the generation unit 204.

生成部２０４は、フィルタリング部２０３から取得したフィルタリング後の画素値を用いて、画面内予測の予測画像を生成する。生成部２０４は、例えば、予測画像生成部２０５、予測モード決定部２０６を有する。 The generation unit 204 uses the filtered pixel values acquired from the filtering unit 203 to generate a predicted image for intra-screen prediction. The generation unit 204 includes, for example, a predicted image generation unit 205 and a prediction mode determination unit 206.

予測画像生成部２０５は、フィルタリング後の画素値を予測値として、画面内予測の予測方向に対応する予測モード毎に、予測画像を生成する。 The predicted image generation unit 205 generates a predicted image for each prediction mode corresponding to the prediction direction of intra-screen prediction using the filtered pixel value as a predicted value.

予測モード決定部２０６は、予測モード毎の予測画像と、処理対象ブロックとの差が最も小さい予測モードを決定する。予測モード決定部２０６は、決定された予測モードの予測画像を、予測画像選択部１１４に出力する。 The prediction mode determination unit 206 determines a prediction mode with the smallest difference between the prediction image for each prediction mode and the processing target block. The prediction mode determination unit 206 outputs the prediction image of the determined prediction mode to the prediction image selection unit 114.

＜参照画素の分類＞
次に、分類部２０１による参照画素の分類について説明する。参照画素の例として、Ｈ．２６４／ＡＶＣの８×８のブロック（図１参照）を用いて説明するが、このブロックサイズに限らず、４×４、１６×１６などのブロックサイズでもよい。また、輝度及び色差いずれのブロックに対しても適用できる。また、正方形のブロックに限らず、例えば８×１６などの長方形のブロックに対しても適用できる。 <Classification of reference pixels>
Next, reference pixel classification by the classification unit 201 will be described. As an example of the reference pixel, H.264 is used. A description will be given using an H.264 / AVC 8 × 8 block (see FIG. 1), but the block size is not limited to this, and may be a block size such as 4 × 4 or 16 × 16. Further, the present invention can be applied to both luminance and color difference blocks. Further, the present invention is not limited to a square block, and can be applied to a rectangular block such as 8 × 16.

まず、Ｈ．２６４／ＡＶＣの場合の予測方向について説明する。図４は、８×８の画面内予測符号化の予測方向を示す図である。図４に示す例では、各予測モードに対応する予測方向を示している。図４に示す灰色丸が参照画素を示し、白丸が処理対象画素を示す。矢印が予測方向を示す。 First, H. A prediction direction in the case of H.264 / AVC will be described. FIG. 4 is a diagram illustrating a prediction direction of 8 × 8 intra prediction encoding. In the example shown in FIG. 4, the prediction direction corresponding to each prediction mode is shown. The gray circles shown in FIG. 4 indicate reference pixels, and the white circles indicate processing target pixels. The arrow indicates the prediction direction.

例えば、予測モード０では、参照画素Ｐを、同じ列の画素（Ｎ０、Ｎ１、・・・、Ｎ７）の予測値とすることを示す。 For example, the prediction mode 0 indicates that the reference pixel P is a predicted value of pixels (N0, N1,..., N7) in the same column.

以下では、図４に示す予測方向を用いるが、図４に示す予測方向に限られず、他の予測方向がある場合でも、以下に説明する分類方法を同様に適用することができる。 In the following, although the prediction direction shown in FIG. 4 is used, the classification method described below can be similarly applied even when there are other prediction directions without being limited to the prediction direction shown in FIG.

（予測方向の数に基づく分類）
分類部２０１は、例えば、参照される予測方向の数に基づいて参照画素を分類する。この分類方法は、参照される予測方向の数が多い場合、そのいずれかの方向にエッジが存在するとき、ＬＰＦにより画面内予測の効率が低下するという考えに基づく。 (Classification based on the number of prediction directions)
For example, the classification unit 201 classifies reference pixels based on the number of referenced prediction directions. This classification method is based on the idea that when the number of prediction directions to be referred to is large and there is an edge in any one direction, the efficiency of intra prediction is reduced by LPF.

図５は、Ｈ．２６４／ＡＶＣにおける参照画素Ｙを参照する予測方向を示す図である。図５に示すように、参照画素Ｙは、図４に示す予測モード３の予測方向の場合に参照される。 FIG. It is a figure which shows the prediction direction which refers to the reference pixel Y in H.264 / AVC. As shown in FIG. 5, the reference pixel Y is referred to in the case of the prediction direction of the prediction mode 3 shown in FIG.

図６は、Ｈ．２６４／ＡＶＣにおける参照画素Ｕを参照する予測方向を示す図である。図６に示すように、参照画素Ｕは、図４に示す予測モード３、７の予測方向の場合に参照される。 FIG. It is a figure which shows the prediction direction which refers to the reference pixel U in H.264 / AVC. As shown in FIG. 6, the reference pixel U is referred to in the prediction directions of the prediction modes 3 and 7 shown in FIG. 4.

図７は、Ｈ．２６４／ＡＶＣにおける参照画素Ｑを参照する予測方向を示す図である。図７に示すように、参照画素Ｑは、図４に示す予測モード０、３、７の予測方向の場合に参照される。 FIG. It is a figure which shows the prediction direction which refers the reference pixel Q in H.264 / AVC. As shown in FIG. 7, the reference pixel Q is referred to in the case of the prediction directions of the prediction modes 0, 3, and 7 shown in FIG.

図８は、Ｈ．２６４／ＡＶＣにおける参照画素Ｍを参照する予測方向を示す図である。図８に示すように、参照画素Ｍは、図４に示す予測モード０、３、４、５、６、７の予測方向の場合に参照される。 FIG. It is a figure which shows the prediction direction which refers to the reference pixel M in H.264 / AVC. As shown in FIG. 8, the reference pixel M is referred to in the prediction directions of the prediction modes 0, 3, 4, 5, 6, and 7 shown in FIG. 4.

例えば、参照画素Ｍについては、図８に示す６方向のいずれかにエッジが存在する場合に、ＬＰＦを適用すると画面内予測の効果が低下する要因となる。 For example, with respect to the reference pixel M, when an edge exists in any of the six directions shown in FIG.

また、参照画素Ｕについては、図６に示す２方向のいずれかにエッジが存在する場合に、ＬＰＦを適用すると画面内予測の効果が低下する要因となるが、その他の予測方向では参照されないため、ＬＰＦの有無による影響が小さい。 For the reference pixel U, when an edge exists in one of the two directions shown in FIG. 6, the effect of intra prediction is reduced when LPF is applied, but it is not referenced in other prediction directions. The effect of the presence or absence of LPF is small.

ここで、画像中に存在するエッジの方向がランダムであると仮定すると、画面内予測の予測方向と、エッジの方向とが一致する確率は、参照画素Ｍの方が、参照画素Ｕよりも３（＝６（参照画素Ｍの予測方向数）／２（参照画素Ｕの予測方向数））倍高いことになる。 Here, assuming that the direction of the edge existing in the image is random, the probability that the prediction direction of the intra prediction matches the direction of the edge is 3 for the reference pixel M than for the reference pixel U. (= 6 (number of prediction directions of reference pixel M) / 2 (number of prediction directions of reference pixel U)) times higher.

よって、分類部２０１は、参照される予測方向の数に基づいて、参照画素を以下のように分類することができる。
グループ１：Ｖ，Ｗ，Ｘ，Ｙ
グループ２：Ａ，Ｒ，Ｓ，Ｔ，Ｕ
グループ３：Ｈ，Ｉ，Ｑ
グループ４：Ｂ，Ｃ，Ｄ，Ｆ，Ｊ
グループ５：Ｅ，Ｇ，Ｌ，Ｎ，Ｐ
グループ６：Ｋ，Ｍ，Ｏ
グループ１は、参照される予測方向の数が１であり、グループ２は、参照される予測方向の数が２であり、グループ３は、参照される予測方向の数が３であり、グループ４は、参照される予測方向の数が４であり、グループ５は、参照される予測方向の数が５であり、グループ６は、参照される予測方向の数が６である。 Therefore, the classification unit 201 can classify reference pixels as follows based on the number of referenced prediction directions.
Group 1: V, W, X, Y
Group 2: A, R, S, T, U
Group 3: H, I, Q
Group 4: B, C, D, F, J
Group 5: E, G, L, N, P
Group 6: K, M, O
Group 1 has a number of referenced prediction directions of 1, group 2 has a number of referenced prediction directions of 2, group 3 has a number of referenced prediction directions of 3, group 4 The number of referenced prediction directions is 4, group 5 has 5 referenced prediction directions, and group 6 has 6 referenced prediction directions.

また、分類部２０１は、グループ１〜６のうち、複数のグループをまとめて分類してもよい。例えば、分類部２０１は、グループ１と２とを１つのグループ、グループ３と４とを１つのグループ、グループ５と６とを１つのグループにしてもよい。 Moreover, the classification | category part 201 may classify a some group collectively from the groups 1-6. For example, the classification unit 201 may group 1 and 2 into one group, group 3 and 4 into one group, and groups 5 and 6 into one group.

これにより、予測方向と一致する方向のエッジが存在する確率を考慮して、参照画素の分類を行うことができる。 Thereby, it is possible to classify the reference pixels in consideration of the probability that an edge in the direction matching the prediction direction exists.

なお、符号化標準技術のブロックサイズや予測方向によって、分類されるグループ数が異なるが、ブロックサイズやどんな予測方向があるかが決まれば、分類部２０１は、参照画素の分類を一意に行うことができる。 Note that the number of groups to be classified differs depending on the block size and prediction direction of the encoding standard technique, but if the block size and what kind of prediction direction is determined, the classification unit 201 should uniquely classify the reference pixels. Can do.

（隣接するか否かに基づく分類）
分類部２０１は、例えば、処理対象ブロックに隣接するか否かで参照画素を分類する。この分類方法は、処理対象ブロックに直接隣接する参照画素が、直接隣接しない参照画素よりも、参照される予測方向が多いという考えに基づく。 (Classification based on whether or not they are adjacent)
For example, the classification unit 201 classifies reference pixels based on whether or not they are adjacent to the processing target block. This classification method is based on the idea that reference pixels that are directly adjacent to the processing target block are referred to more prediction directions than reference pixels that are not directly adjacent.

図９は、参照画素を２分類する一例を示す図である。図９に示す例では、分類部２０１は、処理対象ブロックに直接隣接するグループ２、直接隣接しないグループ１に分類する。
グループ１：処理対象ブロックに直接隣接しない参照画素
グループ２：処理対象ブロックに直接隣接する参照画素
図１０は、参照画素を３分類する一例を示す図である。図１０に示す例では、分類部２０１は、処理対象ブロックに隣接するグループ２とグループ３と、処理対象ブロックに直接隣接しないグループ１とに分類する。
グループ１：処理対象ブロックに直接隣接しない参照画素
グループ２：処理対象ブロックの左上付近に直接隣接する参照画素
グループ３：処理対象ブロックに直接隣接し、グループ２以外の参照画素
グループ２は、例えば、図１に示すＨ，Ｉ，Ｊの参照画素とする。分類部２０１は、処理対象ブロックに直接隣接する参照画素及び／又は処理対象ブロックに直接隣接しない参照画素を、さらに複数のグループに分類してもよい。 FIG. 9 is a diagram illustrating an example of classifying the reference pixels into two. In the example illustrated in FIG. 9, the classification unit 201 classifies a group 2 that is directly adjacent to the processing target block and a group 1 that is not directly adjacent.
Group 1: Reference pixels not directly adjacent to the processing target block Group 2: Reference pixels directly adjacent to the processing target block FIG. 10 is a diagram illustrating an example of classifying the reference pixels into three categories. In the example illustrated in FIG. 10, the classification unit 201 classifies groups 2 and 3 adjacent to the processing target block and group 1 that is not directly adjacent to the processing target block.
Group 1: Reference pixel group not directly adjacent to the processing target block 2: Reference pixel group directly adjacent to the vicinity of the upper left of the processing target block 3: Directly adjacent to the processing target block, and the reference pixel group 2 other than the group 2 is, for example, Assume that the reference pixels are H, I, and J shown in FIG. The classification unit 201 may further classify the reference pixels directly adjacent to the processing target block and / or the reference pixels not directly adjacent to the processing target block into a plurality of groups.

これにより、参照される可能性に基づいて、参照画素を複数のグループに簡易的に分類することができる。 Thus, the reference pixels can be easily classified into a plurality of groups based on the possibility of reference.

分類部２０１は、予測方向の数に基づく分類、隣接するか否かに基づく分類のいずれかの分類方法で、参照画素を分類すればよい。また、分類部２０１は、符号化レートなどに基づいて、どちらの分類を行うかを選択して分類することも可能である。 The classification unit 201 may classify the reference pixels by any one of the classification method based on the number of prediction directions and the classification based on whether or not they are adjacent. Further, the classification unit 201 can select and classify which classification is performed based on the encoding rate or the like.

＜フィルタ決定＞
次に、フィルタ決定部２０２によるフィルタ決定について説明する。フィルタ決定部２０２は、グループ内の参照画素が参照される可能性が高いほど、エッジ成分の変化が小さいフィルタに決定する。 <Filter determination>
Next, filter determination by the filter determination unit 202 will be described. The filter determination unit 202 determines a filter having a smaller change in edge component as the reference pixel in the group is more likely to be referenced.

（ＬＰＦ）
フィルタ決定部２０２は、例えば、グループ内の参照画素が参照される可能性が高いほど、通過帯域の広いＬＰＦになるようフィルタを決定する。フィルタ決定部２０２は、例えば、分類部２０１により分類されたグループの番号が小さいほど、通過帯域の狭いフィルタにするよう決定する。 (LPF)
For example, the filter determination unit 202 determines the filter so that the LPF having a wider passband becomes higher as the reference pixel in the group is more likely to be referred to. For example, the filter determining unit 202 determines that the filter has a narrower passband as the group number classified by the classifying unit 201 is smaller.

例えば、分類部２０１は、予測方向の数に基づく分類を行ったとする。この場合、フィルタ決定部２０２は、グループ１と２に対し、［１／４，１／２，１／４］のＬＰＦに決定し、グループ３と４に対し、［１／８，３／４，１／８］のＬＰＦに決定し、グループ５と６に対し、［１／１６，７／８，１／１６］のＬＰＦに決定する。 For example, it is assumed that the classification unit 201 performs classification based on the number of prediction directions. In this case, the filter determination unit 202 determines LPFs of [1/4, 1/2, 1/4] for groups 1 and 2 and [1/8, 3/4] for groups 3 and 4. , 1/8] LPF, and for groups 5 and 6, [1/16, 7/8, 1/16] LPF.

例えば、分類部２０１は、隣接するか否かに基づいて３つのグループに分類を行ったとする。この場合、フィルタ決定部２０２は、グループ１に対し、［１／４，１／２，１／４］のＬＰＦに決定し、グループ２に対し、［１／８，３／４，１／８］のＬＰＦに決定し、グループ３に対し、［１／１６，７／８，１／１６］のＬＰＦに決定する。上記ＬＰＦは、あくまでも一例であり、タップ数や通過帯域については適宜変更してもよい。 For example, it is assumed that the classification unit 201 performs classification into three groups based on whether or not they are adjacent to each other. In this case, the filter determination unit 202 determines [1/4, 1/2, 1/4] LPF for group 1 and [1/8, 3/4, 1/8] for group 2. The LPF of [1/16, 7/8, 1/16] for group 3 is determined. The LPF is merely an example, and the number of taps and the passband may be changed as appropriate.

つまり、参照される可能性が高い参照画素では、通過帯域の広いＬＰＦを用いる。通過帯域の広いＬＰＦは、通過帯域の狭いＬＰＦと比較し、着目画素値の変化が小さい。そのため、参照される可能性が高い参照画素は、エッジが存在した場合でも予測効率の低下を小さくすることができる。 That is, an LPF having a wide passband is used for a reference pixel that is highly likely to be referenced. An LPF with a wide passband has a smaller change in the pixel value of interest than an LPF with a narrow passband. Therefore, a reference pixel that is highly likely to be referenced can reduce a decrease in prediction efficiency even when an edge exists.

一方、参照される可能性が低い参照画素は、適用される予測方向と一致するエッジが存在する確率が低いため、通過帯域の狭いＬＰＦを用いる。これにより、十分なノイズの低減効果を期待することができる。 On the other hand, a reference pixel having a low possibility of being referenced uses an LPF having a narrow passband because the probability that an edge matching the applied prediction direction exists is low. Thereby, a sufficient noise reduction effect can be expected.

（ＬＰＦ＋ノイズ除去フィルタ）
フィルタ決定部２０２は、例えば、参照される可能性が一番高い参照画素を含むグループに対し、ノイズ除去フィルタを決定し、その他のグループに対して、グループ内の参照画素が参照される可能性が高いほど、通過帯域の広いＬＰＦになるようフィルタを決定する。 (LPF + noise removal filter)
For example, the filter determination unit 202 determines a noise removal filter for a group including a reference pixel that is most likely to be referred to, and may reference a reference pixel in the group for other groups. The filter is determined so as to be an LPF with a wider passband as the value of is higher.

フィルタ決定部２０２は、例えば、分類部２０１により分類されたグループの番号が一番大きいグループに対してノイズ除去フィルタに決定し、その他のグループについてはグループの番号が小さいほど、通過帯域の狭いフィルタにするよう決定する。 The filter determination unit 202 determines, for example, a noise removal filter for the group with the largest group number classified by the classification unit 201, and for other groups, a filter with a narrower passband as the group number is smaller. Decide to be.

ノイズ除去フィルタとは、例えば、メディアンフィルタ、移動平均フィルタ、εフィルタ、ＭＴＭ（Modified Trimmed Mean）フィルタなどがある。例えば、ノイズ除去フィルタとして、フィルタのタップ数が少ないメディアンフィルタが計算量の観点から好適である。 Examples of the noise removal filter include a median filter, a moving average filter, an ε filter, and an MTM (Modified Trimmed Mean) filter. For example, a median filter having a small number of filter taps is suitable as a noise removal filter from the viewpoint of calculation amount.

例えば、分類部２０１は、予測方向の数に基づく分類を行ったとする。この場合、フィルタ決定部２０２は、グループ１と２に対し、［１／４，１／２，１／４］のＬＰＦに決定し、グループ３と４に対し、［１／８，３／４，１／８］のＬＰＦに決定し、グループ５と６に対し、３タップのメディアンフィルタに決定する。 For example, it is assumed that the classification unit 201 performs classification based on the number of prediction directions. In this case, the filter determination unit 202 determines LPFs of [1/4, 1/2, 1/4] for groups 1 and 2 and [1/8, 3/4] for groups 3 and 4. , 1/8] LPF and, for groups 5 and 6, a 3-tap median filter.

例えば、分類部２０１は、隣接するか否かに基づいて３つのグループに分類を行ったとする。この場合、フィルタ決定部２０２は、グループ１に対し、［１／４，１／２，１／４］のＬＰＦに決定し、グループ２に対し、［１／８，３／４，１／８］のＬＰＦに決定し、グループ３に対し、３タップのメディアンフィルタに決定する。上記ＬＰＦは、あくまでも一例であり、タップ数や通過帯域については適宜変更してもよい。 For example, it is assumed that the classification unit 201 performs classification into three groups based on whether or not they are adjacent to each other. In this case, the filter determination unit 202 determines [1/4, 1/2, 1/4] LPF for group 1 and [1/8, 3/4, 1/8] for group 2. ], And for group 3, a 3-tap median filter is determined. The LPF is merely an example, and the number of taps and the passband may be changed as appropriate.

一般的にノイズ除去フィルタは、ＬＰＦよりもエッジの値の変化が小さいため、通過帯域を広くしたＬＰＦと同様の効果が得られる。また、ノイズ除去フィルタは、ＬＰＦよりも計算量は多くなる場合があるが、例えば３タップのメディアンフィルタを用いた場合にはＬＰＦからの計算量の増加は少ない。フィルタ決定部２０２は、上述したように、参照される予測方向の数が多いほど通過帯域を広くし、参照される予測方向の数が小さいほど通過帯域を狭くしたフィルタのセットを複数種類用意しておく。そして、予め各種の絵柄に応じたそれぞれのフィルタセットの効果を確かめておくことで、入力動画像の種類が与えられれば、フィルタ決定部２０２は、使用するフィルタセットを決めることができる。また、フィルタ決定部２０２は、入力動画像の絵柄の種類の判別に応じて、適応的にフィルタセットを変えてもよい。 In general, since the noise removal filter has a smaller edge value change than the LPF, the same effect as the LPF having a wide passband can be obtained. In addition, although the noise removal filter may have a larger calculation amount than the LPF, for example, when a 3-tap median filter is used, the increase in the calculation amount from the LPF is small. As described above, the filter determination unit 202 prepares a plurality of sets of filters that increase the passband as the number of referenced prediction directions increases, and narrow the passband as the number of referenced prediction directions decreases. Keep it. The filter determination unit 202 can determine the filter set to be used if the type of the input moving image is given by confirming the effect of each filter set corresponding to various patterns in advance. Further, the filter determination unit 202 may adaptively change the filter set according to the determination of the pattern type of the input moving image.

よって、実施例１では、参照画素毎のエッジ判定を行う必要がなく、参照画素の分類に応じたフィルタリングを行うことで、エッジを維持する効果と、ノイズを低減する効果との組み合わせにより、画面内予測の予測性能を向上させることが可能となる。 Therefore, in the first embodiment, it is not necessary to perform edge determination for each reference pixel, and by performing filtering according to the classification of the reference pixel, a combination of the effect of maintaining the edge and the effect of reducing noise is used. It is possible to improve the prediction performance of intra prediction.

＜実験結果＞
次に、ＨＥＶＣの規格化作業で用いられているソフトウェアエンコーダＨＭ３．０と、ＨＭ３．０に実施例１の方式を実装した提案方式とで行った符号化実験について説明する。 <Experimental result>
Next, a description will be given of an encoding experiment performed by the software encoder HM3.0 used in the HEVC standardization work and the proposed method in which the method of the first embodiment is implemented in HM3.0.

実施例１の方式とは、処理対象ブロックに直接隣接するか否かにより２つのグループに分類し（図９参照）、グループ１に［１／４，１／２，１／４］のＬＰＦ、グループ２に［１／８，３／４，１／８］のＬＰＦを適用する。 The system of the first embodiment is classified into two groups depending on whether or not it is directly adjacent to the processing target block (see FIG. 9), and [1/4, 1/2, 1/4] LPF is assigned to group 1, [1/8, 3/4, 1/8] LPF is applied to group 2.

実験では、画像Ａ〜Ｄに対し、それぞれ符号化を行い、ＨＭ３．０をアンカーとし、提案方式の性能をＢＤ−ＲＡＴＥで表す。
画像Ａ：街の風景
画像Ｂ：室内の人物の映像
画像Ｃ：走る馬
画像Ｄ：屋内の人物の映像
ＢＤ−ＲＡＴＥは、「G. Bjontegaard, "Calculation of average PSNR differences between RD-Curves," ITU-T SG16 Q.6 Document, VCEG-M33,April 2001」に提案された符号化性能を比較する指標である。ＢＤ−ＲＡＴＥの値が０の場合はアンカーと提案方式の性能は等しく、ＢＤ−ＲＡＴＥの値が小さいほどアンカーより提案方式の性能が優れている。 In the experiment, each of the images A to D is encoded, and HM3.0 is used as an anchor, and the performance of the proposed method is represented by BD-RATE.
Image A: City landscape image B: Indoor person image C: Running horse image D: Indoor person image BD-RATE is "G. Bjontegaard," Calculation of average PSNR differences between RD-Curves, "ITU -T SG16 Q.6 Document, VCEG-M33, April 2001 ”is an index for comparing the encoding performance proposed. When the value of BD-RATE is 0, the performance of the proposed method is equal to that of the anchor. The smaller the value of BD-RATE, the better the performance of the proposed method than the anchor.

図１１は、実験結果を示す図である。図１１に示すように、提案方式では、ＨＭ３．０よりも符号化性能が向上していることが分かる。よって、実施例１では、計算量の増加を抑えつつ、画面内予測の予測性能を向上させることが可能となる。 FIG. 11 is a diagram showing experimental results. As shown in FIG. 11, it can be seen that the proposed scheme has improved encoding performance over HM3.0. Therefore, in the first embodiment, it is possible to improve the prediction performance of intra prediction while suppressing an increase in calculation amount.

＜動作＞
次に、実施例１における画像符号化装置１００の動作について説明する。図１２は、実施例１におけるが画面内予測処理の一例を示すフローチャートである。 <Operation>
Next, the operation of the image encoding device 100 according to the first embodiment will be described. FIG. 12 is a flowchart illustrating an example of the intra-screen prediction process in the first embodiment.

図１２に示すステップＳ１０１で、分類部２０１は、参照される可能性に基づいて参照画素を複数のグループに分類する。例えば、分類部２０１は、処理対象ブロックのブロックサイズなどに応じて分類方法を決めておき、この分類方法により分類を行えばよい。分類方法は、前述した通り、予測方向の数に基づく分類か、隣接するか否かに基づく分類かのいずれかを用いればよい。 In step S101 illustrated in FIG. 12, the classification unit 201 classifies reference pixels into a plurality of groups based on the possibility of reference. For example, the classification unit 201 may determine a classification method according to the block size of the processing target block and classify by this classification method. As described above, the classification method may use either a classification based on the number of prediction directions or a classification based on whether or not adjacent to each other.

ステップＳ１０２で、フィルタ決定部２０２は、分類部２０１により決定されたグループに応じて異なるフィルタを決定する。フィルタ決定部２０２は、例えば、グループ内の参照画素が参照される可能性が高いほど、通過帯域の広いＬＰＦに決定する。 In step S102, the filter determination unit 202 determines different filters according to the groups determined by the classification unit 201. For example, the filter determination unit 202 determines an LPF having a wider passband as the reference pixel in the group is more likely to be referenced.

ステップＳ１０３で、フィルタリング部２０３は、フィルタ決定部２０２により決定されたフィルタを用いて、参照画素の画素値をフィルタリングする。 In step S103, the filtering unit 203 filters the pixel value of the reference pixel using the filter determined by the filter determining unit 202.

ステップＳ１０４で、生成部２０４は、複数の予測モードに対応する予測画像を生成し、最適な予測モードを決定する。生成部２０４は、決定された予測モードの予測画像を予測画像選択部１１４に出力する。 In step S104, the generation unit 204 generates a prediction image corresponding to a plurality of prediction modes, and determines an optimal prediction mode. The generation unit 204 outputs the predicted image of the determined prediction mode to the predicted image selection unit 114.

以上、実施例１によれば、計算量の増加を抑えつつ、画面内予測の予測効率を向上させることができる。 As described above, according to the first embodiment, it is possible to improve the prediction efficiency of intra prediction while suppressing an increase in calculation amount.

［実施例２］
図１３は、実施例２における画像符号化装置３００の構成の一例を示すブロック図である。画像符号化装置３００は、上述した実施例１で説明した画像符号化処理をソフトウェアで実装した装置の一例である。 [Example 2]
FIG. 13 is a block diagram illustrating an example of the configuration of the image encoding device 300 according to the second embodiment. The image encoding device 300 is an example of a device in which the image encoding process described in the first embodiment is implemented by software.

図１３に示すように、画像符号化装置３００は、制御部３０１、主記憶部３０２、補助記憶部３０３、ドライブ装置３０４、ネットワークＩ／Ｆ部３０６、入力部３０７、表示部３０８を有する。これら各構成は、バスを介して相互にデータ送受信可能に接続されている。 As illustrated in FIG. 13, the image encoding device 300 includes a control unit 301, a main storage unit 302, an auxiliary storage unit 303, a drive device 304, a network I / F unit 306, an input unit 307, and a display unit 308. These components are connected to each other via a bus so as to be able to transmit and receive data.

制御部３０１は、コンピュータの中で、各装置の制御やデータの演算、加工を行うＣＰＵである。また、制御部３０１は、主記憶部３０２又は補助記憶部３０３に記憶されたフィルタ処理を含む画像符号化処理のプログラムを実行する演算装置である。制御部３０１は、入力部３０７や記憶装置からデータを受け取り、演算、加工した上で、表示部３０８や記憶装置などに出力する。 The control unit 301 is a CPU that controls each device, calculates data, and processes in a computer. The control unit 301 is an arithmetic device that executes a program for image encoding processing including filter processing stored in the main storage unit 302 or the auxiliary storage unit 303. The control unit 301 receives data from the input unit 307 and the storage device, calculates and processes the data, and outputs the data to the display unit 308 and the storage device.

制御部３０１は、フィルタ処理を含む画像符号化処理のプログラムを実行することで、各実施例で説明したフィルタ処理を実現することができる。 The control unit 301 can realize the filter processing described in each embodiment by executing a program of image encoding processing including filter processing.

主記憶部３０２は、ＲＯＭ（Read Only Memory）やＲＡＭ（Random Access Memory）などである。主記憶部３０２は、制御部３０１が実行する基本ソフトウェアであるＯＳ（Operating System）やアプリケーションソフトウェアなどのプログラムやデータを記憶又は一時保存する記憶装置である。 The main storage unit 302 is a ROM (Read Only Memory), a RAM (Random Access Memory), or the like. The main storage unit 302 is a storage device that stores or temporarily stores programs and data such as OS (Operating System) and application software that are basic software executed by the control unit 301.

補助記憶部３０３は、ＨＤＤ（Hard Disk Drive）などであり、アプリケーションソフトウェアなどに関連するデータを記憶する記憶装置である。 The auxiliary storage unit 303 is an HDD (Hard Disk Drive) or the like, and is a storage device that stores data related to application software and the like.

ドライブ装置３０４は、記録媒体３０５、例えばフレキシブルディスクからプログラムを読み出し、記憶装置にインストールする。 The drive device 304 reads the program from the recording medium 305, for example, a flexible disk, and installs it in the storage device.

また、記録媒体３０５に、所定のプログラムを格納し、この記録媒体３０５に格納されたプログラムはドライブ装置３０４を介して画像符号化装置３００にインストールされる。インストールされた所定のプログラムは、画像符号化装置３００により実行可能となる。 A predetermined program is stored in the recording medium 305, and the program stored in the recording medium 305 is installed in the image encoding device 300 via the drive device 304. The installed predetermined program can be executed by the image encoding device 300.

ネットワークＩ／Ｆ部３０６は、有線及び／又は無線回線などのデータ伝送路により構築されたＬＡＮ（Local Area Network）、ＷＡＮ（Wide Area Network）などのネットワークを介して接続された通信機能を有する周辺機器と画像符号化装置３００とのインターフェースである。 The network I / F unit 306 is a peripheral having a communication function connected via a network such as a LAN (Local Area Network) or a WAN (Wide Area Network) constructed by a data transmission path such as a wired and / or wireless line. This is an interface between the device and the image encoding device 300.

入力部３０７は、カーソルキー、数字入力及び各種機能キー等を備えたキーボード、表示部３０８の表示画面上でキーの選択等を行うためのマウスやスライスパット等を有する。また、入力部３０７は、ユーザが制御部３０１に操作指示を与えたり、データを入力したりするためのユーザインターフェースである。 The input unit 307 includes a keyboard having cursor keys, numeric input, various function keys, and the like, a mouse and a slice pad for selecting keys on the display screen of the display unit 308, and the like. The input unit 307 is a user interface for a user to give an operation instruction to the control unit 301 or input data.

表示部３０８は、ＣＲＴ（Cathode Ray Tube）やＬＣＤ（Liquid Crystal Display）等により構成され、制御部３０１から入力される表示データに応じた表示が行われる。 The display unit 308 is configured by a CRT (Cathode Ray Tube), an LCD (Liquid Crystal Display), or the like, and performs display according to display data input from the control unit 301.

なお、図２に示す復号画像記憶部１０９は、例えば主記憶部３０２又は補助記憶部３０３により実現され、図２に示す復号画像記憶部１０９以外の構成は、例えば制御部３０１及びワークメモリとしての主記憶部３０２により実現されうる。 The decoded image storage unit 109 illustrated in FIG. 2 is realized by, for example, the main storage unit 302 or the auxiliary storage unit 303, and the configuration other than the decoded image storage unit 109 illustrated in FIG. This can be realized by the main storage unit 302.

画像符号化装置３００で実行されるプログラムは、実施例１で説明した各部を含むモジュール構成となっている。実際のハードウェアとしては、制御部３０１が補助記憶部３０３からプログラムを読み出して実行することにより上記各部のうち１又は複数の各部が主記憶部３０２上にロードされ、１又は複数の各部が主記憶部３０２上に生成されるようになっている。 The program executed by the image encoding device 300 has a module configuration including each unit described in the first embodiment. As actual hardware, when the control unit 301 reads a program from the auxiliary storage unit 303 and executes it, one or more of the above-described units are loaded onto the main storage unit 302, and one or more of the units are main. It is generated on the storage unit 302.

このように、上述した実施例１で説明した画面内予測処理は、コンピュータに実行させるためのプログラムとして実現されてもよい。このプログラムをサーバ等からインストールしてコンピュータに実行させることで、前述したフィルタ処理を実現することができる。 Thus, the in-screen prediction process described in the first embodiment may be realized as a program for causing a computer to execute. The filter processing described above can be realized by installing this program from a server or the like and causing the computer to execute it.

また、このプログラムを記録媒体３０５に記録し、このプログラムが記録された記録媒体３０５をコンピュータや携帯端末に読み取らせて、前述したフィルタ処理を実現させることも可能である。なお、記録媒体３０５は、ＣＤ−ＲＯＭ、フレキシブルディスク、光磁気ディスク等の様に情報を光学的，電気的或いは磁気的に記録する記録媒体、ＲＯＭ、フラッシュメモリ等の様に情報を電気的に記録する半導体メモリ等、様々なタイプの記録媒体を用いることができる。また、上述した各実施例で説明したフィルタ処理は、１つ又は複数の集積回路に実装してもよい。 It is also possible to record the program in the recording medium 305 and cause the computer or portable terminal to read the recording medium 305 on which the program is recorded, thereby realizing the filter processing described above. The recording medium 305 is a recording medium that records information optically, electrically, or magnetically, such as a CD-ROM, a flexible disk, or a magneto-optical disk, and information is electrically stored such as a ROM or flash memory. Various types of recording media such as a semiconductor memory for recording can be used. Further, the filtering process described in each of the above embodiments may be implemented in one or a plurality of integrated circuits.

なお、実施例では、画像符号化装置を例にして説明したが、複数の予測モードを用いる画面内予測を行う画像復号装置でも同様に適用することができる。画像符号化装置及び画像復号装置をまとめて画像処理装置と呼ぶ。また、上記実施例では、Ｈ．２６４／ＡＶＣを例に説明したが、参照画素に対して複数の予測方向を用いて画面内予測を行う画像処理技術であれば適用できる。 In the embodiment, the image coding apparatus has been described as an example. However, the present invention can be similarly applied to an image decoding apparatus that performs intra prediction using a plurality of prediction modes. The image encoding device and the image decoding device are collectively referred to as an image processing device. In the above embodiment, H. Although H.264 / AVC has been described as an example, any image processing technique that performs intra prediction using a plurality of prediction directions with respect to a reference pixel can be applied.

以上、各実施例について詳述したが、特定の実施例に限定されるものではなく、特許請求の範囲に記載された範囲内において、上記変形例以外にも種々の変形及び変更が可能である。 Each embodiment has been described in detail above. However, the present invention is not limited to the specific embodiment, and various modifications and changes other than the above-described modification are possible within the scope described in the claims. .

１００、３００画像符号化装置
１０１予測画像生成部
１０２直交変換部
１０３量子化部
１０４エントロピー符号化部
１０５逆量子化部
１０６逆直交変換部
１０７復号画像生成部
１０８デブロッキングフィルタ部
１０９復号画像記憶部
１１０イントラ予測部
１１１インター予測部
１１２動きベクトル計算部
１１３符号化制御及びヘッダ生成部
１１４予測画像選択部
２０１分類部
２０２フィルタ決定部
２０３フィルタリング部
２０４生成部
２０５予測画像生成部
２０６予測モード決定部
３０１制御部
３０２主記憶部
３０３補助記憶部
３０４ドライブ装置
３０６ネットワークＩ／Ｆ部
３０７入力部
３０８表示部 100, 300 Image coding apparatus 101 Predicted image generation unit 102 Orthogonal transformation unit 103 Quantization unit 104 Entropy coding unit 105 Inverse quantization unit 106 Inverse orthogonal transformation unit 107 Decoded image generation unit 108 Deblocking filter unit 109 Decoded image storage unit 110 Intra Prediction Unit 111 Inter Prediction Unit 112 Motion Vector Calculation Unit 113 Coding Control and Header Generation Unit 114 Predictive Image Selection Unit 201 Classification Unit 202 Filter Determination Unit 203 Filtering Unit 204 Generation Unit 205 Predictive Image Generation Unit 206 Prediction Mode Determination Unit 301 Control unit 302 Main storage unit 303 Auxiliary storage unit 304 Drive device 306 Network I / F unit 307 Input unit 308 Display unit

Claims

An image processing apparatus that performs intra-screen prediction using a plurality of prediction directions,
A classification unit that classifies a plurality of reference pixels into a plurality of groups based on a possibility of being referred to; and
A filter determination unit that determines a different filter for each classified group;
A filtering unit that performs filtering on pixel values of the reference pixels included in each group using the filter determined by the filter determination unit;
Using the pixel value after filtering by the filtering unit, a generation unit that generates a prediction image of the intra-screen prediction;
An image processing apparatus comprising:

The classification unit includes:
The image processing apparatus according to claim 1, wherein the plurality of reference pixels are classified based on the number of prediction directions to be referred to.

The classification unit includes:
The image processing apparatus according to claim 1, wherein the plurality of reference pixels are classified according to whether or not they are adjacent to a processing target block.

The classification unit includes:
The image processing apparatus according to claim 3, wherein reference pixels adjacent to the processing target block or pixels not adjacent to the processing target block are further classified into a plurality of groups.

The filter determination unit
5. The image processing apparatus according to claim 1, wherein the image processing apparatus determines a low-pass filter having a wider pass band as the reference pixel in the group is more likely to be referred to.

The filter determination unit
There is a possibility that a noise removal filter is determined for the first group including a reference pixel that is most likely to be referenced, and a reference pixel in the group is referred to for a group other than the first group. The image processing apparatus according to claim 1, wherein the image processing apparatus determines a low-pass filter having a wider pass band as the height is higher.

A classification step of classifying a plurality of reference pixels into a plurality of groups based on a reference possibility;
A filter determination step for determining different filters for each classified group;
A filtering step of filtering the pixel values of the reference pixels included in each group using the determined filter;
A program for causing a computer to execute a generation step of generating a predicted image for intra prediction using pixel values after filtering.