JP6200220B2

JP6200220B2 - Image processing apparatus, encoding apparatus, decoding apparatus, and program

Info

Publication number: JP6200220B2
Application number: JP2013132070A
Authority: JP
Inventors: 康孝松尾; 境田　慎一; 慎一境田; 善明鹿喰
Original assignee: Japan Broadcasting Corp
Current assignee: Japan Broadcasting Corp
Priority date: 2013-06-24
Filing date: 2013-06-24
Publication date: 2017-09-20
Anticipated expiration: 2033-06-24
Also published as: JP2015008367A

Description

本発明は、画像の奥行き情報を用いて量子化ステップを調整する画像処理装置、符号化装置、復号装置、及びプログラムに関する。 The present invention relates to an image processing apparatus, an encoding apparatus, a decoding apparatus, and a program that adjust a quantization step using depth information of an image.

近年、奥行き情報を取得できる深度センサを搭載したカメラが盛んに研究開発されている。例えば、ＲＧＢにＤ（奥行き）を加えた画像情報を取得するＲＧＢ―Ｄセンサを搭載したカメラの一例として、Ｍｉｃｒｏｓｏｆｔ社のＫｉｎｅｃｔ（登録商標）がある。このような背景から、２次元の画像信号と、奥行き情報とを有する動画像情報の利用が増加することが予想される。 In recent years, a camera equipped with a depth sensor capable of acquiring depth information has been actively researched and developed. For example, there is Microsoft's Kinect (registered trademark) as an example of a camera equipped with an RGB-D sensor for acquiring image information obtained by adding D (depth) to RGB. From such a background, it is expected that the use of moving image information having a two-dimensional image signal and depth information will increase.

また、動画情報を効率的に処理する規格であるＨ．２６４／ＡＶＣ（Advanced Video Coding）やＨ．２６５／ＨＥＶＣ（High Efficiency Video Coding）の映像の圧縮符号化方式では、映像の各フレームをブロックと呼ばれる矩形領域に分割して符号化が行われる。 In addition, H.264, which is a standard for efficiently processing moving image information. H.264 / AVC (Advanced Video Coding) and H.264. In the H.265 / HEVC (High Efficiency Video Coding) video compression encoding method, each frame of the video is divided into rectangular areas called blocks, and encoding is performed.

この映像の圧縮の手法の１つとして、画像の符号化の際に量子化の処理が行われる。量子化とは、元々持っているデータの細かさを粗く分け直す処理である。すなわち、直交変換した各周波数成分に対して、所定の量子化ステップで除算した結果を整数値に丸める処理を「量子化」と呼んでいる。逆に、この量子化で得られた整数値に量子化ステップを乗算して直交変換した成分に戻す処理を「逆量子化」と呼ぶ。 As one of the video compression methods, quantization processing is performed when an image is encoded. Quantization is a process of roughly re-dividing the fineness of data originally possessed. That is, the process of rounding the result obtained by dividing each frequency component subjected to orthogonal transformation by a predetermined quantization step to an integer value is called “quantization”. Conversely, the process of multiplying the integer value obtained by this quantization by a quantization step and returning it to the orthogonally transformed component is called “inverse quantization”.

量子化によって、直交変換した成分よりも小さな整数値と少ないバリエーションとなった量子化値を符号化することで、直交変換した成分を直接符号化するよりも符号化に必要なビット数を少なくすることができる（例えば非特許文献１、及び非特許文献２参照）。 Quantization reduces the number of bits required for encoding compared to direct encoding of orthogonally transformed components by encoding smaller integer values and smaller variations of quantized values than orthogonally transformed components (For example, see Non-Patent Document 1 and Non-Patent Document 2).

図１は、Ｈ．２６４／ＡＶＣにおける、インターブロック（画面間符号化ブロック）の量子化マトリクスの例を示す図である。図１（ａ）は、４×４の１６個のインターブロックの量子化マトリクス１１０を示している。図１（ｂ）は、８×８の６４個のインターブロックの量子化マトリクス１２０を示している。 FIG. 2 is a diagram illustrating an example of a quantization matrix of an inter block (inter-coded block) in H.264 / AVC. FIG. 1A shows a 4 × 4 16 inter-block quantization matrix 110. FIG. 1B shows an 8 × 8 64 inter-block quantization matrix 120.

例えば、図１（ｂ）における量子化マトリクス１２０のブロック１０１の量子化ステップ値は、「９」を示している。画像をＤＣＴ（離散コサイン変換）した場合、このブロック１０１に対応するＤＣＴ係数は、ＤＣ（直流）成分に該当する。したがって、ＤＣＴの直流成分を９で除算し、余りを切り捨てた商の値が、ＤＣ成分の量子化後の値となる。 For example, the quantization step value of the block 101 of the quantization matrix 120 in FIG. 1B indicates “9”. When the image is DCT (Discrete Cosine Transform), the DCT coefficient corresponding to the block 101 corresponds to a DC (direct current) component. Therefore, the quotient value obtained by dividing the DC component of DCT by 9 and discarding the remainder is the value after quantization of the DC component.

量子化ステップが小さい場合には、ＤＣＴ係数の周波数成分を量子化ステップで割り、切り捨てられる余りは一般に小さくなる。このため、量子化された値を逆量子化して戻した値と、量子化前の値との誤差は小さくなり、量子化による画像の品質の劣化は少ない。しかしながら、商の値は一般に大きくなるとともに商の値がとり得るバリエーションが多くなるため、符号化効率（圧縮率）は低下する。したがって、量子化ステップを変えることで、圧縮率、画像の品質等を調整することが可能である。 When the quantization step is small, the frequency component of the DCT coefficient is divided by the quantization step, and the remainder that is discarded is generally small. For this reason, an error between a value obtained by dequantizing the quantized value and a value before quantization is small, and deterioration in image quality due to quantization is small. However, since the value of the quotient generally increases and the variation that the value of the quotient can take increases, the coding efficiency (compression rate) decreases. Therefore, it is possible to adjust the compression rate, the image quality, and the like by changing the quantization step.

一般に、量子化ステップは、人間の目に認識されやすい低周波成分（図１の量子化マトリクスの左上部分）に対して、小さな値とする。そして、人間の目に対して画像の劣化が認識されにくい高周波成分（図１の量子化マトリクスの右下部分）に対して、大きな値とすることが多い。 In general, the quantization step is set to a small value with respect to a low-frequency component (upper left portion of the quantization matrix in FIG. 1) that is easily recognized by human eyes. In many cases, a large value is set for a high-frequency component (lower right portion of the quantization matrix in FIG. 1) in which image degradation is difficult to be recognized by human eyes.

上述のように、量子化ステップを大きくすれば、画像の符号化効率を高めることができる。しかしながら、空間周波数の低い画像の領域において、等高線のような擬似輪郭が発生する傾向が高くなる。 As described above, if the quantization step is increased, the encoding efficiency of the image can be increased. However, there is a high tendency for pseudo contours such as contour lines to occur in an image region having a low spatial frequency.

逆に、量子化ステップを小さくすれば、画像の圧縮率は低下するが、画像表現及び精細度が向上した画像を再現することができる。しかしながら、例えば被写界深度外のような、一般的に人間によって重要視されない画像領域においても、低圧縮の処理が施されるため、全体的な圧縮率を低下させることになる。 Conversely, if the quantization step is reduced, the compression ratio of the image is reduced, but an image with improved image expression and definition can be reproduced. However, for example, even in an image region that is generally not important by humans, such as outside the depth of field, the low compression process is performed, so that the overall compression rate is reduced.

したがって、互いに相反する画像の品質の担保と圧縮率の確保を得るためには、画像の性質に応じて、量子化ステップの値を適切に選ぶことが重要である、 Therefore, it is important to appropriately select the value of the quantization step according to the nature of the image in order to ensure the quality of the mutually contradictory image and ensure the compression rate.

亀山渉、花村剛著、「ディジタル放送教科書（上）ＭＰＥＧ−１／ＭＰＥＧ−２／ＭＰＥＧ−４」、インプレス、２００３年Wataru Kameyama, Takeshi Hanamura, “Digital Broadcast Textbook (above) MPEG-1 / MPEG-2 / MPEG-4”, Impress, 2003 大久保榮監修，「改訂三版Ｈ．２６４／ＡＶＣ教科書」、インプレスＲ＆Ｄ，ｐ１２４、２００９年１月１日Supervised by Satoshi Okubo, “Revised third edition H.264 / AVC textbook”, Impress R & D, p124, January 1, 2009

上述のように、量子化ステップは、擬似輪郭の発生などの画像品質、及び画像データの圧縮率に密接に関連している。そして、被写界深度内にある画像領域は、注視領域である可能性が高いため、擬似輪郭による視覚的な画像の劣化が目立ちやすい。 As described above, the quantization step is closely related to the image quality such as the generation of pseudo contours and the compression rate of the image data. Since the image area within the depth of field is likely to be a gaze area, visual image degradation due to pseudo contours is easily noticeable.

また、ＣＭＯＳ等の撮像素子で生じるノイズは、被写界深度内／外で一様に含まれ、符号化効率を低下させる。しかしながら、ノイズは画像表現及び精細度向上のために重要であり、被写界深度内／外で一様に除去すればよいというものではない。 In addition, noise generated in an image pickup device such as a CMOS is uniformly included in / out of the depth of field, which reduces the encoding efficiency. However, noise is important for improving the image expression and definition, and it does not have to be removed uniformly within / out of the depth of field.

そこで、本発明は、奥行き情報を用いて、画像の被写界深度内／外の情報を求めると共に、この情報を用いて量子化ステップの値を調整することで、画像劣化を抑制し、或いは符号化効率を向上させることを目的とする。 Therefore, the present invention uses the depth information to obtain information inside / outside the depth of field of the image, and adjusts the value of the quantization step using this information, thereby suppressing image degradation, or The purpose is to improve the coding efficiency.

本発明の一態様における画像処理装置は、画像の奥行きの情報を用いて画像の量子化ステップを調整する画像処理装置であって、前記画像に含まれる複数のブロックの各々の空間周波数のパワーを取得する空間周波数取得部と、前記画像における各画素位置又は各所定領域の奥行きの情報をクラスタリングし、当該クラスタリングの結果に基づいて、前記複数のブロックの各々のクラスタを決定する奥行き情報クラスタリング部と、所定のクラスタに属する１つ以上のブロックの空間周波数のパワーと、前記所定のクラスタ以外に属する複数のブロックの空間周波数のパワーとに基づいて、前記所定のクラスタに属する前記１つ以上のブロックを被写界深度内に属すると判定する、被写界深度判定部と、前記被写界深度判定部の判定結果と、前記画像の圧縮率とに基づいて、前記複数のブロックのうち所定のブロックに対して、直交変換された複数の周波数成分の各々の量子化ステップを含む量子化マトリクスの値を修正する、量子化マトリクス修正部と、を有する。 An image processing apparatus according to an aspect of the present invention is an image processing apparatus that adjusts a quantization step of an image by using information on the depth of the image, and calculates a spatial frequency power of each of a plurality of blocks included in the image. A spatial frequency acquisition unit to acquire, and a depth information clustering unit that clusters information on the depth of each pixel position or each predetermined region in the image and determines each cluster of the plurality of blocks based on the result of the clustering; a power of one or more spatial frequencies of the block belonging to a given cluster, based on the power of the spatial frequency of a plurality of blocks belonging to other than the given cluster, the one or more blocks belonging to the given cluster And a determination result of the depth of field determination unit, the determination result of the depth of field determination unit, A quantization matrix including a quantization step including a quantization step for each of a plurality of frequency components orthogonally transformed with respect to a predetermined block of the plurality of blocks based on a compression rate of the recorded image; A matrix correction unit.

前記画像の圧縮率が所定の第１の圧縮率より高い場合を高圧縮率と判定する圧縮率判定部を有し、前記被写界深度判定部は、前記所定のクラスタに属する１つ以上のブロックの空間周波数のうち所定の空間周波数を越える空間周波数のパワーの割合が、前記所定のクラスタ以外に属する複数のブロックの空間周波数のうち前記所定の空間周波数を越える空間周波数のパワーの割合より高い場合、前記所定のクラスタに属する前記１つ以上のブロックを被写界深度内に属すると判定し、前記量子化マトリクス修正部は、前記圧縮率が、高圧縮率である場合、被写界深度内に属すると判定された前記１つ以上のブロックのうち、空間周波数の低周波成分のパワーの割合が所定の割合より高いブロックに対して、前記量子化マトリクスの所定の領域を、より小さい値に修正してもよい。

A compression rate determination unit that determines that the compression rate of the image is higher than a predetermined first compression rate as a high compression rate, wherein the depth-of-field determination unit includes at least one of the predetermined clusters The proportion of the spatial frequency power exceeding the predetermined spatial frequency among the spatial frequencies of the blocks is higher than the proportion of the power of the spatial frequency exceeding the predetermined spatial frequency among the spatial frequencies of the plurality of blocks belonging to other than the predetermined cluster. The one or more blocks belonging to the predetermined cluster are determined to belong to a depth of field, and the quantization matrix correction unit determines the depth of field when the compression rate is a high compression rate. Among the one or more blocks determined to belong to, a predetermined area of the quantization matrix for a block in which the power ratio of the low frequency component of the spatial frequency is higher than a predetermined ratio , It may be modified to a smaller value.

また、前記量子化マトリクス修正部は、前記圧縮率が、所定の第２の圧縮率より低い場合、被写界深度内に属すると判定された前記１つ以上のブロック以外のブロックに対して、前記量子化マトリクスの値を、より大きい値に修正してもよい。 In addition, the quantization matrix correction unit, for the blocks other than the one or more blocks determined to belong within the depth of field when the compression rate is lower than a predetermined second compression rate, The value of the quantization matrix may be modified to a larger value.

また、前記空間周波数の低周波成分のパワーの割合が所定の割合より高いブロックは、第１の閾値以下の空間周波数のパワーを前記第１の閾値を超える空間周波数のパワーで除した値が、第２の閾値を越えるブロックであってもよい。 Further, in the block in which the ratio of the power of the low frequency component of the spatial frequency is higher than a predetermined ratio, the value obtained by dividing the power of the spatial frequency below the first threshold by the power of the spatial frequency exceeding the first threshold is: The block may exceed the second threshold.

また、本発明の他の態様における符号化装置は、上記画像処理装置を備えてもよい。 An encoding apparatus according to another aspect of the present invention may include the image processing apparatus.

また、本発明の他の態様におけるプログラムは、コンピュータを、上記符号化装置として機能させる。 A program according to another aspect of the present invention causes a computer to function as the encoding device.

本発明によれば、奥行き情報を用いて、画像内の被写界深度内／外判定を行って量子化ステップを調整するので、画像劣化を抑制し、又は符号化効率を向上させることができる。 According to the present invention, since the quantization step is adjusted by performing the inside / outside determination of the depth of field in the image using the depth information, it is possible to suppress the image deterioration or improve the encoding efficiency. .

インターブロック（画面間符号化ブロック）の量子化マトリクスの例を示す図。The figure which shows the example of the quantization matrix of an inter block (inter-screen coding block). 実施例１における画像処理装置の概略ブロック図。1 is a schematic block diagram of an image processing apparatus in Embodiment 1. FIG. 直交変換ブロックの空間周波数における高帯域の領域の例を示す図。The figure which shows the example of the area | region of the high band in the spatial frequency of an orthogonal transformation block. 実施例１の画像処理装置１０の処理の一例のフローチャート。3 is a flowchart illustrating an example of processing performed by the image processing apparatus according to the first exemplary embodiment. 実施例１における被写界深度判定の例を示すフローチャート。6 is a flowchart illustrating an example of depth of field determination in the first embodiment. 実施例１の量子化マトリクスの値を修正する例を示すフローチャート。9 is a flowchart illustrating an example of correcting a quantization matrix value according to the first embodiment. 実施例２における画像処理装置の概略構成の一例を示すブロック図。FIG. 6 is a block diagram illustrating an example of a schematic configuration of an image processing apparatus according to a second embodiment. 実施例３における画像処理装置の概略構成の一例を示すブロック図。FIG. 9 is a block diagram illustrating an example of a schematic configuration of an image processing apparatus according to a third embodiment. 実施例４における画像処理装置の概略構成の一例を示すブロック図。FIG. 10 is a block diagram illustrating an example of a schematic configuration of an image processing apparatus according to a fourth embodiment.

以下、実施例について図面を参照しながら説明する。 Hereinafter, embodiments will be described with reference to the drawings.

［実施例１］
実施例１における画像処理装置１０は、修正された量子化マトリクスを決定する装置である。この画像処理装置は、プログラムが実行されることで機能してもよいし、集積回路などにより実装されてもよい。 [Example 1]
The image processing apparatus 10 according to the first embodiment is an apparatus that determines a modified quantization matrix. This image processing apparatus may function by executing a program, or may be implemented by an integrated circuit or the like.

＜実施例１の概要及び位置づけ＞
実施例１は、修正された量子化マトリクス（又は量子化パラメータ）を決定するために、画像の奥行き情報に基づく被写界深度の情報を用いる。この量子化マトリクス（又は量子化パラメータ）は、既に述べたように、画像の符号化の過程において、情報の圧縮を行うために用いられる。したがって、以下に詳述する実施例１は、例えば、画像符号化装置の中で用いることができる。なお、画像符号化装置については、実施例２において、その具体例を示すこととする。 <Outline and Position of Example 1>
Example 1 uses depth-of-field information based on image depth information to determine a modified quantization matrix (or quantization parameter). As already described, this quantization matrix (or quantization parameter) is used to compress information in the process of image encoding. Therefore, Example 1 described in detail below can be used in, for example, an image encoding device. A specific example of the image encoding device will be described in the second embodiment.

なお、実施例１で用いる被写界深度の情報を得るためには、分割された各ブロックにおける奥行き情報と、空間周波数の情報に関して画像の１フレーム分の情報を取得することが望ましい。このため、以下に説明する実施例１においては、１フレームの画像処理を２パスで行う具体例を示す。 In order to obtain information on the depth of field used in the first embodiment, it is desirable to acquire information for one frame of the image regarding the depth information in each divided block and the spatial frequency information. For this reason, the first embodiment described below shows a specific example in which image processing for one frame is performed in two passes.

すなわち、１パス目において、１フレーム内の各ブロックの被写界深度情報が得られる。そして、２パス目において、この被写界深度情報を用いて、１フレーム内の各ブロックの量子化マトリクス（又は量子化パラメータ）を修正する処理が施される。 That is, in the first pass, depth-of-field information of each block in one frame is obtained. In the second pass, processing for correcting the quantization matrix (or quantization parameter) of each block in one frame is performed using this depth of field information.

したがって、１パス目及び２パス目のいずれのパスにおいても、実施例１を含む画像符号化装置全体が動作することは、当業者であれば理解されるところである。なお、１パス目におけるブロック分割情報を、そのまま２パス目で用いることが望ましい。このため、１パス目におけるブロック分割情報が保存され、２パス目の処理において、その保存されたブロック分割情報が利用される。なお、実施例１は２パスの処理に限定されるものではない。 Therefore, those skilled in the art will understand that the entire image encoding apparatus including the first embodiment operates in both the first pass and the second pass. It is desirable to use the block division information in the first pass as it is in the second pass. For this reason, the block division information in the first pass is saved, and the saved block division information is used in the process of the second pass. The first embodiment is not limited to the two-pass process.

なお、実施例１においては、画像符号化装置における前処理部、エントロピー符号化部、予測誤差信号、ブロック直交変換後情報、量子化後情報等にも言及している。これらの詳細は、画像符号化装置の具体例を示す実施例２において詳述するが、実施例１の理解を助けるために、これらの概略を以下に簡潔に述べることとする。 In the first embodiment, reference is also made to a preprocessing unit, an entropy coding unit, a prediction error signal, post-block orthogonal transform information, post-quantization information, and the like in the image coding apparatus. These details will be described in detail in a second embodiment showing a specific example of an image encoding device. In order to facilitate understanding of the first embodiment, the outline thereof will be briefly described below.

前処理部は、原画像のピクチャタイプに合わせてピクチャを並べ替え、ピクチャタイプ及びフレーム毎のフレーム画像等を順次出力する。また、前処理部は、ブロック分割なども行う。 The preprocessing unit rearranges the pictures in accordance with the picture type of the original image, and sequentially outputs the picture type and the frame image for each frame. The preprocessing unit also performs block division and the like.

エントロピー符号化部は、シンボルの出現頻度に応じて可変長の符号を割り当てるものであり、量子化された信号出力等を符号化する。 The entropy encoding unit assigns a variable-length code according to the appearance frequency of the symbol, and encodes the quantized signal output and the like.

予測誤差信号とは、原画像のブロックのデータと、予測画像のブロックのデータの差分値を意味する。他のフレームの情報又は同一フレーム内の周辺の情報から、画像を予測し、原画像との差分を得て、予測誤差信号を得ることによって、符号化すべき画像の情報量を圧縮することができる。 The prediction error signal means a difference value between block data of the original image and block data of the prediction image. The amount of information of an image to be encoded can be compressed by predicting an image from other frame information or peripheral information in the same frame, obtaining a difference from the original image, and obtaining a prediction error signal. .

ブロック直交変換後情報とは、予測誤差信号を直交変換することによって得られた情報を意味する。直交変換の手法としては、例えばＤＣＴ（離散コサイン変換）が用いられる。予測誤差信号を直交変換することによって、更に情報の圧縮が可能である。 The post-block orthogonal transformation information means information obtained by orthogonal transformation of the prediction error signal. For example, DCT (Discrete Cosine Transform) is used as the orthogonal transform method. Information can be further compressed by orthogonally transforming the prediction error signal.

量子化後情報とは、直交変換後の情報を量子化した情報を意味する。直交変換後の情報を量子化することによって出力信号の符号量を低減することができる。 The post-quantization information means information obtained by quantizing information after orthogonal transformation. By quantizing the information after the orthogonal transformation, the code amount of the output signal can be reduced.

以上の前提に基づいて、以下、実施例１の構成及び動作について詳述する。なお、実施例１は、これらの前提に限定されるものではない。
＜実施例１の構成＞
実施例１における１つのフレームに対する処理は、大きく１パス目と２パス目に分けられる。特定のフレームに対する１パス目の処理が終了すると、その特定のフレームに対する２パス目の処理が実行される。 Based on the above assumptions, the configuration and operation of the first embodiment will be described in detail below. The first embodiment is not limited to these assumptions.
<Configuration of Example 1>
The processing for one frame in the first embodiment is roughly divided into the first pass and the second pass. When the first pass processing for the specific frame is completed, the second pass processing for the specific frame is executed.

図２は、実施例１における画像処理装置１０の概略ブロック図である。図２に示す画像処理装置は、１パス目動作部１４０と、１パス・２パス切替指示部１３０と、量子化マトリクス修正部１６０とを有する。１パス・２パス切替指示部１３０が、１パス目動作指示信号１３２を活性化させ、１パス目動作部１４０を動作させる。また、１パス・２パス切替指示部１３０は、１パス・２パス切替指示１６２を前処理部２００に与え、前処理部２００が１パス目の処理であることを認識させる。なお、前処理部の動作については、後述する。また、１パス・２パス切替指示部１３０は、２パス目動作指示信号１８４を活性化させ、量子化マトリクス修正部１６０を動作させる。また、１パス・２パス切替指示部１３０は、２パス目動作指示信号１８４を活性化させ、後述するエントロピー符号化部を動作させる。 FIG. 2 is a schematic block diagram of the image processing apparatus 10 according to the first embodiment. The image processing apparatus illustrated in FIG. 2 includes a first pass operation unit 140, a 1 pass / 2 pass switching instruction unit 130, and a quantization matrix correction unit 160. The 1-pass / 2-pass switching instruction unit 130 activates the first-pass operation instruction signal 132 and operates the first-pass operation unit 140. Further, the 1-pass / 2-pass switching instruction unit 130 gives the 1-pass / 2-path switching instruction 162 to the pre-processing unit 200, and recognizes that the pre-processing unit 200 is the first pass process. The operation of the preprocessing unit will be described later. The 1-pass / 2-pass switching instruction unit 130 activates the second-pass operation instruction signal 184 and operates the quantization matrix correction unit 160. Further, the 1-pass / 2-pass switching instruction unit 130 activates the second-pass operation instruction signal 184 and operates an entropy encoding unit described later.

１パス目動作部１４０は、ブロック分割情報保存部１１１と、予測誤差信号情報量積算部１１２と、奥行き情報クラスタリング部１１３と、被写界深度判定部１１４と、空間周波数取得部１１５と、量子化後情報量積算部１１６と、圧縮率判定部１１７とを有する。また、被写界深度判定部１１４は被写界深度保存部１１８を含む。 The first-pass operation unit 140 includes a block division information storage unit 111, a prediction error signal information amount accumulation unit 112, a depth information clustering unit 113, a depth of field determination unit 114, a spatial frequency acquisition unit 115, a quantum The information amount integrating unit 116 after conversion and the compression rate determining unit 117 are included. The depth of field determination unit 114 includes a depth of field storage unit 118.

１パス目動作部１４０は、１パス目において動作を行い、２パス目は、必用な情報を他の処理部に提供する能力を有する。 The first pass operation unit 140 operates in the first pass, and the second pass has a capability of providing necessary information to other processing units.

ブロック分割情報保存部１１１は、前処理部２００からのブロック分割情報１６４を取得し、順次保存する。ブロック分割情報保存部１１１は、少なくとも１フレーム分のブロック分割情報を保存することができる。１フレーム分のブロック分割情報は、被写界深度保存部１１８に提供される。また、１フレーム分のブロック分割情報１６４は、前処理部に提供され、前処理部が、２パス目において、１パス目と同じブロック分割を行うことができるようにする。 The block division information storage unit 111 acquires the block division information 164 from the preprocessing unit 200 and sequentially stores it. The block division information storage unit 111 can store block division information for at least one frame. The block division information for one frame is provided to the depth of field storage unit 118. Further, the block division information 164 for one frame is provided to the preprocessing unit, so that the preprocessing unit can perform the same block division in the second pass as in the first pass.

予測誤差信号情報量積算部１１２は、１フレーム分の予測誤差信号１６８の情報量の積算値ａを取得する。 The prediction error signal information amount integration unit 112 acquires an integrated value a of the information amount of the prediction error signal 168 for one frame.

量子化後情報量積算部１１６は、１フレーム分の、量子化後情報１８２を積算し、積算値ｂを取得する。 The post-quantization information amount integration unit 116 integrates post-quantization information 182 for one frame and obtains an integrated value b.

圧縮率判定部１１７は、ｂ／ａ＝ｃを計算することにより、直交変換と量子化により達成される１フレーム分の情報量の圧縮率ｃを取得する。圧縮率ｃが所定の第１の圧縮率より高い場合を「高圧縮率」と判定し、圧縮率ｃが所定の第２の圧縮率より低い場合、「低圧縮率」と判定する。この判定の結果は、２パス目において量子化マトリクス修正部１６０に提供される。 The compression rate determination unit 117 obtains the compression rate c of the information amount for one frame achieved by orthogonal transform and quantization by calculating b / a = c. When the compression rate c is higher than the predetermined first compression rate, it is determined as “high compression rate”, and when the compression rate c is lower than the predetermined second compression rate, it is determined as “low compression rate”. The result of this determination is provided to the quantization matrix correction unit 160 in the second pass.

奥行き情報クラスタリング部１１３は、例えば、１フレーム分の奥行き情報のヒストグラムを求め、このヒストグラムが取る値の範囲をクラスタ数ｎ（例えばｎ＝２）に等分割し、奥行き情報を分類する。奥行き情報クラスタリング部１１３は、各画素位置又は各所定領域の奥行き情報のクラスタリング結果として、奥行き降順に番号（例えば、クラスタ番号＝１、０）を付与して、このクラスタリング結果を得る。クラスタリングの手法としては、Ｋ−ｍｅａｎｓ法を用いることができる。 For example, the depth information clustering unit 113 obtains a histogram of depth information for one frame, equally divides the range of values taken by the histogram into the number of clusters n (for example, n = 2), and classifies the depth information. The depth information clustering unit 113 assigns numbers (for example, cluster numbers = 1, 0) in descending depth order as the clustering result of the depth information of each pixel position or each predetermined region, and obtains this clustering result. As a clustering method, a K-means method can be used.

K-means法は，あらかじめ固定された数（例えば，ｎ個）のクラスタの各々にその代表であるプロトタイプを与え，それぞれのデータを最も近いプロトタイプに割り当てることでクラスタリングを行う手法である。なお、実施例１は、特定のクラスタリング手法に限定されるものではない。 The K-means method is a technique for performing clustering by giving a prototype that is a representative to each of a predetermined number of clusters (for example, n) and assigning each data to the nearest prototype. The first embodiment is not limited to a specific clustering method.

被写界深度判定部１１４は、クラスタリング結果、及び空間周波数のパワーの情報として例えばブロック直交変換後情報１７２を用いる。ブロック直交変換後情報１７２は、空間周波数取得部１１５を介して取得される。 The depth of field determination unit 114 uses, for example, post-block orthogonal transformation information 172 as clustering results and spatial frequency power information. The post-block orthogonal transform information 172 is acquired via the spatial frequency acquisition unit 115.

なお、図２において空間周波数取得部１１５は、ブロック直交変換後情報１７２の代わりに、原画像から空間周波数のパワーを直接求めてもよい。この場合には、空間周波数取得部１１５に、原画像を与える。そして、空間周波数取得部１１５において、該当するブロックについて空間周波数のパワーを計算する。空間周波数のパワーを取得する手法としては、ＤＣＴ（離散コサイン変換）、アダマール変換、離散フーリエ変換などを用いることができる。 In FIG. 2, the spatial frequency acquisition unit 115 may directly obtain the spatial frequency power from the original image instead of the post-block orthogonal transformation information 172. In this case, an original image is given to the spatial frequency acquisition unit 115. Then, the spatial frequency acquisition unit 115 calculates the power of the spatial frequency for the corresponding block. As a method for acquiring the power of the spatial frequency, DCT (Discrete Cosine Transform), Hadamard Transform, Discrete Fourier Transform, or the like can be used.

まず、被写界深度判定部１１４は、同じクラスタにクラスタリングされた直交変換ブロックの空間周波数のうち、高い帯域のパワーの割合を算出する。なお、パワーではなく、空間周波数のレベルの絶対値、空間周波数の成分毎の強度等を用いてもよい。 First, the depth-of-field determination unit 114 calculates a ratio of power in a high band among spatial frequencies of orthogonal transform blocks clustered in the same cluster. Instead of power, the absolute value of the spatial frequency level, the intensity of each spatial frequency component, or the like may be used.

なお、本明細書においては、「空間周波数のパワー」の語を用いるが、この語は、「空間周波数のレベルの絶対値」、「空間周波数の成分毎の強度」等をも意味する語として用いる。 In this specification, the term “spatial frequency power” is used, but this term also means “absolute value of spatial frequency level”, “intensity for each component of spatial frequency”, and the like. Use.

図３は、画像の１つのブロックを直交変換して得られた直交変換係数（空間周波数）をブロック状に並べた例を示している。空間周波数成分３００は、１つの直交変換ブロック（８画素×８ライン）の６４画素をＤＣＴ変換した６４個のＤＣＴ係数を、水平成分及び垂直成分に並べたものである。 FIG. 3 shows an example in which orthogonal transform coefficients (spatial frequencies) obtained by orthogonal transform of one block of an image are arranged in blocks. The spatial frequency component 300 is obtained by arranging 64 DCT coefficients obtained by DCT transforming 64 pixels of one orthogonal transform block (8 pixels × 8 lines) into a horizontal component and a vertical component.

例えば、空間高周波成分の領域は、斜線で示した周波数領域３９０である。周波数領域３９０は、水平方向、及び垂直方向、それぞれの空間最大周波数の１／２を超える領域である。周波数領域３９０の空間周波数のパワーをＨ_１、全体の空間周波数成分３００の空間周波数のパワーをＷ_１とすれば、周波数領域３９０の空間高周波数のパワーの割合α_１は、以下の式で求まる。 For example, the region of the spatial high frequency component is a frequency region 390 indicated by hatching. The frequency region 390 is a region exceeding 1/2 of the maximum spatial frequency in the horizontal direction and the vertical direction. Assuming that the spatial frequency power in the frequency domain 390 is H ₁ and the spatial frequency power of the entire spatial frequency component 300 is W ₁ , the spatial high frequency power ratio α ₁ in the frequency domain 390 can be obtained by the following equation. .

α_１＝Ｈ_１／Ｗ_１
クラスタ数を２個とし、ここで求めたα_１を、クラスタ０に属する複数のブロックの各々に対して求めて、空間高周波数のパワーの割合の平均値αを求める。同様にクラスタ１に属する複数のブロック全体に対して求めた空間高周波数のパワーの割合の平均値をβとする。 α ₁ = H ₁ / W ₁
The number of clusters is set to two, and α ₁ obtained here is obtained for each of a plurality of blocks belonging to cluster 0 to obtain an average value α of the ratio of spatial high frequency power. Similarly, let β be the average value of the ratio of the power of the spatial high frequency obtained for all the blocks belonging to cluster 1.

そして、αとβを比較し、α＞βであれば、αが導かれたクラスタ０に属する複数のブロックが、被写界深度の領域内であると判断することができる。その理由は、被写界深度内の領域は、ぼけのない鮮明な画像が多いため、空間高周波数のパワーの割合が大きくなるという経験則に基づいているからである。以上のようにして、被写界深度内／外の領域を特定することができる。 Then, α and β are compared, and if α> β, it can be determined that a plurality of blocks belonging to cluster 0 from which α is derived are within the depth of field region. The reason is that the region within the depth of field is based on an empirical rule that the ratio of the power of the spatial high frequency is large because there are many clear images without blur. As described above, it is possible to specify a region within / out of the depth of field.

なお、図３に示す空間高周波数成分の領域の取り方は、一例に過ぎず、実施例１は、これに限定されない。また、上記の例では、クラスタ数を２としたが、３以上のクラスタに分けて、空間高周波数のパワーの割合が最も大きくなるクラスタに属する複数のブロックで構成される領域を、被写界深度内の領域と判定してもよい。 In addition, the method of taking the area | region of the spatial high frequency component shown in FIG. 3 is only an example, and Example 1 is not limited to this. In the above example, the number of clusters is 2. However, an area composed of a plurality of blocks belonging to a cluster having the highest spatial high frequency power ratio is divided into three or more clusters. You may determine with the area | region within the depth.

また、上述の例では、図３の空間周波数成分３００の左上（０，０）の位置のＤＣＴ係数の直流（ＤＣ）成分をも、全体の空間周波数のパワーの平均値を求める際に利用した。しかしながら、ＤＣ成分は、ブロックの明るさに関連する係数であるため、空間周波数のパワーの平均値を求める際に、計算から除外してもよい。ＤＣ成分を除外することによって、画像の明るさに左右されずに、空間高周波数のパワーの割合α_１を求めることができる。 In the above example, the direct current (DC) component of the DCT coefficient at the upper left (0, 0) position of the spatial frequency component 300 in FIG. 3 is also used when obtaining the average value of the power of the entire spatial frequency. . However, since the DC component is a coefficient related to the brightness of the block, it may be excluded from the calculation when obtaining the average value of the power of the spatial frequency. By excluding the DC component, it is possible to obtain the spatial high frequency power ratio α ₁ regardless of the brightness of the image.

被写界深度判定部１１４に含まれる被写界深度保存部１１８は、ブロック分割情報保存部１１１からの情報と、被写界深度判定部１１４での情報とを用いて、１フレームに含まれる各ブロックの被写界深度の内／外の情報を保存する。 The depth of field storage unit 118 included in the depth of field determination unit 114 is included in one frame using information from the block division information storage unit 111 and information from the depth of field determination unit 114. Store information inside / outside the depth of field of each block.

以上の処理によって、１パス目動作部１４０は、現在処理しているフレームが、高圧縮率、低圧縮率、あるいはそれ以外かの特定を行う。加えて、１パス目動作部１４０は、１フレームの各ブロックに対して、被写界深度の内／外の特定を行う。 Through the above processing, the first-pass operation unit 140 identifies whether the frame currently being processed is a high compression rate, a low compression rate, or otherwise. In addition, the first-pass operation unit 140 specifies inside / outside of the depth of field for each block of one frame.

１パス・２パス切替指示部１３０は、前処理部２００に対し、２パス目の処理であることを伝達する。前処理部２００は、この伝達に応答して、ブロック分割情報保存部１１１から得られたブロック分割情報に従って、ブロック分割を行い、各ブロックが処理されるよう、ブロックに関する情報を順次出力する。また、１パス・２パス切替指示部１３０は、２パス目動作指示信号１８４を、量子化マトリクス修正部１６０及びエントロピー符号化部２０４に与える。なお、エントロピー符号化部２０４については、図７を用いて後述する。 The 1-pass / 2-pass switching instruction unit 130 notifies the preprocessing unit 200 that the process is the second pass. In response to this transmission, the preprocessing unit 200 performs block division according to the block division information obtained from the block division information storage unit 111, and sequentially outputs information about the blocks so that each block is processed. Further, the 1-pass / 2-pass switching instruction unit 130 provides the second-pass operation instruction signal 184 to the quantization matrix correction unit 160 and the entropy encoding unit 204. The entropy encoding unit 204 will be described later with reference to FIG.

量子化マトリクス修正部１６０は、符号化のために一旦作成された量子化マトリクス又は量子化パラメータであるＱｐ値１７４、被写界深度判定部１１４からの情報、空間周波数取得部１１５からの情報、及び圧縮率判定部１１７からの情報を受け取る。量子化マトリクス修正部１６０は、符号化のために一旦作成された量子化マトリクス又は量子化パラメータであるＱｐ値１７４を修正することができる。 The quantization matrix correction unit 160 includes a Qp value 174 that is a quantization matrix or a quantization parameter once created for encoding, information from the depth-of-field determination unit 114, information from the spatial frequency acquisition unit 115, And the information from the compression rate determination unit 117 is received. The quantization matrix correction unit 160 can correct the Qp value 174 that is a quantization matrix or a quantization parameter once created for encoding.

まず、圧縮率判定部１１７が、高圧縮であると判定した場合を説明する。この場合には、量子化マトリクス修正部１６０は、被写界深度内に属するブロックに対して、擬似輪郭の発生を抑制する処理を施す。 First, a case where the compression rate determination unit 117 determines that the compression is high will be described. In this case, the quantization matrix correction unit 160 performs processing for suppressing the occurrence of pseudo contours on the blocks belonging to the depth of field.

量子化マトリクス修正部１６０は、被写界深度内に属するブロックのうち、空間低周波数のパワーの割合が、例えば所定の閾値より高いブロックを特定する。一般に、被写界深度内のブロックは、鮮明度が高いため、空間低周波数のパワーよりも空間高周波数のパワーの割合が高い場合が多い。しかしながら、オブジェクトのテクスチャ自体が空間低周波数を持つような場合には、被写界深度内においても、空間低周波数のパワーの割合が高いブロックが存在する場合がある。このようなブロックでは、上述のように擬似輪郭が発生する確率が高い。しかも被写界深度内にそのブロックが存在するために、この擬似輪郭が目立ちやすいと判断される。 The quantization matrix correction | amendment part 160 specifies the block in which the ratio of the power of spatial low frequency is higher than a predetermined threshold among the blocks which belong within the depth of field. In general, a block within the depth of field has a high definition, so that the ratio of the power of the spatial high frequency is often higher than the power of the spatial low frequency. However, when the texture of the object itself has a spatial low frequency, there may be a block having a high spatial low frequency power ratio even within the depth of field. In such a block, the probability that a pseudo contour will occur as described above is high. Moreover, since the block exists within the depth of field, it is determined that the pseudo contour is conspicuous.

したがって、量子化マトリクス修正部１６０は、このようなブロックに対して、擬似輪郭の発生を抑制するために、量子化マトリクスの値を、より小さくすることが望ましい。このため、量子化マトリクス修正部１６０は、このようなブロックを特定する。 Therefore, it is desirable that the quantization matrix correction unit 160 reduce the value of the quantization matrix for such a block in order to suppress the occurrence of pseudo contours. Therefore, the quantization matrix correction unit 160 identifies such a block.

量子化マトリクス修正部１６０は、特定されたブロックの量子化ステップを、より小さな値にして修正する。たとえば、量子化マトリクス修正部１６０は、量子化マトリクスの各要素の値を１／２にした値に修正してもよい。或いは、量子化マトリクス修正部１６０は、Ｑｐ値を、例えば６減算してもよい。Ｑｐ値が６減算されると、量子化マトリクスの値の各々は、１／２に設定されることになる。 The quantization matrix correction unit 160 corrects the quantization step of the identified block to a smaller value. For example, the quantization matrix correction unit 160 may correct the value of each element of the quantization matrix to a value halved. Alternatively, the quantization matrix correction unit 160 may subtract 6 from the Qp value, for example. When the Qp value is subtracted by 6, each value of the quantization matrix is set to ½.

或いは、例えば、図３に周波数領域３９０で示した量子化マトリクスの空間高周波成分の領域に対応する量子化マトリクスの値のみを１／２にした値に修正してもよい。あるいは、図３において、行番号が４以上、列番号が４以上の周波数領域に対応する量子化マトリクスの値のみを１／２にした値に修正してもよい。なお、修正のための係数１／２の値は例であり、その他の計数値を掛けても良いことは言うまでもない。また、空間高周波成分としてどの空間周波数の帯域を採用するかは、上記の例に限定されない。 Alternatively, for example, only the value of the quantization matrix corresponding to the spatial high frequency component region of the quantization matrix indicated by the frequency region 390 in FIG. 3 may be corrected to a value halved. Alternatively, in FIG. 3, only the value of the quantization matrix corresponding to the frequency region where the row number is 4 or more and the column number is 4 or more may be corrected to a value halved. It should be noted that the value of the coefficient 1/2 for correction is an example, and it goes without saying that other count values may be multiplied. Further, which spatial frequency band is adopted as the spatial high-frequency component is not limited to the above example.

上述のように、空間高周波成分に対応する量子化マトリクスの値を小さくする理由は、擬似輪郭が縞模様を呈するため、縞模様の部分で、高周波数成分が増加しているためである。したがって、擬似輪郭を抑制するためには、高周波成分の量子化誤差を小さくすることが効果的である場合が多い。 As described above, the reason for reducing the value of the quantization matrix corresponding to the spatial high-frequency component is that the high-frequency component is increased in the striped pattern because the pseudo contour exhibits a striped pattern. Therefore, in order to suppress the pseudo contour, it is often effective to reduce the quantization error of the high frequency component.

或いは、量子化マトリクス修正部１６０は、量子化マトリクスの値の各々を個別に修正し、新たな値を有する量子化マトリクスを生成する。 Alternatively, the quantization matrix correction unit 160 individually corrects each value of the quantization matrix to generate a quantization matrix having a new value.

次に、圧縮率判定部１１７が、低圧縮であると判定した場合を説明する。この場合には、符号化効率を高める処理を行うことができる。量子化マトリクス修正部１６０は、被写界深度判定部１１４で、被写界深度外であると判定された直交変換ブロックに対する量子化マトリクスの値を、より大きな値にして修正する。この理由は、被写界深度外の領域は、注視領域でない場合が多いため、この領域における圧縮率を高めても、画像の劣化が認識されにくいことが経験側として存在するからである。量子化マトリクス修正部１６０は、被写界深度判定部１１４で、被写界深度外であると判定された直交変換ブロックに対する量子化マトリクスの値を、例えば２倍の値に修正してもよい。なお、実施例１は、２倍に限定されるものではない。 Next, a case where the compression rate determination unit 117 determines that the compression is low will be described. In this case, it is possible to perform processing for improving the encoding efficiency. The quantization matrix correction unit 160 corrects the quantization matrix value for the orthogonal transform block determined by the depth of field determination unit 114 to be outside the depth of field to a larger value. This is because an area outside the depth of field is often not a gaze area, and even if the compression rate in this area is increased, it is difficult to recognize image degradation as an experience side. The quantization matrix correction unit 160 may correct the quantization matrix value for the orthogonal transform block determined by the depth of field determination unit 114 to be outside the depth of field, for example, to a double value. . In addition, Example 1 is not limited to 2 times.

或いは、上記の場合には、量子化マトリクス修正部１６０は、Ｑｐ値に、例えば６を加算してもよい。上述のように、Ｑｐ値に６を加算すると、量子化マトリクスの値の各々は、２倍される。或いは、量子化マトリクス修正部１６０は、量子化マトリクスの値の各々を個別に修正し、新たな値を有する修正された量子化マトリクス１８６を生成してもよい。 Alternatively, in the above case, the quantization matrix correction unit 160 may add, for example, 6 to the Qp value. As described above, when 6 is added to the Qp value, each value of the quantization matrix is doubled. Alternatively, the quantization matrix modification unit 160 may individually modify each value of the quantization matrix to generate a modified quantization matrix 186 having a new value.

以上のようにして、量子化マトリクス修正部１６０によって修正された量子化マトリクスの情報は、エントロピー符号化を経て復号側に伝送される（１８６）。また、併せて、被写界深度内／外の情報も復号側に伝送されてもよい。 The quantization matrix information modified by the quantization matrix modification unit 160 as described above is transmitted to the decoding side through entropy coding (186). In addition, information within / outside of the depth of field may also be transmitted to the decoding side.

なお、上述の説明において、Ｑｐ値を修正した場合には、修正されたＱｐ値を復号側に伝送すればよい。例えば、Ｈ．２６４／ＡＶＣの規格では、規格上Ｑｐ値を復号側に送ることになっている。このため、修正されたＱｐ値が復号側に送られれば、復号側は、量子化マトリクスの修正に係る追加的な復号化の機能を新たに設ける必要はない。 In the above description, when the Qp value is corrected, the corrected Qp value may be transmitted to the decoding side. For example, H.M. In the H.264 / AVC standard, the Qp value is to be sent to the decoding side according to the standard. For this reason, if the corrected Qp value is sent to the decoding side, the decoding side does not need to newly provide an additional decoding function related to the correction of the quantization matrix.

＜動作＞
次に、実施例１における画像処理装置１０の動作について説明する。図４は、画像処理装置１０の処理の一例を示す図である。図４に示すステップＳ４１０で、空間周波数取得部１１５は、ブロック直交変換後情報１７２を取得する。なお、上述のように、空間周波数取得部１１５は、原画像のフレーム（画像）に対して、直交変換が行われるブロック（直交変換ブロック）領域毎に、空間周波数解析を独自に行ってもよい。 <Operation>
Next, the operation of the image processing apparatus 10 in the first embodiment will be described. FIG. 4 is a diagram illustrating an example of processing of the image processing apparatus 10. In step S410 illustrated in FIG. 4, the spatial frequency acquisition unit 115 acquires post-block orthogonal transform information 172. Note that, as described above, the spatial frequency acquisition unit 115 may independently perform spatial frequency analysis for each block (orthogonal transform block) region in which orthogonal transform is performed on the frame (image) of the original image. .

ステップＳ４２０で、被写界深度判定部１１４は、画像のブロック毎の被写界深度の内／外判定を行う。判定では、例えば画素毎、又は所定の領域毎に取得された２５６段階の奥行き情報をK-means法などによりクラスタ化を行ってもよい。クラスタ数ｎとしては、例えば、ｎ＝２とする。動作の詳細については、図５で述べる。ブロック毎の被写界深度の情報は、ブロック分割情報保存部１１１のブロック分割の情報に対応付けて、被写界深度保存部１１８に保存される。 In step S420, the depth-of-field determination unit 114 performs inner / outer determination of the depth of field for each block of the image. In the determination, for example, 256-step depth information acquired for each pixel or for each predetermined region may be clustered by the K-means method or the like. As the number of clusters n, for example, n = 2. Details of the operation will be described with reference to FIG. The information on the depth of field for each block is stored in the depth of field storage unit 118 in association with the block division information in the block division information storage unit 111.

ステップＳ４３０で、圧縮率判定部１１７は、画像全体の圧縮率の高低を判定する。圧縮率判定部１１７は、例えば、フレームの圧縮率を、上述のように予測誤差信号情報量積算部１１２の情報と、量子化後情報量積算部１１６の情報との比を計算することによって求めることができる。 In step S430, the compression rate determination unit 117 determines whether the compression rate of the entire image is high or low. For example, the compression rate determination unit 117 calculates the compression rate of the frame by calculating the ratio between the information of the prediction error signal information amount integration unit 112 and the information of the post-quantization information amount integration unit 116 as described above. be able to.

ステップＳ４４０で、量子化マトリクス修正部１６０は、符号化のために一旦作成された量子化マトリクス又はＱｐ値１７４を修正し、修正された量子化マトリクス又はＱｐ値１８６を得る。この動作の詳細については、図６を用いて説明する。 In step S440, the quantization matrix correction unit 160 corrects the quantization matrix or Qp value 174 once created for encoding, and obtains a corrected quantization matrix or Qp value 186. Details of this operation will be described with reference to FIG.

図５は、実施例１における被写界深度の判定の例を示すフローチャートである。 FIG. 5 is a flowchart illustrating an example of determination of the depth of field in the first embodiment.

ステップＳ５１０で、１フレーム分の奥行き情報１７０に基づいて、奥行き情報クラスタリング部１１３は、各ブロックをクラスタ化する。クラスタ化の一例を以下に示す。 In step S510, based on the depth information 170 for one frame, the depth information clustering unit 113 clusters each block. An example of clustering is shown below.

まず、奥行き情報クラスタリング部１１３は、奥行き情報のヒストグラムを求め、このヒストグラムが取る値の範囲をクラスタ数ｎ（例えばｎ＝２）に等分割し、奥行き情報を分類する。 First, the depth information clustering unit 113 obtains a histogram of depth information, equally divides the range of values taken by the histogram into the number of clusters n (for example, n = 2), and classifies the depth information.

奥行き情報クラスタリング部１１３は、各画素位置又は各所定領域の奥行き情報のクラスタリング結果として、奥行き降順に番号（クラスタ番号＝１、０）を付与して、このクラスタリング結果を得る。 The depth information clustering unit 113 assigns numbers (cluster number = 1, 0) in descending depth order as the clustering result of the depth information of each pixel position or each predetermined region, and obtains this clustering result.

奥行き情報クラスタリング部１１３は、ブロック毎に、クラスタリング情報の代表値を取得する。例えば、奥行き情報クラスタリング部１１３は、ブロック毎にクラスタリング情報を平均化し、端数を四捨五入する整数化を行うことで、ブロック毎に１つの奥行き情報（クラスタ番号の代表値＝１又は０）を取得する。例えば、上述の整数化を行った値が１であれば、クラスタ番号の代表値＝１とする。また、整数化を行った値が０であれば、クラスタ番号の代表値＝０とすればよい。なお、奥行き情報クラスタリング部１１３は、ブロック毎に、クラスタリング情報の中央値や最頻値をクラスタ番号の代表値としてもよい。 The depth information clustering unit 113 acquires a representative value of clustering information for each block. For example, the depth information clustering unit 113 obtains one depth information (representative value of cluster number = 1 or 0) for each block by averaging the clustering information for each block and performing rounding to an integer. . For example, if the value obtained by the above integer conversion is 1, the representative value of the cluster number = 1. If the integer value is 0, the representative value of the cluster number may be 0. The depth information clustering unit 113 may use the median value or mode value of the clustering information as a representative value of the cluster number for each block.

ステップＳ５２０で、被写界深度判定部１１４は、奥行き情報クラスタリング部１１３からの情報を用いて、同じクラスタに属するブロック毎に空間高周波数のパワーの割合を算出する。ブロックにおける空間高周波数のパワーの割合の計算の例については、図３を用いて既に説明したので、説明は省略する。 In step S520, the depth-of-field determination unit 114 uses the information from the depth information clustering unit 113 to calculate the ratio of spatial high frequency power for each block belonging to the same cluster. Since the example of the calculation of the ratio of the spatial high frequency power in the block has already been described with reference to FIG.

ステップＳ５３０で、被写界深度判定部１１４は、空間高周波数のパワーの割合が最も高いクラスタを特定する。この特定されたクラスタに属する複数のブロックが被写界深度内に存在する可能性の高いブロックであると推定できる。 In step S530, the depth-of-field determining unit 114 identifies the cluster having the highest spatial high frequency power ratio. It can be estimated that a plurality of blocks belonging to the specified cluster are highly likely to exist within the depth of field.

ステップＳ５４０で、被写界深度判定部１１４は、特定されたクラスタに属するブロックの情報を、被写界深度保存部１１８に送る。被写界深度保存部１１８に保存された情報は、２パス目において、量子化マトリクス修正部１６０において用いられる。 In step S <b> 540, the depth of field determination unit 114 sends information on the blocks belonging to the identified cluster to the depth of field storage unit 118. The information stored in the depth of field storage unit 118 is used in the quantization matrix correction unit 160 in the second pass.

図６は、量子化マトリクス修正部１６０が、量子化マトリクスの値を修正する例を示したフローチャートである。 FIG. 6 is a flowchart illustrating an example in which the quantization matrix correction unit 160 corrects the value of the quantization matrix.

ステップＳ６１０で、量子化マトリクス修正部１６０は、圧縮率判定部１１７からの圧縮率の情報（高圧縮率であるか低圧縮率であるかの情報）を基に、処理の分岐のための判定を行う。判定結果が「はい」であれば、高圧縮であるためステップＳ６２０に進む。判定結果が「いいえ」であれば、ステップＳ６１１に進む。 In step S <b> 610, the quantization matrix correction unit 160 determines the processing branch based on the compression rate information from the compression rate determination unit 117 (information about whether the compression rate is high or low). I do. If the determination result is “Yes”, since the compression is high, the process proceeds to step S620. If the determination result is “No”, the process proceeds to step S611.

ステップＳ６１１で、量子化マトリクス修正部１６０は、圧縮率判定部１１７からの圧縮率の情報（高圧縮率であるか低圧縮率であるかの情報）を基に、処理の分岐のための判定を行う。判定結果が「はい」であれば低圧縮であるためステップＳ６４０に進む。判定結果が「いいえ」であれば、高圧縮でも低圧縮でもないため、終了する。 In step S611, the quantization matrix correction unit 160 determines the processing branch based on the compression rate information from the compression rate determination unit 117 (information about whether the compression rate is high or low). I do. If the determination result is “Yes”, since the compression is low, the process proceeds to step S640. If the determination result is “No”, the processing is terminated because neither high compression nor low compression is performed.

ステップＳ６２０で、量子化マトリクス修正部１６０は、被写界深度内に属するブロックのうち、空間低周波数のパワーの割合が、例えば所定の閾値より高いブロックを特定する。一般に、被写界深度内のブロックは、鮮明度が高いため、空間低周波数のパワーよりも空間高周波数のパワーの割合が高い場合が多い。しかしながら、オブジェクトのテクスチャ自体が空間低周波数を持つような場合には、被写界深度内においても、空間低周波数のパワーの割合が高いブロックが存在する場合がある。このようなブロックでは、上述のように擬似輪郭が発生する確率が高い。しかも被写界深度内にそのブロックが存在するために、この擬似輪郭が目立ちやすいと判断される。このようなブロックは、例えば、ブロック内における第１の閾値以下の空間周波数のパワーを第１の閾値を超える空間周波数のパワーで除した値が、第２の閾値を越えるブロックを見出せばよい。なお、第１の閾値、及び第２の閾値は、本実施例を実装する際に、当業者が適宜設定することができる。 In step S620, the quantization matrix correction unit 160 identifies blocks having a spatial low frequency power ratio higher than, for example, a predetermined threshold among blocks belonging to the depth of field. In general, a block within the depth of field has a high definition, so that the ratio of the power of the spatial high frequency is often higher than the power of the spatial low frequency. However, when the texture of the object itself has a spatial low frequency, there may be a block having a high spatial low frequency power ratio even within the depth of field. In such a block, the probability that a pseudo contour will occur as described above is high. Moreover, since the block exists within the depth of field, it is determined that the pseudo contour is conspicuous. For such a block, for example, a block in which the value obtained by dividing the power of the spatial frequency below the first threshold by the power of the spatial frequency exceeding the first threshold in the block exceeds the second threshold may be found. The first threshold and the second threshold can be appropriately set by those skilled in the art when implementing this embodiment.

ステップＳ６３０で、量子化マトリクス修正部１６０は、特定されたブロックの量子化ステップを、より小さい値に設定する。量子化マトリクスの値の設定の例については、既に量子化マトリクス修正部１６０の説明の部分で詳述したので、ここでは省略する。 In step S630, the quantization matrix correction unit 160 sets the quantization step of the identified block to a smaller value. An example of setting the value of the quantization matrix has already been described in detail in the description of the quantization matrix correction unit 160, and is omitted here.

ステップＳ６４０で、量子化マトリクス修正部１６０は、被写界深度外のブロックを特定する。被写界深度外のブロックは、経験則上、目に止まりにくいと判断される。したがって、被写界深度外のブロックの圧縮率を上げても、画像の劣化は目立ちにくいと予測される。このような理由から、量子化マトリクス修正部１６０は、被写界深度外のブロックの圧縮率を上げるように、被写界深度外のブロックに対応する量子化マトリクスの値を、より大きくする。量子化マトリクスの値の設定の例については、既に量子化マトリクス修正部１６０の説明の部分で詳述したので、ここでは省略する。 In step S640, the quantization matrix correction unit 160 identifies a block outside the depth of field. As a rule of thumb, it is determined that blocks outside the depth of field are not easily caught by the eyes. Therefore, even if the compression rate of the block outside the depth of field is increased, it is predicted that the deterioration of the image is not noticeable. For this reason, the quantization matrix correction unit 160 increases the value of the quantization matrix corresponding to the block outside the depth of field so as to increase the compression rate of the block outside the depth of field. An example of setting the value of the quantization matrix has already been described in detail in the description of the quantization matrix correction unit 160, and is omitted here.

以上、実施例１では、奥行き情報から、被写界深度に関する情報を取得することができる。加えて、この被写界深度の情報を用いて、画質の向上、又は圧縮率の増加を図ることができる。より詳細には、高圧縮時においては、被写界深度内に発生し得る擬似輪郭の発生を充分に抑制することができる。また、低圧縮時においては、被写界深度外の領域の量子化ステップを増加させることで、画像の劣化がなるべく目立たないように、圧縮率を上げることができる。 As described above, in the first embodiment, information on the depth of field can be acquired from the depth information. In addition, it is possible to improve the image quality or increase the compression rate using the information on the depth of field. More specifically, during high compression, it is possible to sufficiently suppress the occurrence of pseudo contours that can occur within the depth of field. Further, at the time of low compression, the compression rate can be increased by increasing the quantization step in the region outside the depth of field so that the deterioration of the image is not as conspicuous as possible.

［実施例２］
実施例２では、実施例１における画像処理装置１０を画像処理部（１）１１に含む画像処理装置（画像符号化装置）について説明する。 [Example 2]
In the second embodiment, an image processing apparatus (image encoding apparatus) including the image processing apparatus 10 in the first embodiment in the image processing unit (1) 11 will be described.

＜構成＞
図７は、実施例２における画像処理装置２０の概略構成の一例を示すブロック図である。図７に示す例では、画像処理装置２０は、前処理部２００と、予測誤差信号生成部２０１と、直交変換部２０２と、量子化部２０３と、エントロピー符号化部２０４と、逆量子化部２０５と、逆直交変換部２０６と、復号画像生成部２０７と、ループフィルタ部２０９と、復号画像記憶部２１０と、イントラ予測部２１１と、インター予測部２１２と、動きベクトル計算部２１３と、予測画像選択部２１５と、画像処理部（１）１１とを有する。各部についての概略を以下に説明する。 <Configuration>
FIG. 7 is a block diagram illustrating an example of a schematic configuration of the image processing apparatus 20 according to the second embodiment. In the example illustrated in FIG. 7, the image processing apparatus 20 includes a preprocessing unit 200, a prediction error signal generation unit 201, an orthogonal transformation unit 202, a quantization unit 203, an entropy coding unit 204, and an inverse quantization unit. 205, an inverse orthogonal transform unit 206, a decoded image generation unit 207, a loop filter unit 209, a decoded image storage unit 210, an intra prediction unit 211, an inter prediction unit 212, a motion vector calculation unit 213, and a prediction An image selection unit 215 and an image processing unit (1) 11 are included. An outline of each part will be described below.

前処理部２００は、ピクチャタイプに合わせてピクチャを並べ替え、ピクチャタイプ及びフレーム毎のフレーム画像等を順次出力する。また、前処理部２００は、ブロック分割なども行い、ブロック分割の境界情報を、画像処理部（１）１１、及びループフィルタ部２０９に出力する。前処理部２００は、画像処理部（１）１１から、１パス・２パス切替指示１６２を取得する。また、前処理部２００は、画像処理部（１）１１との間で、ブロック分割情報を、信号線１６４を介して交換する。前処理部２００は、１パス目で、ブロック分割情報を画像処理部（１）１１に与える。そして、前処理部２００は、２パス目で、ブロック分割情報を画像処理部（１）１１から取得する。このようにすることによって、前処理部２００は、１パス目のブロック分割と同じブロック分割を、２パス目で用いることができる。 The preprocessing unit 200 rearranges the pictures according to the picture type, and sequentially outputs the picture type and the frame image for each frame. The preprocessing unit 200 also performs block division and the like, and outputs block division boundary information to the image processing unit (1) 11 and the loop filter unit 209. The pre-processing unit 200 acquires a 1-pass / 2-pass switching instruction 162 from the image processing unit (1) 11. In addition, the preprocessing unit 200 exchanges block division information with the image processing unit (1) 11 through the signal line 164. The pre-processing unit 200 gives block division information to the image processing unit (1) 11 in the first pass. The preprocessing unit 200 acquires block division information from the image processing unit (1) 11 in the second pass. In this way, the preprocessing unit 200 can use the same block division as the first pass block division in the second pass.

予測誤差信号生成部２０１は、入力された原画像データの符号化対象画像が、例えば３２×３２、１６×１６、８×８画素などのブロックに分割されたブロックデータを取得する。 The prediction error signal generation unit 201 obtains block data obtained by dividing the encoding target image of the input original image data into blocks such as 32 × 32, 16 × 16, and 8 × 8 pixels, for example.

予測誤差信号生成部２０１は、そのブロックデータと、予測画像選択部２１５から出力される予測画像のブロックデータとにより、予測誤差信号を生成する。予測誤差信号生成部２０１は、生成された予測誤差信号を、画像処理部（１）１１、及び直交変換部２０２に出力する。 The prediction error signal generation unit 201 generates a prediction error signal based on the block data and the block data of the prediction image output from the prediction image selection unit 215. The prediction error signal generation unit 201 outputs the generated prediction error signal to the image processing unit (1) 11 and the orthogonal transformation unit 202.

直交変換部２０２は、入力された予測誤差信号を直交変換処理する。直交変換部２０２は、変換された係数値を示す信号を、画像処理部（１）１１、及び量子化部２０３に出力する。 The orthogonal transform unit 202 performs an orthogonal transform process on the input prediction error signal. The orthogonal transform unit 202 outputs a signal indicating the transformed coefficient value to the image processing unit (1) 11 and the quantization unit 203.

量子化部２０３は、直交変換部２０２からの出力信号を量子化する。量子化部２０３は、量子化することによって出力信号の符号量を低減し、この出力信号を画像処理部（１）１１、エントロピー符号化部２０４、及び逆量子化部２０５に出力する。 The quantization unit 203 quantizes the output signal from the orthogonal transform unit 202. The quantization unit 203 reduces the code amount of the output signal by performing quantization, and outputs the output signal to the image processing unit (1) 11, the entropy encoding unit 204, and the inverse quantization unit 205.

画像処理部（１）１１は、実施例１で説明した画像処理装置１０を含み得る。なお、画像処理部（１）１１は、量子化部２０３から信号線１７４により量子化パラメータ等を受け取る。そして、画像処理部（１）１１によって修正された量子化パラメータの情報が、信号線１８６によって、量子化部２０３に返される。量子化部２０３は、この修正された量子化パラメータを使用して、量子化を実行する。なお、画像処理部（１）１１は、量子化パラメータの代わりに、量子化マトリクス自体を修正して、量子化部２０３に与えてもよい。画像処理部（１）１１には、信号線１７０を介して、前処理部２００から奥行き情報が入力されている。なお、前処理部２００は、上述のように原画像を画像処理部（１）１１に提供してもよい。原画像は、図２示す空間周波数取得部１１５で利用され得る。また、奥行き情報は、図２示す奥行き情報クラスタリング部１１３で利用される。また、画像処理部（１）１１には、直交変換部２０２から信号線７１０を介して、直交変換係数が入力される。この直交変換係数は、図２に示す空間周波数取得部１１５が利用する。 The image processing unit (1) 11 can include the image processing apparatus 10 described in the first embodiment. The image processing unit (1) 11 receives the quantization parameter and the like from the quantization unit 203 through the signal line 174. Then, the information of the quantization parameter corrected by the image processing unit (1) 11 is returned to the quantization unit 203 through the signal line 186. The quantization unit 203 uses the modified quantization parameter to perform quantization. Note that the image processing unit (1) 11 may modify the quantization matrix itself instead of the quantization parameter, and give it to the quantization unit 203. Depth information is input to the image processing unit (1) 11 from the preprocessing unit 200 via the signal line 170. Note that the preprocessing unit 200 may provide the original image to the image processing unit (1) 11 as described above. The original image can be used in the spatial frequency acquisition unit 115 shown in FIG. The depth information is used in the depth information clustering unit 113 shown in FIG. Further, an orthogonal transform coefficient is input to the image processing unit (1) 11 from the orthogonal transform unit 202 via the signal line 710. This orthogonal transform coefficient is used by the spatial frequency acquisition unit 115 shown in FIG.

エントロピー符号化部２０４は、画像処理部（１）１１から２パス目動作指示信号１８４を受け取ることにより、２パス目であることを認識し動作する。エントロピー符号化部２０４は、量子化部２０３からの出力信号や、動きベクトル計算部２１３から出力された動きベクトル情報やループフィルタ部２０９からのフィルタ係数などをエントロピー符号化して出力する。 The entropy encoding unit 204 recognizes that it is the second pass by receiving the second pass operation instruction signal 184 from the image processing unit (1) 11, and operates. The entropy encoding unit 204 entropy-encodes and outputs the output signal from the quantization unit 203, the motion vector information output from the motion vector calculation unit 213, the filter coefficient from the loop filter unit 209, and the like.

また、エントロピー符号化部２０４は、イントラ予測部２１１から取得したイントラ予測方向の差分値や、インター予測部２１２から取得した動きベクトルと予測ベクトルの差分値などをエントロピー符号化する。 Also, the entropy encoding unit 204 entropy encodes the difference value of the intra prediction direction acquired from the intra prediction unit 211, the difference value of the motion vector and the prediction vector acquired from the inter prediction unit 212, and the like.

また、エントロピー符号化部２０４は、画像処理部（１）１１で修正された量子化マトリクスの情報を符号化する。エントロピー符号化とは、シンボルの出現頻度に応じて可変長の符号を割り当てる方式をいう。 The entropy encoding unit 204 encodes information of the quantization matrix corrected by the image processing unit (1) 11. Entropy coding is a method of assigning variable-length codes according to the appearance frequency of symbols.

逆量子化部２０５は、量子化部２０３からの出力信号を逆量子化してから逆直交変換部２０６に出力する。逆直交変換部２０６は、逆量子化部２０５からの出力信号を逆直交変換処理してから復号画像生成部２０７に出力する。これら逆量子化部２０５及び逆直交変換部２０６によって復号処理が行われることにより、符号化前の予測誤差信号と同程度の信号が得られる。 The inverse quantization unit 205 performs inverse quantization on the output signal from the quantization unit 203 and outputs the result to the inverse orthogonal transform unit 206. The inverse orthogonal transform unit 206 performs an inverse orthogonal transform process on the output signal from the inverse quantization unit 205 and then outputs the output signal to the decoded image generation unit 207. By performing decoding processing by the inverse quantization unit 205 and the inverse orthogonal transform unit 206, a signal having the same level as the prediction error signal before encoding is obtained.

復号画像生成部２０７は、イントラ予測部２１１で画面内予測された画像或いはインター予測部２１２で動き補償された画像のブロックデータと、逆量子化部２０５及び逆直交変換部２０６により復号処理された予測誤差信号とを加算する。復号画像生成部２０７は、加算して生成した復号画像のブロックデータを、ループフィルタ部２０９に出力する。 The decoded image generation unit 207 is decoded by the block data of the image predicted in the screen by the intra prediction unit 211 or the motion compensated image by the inter prediction unit 212, and the inverse quantization unit 205 and the inverse orthogonal transform unit 206. Add the prediction error signal. The decoded image generation unit 207 outputs the decoded image block data generated by addition to the loop filter unit 209.

ループフィルタ部２０９は、例えばＡＬＦ（Adaptive Loop Filter）やデブロッキングフィルタである。ループフィルタ部２０９は、フィルタ処理結果を復号画像記憶部２１０に出力し、蓄積された１画像分のフィルタ処理結果を参照画像として記憶させる。 The loop filter unit 209 is, for example, an ALF (Adaptive Loop Filter) or a deblocking filter. The loop filter unit 209 outputs the filter processing result to the decoded image storage unit 210, and stores the accumulated filter processing result for one image as a reference image.

復号画像記憶部２１０は、入力した復号画像のブロックデータを新たな参照画像のデータとして記憶し、イントラ予測部２１１、インター予測部２１２及び動きベクトル計算部２１３に出力する。 The decoded image storage unit 210 stores the input block data of the decoded image as new reference image data, and outputs the data to the intra prediction unit 211, the inter prediction unit 212, and the motion vector calculation unit 213.

イントラ予測部２１１は、符号化対象画像の処理対象ブロックに対して、既に符号化された参照画素から予測画像のブロックデータを生成する。イントラ予測部２１１は、複数の予測方向を用いて予測を行い、最適な予測方向を決定する。予測方向については、符号化済みブロックの予測方向との差分値をビットストリームに含めるために、差分値がエントロピー符号化部２０４に出力される。 The intra prediction unit 211 generates block data of the predicted image from the already-encoded reference pixels for the processing target block of the encoding target image. The intra prediction unit 211 performs prediction using a plurality of prediction directions, and determines an optimal prediction direction. With respect to the prediction direction, the difference value is output to the entropy encoding unit 204 in order to include the difference value with the prediction direction of the encoded block in the bitstream.

インター予測部２１２は、復号画像記憶部２１０から取得した参照画像のデータを動きベクトル計算部２１３から提供される動きベクトルで動き補償する。これにより、動き補償された参照画像としてのブロックデータが生成される。動きベクトルについては、符号化済みブロックの動きベクトル（予測ベクトル）との差分値をビットストリームに含めるために、差分値がエントロピー符号化部２０４に出力される。 The inter prediction unit 212 performs motion compensation on the reference image data acquired from the decoded image storage unit 210 with the motion vector provided from the motion vector calculation unit 213. Thereby, block data as a motion-compensated reference image is generated. For the motion vector, the difference value with the motion vector (predicted vector) of the encoded block is output to the entropy encoding unit 204 in order to include the difference value in the bitstream.

動きベクトル計算部２１３は、符号化対象画像におけるブロックデータと、復号画像記憶部２１０から取得する参照画像とを用いて、動きベクトルを求める。 The motion vector calculation unit 213 obtains a motion vector using the block data in the encoding target image and the reference image acquired from the decoded image storage unit 210.

動きベクトル計算部２１３は、求めた動きベクトルをインター予測部２１２に出力し、参照画像を示す情報を含む動きベクトル情報をエントロピー符号化部２０４に出力する。 The motion vector calculation unit 213 outputs the obtained motion vector to the inter prediction unit 212 and outputs motion vector information including information indicating the reference image to the entropy encoding unit 204.

イントラ予測部２１１とインター予測部２１２から出力されたブロックデータは、予測画像選択部２１５に入力される。 The block data output from the intra prediction unit 211 and the inter prediction unit 212 is input to the predicted image selection unit 215.

予測画像選択部２１５は、イントラ予測部２１１とインター予測部２１２から取得したブロックデータのうち、どちらか一方のブロックデータを予測画像として選択する。選択された予測画像は、予測誤差信号生成部２０１に出力される。 The predicted image selection unit 215 selects one of the block data acquired from the intra prediction unit 211 and the inter prediction unit 212 as a predicted image. The selected prediction image is output to the prediction error signal generation unit 201.

なお、図７に示す画像処理装置２０の構成は一例であり、必要に応じて各構成を組み合わせたり、各構成を適宜変更したりしてもよい。 The configuration of the image processing apparatus 20 illustrated in FIG. 7 is an example, and the configurations may be combined or the configurations may be appropriately changed as necessary.

以上、実施例２によれば、画像符号化時に、奥行き情報を得て、得られた奥行き情報に基づいて、高圧縮時には擬似輪郭の抑制又は、低圧縮時には圧縮率の向上を図ることができる。 As described above, according to the second embodiment, depth information is obtained at the time of image coding, and based on the obtained depth information, pseudo contour can be suppressed at the time of high compression or the compression rate can be improved at the time of low compression. .

［実施例３］
実施例３における画像処理装置（画像復号装置）は、実施例２における画像処理装置２０で符号化されたビットストリームを復号する装置である。 [Example 3]
The image processing device (image decoding device) according to the third embodiment is a device that decodes the bitstream encoded by the image processing device 20 according to the second embodiment.

＜構成＞
図８は、実施例３における画像処理装置３０の概略構成の一例を示すブロック図である。図８に示すように、画像処理装置３０は、エントロピー復号部３０１と、逆量子化部３０２と、逆直交変換部３０３と、イントラ予測部３０４と、復号情報記憶部３０５と、インター予測部３０６と、予測画像選択部３０７と、復号画像生成部３０８と、ループフィルタ部３１０と、フレームメモリ３１１とを有する。各部についての概略を以下に説明する。 <Configuration>
FIG. 8 is a block diagram illustrating an example of a schematic configuration of the image processing apparatus 30 according to the third embodiment. As illustrated in FIG. 8, the image processing device 30 includes an entropy decoding unit 301, an inverse quantization unit 302, an inverse orthogonal transform unit 303, an intra prediction unit 304, a decoded information storage unit 305, and an inter prediction unit 306. A predicted image selection unit 307, a decoded image generation unit 308, a loop filter unit 310, and a frame memory 311. An outline of each part will be described below.

エントロピー復号部３０１は、ビットストリームが入力されると、画像処理装置２０のエントロピー符号化に対応するエントロピー復号を行う。エントロピー復号部３０１により復号された予測誤差信号などは逆量子化部３０２に出力される。また、実施例１又は２において利用された量子化マトリクスの修正に関する情報、及びインター予測されている場合の、復号された動きベクトルの差分値などは復号情報記憶部３０５に出力される。 When a bit stream is input, the entropy decoding unit 301 performs entropy decoding corresponding to the entropy encoding of the image processing device 20. The prediction error signal decoded by the entropy decoding unit 301 is output to the inverse quantization unit 302. In addition, information regarding modification of the quantization matrix used in the first or second embodiment, a difference value of a decoded motion vector when inter prediction is performed, and the like are output to the decoded information storage unit 305.

また、エントロピー復号部３０１は、イントラ予測の場合、イントラ予測部３０４にその旨通知する。また、エントロピー復号部３０１は、復号対象画像がインター予測されているか、イントラ予測されているかを予測画像選択部３０７に通知する。 In the case of intra prediction, the entropy decoding unit 301 notifies the intra prediction unit 304 to that effect. In addition, the entropy decoding unit 301 notifies the prediction image selection unit 307 whether the decoding target image is inter predicted or intra predicted.

逆量子化部３０２は、エントロピー復号部３０１からの出力信号に対して、必要に応じて実施例１又は２において利用された量子化マトリクスの修正に関する情報を考慮して、逆量子化処理を行う。逆量子化された出力信号は逆直交変換部３０３に出力される。 The inverse quantization unit 302 performs an inverse quantization process on the output signal from the entropy decoding unit 301 in consideration of information regarding modification of the quantization matrix used in the first or second embodiment as necessary. . The inversely quantized output signal is output to the inverse orthogonal transform unit 303.

逆直交変換部３０３は、逆量子化部３０２からの出力信号の復号ブロックに対して逆直交変換処理を行い、残差信号を生成する。残差信号は復号画像生成部３０８に出力される。 The inverse orthogonal transform unit 303 performs an inverse orthogonal transform process on the decoded block of the output signal from the inverse quantization unit 302 to generate a residual signal. The residual signal is output to the decoded image generation unit 308.

イントラ予測部３０４は、フレームメモリ３１１から取得する復号対象画像の既に復号化された周辺画素から、複数の予測方向を用いて予測画像を生成する。 The intra prediction unit 304 generates a prediction image using a plurality of prediction directions from the peripheral pixels already decoded of the decoding target image acquired from the frame memory 311.

復号情報記憶部３０５は、分割モードなどの復号情報を記憶する。 The decoding information storage unit 305 stores decoding information such as a division mode.

インター予測部３０６は、フレームメモリ３１１から取得した参照画像のデータを復号情報記憶部３０５から動きベクトルの差分値などを取得する。インター予測部３０６は、予測ベクトルを決定し、決定した予測ベクトルと、動きベクトルの差分値とを加算し、動きベクトルを生成する。インター予測部３０６は、生成した動きベクトルを用いて動き補償を行う。これにより、動き補償された参照画像としてのブロックデータが生成される。 The inter prediction unit 306 acquires the reference image data acquired from the frame memory 311 from the decoding information storage unit 305 and the difference value of the motion vector. The inter prediction unit 306 determines a prediction vector, adds the determined prediction vector and the difference value of the motion vector, and generates a motion vector. The inter prediction unit 306 performs motion compensation using the generated motion vector. Thereby, block data as a motion-compensated reference image is generated.

予測画像選択部３０７は、イントラ予測画像、又はインター予測画像のどちらか一方の予測画像を選択する。選択されたブロックデータは、復号画像生成部３０８に出力される。 The predicted image selection unit 307 selects either the intra predicted image or the inter predicted image. The selected block data is output to the decoded image generation unit 308.

復号画像生成部３０８は、予測画像選択部３０７から出力される予測画像と、逆直交変換部３０３から出力される残差信号とを加算し、復号画像を生成する。生成された復号画像はループフィルタ部３１０に出力される。 The decoded image generation unit 308 adds the predicted image output from the predicted image selection unit 307 and the residual signal output from the inverse orthogonal transform unit 303 to generate a decoded image. The generated decoded image is output to the loop filter unit 310.

ループフィルタ部３１０は、復号画像生成部３０８から出力された復号画像に対し、ブロック歪を低減するためのフィルタをかけ、ループフィルタ処理後の復号画像をフレームメモリ３１１に出力する。 The loop filter unit 310 applies a filter for reducing block distortion to the decoded image output from the decoded image generation unit 308, and outputs the decoded image after the loop filter processing to the frame memory 311.

なお、ループフィルタ後の復号画像は表示装置などに出力される。 Note that the decoded image after the loop filter is output to a display device or the like.

フレームメモリ３１１は、参照画像となる復号画像などを記憶する。なお、復号情報記憶部３０５とフレームメモリ３１１は、分けた構成にしているが、同じ記憶部であってもよい。 The frame memory 311 stores a decoded image that serves as a reference image. The decoded information storage unit 305 and the frame memory 311 are configured separately, but they may be the same storage unit.

以上、実施例３によれば、実施例２の符号時に使用された量子化マトリクスに係る修正の情報を用いて、画像処理装置２０で符号化されたビットストリームを、適切に復号することができる。 As described above, according to the third embodiment, it is possible to appropriately decode the bitstream encoded by the image processing device 20 using the correction information related to the quantization matrix used at the time of encoding according to the second embodiment. .

［実施例４］
図９は、実施例４における画像処理装置４０の概略構成の一例を示すブロック図である。図９に示す画像処理装置４０は、上述した実施例１〜３で説明した画像処理をソフトウェアで実装した装置の一例である。 [Example 4]
FIG. 9 is a block diagram illustrating an example of a schematic configuration of the image processing device 40 according to the fourth embodiment. An image processing apparatus 40 illustrated in FIG. 9 is an example of an apparatus in which the image processing described in the first to third embodiments is implemented by software.

図９に示すように、画像処理装置４０は、制御部４０１と、主記憶部４０２と、補助記憶部４０３と、ドライブ装置４０４と、ネットワークＩ／Ｆ部４０６と、入力部４０７と、表示部４０８とを有する。これら各構成は、バスを介して相互にデータ送受信可能に接続されている。 As shown in FIG. 9, the image processing apparatus 40 includes a control unit 401, a main storage unit 402, an auxiliary storage unit 403, a drive device 404, a network I / F unit 406, an input unit 407, and a display unit. 408. These components are connected to each other via a bus so as to be able to transmit and receive data.

制御部４０１は、コンピュータの中で、各装置の制御やデータの演算、加工を行うＣＰＵ（Central Processing Unit）である。また、制御部４０１は、主記憶部４０２又は補助記憶部４０３に記憶された画像処理のプログラムを実行する演算装置である。制御部４０１は、入力部４０７や記憶装置からデータを受け取り、演算、加工した上で、表示部４０８や記憶装置などに出力する。 The control unit 401 is a CPU (Central Processing Unit) that controls each device, calculates data, and processes in a computer. The control unit 401 is an arithmetic device that executes an image processing program stored in the main storage unit 402 or the auxiliary storage unit 403. The control unit 401 receives data from the input unit 407 and the storage device, calculates and processes the data, and then outputs the data to the display unit 408 and the storage device.

また、制御部４０１は、画像処理のプログラムを実行することで、実施例１〜４で説明した処理を実現することができる。 Also, the control unit 401 can realize the processing described in the first to fourth embodiments by executing an image processing program.

主記憶部４０２は、ＲＯＭ（Read Only Memory）やＲＡＭ（Random Access Memory）などである。主記憶部４０２は、制御部４０１が実行する基本ソフトウェアであるＯＳ（Operating System）やアプリケーションソフトウェアなどのプログラムやデータを記憶又は一時保存する記憶装置である。 The main storage unit 402 is a ROM (Read Only Memory), a RAM (Random Access Memory), or the like. The main storage unit 402 is a storage device that stores or temporarily stores programs and data such as OS (Operating System) and application software that are basic software executed by the control unit 401.

補助記憶部４０３は、ＨＤＤ（Hard Disk Drive）などであり、アプリケーションソフトウェアなどに関連するデータを記憶する記憶装置である。 The auxiliary storage unit 403 is an HDD (Hard Disk Drive) or the like, and is a storage device that stores data related to application software or the like.

ドライブ装置４０４は、記録媒体４０５、例えばフレキシブルディスクからプログラムを読み出し、記憶部にインストールする。 The drive device 404 reads the program from the recording medium 405, for example, a flexible disk, and installs it in the storage unit.

また、記録媒体４０５に、所定のプログラムを格納し、この記録媒体４０５に格納されたプログラムはドライブ装置４０４を介して画像処理装置４０にインストールされる。インストールされた所定のプログラムは、画像処理装置４０により実行可能となる。 A predetermined program is stored in the recording medium 405, and the program stored in the recording medium 405 is installed in the image processing apparatus 40 via the drive device 404. The installed predetermined program can be executed by the image processing apparatus 40.

ネットワークＩ／Ｆ部４０６は、有線及び／又は無線回線などのデータ伝送路により構築されたＬＡＮ（Local Area Network）、ＷＡＮ（Wide Area Network）などのネットワークを介して接続された通信機能を有する周辺機器と画像処理装置４０とのインターフェースである。 The network I / F unit 406 is a peripheral having a communication function connected via a network such as a LAN (Local Area Network) or a WAN (Wide Area Network) constructed by a data transmission path such as a wired and / or wireless line. This is an interface between the device and the image processing apparatus 40.

入力部４０７は、カーソルキー、数字入力及び各種機能キー等を備えたキーボード、表示部４０８の表示画面上でキーの選択等を行うためのマウスやスライドパット等を有する。表示部４０８は、ＬＣＤ（Liquid Crystal Display）等により構成され、制御部４０１から入力される表示データに応じた表示が行われる。 The input unit 407 includes a keyboard having cursor keys, numeric input, various function keys, and the like, and a mouse and a slide pad for performing key selection on the display screen of the display unit 408. The display unit 408 is configured by an LCD (Liquid Crystal Display) or the like, and performs display according to display data input from the control unit 401.

なお、図２の画像処理装置１０、図７の画像処理装置２０、及び図８の画像処理装置３０の各部は、例えば制御部４０１及びワークメモリとしての主記憶部４０２により実現されうる。 The units of the image processing apparatus 10 in FIG. 2, the image processing apparatus 20 in FIG. 7, and the image processing apparatus 30 in FIG. 8 can be realized by, for example, the control unit 401 and the main storage unit 402 as a work memory.

また、図７に示す復号画像記憶部２１０は、例えば主記憶部４０２又は補助記憶部４０３により実現され、図７に示す復号画像記憶部２１０以外の構成は、例えば制御部４０１及びワークメモリとしての主記憶部４０２により実現されうる。 Further, the decoded image storage unit 210 illustrated in FIG. 7 is realized by, for example, the main storage unit 402 or the auxiliary storage unit 403, and the configuration other than the decoded image storage unit 210 illustrated in FIG. This can be realized by the main storage unit 402.

また、図８に示す復号情報記憶部３０５及びフレームメモリ３１１は、例えば主記憶部４０２又は補助記憶部４０３により実現されうる。図８に示す復号情報記憶部３０５及びフレームメモリ３１１以外の構成は、例えば制御部４０１及びワークメモリとしての主記憶部４０２により実現されうる。 Further, the decoded information storage unit 305 and the frame memory 311 illustrated in FIG. 8 can be realized by the main storage unit 402 or the auxiliary storage unit 403, for example. A configuration other than the decoded information storage unit 305 and the frame memory 311 illustrated in FIG. 8 can be realized by the control unit 401 and the main storage unit 402 as a work memory, for example.

画像処理装置４０で実行されるプログラムは、実施例１〜３で説明した各部を含むモジュール構成となっている。実際のハードウェアとしては、制御部４０１が補助記憶部４０３からプログラムを読み出して実行することにより上記各部のうち１又は複数の各部が主記憶部４０２上にロードされ、１又は複数の各部が主記憶部４０２上に生成されるようになっている。 The program executed by the image processing apparatus 40 has a module configuration including each unit described in the first to third embodiments. As actual hardware, when the control unit 401 reads out and executes a program from the auxiliary storage unit 403, one or more of the above-described units are loaded onto the main storage unit 402, and one or more of the respective units are main. It is generated on the storage unit 402.

このように、上述した実施例１〜３で説明した画像処理は、コンピュータに実行させるためのプログラムとして実現することができる。このプログラムをサーバ等からインストールしてコンピュータに実行させることで、実施例１〜３で説明した処理を実現することができる。 As described above, the image processing described in the first to third embodiments can be realized as a program for causing a computer to execute the image processing. The processing described in the first to third embodiments can be realized by installing this program from a server or the like and causing the computer to execute the program.

また、このプログラムを記録媒体４０５に記録し、このプログラムが記録された記録媒体４０５をコンピュータや携帯端末などの処理装置に読み取らせて、前述した画像処理を実現させることも可能である。 It is also possible to record the program in the recording medium 405 and cause the processing unit such as a computer or a portable terminal to read the recording medium 405 on which the program is recorded, thereby realizing the above-described image processing.

なお、記録媒体４０５は、ＣＤ−ＲＯＭ、フレキシブルディスク、光磁気ディスク等のように情報を光学的，電気的或いは磁気的に記録する記録媒体、ＲＯＭ、フラッシュメモリ等のように情報を電気的に記録する半導体メモリ等、様々なタイプの記録媒体を用いることができる。 Note that the recording medium 405 is a recording medium that records information optically, electrically, or magnetically, such as a CD-ROM, a flexible disk, or a magneto-optical disk, and information that is electrically stored such as a ROM or a flash memory. Various types of recording media such as a semiconductor memory for recording can be used.

また、上述した各実施例で説明した画像処理は、１つ又は複数の集積回路に実装され得る。なお、実施例４における画像処理装置４０は、上記の通り、画像処理装置１０、２０、３０の少なくとも１つの装置としての機能を有する。 In addition, the image processing described in each embodiment described above can be implemented in one or a plurality of integrated circuits. Note that the image processing device 40 according to the fourth embodiment has a function as at least one of the image processing devices 10, 20, and 30 as described above.

また、上述した各実施例における画像処理装置は、奥行き情報を用いて被写界深度情報を出力し、または量子化マトリクスを修正する符号化技術に対して適用可能であり、Ｈ．２６４／ＡＶＣやＨ．２６５／ＨＥＶＣだけに限られるものではない。 Further, the image processing apparatus in each of the embodiments described above can be applied to an encoding technique that outputs depth-of-field information using depth information or modifies a quantization matrix. H.264 / AVC and H.264 It is not limited to 265 / HEVC.

以上、各実施例について詳述したが、特定の実施例に限定されるものではなく、特許請求の範囲に記載された範囲内において、上記変形例以外にも種々の変形及び変更が可能である。 Each embodiment has been described in detail above. However, the present invention is not limited to the specific embodiment, and various modifications and changes other than the above-described modification are possible within the scope described in the claims. .

１０、２０、３０、４０画像処理装置
１１画像処理部（１）
１１１ブロック分割情報保存部
１１２予測誤差信号情報量積算部
１１３奥行き情報クラスタリング部
１１４被写界深度判定部
１１５空間周波数取得部
１１６量子化後情報量積算部
１１７圧縮率判定部
１１８被写界深度保存部
１３０１パス・２パス切替指示部
１６０量子化マトリクス修正部 10, 20, 30, 40 Image processing apparatus 11 Image processing unit (1)
111 Block division information storage unit 112 Prediction error signal information amount integration unit 113 Depth information clustering unit 114 Depth of field determination unit 115 Spatial frequency acquisition unit 116 Information amount integration unit after quantization 117 Compression rate determination unit 118 Preservation of depth of field Unit 130 1-pass / 2-pass switching instruction unit 160 quantization matrix correction unit

Claims

An image processing apparatus that adjusts the quantization step of an image using information on the depth of the image,
A spatial frequency acquisition unit that acquires the power of each spatial frequency of the plurality of blocks included in the image;
A depth information clustering unit that clusters information on the depth of each pixel position or each predetermined region in the image, and determines each cluster of the plurality of blocks based on a result of the clustering;
The power of one or more spatial frequencies of the block belonging to a given cluster, based on the power of the spatial frequency of a plurality of blocks belonging to other than the given cluster, the one or more blocks belonging to the given cluster A depth-of-field determination unit that determines to belong to the depth of field;
Based on the determination result of the depth-of-field determination unit and the compression rate of the image, each quantization step of the plurality of frequency components orthogonally transformed with respect to a predetermined block among the plurality of blocks is performed. A quantization matrix correction unit that corrects the value of the quantization matrix including,
An image processing apparatus.

A compression rate determination unit that determines a high compression rate when the compression rate of the image is higher than a predetermined first compression rate;
The depth of field determination unit
The ratio of the power of the spatial frequency exceeding a predetermined spatial frequency among the spatial frequencies of one or more blocks belonging to the predetermined cluster is the predetermined spatial frequency among the spatial frequencies of a plurality of blocks belonging to other than the predetermined cluster. Determining that the one or more blocks belonging to the predetermined cluster belong within a depth of field if the power ratio is higher than a spatial frequency power ratio exceeding
The quantization matrix correction unit, when the compression ratio is a high compression ratio, among the one or more blocks determined to belong to the depth of field, the ratio of the power of the low frequency component of the spatial frequency For a block that is higher than a predetermined percentage, the predetermined region of the quantization matrix is modified to a smaller value.
The image processing apparatus according to claim 1.

The quantization matrix correcting unit applies the quantization to a block other than the one or more blocks determined to belong to the depth of field when the compression rate is lower than a predetermined second compression rate. Modify the value of the quantization matrix to a larger value,
The image processing apparatus according to claim 2.

In the block in which the ratio of the power of the low frequency component of the spatial frequency is higher than a predetermined ratio, a value obtained by dividing the power of the spatial frequency below the first threshold by the power of the spatial frequency exceeding the first threshold is the second. The image processing apparatus according to claim 2, wherein the image processing apparatus is a block exceeding a predetermined threshold value.

An encoding device comprising the image processing device according to any one of claims 1 to 4.

A program for causing a computer to function as the encoding device according to claim 5.