JP2014220624A

JP2014220624A - Image processing apparatus, encoding apparatus, and program

Info

Publication number: JP2014220624A
Application number: JP2013097917A
Authority: JP
Inventors: 市ヶ谷　敦郎; Atsuro Ichigaya; 敦郎市ヶ谷; 境田　慎一; Shinichi Sakaida; 慎一境田
Original assignee: Nippon Hoso Kyokai NHK; Japan Broadcasting Corp
Current assignee: Japan Broadcasting Corp
Priority date: 2013-05-07
Filing date: 2013-05-07
Publication date: 2014-11-20
Anticipated expiration: 2033-05-07
Also published as: JP6159139B2

Abstract

PROBLEM TO BE SOLVED: To provide an image processing apparatus, an encoding apparatus, and a program which improve processing efficiency in selection of proper orthogonal transformation from plural orthogonal transformations.SOLUTION: The image processing apparatus, which selects one of a first orthogonal transformation and one or more other orthogonal transformations and performs processing, comprises: an analyzer which analyzes a first orthogonal transformation coefficient obtained by the first orthogonal transformation of image data, in accordance with a predetermined rule; and a selector which selects one of the first orthogonal transformation and the one or more other orthogonal transformations, on the basis of a result of analysis by the analyzer.

Description

本発明は、画像の符号化における直交変換を適応的に選択する画像処理装置、符号化装置、及びプログラムに関する。 The present invention relates to an image processing apparatus, an encoding apparatus, and a program that adaptively select orthogonal transform in image encoding.

動画などのフレーム画像の画素値を直交変換し、直交変換係数で表すことが行われている。この直交変換係数に対して、必要に応じて量子化及びエントロピー符号化を施すことによって、画像品質をなるべく落とさずに、情報量が圧縮される。特に、この画像符号化技術は、映像符号化の分野で利用されている。 A pixel value of a frame image such as a moving image is orthogonally transformed and represented by an orthogonal transformation coefficient. By performing quantization and entropy coding on the orthogonal transform coefficient as necessary, the amount of information is compressed without reducing image quality as much as possible. In particular, this image encoding technique is used in the field of video encoding.

直交変換を利用する主たる目的は、画像の空間的な相関を排除し、少ない直交変換係数に映像信号の情報（エネルギ−）を集中させることにある。フレーム画像を小領域に分割し、例えば８×８画素のブロックごとに直交変換が施される。この符号化では、ＤＣ成分に近い領域に情報を集約できるＤＣＴ（離散コサイン変換：Discrete Cosine Transform）が最も広く用いられている。 The main purpose of using the orthogonal transform is to eliminate the spatial correlation of images and concentrate the information (energy) of the video signal on a small number of orthogonal transform coefficients. The frame image is divided into small areas, and orthogonal transformation is performed for each block of 8 × 8 pixels, for example. In this encoding, DCT (Discrete Cosine Transform) capable of consolidating information in a region close to a DC component is most widely used.

入力された画像に対する最適な直交変換は、主成分分析によって無相関化された直交基底を設計して実現できる。しかしながら、現実的には入力信号に応じて実時間で直交基底を設計することは困難を伴う。主成分分析を利用した変換として、ＫＬ変換（ＫＬＴ：Karhuen Loeve transform）が知られている。ＫＬ変換は、入力信号に対し最適な変換効率を与える。しかしながら、ハードウェア化が非常に困難であり、現在のところ符号化方式には採用されていない。 Optimal orthogonal transformation for an input image can be realized by designing an orthogonal basis that has been decorrelated by principal component analysis. However, in reality, it is difficult to design an orthogonal basis in real time according to an input signal. As transformation using principal component analysis, KL transformation (KLT: Karhuen Loeve transform) is known. KL conversion provides optimal conversion efficiency for an input signal. However, hardware implementation is very difficult, and currently it is not adopted as an encoding method.

したがって、従来、画像を符号化する際には、事前に知られた入力信号の統計的な性質によって最適とされた直交変換を用いている。このため、画像符号化方式ごとに直交変換が予め定められている。 Therefore, conventionally, when an image is encoded, orthogonal transformation optimized by the statistical properties of an input signal known in advance is used. For this reason, orthogonal transform is predetermined for each image coding method.

画像符号化技術で用いられる代表的な直交変換としては、上述のＤＣＴ以外に、ＤＳＴ（離散サイン変換）、ＤＷＴ（離散ウェーブレット変換）、ＤＦＴ（離散フーリエ変換）、ＤＨＴ（アダマール変換）などが知られている。 Typical orthogonal transforms used in image coding techniques include DST (Discrete Sine Transform), DWT (Discrete Wavelet Transform), DFT (Discrete Fourier Transform), DHT (Hadamard Transform), etc. in addition to the above-mentioned DCT. It has been.

なお、ＤＣＴ及びＤＳＴには、それぞれ、標準的なタイプが８通りずつある。ＤＣＴにおいて、最も一般的なタイプは、ＤＣＴ−ＩＩ（タイプ２）である。ＤＣＴ−ＩＩの逆変換は、ＤＣＴ−ＩＩＩである。ＤＣＴ−ＩとＤＣＴ−ＩＶは、それ自身が逆変換になる。ＤＣＴ及びＤＳＴは、それぞれのタイプによって、異なる性質を持つ。 There are eight standard types of DCT and DST, respectively. In DCT, the most common type is DCT-II (type 2). The inverse transform of DCT-II is DCT-III. DCT-I and DCT-IV themselves are inversely transformed. DCT and DST have different properties depending on their types.

なお、これらの変換は、一般に直交変換と呼ばれている。しかしながら、インプリメントされる際に、整数精度の計算を採用したり、丸め誤差が発生したりするため、厳密な意味での直交性が担保されるわけではない。本明細書では、直交変換の語を用いて説明するが、このように厳密な意味での直交性が担保された変換のみを指す語ではない点に留意すべきである。 Note that these transformations are generally called orthogonal transformations. However, when implemented, integer precision calculation is adopted or rounding error occurs, so the orthogonality in the strict sense is not guaranteed. In the present specification, the term “orthogonal transformation” is used for explanation. However, it should be noted that the term does not mean only a transformation in which orthogonality in a strict sense is ensured.

広く用いられている映像符号化として、ＭＰＥＧが知られている。ＭＰＥＧ−２やＨ．２６４／ＡＶＣ（Advance Video Coding）（例えば、非特許文献１参照）、ＨＥＶＣなど、ほぼ全ての画像符号化技術では、入力信号を周波数領域に変換し、様々な処理を用いて、変換領域における信号容量の圧縮を実現している。 MPEG is known as a widely used video encoding. MPEG-2 and H.264 In almost all image coding techniques such as H.264 / AVC (Advance Video Coding) (for example, see Non-Patent Document 1) and HEVC, an input signal is converted into a frequency domain, and signals in the conversion domain are converted using various processes. Realizes capacity compression.

なお、映像符号化方式のＨＥＶＣにおいては、４×４画素のブロックにおけるイントラ符号化（intra-frame coding）（画面内符号化）では、ＤＳＴ―ＶＩＩ（タイプ７）の直交変換が用いられている。 Note that in the video encoding scheme HEVC, DST-VII (type 7) orthogonal transform is used for intra-frame coding (intra-screen coding) in a block of 4 × 4 pixels. .

したがって、ＨＥＶＣイントラ符号化では、直交変換として、ＤＣＴ−ＩＩとＤＳＴ―ＶＩＩの２つを利用している。一方、インター符号化では、残差信号が、イントラ符号化のような特徴的な傾向がないためＤＣＴ−ＩＩのみを用いた符号化が用いられている。 Therefore, HEVC intra coding uses two DCT-II and DST-VII as orthogonal transforms. On the other hand, in inter coding, since the residual signal does not have a characteristic tendency as in intra coding, coding using only DCT-II is used.

なお、画面内符号化（イントラ符号化）は、画面内の空間的な信号予測を行って画像を符号化する。この技術は、画面内予測（イントラ予測）とも称す。また、画面間予測（インター符号化）は、異なるフレーム間で類似した領域を探索し、予測に用いて画像を符号化する。この技術は、動き補償予測（インター予測）とも称す。 In addition, intra coding (intra coding) encodes an image by performing spatial signal prediction in the screen. This technique is also referred to as intra prediction (intra prediction). Also, inter-screen prediction (inter-coding) searches for similar regions between different frames and encodes an image for prediction. This technique is also referred to as motion compensation prediction (inter prediction).

上述のＨＥＶＣで採用されたＤＣＴ−ＩＩとＤＳＴ−ＶＩＩを選択的に切り換える方式では、事前の解析により外挿を用いたイントラ予測方式による信号の偏りを統計的に解析している。そして、イントラ予測モード毎に何れの直交変換を選択するかが、事前にテーブル上で決定されている（非特許文献３参照）。 In the method of selectively switching between DCT-II and DST-VII adopted in the above-mentioned HEVC, the signal bias by the intra prediction method using extrapolation is statistically analyzed by prior analysis. Then, which orthogonal transform is selected for each intra prediction mode is determined in advance on the table (see Non-Patent Document 3).

また、そのほかの様々な直交変換（ＤＳＴ−ＩＩ、ＤＣＴ−ＩＩＩ、ＤＣＴ−ＩＶ、ＤＣＴ−ＶＩＩなど）を適応的に用いることによって、更なる符号化効率の改善が望めることが報告されている（非特許文献２参照）。 Further, it has been reported that further improvement in coding efficiency can be expected by using various other orthogonal transforms (DST-II, DCT-III, DCT-IV, DCT-VII, etc.) adaptively ( Non-patent document 2).

しかしながら、実際に符号化する入力信号が、統計的に知られている傾向と異なる場合もあるため、予め定められた直交変換が、適切な直交変換ではない場合がある。 However, since the input signal to be actually encoded may differ from the statistically known tendency, the predetermined orthogonal transform may not be an appropriate orthogonal transform.

このような問題を解決するためには、ＲＤ（Rate-Distortion）最適化法などの手法により、例えば、以下に示すように、より適切な直交変換を選択することが可能である。しかしながら、用意されたそれぞれの直交変換を、完全にあるいは簡略化した手法で事前に圧縮信号になるまでの処理を行う必要がある。 In order to solve such a problem, it is possible to select a more appropriate orthogonal transform by a technique such as an RD (Rate-Distortion) optimization method, for example, as shown below. However, it is necessary to perform processing until each prepared orthogonal transform becomes a compressed signal in advance by a completely or simplified method.

図１は、２種類の直交変換を選択する場合の例を示している。入力信号１０２に対して、第１の直交変換部１１２により、第１の直交変換を行う。その後エントロピー符号化部１１４によって、エントロピー符号化を行う。この処理によって圧縮信号１１６が得られる。そして、入力信号１０２に対して、第２の直交変換部１２２は、第２の直交変換を行う。その後エントロピー符号化部１２４が、エントロピー符号化を行う。この処理によって圧縮信号１２６が得られる。 FIG. 1 shows an example in which two types of orthogonal transforms are selected. A first orthogonal transform is performed on the input signal 102 by the first orthogonal transform unit 112. Thereafter, the entropy encoding unit 114 performs entropy encoding. By this processing, a compressed signal 116 is obtained. Then, the second orthogonal transform unit 122 performs second orthogonal transform on the input signal 102. Thereafter, the entropy encoding unit 124 performs entropy encoding. A compressed signal 126 is obtained by this processing.

そして、圧縮信号１１６と圧縮信号１２６は、ＲＤ判定部１３０に入力される。ＲＤ判定部１３０は、情報量（Rate）及び歪み（Distortion）の両方を勘案して最適化する際に用いられるＲＤ最適化法を用いて、何れの直交変換が適切かを判定する。なお、この場合、１種類の直交変換を用いる場合と比してほぼ２倍のリソースを必要とする（非特許文献４参照）。 The compressed signal 116 and the compressed signal 126 are input to the RD determination unit 130. The RD determination unit 130 determines which orthogonal transform is appropriate by using an RD optimization method that is used for optimization in consideration of both information amount (Rate) and distortion (Distortion). In this case, almost twice as many resources are required as compared with the case of using one type of orthogonal transform (see Non-Patent Document 4).

大久保榮監修，「改訂三版Ｈ．２６４／ＡＶＣ教科書」，インプレスＲ＆Ｄ，ｐ１２４，２００９年１月１日Supervised by Satoshi Okubo, “Revised 3rd edition H.264 / AVC textbook”, Impress R & D, p124, January 1, 2009 ＪＣＴＶＣ−Ｅ０２７ＣＥ−７：Summary report of core experiment on alternative transforms）(March 2011)JCTVC-E027 CE-7: Summary report of core experiment on alternative transforms) (March 2011) A. Saxena and F. C. Fernandes, "Mode dependent DCT/DST for intra prediction in block-based image/video coding"18th IEEE International Conference on Image Processing, PP/ 1721-1724, 2011A. Saxena and F. C. Fernandes, "Mode dependent DCT / DST for intra prediction in block-based image / video coding" 18th IEEE International Conference on Image Processing, PP / 1721-1724, 2011 市ヶ谷敦郎，杉藤泰子，境田慎一，"ＤＣＴとＤＳＴを適応的に用いたインター符号化"，映情学技報，vol.36, no.9 ME2012-59,PP.213-218,2012Goro Ichigaya, Yasuko Sugito, Shinichi Sakaida, "Inter-coding using DCT and DST adaptively", Eiji-gakugakuho, vol.36, no.9 ME2012-59, PP.213-218,2012

上述のように、複数の直交変換のうち適切なものを選択する場合に、各々の直交変換を実行することは、処理能力の増大、消費電力の増大を招き、現実的ではない。このため、リアルタイム信号処理における実装を考慮した場合、最適な直交変換を高速に選択する技術が必要とされている。 As described above, when an appropriate one of a plurality of orthogonal transforms is selected, executing each orthogonal transform increases processing performance and power consumption, and is not realistic. For this reason, in consideration of implementation in real-time signal processing, a technique for selecting an optimal orthogonal transform at high speed is required.

そこで、本発明では、複数の直交変換から適切な直交変換を選択する際の処理効率を向上させる画像処理装置、符号化装置、及びプログラムを提供することを目的とする。 Therefore, an object of the present invention is to provide an image processing device, an encoding device, and a program that improve processing efficiency when selecting an appropriate orthogonal transform from a plurality of orthogonal transforms.

本発明の一態様における画像処理装置は、第１の直交変換、及び１以上の他の直交変換のうち、何れか１つを選択して処理を行う画像処理装置であって、画像データを前記第１の直交変換によって変換して得られた第１の直交変換係数を、所定の規則に従って分析する分析部と、前記分析部の分析結果に基づいて、前記第１の直交変換、及び前記１以上の他の直交変換のうち、何れか１つを選択する選択部と、を有する。 An image processing apparatus according to an aspect of the present invention is an image processing apparatus that performs processing by selecting any one of a first orthogonal transform and one or more other orthogonal transforms, An analysis unit that analyzes the first orthogonal transform coefficient obtained by the first orthogonal transform according to a predetermined rule, the first orthogonal transform based on the analysis result of the analysis unit, and the 1 And a selection unit that selects any one of the other orthogonal transforms.

また、前記所定の規則は、前記第１の直交変換係数の水平方向の奇数次と偶数次とのエネルギー比、及び／又は前記第１の直交変換係数の垂直方向の奇数次と偶数次とのエネルギー比を用いてもよい。 Further, the predetermined rule is that an energy ratio between an odd-order and an even-order in the horizontal direction of the first orthogonal transform coefficient and / or an odd-order and an even-order in the vertical direction of the first orthogonal transform coefficient. An energy ratio may be used.

また、前記分析部は、前記１以上の他の直交変換の各々が有する所定の数の複数の基底画像の各々を前記第１の直交変換によって変換して得られた直交変換係数を蓄積した記憶部と、前記第１の直交変換係数と前記記憶部に記憶された前記複数の直交変換係数との類似度を算出する類似度算出部と、を含み、前記選択部は、前記類似度算出部によって得られた類似度のうち、最も高い類似度が、所定の閾値を越える場合、前記最も高い類似度に対応する、前記他の直交変換に、前記画像データを選択的に与えることによって、前記画像データが変換された直交変換係数を、前記画像データの符号化のために用い、前記最も高い類似度が、前記所定の閾値以下の場合、前記第１の直交変換係数を、前記画像データの符号化のために用いてもよい。 In addition, the analysis unit stores the orthogonal transform coefficients obtained by transforming each of a predetermined number of the plurality of base images included in each of the one or more other orthogonal transforms by the first orthogonal transform. A similarity calculation unit that calculates a similarity between the first orthogonal transform coefficient and the plurality of orthogonal transform coefficients stored in the storage unit, and the selection unit includes the similarity calculation unit If the highest similarity among the similarities obtained by the above-mentioned is over a predetermined threshold, the image data is selectively given to the other orthogonal transformation corresponding to the highest similarity, When the orthogonal transform coefficient obtained by transforming the image data is used for encoding the image data, and the highest similarity is equal to or less than the predetermined threshold, the first orthogonal transform coefficient is used as the image data. May be used for encoding

また、上記何れかの画像処理装置を備える符号化装置が提供される。 In addition, an encoding device including any one of the above image processing devices is provided.

また、本発明の他の実施態様におけるプログラムは、コンピュータを上記符号化装置として機能させる。 A program according to another embodiment of the present invention causes a computer to function as the encoding device.

本発明によれば、複数の直交変換から適切な直交変換を選択して、画像処理を行う際の効率を向上させることができる。 According to the present invention, it is possible to improve efficiency when performing image processing by selecting an appropriate orthogonal transform from a plurality of orthogonal transforms.

従来技術における２種類の直交変換を選択する場合の例を示す図である。It is a figure which shows the example in the case of selecting two types of orthogonal transformation in a prior art. ＤＣＴ及びＤＳＴの直交変換係数の例を示すグラフである。It is a graph which shows the example of the orthogonal transformation coefficient of DCT and DST. 実施例１の画像処理装置の構成の例を示すブロック図である。1 is a block diagram illustrating an example of a configuration of an image processing apparatus according to a first embodiment. ＤＣＴ及びＤＳＴの直交変換係数の水平成分の例を示す図である。It is a figure which shows the example of the horizontal component of the orthogonal transformation coefficient of DCT and DST. 実施例１における画像処理の一例を示すフローチャートである。3 is a flowchart illustrating an example of image processing according to the first exemplary embodiment. 実施例１の基本原理の例を説明する図である。2 is a diagram illustrating an example of a basic principle of Embodiment 1. FIG. 実施例２の画像処理装置の構成の例を示すブロック図である。FIG. 10 is a block diagram illustrating an example of a configuration of an image processing apparatus according to a second embodiment. 記憶部に記憶される直交変換係数比較用パターンを説明する図である。It is a figure explaining the pattern for orthogonal transformation coefficient comparison memorize | stored in a memory | storage part. 実施例２における画像処理の一例を示すフローチャートである。10 is a flowchart illustrating an example of image processing in the second embodiment. 実施例３における画像処理装置の概略構成の一例を示すブロック図である。10 is a block diagram illustrating an example of a schematic configuration of an image processing apparatus according to Embodiment 3. FIG. 実施例４における画像処理装置の概略構成の一例を示すブロック図であるFIG. 10 is a block diagram illustrating an example of a schematic configuration of an image processing device according to a fourth embodiment.

まず、ＤＣＴとＤＳＴを例にとって、異なる直交変換の関係の例を示す。ＤＣＴ領域におけるエネルギー分布は、式（１）として求められる。例えば、ＤＣＴ基底に直交する正弦波信号を入力とする場合を想定する、簡単のために入力信号を連続信号とし、正規化係数も省略する。また、一次元の信号を取り上げる。 First, taking DCT and DST as an example, an example of the relationship between different orthogonal transforms is shown. The energy distribution in the DCT region is obtained as Equation (1). For example, assuming that a sine wave signal orthogonal to the DCT base is input, the input signal is a continuous signal for simplicity, and the normalization coefficient is also omitted. In addition, a one-dimensional signal is taken up.

入力信号を式（２）とすると、

If the input signal is expressed by equation (2),

式（１）は、式（３）のように表せる。

Expression (1) can be expressed as Expression (3).

この式（３）を用いて、以下の２つの場合についてＤＣＴを計算する。
ｉ）u_i+1＝uの場合

Using this equation (3), DCT is calculated for the following two cases.
i) When u _i + 1 = u

ｉｉ）u_i+1≠uの場合
ア）u_i+u＝2n−1の場合

ii) When u _i + 1 ≠ u a) When u _i + u = 2n−1

イ）u_i+u＝2nの場合

B) When u _i + u = 2n

したがって、式（４）、式（５）、式（６）より、正弦波信号の次数u_iとＤＣＴ基底の次数uの和が偶数となる次数に、ＤＣＴ係数のエネルギーが広く分布することが分かる。

Therefore, from Equations (4), (5), and (6), the energy of the DCT coefficient is widely distributed in the order in which the sum of the order u _i of the sine wave signal and the order u of the DCT base is an even number. I understand.

図２は、正弦波信号（u_i＝1）を入力とした場合の、ＤＣＴ係数とＤＳＴ係数とを示している。図２（Ａ）は、ＤＣＴ係数を示しており、u_i＝1との和が偶数になるuの係数（u=1,3,5,7）に、絶対値の大きいスペクトルが分布している。なお、ここで、スペクトルとは、直交変換係数のことを意味する。 FIG. 2 shows DCT coefficients and DST coefficients when a sine wave signal (u _i = 1) is input. FIG. 2 (A) shows the DCT coefficient, and a spectrum having a large absolute value is distributed on the coefficient of u (u = 1, 3, 5, 7) that makes the sum of u _i = 1 even. Yes. Here, the spectrum means an orthogonal transform coefficient.

図２（Ｂ）は、ＤＳＴのスペクトルを示している。この場合には、自明ながらu_i＝1だけにスペクトルが現れる。したがって、ＤＳＴ係数の３次以降の係数を除去したとしても、入力信号を再構成できる。 FIG. 2B shows a spectrum of DST. In this case, the spectrum appears only when u _i = 1. Therefore, the input signal can be reconstructed even if the third and subsequent coefficients of the DST coefficient are removed.

図２（Ａ）に戻ると、この場合には、図２（Ｂ）とは異なり、ＤＣＴ係数のスペクトル集中は低く、スペクトルは広く分布する。ＤＣＴは、ＤＳＴと相補的な性質を持つことが分かる。 Returning to FIG. 2A, in this case, unlike FIG. 2B, the spectrum concentration of DCT coefficients is low and the spectrum is widely distributed. It can be seen that DCT has a complementary property to DST.

以上の例からも分かるように、正弦波信号（u_i＝1）に対しては、ＤＳＴのほうが、ＤＣＴよりも、より少ないスペクトル成分に情報を集約でき、符号化効率の観点からは、より適切である。 As can be seen from the above example, for a sine wave signal (u _i = 1), DST can collect information in fewer spectral components than DCT, and from the viewpoint of encoding efficiency, Is appropriate.

［実施例１］
実施例１では、上述のＤＣＴとＤＳＴの２つの直交変換の性質を利用する。そして、何れの直交変換が適切な直交変換であるかを、簡便な方式で特定することができる。 [Example 1]
In the first embodiment, the characteristics of the two orthogonal transforms of DCT and DST described above are used. Then, which orthogonal transformation is an appropriate orthogonal transformation can be specified by a simple method.

＜構成＞
図３は、実施例１における画像処理装置３００の一例を示すブロック図である。図３に示す例では、画像処理装置３００は、第１の直交変換部３２０、分析部３２２、選択部３３０、第２の直交変換部３４０、第１のスイッチ３２８、及び第２のスイッチ３２９を有する。各部についての概略を以下に説明する。 <Configuration>
FIG. 3 is a block diagram illustrating an example of the image processing apparatus 300 according to the first embodiment. In the example illustrated in FIG. 3, the image processing apparatus 300 includes a first orthogonal transform unit 320, an analysis unit 322, a selection unit 330, a second orthogonal transform unit 340, a first switch 328, and a second switch 329. Have. An outline of each part will be described below.

第１の直交変換部３２０は、入力画像３４１を、第１の直交変換により変換する。なお、入力画像３４１は、原画像から予測された画像を差し引いた予測誤差信号であってもよい。変換する単位は、入力画像の一部分である３２×３２、１６×１６、８×８画素などのブロックに分割された単位である。第１の直交変換部３２０は、たとえば、ＤＣＴを実行する機能を有してもよい。 The first orthogonal transform unit 320 transforms the input image 341 by the first orthogonal transform. Note that the input image 341 may be a prediction error signal obtained by subtracting the predicted image from the original image. The unit to be converted is a unit divided into blocks such as 32 × 32, 16 × 16, and 8 × 8 pixels that are a part of the input image. The first orthogonal transform unit 320 may have a function of executing DCT, for example.

分析部３２２は、第１の直交変換部３２０で変換された第１の直交変換係数３２１を受け取る。分析部３２２の具体的処理については、図４を用いて後述する。 The analysis unit 322 receives the first orthogonal transform coefficient 321 transformed by the first orthogonal transform unit 320. Specific processing of the analysis unit 322 will be described later with reference to FIG.

選択部３３０は、分析部３２２の分析結果３２１を受け取る。選択部３３０は、分析結果３２１に基づいて、第１のスイッチ３２８及び第２のスイッチ３２９に、選択指示信号３２７を与える。 The selection unit 330 receives the analysis result 321 of the analysis unit 322. The selection unit 330 provides a selection instruction signal 327 to the first switch 328 and the second switch 329 based on the analysis result 321.

選択指示信号３２７が、第１のスイッチ３２８に対して、入力画像３４１を第２の直交変換部３４０に与える信号である場合、第２の直交変換部３４０が、入力画像３４１に対して直交変換処理を実行する。そして、第２の直交変換部３４０は、第２の直交変換係数３３１を第２のスイッチ３２９に出力する。 When the selection instruction signal 327 is a signal that gives the input image 341 to the second orthogonal transform unit 340 to the first switch 328, the second orthogonal transform unit 340 performs orthogonal transform on the input image 341. Execute the process. Then, the second orthogonal transformation unit 340 outputs the second orthogonal transformation coefficient 331 to the second switch 329.

以上のように、選択指示信号３２７によって、出力３５１には、第１の直交変換係数３２１又は第２の直交変換係数３３１が選択的に出力される。この場合、第１の直交変換係数が出力３５１に出力される場合には、第２の直交変換部３４０の処理が省略され得る。この場合には、リソースの節約がなされる。 As described above, the first orthogonal transform coefficient 321 or the second orthogonal transform coefficient 331 is selectively output to the output 351 by the selection instruction signal 327. In this case, when the first orthogonal transform coefficient is output to the output 351, the processing of the second orthogonal transform unit 340 can be omitted. In this case, resources are saved.

＜分析部３２２の説明＞
次に、分析部３２２について説明する。例示として、第１の直交変換部３２０にＤＣＴ−ＩＩ（以下、単にＤＣＴと称する場合がある）を用い、第２の直交変換部３４０にＤＳＴ−ＩＩ（以下、単にＤＳＴと称する場合がある）を用いた場合を想定する。なお、ＤＳＴは実装上の容易さからタイプ２としたが、他のタイプであってもよい。 <Description of Analysis Unit 322>
Next, the analysis unit 322 will be described. As an example, DCT-II (hereinafter may be simply referred to as DCT) is used for the first orthogonal transform unit 320, and DST-II (hereinafter simply referred to as DST) is used for the second orthogonal transform unit 340. The case where is used is assumed. Although DST is type 2 for ease of mounting, other types may be used.

ＤＣＴとＤＳＴには、上述の式（４）、式（５）、式（６）の関係が存在する。すなわち、ＤＳＴの基底画像の１つをＤＣＴで変換した直交変換係数は、偶数次、又は奇数次の一方だけにパワースペクトル生じ、他方のパワースペクトルが０になる。同様にＤＣＴの基底画像の１つをＤＳＴで変換した直交変換係数も、偶数次、又は奇数次の一方だけにスペクトルが生じ、他方のスペクトルが０になる。 DCT and DST have the relationship of the above-mentioned formula (4), formula (5), and formula (6). That is, an orthogonal transform coefficient obtained by transforming one of the DST base images by DCT has a power spectrum only in one of the even order and the odd order, and the other power spectrum becomes zero. Similarly, an orthogonal transform coefficient obtained by transforming one of the DCT base images with the DST has a spectrum only in one of the even order and the odd order, and the other spectrum becomes zero.

図４は、ＤＳＴとＤＣＴとの関係の例を示している。図４は、二次元の直交変換係数の水平方向の一列を取り出した図である。図４（Ａ）と図４（Ｂ）は、それぞれ同じ画像をＤＣＴとＤＳＴで変換した場合のスペクトルの水平成分の１つを示した図である。色の濃さは、スペクトルの絶対値の大きさを表している。 FIG. 4 shows an example of the relationship between DST and DCT. FIG. 4 is a diagram showing a horizontal row of two-dimensional orthogonal transform coefficients. 4A and 4B are diagrams showing one of the horizontal components of the spectrum when the same image is converted by DCT and DST, respectively. The color intensity represents the magnitude of the absolute value of the spectrum.

図４（Ｃ）と図４（Ｄ）は、他の条件で、それぞれ同じ画像をＤＣＴとＤＳＴで変換した場合のスペクトルの水平成分の１つを示した図である。 FIG. 4C and FIG. 4D are diagrams showing one of the horizontal components of the spectrum when the same image is converted by DCT and DST, respectively, under other conditions.

信号の圧縮符号化方式では、直交変換において一般に低域にスペクトルが集中することを想定している。したがって、図４（Ａ）又は図４（Ｃ）のような、スペクトルの拡散が発生する場合は、対象画像に対して、その直交変換（すなわちＤＣＴ）が適切に機能しないことを意味する。表現を変えて説明すれば、この場合には、ＤＣＴを利用しても、スペクトルが拡散しているために、その後の情報圧縮が十分に望めないことを意味する。 In the signal compression coding system, it is assumed that the spectrum is generally concentrated in a low band in the orthogonal transform. Therefore, when spectrum spread occurs as shown in FIG. 4A or FIG. 4C, it means that the orthogonal transform (that is, DCT) does not function properly for the target image. In other words, in this case, even if DCT is used, it means that the subsequent information compression cannot be sufficiently expected because the spectrum is spread.

これに対して、図４（Ｂ）又は図４（Ｄ）のようなＤＳＴによってスペクトル集中が得られる画像の場合には、ＤＳＴを用いることが適切であることを示している。 On the other hand, in the case of an image in which spectrum concentration is obtained by DST as shown in FIG. 4B or FIG. 4D, it is shown that it is appropriate to use DST.

このような特徴から、ＤＣＴを用いた第１の直交変換部３２０において、図４（Ａ）又は図４（Ｃ）に類似したスペクトルパターンが観察された場合には、ＤＳＴによる第２の直交変換部３４０を利用した方が、ＤＣＴよりもスペクトルの集中が望めることが容易に予想できる。このことは、ＤＣＴのスペクトルの特徴を調べることによって、ＤＳＴを試すことなく、ＤＣＴ又はＤＳＴの何れの直交変換が適切かを判定することができることを意味する。 Due to these characteristics, when a spectrum pattern similar to FIG. 4A or 4C is observed in the first orthogonal transform unit 320 using DCT, the second orthogonal transform by DST is performed. It can be easily predicted that the concentration of the spectrum can be expected by using the unit 340 rather than the DCT. This means that by examining the characteristics of the DCT spectrum, it is possible to determine which DCT or DST orthogonal transform is appropriate without trying DST.

＜動作＞
次に、実施例１における画像処理装置３００の動作について説明する。図５は、実施例１における画像処理の一例を示すフローチャートである。 <Operation>
Next, the operation of the image processing apparatus 300 according to the first embodiment will be described. FIG. 5 is a flowchart illustrating an example of image processing according to the first embodiment.

ステップＳ５０２で、第１の直交変換部３２０は、入力画像を第１の直交変換であるＤＣＴ−ＩＩによって変換し、第１の直交変換係数３２１を得る。 In step S502, the first orthogonal transform unit 320 transforms the input image by DCT-II, which is the first orthogonal transform, and obtains a first orthogonal transform coefficient 321.

ステップＳ５０４で、分析部３２２は、水平方向のパワースペクトル（又はスペクトルの絶対値）を、偶数次、及び奇数次毎に積算してもよい。 In step S504, the analysis unit 322 may integrate the horizontal power spectrum (or the absolute value of the spectrum) for each even-order and odd-order.

ステップＳ５０６で、分析部３２２は、ステップＳ５０４で得られた、水平方向の偶数次の積算と、水平方向の奇数次の積算との比を計算する。この場合の比の計算では、大きい値を分母に、小さい値を分子として計算し、比が、常に１以下の値として得られるようにする。 In step S506, the analysis unit 322 calculates the ratio between the even-order integration in the horizontal direction and the odd-order integration in the horizontal direction obtained in step S504. In the calculation of the ratio in this case, the large value is calculated as the denominator and the small value is calculated as the numerator so that the ratio is always obtained as a value of 1 or less.

図６を用いて、ステップ４の計算の例を説明する。図６は、８×８画素のブロックをＤＣＴ−ＩＩで直交変換した直交変換係数から得たパワースペクトル６００を示している。説明を分かりやすくするために、パワースペクトル６００は、列１〜列８、及び行Ａ〜行Ｈの符号を用いて、各パワースペクトルの値を表現する。例えば、列１行Ａの要素Ａ１は、ＤＣＴ−ＩＩのＤＣ成分のパワースペクトル値である。他の要素についても、それぞれの周波数領域のパワースペクトル値を表している。 An example of the calculation in step 4 will be described with reference to FIG. FIG. 6 shows a power spectrum 600 obtained from orthogonal transform coefficients obtained by orthogonally transforming an 8 × 8 pixel block by DCT-II. For ease of explanation, the power spectrum 600 expresses the value of each power spectrum using the codes of columns 1 to 8 and rows A to H. For example, element A1 in column 1 row A is the power spectrum value of the DC component of DCT-II. The other elements also represent the power spectrum values in the respective frequency regions.

上述のステップＳ５０４で、水平方向の偶数次のパワースペクトルの合計値をH_evenとすれば、H_evenは、具体的に以下のように計算される。 If the total value of the even-order power spectra in the horizontal direction is set to H _even in step S504, H _even is specifically calculated as follows.

H_even=A2+A4+A6+A8+B2+……..+H6+H8
また、ステップＳ５０４で、水平方向の奇数次のパワースペクトルの合計値をH_oddとすれば、H_oddは、具体的に以下のように計算される。 H _even = A2 + A4 + A6 + A8 + B2 + …… .. + H6 + H8
Further, in step S504, if the total value of the odd-order power spectra in the horizontal direction is H _odd , H _odd is specifically calculated as follows.

H_odd=A1+A3+A5+A7+B1+……..+H5+H7
そして、H_evenとH_oddのうち、大きい値を分母として、比H_rateを以下のように計算する。 H _odd = A1 + A3 + A5 + A7 + B1 + …… .. + H5 + H7
Then, the ratio H _rate is calculated as follows using the larger value of H _even and H _odd as the denominator.

H_rate＝H_even／H_odd (H_even<H_oddの場合)
図５に戻る。 H _rate = H _even / H _odd (when H _even <H _odd )
Returning to FIG.

ステップＳ５０８で、分析部３２２は、垂直方向のパワースペクトル（又はスペクトルの絶対値）を、偶数次、及び奇数次毎に積算する。 In step S508, the analysis unit 322 integrates the power spectrum in the vertical direction (or the absolute value of the spectrum) for each even-order and odd-order.

ステップＳ５１０で、分析部３２２は、ステップＳ５０８で得られた、垂直方向の偶数次の積算と、垂直方向の奇数次の積算との比を計算する。この場合の比の計算では、大きい値を分母に、小さい値を分子として計算し、比が、常に１以下の値として得られるようにする。 In step S510, the analysis unit 322 calculates the ratio of the even-order integration in the vertical direction and the odd-order integration in the vertical direction obtained in step S508. In the calculation of the ratio in this case, the large value is calculated as the denominator and the small value is calculated as the numerator so that the ratio is always obtained as a value of 1 or less.

ステップＳ５０８及びステップＳ５１０を、図６を用いて同様に計算すると、以下のようになる。 If step S508 and step S510 are similarly calculated using FIG. 6, it will be as follows.

垂直方向の偶数次のパワースペクトルの合計値をV_evenとすれば、V_evenは、具体的に以下のように計算される。 If the total value of the even-order power spectra in the vertical direction is V _even , V _even is specifically calculated as follows.

V_even=B1+D1+F1+H1+B2+……..+F8+H8
また、ステップＳ５０８で、垂直方向の奇数次のパワースペクトルの合計値をV_oddとすれば、V_oddは、具体的に以下のように計算される。 V _even = B1 + D1 + F1 + H1 + B2 + …… .. + F8 + H8
Further, in step S508, if the total value of the odd-order power spectra in the vertical direction is V _odd , V _odd is specifically calculated as follows.

V_odd=A1+C1+E1+G1+A2+……..+E8+G8
そして、V_evenとV_oddのうち、大きい値を分母として、比V_rateを以下のように計算する。 V _odd = A1 + C1 + E1 + G1 + A2 + …… .. + E8 + G8
Then, the ratio V _rate is calculated as follows using the larger value of V _even and V _odd as the denominator.

V_rate＝V_even／V_odd (H_even<H_oddの場合)
図５に戻る。 V _rate = V _even / V _odd (when H _even <H _odd )
Returning to FIG.

ステップＳ５１２において、水平方向の比H_rateと垂直方向の比V_rateの平均値AVE_rateを計算してもよい。比の平均値AVE_rateは、以下のように計算できる。 In step S512, an average value AVE _rate of the horizontal ratio H _rate and the vertical ratio V _rate may be calculated. The average ratio AVE _rate can be calculated as follows.

AVE_rate=（H_rate＋V_rate）／2
ステップＳ５１４で、選択部３３０は、比の平均値AVE_rateを所定の閾値と比較する。例えば、閾値として、０．５を採用した場合は、以下の判断を行ってもよい。 AVE _rate = (H _rate + V _rate ) / 2
In step S514, the selection unit 330 compares the average ratio value AVE _rate with a predetermined threshold value. For example, when 0.5 is adopted as the threshold value, the following determination may be made.

AVE_rate＞0.5
AVE_rateは、大きい値であるほど、第１の直交変換のパワースペクトルの偶数次と奇数次の差が小さいことを意味する。この場合、ＤＣＴの方がＤＳＴよりもより適切な直交変換である可能性が高くなることを意味する。 AVE _rate > 0.5
The larger the AVE _rate , the smaller the difference between the even and odd orders of the power spectrum of the first orthogonal transform. In this case, DCT is more likely to be more appropriate orthogonal transform than DST.

したがって、AVE_rate＞0.5が成り立てば、ＤＣＴを採用することが適切であると判断してもよい。そして、AVE_rate＞0.5が成り立たなければ、ＤＳＴを採用することが適切であると判断してもよい。 Therefore, if AVE _rate > 0.5 holds, it may be determined that DCT is appropriate. If AVE _rate > 0.5 does not hold, it may be determined that DST is appropriate.

なお、ここで採用した閾値0.5は、例示に過ぎない。他の値であってもよい。また、上記の例では水平方向の比H_rateと素直垂直方向の比V_rateの両方を用いたが、これはいずれか一方を用いることでも構わない。
また、本実施例では簡単のために二次元直交変換を例に示すが、水平、垂直で異なる一次元直交変換を適用する符号化の場合はH_rateとV_rateそれぞれが閾値以上であるかによってそれぞれの直交変換を決定してもよい。 Note that the threshold value 0.5 adopted here is merely an example. Other values may be used. In the above example, both the horizontal ratio H _rate and the straight vertical ratio V _rate are used, but either one of them may be used.
Although in the example the two-dimensional orthogonal transformation for the sake of simplicity, in this embodiment, the horizontal, in the case of coding the application of different one-dimensional orthogonal transform in the vertical depending respectively H _rate and V _rate is equal to or greater than a threshold value Each orthogonal transform may be determined.

ステップＳ５１４で、判断結果が「はい」であれば、ステップＳ５１６に移る。判断結果が「いいえ」であれば、ステップＳ５１８に移る。 If the determination result is “Yes” in step S514, the process proceeds to step S516. If the determination result is “No”, the process proceeds to step S518.

ステップＳ５１６で、選択部３３０は、第１の直交変換部３２０のＤＣＴによって得られた直交変換係数を出力３５１に出力する。 In step S516, the selection unit 330 outputs the orthogonal transform coefficient obtained by the DCT of the first orthogonal transform unit 320 to the output 351.

ステップＳ５１８で、選択部３３０は、第２の直交変換部３４０に、入力画像３４１を与える。そして、第２の直交変換部３４０がＤＳＴを用いて入力画像３４１を変換する。 In step S518, the selection unit 330 provides the input image 341 to the second orthogonal transform unit 340. Then, the second orthogonal transform unit 340 transforms the input image 341 using DST.

ステップＳ５２０で、選択部３３０は、第２の直交変換部３４０で得られた直交変換係数を出力３５１に出力する。 In step S520, the selection unit 330 outputs the orthogonal transform coefficient obtained by the second orthogonal transform unit 340 to the output 351.

以上、実施例１によれば、計算量を抑えつつ、ＤＣＴとＤＳＴの何れの直交変換を行うことが好ましいかを判断できる。そして、計算量を抑えつつ、ＤＣＴとＤＳＴのうち、適切な直交変換を利用して、直交変換係数を取得することができる。ＤＣＴが適切であると判断された場合には、ＤＳＴの計算を省略することができる。 As described above, according to the first embodiment, it is possible to determine which orthogonal transform of DCT or DST is preferable while reducing the amount of calculation. Then, while reducing the amount of calculation, an orthogonal transform coefficient can be acquired by using an appropriate orthogonal transform between DCT and DST. If it is determined that DCT is appropriate, the calculation of DST can be omitted.

［実施例２］
実施例２は、実施例１を、より一般化した例である。 [Example 2]
The second embodiment is an example in which the first embodiment is more generalized.

＜構成＞
図７は、実施例２の画像処理装置７００の構成のブロック図を示している。図３と同様の機能を有する構成要素については、同じ符号が付されている。重複を避けるため、同じ符号に係る構成要素の説明は省略する。 <Configuration>
FIG. 7 is a block diagram illustrating a configuration of the image processing apparatus 700 according to the second embodiment. Constituent elements having the same functions as those in FIG. In order to avoid duplication, description of the constituent elements related to the same reference numerals is omitted.

図７において、分析部３２２は、類似度算出部７２４と、記憶部７２６とを含んでもよい。また、図７の例では、第１の直交変換部３２０及び第２の直交変換部３４０に加えて、第３の直交変換部７４３を有している。このため、第３の直交変換部７４３が選択できるよう、第１のスイッチ３２８が変形されている。 In FIG. 7, the analysis unit 322 may include a similarity calculation unit 724 and a storage unit 726. In addition, in the example of FIG. 7, in addition to the first orthogonal transform unit 320 and the second orthogonal transform unit 340, a third orthogonal transform unit 743 is provided. For this reason, the first switch 328 is modified so that the third orthogonal transform unit 743 can be selected.

第１の直交変換部３２０、第２の直交変換部３４０、及び第３の直交変換部７４３は、それぞれ、例えばＤＣＴ−ＩＩ、ＤＳＴ−ＩＩ、及びＤＳＴ−ＶＩＩを採用していると仮定して以下の説明を行う。なお、実施例２は、これに限られるものではない。したがって、４個以上の直交変換部が存在していてもよい。 Assume that the first orthogonal transform unit 320, the second orthogonal transform unit 340, and the third orthogonal transform unit 743 employ, for example, DCT-II, DST-II, and DST-VII, respectively. The following description will be given. In addition, Example 2 is not restricted to this. Therefore, four or more orthogonal transform units may exist.

図８は、記憶部に記憶される直交変換係数比較用パターンを説明する図である。図８を用いて、実施例２の基本原理を説明する。 FIG. 8 is a diagram for explaining orthogonal transform coefficient comparison patterns stored in the storage unit. The basic principle of Example 2 is demonstrated using FIG.

図８の第２の直交変換の基底画像８２０を取り上げて、記憶部に記憶される複数の直交変換係数比較用パターンを説明する。基底画像８２０は、合計６４個の基底画像を持ち、それぞれの基底画像は、８×８画素とする。 Taking the base image 820 of the second orthogonal transform in FIG. 8, a plurality of orthogonal transform coefficient comparison patterns stored in the storage unit will be described. The base image 820 has a total of 64 base images, and each base image has 8 × 8 pixels.

この例では、第２の直交変換（ＤＳＴ−ＩＩ）の４つの低域成分の基底画像を用いる。まず、基底画像８２１に対して第１の直交変換部３２０により、第１の直交変換（ＤＣＴ−ＩＩ）を施し、直交変換係数８７２を取得する。この直交変換係数８７２は、記憶部７２６に記憶されてもよい。同様に、他の３つの基底画像８２２、８２３、８２４についても、第１の直交変換を施して、それぞれ、直交変換係数８７１、８７４、８７３を取得し、記憶部７２６に記憶される。 In this example, four low-frequency component base images of the second orthogonal transform (DST-II) are used. First, a first orthogonal transform (DCT-II) is performed on the base image 821 by the first orthogonal transform unit 320 to obtain an orthogonal transform coefficient 872. The orthogonal transform coefficient 872 may be stored in the storage unit 726. Similarly, the other three base images 822, 823, and 824 are also subjected to the first orthogonal transform to obtain orthogonal transform coefficients 871, 874, and 873, and are stored in the storage unit 726.

この操作によって得られた、直交変換係数８７１〜８７４は、直交変換係数比較用パターンとして用いることができる。 The orthogonal transform coefficients 871 to 874 obtained by this operation can be used as an orthogonal transform coefficient comparison pattern.

同様に、第３の直交変換の基底の低域成分の基底画像８３１〜８３４にも、第１の直交変換が施される。その結果、直交変換係数８８１〜８８４が得られ、記憶部７２６に記憶される。直交変換係数８８１〜８８４は、直交変換係数比較用パターンとして用いることができる。 Similarly, the low-frequency component base images 831 to 834 of the base of the third orthogonal transform are also subjected to the first orthogonal transform. As a result, orthogonal transform coefficients 881 to 884 are obtained and stored in the storage unit 726. The orthogonal transform coefficients 881 to 884 can be used as orthogonal transform coefficient comparison patterns.

以上の処理は、実際の画像処理を行う前に、事前に実行されているものとする。したがって、記憶部７２６には、複数の直交変換係数比較用パターンが記憶されているものとする。 It is assumed that the above processing is performed in advance before performing actual image processing. Therefore, it is assumed that the storage unit 726 stores a plurality of orthogonal transform coefficient comparison patterns.

次に、直交変換係数比較用パターンの性質について説明する。例えば、第１の直交変換によって、入力画像が変換され、直交変換係数８９０が得られたとする。この直交変換係数８９０が、記憶部７２６に記憶された直交変換係数比較用パターン８７１と極めて類似したパターンである場合を想定する。類似度の計算には、相関値の計算を利用してもよい。 Next, the properties of the orthogonal transform coefficient comparison pattern will be described. For example, it is assumed that the input image is transformed by the first orthogonal transformation and the orthogonal transformation coefficient 890 is obtained. Assume that the orthogonal transform coefficient 890 is a pattern that is very similar to the orthogonal transform coefficient comparison pattern 871 stored in the storage unit 726. For calculating the similarity, a correlation value may be calculated.

この場合には、入力画像３４１を第２の直交変換部３４０（ＤＳＴ−ＩＩ）で変換した場合には、基底画像８２２に対応する直交変換係数の絶対値が高いスペクトルパターンが得られることとなる。この場合、第２の直交変換は、第１の直交変換よりも、スペクトルを集約できる可能性が高いことを示している。したがって、この場合には、第１の直交変換よりも第２の直交変換を用いることが望ましい。これによって、圧縮効率の向上が期待できる。 In this case, when the input image 341 is converted by the second orthogonal transform unit 340 (DST-II), a spectrum pattern having a high absolute value of the orthogonal transform coefficient corresponding to the base image 822 is obtained. . In this case, it is shown that the second orthogonal transform has a higher possibility of collecting the spectrum than the first orthogonal transform. Therefore, in this case, it is desirable to use the second orthogonal transform rather than the first orthogonal transform. Thereby, improvement in compression efficiency can be expected.

なお、この類似度の比較は、たとえば、直交変換係数比較用パターンと、実際の画像を第１の直交変換部３２０で得られた直交変換係数との相関値を計算してもよい。そして計算された相関値のうち一番相関が高い直交変換係数比較用パターンを探索してもよい。探索された直交変換係数比較用パターンに対応する相関値と所定の閾値を比較した結果を基に、第１の直交変換を用いるか、探索された直交変換比較用パターンに対応する直交変換を用いるか否かを決定してもよい。すなわち、一番高い相関値が所定の閾値を超える場合には、探索された直交変換比較用パターンに対応する直交変換を用いる。その相関値が所定の閾値以下の場合には、第１の直交変換を用いる。 Note that this similarity comparison may be performed, for example, by calculating a correlation value between the orthogonal transform coefficient comparison pattern and the orthogonal transform coefficient obtained from the actual image by the first orthogonal transform unit 320. Then, an orthogonal transform coefficient comparison pattern having the highest correlation among the calculated correlation values may be searched. Based on the result of comparing the correlation value corresponding to the searched orthogonal transformation coefficient comparison pattern with a predetermined threshold, the first orthogonal transformation is used, or the orthogonal transformation corresponding to the searched orthogonal transformation comparison pattern is used. It may be determined whether or not. That is, when the highest correlation value exceeds a predetermined threshold, the orthogonal transformation corresponding to the searched orthogonal transformation comparison pattern is used. If the correlation value is less than or equal to a predetermined threshold, the first orthogonal transform is used.

また類似度の計算は、既存のＳＡＤ（Sum of absolute difference）、ＳＳＤ（Sum of Squared difference）などの手法の何れを用いても良い。 The similarity may be calculated using any of existing methods such as SAD (Sum of absolute difference) and SSD (Sum of Squared difference).

なお、記憶部７２６にどの程度の数の直交変換係数比較用パターンを記憶させておくかは、用意する直交変換の数、類似度計算に要するリソースの量等を勘案して決定すればよい。 The number of orthogonal transform coefficient comparison patterns stored in the storage unit 726 may be determined in consideration of the number of orthogonal transforms to be prepared, the amount of resources required for similarity calculation, and the like.

また、図７では、第２の直交変換の基底として、低域成分の基底を選択した。これは、低域に、パワースペクトルが集中するように直交変換が設計されている場合が多いからである。したがって、直交変換の性質に応じて、どの基底を基にして、直交変換係数比較用パターンを作成するかは、採用する直交変換の性質又は設計理念を勘案することが望ましい。 In FIG. 7, the base of the low frequency component is selected as the base of the second orthogonal transformation. This is because the orthogonal transform is often designed so that the power spectrum is concentrated in the low band. Therefore, it is desirable to consider the nature or design philosophy of the orthogonal transformation to be used as the basis for creating the orthogonal transformation coefficient comparison pattern based on the nature of the orthogonal transformation.

また、実施例２では、第１の直交変換部３２０を使用して、１つの基底画像から１つの直交変換係数比較用パターンを作成した。しかしながら、実施例２は、これに限られるものではなく、複数の基底画像を合成した画像を作り、第１の直交変換部３２０を使用して、１つの直交変換係数比較用パターンを作成してもよい。 In the second embodiment, the first orthogonal transform unit 320 is used to create one orthogonal transform coefficient comparison pattern from one base image. However, the second embodiment is not limited to this. An image obtained by combining a plurality of base images is created, and one orthogonal transform coefficient comparison pattern is created using the first orthogonal transform unit 320. Also good.

＜動作＞
図９は、実施例２における画像処理の一例を示すフローチャートを示している。以下に、実施例２における画像処理装置の動作について説明する。 <Operation>
FIG. 9 is a flowchart illustrating an example of image processing according to the second embodiment. The operation of the image processing apparatus according to the second embodiment will be described below.

ステップＳ９０２で、第１の直交変換部３２０は、入力画像３４１に対して、第１の直交変換（ＤＣＴ−ＩＩ）を実行し、第１の直交変換係数３２１を取得する。 In step S <b> 902, the first orthogonal transform unit 320 performs a first orthogonal transform (DCT-II) on the input image 341 and obtains a first orthogonal transform coefficient 321.

ステップＳ９０４で、類似度算出部７２４は、記憶部７２６から、直交変換比較用パターンのうちから１つを取り出す。 In step S904, the similarity calculation unit 724 extracts one of the orthogonal transform comparison patterns from the storage unit 726.

ステップＳ９０６で、類似度算出部７２４は、第１の直交変換係数３２１と読み出された直交変換比較用パターンとの類似度（例えば相関値）を計算する。 In step S906, the similarity calculation unit 724 calculates the similarity (eg, correlation value) between the first orthogonal transform coefficient 321 and the read orthogonal transform comparison pattern.

ステップＳ９０８で、記憶部７２６に記憶されたパターンの数だけ、ステップＳ９０４及びステップＳ９０６を繰り返す。 In step S908, steps S904 and S906 are repeated for the number of patterns stored in the storage unit 726.

ステップＳ９１０で、選択部３３０は、最大の類似度（例えば相関値）を持つ直交変換比較用パターンを特定する。 In step S910, the selection unit 330 specifies an orthogonal transform comparison pattern having the maximum similarity (for example, correlation value).

ステップＳ９１２で、選択部３３０は、最大の類似度が、所定の閾値を越えるか否かを判断する。その判断が「いいえ」であれば、ステップＳ９１４に移る。その判断が「はい」であれば、ステップＳ９１６に移る。 In step S912, the selection unit 330 determines whether the maximum similarity exceeds a predetermined threshold. If the determination is “No”, the process moves to step S914. If the determination is “Yes”, the process proceeds to step S916.

ステップＳ９１４で、選択部は、第１の直交変換部で得られた直交変換係数を出力３５１に出力する。 In step S914, the selection unit outputs the orthogonal transformation coefficient obtained by the first orthogonal transformation unit to the output 351.

ステップＳ９１６で、選択部は、選択された直交変換比較用パターンに対応する直交変換部（第２の直交変換部３４０、又は第３の直交変換部７４３のいずれか）を特定する。 In step S916, the selection unit specifies an orthogonal transformation unit (either the second orthogonal transformation unit 340 or the third orthogonal transformation unit 743) corresponding to the selected orthogonal transformation comparison pattern.

選択部３３０は、第１のスイッチ３２８及び第２のスイッチ３２９を操作して、入力画像３４１を、特定された直交変換部に与える。特定された直交変換部は、入力画像３４１に特定された直交変換部の直交変換を施し、直交変換係数３３１を取得する。選択部３３０は、第２のスイッチ３２９に指示し、取得された直交変換係数３３１を出力３５１に出力させる。 The selection unit 330 operates the first switch 328 and the second switch 329 to provide the input image 341 to the specified orthogonal transform unit. The identified orthogonal transform unit performs the orthogonal transform of the identified orthogonal transform unit on the input image 341, and acquires the orthogonal transform coefficient 331. The selection unit 330 instructs the second switch 329 to output the obtained orthogonal transform coefficient 331 to the output 351.

以上、実施例２によって、計算量を抑えつつ、複数の直交変換をのうちから適切な直交変換により、直交変換係数を得ることができる。利用候補とされる直交変換が３つ以上であっても、最大で２つの直交変換を実行するだけでよい。第１の直交変換が適切と判断された場合には、直交変換は、１回だけ実行すればよい。 As described above, according to the second embodiment, it is possible to obtain an orthogonal transform coefficient by performing an appropriate orthogonal transform among a plurality of orthogonal transforms while suppressing a calculation amount. Even if there are three or more orthogonal transforms that are candidates for use, it is only necessary to perform two orthogonal transforms at maximum. If it is determined that the first orthogonal transformation is appropriate, the orthogonal transformation only needs to be executed once.

［実施例３］
実施例３では、実施例１における画像処理装置３００，又は実施例２における画像処理装置７００を直交変換部２０２に含む画像処理装置（画像符号化装置）について説明する。 [Example 3]
In the third embodiment, an image processing device (image encoding device) including the image processing device 300 in the first embodiment or the image processing device 700 in the second embodiment in the orthogonal transform unit 202 will be described.

＜構成＞
図１０は、実施例３における画像処理装置２０の概略構成の一例を示すブロック図である。図１０に示す例では、画像処理装置２０は、前処理部２００と、予測誤差信号生成部２０１と、直交変換部２０２と、量子化部２０３と、エントロピー符号化部２０４と、逆量子化部２０５と、逆直交変換部２０６と、復号画像生成部２０７と、ループフィルタ部２０９と、復号画像記憶部２１０と、イントラ予測部２１１と、インター予測部２１２と、動きベクトル計算部２１３と、予測画像選択部２１５とを有する。各部についての概略を以下に説明する。 <Configuration>
FIG. 10 is a block diagram illustrating an example of a schematic configuration of the image processing apparatus 20 according to the third embodiment. In the example illustrated in FIG. 10, the image processing apparatus 20 includes a preprocessing unit 200, a prediction error signal generation unit 201, an orthogonal transformation unit 202, a quantization unit 203, an entropy encoding unit 204, and an inverse quantization unit. 205, an inverse orthogonal transform unit 206, a decoded image generation unit 207, a loop filter unit 209, a decoded image storage unit 210, an intra prediction unit 211, an inter prediction unit 212, a motion vector calculation unit 213, and a prediction And an image selection unit 215. An outline of each part will be described below.

前処理部２００は、ピクチャタイプに合わせてピクチャを並べ替え、ピクチャタイプ及びフレーム毎のフレーム画像等を順次出力する。また、前処理部２００は、ブロック分割なども行い、ブロック分割の境界情報をループフィルタ部２０９に出力してもよい。 The preprocessing unit 200 rearranges the pictures according to the picture type, and sequentially outputs the picture type and the frame image for each frame. The preprocessing unit 200 may also perform block division and the like, and output block division boundary information to the loop filter unit 209.

予測誤差信号生成部２０１は、入力された動画像データの符号化対象画像が、例えば３２×３２、１６×１６、８×８画素などのブロックに分割されたブロックデータを取得する。 The prediction error signal generation unit 201 obtains block data obtained by dividing the encoding target image of the input moving image data into blocks such as 32 × 32, 16 × 16, and 8 × 8 pixels, for example.

予測誤差信号生成部２０１は、そのブロックデータと、予測画像選択部２１５から出力される予測画像のブロックデータとにより、予測誤差信号を生成する。予測誤差信号生成部２０１は、生成された予測誤差信号を直交変換部２０２に出力する。 The prediction error signal generation unit 201 generates a prediction error signal based on the block data and the block data of the prediction image output from the prediction image selection unit 215. The prediction error signal generation unit 201 outputs the generated prediction error signal to the orthogonal transformation unit 202.

直交変換部２０２は、実施例１における画像処理装置３００、又は実施例２における画像処理装置７００が含まれ得る。直交変換部２０２は、入力された予測誤差信号を直交変換処理する。直交変換部２０２は、変換された係数値を示す信号を量子化部２０３に出力する。加えて、選択された直交変換を示す選択信号をエントロピー符号化部及び逆直交変換部に送出する。 The orthogonal transform unit 202 can include the image processing device 300 in the first embodiment or the image processing device 700 in the second embodiment. The orthogonal transform unit 202 performs an orthogonal transform process on the input prediction error signal. The orthogonal transform unit 202 outputs a signal indicating the transformed coefficient value to the quantization unit 203. In addition, a selection signal indicating the selected orthogonal transform is sent to the entropy coding unit and the inverse orthogonal transform unit.

量子化部２０３は、直交変換部２０２からの出力信号を量子化する。量子化部２０３は、量子化することによって出力信号の符号量を低減し、この出力信号をエントロピー符号化部２０４及び逆量子化部２０５に出力する。量子化部２０３は、量子化パラメータのＱＰ値をループフィルタ部２０９に出力してもよい。 The quantization unit 203 quantizes the output signal from the orthogonal transform unit 202. The quantization unit 203 reduces the code amount of the output signal by quantization, and outputs this output signal to the entropy encoding unit 204 and the inverse quantization unit 205. The quantization unit 203 may output the QP value of the quantization parameter to the loop filter unit 209.

エントロピー符号化部２０４は、量子化部２０３からの出力信号や、動きベクトル計算部２１３から出力された動きベクトル情報やループフィルタ部２０９からのフィルタ係数などをエントロピー符号化して出力する。 The entropy encoding unit 204 entropy-encodes and outputs the output signal from the quantization unit 203, the motion vector information output from the motion vector calculation unit 213, the filter coefficient from the loop filter unit 209, and the like.

また、エントロピー符号化部２０４は、イントラ予測部２１１から取得したイントラ予測方向の差分値や、インター予測部２１２から取得した動きベクトルと予測ベクトルの差分値などをエントロピー符号化してもよい。 Further, the entropy encoding unit 204 may entropy encode the difference value of the intra prediction direction acquired from the intra prediction unit 211, the difference value of the motion vector and the prediction vector acquired from the inter prediction unit 212, and the like.

また、エントロピー符号化部２０４は、直交変換部２０２において使用された直交変換の情報２５０（直交変換の選択信号）を符号化して、選択した直交変換のインデックスとして復号装置側へ伝送する。エントロピー符号化とは、シンボルの出現頻度に応じて可変長の符号を割り当てる方式をいう。 The entropy encoding unit 204 encodes the orthogonal transformation information 250 (orthogonal transformation selection signal) used in the orthogonal transformation unit 202, and transmits the coded information to the decoding device side as an index of the selected orthogonal transformation. Entropy coding is a method of assigning variable-length codes according to the appearance frequency of symbols.

逆量子化部２０５は、量子化部２０３からの出力信号を逆量子化してから逆直交変換部２０６に出力する。逆直交変換部２０６は、逆量子化部２０５からの出力信号を逆直交変換処理してから復号画像生成部２０７に出力する。逆直交変換部２０６は、入力された直交変換の選択信号を用いて逆直交変換の方法を選択する。これら逆量子化部２０５及び逆直交変換部２０６によって復号処理が行われることにより、符号化前の予測誤差信号と同程度の信号が得られる。 The inverse quantization unit 205 performs inverse quantization on the output signal from the quantization unit 203 and outputs the result to the inverse orthogonal transform unit 206. The inverse orthogonal transform unit 206 performs an inverse orthogonal transform process on the output signal from the inverse quantization unit 205 and then outputs the output signal to the decoded image generation unit 207. The inverse orthogonal transform unit 206 selects an inverse orthogonal transform method using the input orthogonal transform selection signal. By performing decoding processing by the inverse quantization unit 205 and the inverse orthogonal transform unit 206, a signal having the same level as the prediction error signal before encoding is obtained.

復号画像生成部２０７は、イントラ予測部２１１で画面内予測された画像或いはインター予測部２１２で動き補償された画像のブロックデータと、逆量子化部２０５及び逆直交変換部２０６により復号処理された予測誤差信号とを加算する。復号画像生成部２０７は、加算して生成した復号画像のブロックデータを、ループフィルタ部２０９に出力する。 The decoded image generation unit 207 is decoded by the block data of the image predicted in the screen by the intra prediction unit 211 or the motion compensated image by the inter prediction unit 212, and the inverse quantization unit 205 and the inverse orthogonal transform unit 206. Add the prediction error signal. The decoded image generation unit 207 outputs the decoded image block data generated by addition to the loop filter unit 209.

ループフィルタ部２０９は、例えばＡＬＦ（Adaptive Loop Filter）やデブロッキングフィルタである。ループフィルタ部２０９は、フィルタ処理結果を復号画像記憶部２１０に出力し、蓄積された１画像分のフィルタ処理結果を参照画像として記憶させる。 The loop filter unit 209 is, for example, an ALF (Adaptive Loop Filter) or a deblocking filter. The loop filter unit 209 outputs the filter processing result to the decoded image storage unit 210, and stores the accumulated filter processing result for one image as a reference image.

復号画像記憶部２１０は、入力した復号画像のブロックデータを新たな参照画像のデータとして記憶し、イントラ予測部２１１、インター予測部２１２及び動きベクトル計算部２１３に出力する。 The decoded image storage unit 210 stores the input block data of the decoded image as new reference image data, and outputs the data to the intra prediction unit 211, the inter prediction unit 212, and the motion vector calculation unit 213.

イントラ予測部２１１は、符号化対象画像の処理対象ブロックに対して、既に符号化された参照画素から予測画像のブロックデータを生成する。イントラ予測部２１１は、複数の予測方向を用いて予測を行い、最適な予測方向を決定する。予測方向については、符号化済みブロックの予測方向との差分値をビットストリームに含めるために、差分値がエントロピー符号化部２０４に出力される。 The intra prediction unit 211 generates block data of the predicted image from the already-encoded reference pixels for the processing target block of the encoding target image. The intra prediction unit 211 performs prediction using a plurality of prediction directions, and determines an optimal prediction direction. With respect to the prediction direction, the difference value is output to the entropy encoding unit 204 in order to include the difference value with the prediction direction of the encoded block in the bitstream.

インター予測部２１２は、復号画像記憶部２１０から取得した参照画像のデータを動きベクトル計算部２１３から提供される動きベクトルで動き補償する。これにより、動き補償された参照画像としてのブロックデータが生成される。動きベクトルについては、符号化済みブロックの動きベクトル（予測ベクトル）との差分値をビットストリームに含めるために、差分値がエントロピー符号化部２０４に出力される。 The inter prediction unit 212 performs motion compensation on the reference image data acquired from the decoded image storage unit 210 with the motion vector provided from the motion vector calculation unit 213. Thereby, block data as a motion-compensated reference image is generated. For the motion vector, the difference value with the motion vector (predicted vector) of the encoded block is output to the entropy encoding unit 204 in order to include the difference value in the bitstream.

動きベクトル計算部２１３は、符号化対象画像におけるブロックデータと、復号画像記憶部２１０から取得する参照画像とを用いて、動きベクトルを求める。 The motion vector calculation unit 213 obtains a motion vector using the block data in the encoding target image and the reference image acquired from the decoded image storage unit 210.

動きベクトル計算部２１３は、求めた動きベクトルをインター予測部２１２に出力し、参照画像を示す情報を含む動きベクトル情報をエントロピー符号化部２０４に出力する。 The motion vector calculation unit 213 outputs the obtained motion vector to the inter prediction unit 212 and outputs motion vector information including information indicating the reference image to the entropy encoding unit 204.

イントラ予測部２１１とインター予測部２１２から出力されたブロックデータは、予測画像選択部２１５に入力される。 The block data output from the intra prediction unit 211 and the inter prediction unit 212 is input to the predicted image selection unit 215.

予測画像選択部２１５は、イントラ予測部２１１とインター予測部２１２から取得したブロックデータのうち、どちらか一方のブロックデータを予測画像として選択する。選択された予測画像は、予測誤差信号生成部２０１に出力される。 The predicted image selection unit 215 selects one of the block data acquired from the intra prediction unit 211 and the inter prediction unit 212 as a predicted image. The selected prediction image is output to the prediction error signal generation unit 201.

なお、イントラ予測部２１１とインター予測部２１２のいずれが使用されたかに依存して、直交変換部２０２において利用される複数の直交変換の候補を異ならせてもよい。 Depending on which of the intra prediction unit 211 and the inter prediction unit 212 is used, a plurality of orthogonal transform candidates used in the orthogonal transform unit 202 may be different.

なお、図１０に示す画像処理装置２０の構成は一例であり、必要に応じて各構成を組み合わせたり、各構成を適宜変更したりしてもよい。 Note that the configuration of the image processing apparatus 20 illustrated in FIG. 10 is an example, and the configurations may be combined or the configurations may be appropriately changed as necessary.

以上、実施例３によれば、画像符号化時に、直交変換の計算量を抑えつつ、複数の直交変換のうちから適切な直交変換により、直交変換係数を得ることができる。 As described above, according to the third embodiment, orthogonal transform coefficients can be obtained by appropriate orthogonal transform from among a plurality of orthogonal transforms while suppressing the amount of computation of orthogonal transform during image coding.

［実施例４］
図１１は、実施例４における画像処理装置４０の概略構成の一例を示すブロック図である。図１１に示す画像処理装置４０は、上述した実施例１〜３で説明した画像処理をソフトウェアで実装した装置の一例である。 [Example 4]
FIG. 11 is a block diagram illustrating an example of a schematic configuration of the image processing device 40 according to the fourth embodiment. An image processing apparatus 40 shown in FIG. 11 is an example of an apparatus in which the image processing described in the first to third embodiments is implemented by software.

図１１に示すように、画像処理装置４０は、制御部４０１と、主記憶部４０２と、補助記憶部４０３と、ドライブ装置４０４と、ネットワークＩ／Ｆ部４０６と、入力部４０７と、表示部４０８とを有する。これら各構成は、バスを介して相互にデータ送受信可能に接続されている。 As shown in FIG. 11, the image processing apparatus 40 includes a control unit 401, a main storage unit 402, an auxiliary storage unit 403, a drive device 404, a network I / F unit 406, an input unit 407, and a display unit. 408. These components are connected to each other via a bus so as to be able to transmit and receive data.

制御部４０１は、コンピュータの中で、各装置の制御やデータの演算、加工を行うＣＰＵ（Central Processing Unit）である。また、制御部４０１は、主記憶部４０２又は補助記憶部４０３に記憶された画像処理のプログラムを実行する演算装置である。制御部４０１は、入力部４０７や記憶装置からデータを受け取り、演算、加工した上で、表示部４０８や記憶装置などに出力する。 The control unit 401 is a CPU (Central Processing Unit) that controls each device, calculates data, and processes in a computer. The control unit 401 is an arithmetic device that executes an image processing program stored in the main storage unit 402 or the auxiliary storage unit 403. The control unit 401 receives data from the input unit 407 and the storage device, calculates and processes the data, and then outputs the data to the display unit 408 and the storage device.

また、制御部４０１は、画像処理のプログラムを実行することで、実施例１〜３で説明した処理を実現することができる。 Also, the control unit 401 can realize the processing described in the first to third embodiments by executing an image processing program.

主記憶部４０２は、ＲＯＭ（Read Only Memory）やＲＡＭ（Random Access Memory）などである。主記憶部４０２は、制御部４０１が実行する基本ソフトウェアであるＯＳ（Operating System）やアプリケーションソフトウェアなどのプログラムやデータを記憶又は一時保存する記憶装置である。 The main storage unit 402 is a ROM (Read Only Memory), a RAM (Random Access Memory), or the like. The main storage unit 402 is a storage device that stores or temporarily stores programs and data such as OS (Operating System) and application software that are basic software executed by the control unit 401.

補助記憶部４０３は、ＨＤＤ（Hard Disk Drive）などであり、アプリケーションソフトウェアなどに関連するデータを記憶する記憶装置である。 The auxiliary storage unit 403 is an HDD (Hard Disk Drive) or the like, and is a storage device that stores data related to application software or the like.

ドライブ装置４０４は、記録媒体４０５、例えばフレキシブルディスクからプログラムを読み出し、記憶部にインストールする。 The drive device 404 reads the program from the recording medium 405, for example, a flexible disk, and installs it in the storage unit.

また、記録媒体４０５に、所定のプログラムを格納し、この記録媒体４０５に格納されたプログラムはドライブ装置４０４を介して画像処理装置４０にインストールされる。インストールされた所定のプログラムは、画像処理装置４０により実行可能となる。 A predetermined program is stored in the recording medium 405, and the program stored in the recording medium 405 is installed in the image processing apparatus 40 via the drive device 404. The installed predetermined program can be executed by the image processing apparatus 40.

ネットワークＩ／Ｆ部４０６は、有線及び／又は無線回線などのデータ伝送路により構築されたＬＡＮ（Local Area Network）、ＷＡＮ（Wide Area Network）などのネットワークを介して接続された通信機能を有する周辺機器と画像処理装置４０とのインターフェースである。 The network I / F unit 406 is a peripheral having a communication function connected via a network such as a LAN (Local Area Network) or a WAN (Wide Area Network) constructed by a data transmission path such as a wired and / or wireless line. This is an interface between the device and the image processing apparatus 40.

入力部４０７は、カーソルキー、数字入力及び各種機能キー等を備えたキーボード、表示部４０８の表示画面上でキーの選択等を行うためのマウスやスライドパット等を有する。表示部４０８は、ＬＣＤ（Liquid Crystal Display）等により構成され、制御部４０１から入力される表示データに応じた表示が行われる。 The input unit 407 includes a keyboard having cursor keys, numeric input, various function keys, and the like, and a mouse and a slide pad for performing key selection on the display screen of the display unit 408. The display unit 408 is configured by an LCD (Liquid Crystal Display) or the like, and performs display according to display data input from the control unit 401.

なお、図３の画像処理装置３００及び図７の画像処理装置７００の各部は、例えば制御部４０１及びワークメモリとしての主記憶部４０２により実現されうる。 Each unit of the image processing apparatus 300 in FIG. 3 and the image processing apparatus 700 in FIG. 7 can be realized by, for example, the control unit 401 and the main storage unit 402 as a work memory.

また、図１０に示す復号画像記憶部２１０は、例えば主記憶部４０２又は補助記憶部４０３により実現され、図１０に示す復号画像記憶部２１０以外の構成は、例えば制御部４０１及びワークメモリとしての主記憶部４０２により実現されうる。 Also, the decoded image storage unit 210 illustrated in FIG. 10 is realized by, for example, the main storage unit 402 or the auxiliary storage unit 403, and the configuration other than the decoded image storage unit 210 illustrated in FIG. This can be realized by the main storage unit 402.

画像処理装置４０で実行されるプログラムは、実施例１〜３で説明した各部を含むモジュール構成となっている。実際のハードウェアとしては、制御部４０１が補助記憶部４０３からプログラムを読み出して実行することにより上記各部のうち１又は複数の各部が主記憶部４０２上にロードされ、１又は複数の各部が主記憶部４０２上に生成されるようになっている。 The program executed by the image processing apparatus 40 has a module configuration including each unit described in the first to third embodiments. As actual hardware, when the control unit 401 reads out and executes a program from the auxiliary storage unit 403, one or more of the above-described units are loaded onto the main storage unit 402, and one or more of the respective units are main. It is generated on the storage unit 402.

このように、上述した実施例１〜３で説明した画像処理は、コンピュータに実行させるためのプログラムとして実現されてもよい。このプログラムをサーバ等からインストールしてコンピュータに実行させることで、実施例１〜３で説明した処理を実現することができる。 As described above, the image processing described in the first to third embodiments may be realized as a program for causing a computer to execute the image processing. The processing described in the first to third embodiments can be realized by installing this program from a server or the like and causing the computer to execute the program.

また、このプログラムを記録媒体４０５に記録し、このプログラムが記録された記録媒体４０５をコンピュータや携帯端末などの処理装置に読み取らせて、前述した画像処理を実現させることも可能である。 It is also possible to record the program in the recording medium 405 and cause the processing unit such as a computer or a portable terminal to read the recording medium 405 on which the program is recorded, thereby realizing the above-described image processing.

なお、記録媒体４０５は、ＣＤ−ＲＯＭ、フレキシブルディスク、光磁気ディスク等のように情報を光学的，電気的或いは磁気的に記録する記録媒体、ＲＯＭ、フラッシュメモリ等のように情報を電気的に記録する半導体メモリ等、様々なタイプの記録媒体を用いることができる。 Note that the recording medium 405 is a recording medium that records information optically, electrically, or magnetically, such as a CD-ROM, a flexible disk, or a magneto-optical disk, and information that is electrically stored such as a ROM or a flash memory. Various types of recording media such as a semiconductor memory for recording can be used.

また、上述した各実施例で説明した画像処理は、１つ又は複数の集積回路に実装されてもよい。なお、実施例４における画像処理装置４０は、上記の通り、画像処理装置３００、７００、２０の少なくとも１つの装置としての機能を有してもよい。 Further, the image processing described in each of the above embodiments may be implemented in one or a plurality of integrated circuits. Note that, as described above, the image processing apparatus 40 according to the fourth embodiment may have a function as at least one of the image processing apparatuses 300, 700, and 20.

また、上述した各実施例における画像処理装置は、直交変換を選択して使用する符号化技術に対して適用可能であり、Ｈ．２６４／ＡＶＣやＨＥＶＣだけに限られるものではない。 In addition, the image processing apparatus in each embodiment described above can be applied to an encoding technique that selects and uses orthogonal transform. It is not limited to H.264 / AVC or HEVC.

以上、各実施例について詳述したが、特定の実施例に限定されるものではなく、特許請求の範囲に記載された範囲内において、上記変形例以外にも種々の変形及び変更が可能である。 Each embodiment has been described in detail above. However, the present invention is not limited to the specific embodiment, and various modifications and changes other than the above-described modification are possible within the scope described in the claims. .

３２０第１の直交変換部
３２１分析結果
３２２分析部
３２７選択指示信号
３２８第１のスイッチ
３２９第２のスイッチ
３３０選択部
３４０第２の直交変換部
３４１入力画像
３５１出力
７２４類似度算出部
７２６記憶部
７４３第３の直交変換部 320 First orthogonal transform unit 321 Analysis result 322 Analysis unit 327 Selection instruction signal 328 First switch 329 Second switch 330 Selection unit 340 Second orthogonal transform unit 341 Input image 351 Output 724 Similarity calculation unit 726 Storage unit 743 Third orthogonal transform unit

Claims

An image processing apparatus that performs processing by selecting any one of a first orthogonal transformation and one or more other orthogonal transformations,
An analysis unit that analyzes a first orthogonal transform coefficient obtained by transforming image data by the first orthogonal transform according to a predetermined rule;
A selection unit that selects any one of the first orthogonal transform and the one or more other orthogonal transforms based on the analysis result of the analysis unit;
An image processing apparatus.

The predetermined rule is: an energy ratio between an odd-order and an even-order in the horizontal direction of the first orthogonal transform coefficient and / or an energy ratio between an odd-order and an even-order in the vertical direction of the first orthogonal transform coefficient. Use
The image processing apparatus according to claim 1.

The analysis unit
A storage unit that stores orthogonal transform coefficients obtained by transforming each of a predetermined number of the plurality of base images of each of the one or more other orthogonal transforms by the first orthogonal transform;
A similarity calculation unit that calculates the similarity between the first orthogonal transform coefficient and the plurality of orthogonal transform coefficients stored in the storage unit;
Including
The selection unit, when the highest similarity among the similarities obtained by the similarity calculation unit exceeds a predetermined threshold, the image is subjected to the other orthogonal transformation corresponding to the highest similarity. By selectively providing data, an orthogonal transform coefficient obtained by transforming the image data is used for encoding the image data,
If the highest similarity is less than or equal to the predetermined threshold, the first orthogonal transform coefficient is used for encoding the image data;
The image processing apparatus according to claim 1.

An encoding device comprising the image processing device according to any one of claims 1 to 3.

A program for causing a computer to function as the encoding device according to claim 4.