JP2013182416A

JP2013182416A - Feature amount extraction device, feature amount extraction method, and feature amount extraction program

Info

Publication number: JP2013182416A
Application number: JP2012045780A
Authority: JP
Inventors: Toshiaki Inoue; 俊明井上
Original assignee: Pioneer Electronic Corp
Current assignee: Pioneer Corp
Priority date: 2012-03-01
Filing date: 2012-03-01
Publication date: 2013-09-12

Abstract

PROBLEM TO BE SOLVED: To accurately extract a feature amount of an input image with a small operation amount.SOLUTION: A feature amount extraction device smooths an input image input from outside to generate a smoothed image group. The smoothed image group is spatially differentiated to generate a differential image group. Then, a feature amount of the input image is extracted on the basis of the differential image group, and is output to the outside. A differential image is generated by spatial differentiation, thereby enabling accurate feature amount extraction with a small operation amount.

Description

本発明は、画像の回転や拡大縮小などの変化に対して不変な局所特徴量を抽出する方法であるＳＩＦＴ（Ｓｃａｌｅ−ＩｎｖａｒｉａｎｔＦｅａｔｕｒｅＴｒａｎｓｆｏｒｍ）に関する。 The present invention relates to SIFT (Scale-Invariant Feature Transform), which is a method for extracting local feature amounts that are invariant to changes such as image rotation and enlargement / reduction.

非特許文献１は、ＳＩＦＴによる特徴量抽出方法について記載している。具体的に、非特許文献１の方法は、入力画像を平滑化し、平滑化画像群から差分手段により微分画像群を生成し、微分画像群から特徴量を算出している。 Non-Patent Document 1 describes a feature quantity extraction method by SIFT. Specifically, in the method of Non-Patent Document 1, the input image is smoothed, a differential image group is generated from the smoothed image group by the difference means, and the feature amount is calculated from the differential image group.

特許文献１は、複数のオブジェクトを含むオブジェクト画像から、検出対象となるモデルを抽出する画像認識装置およびその方法、並びにそのような画像認識機能を搭載したロボット装置に関する発明を記載している。 Patent Literature 1 describes an invention relating to an image recognition apparatus and method for extracting a model to be detected from an object image including a plurality of objects, and a robot apparatus equipped with such an image recognition function.

八木、斎藤編、「コンピュータビジョン最先端ガイド２」、アドコムメディア社、pp.5-29、2010年6月19日発行Yagi, Saito, “Computer Vision Advanced Guide 2”, Adcom Media, pp.5-29, published on June 19, 2010

特許第４４９２０３６号公報Japanese Patent No. 4492036

非特許文献１は、微分画像群の生成に、ＬｏＧ（ＬａｐｌａｃｉａｎｏｆＧａｕｓｓｉａｎ）フィルタの近似であるＤｏＧ（ＤｉｆｆｅｒｅｎｃｅｏｆＧａｕｓｓｉａｎ）フィルタを用いているため、微分画像の画素値（微分係数）の精度がＬｏＧフィルタによる結果に比べて劣化するという問題がある。すなわち微分画像の極値（極大値または極小値）として検出されるキーポイント候補の精度、さらには抽出される特徴量の精度も劣化するという問題がある。 Since Non-Patent Document 1 uses a DoG (Difference of Gaussian) filter that is an approximation of a LoG (Laplacian of Gaussian) filter to generate a differential image group, the accuracy of the pixel value (differential coefficient) of the differential image is LoG. There is a problem that the result is deteriorated as compared with the result of the filter. That is, there is a problem that the accuracy of the key point candidate detected as the extreme value (maximum value or minimum value) of the differential image, and further, the accuracy of the extracted feature value deteriorates.

一方、特許文献１では、平滑化画像からの微分画像の生成にＤｏＧフィルタを用いれば先行技術１と同様の問題が生じる。また入力からの微分画像の生成にＬｏＧフィルタを用いれば、平滑化画像の生成とは別に入力画像とＬｏＧフィルタとの畳み込み演算を行う必要があり、演算量が増大するという問題がある。 On the other hand, in Patent Document 1, if a DoG filter is used to generate a differential image from a smoothed image, the same problem as in Prior Art 1 occurs. If a LoG filter is used to generate a differential image from an input, it is necessary to perform a convolution operation between the input image and the LoG filter separately from the generation of the smoothed image, which increases the amount of calculation.

本発明の解決しようとする課題としては、上記のものが一例として挙げられる。本発明は、入力画像の特徴量を少ない演算量で高精度に抽出することを目的とする。 Examples of the problem to be solved by the present invention include the above. An object of the present invention is to extract a feature amount of an input image with high accuracy with a small amount of calculation.

請求項１に記載の発明は、特徴量抽出装置であって、入力画像を平滑化して平滑化画像群を生成する平滑化手段と、前記平滑化画像群を空間微分して微分画像群を生成する微分手段と、前記微分画像群に基づいて、前記入力画像の特徴量を抽出し出力する特徴量抽出手段と、を備えることを特徴とする。 The invention according to claim 1 is a feature quantity extraction device, wherein a smoothing unit that smoothes an input image to generate a smoothed image group, and a differential image group is generated by spatially differentiating the smoothed image group. And a feature quantity extraction means for extracting and outputting a feature quantity of the input image based on the differential image group.

請求項８に記載の発明は、特徴量抽出装置により実行される特徴量抽出方法であって、入力画像を平滑化して平滑化画像群を生成する平滑化工程と、前記平滑化画像群を空間微分して微分画像群を生成する微分工程と、前記微分画像群に基づいて、前記入力画像の特徴量を抽出し出力する特徴量抽出工程と、を有することを特徴とする。 The invention according to claim 8 is a feature quantity extraction method executed by a feature quantity extraction device, wherein a smoothing step of smoothing an input image to generate a smoothed image group, and the smoothed image group as a space A differentiation step of differentiating to generate a differential image group; and a feature amount extraction step of extracting and outputting a feature amount of the input image based on the differential image group.

請求項９に記載の発明は、コンピュータにより実行される特徴量抽出プログラムであって、入力画像を平滑化して平滑化画像群を生成する平滑化手段、前記平滑化画像群を空間微分して微分画像群を生成する微分手段、前記微分画像群に基づいて、前記入力画像の特徴量を抽出し出力する特徴量抽出手段、として前記コンピュータを機能させる。 The invention according to claim 9 is a feature amount extraction program executed by a computer, smoothing means for smoothing an input image to generate a smoothed image group, and spatially differentiating the derivative of the smoothed image group The computer is caused to function as a differentiation unit that generates an image group, and a feature amount extraction unit that extracts and outputs a feature amount of the input image based on the differential image group.

第１実施例に係る特徴量抽出装置の構成を示すブロック図である。It is a block diagram which shows the structure of the feature-value extraction apparatus which concerns on 1st Example. 第２実施例に係る特徴量抽出装置の構成を示すブロック図である。It is a block diagram which shows the structure of the feature-value extraction apparatus which concerns on 2nd Example. 第３実施例に係る特徴量抽出装置の構成を示すブロック図である。It is a block diagram which shows the structure of the feature-value extraction apparatus which concerns on 3rd Example. 非特許文献による特徴量抽出装置の構成を示すブロック図である。It is a block diagram which shows the structure of the feature-value extraction apparatus by a nonpatent literature. 第１の変形例に係る特徴量抽出装置の構成を示すブロック図である。It is a block diagram which shows the structure of the feature-value extraction apparatus which concerns on a 1st modification. 第２の変形例に係る特徴量抽出装置の構成を示すブロック図である。It is a block diagram which shows the structure of the feature-value extraction apparatus which concerns on a 2nd modification. 第３の変形例に係る特徴量抽出装置の構成を示すブロック図である。It is a block diagram which shows the structure of the feature-value extraction apparatus which concerns on a 3rd modification. 第４の変形例に係る特徴量抽出装置の構成を示すブロック図である。It is a block diagram which shows the structure of the feature-value extraction apparatus which concerns on a 4th modification. 第５の変形例に係る特徴量抽出装置の構成を示すブロック図である。It is a block diagram which shows the structure of the feature-value extraction apparatus which concerns on a 5th modification.

本発明の好適な実施形態では、特徴量抽出装置は、入力画像を平滑化して平滑化画像群を生成する平滑化手段と、前記平滑化画像群を空間微分して微分画像群を生成する微分手段と、前記微分画像群に基づいて、前記入力画像の特徴量を抽出し出力する特徴量抽出手段と、を備える。 In a preferred embodiment of the present invention, the feature quantity extraction device includes a smoothing unit that smoothes an input image to generate a smoothed image group, and a differential that generates a differential image group by spatially differentiating the smoothed image group. And a feature quantity extraction means for extracting and outputting the feature quantity of the input image based on the differential image group.

上記の特徴量抽出装置は、外部から入力された入力画像を平滑化し、平滑化画像群を生成する。平滑化画像群は空間微分され、微分画像群が生成される。そして、微分画像群に基づいて入力画像の特徴量が抽出され、外部へ出力される。空間微分で微分画像を生成することにより、少ない演算量で高精度な特徴量抽出が可能となる。好適な例では、前記微分手段は、ラプラシアンフィルタ手段である。 The feature amount extraction apparatus smoothes an input image input from the outside, and generates a smoothed image group. The smoothed image group is spatially differentiated to generate a differentiated image group. And the feature-value of an input image is extracted based on a differential image group, and is output outside. By generating a differential image by spatial differentiation, highly accurate feature extraction can be performed with a small amount of computation. In a preferred example, the differentiating means is a Laplacian filter means.

上記の特徴量抽出装置の一態様は、前記微分手段から出力された微分画像の画素値を増幅して前記特徴量抽出手段に入力する増幅手段を備える。微分画像の画素値、即ち微分係数を増幅することにより、多くの特徴量を抽出することが可能となる。 One aspect of the above feature quantity extraction device includes amplification means for amplifying the pixel value of the differential image output from the differentiation means and inputting it to the feature quantity extraction means. By amplifying the pixel value of the differential image, that is, the differential coefficient, it is possible to extract many feature amounts.

上記の特徴量抽出装置の他の一態様は、前記入力画像をグレースケールに変換し、前記平滑化手段に入力するグレースケール変換手段を備える。この態様では、複数の色成分を有する入力画像を、色成分毎に別々に処理する必要がないため、処理量を減少させることができる。 Another aspect of the feature amount extraction apparatus includes a gray scale conversion unit that converts the input image into a gray scale and inputs the converted image to the smoothing unit. In this aspect, since it is not necessary to separately process an input image having a plurality of color components for each color component, the processing amount can be reduced.

上記の特徴量抽出装置の他の一態様は、前記入力画像の色空間を変換し、色空間変換後の入力画像を前記平滑化手段に入力する色空間変換手段を備える。この態様では、有効な特徴量の抽出を行うのに適した色空間を選択することができる。 Another aspect of the feature amount extraction apparatus includes a color space conversion unit that converts a color space of the input image and inputs the input image after the color space conversion to the smoothing unit. In this aspect, it is possible to select a color space suitable for extracting effective feature amounts.

上記の特徴量抽出装置の他の一態様は、前記入力画像の色空間を変換する色空間変換手段と、色空間変換後の入力画像から１つの色成分を抽出して前記平滑化手段に入力する色成分抽出手段と、を備える。この態様では、複数の色成分を有する色空間の入力画像から、有効な特徴量の抽出を行うのに適した色成分を選択して特徴量の抽出を行うことができる。 Another aspect of the feature amount extraction apparatus is a color space conversion unit that converts a color space of the input image, and extracts one color component from the input image after the color space conversion and inputs the color component to the smoothing unit. Color component extraction means. In this aspect, it is possible to extract a feature amount by selecting a color component suitable for extracting an effective feature amount from an input image in a color space having a plurality of color components.

上記の特徴量抽出装置の他の一態様は、前記入力画像の色空間を変換する色空間変換手段と、色空間変換後の入力画像から複数の色成分を抽出する色成分抽出手段と、抽出された複数の色成分について任意の演算を行い前記平滑化手段に入力する演算手段と、を備える。好適な例では、演算手段は、任意の演算として平均を算出する。この態様では、複数の色成分を有する色空間の入力画像から、有効な特徴量の抽出を行うのに適した複数の色成分を選択して特徴量の抽出を行うことができる。 Another aspect of the feature amount extraction apparatus includes a color space conversion unit that converts a color space of the input image, a color component extraction unit that extracts a plurality of color components from the input image after the color space conversion, and an extraction Arithmetic means for performing an arbitrary calculation on the plurality of color components and inputting the result to the smoothing means. In a preferred example, the calculation means calculates an average as an arbitrary calculation. In this aspect, it is possible to extract a feature quantity by selecting a plurality of color components suitable for extracting an effective feature quantity from an input image in a color space having a plurality of color components.

本発明の他の好適な実施形態は、特徴量抽出装置により実行される特徴量抽出方法であって、入力画像を平滑化して平滑化画像群を生成する平滑化工程と、前記平滑化画像群を空間微分して微分画像群を生成する微分工程と、前記微分画像群に基づいて、前記入力画像の特徴量を抽出し出力する特徴量抽出工程と、を有する。 Another preferred embodiment of the present invention is a feature amount extraction method executed by a feature amount extraction apparatus, which smoothes an input image to generate a smoothed image group, and the smoothed image group A differential process for generating a differential image group by spatial differentiation, and a feature quantity extraction process for extracting and outputting a feature quantity of the input image based on the differential image group.

上記の特徴量抽出方法では、外部から入力された入力画像が平滑化され、平滑化画像群が生成される。平滑化画像群は空間微分され、微分画像群が生成される。そして、微分画像群に基づいて、入力画像の特徴量が抽出され、外部へ出力される。空間微分で微分画像を生成することにより、少ない演算量で高精度な特徴量抽出が可能となる。 In the above feature quantity extraction method, an input image input from the outside is smoothed, and a smoothed image group is generated. The smoothed image group is spatially differentiated to generate a differentiated image group. Based on the differential image group, the feature amount of the input image is extracted and output to the outside. By generating a differential image by spatial differentiation, highly accurate feature extraction can be performed with a small amount of computation.

本発明の他の好適な実施形態は、コンピュータにより実行される特徴量抽出プログラムであって、入力画像を平滑化して平滑化画像群を生成する平滑化手段、前記平滑化画像群を空間微分して微分画像群を生成する微分手段、前記微分画像群に基づいて、前記入力画像の特徴量を抽出し出力する特徴量抽出手段、として前記コンピュータを機能させる。 Another preferred embodiment of the present invention is a feature amount extraction program executed by a computer, comprising: smoothing means for smoothing an input image to generate a smoothed image group; and spatially differentiating the smoothed image group. The computer is caused to function as differentiation means for generating a differential image group, and feature quantity extraction means for extracting and outputting the feature quantity of the input image based on the differential image group.

上記の特徴量抽出プログラムをコンピュータ上で実行することにより、外部から入力された入力画像が平滑化され、平滑化画像群が生成される。平滑化画像群は空間微分され、微分画像群が生成される。そして、微分画像群に基づいて、入力画像の特徴量が抽出され、外部へ出力される。空間微分で微分画像を生成することにより、少ない演算量で高精度な特徴量抽出が可能となる。好適な例では、上記のプログラムは記憶媒体に記憶される。 By executing the above feature quantity extraction program on a computer, an input image input from the outside is smoothed, and a smoothed image group is generated. The smoothed image group is spatially differentiated to generate a differentiated image group. Based on the differential image group, the feature amount of the input image is extracted and output to the outside. By generating a differential image by spatial differentiation, highly accurate feature extraction can be performed with a small amount of computation. In a preferred example, the above program is stored in a storage medium.

以下、図面を参照して本発明の好適な実施例について説明する。なお、以下に説明する実施例及び変形例に係る特徴量抽出装置において、各手段は集積回路などによるハードウェアにより実現することができ、また、コンピュータ上で実行されるソフトウェア（プログラム）により実現することもできる。 Hereinafter, preferred embodiments of the present invention will be described with reference to the drawings. In the feature quantity extraction apparatus according to the embodiments and modifications described below, each unit can be realized by hardware such as an integrated circuit, and also realized by software (program) executed on a computer. You can also

［第１実施例］
図１は、第１実施例に係る特徴量抽出装置の構成を示す。第１実施例に係る特徴量抽出装置は、画像入力手段１１と、平滑化手段１２と、空間微分手段１３と、キーポイント検出・処理・記述手段１４と、特徴量出力手段１５と、を備える。 [First embodiment]
FIG. 1 shows the configuration of a feature quantity extraction apparatus according to the first embodiment. The feature quantity extraction apparatus according to the first embodiment includes an image input means 11, a smoothing means 12, a spatial differentiation means 13, a key point detection / processing / description means 14, and a feature quantity output means 15. .

画像入力手段１１には、カメラなどの撮像デバイスで撮影された画像やハードディスクなどの記録デバイスに格納された画像が入力される。画像入力手段１１は、その入力画像を受け取り、平滑化手段１２に出力する。平滑化手段１２は、画像入力手段１２から入力された画像から複数の平滑化画像を生成し、それら複数の平滑化画像からなる平滑化画像群を空間微分手段１３に出力する。 The image input means 11 receives an image taken by an imaging device such as a camera or an image stored in a recording device such as a hard disk. The image input means 11 receives the input image and outputs it to the smoothing means 12. The smoothing unit 12 generates a plurality of smoothed images from the image input from the image input unit 12 and outputs a smoothed image group including the plurality of smoothed images to the spatial differentiating unit 13.

空間微分手段１３は、平滑化画像群を受け取り、複数の微分画像からなる微分画像群を生成してキーポイント検出・処理・記述手段１４に出力する。キーポイント検出・処理・記述手段１４は、例えば多次元ベクトルで表現される特徴量を抽出し、特徴量出力手段１５に出力する。特徴量出力手段１５は、キーポイント検出・処理・記述手段１４から入力された特徴量を、記録デバイスやディスプレイ、又は、ほかの処理を行う処理部へ出力する。 The spatial differentiation means 13 receives the smoothed image group, generates a differential image group composed of a plurality of differential images, and outputs it to the key point detection / processing / description means 14. The key point detection / processing / description unit 14 extracts a feature amount expressed by, for example, a multidimensional vector and outputs the feature amount to the feature amount output unit 15. The feature quantity output unit 15 outputs the feature quantity input from the keypoint detection / processing / description unit 14 to a recording device, a display, or a processing unit that performs other processing.

次に、第１実施例による特徴量抽出処理について詳しく説明する。 Next, the feature amount extraction process according to the first embodiment will be described in detail.

カメラなどの撮像デバイスで撮影された画像やハードディスクなどの記録デバイスに格納された入力画像は、画像入力手段１１によって取り込まれ、取り込まれた画像から平滑化手段１２によって平滑化度合いの異なる複数の画像である平滑化画像群が生成される。平滑化画像の生成は、例えば式（１）に示すように入力画像とガウス関数との畳み込み演算で行われる。 An image captured by an imaging device such as a camera or an input image stored in a recording device such as a hard disk is captured by the image input unit 11, and a plurality of images having different smoothing levels by the smoothing unit 12 from the captured image. A smoothed image group is generated. The generation of the smoothed image is performed by a convolution operation between the input image and a Gaussian function, for example, as shown in Expression (1).

ここでＬは平滑化画像、Ｇはガウス関数、Ｉは入力画像、（ｘ，ｙ）と（ｕ，ｖ）は２次元座標、σは平滑化度合いである。平滑化画像群は、１枚の入力画像から式（１）のσの値を変えながら生成された平滑化度合いの異なる複数の画像からなる。 Here, L is a smoothed image, G is a Gaussian function, I is an input image, (x, y) and (u, v) are two-dimensional coordinates, and σ is a smoothing degree. The smoothed image group is composed of a plurality of images with different degrees of smoothing generated while changing the value of σ in Expression (1) from one input image.

次に、空間微分手段１３によって、平滑化画像群のそれぞれの画像に対して空間微分が施され、微分画像群が生成される。ここで空間微分手段１３はどのようなものでもよいが、例えば式（２）に示すようにラプラシアン演算（空間に関する２次微分）を平滑化画像に施してもよい。 Next, the spatial differentiation means 13 performs spatial differentiation on each image of the smoothed image group to generate a differential image group. Here, any spatial differentiation means 13 may be used, but for example, Laplacian calculation (secondary differentiation with respect to space) may be performed on the smoothed image as shown in Expression (2).

ここで、Ｄは微分画像、σはガウス関数の平滑化度合い、ｋは任意の定数、Ｌは平滑化画像、（ｘ、ｙ）は２次元座標である。微分画像群は、平滑化画像群のそれぞれの画像に対して空間微分を施して得られた画像からなる。 Here, D is a differential image, σ is a smoothing degree of a Gaussian function, k is an arbitrary constant, L is a smoothed image, and (x, y) is a two-dimensional coordinate. The differential image group consists of images obtained by performing spatial differentiation on each image of the smoothed image group.

次に、キーポイント検出・処理・記述手段１４によって、微分画像群からキーポイント候補（特徴量抽出の対象となる画像上の位置の候補）が検出される。さらに、適当な手法によって特徴量として不適切なキーポイント候補が削除され、有効なキーポイントに対して例えば多次元ベクトルからなる特徴量が記述されて特徴量出力手段１５へ出力される。 Next, the key point detection / processing / description means 14 detects key point candidates (candidates of positions on the image to be subjected to feature extraction) from the differential image group. Further, an inappropriate key point candidate as a feature quantity is deleted by an appropriate method, and a feature quantity composed of, for example, a multidimensional vector is described for the valid key point and output to the feature quantity output means 15.

特徴量出力手段１５は、キーポイント検出・処理・記述手段１４から出力された特徴量を、例えば記録デバイスへ蓄積するか、ディスプレイへ表示するか、あるいは他の処理へ出力する。 The feature quantity output means 15 stores the feature quantity output from the key point detection / processing / description means 14 in, for example, a recording device, displays it on a display, or outputs it to other processing.

キーポイント検出・処理・記述手段１４による処理には、例えば非特許文献１：「コンピュータビジョン最先端ガイド２」（アドコムメディア社、２０１０年６月１９日発行）の１０ページから１７ページに詳細に説明されている手法を適用することができる。以下、その概要を説明する。 For the processing by the key point detection / processing / description means 14, for example, Non-Patent Document 1: “Computer Vision State-of-the-Art Guide 2” (Adcom Media, published on June 19, 2010), page 10 to page 17, will be described in detail. The described technique can be applied. The outline will be described below.

キーポイント検出・処理・記述手段１４は、（ａ）キーポイント候補検出処理、（ｂ）キーポイントローカライズ処理とオリエンテーション算出処理、（ｃ）特徴量記述処理、という３つのステップからなる。 The key point detection / processing / description means 14 includes three steps: (a) key point candidate detection processing, (b) key point localization processing and orientation calculation processing, and (c) feature amount description processing.

（ａ）キーポイント候補検出処理
キーポイント候補検出処理は、空間微分手段１３により生成された微分画像群からキーポイントの候補を検出する処理である。すなわち平滑化度合いσ_ｉで式１により平滑化した画像（平滑化画像σ_ｉとする）に対応する微分画像（微分画像σ_ｉとする）上の注目画素ｐに対し、ｐおよびｐに隣接する８個の画素を含む領域（ｐを中心とした９画素の矩形領域）をとる。さらに微分画像σ_ｉと隣接する２つの微分画像σ_ｉ±１のそれぞれについても同様に上記矩形領域に対応する９画素をとり、ｐの値（微分係数）と合計２６個の隣接画素の値を比較し、ｐの値が最大または最小である場合に注目画素ｐを平滑化画像σ_ｉの１つのキーポイント候補とする。この処理を微分画像群の全てに対して行うことで全てのキーポイント候補を決定する。自然画像では通常キーポイント候補は多数検出される。 (A) Key Point Candidate Detection Process The key point candidate detection process is a process for detecting key point candidates from the differential image group generated by the spatial differentiator 13. That is, adjacent to p and p with respect to the pixel of interest p on the differential image (referred to as differential image σ _i ) corresponding to the image smoothed by Equation 1 with the smoothing degree σ _i (referred to as smoothed image σ _i ). An area including 8 pixels (a rectangular area of 9 pixels centered on p) is taken. Further takes two 9 pixels corresponding to the same manner the rectangular region for each of the differential image sigma _{i ± 1} adjacent to the differential image sigma _i, the values of (derivative) and a total of 26 pieces of adjacent pixels p In comparison, when the value of p is maximum or minimum, the target pixel p is set as one key point candidate of the smoothed image σ _i . All key point candidates are determined by performing this process on all the differential images. Many natural keypoint candidates are detected in a natural image.

（ｂ）キーポイントローカライズ処理とオリエンテーション算出処理
キーポイントローカライズ処理は、検出されたキーポイント候補から特徴量として適切でないものを削除し有効なキーポイント候補のみを選択する処理である。すなわち、（ｉ）ヘッセ行列を用いたエッジ上のキーポイント候補の削除、（ｉｉ）パラボラフィッティングによるキーポイント位置のサブピクセル推定、（ｉｉｉ）低コントラストのキーポイント削除、により有効なキーポイント候補を選択する。 (B) Key Point Localization Process and Orientation Calculation Process The key point localization process is a process for selecting only valid key point candidates by deleting those that are not suitable as feature quantities from the detected key point candidates. That is, (i) deletion of key point candidates on the edge using the Hessian matrix, (ii) sub-pixel estimation of the key point position by parabola fitting, and (iii) deletion of key points with low contrast. select.

オリエンテーション算出処理は、キーポイントローカライズ処理で選択された有効なキーポイント候補に対してそれぞれ特有のオリエンテーション（方向）を算出し、属性として付与することで画像の回転に不変な特徴量の抽出を可能にする処理である。すなわち、平滑化画像上のキーポイントの位置を中心として適当な矩形領域をとり、矩形領域内の全画素について輝度勾配を算出し、輝度勾配に関するヒストグラムを算出し、最頻値をとる勾配方向（３６方向に量子化された角度）をそのキーポイントのオリエンテーションとする。 The orientation calculation process calculates the unique orientation (direction) for each valid keypoint candidate selected in the keypoint localization process, and assigns it as an attribute to extract features that are invariant to image rotation It is a process to make. That is, an appropriate rectangular area is taken around the position of the key point on the smoothed image, a luminance gradient is calculated for all the pixels in the rectangular area, a histogram relating to the luminance gradient is calculated, and the gradient direction ( (An angle quantized in 36 directions) is set as the orientation of the key point.

（ｃ）特徴量記述処理
特徴量記述処理は、選択された有効なキーポイントとそのオリエンテーションを用いて特徴量を記述する処理である。すなわち、特徴量を記述する平滑化画像σ_ｉ上のキーポイントを中心として平滑化度合いσ_ｉを半径とする円が内接する矩形領域をとり、矩形領域をまずキーポイントのオリエンテーション方向に回転する。さらに矩形領域を４×４ブロックに分割し、ブロック毎にオリエンテーションの算出と同様の方法で輝度勾配を算出し、８方向に量子化された勾配方向ヒストグラムを作成する。特徴量は４×４＝１６ブロックについて８方向の勾配方向ヒストグラムを作成することで、１２８次元のベクトルとして記述される。この特徴量は画像に回転や拡大縮小があっても不変のベクトル量であり、また各特徴量のベクトルの長さを正規化することにより、画像の明るさの変化に関しても影響の少ない特徴量となる。 (C) Feature Quantity Description Process The feature quantity description process is a process for describing a feature quantity using a selected effective key point and its orientation. That is, a rectangular area inscribed by a circle having a radius of the smoothing degree σ _i around the key point on the smoothed image σ _i describing the feature amount is taken, and the rectangular area is first rotated in the orientation direction of the key point. Further, the rectangular area is divided into 4 × 4 blocks, the luminance gradient is calculated for each block in the same manner as the orientation calculation, and a gradient direction histogram quantized in 8 directions is created. The feature amount is described as a 128-dimensional vector by creating a gradient histogram in 8 directions for 4 × 4 = 16 blocks. This feature quantity is an invariant vector quantity even if the image is rotated or scaled. By normalizing the length of each feature quantity vector, the feature quantity has little effect on changes in image brightness. It becomes.

[第２実施例]
図２は、第２実施例に係る特徴量抽出装置の構成を示す。図２に示すように、第２実施例に係る特徴量抽出装置は、第１実施例に係る特徴量抽出装置において、空間微分手段１３をラプラシアンフィルタ手段１３ａとして構成したものである。 [Second Embodiment]
FIG. 2 shows the configuration of the feature quantity extraction apparatus according to the second embodiment. As shown in FIG. 2, the feature amount extraction apparatus according to the second embodiment is the same as the feature amount extraction apparatus according to the first embodiment except that the spatial differentiation means 13 is configured as a Laplacian filter means 13a.

第２実施例では、画像入力手段１１から平滑化手段１２まで、およびキーポイント検出・処理・記述手段１４から特徴量出力手段１５までは第１実施例と同様の処理であるため、第２実施例の特徴部分であるラプラシアンフィルタ手段１３ａの処理についてのみ説明する。 In the second embodiment, since the processing from the image input means 11 to the smoothing means 12 and the processing from the key point detection / processing / description means 14 to the feature amount output means 15 are the same as those in the first embodiment, the second embodiment Only the processing of the Laplacian filter means 13a, which is a characteristic part of the example, will be described.

ラプラシアンフィルタ手段１３ａは、式（１）に示す平滑化手段１２によって生成された平滑化画像を受け取ると、式（３）に示す演算によって微分画像を生成し、キーポイント検出・処理・記述手段１４に出力する。 When the Laplacian filter means 13a receives the smoothed image generated by the smoothing means 12 shown in Expression (1), the Laplacian filter means 13a generates a differential image by the operation shown in Expression (3), and detects the key point detection / processing / description means 14. Output to.

ここで、ＬｉおよびＤｉはそれぞれ平滑化度合いσ_ｉに対応する平滑化画像および微分画像、（ｘ、ｙ）は２次元座標であり、式（３）により平滑化画像群（Ｌｉ,ｉ≧１）に対応する微分画像群（Ｄｉ，ｉ≧１）が生成される。 Here, Li and Di are respectively a smoothed image and a differential image corresponding to the smoothing degree σ _i , and (x, y) are two-dimensional coordinates, and a smoothed image group (Li, i ≧ 1) is obtained by Expression (3). ) Is generated corresponding to the differential image group (Di, i ≧ 1).

[第３実施例]
図３は、第３実施例に係る特徴量抽出装置の構成を示す。図３に示すように、第３実施例に係る特徴量抽出装置は、第１実施例又は第２実施例と同様の構成に加えて、キーポイント検出・処理・記述手段１４の前段に増幅手段１６を備える。 [Third embodiment]
FIG. 3 shows the configuration of the feature quantity extraction apparatus according to the third embodiment. As shown in FIG. 3, the feature quantity extraction apparatus according to the third embodiment has the same configuration as that of the first embodiment or the second embodiment, and an amplification unit in front of the keypoint detection / processing / description unit 14. 16.

第３実施例では、画像入力手段１１から微分手段１２まで、およびキーポイント検出・処理・記述手段１４から特徴量出力手段１５までは、第１実施例及び第２実施例と同様の処理であるため、第３実施例の特徴部分である増幅手段１６の処理についてのみ説明する。 In the third embodiment, the processing from the image input means 11 to the differentiation means 12 and the key point detection / processing / description means 14 to the feature quantity output means 15 are the same as those in the first and second embodiments. Therefore, only the processing of the amplification means 16 that is a characteristic part of the third embodiment will be described.

増幅手段１６は、空間微分手段１３によって生成された式（２）に示す微分画像、または、ラプラシアンフィルタ手段１３ａによって生成された式（３）に示す微分画像を入力とし、微分画像の各画素の値（微分係数）を適当な演算により増幅することによって増幅画像を生成し、キーポイント検出・処理・記述手段１４に出力する。増幅手段１６は例えば式（４）のようなものでもよい。 The amplifying means 16 receives as input the differential image shown in Expression (2) generated by the spatial differentiating means 13 or the differential image shown in Expression (3) generated by the Laplacian filter means 13a. An amplified image is generated by amplifying the value (differential coefficient) by an appropriate calculation, and is output to the keypoint detection / processing / description means 14. For example, the amplifying means 16 may be as shown in Equation (4).

ここで、Ｄ’，Ｄ_ｉ’は増幅画像、αは増幅率（任意の定数）、Ｄは微分画像、Ｌ、Ｌ_ｉは平滑化画像、ｋは任意の定数（ｋ＞１）であり、（Ｄ’，Ｄ，Ｌ）および（Ｄ_ｉ’，Ｌ_ｉ）はそれぞれ平滑化度合いσおよびσ_ｉに対応している。 Here, D ′ and D _i ′ are amplified images, α is an amplification factor (arbitrary constant), D is a differential image, L and L _i are smoothed images, k is an arbitrary constant (k> 1), (D ′, D, L) and (D _i ′, L _i ) correspond to the smoothing degrees σ and σ _i , respectively.

微分係数を増幅することにより、キーポイント検出処理においてより多くのキーポイント候補を検出することができ、これに伴いより多くの有効なキーポイント候補も得られる。その結果、より多くの特徴量を抽出することが可能になる。 By amplifying the differential coefficient, more key point candidates can be detected in the key point detection process, and more effective key point candidates can be obtained accordingly. As a result, it is possible to extract more feature amounts.

[先行技術との比較]
非特許文献１は、ＳＩＦＴによる特徴量抽出方法について記載したものであり、図４に示すように、（図示されない）外部から画像を入力して複数の平滑化画像からなる平滑化画像群を出力する平滑化手段２２と、平滑化画像群を入力して微分画像群を出力する差分手段２３と、微分画像群を入力して特徴量を算出するキーポイント検出・処理・記述手段２４とで構成される。 [Comparison with prior art]
Non-Patent Document 1 describes a feature amount extraction method using SIFT. As shown in FIG. 4, an image is input from the outside (not shown) and a smoothed image group including a plurality of smoothed images is output. Smoothing means 22 for inputting, a difference means 23 for inputting a smoothed image group and outputting a differential image group, and a key point detection / processing / description means 24 for inputting a differential image group and calculating a feature quantity. Is done.

ここで、本発明に関連する部分は、平滑化手段によって生成された平滑化画像群から差分手段によって微分画像群を生成し、キーポイント検出・処理・記述手段に出力する部分であり、式（５）に示すように平滑化度合いの異なる平滑化画像の差分により簡単に微分画像を生成している。 Here, the part related to the present invention is a part for generating a differential image group by the difference means from the smoothed image group generated by the smoothing means and outputting it to the key point detection / processing / description means, As shown in 5), the differential image is easily generated based on the difference between the smoothed images having different smoothing degrees.

ここで、Ｄは微分画像、Ｌは平滑化画像、Ｇはガウス関数、（ｘ、ｙ）および（ｕ，ｖ）は２次元座標、σは平滑化度合い、ｋは定数である。特に式（５）において、[ ]内はＤｏＧ（ＤｉｆｆｅｒｅｎｃｅｏｆＧａｕｓｓｉａｎ）フィルタと呼ばれる（非特許文献のｐ．７〜８を参照）。 Here, D is a differential image, L is a smoothed image, G is a Gaussian function, (x, y) and (u, v) are two-dimensional coordinates, σ is a smoothing degree, and k is a constant. In particular, in Formula (5), the inside of [] is called a DoG (Difference of Gaussian) filter (refer p.7-8 of a nonpatent literature).

特許文献１は、複数のオブジェクトを含むオブジェクト画像から、検出対象となるモデルを抽出する画像認識装置およびその方法、並びにそのような画像認識機能を搭載したロボット装置に関する発明である。具体的に、オブジェクト画像およびモデル画像について、それぞれの特徴点を抽出する特徴点抽出部と、抽出された特徴点から特徴量を記述する特徴量保持部と、それぞれの特徴量から類似度の高い（オブジェクト画像とモデル画像で対応する可能性の高い特徴点ペア）を生成する特徴量比較部で構成される（特許文献１の図１およびパラグラフ００２６を参照）。 Patent Document 1 is an invention related to an image recognition apparatus and method for extracting a model to be detected from an object image including a plurality of objects, and a robot apparatus equipped with such an image recognition function. Specifically, for an object image and a model image, a feature point extraction unit that extracts each feature point, a feature amount holding unit that describes a feature amount from the extracted feature points, and a high similarity from each feature amount A feature amount comparison unit that generates (a feature point pair that is highly likely to correspond between an object image and a model image) is configured (see FIG. 1 of Patent Document 1 and paragraph 0026).

ここで、本発明に関連する部分は、特徴点抽出部においてガウス関数を用いてオブジェクト画像およびモデル画像の多重解像度ピラミッド（平滑化画像群）を生成し、ＤｏＧフィルタまたはＬｏＧ（ＬａｐｌａｃｉａｎｏｆＧａｕｓｓｉａｎ）フィルタを用いて微分画像群を生成するところである（段落００２９〜００３２を参照）。ＤｏＧフィルタは式（５）に相当するものであり、ＬｏＧフィルタは式（６）が用いられる。 Here, a part related to the present invention is to generate a multi-resolution pyramid (smoothed image group) of an object image and a model image using a Gaussian function in a feature point extraction unit, and perform a DoG filter or a LoG (Laplacian of Gaussian) filter. Is used to generate a differential image group (see paragraphs 0029 to 0032). The DoG filter corresponds to Equation (5), and Equation (6) is used for the LoG filter.

ここで、ｆはＬｏＧフィルタ、Ｄは微分画像、Ｉは入力画像、（ｘ、ｙ）および（ｕ，ｖ）は２次元座標、σは平滑化度合いである。 Here, f is a LoG filter, D is a differential image, I is an input image, (x, y) and (u, v) are two-dimensional coordinates, and σ is a smoothing degree.

非特許文献１は、平滑化手段によって生成された平滑化画像群に対して、式（５）に示す差分手段により微分画像群を生成し、キーポイント検出・処理・記述手段に与る構成である。これに対し、第１〜第３実施例は、差分手段の代わりに式（２）に示す空間微分手段１３または式（３）に示すラプラシアンフィルタ手段１３ａにより微分画像群を生成し、キーポイント検出・処理・記述手段１４に与える構成となっている。 Non-Patent Document 1 has a configuration in which a differential image group is generated by the difference means shown in Equation (5) for the smoothed image group generated by the smoothing means, and applied to the key point detection / processing / description means. is there. On the other hand, in the first to third embodiments, a differential image group is generated by the spatial differentiating means 13 shown in the expression (2) or the Laplacian filter means 13a shown in the expression (3) instead of the difference means, and the key point is detected. The configuration is given to the processing / description means 14.

また、特許文献１では、多重解像度ピラミッド（平滑化画像群）に対してＤｏＧフィルタを用いるか、あるいは入力画像に対してＬｏＧフィルタを用いて微分画像群を生成している。ＤｏＧフィルタは式（５）に相当するものであり、ＬｏＧフィルタは式（６）に相当するものである。これに対し、第１〜第３実施例では、平滑化画像に対して式（２）に示す空間微分手段１３または式（３）に示すラプラシアンフィルタ手段１３ａによって微分画像群を生成し、キーポイント検出・処理・記述手段１４に与える構成となっている。 In Patent Document 1, a DoG filter is used for a multi-resolution pyramid (smoothed image group) or a differential image group is generated using a LoG filter for an input image. The DoG filter corresponds to Equation (5), and the LoG filter corresponds to Equation (6). On the other hand, in the first to third embodiments, a differential image group is generated for the smoothed image by the spatial differentiating means 13 shown in the equation (2) or the Laplacian filter means 13a shown in the equation (3), and the key points are generated. The detection / processing / description means 14 is provided.

次に、先行技術の問題点と本発明の実施例で得られる効果を説明するために、ＤｏＧフィルタとＬｏＧフィルタの関係について詳細に説明する。 Next, in order to explain the problems of the prior art and the effects obtained by the embodiment of the present invention, the relationship between the DoG filter and the LoG filter will be described in detail.

ＤｏＧフィルタやＬｏＧフィルタに用いられるガウス関数（式（１））は、式（７）に示す拡散方程式を満たす。 The Gaussian function (formula (1)) used for the DoG filter or LoG filter satisfies the diffusion equation shown in formula (7).

ここで、Ｇはガウス関数、σは平滑化度合い、（ｘ，ｙ）は２次元座標である。式（７）の左辺をσについて離散化すると式（８）の近似関係が得られる（kは任意の定数）。 Here, G is a Gaussian function, σ is a smoothing degree, and (x, y) is a two-dimensional coordinate. When the left side of Equation (7) is discretized with respect to σ, the approximate relationship of Equation (8) is obtained (k is an arbitrary constant).

ここで、式（８）の左辺は式（５）に相当するＤｏＧフィルタ、右辺の∇^２Ｇは式（６）に相当するＬｏＧフィルタであり、式（８）はＤｏＧフィルタがＬｏＧフィルタの近似になっていることを意味する。式（８）の両辺に対して入力画像Ｉとの畳み込み演算を行うことにより、式（９）の近似関係が得られる。 Here, the left side of Expression (8) is a DoG filter corresponding to Expression (5), ∇ ² G on the right side is a LoG filter corresponding to Expression (6), and Expression (8) is an approximation of the DoG filter as a LoG filter. Means that By performing a convolution operation with the input image I on both sides of Equation (8), the approximate relationship of Equation (9) is obtained.

ここで、式（９）の左辺の各項は平滑化画像、左辺および右辺は微分画像に相当する。 Here, each term on the left side of Equation (9) corresponds to a smoothed image, and the left and right sides correspond to a differential image.

非特許文献１の問題点は、微分画像群の生成に、ＬｏＧフィルタの近似であるＤｏＧフィルタ（式（９）の左辺または式（５））を用いているため、微分画像の画素値（微分係数）の精度がＬｏＧフィルタによる結果（式（９）の右辺）に比べて劣化することである。すなわち微分画像の極値（極大値または極小値）として検出されるキーポイント候補の精度、さらには抽出される特徴量の精度も劣化するという問題がある。 The problem of Non-Patent Document 1 is that the DoG filter (the left side of Equation (9) or Equation (5)) that is an approximation of the LoG filter is used to generate the differential image group. The accuracy of the coefficient is deteriorated as compared with the result (the right side of the equation (9)) by the LoG filter. That is, there is a problem that the accuracy of the key point candidate detected as the extreme value (maximum value or minimum value) of the differential image, and further, the accuracy of the extracted feature value deteriorates.

これに対して第１〜第３実施例では、ＬｏＧフィルタに相当する結果を用いるため、微分画像をより高精度に生成することができ、結果的に特徴量の抽出精度も向上する。 On the other hand, in the first to third embodiments, since the result corresponding to the LoG filter is used, the differential image can be generated with higher accuracy, and the extraction accuracy of the feature amount is improved as a result.

特許文献１の問題点は、平滑化画像からの微分画像の生成にＤｏＧフィルタ（式（９）の左辺または式（５））を用いれば先行技術１と同様の問題が生じ、入力からの微分画像の生成にＬｏＧフィルタ（式９の右辺または式６）を用いれば、平滑化画像の生成とは別に入力画像とＬｏＧフィルタとの畳み込み演算を行う必要があり、演算量が増大するということである。 The problem of Patent Document 1 is that if a DoG filter (the left side of Expression (9) or Expression (5)) is used to generate a differential image from a smoothed image, the same problem as in Prior Art 1 occurs, and the differential from the input If the LoG filter (the right side of Equation 9 or Equation 6) is used for image generation, it is necessary to perform a convolution operation between the input image and the LoG filter separately from the generation of the smoothed image, which increases the amount of calculation. is there.

これに対して第１〜第３実施例では、式（９）の右辺について、 On the other hand, in the first to third embodiments, the right side of Expression (9) is

の関係を見出すことにより、直接的にＬｏＧフィルタとの畳み込み演算を行わずに、平滑化画像どうしの加減算のみで微分画像を生成することができるため、演算量の増加を抑えられるという利点がある。 By finding the relationship, it is possible to generate a differential image only by adding and subtracting between smoothed images without directly performing a convolution operation with the LoG filter, so that an increase in the amount of calculation can be suppressed. .

すなわち、第１〜第３実施例は、非特許文献１よりも高精度で、かつ、特許文献１よりも演算量の少ない特徴量抽出方法を与えるものである。 That is, the first to third embodiments provide a feature amount extraction method that is more accurate than Non-Patent Document 1 and has a smaller amount of calculation than Patent Document 1.

[変形例]
以下、本発明による特徴量抽出装置の変形例について説明する。 [Modification]
Hereinafter, modifications of the feature quantity extraction device according to the present invention will be described.

（第１変形例）
図５は第１変形例の構成を示す。第１変形例では、画像入力手段１１と平滑化手段１２の間に、画像入力手段１１の出力を入力とするグレースケール変換手段１７を設け、グレースケール変換手段１７の出力を平滑化手段１２に与えることを特徴とする。グレースケール変換手段１７は、例えば入力画像がＲＧＢ色空間（３チャンネルの画像）で表現されている場合には、画素毎に式（１１）の演算を施すものでもよい。 (First modification)
FIG. 5 shows a configuration of the first modification. In the first modification, a gray scale conversion unit 17 that receives the output of the image input unit 11 as an input is provided between the image input unit 11 and the smoothing unit 12, and the output of the gray scale conversion unit 17 is supplied to the smoothing unit 12. It is characterized by giving. For example, when the input image is expressed in an RGB color space (3-channel image), the grayscale conversion unit 17 may perform the calculation of Expression (11) for each pixel.

ここで、ｉはグレースケール画像の画素値、（ｒ、ｇ、ｂ）はそれぞれ入力画像の画素値である。 Here, i is the pixel value of the grayscale image, and (r, g, b) are the pixel values of the input image.

第１変形例では、画像入力手段１１と平滑化手段１２の間に、グレースケール変換手段１７を設けている。グレースケール変換手段１７を用いることにより、複数チャンネル（カラー）からなる任意の色空間の入力画像について色成分を考慮した１チャンネルの階調値を持つ画像に変換することができ、複数チャンネルの入力画像を別々に処理する必要がないために処理量の増大を抑えることができる。 In the first modification, a gray scale conversion unit 17 is provided between the image input unit 11 and the smoothing unit 12. By using the gray scale conversion means 17, an input image in an arbitrary color space composed of a plurality of channels (colors) can be converted into an image having a gradation value of one channel in consideration of color components. Since it is not necessary to process images separately, an increase in processing amount can be suppressed.

（第２変形例）
図６は第２変形例の構成を示す。第２変形例では、画像入力手段１１と平滑化手段１２の間に、画像入力手段１１の出力を入力とする色空間変換手段３１と、色空間変換手段３の出力を入力とする色成分抽出手段３２と、色成分抽出手段３２の出力を入力とする色成分混合手段３３を設け、色成分混合手段３３の出力を平滑化手段１２に与えることを特徴とする。 (Second modification)
FIG. 6 shows the configuration of the second modification. In the second modification, between the image input means 11 and the smoothing means 12, a color space conversion means 31 that receives the output of the image input means 11 and a color component extraction that uses the output of the color space conversion means 3 as an input. Means 32 and a color component mixing means 33 that receives the output of the color component extraction means 32 are provided, and the output of the color component mixing means 33 is given to the smoothing means 12.

第２変形例では、画像入力手段１１と平滑化手段１２の間に、色空間変換手段３１、色成分抽出手段３２、および色成分混合手段３３を設けている。色空間変換手段３１により入力画像の色空間から任意の色空間（通常複数チャンネルをもつ）に変換し、色成分抽出手段３３によって任意の色成分を抽出できるようにすることで、その後の処理（平滑化手段１２からキーポイント検出・処理・記述手段１４）において、有効な特徴量抽出を行うのに適した色空間および色成分画像を選択することができ、さらに色成分混合手段３３によって第１変形例と同様の効果が得られる。 In the second modification, a color space conversion unit 31, a color component extraction unit 32, and a color component mixing unit 33 are provided between the image input unit 11 and the smoothing unit 12. The color space conversion unit 31 converts the color space of the input image into an arbitrary color space (usually having a plurality of channels), and the color component extraction unit 33 can extract an arbitrary color component. In the key point detection / processing / description means 14) from the smoothing means 12, a color space and color component image suitable for performing effective feature extraction can be selected. The same effect as the modified example can be obtained.

（第３変形例）
図７は第３変形例の構成を示す。第３変形例では、画像入力手段１１と平滑化手段１２の間に、画像入力手段の出力を入力とするＨＳＶ色空間変換手段４１と、ＨＳＶ色空間変換手段４１の出力を入力とするＶ成分抽出手段４２を設け、Ｖ成分抽出手段４２の出力を平滑化手段１２に与えることを特徴とする。例えばＲＧＢ色空間からＨＳＶ色空間への変換は、
http://opencv.jp/opencv-1.1.0/document/opencvref_cv_filters.html#decl_cvCvtColor
に記載の方法を用いてもよい。 (Third Modification)
FIG. 7 shows the configuration of the third modification. In the third modification, between the image input unit 11 and the smoothing unit 12, the HSV color space conversion unit 41 that receives the output of the image input unit and the V component that receives the output of the HSV color space conversion unit 41 are input. An extraction means 42 is provided, and the output of the V component extraction means 42 is given to the smoothing means 12. For example, conversion from RGB color space to HSV color space is
http://opencv.jp/opencv-1.1.0/document/opencvref_cv_filters.html#decl_cvCvtColor
May be used.

第３変形例では、画像入力手段１１と平滑化手段１２の間に、ＨＳＶ色空間変換手段４１およびＶ成分抽出手段４２を設けている。平滑化画像をＶ成分（明度）の画像から生成することにより、第１変形例と同様の効果が得られる。 In the third modification, an HSV color space conversion unit 41 and a V component extraction unit 42 are provided between the image input unit 11 and the smoothing unit 12. By generating the smoothed image from the image of the V component (lightness), the same effect as in the first modification can be obtained.

なお、図７の例では、ＨＳＶの成分のうちのＶ成分を用いているが、その代わりにＨ成分又はＳ成分を用いてもよい。また、色空間としては、ＨＳＶ色空間以外の色空間を利用してもよい。 In the example of FIG. 7, the V component of the HSV components is used, but an H component or an S component may be used instead. Further, a color space other than the HSV color space may be used as the color space.

（第４変形例）
図８は第４変形例の構成を示す。第４変形例では、画像入力手段１１と平滑化手段１２の間に、画像入力手段の出力を入力とするＨＳＶ色空間変換手段５１と、ＨＳＶ色空間変換手段５１の出力を入力とする（Ｓ，Ｖ）成分抽出手段５２と、（Ｓ，Ｖ）成分抽出手段５２の出力を入力とする（Ｓ，Ｖ）成分単純平均手段５３を設け、（Ｓ，Ｖ）単純平均手段５３の出力を平滑化手段１２に与えることを特徴とする。（Ｓ，Ｖ）成分単純平均手段５３は、例えば画素毎に式（１２）を施すものでもよい。 (Fourth modification)
FIG. 8 shows the configuration of the fourth modification. In the fourth modified example, between the image input unit 11 and the smoothing unit 12, the output of the HSV color space conversion unit 51 that receives the output of the image input unit and the output of the HSV color space conversion unit 51 are input (S , V) component extraction means 52 and (S, V) component simple average means 53 which receives the output of (S, V) component extraction means 52 as inputs, and smoothes the output of (S, V) simple average means 53. It is characterized by being provided to the conversion means 12. For example, the (S, V) component simple average means 53 may apply the equation (12) for each pixel.

ここで、ｉは単純平均された画像の画素値、ｓおよびｖはそれぞれＨＳＶ空間における画素のｓ成分およびｖ成分の画素値である。 Here, i is a pixel value of a simple averaged image, and s and v are pixel values of the s component and v component of the pixel in the HSV space, respectively.

第４変形例では、画像入力手段１１と平滑化手段１２の間に、ＨＳＶ色空間変換手段５１、（Ｓ，Ｖ）成分抽出手段５２、および（Ｓ，Ｖ）成分単純平均手段５３を設けている。第２変形例における色空間変換手段３２を入力画像の色空間からＨＳＶ色空間へ変換する処理とし、色成分抽出手段３２を（Ｈ，Ｓ，Ｖ）成分から（Ｓ，Ｖ）成分画像を選択する処理とし、色成分混合手段３３を式（１２）によりＳ成分画像とＶ成分画像の単純平均を行う処理に置き換えることで、第２変形例と同様の効果が得られる。 In the fourth modification, an HSV color space conversion unit 51, an (S, V) component extraction unit 52, and an (S, V) component simple average unit 53 are provided between the image input unit 11 and the smoothing unit 12. Yes. The color space conversion means 32 in the second modification is a process for converting from the color space of the input image to the HSV color space, and the color component extraction means 32 selects the (S, V) component image from the (H, S, V) component. By replacing the color component mixing unit 33 with a process for performing a simple average of the S component image and the V component image using the equation (12), the same effect as in the second modification can be obtained.

（第５変形例）
図９は、第５変形例の構成を示す。第５変形例では、画像入力手段１１と平滑化手段１２の間に、画像入力手段の出力を入力とするＨＳＶ色空間変換手段５１と、ＨＳＶ色空間変換手段５１の出力を入力とする（Ｓ，Ｖ）成分抽出手段５２と、（Ｓ，Ｖ）成分抽出手段５２の出力を入力とする（Ｓ，Ｖ）成分２乗平均手段５４を設け、（Ｓ，Ｖ）成分２乗平均手段５４の出力を平滑化手段１２に与えることを特徴とする。（Ｓ，Ｖ）成分２乗平均手段５４は、例えば画素毎に式（１３）を施すものでもよい。なお、式（１３）において、ｉ，ｓ，ｖは式（１２）と同様である。 (5th modification)
FIG. 9 shows the configuration of the fifth modification. In the fifth modification, between the image input means 11 and the smoothing means 12, the output of the HSV color space conversion means 51 that receives the output of the image input means and the output of the HSV color space conversion means 51 are input (S , V) component extraction means 52 and (S, V) component mean square means 54, which receives the output of the (S, V) component extraction means 52 as inputs, are provided. The output is supplied to the smoothing means 12. The (S, V) component mean square means 54 may, for example, apply the equation (13) for each pixel. In equation (13), i, s, and v are the same as in equation (12).

第５変形例では、画像入力手段１１と平滑化手段１２の間に、ＨＳＶ色空間変換手段５１、（Ｓ，Ｖ）成分抽出手段５２、および（Ｓ，Ｖ）成分２乗平均手段５４を設けている。第４変形例における（Ｓ，Ｖ）成分単純平均手段５３を、式（１３）によりＳ成分とＶ成分の２乗平均を行う処理とすることで、第４変形例と同様の効果が得られる。 In the fifth modification, an HSV color space conversion unit 51, an (S, V) component extraction unit 52, and an (S, V) component mean square unit 54 are provided between the image input unit 11 and the smoothing unit 12. ing. The (S, V) component simple average means 53 in the fourth modified example is a process for performing the root mean square of the S component and the V component according to the equation (13), thereby obtaining the same effect as in the fourth modified example. .

本発明は、特徴量を用いた画像認識技術、画像監視技術、および画像編集技術とこれらの技術を組み込んだソフトウェアやハードウェア、たとえばコンピュータシステムや車載機器、オーディオビジュアル機器、ゲーム機器、およびディジタルサイネージ機器などに利用可能である。 The present invention relates to an image recognition technique, an image monitoring technique, an image editing technique, and software and hardware incorporating these techniques, such as a computer system, an in-vehicle device, an audio-visual device, a game device, and a digital signage. It can be used for equipment.

１１画像入力手段
１２平滑化手段
１３空間微分手段
１３ａラプラシアンフィルタ手段
１４キーポイント検出・処理・記述手段
１５特徴量出力手段 DESCRIPTION OF SYMBOLS 11 Image input means 12 Smoothing means 13 Spatial differentiation means 13a Laplacian filter means 14 Keypoint detection / processing / description means 15 Feature quantity output means

Claims

Smoothing means for smoothing an input image to generate a smoothed image group;
Differential means for spatially differentiating the smoothed image group to generate a differential image group;
Feature quantity extraction means for extracting and outputting the feature quantity of the input image based on the differential image group;
A feature quantity extraction device comprising:

The feature amount extraction apparatus according to claim 1, wherein the differentiating unit is a Laplacian filter unit.

The feature amount extraction apparatus according to claim 1, further comprising an amplifying unit that amplifies the pixel value of the differential image output from the differentiating unit and inputs the amplified pixel value to the feature amount extracting unit.

4. The feature quantity extraction device according to claim 1, further comprising a gray scale conversion unit that converts the input image into a gray scale and inputs the converted image to the smoothing unit. 5.

The color space conversion means for converting the color space of the input image and inputting the input image after the color space conversion to the smoothing means, The characteristic according to any one of claims 1 to 3, Quantity extraction device.

Color space conversion means for converting the color space of the input image;
Color component extraction means for extracting one color component from the input image after color space conversion and inputting it to the smoothing means;
The feature quantity extraction device according to any one of claims 1 to 3, further comprising:

Color space conversion means for converting the color space of the input image;
Color component extraction means for extracting a plurality of color components from the input image after color space conversion;
Arithmetic means for performing an arbitrary calculation on the plurality of extracted color components and inputting to the smoothing means;
The feature quantity extraction device according to any one of claims 1 to 3, further comprising:

A feature amount extraction method executed by a feature amount extraction apparatus,
A smoothing step of smoothing the input image to generate a smoothed image group;
A differentiation step of spatially differentiating the smoothed image group to generate a differential image group;
A feature amount extraction step of extracting and outputting a feature amount of the input image based on the differential image group;
A feature quantity extraction method characterized by comprising:

A feature extraction program executed by a computer,
Smoothing means for smoothing an input image to generate a smoothed image group;
Differentiating means for generating a differential image group by spatially differentiating the smoothed image group,
A feature quantity extraction program for causing the computer to function as feature quantity extraction means for extracting and outputting a feature quantity of the input image based on the differential image group.

A storage medium storing the feature quantity extraction program according to claim 9.