JPH06348901A

JPH06348901A - Multiple resolution image feature extracting device

Info

Publication number: JPH06348901A
Application number: JP5136357A
Authority: JP
Inventors: Takatsugu Yamada; 敬嗣山田
Original assignee: NEC Corp
Current assignee: NEC Corp
Priority date: 1993-06-08
Filing date: 1993-06-08
Publication date: 1994-12-22

Abstract

PURPOSE:To provide the feature extracting device of multiple resolution which employs the convolution of templates capable of absorbing a position shift without causing an increase in device scale. CONSTITUTION:An image feature extracting device with 1st multiple resolution generates an enlarged or reduced image by an image formation part 101 according to a scale signal which is generated by a scale control part 106 and corresponds to an enlargement/-reduction rate. A template convolution part 102 convolutes plural templates for the image and an addition part 103 adds a generated convolution signal to generate an addition image. Further, a re- sampling part 104 performs a thinning-out process according to the scale signal and outputs image features. An image feature extracting device with 2nd multiple resolution after enlarging or reducing the templates by a template convolution part 102 according to a scale signal convolutes the output image of an image formation part 101.

Description

【発明の詳細な説明】Detailed Description of the Invention

【０００１】[0001]

【産業上の利用分野】本発明は、画像特徴抽出装置に関
し、特に多重解像度の画像特徴抽出装置に関する。BACKGROUND OF THE INVENTION 1. Field of the Invention The present invention relates to an image feature extraction device, and more particularly to a multi-resolution image feature extraction device.

【０００２】[0002]

【従来の技術】従来、この種の画像特徴抽出装置は、画
像認識、特に文字認識において、認識の鍵となる図形特
徴を抽出して、手ぶれや、雑音重畳による図形歪を吸収
して、変動に影響されない特徴を抽出して、その特徴デ
ータを認識部に引き渡すことを目的として用いられてい
る。例えば、「１９７８年、デジタル画像処理、近代科
学社、４４３ページ〜４４５ページ」には、文字のスト
ロークなどをテンプレートとして、それを畳み込み演算
により、対象画像に対して畳込みを行うことにより、対
象画像中のテンプレートの形状と類似する部分を検出す
る技術が記載されている。2. Description of the Related Art Conventionally, this type of image feature extraction apparatus extracts a graphic feature which is a key to recognition in image recognition, particularly character recognition, absorbs a hand shake or a graphic distortion due to noise superposition, and changes It is used for the purpose of extracting the features that are not affected by and passing the feature data to the recognition unit. For example, in "1978, Digital Image Processing, Modern Science Co., Ltd., pages 443 to 445", a character stroke or the like is used as a template, and by performing a convolution operation on the target image, the target image is convolved. A technique for detecting a portion similar to the shape of the template in the image is described.

【０００３】図１０は、畳込みを用いた従来の画像特徴
抽出装置の動作の一例を示す図である。入力された画像
１００１に対して、テンプレート１００２を畳み込み部
１００３において、数１に従った計算を実行する。FIG. 10 is a diagram showing an example of the operation of a conventional image feature extraction apparatus using convolution. The convolution unit 1003 of the template 1002 performs the calculation according to the equation 1 on the input image 1001.

【０００４】[0004]

【数１】 [Equation 1]

【０００５】ここで、入力された画像１００１のサイズ
をＮ×Ｎ画素とし、座標（ｘ，ｙ）での値をｈ（ｘ，
ｙ）とし、テンプレート１００２の座標（ａ，ｂ）での
値をｆ（ａ，ｂ）とし、畳み込み後の画像１００３の座
標（ｘ，ｙ）での値をＨ（ｆ）とする。これにより、Ｈ
（ｆ）の値が、入力画像１００１中の座標（ｘ，ｙ）に
テンプレート１００２が存在する程度が求められる。こ
の値が大きいほどテンプレート１００２に似た局所パタ
ーンが存在することを示す。入力画像１００１の全ての
座標（ｘ，ｙ）に対して同様の処理を行い、入力画像の
全ての場所でのテンプレート１００２の存在の程度を検
出する。Here, the size of the input image 1001 is N × N pixels, and the value at coordinates (x, y) is h (x,
y), the value at the coordinates (a, b) of the template 1002 is f (a, b), and the value at the coordinates (x, y) of the image 1003 after convolution is H (f). This makes H
The value of (f) is obtained to the extent that the template 1002 exists at the coordinates (x, y) in the input image 1001. The larger this value is, the more local pattern similar to the template 1002 exists. The same process is performed on all the coordinates (x, y) of the input image 1001 to detect the degree of existence of the template 1002 at all the positions of the input image.

【０００６】[0006]

【発明が解決しようとする課題】この従来の画像特徴抽
出装置では、局所的な特徴をテンプレートで表現し、入
力画像中での局所特徴の検出のために、畳み込み演算を
実行している。少し位置のずれた局所特徴の検出は、全
ての位置ズレの可能性のある入力画像中の座標（ｘ，
ｙ）で畳込みを実行することにより求めているため、実
現するための計算量や記憶容量が増大して、実現が困難
になる。局所特徴が拡大したり、縮小している場合にも
この従来の画像特徴抽出装置では検出できず、検出した
い局所特徴を拡大したり、縮小したパターンをテンプレ
ートとして、多数種類備えておく必要がある。In this conventional image feature extraction apparatus, a local feature is represented by a template, and a convolution operation is executed to detect the local feature in the input image. The detection of local features that are slightly misaligned can be performed by using the coordinates (x,
Since it is obtained by executing the convolution in y), the amount of calculation and the storage capacity for the realization increase, and the realization becomes difficult. Even if the local feature is expanded or reduced, this conventional image feature extraction device cannot detect it, and it is necessary to prepare a large number of types of the expanded or reduced local feature to be detected as a template. .

【０００７】本発明の目的は、装置規模の増大を招くこ
となく、位置ズレ吸収可能なテンプレートの畳み込みに
よる多重解像度の特徴抽出を可能にする多重解像度画像
特徴抽出装置を提供することにある。It is an object of the present invention to provide a multi-resolution image feature extraction apparatus capable of performing multi-resolution feature extraction by convoluting a template capable of absorbing positional deviation without increasing the scale of the apparatus.

【０００８】[0008]

【課題を解決するための手段】第１の発明は、画像の局
所特徴をテンプレートとして備え、入力画像にそのテン
プレートを畳み込むことによって画像の特徴を抽出する
画像特徴抽出装置において、拡大縮小比率に対応するス
ケール信号を生成するスケール制御部と、前記スケール
制御部からのスケール信号に従って像を生成する像形成
部と、複数のテンプレートを畳み込むテンプレート畳み
込み部と、前記テンプレート畳み込み部において生成さ
れた畳み込み信号を加算し記憶する加算部と、前記加算
部から出力された畳み込み信号を前記スケール制御部か
らのスケール信号に従って間引きして特徴を出力する再
サンプリング部と、を備えることを特徴とする。According to a first aspect of the present invention, there is provided an image feature extraction apparatus which includes a local feature of an image as a template and which convolves the template with an input image to extract the feature of the image, and which corresponds to a scaling ratio. A scale control unit that generates a scale signal, an image forming unit that generates an image according to the scale signal from the scale control unit, a template convolution unit that convolves a plurality of templates, and a convolution signal generated in the template convolution unit. It is characterized by comprising an adding section for adding and storing, and a resampling section for thinning out the convolutional signal output from the adding section according to the scale signal from the scale control section and outputting the characteristic.

【０００９】第２の発明は、第１の発明において、前記
テンプレート畳み込み部が、複数のテンプレートを前記
スケール制御部からのスケール信号に従って拡大縮小し
た後に畳み込むことを特徴とする。A second invention is characterized in that, in the first invention, the template convolution unit convolves a plurality of templates after enlarging / reducing a plurality of templates according to a scale signal from the scale control unit.

【００１０】[0010]

【実施例】次に、本発明について図面を参照して説明す
る。図１は、第１の発明の一実施例を示すブロック図で
ある。図１を参照すると、本実施例は、画像から抽出す
る特徴の大きさを制御するスケール制御部１０６と、像
形成部１０１、テンプレート畳み込み部１０２、加算部
１０３、再サンプリング部１０４、出力端子１０５から
構成される。スケール制御部１０６は、抽出する局所特
徴の大きさに従ってスケール信号を生成する。例えば、
テンプレート畳み込み部１０２で用いる局所特徴のテン
プレートの大きさに対して、２^1/2倍，２倍，２×２
^1/2倍，２^-1/2倍、１／２倍の６種類の大きさの局所特
徴の存否を特徴として抽出する場合に、その倍率を示す
スケール信号ｓを像形成部１０１と再サンプリング部１
０４に送出する。像形成部１０１の動作を図３を用いて
説明する。なお図３は、像形成部における画像の拡大縮
小部分の説明図である。特徴抽出の対象となる画像を、
端子３０１から入力して像記憶部３０２に記憶する。ス
ケール制御部１０６から送出されたスケール信号ｓは端
子３０５から入力され拡大縮小部３０４が起動される。
スケール信号ｓは、画像をｓ倍に拡大することを示す。
像記憶部３０２に記憶された入力画像の（ｘ，ｙ）座標
での画像の濃淡値をｈ（ｘ，ｙ）で表現した時に、ｓ倍
に拡大した画像の濃淡値ｈ′（ｘ′，ｙ′）は、数２で
示される式に従って計算される。DESCRIPTION OF THE PREFERRED EMBODIMENTS Next, the present invention will be described with reference to the drawings. FIG. 1 is a block diagram showing an embodiment of the first invention. Referring to FIG. 1, in the present exemplary embodiment, a scale control unit 106 that controls the size of a feature extracted from an image, an image forming unit 101, a template convolution unit 102, an addition unit 103, a resampling unit 104, and an output terminal 105. Composed of. The scale control unit 106 generates a scale signal according to the size of the extracted local feature. For example,
2 ^1/2 times, 2 times, 2 × 2 times the size of the template of the local feature used in the template convolution unit 102.
^When the presence / absence of six types of local features of ^1/2 times, 2 ^−1/2 times, and ^1/2 times is extracted as a feature, a scale signal s indicating the magnification is resampled with the image forming unit 101. Part 1
Send to 04. The operation of the image forming unit 101 will be described with reference to FIG. Note that FIG. 3 is an explanatory diagram of an image enlargement / reduction portion in the image forming unit. The target image for feature extraction is
It is input from the terminal 301 and stored in the image storage unit 302. The scale signal s sent from the scale control unit 106 is input from the terminal 305 and the scaling unit 304 is activated.
The scale signal s indicates that the image is magnified s times.
When the grayscale value of the image at the (x, y) coordinates of the input image stored in the image storage unit 302 is represented by h (x, y), the grayscale value h ′ (x ′, y ′) is calculated according to the equation shown in Equation 2.

【数２】ｈ′（ｘ′，ｙ′）＝（ｙ′／ｓ−Ｆ（ｙ′／ｓ））（（ｘ′／ｓ−Ｆ（ｘ′／ｓ））ｈ（Ｃ（ｘ′／ｓ），Ｃ（ｙ′／ｓ））＋（Ｃ（ｘ′／ｓ）−ｘ′／ｓ）ｈ（Ｆ（ｘ′／ｓ），Ｃ（ｙ′／ｓ）））＋（Ｃ（ｙ′／ｓ）−ｙ′／ｓ）（（ｘ′／ｓ−Ｆ（ｘ′／ｓ））ｈ（Ｃ（ｘ′／ｓ），Ｆ（ｙ′／ｓ））＋（Ｃ（ｘ′／ｓ）−ｘ′／ｓ）ｈ（Ｆ（ｘ′／ｓ），Ｆ（ｙ′／ｓ）））ここで、Ｃ（Ｒ）は、実数値Ｒ以上で最小の整数値を表
し、Ｆ（Ｒ）は、実数値Ｒより小さい中で最大の整数値
を表す。また、元の画像の大きさを６４画素×６４画素
とすると、ｘ，ｙの値は１から６４の間の整数値をと
る。また、ｓ倍された画像ｈ′の座標値ｘ′，ｙ′は１
から６４ｓの間の整数値をとる。以上のようにｓ倍に拡
大された画像ｈ′（ｘ′，ｙ′）は、像記憶部３０２に
記憶され、端子３０３からテンプレート畳み込み部１０
２に送出される。## EQU00002 ## h '(x', y ') = (y' / s-F (y '/ s)) ((x' / s-F (x '/ s)) h (C (x' / s), C (y '/ s)) + (C (x' / s) -x '/ s) h (F (x' / s), C (y '/ s))) + (C (y '/ S) -y' / s) ((x '/ s-F (x' / s)) h (C (x '/ s), F (y' / s)) + (C (x '/ s) -x '/ s) h (F (x' / s), F (y '/ s))) where C (R) represents the smallest integer value equal to or greater than the real value R, and F ( R) represents the largest integer value smaller than the real value R. Further, if the size of the original image is 64 pixels × 64 pixels, the values of x and y take integer values between 1 and 64. In addition, the coordinate values x ′ and y ′ of the image h ′ multiplied by s are 1
To 64 s. The image h ′ (x ′, y ′) magnified s times as described above is stored in the image storage unit 302, and is input from the terminal 303 to the template convolution unit 10.
2 is sent.

【００１１】次に図１のテンプレート畳み込み部１０２
について説明する。テンプレート畳み込み部１０２は、
予め生成した画像の局所特徴を表すテンプレートを複数
持つとともに、１つの局所特徴を表すテンプレートｆに
対して、その局所特徴が少々の位置ズレを起こした場合
に相当するテンプレートｆ′を合わせて持つ。そのテン
プレートの生成方法の例を説明する。例えば、第ｉ番目
の局所特徴に相当するテンプレートｆ（ｉ，ａ，ｂ）を
数３で表す。Next, the template convolution unit 102 shown in FIG.
Will be described. The template convolution unit 102 is
In addition to having a plurality of templates representing the local features of the image generated in advance, a template f'corresponding to the case where the local features cause a slight positional deviation is also provided for the template f representing one local feature. An example of the template generation method will be described. For example, the template f (i, a, b) corresponding to the i-th local feature is represented by Equation 3.

【数３】ｆ（ｉ，ａ，ｂ）＝（１／ｎ）ｃｏｓ（（２π／Ｌ）（Ａ（ｉ）ａ＋Ｂ（ｉ）ｂ）） ×ｅｘｐ（−２（ａ²＋ｂ²）／Ｌ²）ここで、ｉは０から７までの整数値を表し、ｎは１、Ｌ
の値は８とするが、ｎ，Ｌはこれ以外の値をとっても問
題はない。これにより、πｉ／８ラジアン分回転してお
り、Ｌ画素おきに平行に存在する線分からなる局所特徴
を表すテンプレートを構成する。また、Ａ（ｉ），Ｂ
（ｉ）は第ｉ番目の局所特徴に依存する値をとり、下の
数４，数５で与えることができるが、これは本質的な問
題ではなく、他の値もとり得る。F (i, a, b) = (1 / n) cos ((2π / L) (A (i) a + B (i) b)) × exp (−2 (a ² + b ² ) / L ² ) Here, i represents an integer value from 0 to 7, and n is 1, L
Although the value of is set to 8, there is no problem even if n and L take other values. As a result, the template is rotated by πi / 8 radians and forms a template representing a local feature that is composed of line segments that exist in parallel at every L pixel. Also, A (i), B
(I) takes a value depending on the i-th local feature, and can be given by the following equations 4 and 5, but this is not an essential problem, and other values can be employed.

【数４】Ａ（ｉ）＝ｃｏｓ（πｉ／８）## EQU00004 ## A (i) = cos (.pi.i / 8)

【数５】Ｂ（ｉ）＝ｓｉｎ（πｉ／８）この場合に、ｉ
は０から７までの整数値とすると、８種類の局所特徴を
表すテンプレートを用いることになる。数３で定められ
たテンプレートを用いる場合に、そのテンプレートを少
し位置ずれさせたテンプレートｆ′（ｉ，ａ，ｂ）は数
６で与える。## EQU5 ## B (i) = sin (πi / 8) In this case, i
Is an integer value from 0 to 7, a template representing eight types of local features will be used. When the template defined by the equation 3 is used, the template f ′ (i, a, b) obtained by slightly shifting the template is given by the equation 6.

【数６】ｆ′（ｉ，ａ，ｂ）＝（１／ｎ）ｓｉｎ（（２π／Ｌ）（Ａ（ｉ）ａ＋Ｂ（ｉ）ｂ）） ×ｅｘｐ（−２（ａ²＋ｂ²）／Ｌ²）これにより定められたテンプレートｆ′は、元のテンプ
レートｆに対して約Ｌ／４画素、つまり約２画素分の位
置ズレを起こしたものとなっている。以上により、８種
類の方向からなる平行線分を表すテンプレートとそれが
少し位置ズレした８種類のテンプレートが生成できる。
このように生成されたテンプレートは予めテンプレート
畳み込み部１０２に記憶されており、テンプレート畳み
込み部１０２では数３，数６で示したｉ番目の２つのテ
ンプレートを数１と同様の畳み込み演算を実行すること
により、２種類の結果画像Ｈ（ｉ，ｆ）（ｘ，ｙ）とＨ
（ｉ，ｆ′）（ｘ，ｙ）を生成し、加算部１０３に送出
する。F ′ (i, a, b) = (1 / n) sin ((2π / L) (A (i) a + B (i) b)) × exp (−2 (a ² + b ² ) / L ² ) The template f ′ defined in this way has a position shift of about L / 4 pixels, that is, about 2 pixels, from the original template f. As described above, it is possible to generate a template representing a parallel line segment composed of eight types of directions and eight types of templates with a slight positional deviation.
The template generated in this way is stored in advance in the template convolution unit 102, and the template convolution unit 102 performs the same convolution operation on the i-th two templates shown in Formulas 3 and 6 as in Formula 1. Two types of result images H (i, f) (x, y) and H
(I, f ') (x, y) is generated and sent to the adder 103.

【００１２】加算部１０３では、受け取った（ｘ，ｙ）
座標の２種類の画像データＨ（ｉ，ｆ）とＨ（ｉ，
ｆ′）から、数７に従って加算を行う。The adder 103 receives (x, y)
Two types of coordinate image data H (i, f) and H (i, f)
From f '), addition is performed according to the equation 7.

【数７】Ｇ（ｉ，ｆ）＝（Ｈ（ｉ，ｆ）²＋Ｈ（ｉ，ｆ′）²）^1/2 この計算を実行して得られた（ｘ，ｙ）座標での値Ｇ
（ｉ，ｆ）（ｘ，ｙ）を、再サンプリング部１０４に転
送する。ここでは、位置ズレテンプレート数を２つにし
て、説明したが、３以上の場合も同様である。## EQU7 ## G (i, f) = (H (i, f) ² + H (i, f ') ² ) ^1/2 The value G at the (x, y) coordinates obtained by executing this calculation
(I, f) (x, y) is transferred to the resampling unit 104. Here, the description has been given assuming that the number of positional deviation templates is two, but the same applies to the case of three or more.

【００１３】再サンプリング部１０４では、ｘ方向，ｙ
方向ともにｋ画素毎にサンプリングを行い、サンプリン
グされた画素のＧ（ｉ，ｆ）については端子１０５から
特徴として送出し、それ以外の画素の値Ｇ（ｉ，ｆ）は
消去する。ここでパラメータｋは便宜上１とするが、そ
れ以外の値でも本質的な問題ではない。スケール信号ｓ
を順次変更することにより、同一画像から異なる解像度
に相当する同一特徴を抽出することができる。In the resampling unit 104, x direction, y
Sampling is performed every k pixels in both directions, and G (i, f) of the sampled pixels is sent out as a feature from the terminal 105, and the values G (i, f) of the other pixels are erased. Here, the parameter k is set to 1 for convenience, but other values are not an essential problem. Scale signal s
It is possible to extract the same feature corresponding to different resolutions from the same image by sequentially changing.

【００１４】図４，図５は、第１の発明の他の実施例で
ある。特徴抽出の対象となる入力画像ｈ（ｘ，ｙ）は、
入力端子４１０から入力され、スケール制御部１０６か
ら転送されたスケール信号ｓに従って、２次元レーザア
レイ４０１の各画素のスイッチングを行うものとする。
２次元レーザアレイの各画素には図５に示すレーザコン
ポーネントを配列する。（ｘ，ｙ）座標に配置されたレ
ーザコンポーネントの動作を図５を用いて３つのスケー
ルで特徴抽出する場合を説明する。ここでは、３スケー
ルの場合を説明するが、スケール数は３には限らない。
画素値入力端子５０１からは、入力画像の（ｘ，ｙ）座
標の値ｆ（ｘ，ｙ）が入力される。スケール制御部１０
６から転送されたスケール信号ｓは端子５１２から入力
されてセレクタ５０６を起動し、第ｓ番目のスケールに
対応する信号線を選択し、３つのアンプ５０２の内の１
つを駆動する。それにより、異なる周波数のコヒーレン
ト光を照射する３つのレーザ素子５０３，５０４，５０
５の内の１つからレーザ光が照射される。２次元レーザ
アレイ４０１のｆ（ｘ，ｙ）が定められたしきい値より
も大きい値を持つ座標（ｘ，ｙ）にあるレーザコンポー
ネント全てから、選択された１つのコヒーレント光が照
射される。ここで選択されたコヒーレント光の波長をＰ
とする。２次元レーザアレイ４０１から照射された光
は、ビームスプリッタ４０２で２方向の光に分けられ
る。１つはレンズ４０４を透過して空間変調器４０５に
至る。もう１つはミラー４０３で反射して、レンズ４０
４を透過して空間変調器４０６に至る。空間変調器４０
５は、数３で定めたテンプレートをフーリエ変換して得
られる画像を濃淡にして透明フィルムに焼き付けること
により実現できる。また、空間変調器４０６は、同様に
数６で定めたテンプレートをフーリエ変換して得られる
画像を濃淡にして透明フィルムに焼き付けることにより
実現できる。２次元レーザアレイ４０１からレンズ４０
４に至る光路長と、レンズ４０４から空間変調器４０
５、空間変調器４０６に至る光路長を、レンズ４０４の
焦点距離に一致させることにより空間変調器４０５と空
間変調器４０６の面上に、入射コヒーレント光の波長Ｐ
に反比例した大きさで端子４１０から入力された画像の
フーリエ変換像に相当する画像が投影される。そこで投
影光が空間変調器４０５を透過することにより、入力画
像に対して数３で与えたテンプレートを数１に従って畳
み込み演算を実行したものをフーリエ変換したものと同
等の結果が得られる。同様に空間変調器４０６を透過し
た光は、数６で定めたテンプレートを数１に従って畳み
込み演算を実行したものをフーリエ変換したものと同等
のものを表す。さらに、空間変調器４０５，４０６を透
過した光はレンズ４０９を透過して、プリズム４０７を
透過し、２次元光電変換素子アレイ４０８に至る。ここ
でも、空間変調器４０５，４０６からレンズ４０９に至
る光路長と、レンズ４０９から光電変換素子４０８に至
る光路長をレンズ４０９の焦点距離に一致させる。これ
により光電変換素子アレイ４０８上には空間変調器４０
５，４０６を透過した画像の逆フーリエ変換像が重ね合
わせて投影され、入力画像に数３と数６で定めた２つの
テンプレートを畳み込んで、数７に従ってその２つの畳
み込み画像を加算した結果のＧ（ｉ，ｆ）²（ｘ′，
ｙ′）が得られる。この結果を再サンプリング部１０４
に転送し、スケール制御部１０６から転送されたスケー
ル信号に従って、ｋ画素おきに間引き、そのデータを端
子１０５から画像特徴として抽出する。ここでは、ｋを
波長Ｐに反比例した値に定めたが、これは本質的な問題
ではない。スケール信号ｓを順次変更することにより、
同一画像から異なる解像度に相当する同一特徴を抽出す
ることができる。4 and 5 show another embodiment of the first invention. The input image h (x, y) that is the target of feature extraction is
It is assumed that each pixel of the two-dimensional laser array 401 is switched according to the scale signal s input from the input terminal 410 and transferred from the scale control unit 106.
The laser component shown in FIG. 5 is arranged in each pixel of the two-dimensional laser array. A case where the operation of the laser component arranged at the (x, y) coordinates is feature-extracted on three scales will be described with reference to FIG. Here, a case of 3 scales will be described, but the number of scales is not limited to 3.
From the pixel value input terminal 501, the value f (x, y) of the (x, y) coordinates of the input image is input. Scale control unit 10
The scale signal s transferred from No. 6 is input from the terminal 512, activates the selector 506, selects the signal line corresponding to the sth scale, and selects one of the three amplifiers 502.
Drive one. As a result, the three laser elements 503, 504, 50 that emit coherent light beams of different frequencies
Laser light is emitted from one of the five. The selected one coherent light is emitted from all the laser components at the coordinates (x, y) where f (x, y) of the two-dimensional laser array 401 has a value larger than a predetermined threshold value. Let P be the wavelength of the coherent light selected here.
And The light emitted from the two-dimensional laser array 401 is split into light in two directions by the beam splitter 402. One passes through the lens 404 and reaches the spatial modulator 405. The other is reflected by the mirror 403 and the lens 40
4 to reach the spatial modulator 406. Spatial modulator 40
5 can be realized by making an image obtained by Fourier transforming the template defined by the expression 3 dark and light and printing it on a transparent film. Further, the spatial modulator 406 can be realized by similarly printing the image obtained by Fourier transforming the template defined by the equation 6 on a transparent film by making the image dark and light. Two-dimensional laser array 401 to lens 40
4, the optical path length from the lens 404 to the spatial modulator 40
5. By matching the optical path length to the spatial modulator 406 with the focal length of the lens 404, the wavelength P of the incident coherent light is projected on the surfaces of the spatial modulator 405 and the spatial modulator 406.
An image corresponding to the Fourier transform image of the image input from the terminal 410 is projected in a size inversely proportional to. Therefore, the projection light is transmitted through the spatial modulator 405, and the result equivalent to the result obtained by performing the Fourier transform on the template obtained by performing the convolution operation on the input image according to the equation 1 is obtained. Similarly, the light transmitted through the spatial modulator 406 represents light equivalent to the light obtained by performing the Fourier transform on the template defined in Expression 6 and performing the convolution operation according to Expression 1. Further, the light transmitted through the spatial modulators 405 and 406 passes through the lens 409, the prism 407, and reaches the two-dimensional photoelectric conversion element array 408. Also in this case, the optical path length from the spatial modulators 405 and 406 to the lens 409 and the optical path length from the lens 409 to the photoelectric conversion element 408 are matched with the focal length of the lens 409. As a result, the spatial modulator 40 is provided on the photoelectric conversion element array 408.
The result obtained by superimposing the inverse Fourier transform images of the images transmitted through 5, 406, convolving the two templates defined by equations 3 and 6 on the input image, and adding the two convolution images according to equation 7. G (i, f) ² (x ′,
y ') is obtained. The result is resampled by the resampling unit 104.
, And thinning out every k pixels in accordance with the scale signal transferred from the scale control unit 106, and extracting the data from the terminal 105 as an image feature. Here, k is set to a value inversely proportional to the wavelength P, but this is not an essential problem. By changing the scale signal s sequentially,
The same feature corresponding to different resolutions can be extracted from the same image.

【００１５】図６を用いて第１の発明のさらに他の実施
例を説明する。スケール制御部１０６から転送されたス
ケール信号は、レーザ照射部６０１に転送される。レー
ザ照射部６０１は、例えば図５に示されるようなレーザ
コンポーネントを１つ以上並置したもので、スケール信
号ｓに従って波長Ｐのコヒーレント光を対象画像６０２
に照射する。反射光はレンズ４０４、空間変調器４０
５，４０６、レンズ４０９、プリズム４０７を透過し
て、光電変換素子アレイ４０８に照射される。この光電
変換素子アレイ上には、図４に示した実施例と同様の動
作により、入力画像に数３と数６で定めた２つのテンプ
レートを畳み込んで、数７に従ってその２つの畳み込み
画像を加算した結果のＧ（ｉ，ｆ）²（ｘ′，ｙ′）が
得られる。この結果を再サンプリング部１０４に転送
し、スケール制御部１０６から転送されたスケール信号
に従って、ｋ画素おきに間引き、そのデータを端子１０
５から画像特徴として送出する。ここでは、ｋを波長Ｐ
に反比例した値に定めたが、これは本質的な問題ではな
い。スケール信号ｓを順次変更することにより、同一画
像から異なる解像度に相当する同一特徴を抽出すること
ができる。Still another embodiment of the first invention will be described with reference to FIG. The scale signal transferred from the scale control unit 106 is transferred to the laser irradiation unit 601. The laser irradiation unit 601 is one in which one or more laser components as shown in FIG. 5, for example, are arranged side by side, and the coherent light of the wavelength P according to the scale signal s is used as the target image 602.
To irradiate. The reflected light is reflected by the lens 404 and the spatial modulator 40.
5, 406, the lens 409, and the prism 407, and the photoelectric conversion element array 408 is irradiated. On this photoelectric conversion element array, by the same operation as that of the embodiment shown in FIG. 4, the two templates defined by the formulas 3 and 6 are convoluted to the input image, and the two convoluted images are calculated according to the formula 7. As a result of addition, G (i, f) ² (x ′, y ′) is obtained. The result is transferred to the resampling unit 104, thinned out every k pixels in accordance with the scale signal transferred from the scale control unit 106, and the data is output to the terminal 10.
5 is sent as an image feature. Here, k is the wavelength P
The value is inversely proportional to, but this is not an essential problem. By sequentially changing the scale signal s, the same feature corresponding to different resolutions can be extracted from the same image.

【００１６】次に図２と図７を用いて第２の発明の一実
施例を説明する。図２に示す多重解像度画像特徴抽出装
置の実施例は、像形成部１０１、テンプレートを拡大縮
小可能なテンプレート畳み込み部２０１、加算部１０
３、再サンプリング部１０４、スケール制御部１０６、
出力端子１０５からなる。像形成部１０１、スケール制
御部１０６、加算部１０３は図１の実施例と同一のた
め、テンプレート畳み込み部２０１を図７を用いて説明
する。数３または数６に従った計算により生成されたテ
ンプレートｆ（ａ，ｂ）は端子７０７から入力され、テ
ンプレート記憶部７０４に記憶される。拡大縮小部７０
５では、スケール制御部１０６から転送されたスケール
信号ｓを端子７０６から入力し、その値ｓに従ってテン
プレート記憶部７０４に記憶されたテンプレートｆ
（ａ，ｂ）を下記の数８に従った計算を実行して拡大縮
小してテンプレートｆ′（ａ′，ｂ′）を得る。Next, an embodiment of the second invention will be described with reference to FIGS. The embodiment of the multi-resolution image feature extraction apparatus shown in FIG. 2 includes an image forming unit 101, a template convolution unit 201 capable of scaling a template, and an adding unit 10.
3, the resampling unit 104, the scale control unit 106,
The output terminal 105. Since the image forming unit 101, the scale control unit 106, and the addition unit 103 are the same as those in the embodiment of FIG. 1, the template convolution unit 201 will be described with reference to FIG. The template f (a, b) generated by the calculation according to Formula 3 or Formula 6 is input from the terminal 707 and stored in the template storage unit 704. Enlarging / reducing unit 70
5, the scale signal s transferred from the scale control unit 106 is input from the terminal 706, and the template f stored in the template storage unit 704 according to the value s.
(A, b) is subjected to calculation according to the following formula 8 to scale it to obtain a template f '(a', b ').

【数８】ｆ′（ａ′，ｂ′）＝（ａ′ｓ−Ｆ（ｂ′ｓ））（（ａ′ｓ−Ｆ（ａ′ｓ））ｆ（Ｃ（ａ′ｓ），Ｃ（ｂ′ｓ））＋（Ｃ（ａ′ｓ）−ａ′ｓ）ｆ（Ｆ（ａ′ｓ），Ｃ（ｂ′ｓ）））＋（Ｃ（ｂ′ｓ）−ｂ′ｓ）（（ａ′ｓ−Ｆ（ａ′ｓ））ｆ（Ｃ（ａ′ｓ），Ｆ（ｂ′ｓ））＋（Ｃ（ａ′ｓ）−ａ′ｓ）ｆ（Ｆ（ａ′ｓ），Ｆ（ｂ′ｓ）））得られたテンプレートｆ′（ａ′，ｂ′）は再びテンプ
レート記憶部７０４に記憶される。畳み込み計算部７０
２で端子７０１を通して像生成部１０１から転送された
画像ｈ（ｘ，ｙ）に対して、数３，数６に従って得た２
つのテンプレートを拡大縮小したものを用い、数１に従
った計算を実行することにより、２つの畳み込み画像Ｈ
（ｉ，ｆ′，ｘ，ｙ）を求め端子７０３を通して加算部
１０３に転送する。数７に従った計算により、加算部１
０３で生成された画像Ｇ（ｉ，ｆ′）（ｘ，ｙ）は、再
サンプリング制御部１０４に転送され、スケール制御部
１０６から転送されたスケール信号ｓに従って、ｋ画素
毎に間引きを行い、端子１０５から画像特徴として出力
する。ここではｋをｓに反比例する値として定めたが、
これは本質的な問題ではない。この実施例では、像形成
部１０１にて画像の拡大縮小を行わない場合にも、ｓ倍
の拡大を行ったのと同様の効果が得られる。さらに、像
形成部１０１にてｓ倍の拡大を行った場合、ｓ²倍に入
力画像を拡大した場合と同様の効果が得られる。スケー
ル信号ｓを順次変更することにより、同一画像から異な
る解像度に相当する同一特徴を抽出することができる。F ′ (a ′, b ′) = (a′s−F (b ′s)) ((a′s−F (a ′s)) f (C (a ′s), C ( b (s)) + (C (a's) -a's) f (F (a's), C (b's))) + (C (b's) -b's) (( a's-F (a's)) f (C (a's), F (b's)) + (C (a's) -a's) f (F (a's), F (B's))) The obtained template f '(a', b ') is stored again in the template storage unit 704. Convolution calculation unit 70
2 obtained according to Equations 3 and 6 for the image h (x, y) transferred from the image generation unit 101 through the terminal 701 in 2
The two convolutional images H are
(I, f ', x, y) is obtained and transferred to the adder 103 through the terminal 703. By the calculation according to the equation 7, the addition unit 1
The image G (i, f ′) (x, y) generated in 03 is transferred to the resampling control unit 104 and thinned out for every k pixels according to the scale signal s transferred from the scale control unit 106. It is output from the terminal 105 as an image feature. Here, k is defined as a value inversely proportional to s,
This is not an essential issue. In this embodiment, even when the image forming unit 101 does not perform image enlargement / reduction, the same effect as that obtained by performing s-fold enlargement can be obtained. Furthermore, when the image forming unit 101 enlarges the image by s times, the same effect as when the input image is enlarged by s ² times is obtained. By sequentially changing the scale signal s, the same feature corresponding to different resolutions can be extracted from the same image.

【００１７】次に、第２の発明の他の実施例を図８を用
いて説明する。入力端子４１０、２次元レーザアレイ４
０１、ビームスプリッタ４０２、ミラー４０３、レンズ
４０４、レンズ４０９、プリズム４０７、光電変換素子
アレイ４０８、再サンプリング部１０４、端子１０５は
図４の実施例と同一であり、テンプレート記憶部７０
４、拡大縮小部７０５、端子７０６，７０７は、図７の
実施例と同一であるため、可変空間変調器８０５につい
て説明する。図４での実施例では、空間変調器４０５，
４０６は固定でテンプレートをフィルムに焼き付けたも
のを用いたが、ここでは数３及び数６に従って生成さ
れ、ｓ倍に拡大されたテンプレートを、それぞれ液晶表
示装置で構成された空間変調器８０５，８０６に表示す
る。これにより、２次元レーザアレイ４０１で、レーザ
光の波長を切り替えることなく、スケール信号ｓを順次
変更することにより、同一画像から異なる解像度に相当
する同一特徴を抽出することができる。また、２次元レ
ーザアレイ４０１で、レーザ光の波長を切り替えた場合
には、より多種類の解像度の特徴を抽出することができ
る。Next, another embodiment of the second invention will be described with reference to FIG. Input terminal 410, two-dimensional laser array 4
01, the beam splitter 402, the mirror 403, the lens 404, the lens 409, the prism 407, the photoelectric conversion element array 408, the resampling unit 104, and the terminal 105 are the same as those in the embodiment of FIG.
4, the enlargement / reduction unit 705 and the terminals 706 and 707 are the same as those in the embodiment of FIG. 7, and therefore the variable spatial modulator 805 will be described. In the embodiment shown in FIG. 4, the spatial modulators 405,
406 is a fixed template printed on a film, but here, the templates generated according to Equations 3 and 6 and magnified s times are used as spatial modulators 805 and 806, respectively, which are configured by liquid crystal display devices. To display. As a result, the two-dimensional laser array 401 can sequentially change the scale signal s without switching the wavelength of the laser light to extract the same feature corresponding to different resolutions from the same image. Further, when the wavelength of the laser light is switched by the two-dimensional laser array 401, it is possible to extract the characteristics of a wider variety of resolutions.

【００１８】次に、図９を用いて第２の発明のさらに他
の実施例を説明する。図９の実施例では、対象画像６０
２、レーザ照射部６０１、レンズ４０４、レンズ４０
９、プリズム４０７、光電変換素子アレイ４０８、再サ
ンプリング部１０４、出力端子１０５、スケール制御部
１０６は、図６の実施例と同じであり、拡大縮小部７０
５、テンプレート記憶部７０４、端子７０６，７０７
は、実施例の図７と同じである。また、可変空間変調器
８０５，８０６も図８で説明したものと同じものであ
る。ｓ倍に拡大されたテンプレートを可変空間変調器８
０５，８０６に表示することにより、スケール信号ｓ倍
のスケールの特徴を抽出できる。スケール信号ｓを順次
変更して、画像特徴を出力することにより、同一画像か
ら異なる解像度に相当する異なる大きさの同一形状特徴
を抽出することができる。Next, still another embodiment of the second invention will be described with reference to FIG. In the example of FIG. 9, the target image 60
2, laser irradiation unit 601, lens 404, lens 40
9, the prism 407, the photoelectric conversion element array 408, the resampling unit 104, the output terminal 105, and the scale control unit 106 are the same as those in the embodiment of FIG.
5, template storage unit 704, terminals 706, 707
Is the same as FIG. 7 of the embodiment. The variable spatial modulators 805 and 806 are also the same as those described in FIG. The variable spatial modulator 8 uses the template magnified s times.
By displaying it on 05 and 806, it is possible to extract the characteristic of the scale of the scale signal s times. By sequentially changing the scale signal s and outputting the image features, it is possible to extract the same shape features of different sizes corresponding to different resolutions from the same image.

【００１９】[0019]

【発明の効果】以上に説明した多重解像度特徴抽出装置
は、位置ズレに対応する複数のテンプレートを備えて、
それぞれを用いて畳み込まれた画像を加算することによ
り、中間的な位置ズレに影響されずにテンプレートで表
した特徴を抽出できる。The multi-resolution feature extraction apparatus described above is provided with a plurality of templates corresponding to positional deviation,
By adding the convoluted images using each of them, the feature represented by the template can be extracted without being affected by the intermediate positional deviation.

【００２０】また、拡大縮小されたテンプレートを予め
備えることなしに、画像自体を拡大縮小するのと同等の
効果を実現することやテンプレートを拡大縮小すること
により、１種類の局所特徴に対応するテンプレートは１
つだけ備えればよく、簡易に多重特徴抽出装置を実現で
き、多重特徴の分のテンプレート数を減らし、効率よく
多数種類の局所特徴に対応するテンプレートを備えるこ
とができる。In addition, the template corresponding to one kind of local feature can be realized by providing the same effect as scaling the image itself without scaling the template in advance and scaling the template. Is 1
It is sufficient to provide only one, and a multiple feature extraction device can be easily realized, the number of templates corresponding to multiple features can be reduced, and templates corresponding to many types of local features can be efficiently provided.

[Brief description of drawings]

【図１】第１の発明の多重解像度抽出装置の実施例を示
す構成図である。FIG. 1 is a configuration diagram showing an embodiment of a multi-resolution extraction device of the first invention.

【図２】第２の発明の多重解像度抽出装置の実施例を示
す構成図である。FIG. 2 is a configuration diagram showing an embodiment of a multi-resolution extraction device of the second invention.

【図３】像形成部における画像の拡大縮小部分の説明図
である。FIG. 3 is an explanatory diagram of an enlargement / reduction portion of an image in the image forming unit.

【図４】第１の発明の多重解像度抽出装置の実施例を示
す構成図である。FIG. 4 is a configuration diagram showing an embodiment of a multi-resolution extraction device of the first invention.

【図５】２次元レーザアレイのレーザコンポーネントの
構成を示す図である。FIG. 5 is a diagram showing a configuration of a laser component of a two-dimensional laser array.

【図６】第１の発明の多重解像度抽出装置の実施例を示
す構成図である。FIG. 6 is a configuration diagram showing an embodiment of a multi-resolution extraction device of the first invention.

【図７】テンプレート畳み込み部のテンプレートの拡大
縮小部分の実施例の説明である。FIG. 7 is an illustration of an embodiment of a scaling portion of a template of a template convolution unit.

【図８】第２の発明の多重解像度抽出装置の実施例を示
す構成図である。FIG. 8 is a configuration diagram showing an embodiment of a multi-resolution extraction device of the second invention.

【図９】第２の発明の多重解像度抽出装置の実施例を示
す構成図である。FIG. 9 is a configuration diagram showing an embodiment of a multi-resolution extraction device of the second invention.

【図１０】従来の畳み込みを用いた特徴抽出の説明図で
ある。FIG. 10 is an explanatory diagram of feature extraction using conventional convolution.

[Explanation of symbols]

１０１像形成部１０２テンプレート畳み込み部１０３加算部１０４再サンプリング部１０５出力端子１０６スケール制御部２０１テンプレート畳み込み部３０１画像入力端子３０２像記憶部３０３像出力端子３０４，７０５拡大縮小部３０５スケール入力端子４０１２次元レーザアレイ４０２ビームスプリッタ４０３ミラー４０４，４０９レンズ４０５，４０６空間変調器４０７プリズム４０８光電変換素子アレイ５０１画素値入力端子５０２アンプ５０３，５０４，５０５レーザ素子５０６セレクタ５１２スケール入力端子６０１レーザ照射部６０２対象画像７０１，７０３，７０６，７０７端子７０２畳み込み計算部７０４テンプレート記憶部８０５，８０６可変空間変調器１００１入力画像１００２テンプレート１００３畳み込み画像１００４畳み込み処理 101 image forming unit 102 template convolution unit 103 adder unit 104 resampling unit 105 output terminal 106 scale control unit 201 template convolution unit 301 image input terminal 302 image storage unit 303 image output terminal 304, 705 scaling unit 305 scale input terminal 401 2 Dimensional laser array 402 Beam splitter 403 Mirror 404,409 Lens 405,406 Spatial modulator 407 Prism 408 Photoelectric conversion element array 501 Pixel value input terminal 502 Amplifier 503,504,505 Laser element 506 Selector 512 Scale input terminal 601 Laser irradiation part 602 Target image 701, 703, 706, 707 Terminal 702 Convolution calculation unit 704 Template storage unit 805, 806 Variable spatial modulator 1001 Input image 1002 Template 1003 Convolutional image 1004 Convolutional processing

Claims

[Claims]

1. An image feature extraction apparatus that includes local features of an image as a template, and extracts the features of the image by convolving the template with an input image, and a scale control unit that generates a scale signal corresponding to a scaling ratio. An image forming unit that generates an image according to the scale signal from the scale control unit, a template convolution unit that convolves a plurality of templates, an addition unit that adds and stores the convolution signals generated in the template convolution unit, and the addition And a resampling unit that thins out the convolutional signal output from the unit according to the scale signal from the scale control unit and outputs the feature.

2. The multi-resolution image feature extraction apparatus according to claim 1, wherein the template convolution unit performs convolution after enlarging / reducing a plurality of templates according to a scale signal from the scale control unit.