JPH0628476A

JPH0628476A - Processor for image signal

Info

Publication number: JPH0628476A
Application number: JP4181403A
Authority: JP
Inventors: Kenji Okajima; 健治岡島
Original assignee: NEC Corp
Current assignee: NEC Corp
Priority date: 1992-07-09
Filing date: 1992-07-09
Publication date: 1994-02-04

Abstract

PURPOSE:To provide the image processing method for recognizing a pattern, which is strong against enlargement/reduction, rotation and misalignment, and also, strong against superposition of a background noise. CONSTITUTION:After a pre-processing is executed to an input image, such a transformation as a Fourier/Melan-transformation, etc., is performed thereto by a converting part 14, and an image is converted to a pattern being invariable against enlargement/reduction, rotation and misalignment. A matching part 15 compares the pattern to which such a conversion is performed, with a standard pattern, and determines a candidate pattern. A reverse converting part 16 executes a reverse conversion processing of the conversion by the converting part to the candidate pattern, and calculates a feedback image. An image memory part 13 eliminates a background noise from an input image, based on the feedback image. By repeating these processings, the pattern is decided.

Description

Detailed Description of the Invention

【０００１】[0001]

【産業上の利用分野】本発明は、パタン認識のための画
像処理を行う画像信号の処理装置に関するものである。BACKGROUND OF THE INVENTION 1. Field of the Invention The present invention relates to an image signal processing apparatus for performing image processing for pattern recognition.

【０００２】[0002]

【従来の技術】従来のＯＣＲ等のパタン認識のための画
像処理を行う装置の認識方式は、大別してアナログ的マ
ッチング法（テンプレートマッチング法）と構造解析法
とに分けることができる。前者は標準パタンと未知パタ
ンとの重ね合わせによってパタンを認識する方式であ
り、汎用性にすぐれ、特に、活字の読み取りには有利で
あるとされている。2. Description of the Related Art Conventional recognition methods for an apparatus for performing image processing for pattern recognition such as OCR can be roughly classified into an analog matching method (template matching method) and a structure analysis method. The former is a method for recognizing a pattern by superimposing a standard pattern and an unknown pattern, which is excellent in versatility and is particularly advantageous for reading a printed character.

【０００３】[0003]

【発明が解決しようとする課題】しかしこの従来の方式
には、パターンに拡大縮小や回転などの変形があった
り、位置ずれがあったりすると標準パタンとの重なりあ
いが少なくなり、正しく認識されないという欠点があっ
た。“ＡｐｐｌｉｅｄＯｐｔｉｃｓ，１９７６，ｖｏ
ｌ．１５”に掲載のＤ．Ｃａｓａｓｅｎｔらによる文献
“Ｐｏｓｉｔｉｏｎ，ｒｏｔａｔｉｏｎ，ａｎｄｓｃ
ａｌｅｉｎｖａｒｉａｎｔｏｐｔｉｃａｌｃｏｒ
ｒｅｌａｔｉｏｎ”には、光学的な方法によりパタンの
フーリエ・メラン変換（Ｆｏｕｒｉｅｒ−Ｍｅｌｌｉｎ
ｔｒａｎｓｆｏｒｍ）を行った上でパタンマッチング
を行う方法が記載されている。この方法は、パタンのフ
ーリエ変換の振幅がもとのパタンの位置をずらしても不
変であること、また、バタンの拡張されたメラン変換の
振幅は、もとのパタンを拡大縮小あるいは回転しても不
変であることを利用したもので、もとのパタンを、位置
ずれ、拡大縮小、回転などに対して不変なパタンに変換
しておいてから、標準パタンとのマッチングを行うもの
であり、変形や位置ずれに対して強い方式である。しか
しながら、この方式では、掲示されるパタンに背景ノイ
ズが重畳している場合、マッチングが著しく悪化すると
いう難点があった。However, in this conventional method, if the pattern is deformed by scaling, rotation, or misaligned, the overlap with the standard pattern is reduced and the pattern cannot be recognized correctly. There was a flaw. “Applied Optics, 1976, vo
l. 15 "by D. Casasent et al.," Position, rotation, and sc. "
ale invariant optical cor
“Relation” includes a Fourier-Melan transform of a pattern by an optical method.
The method of performing pattern matching after performing the (transform) is described. In this method, the amplitude of the Fourier transform of the pattern is invariant even if the position of the original pattern is shifted, and the amplitude of the expanded Melan transform of the pattern is obtained by scaling or rotating the original pattern. Is also invariant, and the original pattern is converted into a pattern that is invariant to misalignment, scaling, rotation, etc., and then matched with the standard pattern. This method is strong against deformation and displacement. However, this method has a drawback that matching is significantly deteriorated when background noise is superimposed on the posted pattern.

【０００４】本発明の目的は、このような従来の欠点を
取り除き、拡大縮小、回転、位置ずれに対して強く、ま
た同時に背景ノイズの重畳に対しても強いパタン認識の
ための画像信号の処理装置を提供することにある。An object of the present invention is to eliminate such drawbacks of the prior art and process an image signal for pattern recognition, which is strong against enlargement / reduction, rotation, displacement and at the same time strong against background noise superposition. To provide a device.

【０００５】[0005]

【課題を解決するための手段】第１の発明は、入力画像
に対してフーリエ・メラン変換を施し、像を位置ずれ、
拡大縮小、回転操作に対して不変なパタンに変換する第
１の変換部と、第１の変換部より得られたパタンを標準
パタンと比較して候補パタンを決めるマッチング部と、
候補パタンに対して第１の変換部による変換の逆変換を
施してフィードバック像を計算する第１の逆変換部と、
フィードバック像に基づいて入力像からパタンに無関係
の背景ノイズを除去するイメージメモリとを備え、ルー
プ処理の繰り返しによりパタン判定を行うことを特徴と
している。According to a first aspect of the present invention, a Fourier-Melan transform is applied to an input image to shift the position of the image,
A first conversion unit that converts into a pattern that is invariant to scaling and rotation operations; a matching unit that compares the pattern obtained from the first conversion unit with a standard pattern to determine a candidate pattern;
A first inverse transform unit that performs an inverse transform of the transform performed by the first transform unit on the candidate pattern to calculate a feedback image;
An image memory that removes background noise irrelevant to the pattern from the input image based on the feedback image is provided, and pattern determination is performed by repeating loop processing.

【０００６】また、第２の発明は、入力画像を小領域に
分割して多段の階層的なフーリエ変換を施し、像のフー
リエ変換の振幅を求める処理によって像を位置ずれに対
して不変なパタンに変換する第２の変換部と、第２の変
換部より得られたパタンに対して拡張されたメラン変換
を施して拡大縮小、回転操作に対しても不変なパタンに
変換する第３の変換部と、第３の変換部より得られたパ
タンを標準パタンと比較して候補パタンを決めるマッチ
ング部と、マッチング部で決まった候補パタンおよび第
３の変換部からのデータにより第３の変換部の逆変換を
計算する第２逆変換部と、第２の逆変換部からのデータ
と第２の変換部からのデータによりフィードバック像を
計算する第３の逆変換部と、フィードバック像に基づい
て入力像からパタンに無関係の背景ノイズを除去するイ
メージメモリとを備え、ループ処理の繰り返しによりパ
タン判定を行うことを特徴としている。According to the second aspect of the invention, the input image is divided into small regions, multi-stage hierarchical Fourier transform is performed, and the image is invariant with respect to the positional deviation by a process of obtaining the amplitude of the Fourier transform of the image. A second conversion unit for converting to a second conversion unit, and a third conversion for converting the pattern obtained from the second conversion unit into a pattern that is invariant to scaling and rotation operations by performing extended Melan conversion. Section, a matching section that determines a candidate pattern by comparing the pattern obtained from the third converting section with a standard pattern, and the third converting section based on the candidate pattern determined by the matching section and the data from the third converting section. Based on the feedback image; and a second inverse transform unit that calculates the inverse transform of the second inverse transform unit, a third inverse transform unit that calculates a feedback image from the data from the second inverse transform unit and the data from the second transform unit Pattern from the input image And a image memory to remove extraneous background noise is characterized by performing a pattern determined by iteration of the loop processing.

【０００７】さらに、第３の発明は、入力画像に複素対
数変換で表される座標変換を施す第４の変換部と、第４
の変換部より得られたパタンを小領域に分割して多段の
階層的なフーリエ変換を施し、像のフーリエ変換の振幅
を求める処理によって像を小さい位置ずれ、拡大縮小、
回転操作に対して不変なパタンに変換する第５の変換部
と、第５の変換部より得られたパタンを標準パタンと比
較して候補パタンを決めるマッチング部と、候補パタン
に対して前記多段の階層的なフーリエ変換とフーリエ変
換の振幅計算の逆変換を施してフィードバック像を計算
する第４の逆変換部と、フィードバック像に基づいて入
力像からパタンに無関係の背景ノイズを除去するイメー
ジメモリとを備え、ループ処理の繰り返しによりパタン
判定を行うことを特徴としている。Further, a third aspect of the present invention is a fourth transformation part for subjecting an input image to coordinate transformation represented by complex logarithmic transformation, and a fourth transformation part.
The pattern obtained from the transformation unit of is divided into small regions and subjected to a multi-stage hierarchical Fourier transform, and the image is subjected to small displacement, scaling,
A fifth conversion unit for converting into a pattern invariant to a rotation operation, a matching unit for comparing a pattern obtained from the fifth conversion unit with a standard pattern to determine a candidate pattern, and the multi-stage for the candidate pattern. A fourth inverse transform unit for performing a feedback image by performing the inverse Fourier transform and the inverse transform of the amplitude calculation of the Fourier transform, and an image memory for removing background noise irrelevant to the pattern from the input image based on the feedback image And is characterized by performing pattern determination by repeating loop processing.

【０００８】[0008]

【作用】第１の発明では、入力画像に対してフーリエ変
換が施され、その振幅が計算される。よく知られている
ように、像のフーリエ変換の振幅は像の位置をずらして
も不変であるから、これにより位置不変なパタンが得ら
れる。この時、もとの像をａ倍拡大するとそのフーリエ
変換の振幅のパタンは１／ａに縮小し、もとの像を回転
するとフーリエ変換のパタンも同じ角度だけ回転するこ
ともよく知られている。そこで、次にこのパタンを新座
標で表わし、縦軸にその偏角、横軸に対数目盛りで動径
をとって表示する。このようにして表したパタンは、も
とのパタンを回転させると縦軸方向に平行移動するし、
また、もとのパタンを拡大あるいは縮小すると横軸方向
に平行移動することがわかる。そこで、このパタンに対
して再度フーリエ変換を行い、その振幅を計算すると拡
張されたメラン変換が得られ、Ｄ．Ｃａｓａｓｅｎｔら
が指摘したように、それはもとのパタンの回転あるいは
拡大縮小に対しても不変なものになる。In the first invention, the Fourier transform is applied to the input image and the amplitude thereof is calculated. As is well known, the amplitude of the Fourier transform of an image is invariant even if the position of the image is shifted, so that a position invariant pattern is obtained. At this time, it is well known that when the original image is enlarged a times, the Fourier transform amplitude pattern is reduced to 1 / a, and when the original image is rotated, the Fourier transform pattern is also rotated by the same angle. There is. Therefore, next, this pattern is represented by new coordinates, the vertical axis represents the deviation angle, and the horizontal axis represents the radius vector on a logarithmic scale. The pattern expressed in this way translates in the vertical direction when the original pattern is rotated,
Further, it is understood that when the original pattern is enlarged or reduced, the original pattern is translated in the horizontal axis direction. Then, the Fourier transform is performed again on this pattern and the amplitude thereof is calculated to obtain the extended Melan transform. As pointed out by Casasent et al., It is invariant to rotation or scaling of the original pattern.

【０００９】次に、ここまでの処理と同様の処理をあら
かじめ施しておいた標準パタンとのテンプレートマッチ
ングを行う。この処理により、提示されたパタンが標準
パタンに対し位置がずれていたり、拡大縮小あるいは回
転していても正しく認識することはできるが、一般的に
は提示パタンには対象としているパタン以外にも背景ノ
イズが重畳しているので、これだけの処理で精度のよい
認識をすることは困難である。そこで、上述のテンプレ
ートマッチングにより得られる類似度があらかじめ設定
しておいたしきい値よりも小さい場合には判定を保留
し、類似度の比較的高い標準パタンを候補パタンとす
る。この候補パタンに基づいて上述の処理を逆にたどっ
ていき、フィードバック像を計算して入力像をこのフィ
ードバック像に基づいて修正する。候補パタンが正しい
パタンであった場合には、フィードバック像により、対
象とするべきパタンの部分と、パタンとは関係のない背
景ノイズの部分とを決めることができるから、これによ
りもともとの入力像中の背景ノイズの部分を抑制するこ
とができる。このような修正により背景ノイズを抑制し
た上で、再び順方向の変換処理とテンプレートマッチン
グを繰り返す。このようにして、候補パタンが正しいパ
タンであった場合には、フィードバック処理により背景
ノイズが抑制されるので、２回目のテンプレートマッチ
ングの結果は大幅に改善し、以上のようなループ処理を
数回繰り返せばマッチングはほぼ完ぺきになって、背景
ノイズがある場合にも精度の高いパタン判定が可能にな
る。Next, template matching is performed with a standard pattern which has been previously subjected to the same processing as the processing so far. By this process, the presented pattern can be correctly recognized even if it is displaced from the standard pattern, scaled, or rotated. However, in general, the presented pattern is not limited to the target pattern. Since background noise is superimposed, it is difficult to perform accurate recognition with such processing. Therefore, when the similarity obtained by the template matching described above is smaller than a preset threshold value, the determination is suspended, and the standard pattern having a relatively high similarity is set as the candidate pattern. Based on this candidate pattern, the above processing is followed in reverse, a feedback image is calculated, and the input image is corrected based on this feedback image. If the candidate pattern is the correct pattern, the feedback image can be used to determine the part of the pattern to be targeted and the part of the background noise that is not related to the pattern. The background noise part of can be suppressed. The background noise is suppressed by such correction, and the forward conversion process and template matching are repeated again. In this way, when the candidate pattern is a correct pattern, the background noise is suppressed by the feedback processing, so the result of the second template matching is greatly improved, and the above loop processing is repeated several times. If it is repeated, the matching is almost perfect, and it is possible to perform highly accurate pattern determination even in the presence of background noise.

【００１０】次に、第２の発明では、第１の発明におけ
る最初の処理であるフーリエ変換の振幅計算処理を次の
ように多段に分割された処理に置き換える。すなわち、
入力像全体のフーリエ変換を一度に計算するかわりに、
入力像を多数の小領域に分割し、まず、各小領域ごとに
像のフーリエ変換とその振幅を計算する。これにより像
中の多数の位置において、ローカルなフーリエ変換の振
幅が得られるが、次にこれを新たにパタンと見なして、
再び位置変数に関するフーリエ変換を行い、その振幅を
計算する。このような処理によって得られるパタンは近
似的に、もとの像についてのフーリエ変換の振幅と同じ
性質を持つので、この処理に第１の発明における残りの
処理をつけ加えることにより、位置ずれ、拡大縮小、回
転に対して強く、同時に背景ノイズの重畳に対しても強
いパタン認識処理が可能になる。Next, in the second invention, the amplitude calculation process of the Fourier transform, which is the first process in the first invention, is replaced with a process divided into multiple stages as follows. That is,
Instead of computing the Fourier transform of the entire input image at once,
The input image is divided into a number of small areas, and the Fourier transform of the image and its amplitude are calculated for each small area. This gives the amplitude of the local Fourier transform at many positions in the image, which is then considered a new pattern,
The Fourier transform of the position variable is performed again, and its amplitude is calculated. The pattern obtained by such a process has approximately the same property as the amplitude of the Fourier transform for the original image, and therefore, by adding the remaining process of the first invention to this process, misalignment and enlargement can be achieved. It is possible to perform pattern recognition processing that is strong against reduction and rotation and at the same time strong against superposition of background noise.

【００１１】次に、第３の発明では、まず、入力像を複
素対数変換を施した座標軸で表現する。よく知られてい
るように、この処理により得られるパタンは、もとの像
を拡大縮小あるいは回転すると近似的に平行移動する。
次に、このパタンを多数の小領域に分割し、まず、各小
領域ごとに像のフーリエ変換とその振幅を計算する。こ
こで、もとの像の位置をずらすと、各小領域の中の像も
移動するが、もしも各小領域での位置ずれの大きさが小
領域の大きさよりも小さければ、フーリエ変換の振幅の
位置ずれに対する不変性により、このような処理により
得られるパタンは近似的に不変である。次に、このよう
にして得られた、多数の位置におけるローカルなフーリ
エ変換の振幅を新たにパタンとみなして、再び位置変数
に関するフーリエ変換を行い、その振幅を計算する。こ
のような処理によって得られるパタンは近似的に、複素
対数変換を施した座標軸で表現された入力像についての
フーリエ変換の振幅と同じ性質を持つので、この座標軸
上でパタンを平行移動しても、すなわちもとの入力像を
拡大縮小、あるいは回転させても近似的に不変である。
従って以上の処理により、小さな位置ずれ、および拡大
縮小変換、回転に対してほぼ不変のパタンが得られる。Next, in the third aspect of the invention, first, the input image is expressed by the coordinate axes subjected to complex logarithmic transformation. As is well known, the pattern obtained by this processing approximately translates when the original image is scaled or rotated.
Next, this pattern is divided into a large number of small areas, and the Fourier transform of the image and its amplitude are calculated for each small area. Here, if the position of the original image is shifted, the image in each small area also moves, but if the magnitude of the displacement in each small area is smaller than the size of the small area, the amplitude of the Fourier transform is reduced. Due to the invariance with respect to the positional deviation of, the pattern obtained by such processing is approximately invariant. Next, the amplitudes of the local Fourier transforms at a large number of positions obtained in this way are regarded as new patterns, and the Fourier transform for the position variable is performed again, and the amplitudes are calculated. The pattern obtained by such a process has approximately the same property as the amplitude of the Fourier transform of the input image represented by the coordinate axis subjected to the complex logarithmic transformation, and therefore, even if the pattern is translated on this coordinate axis, That is, even if the original input image is scaled or rotated, it is approximately unchanged.
Therefore, by the above processing, a pattern that is substantially invariable with respect to a small displacement, scaling conversion, and rotation can be obtained.

【００１２】次に、ここまでの処理と同様の処理をあら
かじめ施しておいた標準パタンとのテンプレートマッチ
ングを行う。第１の発明と同様に、掲示パタンに背景ノ
イズが重畳している場合には、これだけの処理では精度
のよい認識をすることは困難である。そこで、第１の発
明と同様に、類似度があらかじめ設定しておいたしきい
値よりも小さい場合には、類似度の比較的高い標準パタ
ンを候補パタンとしてフィードバックし、この候補パタ
ンに基づいて上述の処理を逆にたどっていき、フィード
バック像を計算して入力像をこのフィードバック像に基
づいて修正する。このような修正を施した上で、再び順
方向の変換処理とテンプレートマッチングを繰り返す。
第１の発明と同様に、以上のようなループ処理を数回繰
り返せばマッチングはほぼ完ぺきになって、背景ノイズ
がある場合にも精度の高い、また位置ずれ、拡大縮小、
回転に対しても強いパタン判定が可能になる。Next, template matching is performed with a standard pattern which has been previously subjected to the same processing as the processing so far. Similar to the first invention, when background noise is superimposed on the bulletin pattern, it is difficult to perform accurate recognition by such processing. Therefore, similar to the first invention, when the similarity is smaller than a preset threshold value, a standard pattern having a relatively high similarity is fed back as a candidate pattern, and based on this candidate pattern, the above-mentioned In the reverse order, the feedback image is calculated and the input image is corrected based on this feedback image. After such correction, the forward conversion process and template matching are repeated again.
Similar to the first invention, if the above loop processing is repeated several times, the matching is almost completed, and even if there is background noise, the accuracy is high, and the position shift, enlargement / reduction,
A strong pattern judgment is possible even for rotation.

【００１３】[0013]

【実施例】次に、本発明の実施例について図面を参照し
て説明する。Embodiments of the present invention will now be described with reference to the drawings.

【００１４】図１は、第１の発明の一実施例を示す構成
図である。図１に示す実施例は、イメージスキャナーも
しくはテレビカメラなどを利用する画像入力部１１と、
データをディジタル化したりエッジ強調などの必要な前
処理をする前処理部１２と、ディジタルデータを格納す
るイメージメモリ１３と、イメージメモリ１３から読み
だした画像データに対してフーリエ変換、拡張されたメ
ラン変換を行って位置ずれ、回転、拡大縮小変化に対し
て不変なパタンに入力を変換する変換部１４と、変換部
１４の出力と標準パタンとのテンプレートマッチングを
行ってパタン判定、候補パタンの決定を行うマッチング
部１５と、マッチング部１５で決まった候補パタンおよ
び変換部１４のデータからフィードバック像を計算する
逆変換部１６とを備えている。FIG. 1 is a block diagram showing an embodiment of the first invention. The embodiment shown in FIG. 1 includes an image input unit 11 using an image scanner or a television camera,
A pre-processing unit 12 for performing necessary pre-processing such as digitizing data and edge enhancement, an image memory 13 for storing digital data, and Fourier transform for image data read from the image memory 13 and extended Melan. A conversion unit 14 that performs conversion to convert an input into a pattern that is invariable with respect to displacement, rotation, and enlargement / reduction change, and template matching between the output of the conversion unit 14 and a standard pattern is performed to determine a pattern and determine a candidate pattern. The matching unit 15 for performing the above and the inverse transform unit 16 for calculating a feedback image from the candidate pattern determined by the matching unit 15 and the data of the transform unit 14 are provided.

【００１５】次に、第１の発明の実施例の動作について
説明する。画像入力部１１から入力された画像情報は、
前処理部１２でＡ／Ｄ変換処理を受け、ディジタル化さ
れ、メモリに格納される。後の認識処理に便利なよう
に、必要に応じてこの部分において像中のエッジ部の強
調などの前処理を行っておいてもよい。次に、この処理
結果はイメージメモリ１３に送られ、変換部１４がイメ
ージメモリ１３中のパタンに対して、そのフーリエ変
換、Ｆ（Ｋｘ，Ｋｙ）とその振幅Ｐ（Ｋｘ，Ｋｙ）を計
算する。以上の処理をかりに変換Ａと呼ぶことにする。
こうして得られる振幅は、よく知られているように入力
像を平行移動しても不変である。計算の都合によっては
振幅の代わりにその２乗であるパワーを計算してもかま
わない。フーリエ変換の位相は後の処理で使うので、変
換部１４内のメモリに格納しておく。Next, the operation of the embodiment of the first invention will be described. The image information input from the image input unit 11 is
The pre-processing unit 12 receives the A / D conversion processing, digitizes it, and stores it in the memory. If necessary, preprocessing such as enhancement of the edge portion in the image may be performed in this portion so as to be convenient for the subsequent recognition processing. Next, this processing result is sent to the image memory 13, and the transform unit 14 calculates the Fourier transform F (Kx, Ky) and its amplitude P (Kx, Ky) for the pattern in the image memory 13. . The above processing will be referred to as conversion A.
The amplitude thus obtained does not change when the input image is translated, as is well known. Depending on the convenience of calculation, the squared power may be calculated instead of the amplitude. Since the phase of the Fourier transform is used in the subsequent processing, it is stored in the memory in the conversion unit 14.

【００１６】次に、変換部１４はフーリエ変換の振幅を
極座標表示に変換し、縦軸にその偏角、横軸に対数目盛
りで動径をとって表示する。数式であらわすとNext, the transforming unit 14 transforms the amplitude of the Fourier transform into a polar coordinate display, displays the deviation angle on the vertical axis, and the radius vector on a logarithmic scale on the horizontal axis. Expressed as a mathematical formula

【数１】ｒｃｏｓθ＝Ｋｘｓ＝ｌｏｇ（１＋ｒ／ｒ
ｃ）ｒｓｉｎθ＝Ｋｙで表される座標変換（Ｋｘ，Ｋｙ）→（ｓ，θ）を計算
し、ｓ−θ軸のパタンへとフーリエ変換の振幅のパタン
を変換する。ここでｒｃは、ｒ＝０で対数関数が発散す
ることを防ぐための正の定数である。## EQU00001 ## rcos .theta. = Kx s = log (1 + r / r
c) The coordinate transformation (Kx, Ky) → (s, θ) represented by rsin θ = Ky is calculated, and the amplitude pattern of the Fourier transform is converted to the s-θ axis pattern. Here, rc is a positive constant for preventing the logarithmic function from diverging at r = 0.

【００１７】こうして得られたパタンＰ（ｓ，θ）は、
もとのパタンを回転させるとθ軸方向に平行移動する
し、また、もとのパタンを拡大あるいは縮小すると近似
的にｓ軸方向に平行移動することがわかる。そこで、こ
のパタンに対して再度フーリエ変換を行い、その振幅を
計算すると、すでに知られているように、それはもとの
パタンの回転あるいは拡大縮小に対しても不変なものに
なる。The pattern P (s, θ) thus obtained is
It can be seen that when the original pattern is rotated, it is translated in the θ-axis direction, and when the original pattern is enlarged or reduced, it is approximately translated in the s-axis direction. Therefore, when the Fourier transform is performed again on this pattern and the amplitude thereof is calculated, it becomes invariant to the rotation or enlargement / reduction of the original pattern, as is already known.

【００１８】変換Ａ以降ここまでの処理を変換Ｂと呼ぶ
ことにする。変換Ｂで得られた振幅はマッチング部１５
に送られる。また、変換Ｂで得られたフーリエ変換の位
相は、やはり後の処理で使うためメモリに格納してお
く。The processing from the conversion A to the processing up to this point will be called conversion B. The amplitude obtained by the conversion B is the matching unit 15
Sent to. Also, the phase of the Fourier transform obtained by the conversion B is stored in the memory for use in the subsequent processing.

【００１９】マッチング部１５は、あらかじめ上に述べ
た変換を施しておいた標準パタンを格納しており、これ
らの標準パタンと変換部１４からおくられてくるパタン
とのテンプレートマッチングを行い、パタン判定を行
う。変換部１４での処理の結果、提示されたパタンが標
準パタンに対し位置がずれていたり、拡大縮小あるいは
回転していてもマッチング部は正しく認識することがで
きるが、一般的には展示パタンには対象としているパタ
ン以外にも背景ノイズが重畳しているので、これだけの
処理で精度のよい認識をすることは困難である。そこ
で、マッチング部１５は、上述のテンプレートマッチン
グにより得られる類似度が、あらかじめ設定しておいた
しきい値よりもよい場合には、それを判定結果として処
理を終了するが、そうでない場合には、類似度の高い標
準パタンを候補パタンとして逆変換部１６にフィードバ
ックする。The matching unit 15 stores the standard patterns which have been subjected to the above-mentioned conversion in advance, and performs template matching between these standard patterns and the patterns sent from the conversion unit 14 to determine the pattern. I do. As a result of the processing in the conversion unit 14, even if the presented pattern is displaced from the standard pattern, scaled or rotated, the matching unit can correctly recognize it, but in general, it is displayed in the display pattern. Since background noise is superimposed on patterns other than the target pattern, it is difficult to perform accurate recognition with such processing. Therefore, if the similarity obtained by the template matching described above is better than a preset threshold value, the matching unit 15 ends the process with the result as a determination result. The standard pattern having a high degree of similarity is fed back to the inverse conversion unit 16 as a candidate pattern.

【００２０】逆変換部１６は、変換部１４での処理を逆
にたどり、フィードバック像を計算する。すなわち、ま
ずはじめにマッチング部１５からフィードバックされて
きた候補パタンの振幅と、変換部１４において変換Ｂで
計算された位相とから逆フーリエ変換を計算し、ｓ−θ
座標軸上のフィードバックパタンを計算する。さらに、
それに対して数１で表される座標変換の逆変換を施し、
パタンＰ′（Ｋｘ，Ｋｙ）を計算する。さらに、こうし
て得られたフィードバック振幅Ｐ′と変換部１４におけ
る変換Ａで計算された位相とから再び逆フーリエ変換を
行い、フィードバック像Ｉ′を求める。以上の逆変換処
理において、本来、正であるべき振幅や明暗度が負にな
ったりあるいは大きくなりすぎることを防ぐために、各
逆フーリエ変換処理のあとにしきい値処理をつけ加えて
もよい。The inverse transformation unit 16 reverses the processing of the transformation unit 14 to calculate a feedback image. That is, first, an inverse Fourier transform is calculated from the amplitude of the candidate pattern fed back from the matching unit 15 and the phase calculated by the transform B in the transform unit 14, and s−θ
Calculate the feedback pattern on the coordinate axes. further,
On the other hand, the inverse transformation of the coordinate transformation expressed by Equation 1
The pattern P '(Kx, Ky) is calculated. Further, an inverse Fourier transform is performed again from the feedback amplitude P ′ thus obtained and the phase calculated by the conversion A in the conversion unit 14 to obtain the feedback image I ′. In the above inverse transform processing, threshold processing may be added after each inverse Fourier transform processing in order to prevent the amplitude or the intensity which should originally be positive from becoming negative or becoming too large.

【００２１】イメージメモリ部１３は、逆変換部１６か
ら送られてきたフィードバック像Ｉ′にもとづいて、例
えば、次式によって入力像Ｉ₀を修正し、修正された像
Ｉを再び変換部１４へと送る。The image memory unit 13 corrects the input image I ₀ on the basis of the feedback image I ′ sent from the inverse conversion unit 16, for example, by the following equation, and the corrected image I to the conversion unit 14 again. And send.

【数２】Ｉ（ｘ，ｙ）＝（Ｉ₀（ｘ，ｙ）ａ＋（１−ａ））・（Ｉ′（ｘ，ｙ）ｂ＋（１−ｂ））ｘ，ｙは画素の位置を指定するパラメータである。[Number 2] I (x, y) = ( I 0 (x, y) a + (1-a)) · (I '(x, y) b + (1-b)) x, y is the position of the pixel It is a specified parameter.

【００２２】この変換の後に、値が大きくなりすぎるこ
とを防ぐための適当なしきい値処理をつけ加えてもよ
い。数２において、ａおよびｂはそれぞれ入力像および
フィードバック像の重みを決めるパラメータであり、０
から１の間に設定する。いま、かりに入力像が標準パタ
ンの中のどれかを平行移動、拡大縮小、回転したもので
あり、背景ノイズはまったくないとする。この場合には
最初のマッチング処理で判定結果がでるので逆変換処理
は行われないが、かりに逆変換処理が行われたと仮定す
ると、この場合、逆変換部１６で行われる処理は、正確
に変換部１４での処理の逆変換になり、フィードバック
像は前処理を受けた入力像と正確に一致する。After this conversion, appropriate thresholding may be added to prevent the values from becoming too large. In Equation 2, a and b are parameters that determine the weights of the input image and the feedback image, respectively, and 0
Set between 1 and 1. Now, it is assumed that the input image is a translation, scaling, or rotation of any of the standard patterns, and there is no background noise. In this case, since the determination result is obtained in the first matching process, the inverse conversion process is not performed. However, assuming that the inverse conversion process is performed, in this case, the process performed by the inverse conversion unit 16 is accurately converted. This is the inverse conversion of the processing in the unit 14, and the feedback image exactly matches the input image subjected to the preprocessing.

【００２３】入力像にノイズが重畳している場合にも、
近似的にフィードバック像は入力像中からノイズ部分を
除いたものになる。従って、例えば、数２において、ａ
およびｂを１に設定すると、修正された像からはノイズ
が大幅に抑制されることになる。これは数２によって、
入力像ではゼロでない値を持つ画素であっても、その画
素がノイズ部分のものであればフィードバック信号Ｉ′
がゼロであるためゼロにされてしまうからである。ａお
よびｂを１に設定する場合には、数２の後に平方根をと
る処理を加える。この結果、入力像とフィードバック像
とが等しい場合には、修正された像もこれに等しくな
る。Even when noise is superimposed on the input image,
Approximately, the feedback image is the input image with the noise part removed. Therefore, for example, in Equation 2, a
Setting b and b to 1 results in significant suppression of noise from the modified image. This is the number 2
Even if the pixel has a non-zero value in the input image, if the pixel is in the noise portion, the feedback signal I '
This is because it is zero because is zero. When a and b are set to 1, a process for taking a square root is added after the formula 2. As a result, if the input image and the feedback image are equal, the modified image will be equal.

【００２４】このようにして修正された像Ｉに対して再
び変換部１４の変換およびマッチング部１５のマッチン
グ処理が繰り返される。このループはマッチングによる
類似度があらかじめ定めておいたしきい値を越えるまで
くりかえされるが、あらかじめ定めておいた繰り返し回
数に達しても十分な類似度が得られない場合には、最初
の候補パタンは正しくなかったものとし、次順位のもの
へ候補パタンを変えて同様の操作を続ける。The conversion of the conversion unit 14 and the matching process of the matching unit 15 are repeated for the image I thus modified. This loop is repeated until the similarity by matching exceeds a predetermined threshold value, but if sufficient similarity is not obtained even after reaching the predetermined number of iterations, the first candidate pattern is If it is not correct, change the candidate pattern to the next one and continue the same operation.

【００２５】図２は、第２の発明の一実施例を示す構成
図である。図２に示す実施例は、イメージスキャナーも
しくはテレビカメラなどを利用する画像入力部２１と、
データをディジタル化したりエッジ強調などの必要な前
処理をする前処理部２２と、ディジタルデータを格納す
るイメージメモリ２３と、多段の階層的なフーリエ変換
を行い、位置ずれに対して不変なパタンに入力を変換す
る変換部Ａ２４と、拡張されたメラン変換を行って回
転、拡大縮小変化に対して不変なパタンに入力を変換す
る変換部Ｂ２５と、変換部Ｂ２５の出力と標準パタンと
のテンプレートマッチングを行ってパタン判定、候補パ
タンの決定を行うマッチング部２６と、マッチング部２
６で決まった候補パタンおよび変換部Ｂ２５のデータか
ら変換部Ｂ２５の逆変換を計算する逆変換部Ｂ２７と、
逆変換部Ｂ２７からのデータと変換部Ａ２４からのデー
タによりフィードバック像を計算する逆変換部Ａ２８と
を備えている。FIG. 2 is a block diagram showing an embodiment of the second invention. The embodiment shown in FIG. 2 includes an image input unit 21 using an image scanner or a television camera,
A pre-processing unit 22 that digitizes data and performs necessary pre-processing such as edge enhancement, an image memory 23 that stores digital data, and a multi-stage hierarchical Fourier transform to form a pattern that is invariable with respect to positional deviation. A conversion unit A24 that converts an input, a conversion unit B25 that performs an extended Melan transform to convert an input into a pattern that is invariant to rotation and scaling, and template matching between an output of the conversion unit B25 and a standard pattern. And the matching unit 26 that performs the pattern determination and the candidate pattern determination.
An inverse transform unit B27 that calculates the inverse transform of the transform unit B25 from the candidate pattern determined in 6 and the data of the transform unit B25;
An inverse transform unit A28 that calculates a feedback image based on the data from the inverse transform unit B27 and the data from the transform unit A24 is provided.

【００２６】次に、第２の発明の実施例の動作について
説明する。第２の発明の実施例の動作は第１の発明の動
作とほぼ同様であるが、第１の発明の実施例における最
初のフーリエ変換の振幅計算処理を次のように多段に分
割された処理に置き換える点が異なっている。また、こ
れに対応して逆変換部も多段に分割されている。Next, the operation of the second embodiment of the invention will be described. The operation of the embodiment of the second invention is almost the same as the operation of the first invention, but the amplitude calculation processing of the first Fourier transform in the embodiment of the first invention is divided into the following multi-stage processing. The difference is that it is replaced with. In addition, the inverse conversion unit is also divided into multiple stages corresponding to this.

【００２７】第１の発明の実施例と同様に、画像入力部
２１から入力された画像情報は、前処理部２２で必要な
前処理を受けてメモリに格納される。第２の発明の実施
例では、第１の発明の実施例での変換部における処理の
うち変換Ａの部分が多段に分割された階層的なフーリエ
変換に置き換えられている。すなわち、まず変換部Ａに
おいては前処理を受けた像を多数の小領域に分割し、各
小領域ごとに像のフーリエ変換とその振幅を計算する。
これらの値は後の処理で用いるため変換部Ａ内のメモリ
に保存しておく。この処理を数式で表すと以下のように
なる。Similar to the first embodiment of the present invention, the image information input from the image input section 21 is subjected to necessary preprocessing by the preprocessing section 22 and stored in the memory. In the second embodiment of the invention, the conversion A portion of the processing in the conversion unit in the first invention is replaced with a hierarchical Fourier transform divided into multiple stages. That is, first, in the conversion unit A, the preprocessed image is divided into a large number of small regions, and the Fourier transform of the image and its amplitude are calculated for each small region.
These values are saved in the memory in the conversion unit A for use in the subsequent processing. This processing is expressed in the following formula.

【００２８】今、入力画像の画素数をＮ×Ｎとする。ま
ず、この入力画像を、画素数ｎ₁×ｎ₁からなる、ｎ₂
×ｎ₂個の小領域に分割する。ここで、互いにオーバー
ラップがないように画像を分割する場合には、Ｎ＝ｎ₁
×ｎ₂であるが、互いにオーバーラップがあるように分
割してもかまわない。Now, let the number of pixels of the input image be N × N. First, this input image is represented by n ₂ consisting of the number of pixels n ₁ × n _1.
It is divided into × n ₂ small areas. Here, when the images are divided so that they do not overlap each other, N = n ₁
Although it is × n ₂ , they may be divided so that they overlap each other.

【００２９】次に、これらの小領域に分割された各画像
に対して、次のように、そのフーリエ変換およびその振
幅を求める処理を施す。なお、以後、Ｘ^*はＸのベクト
ルを表し、ｘ^*はｘのベクトルを表し、ｋ^*はｋのベク
トルを表すものとする。Next, each image divided into these small regions is subjected to the Fourier transform and its amplitude determination process as follows. Note that, hereinafter, X ^* represents an X vector, x ^* represents an x vector, and k ^* represents a k vector.

【００３０】ここで、各小領域を添え字Ｘ^*＝（Ｘ，
Ｙ）（Ｘ，Ｙ＝１〜ｎ₂）で表し、各小領域中での画素
の位置を添え字ｘ^*＝（ｘ，ｙ）（ｘ，ｙ＝１〜ｎ₁）
で表す。また、Ｘ^*で表される小領域中のｘ^*の位置に
ある画素の値をＩ（ｘ^*；Ｘ^*）で表す。Here, each sub-region is assigned a subscript X ^* = (X,
Y) (X, Y = 1 to n ₂ ) and the position of the pixel in each small area is subscript x ^* = (x, y) (x, y = _{1 to} n ₁ ).
It is represented by. Further, the value of the pixel at the position of x ^{* in} the small area represented by X ^* is represented by I (x ^* ; X ^* ).

【００３１】[0031]

【数３】 [Equation 3]

【００３２】[0032]

【数４】 [Equation 4]

【００３３】Ｗは窓関数である。Ｗ＝１であれば数３
は、Ｉ（ｘ^*；Ｘ^*）の添え字ｘ^*に関する２次元の離
散値フーリエ変換を表す。小領域の境界での不連続を消
去するために、境界でゼロになるような窓関数、例え
ば、ガウシアンあるいはハミング窓をＷとして用いても
よい。W is a window function. If W = 1, number 3
It is, I; represents a (x ^* X ^*) of the subscript x ^* regarding two-dimensional discrete values Fourier transform. In order to eliminate the discontinuity at the boundary of the small area, a window function that becomes zero at the boundary, for example, Gaussian or Hamming window may be used as W.

【００３４】数３の計算に関しては、高速フーリエ変換
のアルゴリズムを利用することが可能である。数４のＰ
１（ｋ^*１，Ｘ^*）がフーリエ変換の振幅である。計算
の都合によって振幅の代わりにその２乗であるパワーを
計算しても同じである。入力像の位置をずらしても位置
ずれの大きさが小領域の大きさ程度以下であれば、以上
の処理で得られるパタンＰ１（ｋ^*１，Ｘ^*）は、近似
的に不変である。ここで計算されたフーリエ変換の位相
は後の処理で使うためメモリに格納しておく。For the calculation of the equation 3, it is possible to use the fast Fourier transform algorithm. Number 4 P
1 (k ^* 1, X ^* ) is the amplitude of the Fourier transform. It is the same even if the power which is the square of the amplitude is calculated instead of the amplitude for convenience of calculation. The pattern P1 (k ^* 1, X ^* ) obtained by the above process is approximately unchanged if the magnitude of the positional deviation is equal to or smaller than the size of the small area even if the position of the input image is shifted. The phase of the Fourier transform calculated here is stored in the memory for use in the subsequent processing.

【００３５】次に、変換部Ａ２４は、Ｐ１（ｋ^*１，Ｘ
^*）に対して再び位置変数Ｘ^*に関するフーリエ変換を
行い、その振幅を計算する。Next, the conversion unit A24 outputs P1 (k ^* 1, X
^* ) Is again subjected to Fourier transform with respect to the position variable X ^* , and its amplitude is calculated.

【００３６】[0036]

【数５】 [Equation 5]

【００３７】[0037]

【数６】 [Equation 6]

【００３８】ここで得られるフーリエ変換の位相もやは
り後の処理で使うためメモリに格納しておく。ここまで
に述べた処理は２段階の処理であるが、必要に応じて３
段階以上の多段の処理にすることも可能である。変換部
Ａ２４は、さらに次のような変数変換を行う。The phase of the Fourier transform obtained here is also stored in the memory for use in the subsequent processing. The processing described so far is a two-step processing, but if necessary, 3
It is also possible to carry out multi-stage processing of more than steps. The conversion unit A24 further performs the following variable conversion.

【００３９】[0039]

【数７】 [Equation 7]

【００４０】こうして得られるパタンＰ（ｋ^*）は、第
１の発明の実施例において計算されている像全体のフー
リエ変換の振幅と類似の性質を持ち、入力像を平行移動
しても不変である。The pattern P (k ^* ) thus obtained has a property similar to the amplitude of the Fourier transform of the entire image calculated in the embodiment of the first invention, and does not change even when the input image is translated. is there.

【００４１】次に、Ｐは変換部Ｂ２５へと送られ、第１
の発明の実施例における変換Ｂの処理がなされ、パタン
の回転あるいは拡大縮小に対しても不変なパタンへと変
換される。マッチング部２６の働きは第１の発明の実施
例と同様である。逆変換部は変換部を多段に分割したこ
とに対応してやはり多段に分割される。まず、逆変換部
２７は、第１の発明の実施例と同様に逆フーリエ変換、
逆座標変換を行い、パタンＰ′（ｋ^*）を計算する。次
に、逆変換部Ａ２８は、数７の変換の逆変換を行い、Ｐ
２′（ｋ^*１，ｋ^*２）を計算した上で、このＰ２′と
変換部Ａ２４で計算された位相とを用いて逆フーリエ変
換を行い、Ｐ１′（ｋ^*１，Ｘ^*）を計算する。さら
に、逆変換部Ａ２８は、このＰ１′と変換部Ａ２４での
最初のローカルなフーリエ変換で計算された位相とを用
いて逆変換を行い、フィードバック像Ｉ′を計算する。Next, P is sent to the conversion unit B25, where the first
The process of the conversion B in the embodiment of the invention is performed, and the pattern is converted into a pattern that is invariable even when the pattern is rotated or enlarged or reduced. The function of the matching unit 26 is similar to that of the first embodiment of the invention. The inverse conversion unit is also divided into multiple stages corresponding to the division of the conversion unit into multiple stages. First, the inverse transform unit 27 performs the inverse Fourier transform as in the first embodiment of the invention.
The inverse coordinate transformation is performed to calculate the pattern P '(k ^* ). Next, the inverse conversion unit A28 performs the inverse conversion of the conversion of the expression 7 to obtain P
After calculating 2 ′ (k ^* 1, k ^* 2), an inverse Fourier transform is performed using this P2 ′ and the phase calculated by the conversion unit A24 to obtain P1 ′ (k ^* 1, X ^* ). calculate. Further, the inverse transform unit A28 performs an inverse transform using this P1 'and the phase calculated by the first local Fourier transform in the transform unit A24 to calculate the feedback image I'.

【００４２】イメージメモリ部２６は、第１の発明の実
施例と同様にこのフィードバック像に基づいて入力像を
修正し、これを再び変換部Ａ２４へと送る。このループ
は、マッチングによる類似度があらかじめ定めておいた
しきい値を越えるまで繰り返され、また、あらかじめ定
めておいた繰り返し回数に達しても十分な類似度が得ら
れない場合には、次順位のものへ候補パタンを変えて同
様の操作を続ける点も第１の発明の実施例と同様であ
る。The image memory unit 26 corrects the input image based on this feedback image as in the first embodiment of the present invention, and sends it again to the conversion unit A24. This loop is repeated until the similarity by matching exceeds a predetermined threshold value, and if sufficient similarity is not obtained even after reaching the predetermined number of repetitions, the next rank The point that the candidate pattern is changed to one and the same operation is continued is also the same as the first embodiment of the present invention.

【００４３】この実施例では、フィードバック像Ｉ′ま
での逆変換を行い、入力の修正処理を行っているが、逆
変換部Ａ２８におけるＰ１′（ｋ^*１，Ｘ^*）までの逆
変換までしか行わず、これを用いて、入力像から計算さ
れたＰ１（ｋ^*１，Ｘ^*）に修正処理を行い、マッチン
グ処理、修正処理のループを回すことも可能である。さ
らに、Ｐ１までのループを何回か回してＰ１′を安定さ
せてから、フィードバック像Ｉ′までのループを回すと
いう組合せ処理を行うバリエーションも可能である。In this embodiment, the inverse transformation up to the feedback image I'is performed to correct the input. However, only the inverse transformation up to P1 '(k ^* 1, X ^* ) in the inverse transformation unit A28 is performed. It is also possible to perform the correction processing on P1 (k ^* 1, X ^* ) calculated from the input image without using it and to loop the matching processing and the correction processing. Further, a variation is possible in which the loop up to P1 is rotated several times to stabilize P1 ', and then the loop up to the feedback image I'is rotated.

【００４４】第１の発明の実施例においては、最初に像
全体のフーリエ変換を行うため、背景ノイズが対象とす
るパタンと同じ空間周波数成分を持つような場合にはノ
イズ抑制が難しくなる場合もある。第２の発明の実施例
では、変換の中間表現として場所ごとの空間周波数の分
布であるＰ１（ｋ^*１，Ｘ^*）が計算されるため、たと
えノイズがパタンと同じ空間周波数成分を持っていたと
しても、それがパタンとは離れた位置にあれば抑制する
ことが可能になる。In the first embodiment of the present invention, since the Fourier transform of the entire image is first performed, noise suppression may be difficult when background noise has the same spatial frequency component as the target pattern. is there. In the second embodiment of the present invention, since the spatial frequency distribution P1 (k ^* 1, X ^* ) for each location is calculated as an intermediate representation of the transformation, even if noise has the same spatial frequency component as the pattern. Even if it is, it can be suppressed if it is located away from the pattern.

【００４５】図３は、第３の発明の一実施例を示す構成
図である。図３に示す実施例は、イメージスキャナーも
しくはテレビカメラなどを利用する画像入力部３１と、
データをディジタル化してエッジ強調などの必要な前処
理をする前処理部３２と、入力像を複素対数変換を施し
た座標軸上でのパタンへと変換する変換部Ｃ３３と、変
換部Ｃ３３の出力パタンを格納するイメージメモリ３４
と、このパタンに対して多段の階層的なフーリエ変換を
行い、小さな位置ずれおよび回転、拡大縮小変化に対し
て近似的に不変なパタンに入力を変換する変換部Ｄ３５
と、変換部Ｄ３５の出力と標準パタンとのテンプレート
マッチングを行ってパタン判定、候補パタンの決定を行
うマッチング部３６と、マッチング部３６で決まった候
補パタンおよび変換部Ｄ３５のデータから変換部Ｄ３５
の逆変換を行う逆変換部Ｄ３７とを備えている。FIG. 3 is a block diagram showing an embodiment of the third invention. The embodiment shown in FIG. 3 includes an image input unit 31 using an image scanner or a television camera,
A preprocessing unit 32 that digitizes the data and performs necessary preprocessing such as edge enhancement, a conversion unit C33 that converts the input image into a pattern on the coordinate axis that has undergone complex logarithmic conversion, and an output pattern of the conversion unit C33. Image memory 34 for storing
And a conversion unit D35 that performs a multi-stage hierarchical Fourier transform on this pattern and converts the input into a pattern that is approximately invariant to small displacements, rotations, and scaling changes.
And a matching unit 36 that performs pattern determination and candidate pattern determination by performing template matching between the output of the conversion unit D35 and the standard pattern, and the conversion unit D35 from the candidate pattern determined by the matching unit 36 and the data of the conversion unit D35.
And an inverse conversion unit D37 for performing the inverse conversion of

【００４６】次に、第３の発明の実施例の動作について
説明する。第１の発明の実施例と同様に、画像入力部３
１から入力された画像情報は、前処理部３２で必要な前
処理を受ける。次に、変換部Ｃ３３は前処理部からの送
られてきたパタンＩ（ｘ，ｙ）に対して、ｗ＝Ｌｏｇ（ｚ＋ａ）ｘ＞０の場合ｗ＝Ｌｏｇ（ｚ−ａ）ｘ＜０の場合；ｚ＝ｘ＋ｉｙ，ｗ＝ｕ＋ｉｖ，ａ：正の定数，
ｉ＝√−１の複素対数変換で表される座標変換（ｘ，ｙ）→（ｕ，
ｖ）を行い、ｕ−ｖ軸上のパタンへと変更する。ここで
ａは原点で対数関数が発散することを防ぐための正の定
数である。対数変換のよく知られた性質により、この処
理によって得られるパタンは、もとの像を拡大縮小ある
いは回転すると近似的に平行移動する。変換部Ｃ３３で
処理を受けたパタンはイメージメモリ３４に送られ格納
される。Next, the operation of the embodiment of the third invention will be described. Similar to the first embodiment of the invention, the image input unit 3
The image information input from 1 undergoes necessary preprocessing in the preprocessing unit 32. Next, the conversion unit C33, with respect to the pattern I (x, y) sent from the preprocessing unit, in the case of w = Log (z + a) x> 0 w = Log (z-a) x <0 Case; z = x + iy, w = u + iv, a: positive constant,
Coordinate transformation represented by complex logarithmic transformation of i = √−1 (x, y) → (u,
v), and change to a pattern on the uv axis. Here, a is a positive constant for preventing the logarithmic function from diverging at the origin. Due to the well-known nature of logarithmic transformation, the pattern obtained by this process will approximately translate when the original image is scaled or rotated. The pattern processed by the conversion unit C33 is sent to and stored in the image memory 34.

【００４７】次に、変換部Ｄ３５がこのパタンを多数の
小領域に分割し、まず、各小領域ごとに像のフーリエ変
換とその振幅を計算する。ここで、もとの像の位置をず
らすと、各小領域の中の像も移動するが、もしも各小領
域での位置ずれの大きさが小領域の大きさよりも小さけ
れば、このような処理により得られるパタンは近似的に
不変である。Next, the conversion unit D35 divides this pattern into a large number of small areas, and first calculates the Fourier transform of the image and its amplitude for each small area. Here, if the position of the original image is shifted, the image in each small area also moves, but if the size of the positional deviation in each small area is smaller than the size of the small area, such processing is performed. The pattern obtained by is approximately invariant.

【００４８】さらに、変換部Ｄ３５では、このようにし
て得られた、像中の多数の位置における、ローカルなフ
ーリエ変換の振幅を新たにパタンとみなして、再び位置
変数に関するフーリエ変換を行い、その振幅を計算す
る。この処理は複素対数変換を受けたパタンに対して行
われることを除けば、第２の発明の実施例における数３
〜数６の処理とまったく同様である。Further, in the transforming section D35, the amplitudes of the local Fourier transforms thus obtained at a large number of positions in the image are newly regarded as a pattern, and the Fourier transform on the position variable is performed again. Calculate the amplitude. Mathematical Expression 3 in the embodiment of the second invention except that this processing is performed on the pattern subjected to the complex logarithmic transformation.
~ It is exactly the same as the processing of Expression 6.

【００４９】このような処理によって得られるパタンは
近似的に、複素対数変換を施した座標軸で表現された入
力像についてのフーリエ変換の振幅と同じ性質を持つの
で、この座標軸上でパタンを平行移動しても、すなわち
もとの入力像を拡大縮小、あるいは回転させても近似的
に不変である。従って以上の処理により、小さな位置ず
れおよび拡大縮小変換、回転に対してほぼ不変のパタン
が得られる。The pattern obtained by such processing has approximately the same property as the amplitude of the Fourier transform of the input image represented by the coordinate axis subjected to the complex logarithmic transformation, and therefore the pattern is translated on this coordinate axis. Even if the original input image is enlarged or reduced or rotated, it is approximately unchanged. Therefore, by the above processing, a pattern that is almost invariable with respect to a small displacement, scaling conversion, and rotation can be obtained.

【００５０】次に、マッチング部３６においては、第１
の発明と同様に、ここまでの処理と同様の処理をあらか
じめ施しておいた標準パタンとのテンプレートマッチン
グを行う。逆変換部Ｄ３７は、やはりマッチング部３６
で決定された候補パタンをもとにフィードバック像の計
算を行う。逆変換部Ｄ３７の処理は、第２の発明の実施
例における逆変換部Ａ２８においてＰ２′からフィード
バック像を計算する処理とまったく同じである。Next, in the matching section 36, the first
Similar to the invention described above, the template matching is performed with the standard pattern which has been subjected to the same processing as the processing so far. The inverse conversion unit D37 is also the matching unit 36.
The feedback image is calculated based on the candidate pattern determined in. The process of the inverse transform unit D37 is exactly the same as the process of calculating the feedback image from P2 'in the inverse transform unit A28 in the embodiment of the second invention.

【００５１】イメージメモリ部３４は、第１の発明の実
施例と同様にこのフィードバック像に基づいて入力像を
修正し、これを再び変換部Ｄ３５へと送る。このループ
は、マッチングによる類似度があらかじめ定めておいた
しきい値を越えるまでくりかえされ、また、あらかじめ
定めておいた繰り返し回数に達しても十分な類似度が得
られない場合には、次順位のものへ候補パタンを変えて
同様の操作を続ける点は第１の発明の実施例と同様であ
る。The image memory unit 34 corrects the input image based on this feedback image as in the first embodiment of the invention, and sends it again to the conversion unit D35. This loop is repeated until the similarity due to matching exceeds a predetermined threshold value, and if sufficient similarity is not obtained even after reaching the predetermined number of repetitions, the next rank The point that the candidate pattern is changed to one and the same operation is continued is the same as the embodiment of the first invention.

【００５２】また、第２の発明の実施例のバリエーショ
ンで述べたように、多段のフーリエ変換すべての逆変換
を行わず、途中までの逆変換処理の結果を用いて修正処
理を行い、マッチング処理、修正処理のループを回すこ
とも可能である。さらに、これにフィードバック像まで
戻っての修正処理のループを組み合わせる処理が可能な
点も第２の発明の実施例と同様である。As described in the variation of the second embodiment of the present invention, the inverse transform of all the multi-stage Fourier transforms is not performed, but the correction process is performed by using the result of the inverse transform process up to the middle, and the matching process is performed. It is also possible to rotate the correction processing loop. Further, the point that the process of combining the loop of the correction process by returning to the feedback image can be performed is also the same as the embodiment of the second invention.

【００５３】[0053]

【発明の効果】以上説明したように本発明は、パタンの
拡大縮小、回転、位置ずれに対して強いパタン認識処理
を行うことができ、また、提示されるパタンに背景ノイ
ズが重畳している場合にも標準パタンとのマッチングが
大幅に改善され、精度の高いパタン認識処理を行うこと
ができるという効果を有する。As described above, according to the present invention, it is possible to perform pattern recognition processing that is strong against enlargement / reduction of patterns, rotation, and positional displacement, and background noise is superimposed on the presented pattern. Also in this case, matching with the standard pattern is significantly improved, and it is possible to perform highly accurate pattern recognition processing.

[Brief description of drawings]

【図１】第１の発明の一実施例を示す構成図である。FIG. 1 is a configuration diagram showing an embodiment of a first invention.

【図２】第２の発明の一実施例を示す構成図である。FIG. 2 is a configuration diagram showing an embodiment of a second invention.

【図３】第３の発明の一実施例を示す構成図である。FIG. 3 is a configuration diagram showing an embodiment of a third invention.

[Explanation of symbols]

１１，２１，３１画像入力部１２，２２，３２前処理部１３，２３，３４イメージメモリ１４変換部１５，２６，３６マッチング部１６逆変換部２４変換部Ａ２５変換部Ｂ２７逆変換部Ｂ２８逆変換部Ａ３３変換部Ｃ３５変換部Ｄ３７逆変換部Ｄ 11, 21, 31 image input unit 12, 22, 32 pre-processing unit 13, 23, 34 image memory 14 conversion unit 15, 26, 36 matching unit 16 inverse conversion unit 24 conversion unit A 25 conversion unit B 27 inverse conversion unit B 28 Inverse Transform Unit A 33 Transform Unit C 35 Transform Unit D 37 Inverse Transform Unit D

Claims

[Claims]

1. A first conversion unit for performing a Fourier-Melan transform on an input image to convert the image into a pattern invariant to displacement, enlargement / reduction, and rotation operations; and a first conversion unit. A matching unit that determines a candidate pattern by comparing the obtained pattern with a standard pattern; a first inverse transform unit that performs an inverse transform of the transform performed by the first transform unit on the candidate pattern; An image signal processing apparatus, comprising: an image memory for removing background noise unrelated to a pattern from an input image based on the image, and performing pattern determination by repeating loop processing.

2. A second method in which an input image is divided into small regions, multi-stage hierarchical Fourier transform is performed, and the image is converted into a pattern invariant to displacement by a process of obtaining the amplitude of the Fourier transform of the image. A conversion unit; a third conversion unit that performs extended Melan transform on the pattern obtained by the second conversion unit to convert it into a pattern that is invariant to scaling and rotation operations; The inverse conversion of the third conversion unit is calculated from the matching unit that determines the candidate pattern by comparing the pattern obtained from the conversion unit with the standard pattern, and the candidate pattern determined by the matching unit and the data from the third conversion unit. A second inverse transform unit, a third inverse transform unit that calculates a feedback image from the data from the second inverse transform unit and the data from the second transform unit, and regardless of the pattern from the input image based on the feedback image Background And a image memory to remove noise, the processor of the image signal and performs pattern judgment by repeating the loop process.

3. A fourth transforming unit for subjecting an input image to coordinate transformation represented by complex logarithmic transform, and a pattern obtained by the fourth transforming unit divided into small regions to form a multi-stage hierarchical Fourier transform. And a pattern obtained by the fifth transforming unit, which transforms the image into a pattern that is invariant to small displacement, enlargement / reduction, and rotation operation by the process of obtaining the amplitude of the Fourier transform of the image. And a matching unit for determining a candidate pattern by comparing with a standard pattern, and a fourth inverse transform for computing a feedback image by subjecting the candidate pattern to the inverse transform of the multistage hierarchical Fourier transform and the amplitude calculation of the Fourier transform. And an image memory that removes background noise irrelevant to the pattern from the input image based on the feedback image, and an image signal characterized by performing pattern determination by repeating loop processing. Processing apparatus.