JPH06333051A

JPH06333051A - Image processor

Info

Publication number: JPH06333051A
Application number: JP5116790A
Authority: JP
Inventors: Ikutoshi Fukushima; 福島郁俊
Original assignee: Olympus Optical Co Ltd
Current assignee: Olympus Corp
Priority date: 1993-05-19
Filing date: 1993-05-19
Publication date: 1994-12-02

Abstract

PURPOSE:To provide an image processor which has the satisfactory recognizing capability to such an image that is rotated, magnified or reduced, etc. CONSTITUTION:An image processor is provided with an edge detecting means which detects the edges of the input images, an image input means 305 which irradiates a coherent beam 300 and inputs an edge image detected by the edge detecting means to a system, a computer hologram 306 which superposes the phase information on the information supplied to the system, a Fourier transformation optical system 307 which applies the Fourier transformation on the phase information, and a detecting means 309 which detects the information supplied to a logarithmic pole coordinate transformation surface 308, i.e., a Fourier transformation surface. Then the edge of an input image is detected and this image is defined as an input image of a logarithmic pole coordinate transformation optical system. Thus it is possible to surely detect the magnification/reduction value, to improve the recognizing capability of the rotated and magnified/reduced images to a reference image, and to improve the detecting capability of the magnification/reduction value respectively.

Description

Detailed Description of the Invention

【０００１】[0001]

【産業上の利用分野】本発明は、画像処理装置に関し、
特に、回転・拡大縮小等の変形を受けた画像を認識可能
な画像処理装置に関する。BACKGROUND OF THE INVENTION 1. Field of the Invention The present invention relates to an image processing apparatus,
In particular, the present invention relates to an image processing device capable of recognizing an image that has been deformed by rotation, enlargement / reduction, or the like.

【０００２】[0002]

【従来の技術】産業用ロボットの視覚装置やオートメー
ションラインの製品検査装置等を始めとして、画像の認
識装置へのニーズは強く、従来から、光学的なマッチド
フィルタを用いた相関器による認識装置の開発等が精力
的に行われてきた。さらに、最近では、ニューラルネッ
トワークを利用した認識装置の開発等も盛んである。し
かし、これらの認識装置は、何れも、シフト・回転・拡
大縮小等の変形を受けた画像をも認識できる能力、つま
り、汎化能力が低く、この汎化能力の向上が実用化の上
での最大の課題となっていた。2. Description of the Related Art There is a strong need for an image recognition device such as a visual device for an industrial robot or a product inspection device for an automation line. Conventionally, there has been a strong demand for a recognition device using a correlator using an optical matched filter. Development has been done vigorously. Further, recently, development of a recognition device using a neural network has been actively conducted. However, all of these recognition devices have a low ability to recognize an image that has been deformed by shifting, rotating, scaling, or the like, that is, the generalization ability is low, and improvement of this generalization ability is not practical. Had been the biggest challenge of.

【０００３】この課題に一つの答えを出したのが、カー
ネギーメロン大学のDavid Casasent氏等である(David C
asasent et.al."Real-time deformation invariant opt
icalpattern recognition using coordinate tranforma
tions", Appl. Opt., Vol.26, pp.938-942 (1987)参
照）。彼らの相関器を用いる方法は以下のようなもので
ある。まず、図９に示すように、入力画像１０１をコヒ
ーレント光１０２で照射し、さらにこれに計算機ホログ
ラム（ＣＧＨ：Computer-generated hologram ）１０３
により座標変換のための位相情報を重畳した上で、フー
リエ変換レンズＬ₁１０４によりフーリエ変換を行う
と、入力画像に所望の座標変換を施した情報が座標変換
面１０５上に得られる。この所望の座標変換とは、例え
ば上記文献の場合は、拡大縮小と回転の変形が入力画像
に起こっても、それらの変形がシフト量に変換される対
数極座標変換である。ここで、対数極座標変換は、入力
画像ｆ（ｘ，ｙ）をＸ＝ｌｎ（ｘ²＋ｙ²）^1/2，Ｙ＝−ａｒｃｔａｎ（ｙ
／ｘ）の距離情報と角度情報の２つの情報に変換した画像を与
える。One answer to this problem is David Casasent of Carnegie Mellon University (David C
asasent et.al. "Real-time deformation invariant opt
icalpattern recognition using coordinate tranforma
tions ", Appl. Opt., Vol.26, pp.938-942 (1987)). The method of using their correlators is as follows. First, as shown in FIG. 101 is irradiated with coherent light 102, and a computer-generated hologram (CGH) 103 is irradiated on the light 101.
When the Fourier transform lens L ₁ 104 performs the Fourier transform after superimposing the phase information for the coordinate transform by, the information obtained by performing the desired coordinate transform on the input image is obtained on the coordinate transform surface 105. In the case of the above-mentioned document, for example, the desired coordinate transformation is a logarithmic polar coordinate transformation in which even if transformations of scaling and rotation occur in the input image, those transformations are transformed into shift amounts. Here, in the logarithmic polar coordinate transformation, the input image f (x, y) is converted into X = ln (x ² + y ² ) ^1/2 , Y = −arctan (y
/ X) gives an image converted into two pieces of information of distance information and angle information.

【０００４】次に、図１０に示すように、この対数極座
標変換後の情報を液晶テレビ（ＬＣＴＶ：liquid crys
tal television）１０６上に乗せ、２つのフーリエ変換
レンズＬ₂１０７、Ｌ₃１０８からなる二重回折系と、
基準画像に同様の対数極座標変換を施した上で作られた
マッチドフィルタ（ＭＳＦ：Matched spatial filter）
１０９によるフィルタリングとを組み合わせたシフトイ
ンバリアントな相関光学系にコヒーレント光１１０で照
射することにより入射され、カメラ１１１により相関面
Ｐでの基準画像と被検画像の相関ピークを検出し、それ
らの認識を行うというものである。実際、Casasent氏等
は、上記の文献の中で、上記の方法を用いて、回転と拡
大縮小の変形に対して良好な照合結果が得られる実例を
示している。Next, as shown in FIG. 10, the information after the logarithmic polar coordinate conversion is applied to a liquid crystal television (LCTV).
a double diffraction system consisting of two Fourier transform lenses L ₂ 107 and L ₃ 108,
Matched spatial filter (MSF) created by applying the same logarithmic polar coordinate transformation to the reference image
The coherent light 110 is incident on the shift invariant correlation optical system in which the filtering by 109 is combined, and is incident. The camera 111 detects the correlation peaks of the reference image and the test image on the correlation plane P, and recognizes them. Is to do. In fact, Casasent et al. In the above-referenced document show an example in which good matching results are obtained for rotation and scaling deformations using the above method.

【０００５】[0005]

【発明が解決しようとする課題】しかしながら、Casase
nt氏等の方法では、角度方向の変換はきちんと行われて
いるが、ｌｏｇ_eｒ方向の変換は十分でない。例えば、
図３（Ａ）に示すような正方形の入力画像をシミュレー
ションにより対数極座標変換すると、図３（Ｂ）に示す
画像となるが、これは、ｒをｌｏｇ_eｒに変換するため
に、入力画像に比べてそのｌｏｇ_eｒ方向の情報が極端
に狭い領域に変換されており、この方向の特徴量が検出
される画像の中で取り出し難くなってしまっている。さ
らに、図７（Ａ）の円形の入力画像の対数極座標変換は
図７（Ｂ）のようになるが、図３（Ｂ）と比べてもその
違いは顕著でなく、特に、彼等の示す実際の光学系にお
いては、ＣＧＨの製作上の誤差や回析の影響等で、さら
にその違いは区別し難くなる。したがって、前述したよ
うなＭＳＦによるフィルタリングを用いた相関光学系を
組み合わせても、実際には認識率が低くなってしまう。[Problems to be Solved by the Invention] However, Casase
In the method of nt Mr. like, the conversion of the angular direction is performed properly, the conversion of log _e r direction is not sufficient. For example,
When a logarithmic polar coordinate transformation is performed on a square input image as shown in FIG. 3A by simulation, the image becomes as shown in FIG. 3B. This is because the input image is converted to r to log _er. compared the log _e r direction information are converted into extremely narrow area, the feature quantity in this direction would be longer difficult extraction in the image to be detected. Further, the logarithmic polar coordinate transformation of the circular input image of FIG. 7A is as shown in FIG. 7B, but the difference is not so remarkable as compared with FIG. 3B, and especially, they show. In an actual optical system, the difference becomes more difficult to distinguish due to an error in manufacturing CGH, influence of diffraction, and the like. Therefore, even if a correlation optical system using MSF filtering as described above is combined, the recognition rate actually becomes low.

【０００６】さらに、入力画像に図３（Ａ）の正方形を
拡大した図４（Ａ）の画像を用いた場合、その対数極座
標変換の画像は図４（Ｂ）となる。図３（Ｂ）での一点
鎖線近傍の強度分布を図３（Ｃ）に、図４（Ｂ）での一
点鎖線近傍の強度分布を図４（Ｃ）に示す。ここで、拡
大の変形を受けた図４（Ａ）の画像が図３（Ａ）の画像
と同一物体であると認識できるためには、図４（Ｂ）の
画像が、図４（Ａ）の画像の拡大量に合わせた分だけ図
３（Ｂ）の画像を位置ずれさせた画像と等しくなってい
なければならない。しかし、それぞれの強度分布図４
（Ｃ）、図３（Ｃ）を比較しても分かるように、図４
（Ｃ）の方が光量がはるかに多く、同じ閾値を設定して
その画像を２値化して比較した場合、両者のｌｏｇ_eｒ
方向の幅が大きく異なっており、条件を満たしていない
ことは明らかである。この場合、この条件を満足させる
ためには、入力画像の大きさに応じて閾値を変更する
か、もしくは、入力画像のｒ方向に応じて強度を調整す
る等の処理をすればよいことは容易に思いつくが、複雑
な機構となる上に、光量の大幅な損失を生じるので実用
的ではない。Further, when the image of FIG. 4 (A) obtained by enlarging the square of FIG. 3 (A) is used as the input image, the image of the logarithmic polar coordinate conversion is as shown in FIG. 4 (B). The intensity distribution in the vicinity of the alternate long and short dash line in FIG. 3 (B) is shown in FIG. 3 (C), and the intensity distribution in the vicinity of the alternate long and short dash line in FIG. 4 (B) is shown in FIG. 4 (C). Here, in order to recognize that the image of FIG. 4A that has undergone the enlargement deformation is the same object as the image of FIG. 3A, the image of FIG. 3B must be equal to the image obtained by shifting the position of the image of FIG. 3B. However, each intensity distribution map 4
As can be seen by comparing (C) and FIG. 3 (C), FIG.
In the case of (C), the light intensity is much higher, and when the same threshold value is set and the images are binarized and compared, the log _e r of both images is compared.
It is clear that the widths of the directions are so different that the conditions are not satisfied. In this case, in order to satisfy this condition, it is easy to change the threshold value according to the size of the input image or adjust the intensity according to the r direction of the input image. However, it is not practical because it has a complicated mechanism and causes a large loss of light quantity.

【０００７】以上述べてきたように、入力画像のｌｏｇ
_eｒ方向の情報を顕著に表し、さらに、画像の拡大縮小
量を位置ずれ量とし、拡大縮小の変形を受けた画像をも
正確に認識するためには、Casasent氏等の方法のままで
は、実際上困難である。As described above, the log of the input image
_{In order} to express the information in the er direction conspicuously, and to use the amount of image scaling as the amount of positional deviation and to accurately recognize images that have undergone scaling deformation, the method of Casasent et al. Practically difficult.

【０００８】本発明はこのような実情に鑑みてなされた
ものであり、その目的は、回転・拡大縮小等の変形を受
けた画像に対して十分な認識能力を有する画像処理装置
を提供することである。The present invention has been made in view of the above circumstances, and an object thereof is to provide an image processing apparatus having sufficient recognition ability for an image that has been deformed such as rotated or enlarged or reduced. Is.

【０００９】[0009]

【課題を解決するための手段】上記課題を解決し上記目
的を達成するために、本発明の画像処理装置は、入力画
像のエッジを検出するエッジ検出手段と、略平行なコヒ
ーレント光と、このコヒーレント光を照射して前記エッ
ジ検出手段により検出されたエッジ画像を系内に入力す
るための画像入力手段と、この画像入力手段により系内
に入力された画像情報に対数極座標変換のための位相情
報を重畳するための計算機ホログラムと、位相情報が重
畳された画像情報をフーリエ変換するためのフーリエ変
換光学系と、このフーリエ変換光学系のフーリエ変換面
である対数極座標変換面に出力された対数極座標変換情
報を検出するための検出手段とを有することを特徴とす
るものである。In order to solve the above problems and achieve the above objects, an image processing apparatus according to the present invention comprises edge detection means for detecting the edges of an input image, and substantially parallel coherent light. Image input means for irradiating coherent light to input the edge image detected by the edge detection means into the system, and phase for logarithmic polar coordinate conversion to image information input into the system by this image input means A computer generated hologram for superimposing information, a Fourier transform optical system for Fourier transforming image information on which phase information is superimposed, and a logarithm output on a logarithmic polar coordinate conversion surface that is the Fourier transform surface of this Fourier transform optical system. And a detection unit for detecting polar coordinate conversion information.

【００１０】[0010]

【作用】本発明においては、例えば、図３（Ａ）の正方
形の入力画像のエッジを取れば図５（Ａ）の画像とな
る。この画像に対して対数極座標変換を行えば、図５
（Ｂ）の画像となる。一方、同様に、図７（Ａ）の円形
の入力画像のエッジ画像は図８（Ａ）のようになり、こ
れを対数極座標変換すれば、図８（Ｂ）の画像となる。
図５（Ｂ）と図８（Ｂ）における画像の違いは、図３
（Ｂ）と図７（Ｂ）の違いよりも顕著である。さら、光
学系の収差、回析等の影響を受けても、その違いははっ
きりと認識できる。したがって、その特徴量が実際には
検出不可能であった場合にも検出でき、前述の相関光学
系で認識させる場合に、その認識率は向上する。さら
に、図３（Ａ）の正方形を拡大した図４（Ａ）のエッジ
を検出した画像は図６（Ａ）となり、これを対数極座標
変換した画像は図６（Ｂ）となる。この図６（Ｂ）は、
図５（Ｂ）の画像がその拡大量に対応した量だけ位置ず
れした画像とほぼ等しくなっていることが分かる。この
ように、入力画像のエッジを取ってから対数極座標変換
をすることによって、拡大縮小の変形を受けた画像が、
基準画像に対してその拡大縮小量だけ正確に位置ずれし
た画像として変換されることができるようになり、前述
の相関光学系にこの変換像を入力すれば、従来例に比べ
て、正確な認識ができることは明らかである。さらに、
検出される相関ピークの位置が入力画像の拡大縮小の変
化量に正確に対応しているので、基準画像に対して、ど
の程度拡大しているか、又は、縮小しているかもより正
確に瞬時に分かる。In the present invention, for example, if the edges of the square input image of FIG. 3 (A) are taken, the image of FIG. 5 (A) is obtained. If logarithmic polar coordinate conversion is performed on this image, the result of FIG.
It becomes the image of (B). On the other hand, similarly, the edge image of the circular input image of FIG. 7A is as shown in FIG. 8A, and if this is subjected to logarithmic polar coordinate conversion, it becomes the image of FIG. 8B.
The difference between the images in FIG. 5B and FIG. 8B is that FIG.
This is more remarkable than the difference between (B) and FIG. 7 (B). Furthermore, even if the optical system is affected by aberration, diffraction, etc., the difference can be clearly recognized. Therefore, the feature amount can be detected even if it cannot be actually detected, and the recognition rate is improved when the feature amount is recognized by the above-mentioned correlation optical system. Further, an image in which the edges in FIG. 4A in which the square in FIG. 3A is enlarged is detected is shown in FIG. 6A, and an image obtained by logarithmic polar coordinate conversion of this is shown in FIG. 6B. This FIG. 6 (B) is
It can be seen that the image in FIG. 5B is almost equal to the image displaced by the amount corresponding to the enlargement amount. In this way, by taking the edge of the input image and then performing logarithmic polar coordinate transformation, the image that has undergone scaling deformation is
It becomes possible to convert the reference image as an image that is accurately displaced by the amount of enlargement / reduction, and if this converted image is input to the above-mentioned correlation optical system, more accurate recognition is possible than in the conventional example. It is clear that further,
Since the position of the detected correlation peak accurately corresponds to the amount of change in the scaling of the input image, it is possible to more accurately and instantly determine how much the scaling is expanding or shrinking with respect to the reference image. I understand.

【００１１】[0011]

【実施例】以下、本発明の画像処理装置の実施例１、２
について、図面を参照にして説明する。〔実施例１〕本発明の実施例１は、図１に示す光学系に
よりリアルタイムに入力画像の認識を行う画像処理装置
である。この装置は、大きく、エッジ検出光学系、対数
極座標変換光学系、相関光学系の３つの光学系で構成さ
れる。EXAMPLES Examples 1 and 2 of the image processing apparatus of the present invention will be described below.
Will be described with reference to the drawings. [Embodiment 1] Embodiment 1 of the present invention is an image processing apparatus for recognizing an input image in real time by the optical system shown in FIG. This device is roughly composed of three optical systems: an edge detection optical system, a logarithmic polar coordinate conversion optical system, and a correlation optical system.

【００１２】エッジ検出光学系では、認識対象の画像を
表示装置２０１、例えば液晶表示装置等で表示する。こ
の表示された画像にコヒーレント光３００を照射し、そ
の透過光で読み出し、さらに、空間光変調素子２０２に
光学的に書き込む。この空間光変調素子２０２におい
て、印加電圧をパルス電圧とし、その印加電圧を制御す
ることによって、書き込まれた画像のエッジ画像を読み
出し光側に表示させることができる。そこで、この表示
されているエッジ画像をビームスプリッター２０３を介
した読み出し光２０４で照射し、その反射光でエッジ画
像を読み出す。そのエッジ画像を像面２０５を介して画
像読み取り装置３０４で検出する。In the edge detection optical system, the image to be recognized is displayed on the display device 201, such as a liquid crystal display device. The displayed image is irradiated with coherent light 300, read by the transmitted light, and further optically written in the spatial light modulator 202. In the spatial light modulator 202, the applied voltage is a pulse voltage, and the applied voltage is controlled, whereby the edge image of the written image can be displayed on the reading light side. Therefore, the displayed edge image is irradiated with the reading light 204 through the beam splitter 203, and the edge image is read by the reflected light. The edge image is detected by the image reading device 304 via the image plane 205.

【００１３】次に、前述の画像読み取り装置３０４で検
出されたこのエッジ画像を液晶等の画像入力手段３０５
に表示して、対数極座標変換光学系の入力画像とする。
この光学系の構成は、図９の従来例の対数極座標光学系
と同じで、表示された画像にコヒーレント光３００を照
射し、対数極座標変換を施すＣＧＨ３０６、さらに、焦
点距離ｆ₁のフーリエ変換レンズＬ₁３０７を経ること
で、フーリエ変換レンズＬ₁３０７の後側焦点面に位置
する座標変換面３０８にｌｏｇ_eｒ−θ変換された画像
を得る。この際、得られた画像は、上述したように、入
力画像のｌｏｇ_eｒ方向の特徴を顕著に表示した画像で
あり、また、ある画像の拡大縮小の画像であれば、その
拡大縮小量の分だけ位置ずれした画像である。この画像
をＣＣＤ等の検出手段３０９で検出する。Next, the edge image detected by the image reading device 304 is used as an image input means 305 such as liquid crystal.
And the input image of the log-polar coordinate conversion optical system.
The configuration of the optical system is the same as that in the conventional example log-polar optical system of FIG. 9, is irradiated with coherent light 300 on the displayed image is subjected to log-polar transformation CGH306, further Fourier transform lens having a focal length f ₁ by going through the L ₁ 307, obtain the log _e r-theta-converted image to the coordinate conversion surface 308 located on the back focal plane of the Fourier transform lens L ₁ 307. At this time, the resulting image is, as described above, is remarkably displayed image the log _e r direction of features of the input image, also, when the image scaling of an image, the scaling amount It is an image that is misaligned by an amount. This image is detected by detection means 309 such as a CCD.

【００１４】この検出手段３０９で検出された画像を、
さらに、液晶等の表示装置３１０に表示し、相関光学系
の入力画像とする。相関光学系も、ほぼ図１０の従来例
の相関光学系と同じで、表示装置３１０と、略平行なコ
ヒーレント光３００と、これらによりこの系内に入力さ
れた座標変換情報をフーリエ変換するための焦点距離ｆ
₂のフーリエ変換レンズＬ₂３１１と、このフーリエ変
換レンズＬ₂３１１の後側焦点に置かれ認識のための基
準画像情報を重畳するためのＭＳＦ３１２と、さらに、
この重畳された情報をフーリエ変換するためのもう１枚
の焦点距離ｆ₃のフーリエ変換レンズＬ₃３１３と、こ
のもう１枚のフーリエ変換レンズＬ₃３１３の後側焦点
に置かれた相関面３１４に出力された相関ピークを検出
するための画像読み取り装置３１５で構成される。The image detected by the detecting means 309 is
Further, it is displayed on a display device 310 such as a liquid crystal and used as an input image of the correlation optical system. The correlation optical system is substantially the same as the correlation optical system of the conventional example shown in FIG. 10. The display device 310, the substantially parallel coherent light 300, and the coordinate conversion information input into this system by these are Fourier-transformed. Focal length f
₂ Fourier transform lens L ₂ 311 and MSF 312 for superimposing reference image information for recognition placed on the rear focus of this Fourier transform lens L ₂ 311;
Another Fourier transform lens L ₃ 313 having a focal length f ₃ for Fourier transforming the superimposed information, and a correlation surface 314 placed at the rear focus of the other Fourier transform lens L ₃ 313. The image reading device 315 for detecting the correlation peak output to the above.

【００１５】まず、認識の基準となる画像のＭＳＦ３１
２の作製方法について述べる。この基準となる画像は、
エッジ検出光学系の表示装置２０１に表示する。この表
示された画像は、エッジ検出光学系、対数極座標変換光
学系を経ることによって、エッジ検出、対数極座標変換
の画像処理がなされ、その画像が相関光学系に入力され
る。この相関光学系に入力された画像は、さらにフーリ
エ変換レンズＬ₂３１１でフーリエ変換され、ＭＳＦ３
１２の位置にそのフーリエ変換像を作るが、この位置に
ホログラム乾板を置き、この情報を記録してＭＳＦ３１
２とする。なお、このＭＳＦ３１２は、計算により計算
機ホログラムとして作ってももちろんよい。First, the MSF 31 of the image serving as a reference for recognition.
The manufacturing method of No. 2 will be described. This reference image is
It is displayed on the display device 201 of the edge detection optical system. The displayed image is subjected to image processing of edge detection and logarithmic polar coordinate conversion by passing through an edge detection optical system and a logarithmic polar coordinate conversion optical system, and the image is input to the correlation optical system. The image input to this correlation optical system is further Fourier-transformed by the Fourier transform lens L ₂ 311 to obtain the MSF 3
The Fourier transform image is made at the position of 12, and the hologram dry plate is placed at this position, this information is recorded, and MSF31 is recorded.
Set to 2. The MSF 312 may of course be formed as a computer hologram by calculation.

【００１６】次に、認識過程について述べる。まず、認
識対象となる画像を表示装置２０１に表示する。相関光
学系に入力された画像は、フーリエ変換レンズＬ₂３１
１でフーリエ変換され、さらに、この情報がＭＳＦ３１
２上の情報と重畳され、最後に、もう一度フーリエ変換
レンズＬ₃３１３でフーリエ変換されて、相関面３１４
に基準となる画像と認識される画像との相関がピークと
なって出力される。この相関ピークの高さは、これらの
画像の一致度を示すので、このピークの高いものを同じ
物と判断すればよい。この相関光学系では、系内のＭＳ
Ｆ３１２の位置が丁度この相関光学系への入力画像のフ
ーリエ変換面となっていることより、相関光学系への入
力画像がシフトしても、このＭＳＦ３１２の位置では同
一の画像情報となっている。したがって、前述の対数極
座標変換光学系への入力画像が、基準画像の拡大縮小し
た画像であっても、その拡大縮小量に応じた分だけ位置
ずれした画像を得ることができ、角度方向では従来と同
様の精度であるので、回転・拡大縮小の変形が入力画像
にあっても、この相関光学系の相関ピークは変形のない
基準画像同士の相関ピークと同一になることが分かる。Next, the recognition process will be described. First, the image to be recognized is displayed on the display device 201. The image input to the correlation optical system is the Fourier transform lens L ₂ 31.
Fourier transform by 1, and this information is
2 is superposed on the information, and finally, the Fourier transform is performed again by the Fourier transform lens L ₃ 313 to obtain the correlation surface 314.
The peak of the correlation between the reference image and the recognized image is output. Since the height of this correlation peak indicates the degree of coincidence of these images, the ones having a high peak may be judged to be the same. In this correlation optical system, the MS in the system
Since the position of F312 is exactly the Fourier transform plane of the input image to this correlation optical system, even if the input image to the correlation optical system is shifted, the same image information is obtained at this MSF312 position. . Therefore, even if the input image to the logarithmic polar coordinate conversion optical system is an enlarged or reduced image of the reference image, an image displaced by an amount corresponding to the amount of enlargement or reduction can be obtained. Since the accuracy is similar to, even if the rotation / enlargement / reduction deformation is present in the input image, the correlation peak of this correlation optical system is the same as the correlation peak between the reference images without deformation.

【００１７】以上から、本実施例の画像処理装置は、エ
ッジ検出光学系、対数極座標変換光学系及び相関光学系
によって、回転・拡大縮小が許容される認識となってい
ることが分かるが、前述のように、入力画像のエッジ画
像を対数極座標変換光学系の入力画像とすることによ
り、相関光学系への入力画像は認識対象の画像のｌｏｇ
_eｒ方向の特徴量を顕著に表し、その画像の拡大縮小量
に対応した量だけ位置ずれした画像となっている。した
がって、相関光学系で得られる相関ピークによる認識
は、認識率の低下を招くことなく、相関ピークの位置ず
れ量が認識対象の画像の拡大縮小量を確実に表すことが
できる。From the above, it can be seen that the image processing apparatus of the present embodiment recognizes that rotation / enlargement / reduction is permitted by the edge detection optical system, the logarithmic polar coordinate conversion optical system, and the correlation optical system. As described above, by using the edge image of the input image as the input image of the logarithmic polar coordinate conversion optical system, the input image to the correlation optical system is the log of the recognition target image.
It represents a feature value of _e r direction significantly, which is an image in which misaligned by an amount corresponding to the scale of the image. Therefore, the recognition by the correlation peak obtained by the correlation optical system can surely represent the enlargement / reduction amount of the image to be recognized by the positional shift amount of the correlation peak without lowering the recognition rate.

【００１８】ここでは、画像入力手段は液晶表示装置と
しているが、他の空間変調素子を用いてももちろん実現
可能である。Although the image input means is a liquid crystal display device here, it can be realized by using other spatial modulation elements.

【００１９】また、本実施例では、エッジ検出手段とし
て、空間光変調器の電圧を制御する手法を示したが、上
述した相関光学系と同様の光学系を用いて、ＭＳＦの代
わりに、空間周波数領域で０次光の部分をカットするフ
ィルターを用いてフィルタリングすることで、入力画像
のエッジ画像を得ることができる。さらに、エッジ検出
手段として、高速性を多少犠牲にすれば、入力画像をＣ
ＣＤ等の撮像素子で撮影した画像をフレームメモリ等で
コンピータに取り込み、コンピータ上で画像のラプラシ
アンを取ることでエッジを検出する手段を用いても実現
できる等、その他にも色々な変形が考えられる。Further, in this embodiment, as the edge detecting means, the method of controlling the voltage of the spatial light modulator has been shown. However, an optical system similar to the above-mentioned correlation optical system is used, and the space is used instead of the MSF. An edge image of the input image can be obtained by performing filtering using a filter that cuts the 0th-order light portion in the frequency domain. Further, as an edge detecting means, if the high speed is sacrificed to some extent, the input image is C
Various other modifications are conceivable. For example, it can be realized by using a means for detecting an edge by capturing an image captured by an image sensor such as a CD into a computer with a frame memory or the like and taking a Laplacian of the image on the computer. .

【００２０】〔実施例２〕本実施例は、図２に示す光学
系によりリアルタイムに入力画像の認識（パターン分
類）を行う装置である。図中のエッジ検出光学系、対数
極座標変換光学系は実施例１と全く同一要素で構成して
いる。したがって、エッジ検出・対数極座標変換された
画像をさらにフーリエ変換した画像をＣＣＤ等の画像読
み取り装置４１３で検出する。ここで、検出手段４１３
で検出される画像は、前述したものと同一であり、表示
装置２０１に入力された画像が基準画像の回転・位置ず
れ・拡大縮小されたものであっても、表示装置４１０に
より入力された画像は、焦点距離ｆ₄のフーリエ変換レ
ンズＬ₄４１１によりフーリエ変換を施すことで、フリ
ーエ変換面４１２上では基準画像を入力したときと同じ
画像が結像され、画像読み取り装置４１３で検出され
る。この画像をニューラルネットワーク４１４に入力す
る。[Embodiment 2] This embodiment is an apparatus for recognizing an input image (pattern classification) in real time by the optical system shown in FIG. The edge detection optical system and the logarithmic polar coordinate conversion optical system in the figure are composed of exactly the same elements as in the first embodiment. Therefore, an image obtained by further Fourier transforming the image subjected to edge detection / logarithmic polar coordinate conversion is detected by the image reading device 413 such as a CCD. Here, the detection means 413
The image detected by 1. is the same as that described above. Even if the image input to the display device 201 is the reference image rotated / displaced / enlarged / reduced, the image input by the display device 410 Is subjected to Fourier transform by the Fourier transform lens L ₄ 411 having the focal length f ₄ , so that the same image as that when the reference image is input is formed on the free Fourier transform surface 412 and detected by the image reading device 413. This image is input to the neural network 414.

【００２１】本実施例では、このニューラルネットワー
ク４１４としてコンピュータ上にバックプロパゲーショ
ンモデルをソフトウェアで構築した。このバックプロパ
ゲーションモデルでは、まず、学習によりニューラルネ
ットワーク４１４の各層間のシナプス荷重を決定するこ
とが必要となる。これは、認識（パターン分類）すべき
画像を表示装置２０１により順次入力することにより、
最終的にニューラルネットワーク４１４の入力層のニュ
ーロンに、それらの画像のフーリエ変換像をさらに対数
極座標変換し、それをフーリエ変換した画像情報とし
て、順次入力するが、この際、ニューラルネットワーク
４１４の出力層のニューロンの発火が認識（パターン分
類）すべき画像に対応して選択的に行われるように、逆
伝播学習則を用いて各層間のシナプス荷重を決定すれば
よい。認識（パターン分類）の際には、この各層間のシ
ナプス荷重を固定し、認識（パターン分類）すべき画像
を表示装置２０１により入力面に提示し、出力層のニュ
ーロンの発火を検出すればよい。In this embodiment, as the neural network 414, a backpropagation model is constructed by software on a computer. In this back-propagation model, it is necessary to first determine the synaptic weight between the layers of the neural network 414 by learning. This is performed by sequentially inputting images to be recognized (pattern classification) on the display device 201.
Finally, the Fourier transform images of those images are further subjected to logarithmic polar coordinate transformation to the neurons of the input layer of the neural network 414, and the resulting information is sequentially input as image information obtained by Fourier transform. At this time, the output layer of the neural network 414 is output. The back-propagation learning rule may be used to determine the synapse weights between the layers so that the firing of the neurons is selectively performed corresponding to the image to be recognized (pattern classification). At the time of recognition (pattern classification), the synapse load between the layers may be fixed, an image to be recognized (pattern classification) may be presented on the input surface by the display device 201, and firing of neurons in the output layer may be detected. .

【００２２】以上の構成により、通常のニューラルネッ
トワークのパターン分類能力に加えて、入力画像が回転
や拡大縮小の変形を受けても、これらの光学系を用いる
ことで画像読み取り装置４１３では同じ画像が得られる
ため、基準画像と同一の物体であると認識できる汎化能
力の向上が実現できることは明らかである。そして、前
実施例と同様に、認識能力はエッジ画像を用いることで
格段に向上していることは言うまでもない。また、上記
の実施例では、ニューラルネットワークとしてバックプ
ロパゲーションモデルを考えたが、ホップフィールドモ
デル等の他のモデルを用いた場合でも、同様に汎化能力
の向上が実現できることも言うまでもない。With the above-described structure, in addition to the pattern classification ability of a normal neural network, even if the input image is deformed by rotation or enlargement / reduction, by using these optical systems, the image reading device 413 can obtain the same image. Since it is obtained, it is obvious that the improvement of the generalization ability that can be recognized as the same object as the reference image can be realized. Needless to say, the recognition ability is remarkably improved by using the edge image, as in the previous embodiment. Further, in the above embodiment, the back propagation model was considered as the neural network, but it goes without saying that the generalization ability can be similarly improved even when another model such as the Hopfield model is used.

【００２３】なお、認識対象の画像を表示装置２０１に
表示しているが、認識対象画像にフーリエ変換を施し、
その画像を表示装置２０１に表示させれば、認識対象の
画像が位置ずれを生じていても、表示装置２０１に表示
される画像は同じとなる。よって、認識対象の画像が位
置ずれを生じていても、認識可能となるシステムが実現
できる。Although the image to be recognized is displayed on the display device 201, the image to be recognized is subjected to Fourier transform,
If the image is displayed on the display device 201, the image displayed on the display device 201 will be the same even if the image to be recognized is displaced. Therefore, it is possible to realize a system that enables recognition even if the image to be recognized is displaced.

【００２４】[0024]

【発明の効果】以上の説明から明らかなように、本発明
の画像処理装置は、入力画像のエッジを検出し、その画
像を次の対数極座標変換光学系の入力画像とすること
で、その出力画像においてｌｏｇ_eｒ方向の特徴量を顕
著にし、拡大縮小量を確実に検出できるので、基準画像
に対して回転・拡大縮小した画像の認識能力、及び、拡
大縮小量の検出能力が向上する。As is apparent from the above description, the image processing apparatus of the present invention detects the edge of an input image and uses that image as the input image of the next logarithmic polar coordinate conversion optical system to output the output. was significantly characteristic of log _e r direction in the image, since the scaling amount can be reliably detected, the recognition capability of the image obtained by reducing rotation and enlarged relative to the reference image, and scaling the amount of detection capability is improved.

[Brief description of drawings]

【図１】本発明の実施例１の画像処理装置の光学系を示
す図である。FIG. 1 is a diagram showing an optical system of an image processing apparatus according to a first embodiment of the present invention.

【図２】本発明の実施例２の画像処理装置の光学系を示
す図である。FIG. 2 is a diagram showing an optical system of an image processing apparatus according to a second embodiment of the present invention.

【図３】入力画像とその変換像の１例を示す図である。FIG. 3 is a diagram showing an example of an input image and a converted image thereof.

【図４】他の入力画像とその変換像を示す図である。FIG. 4 is a diagram showing another input image and its converted image.

【図５】図３の画像をエッジ検出した入力画像とその変
換像を示す図である。5 is a diagram showing an input image obtained by edge detection of the image of FIG. 3 and a converted image thereof.

【図６】図４の画像をエッジ検出した入力画像とその変
換像を示す図である。6 is a diagram showing an input image obtained by edge detection of the image of FIG. 4 and a converted image thereof.

【図７】別の入力画像とその変換像を示す図である。FIG. 7 is a diagram showing another input image and its converted image.

【図８】図７の画像をエッジ検出した入力画像とその変
換像を示す図である。8 is a diagram showing an input image obtained by edge detection of the image of FIG. 7 and a converted image thereof.

【図９】従来の対数極座標変換光学系を示す図である。FIG. 9 is a diagram showing a conventional logarithmic polar coordinate conversion optical system.

【図１０】従来の相関光学系を示す図である。FIG. 10 is a diagram showing a conventional correlation optical system.

[Explanation of symbols]

２０１…表示装置２０２…空間光変調素子２０３…ビームスプリッター２０４…読み出し光２０５…像面３００…コヒーレント光３０４…画像読み取り装置３０５…画像入力手段３０６…ＣＧＨ（計算機ホログラム）３０７…フーリエ変換レンズＬ₁ ３０８…座標変換面３０９…検出手段３１０…表示装置３１１…フーリエ変換レンズＬ₂ ３１２…ＭＳＦ（マッチドフィルタ）３１３…フーリエ変換レンズＬ₃ ３１４…相関面３１５…画像読み取り装置４１０…表示装置４１１…フーリエ変換レンズＬ₄ ４１２…フリーエ変換面４１３…画像読み取り装置４１４…ニューラルネットワーク201 ... Display device 202 ... Spatial light modulation element 203 ... Beam splitter 204 ... Readout light 205 ... Image plane 300 ... Coherent light 304 ... Image reading device 305 ... Image input means 306 ... CGH (Computer hologram) 307 ... Fourier transform lens L ₁ 308 ... coordinate transform plane 309 ... detecting means 310 ... display device 311 ... Fourier transform lens L ₂ 312 ... MSF (matched filter) 313 ... Fourier transform lens L ₃ 314 ... correlation surface 315 ... image reading apparatus 410 ... display device 411 ... Fourier Conversion lens L ₄ 412 ... Frier conversion surface 413 ... Image reading device 414 ... Neural network

Claims

[Claims]

1. An edge detecting means for detecting an edge of an input image, a substantially parallel coherent light, and an image for irradiating the coherent light and inputting the edge image detected by the edge detecting means into a system. Input means, a computer generated hologram for superimposing phase information for logarithmic polar coordinate conversion on the image information input into the system by the image input means, and a Fourier transform for Fourier transforming the image information on which the phase information is superimposed. An image processing apparatus comprising: a conversion optical system; and a detection unit for detecting logarithmic polar coordinate conversion information output to a logarithmic polar coordinate conversion surface which is a Fourier transform surface of the Fourier transform optical system.