JPH07239939A

JPH07239939A - Image recognition device

Info

Publication number: JPH07239939A
Application number: JP6034012A
Authority: JP
Inventors: Makoto Niwakawa; 誠庭川; Masakatsu Nomura; 昌克野村
Original assignee: Meidensha Corp; Meidensha Electric Manufacturing Co Ltd
Current assignee: Meidensha Corp; Meidensha Electric Manufacturing Co Ltd
Priority date: 1993-03-09
Filing date: 1994-03-04
Publication date: 1995-09-12
Anticipated expiration: 2018-11-25
Also published as: JP3470375B2

Abstract

PURPOSE:To recognize an image even when the size of an image to be recognized is largely different from that of a teacher image and much noise is included while increasing the discrimination processing speed. CONSTITUTION:A characteristic part is extracted from teacher image information in the learning mode and given to a cerebellum modeled computer (CMAC), in which the characteristic part is learned, discrimination use image information is extracted in the discrimination mode and given to the CMAC, in which the image information most similar to the learned teacher information is discriminated and outputted. In the learning mode, image data 11 are given to a fast speed Fourier transformation (FFT) section 12, in which a prescribed frequency component is obtained, a filter 13 is used to extract a specific component and it is give to a CMAC unit 15, where the component is learned and an image code number is provided to the component. On the other hand, in the discrimination mode, the image data are given to a FFT section 17, from which a prescribed frequency component is obtained and given to the CMAC unit 15 via an expansion section 18 expanding the image and a discrimination section 19 makes discrimination. A code number of an image most similar among images learned in the learning mode is extracted.

Description

Detailed Description of the Invention

【０００１】[0001]

【産業上の利用分野】この発明は小脳モデルコンピュー
タ（ＣＭＡＣ）による画像認識装置に関する。BACKGROUND OF THE INVENTION 1. Field of the Invention The present invention relates to an image recognition device using a cerebellum model computer (CMAC).

【０００２】[0002]

【従来の技術】近年、ニューラルネットワーク（以下Ｎ
Ｎと称す）を用いて画像認識を行う手段がある。このＮ
Ｎを用いた画像認識装置は学習時間が多大になることが
知られているため、通常画像の前処理として２値画像を
用いたりあるいは特に高速なＣＰＵを用いるなどして処
理を行っている。2. Description of the Related Art In recent years, neural networks (hereinafter referred to as N
There is a means for performing image recognition by using (N). This N
Since it is known that the image recognition apparatus using N requires a long learning time, the binary image is used as the preprocessing of the normal image, or the processing is performed by using a particularly high-speed CPU.

【０００３】[0003]

【発明が解決しようとする課題】上述したＮＮによる画
像認識装置は既知の画像を未知として学習し始めるため
に、学習時間が極めて膨大になり、認識処理を高速に行
うことができなかった。The image recognition apparatus based on the above-mentioned NN starts learning with a known image as unknown, so that the learning time becomes extremely large and the recognition processing cannot be performed at high speed.

【０００４】この発明は上記の事情に鑑みてなされたも
ので、判定処理の高速化を図るとともに教師画像より認
識画像が大きく外れたり、雑音が多くても認識できるよ
うにした画像認識装置を提供することを目的とする。The present invention has been made in view of the above circumstances, and provides an image recognizing device capable of recognizing even if the recognition image is greatly deviated from the teacher image or there is a lot of noise while speeding up the determination process. The purpose is to do.

【０００５】[0005]

【課題を解決するための手段】この発明は上記の目的を
達成するために、第１発明は教師画像情報が入力され、
この教師画像情報を学習する学習モードと、この学習モ
ードで学習された画像情報と比較判定される判定用画像
情報が入力される判定モードを有し、前記学習モードで
は教師画像情報から画像の特徴部分を抽出して画像に任
意なコード番号を付し、このコード番号と抽出された画
像とを小脳モデルコンピュータに入力して教師画像情報
を学習させ、前記判定モードでは入力された判定用画像
情報を抽出して前記小脳モデルコンピュータに入力し、
前記学習された教師画像情報と最も似た画像情報を判定
して出力するようにしたものである。According to the first aspect of the present invention, teacher image information is input,
The learning mode for learning the teacher image information and the determination mode for inputting the determination image information to be compared and determined with the image information learned in the learning mode are provided. Extract a part and attach an arbitrary code number to the image, input this code number and the extracted image into the cerebellum model computer to learn the teacher image information, and input the determination image information in the determination mode. And input to the cerebellum model computer,
The image information most similar to the learned teacher image information is determined and output.

【０００６】第２発明は教師画像情報および判定用画像
情報を高速フーリエ変換を用いて出力に直交座標および
極座標情報を得るようにしたことを特徴とするものであ
る。A second aspect of the present invention is characterized in that the teacher image information and the determination image information are subjected to fast Fourier transform to obtain Cartesian coordinate and polar coordinate information at the output.

【０００７】第３発明は判定用画像情報から得た直交座
標情報を画像伸縮部を介して小脳モデルコンピュータに
入力したことを特徴とするものである。A third aspect of the invention is characterized in that the orthogonal coordinate information obtained from the determination image information is input to the cerebellum model computer via the image expansion / contraction unit.

【０００８】第４発明は教師画像情報の他に少しずつ回
転させた教師画像情報を抽出してコード番号とともに小
脳モデルコンピュータに入力して教師画像情報を学習さ
せるようにしたものである。In the fourth aspect of the invention, in addition to the teacher image information, the teacher image information that is rotated little by little is extracted and input to the cerebellum model computer together with the code number so that the teacher image information is learned.

【０００９】第５発明は判定モードにおいて、判定用画
像情報を画像回転部を介して小脳モデルコンピュータに
入力したことを特徴とするものである。A fifth aspect of the invention is characterized in that in the determination mode, the determination image information is input to the cerebellum model computer via the image rotation unit.

【００１０】第６発明は第４発明の判定モードに画像伸
縮部を設けたものである。A sixth aspect of the invention provides an image expanding / contracting portion in the determination mode of the fourth aspect of the invention.

【００１１】第７発明は判定用画像情報を高速フーリエ
変換してから画像回転部に入力したことを特徴とするも
のである。A seventh aspect of the invention is characterized in that the image information for determination is subjected to fast Fourier transform and then input to the image rotating section.

【００１２】第８発明は教師画像情報を高速フーリエ変
換し、変換された情報からパワースペクトルを得、この
パワースペクトルの最大となる周波数を規格化部で規格
化し、規格化した情報から画像の特徴部分を抽出したこ
とを特徴とするものである。In the eighth invention, the teacher image information is subjected to fast Fourier transform, a power spectrum is obtained from the converted information, the maximum frequency of this power spectrum is standardized by a normalizing section, and the characteristic of the image is calculated from the standardized information. It is characterized by extracting a part.

【００１３】第９発明は判定用画像情報を高速フーリエ
変換し、変換された情報からパワースペクトルを得、こ
のパワースペクトルの最大となる周波数を規格化部で規
格化して抽出したことを特徴とするものである。A ninth aspect of the present invention is characterized in that the determination image information is subjected to fast Fourier transform, a power spectrum is obtained from the transformed information, and the maximum frequency of the power spectrum is standardized and extracted by the normalizing section. It is a thing.

【００１４】[0014]

【作用】予め学習モードで教師画像情報から画像の特徴
部分を抽出する。抽出された画像にはコード番号が付さ
れて、このコード番号と抽出された画像が小脳モデルコ
ンピュータに入力されて教師画像情報が学習される。そ
して判定モードで判定する画像情報が学習された画像情
報とどのくらい似ているかが判定される。第２発明では
画像が回転していることもあるので極座標情報を得るよ
うにする。第３発明では画像を大きくしたり、小さくし
たりする。第４発明では予め教師画像情報として少しず
つ回転する画像を学習させることによって極座標情報を
用いない画像の回転状態を検出する。第５、第６発明お
よび第７発明では判定モードにおいて、画像回転部で判
定用画像情報を±９０°と１８０°の回転を行う。第８
発明および第９発明では情報を規格化することにより、
対象のカメラからの位置がずれしても、また、照明が変
化しても認識できるようになる。In the learning mode, the characteristic portion of the image is extracted from the teacher image information in advance. A code number is attached to the extracted image, and the code number and the extracted image are input to the cerebellum model computer to learn the teacher image information. Then, it is determined how similar the image information determined in the determination mode is to the learned image information. In the second invention, since the image may be rotated, polar coordinate information is obtained. In the third invention, the image is enlarged or reduced. In the fourth aspect of the present invention, the rotating state of the image that does not use the polar coordinate information is detected by learning the image that rotates little by little as the teacher image information in advance. In the fifth, sixth, and seventh inventions, in the determination mode, the image rotation unit rotates the determination image information by ± 90 ° and 180 °. 8th
In the invention and the ninth invention, by standardizing information,
Even if the position of the target from the camera is deviated or the illumination is changed, it can be recognized.

【００１５】[0015]

【実施例】以下この発明の実施例を図面に基づいて説明
するに、まず、小脳モデルコンピュータ（以下ＣＭＡＣ
と称す）について述べる。ＣＭＡＣはＣｅｒｅｂｅｌｌ
ｅｒＭｏｄｅｌＡｒｉｔｈｍｅｔｉｃＣｏｍｐｕ
ｔｅｒの略で小脳皮質内の多数のニューロンの情報処理
機構の数学モデルを定式化したものである。ＣＭＡＣは
次のように一連の写像関係により定義される。DETAILED DESCRIPTION OF THE PREFERRED EMBODIMENTS The embodiments of the present invention will be described below with reference to the drawings.
Will be described). CMAC is Cerebell
er Model Arithmetic Compu
Abbreviated ter, it is a formalization of a mathematical model of the information processing mechanism of many neurons in the cerebellar cortex. CMAC is defined by a series of mapping relationships as follows.

【００１６】Ｓ→Ｍ→Ａ→Ｐここで、Ｓは入力ベクトル、ＭはＳをコード化するのに
用いられる苔状線維の集合、ＡはＭが接続される顆粒細
胞の集合、Ｐは出力値である。S → M → A → P where S is an input vector, M is a set of mossy fibers used to code S, A is a set of granule cells to which M is connected, and P is an output. It is a value.

【００１７】図９はＣＭＡＣの概念図を示すもので、１
は入力信号の集合を示し、この入力信号の集合１には制
御目標値やフィードバック信号が入力される。入力信号
の集合１からは荷重の選択信号が送出され、この信号は
荷重の表２の所定の位置に入力される。荷重の表２の出
力は総和計算部３で計算されて出力される。計算部３の
出力は教師信号である出力の目標値と偏差検出器４で偏
差がとられ、その偏差に応じて計算部３で荷重の調整が
なされる。なお、図９には写像関係の位置も示してあ
る。FIG. 9 is a conceptual diagram of CMAC.
Indicates a set of input signals, and a control target value and a feedback signal are input to the set 1 of input signals. A load selection signal is transmitted from the input signal set 1, and this signal is input to a predetermined position in the load table 2. The output of Table 2 of the load is calculated and output by the total sum calculation unit 3. The output of the calculation unit 3 is deviated by the deviation detector 4 from the target value of the output which is the teacher signal, and the calculation unit 3 adjusts the load according to the deviation. Note that FIG. 9 also shows mapping-related positions.

【００１８】ＣＭＡＣは、高次のベクトルは一般により
低次の空間への写像として表現することができるという
性質を利用したものである。言い替えば、低次のベクト
ルを複数用いることにより高次のベクトルを表現するこ
とが出来、その低次のベクトルに要求される機能は、低
レベルで良いという事である。一般に、関数は独立変数
により定義される状態の集合から、従属変数の状態の集
合への写像であると考えられ、ｆ：Ｃ→Ｅと表現され
る。この表現は「ｆは集合Ｃを集合Ｅへ写像する関係」
と読む。図１０にこの概念図を示す。図１０からも分か
るように、集合Ｃのどの状態に対しても集合Ｅ内の一つ
の状態を関係ｆにより求めることができる。また、集合
Ｃ内の複数の点が集合Ｅ内の一つの点に写像されること
もある。CMAC takes advantage of the property that higher-order vectors can be generally expressed as a mapping to a lower-order space. In other words, a high-order vector can be expressed by using a plurality of low-order vectors, and the function required for the low-order vector is good at a low level. In general, a function is considered to be a mapping from a set of states defined by independent variables to a set of states of dependent variables and is expressed as f: C → E. This expression is "f is a relation that maps set C to set E"
And read. FIG. 10 shows this conceptual diagram. As can be seen from FIG. 10, for any state of the set C, one state in the set E can be obtained by the relation f. Also, multiple points in set C may be mapped to one point in set E.

【００１９】いま、入力Ｓ＝（ｓ１，ｓ２，ｓ３…ｓ
ｎ）を出力Ｐへ写像する関数としての作用素をｈとする
と、ｈは次のように表現することができる。Now, the input S = (s1, s2, s3 ... s
Let h be the operator as a function that maps n) to the output P, and h can be expressed as follows.

【００２０】Ｐ＝ｈ（Ｓ）またはＰ＝ｈ（ｓ１，ｓ２，ｓ３…ｓｎ）これを図式化すると、図１１のようになる。また出力を
ベクトルＰとすると作用素または作用素ｈの集合として
図１２のように表現される。P = h (S) or P = h (s1, s2, s3 ... sn) This is illustrated in FIG. When the output is a vector P, it is expressed as an operator or a set of operators h as shown in FIG.

【００２１】以上述べたように作用素は、入力を出力に
ある関数により写像するために、一つのニューロンやニ
ューロン群の働きも記述することができる。たとえば、
一つのニューロンの働きを記述する場合には、ニューロ
ンに対する入力をベクトル、出力をスカラーとして考え
ると、Ｐ＝ｈ（Ｓ）と記述することができる。また、ニ
ューロン群を考える場合には、入力をベクトルと考えれ
ば、Ｐ＝Ｈ（Ｓ）と記述することができる。As described above, the operator can describe the action of one neuron or a group of neurons because the input is mapped by the function at the output. For example,
When describing the function of one neuron, considering the input to the neuron as a vector and the output as a scalar, it can be described as P = h (S). When considering a neuron group, if the input is considered as a vector, it can be described as P = H (S).

【００２２】上述したＣＭＡＣを画像認識装置に適用し
た第１実施例を図１に示す。図１において、１１は画像
データで、この画像データ１１は学習モードと判定モー
ドの２つのモードから成る画像認識装置に導入し、まず
学習モードで画像を学習させる。学習モードは高速フー
リエ変換部（ＦＦＴ部）１２とフィルタ１３で画像デー
タの前処理を行う。ＦＦＴ部１２とフィルタ１３で前処
理された画像データはコントローラ１４を構成するＣＭ
ＡＣユニット１５に導入される。ＣＭＡＣユニット１５
の出力と画像に付された任意のコード番号とを偏差検出
器１６で検出して画像をＣＭＡＣユニット１５に学習さ
せる。FIG. 1 shows a first embodiment in which the above-mentioned CMAC is applied to an image recognition apparatus. In FIG. 1, 11 is image data, and this image data 11 is introduced into an image recognition apparatus having two modes, a learning mode and a determination mode, and an image is first learned in the learning mode. In the learning mode, the fast Fourier transform unit (FFT unit) 12 and the filter 13 perform preprocessing of image data. The image data preprocessed by the FFT unit 12 and the filter 13 is a CM that constitutes the controller 14.
It is introduced into the AC unit 15. CMAC unit 15
Output and the arbitrary code number attached to the image are detected by the deviation detector 16 and the image is learned by the CMAC unit 15.

【００２３】次に判定モードで判定する画像が学習した
画像とどのくらい似ているかを判断する。判定モードは
ＦＦＴ部１７と伸縮部１８からなる前処理で処理した
後、ＣＭＡＣユニット１５に導入する。図１において学
習モードと判定モードでＣＭＡＣユニットを別々に描い
てあるが、説明の便宜上別々にしたもので、実際には同
じＣＭＡＣユニットを示している。ＣＭＡＣユニット１
５で学習させた画像と判定する画像がどのくらい似てい
るかを処理し、その処理結果を判定部１９で判定して最
も似た画像のコード番号を出力する。Next, it is determined how similar the image determined in the determination mode is to the learned image. The determination mode is introduced into the CMAC unit 15 after being processed by the pre-processing including the FFT unit 17 and the expansion / contraction unit 18. In FIG. 1, the CMAC unit is drawn separately in the learning mode and the determination mode, but they are shown separately for convenience of description, and actually show the same CMAC unit. CMAC unit 1
The similarity between the image learned as the image learned in 5 and the image to be determined is processed, the determination result is determined by the determination unit 19, and the code number of the most similar image is output.

【００２４】上記のように構成した実施例の動作を述べ
る。まず、学習モードについて述べる。例えばビデオに
撮った教師画像からＦＦＴ部１２によりパワースペクト
ル（以下ＰＳと称す）を求める。求めたＰＳをフィルタ
１３により画像の特徴的なＰＳのみ通過させる。ここ
で、画像に任意なコード番号を付し、ＰＳとコード番号
を用いてＣＭＡＣユニット１５で画像を学習させる。The operation of the embodiment configured as described above will be described. First, the learning mode will be described. For example, the power spectrum (hereinafter referred to as PS) is obtained by the FFT unit 12 from a teacher image taken on a video. Only the characteristic PS of the image passes through the obtained PS by the filter 13. Here, an arbitrary code number is given to the image, and the image is learned by the CMAC unit 15 using PS and the code number.

【００２５】次に判定モードにおいて判定画像からＦＦ
Ｔ部１７によりＰＳを求める。画像の伸縮を行う場合に
は伸縮部１８にてＰＳを伸縮させてＣＭＡＣユニット１
５に導入し、この出力を判定部１９で学習させた画像と
最も似た画像を選び出してその画像のコード番号を出力
する。コントローラ１４は学習モード、判定モードの切
り替えを行うとともに教師画像の種類によって、ＣＭＡ
Ｃユニット１５のユニット数を増減させる。Next, in the determination mode, the FF is changed from the determination image.
PS is calculated by the T unit 17. When the image is expanded / contracted, the expansion / contraction part 18 expands / contracts PS to expand the CMAC unit 1.
The image which is most similar to the image learned by the determination unit 19 is selected and the code number of the image is output. The controller 14 switches the learning mode and the determination mode, and depending on the type of the teacher image, the CMA
The number of C units 15 is increased or decreased.

【００２６】次に図２によりＦＦＴ部１２ａ，１２ｂお
よび１７ａ，１７ｂ、フィルタ１３ａ，１３ｂ、伸縮部
１８、ＣＭＡＣユニット１５および判定部１９の詳細に
ついて述べる。図２において、ＦＦＴ部１２ａ，１２ｂ
および１７ａ，１７ｂは画像のＰＳ成分である直交座標
成分Ｐ（ωｘ，ωｙ）と極座標成分Ｐ（ωｒ，ωθ）を
求める。フィルタ１３ａ，１３ｂは（ｉ＝１，２…，ｚ
−１、ｊ＝ｉ＋１…ｚ（│Ｐｉ（ωｘ，ωｙ）−Ｐｊ
（ωｘ，ωｙ）│＞量子化間隔））をみたすＰＳを通過
させる。ただし、教師画像数＝ｚ、画像１〜ｚのスペク
トルＰ１（ωｘ，ωｙ）〜Ｐｚ（ωｘ，ωｙ）、量子間
隔＝ＣＭＡＣのパラメータである。Next, the details of the FFT units 12a, 12b and 17a, 17b, the filters 13a, 13b, the expansion / contraction unit 18, the CMAC unit 15 and the determination unit 19 will be described with reference to FIG. In FIG. 2, the FFT units 12a and 12b
And 17a and 17b determine the orthogonal coordinate component P (ωx, ωy) and the polar coordinate component P (ωr, ωθ) that are PS components of the image. The filters 13a and 13b have (i = 1, 2, ..., Z
-1, j = i + 1 ... z (│Pi (ωx, ωy) -Pj
Pass PS satisfying (ωx, ωy) │> quantization interval)). However, the number of teacher images = z, the spectra P1 (ωx, ωy) to Pz (ωx, ωy) of images 1 to z, and the quantum interval = CMAC are parameters.

【００２７】伸縮部１８は次式のようにＰ（ωｘ，ω
ｙ）をＰ（ωａ，ωｂ）に伸縮し、Ｐ（ωａ，ωｂ）を
ＣＭＡＣへ入力する。なお、ωａ＝ｔ・ωｘ、ωｂ＝ｔ
・ωｙ、ｔ＝伸縮係数、Ｐ（ωａ，ωｂ）＝Ｐ（ωｘ，
ωｙ）である。The expansion / contraction part 18 is represented by P (ωx, ω)
y) is expanded / contracted to P (ωa, ωb), and P (ωa, ωb) is input to CMAC. Ωa = t · ωx, ωb = t
Ωy, t = expansion coefficient, P (ωa, ωb) = P (ωx,
ωy).

【００２８】学習モードにおけるＣＭＡＣユニットにお
いて、１個のＣＭＡＣユニットは３次元構成で３入力１
出力とする。そして処理の流れは次のようになる。フィ
ルタ１３ａ，１３ｂで求めたＰ（ωｘ，ωｙ）、Ｐ（ω
ｒ，ωθ）をＣＭＡＣユニットの１番からｎ番へ入力
し、出力Ｏ₁（ｘ，ｙ）〜Ｏ_n（ｘ，ｙ）、Ｏ₁（ｒ，
θ）〜Ｏ_n（ｒ，θ）を得る。この出力とコード番号の
差の絶対値を学習させる。In the CMAC unit in the learning mode, one CMAC unit has a three-dimensional structure and three inputs and one input.
Output. The flow of processing is as follows. P (ωx, ωy) and P (ω determined by the filters 13a and 13b
r, and input to n number of Omegashita) from number 1 CMAC units, the output _{O 1 (x, y) ~O} n (x, y), O 1 (r,
obtain _{θ) ~O n (r, θ} ). The absolute value of the difference between this output and the code number is learned.

【００２９】次に判定モードにおけるＣＭＡＣユニット
において、ＦＦＴ部１７ａ，１７ｂで求めたＰ（ωｘ，
ωｙ）とＰ（ωｒ，ωθ）、伸縮部１８で求めたＰ（ω
ａ，ωｂ）を入力し、出力Ｏ_n（ｘ，ｙ）、Ｏ_n（ｒ，
θ）、Ｏ_n（ａ，ｂ）を得る。Next, in the CMAC unit in the determination mode, P (ωx, obtained by the FFT units 17a and 17b)
ωy) and P (ωr, ωθ), P (ω determined by the expansion / contraction unit 18
a, ωb) and outputs O _n (x, y) and O _n (r,
theta), obtaining a O _n (a, b).

【００３０】判定部１９は出力Ｏ_n（ｘ，ｙ）、Ｏ
_n（ｒ，θ）、Ｏ_n（ａ，ｂ）から最も学習画像に近いコ
ード番号を求める。具体的には次のようになる。これに
は図２の図示下側のＣＭＡＣユニットの出力Ｏ₁（ｘ，
ｙ）について度数分布を求め、最頻値Ｏ_1maxと、Ｏ_1max
についての標準偏差σ₁を求める。同様に残りのＯ
₂（ｘ，ｙ）〜Ｏ_n（ｘ，ｙ）について、Ｏ_2max（ｘ，
ｙ）〜Ｏ_nmax（ｘ，ｙ）、σ₂（ｘ，ｙ）〜σ_n（ｘ，
ｙ）を求める。The judging section 19 outputs outputs O _n (x, y), O
_The code number closest to the learning image is obtained from _n (r, θ) and O _n (a, b). Specifically, it is as follows. For this, the output O ₁ (x,
y), the frequency distribution is calculated, and the mode _values O _1max and O _1max
For the standard deviation σ ₁ . Similarly, the remaining O
_{_{2 (x, y) ~O n}} (x, y) for, O _2max (x,
_{y) ~O nmax (x, y} ), σ 2 (x, y) ~σ n (x,
y) is calculated.

【００３１】以上までと同様にＯ_n（ｒ，θ）、Ｏ
_n（ａ，ｂ）について、Ｏ_1max（ｒ，θ）〜Ｏ
_nmax（ｒ，θ）とσ₁（ｒ，θ）〜σ_n（ｒ，θ）、Ｏ
_1max（ａ，ｂ）〜Ｏ_nmax（ａ，ｂ）とσ₁（ａ，ｂ）〜
σ_n（ａ，ｂ）を求める。最後に、（ｉ＝１，２…ｎ│
σ_n（ｘ，ｙ）、σ_n（ｒ，θ）、σ_n（ａ，ｂ））の中
で最小値σ₁の最頻値Ｏ_iを認識結果にする。Similarly to the above, O _n (r, θ), O
_{For n} (a, b), O _1max (r, θ) to O
_nmax (r, θ) and σ ₁ (r, θ) to σ _n (r, θ), O
_{_{1max (a, b) ~O nmax}} (a, b) and σ ₁ (a, b) ~
Find σ _n (a, b). Finally, (i = 1,2 ... n│
Among σ _n (x, y), σ _n (r, θ) and σ _n (a, b)), the mode O _i of the minimum value σ ₁ is set as the recognition result.

【００３２】上記実施例を用いて図３Ａに示す手袋の画
像を認識させた場合について述べる。A case in which the image of the glove shown in FIG. 3A is recognized using the above embodiment will be described.

【００３３】（１）学習モード図３Ａの手袋の教師画像をそれぞれコード番号６０，１
２０でＣＭＡＣユニットに学習させる。(1) Learning Mode Codes 60 and 1 are assigned to the gloved teacher images shown in FIG. 3A, respectively.
At 20 the CMAC unit is trained.

【００３４】（２）判定モード図３Ｂの手袋の判定画像をＣＭＡＣユニットに導入して
その時の出力を度数分布にしたものが図４に示す実線で
ある。図４の点線は学習させていない画像を同様に度数
分布にしたもので、両分布から実線の場合、学習時のコ
ード番号６０にピークが生じ、点線のものは実線のもの
よりピークが生じない。従って、実線の最頻値６０の標
準偏差から画像判定は容易にでき、認識の結果６０のコ
ード番号を得る。(2) Judgment Mode The judgment image of the glove of FIG. 3B is introduced into the CMAC unit and the output at that time is made into a frequency distribution, which is the solid line shown in FIG. Similarly, the dotted line in FIG. 4 is a frequency distribution of unlearned images. When both distributions are solid lines, a peak occurs in the code number 60 at the time of learning, and a dotted line has less peaks than the solid line. . Therefore, the image determination can be easily performed from the standard deviation of the mode 60 of the solid line, and the code number of the recognition result 60 is obtained.

【００３５】図５はこの発明の第２実施例を示す構成図
で、図１、図２と同一部分は同一符号を付して示す。図
５の第２実施例では極座標成分を得る手段を省いて、こ
れに代えて学習モードでは１つの画像を少しずつ回転さ
せてコード番号と共に学習させたことに特徴がある。画
像データ１１の学習手段は前記実施例と同様にＣＭＡＣ
ユニット１５ａにて行うが、画像データ２１は画像デー
タ１１をθ度回転させたもので、この画像データ２１も
ＦＦＴ部１２ａとフィルタ１３ａによりＰＳを求める。
このＰＳとコード番号（例えば７０）をＣＭＡＣユニッ
ト１５ｂにて学習させる。この処理をθ度ずつ順次回転
させた画像データ２１毎にＰＳを求めてＣＭＡＣユニッ
ト１５ｃ〜１５ｎにて学習する。例えば、上記処理を９
０度／θ＝ｎ回行う。いま、回転角＝１０度とすると、
１つの画像データにつき９０／１０＝９回回転させた画
像の学習を行い、それぞれ９個のＣＭＡＣユニットで学
習する。FIG. 5 is a block diagram showing a second embodiment of the present invention. The same parts as those in FIGS. 1 and 2 are designated by the same reference numerals. The second embodiment of FIG. 5 is characterized in that the means for obtaining the polar coordinate component is omitted, and instead of this, in the learning mode, one image is rotated little by little to learn with the code number. The learning means for the image data 11 is the CMAC as in the above embodiment.
The image data 21 is obtained by rotating the image data 11 by θ degrees, which is performed by the unit 15a, and the PS of the image data 21 is also obtained by the FFT unit 12a and the filter 13a.
The PS and the code number (for example, 70) are learned by the CMAC unit 15b. This process is learned by the CMAC units 15c to 15n by obtaining PS for each image data 21 that is sequentially rotated by θ degrees. For example, the above process
0 degree / θ = n times. Now, assuming that the rotation angle is 10 degrees,
An image rotated by 90/10 = 9 times is learned for each image data, and each image is learned by 9 CMAC units.

【００３６】上述のようにして１種の画像データ１１，
２１について学習したものを、順次θ度回転させた画像
データ２１と新たなコード番号をＣＭＡＣユニットにて
学習させる。なお、フィルタ１３ａは画像の種類、どの
回転について同じ特性である。As described above, one type of image data 11,
The CMAC unit is made to learn the image data 21 and new code number which are sequentially rotated by θ degrees after learning about 21. The filter 13a has the same characteristics for the type of image and any rotation.

【００３７】判定モードは前記実施例から極座標成分を
得るのを除いて画像データ１１のＰＳを伸縮部１８で伸
縮させてＣＭＡＣユニットに入力し、得られる出力
Ｏ_(n，_z)から画像を判定する。まず、学習モードと同様
に、画像をＦＦＴし、ＰＳを求める。ＣＭＡＣユニット
１５ａ〜１５ｎに、全て同じＰＳを入力し、出力Ｏ₍₁，
₁₎〜Ｏ_(n，₁₎を求める。これは次のマトリックスの式の
下線部を求めることになる。In the judgment mode, PS of the image data 11 is expanded / contracted by the expansion / contraction unit 18 and input to the CMAC unit except that the polar coordinate component is obtained from the above embodiment, and the image is judged from the obtained output O _(n , _z). To do. First, as in the learning mode, FFT is performed on the image to obtain PS. The same PS is input to the CMAC units 15a to 15n, and the output O ₍₁ ,
_{1) to} O _(n , ₁₎ are calculated. This will find the underlined part of the following matrix equation.

【００３８】[0038]

【数１】 [Equation 1]

【００３９】次に（１）式の下線部以外を求めるには上
述と同様にＺ倍伸縮させたＰＳをＣＭＡＣユニット１５
ａ〜１５ｎに入力することで求めることができる。Next, in order to obtain the parts other than the underlined part of the equation (1), the PS expanded / contracted by Z times is used in the CMAC unit 15 as described above.
It can be obtained by inputting a to 15n.

【００４０】伸縮部１８では図６に示す如く、ＰからＰ
_zへ、伸縮率ｚで伸縮させるときの説明図で、このとき
の関係を式で示すと次のようになる。In the expansion / contraction section 18, as shown in FIG.
_In the explanatory view when expanding and contracting to z at the expansion and contraction rate z, the relationship at this time is expressed by the following equation.

【００４１】Ｐ_z（ｚ・ωｘ，ｚ・ωｙ）＝Ｐ（ωｘ，
ωｙ）但し、伸縮率は０．５〜２とする。P _z (z · ωx, z · ωy) = P (ωx,
ωy) However, the expansion / contraction rate is 0.5 to 2.

【００４２】判定部１９は前記（１）式のマトリックス
からもっとも信頼できるコード番号を出力し、信頼でき
ない場合はゼロを出力する。The determination section 19 outputs the most reliable code number from the matrix of the above equation (1), and outputs zero if it is not reliable.

【００４３】まず、出力Ｏ₍₁，₁₎からヒストグラムの最
頻値ｍ₍₁，₁₎（以下支持コードと称す）と危険値σ₍₁，
₁₎を求める。これは出力Ｏ₍₁，₁₎から図５の判定部１９
に示す出力Ｏ₍₁，₁₎のヒストグラムを作成し、ヒストグ
ラムの支持コードｍ₍₁，₁₎を求める。そしてｍ₍₁，₁₎に
ついて危険値σ₍₁，₁₎を次式から求める。First, from the output O ₍₁ , ₁₎ , the most frequent value m ₍₁ , _{1) of the} histogram (hereinafter referred to as a support code) and the dangerous value σ ₍₁ ,
Ask for ₁₎ . This is based on the output O ₍₁ , ₁₎ from the decision unit 19 of FIG.
A histogram of the output O ₍₁ , ₁₎ shown in is created, and the supporting code m ₍₁ , ₁₎ of the histogram is obtained. Then, the dangerous value σ ₍₁ , ₁₎ for m ₍₁ , ₁₎ is calculated from the following equation.

【００４４】[0044]

【数２】 [Equation 2]

【００４５】上記のようにして出力Ｏ_(n，₁₎から、支持
コードｍ_(n，₁₎と危険値σ₍₁，₁₎を求めた後、同様にし
て出力Ｏ_(n，_z)から支持コードｍ_(n，_z)と危険値σ_(n，
_z)を求める。次に危険値σ_(m，_z)のマトリックスを次式
から求め、最も危険値の小さいσ_minを求める。After the support code m _(n , ₁₎ and the dangerous value σ ₍₁ , ₁₎ are obtained from the output O _(n , ₁₎ as described above, the output O _(n , _{z) is} similarly obtained. Support code m _(n , _z) and dangerous value σ _(n ,
_z) is calculated. Next, the matrix of dangerous values σ _(m , _z) is obtained from the following equation, and σ _min with the smallest dangerous value is obtained.

【００４６】[0046]

【数３】 [Equation 3]

【００４７】最も危険値の小さいσ_minより、判定結果
の支持コードｍ_(n，_z)がゼロを出力する。The supporting code m _(n , _{z) of the} judgment result outputs zero from σ _{min having} the smallest dangerous value.

【００４８】σ_min≦Ｓは危険値小、判定結果はそのと
きの支持コードｍ_(n，_z)である。Σ _min ≤S is a small dangerous value, and the determination result is the support code m _(n , _z) at that time.

【００４９】σ_min＞Ｓは危険値大、このときは判定結
果はゼロとなる。なお、Ｓは判定できる、できないかを
判断するしきい値で実験的に求める。When σ _min > S, the dangerous value is large, and the determination result is zero at this time. Note that S is experimentally determined by a threshold value for determining whether or not it can be determined.

【００５０】上記のように構成した実施例を用いて図７
Ａ，Ｂ，Ｃの空き缶の画像を学習後、図７Ａ，Ｂ，Ｃに
似た画像で判定させた各危険値σ_minのヒストグラムを
図８に示す。図８において、缶Ａ（図７Ａ）の危険値σ
_min＝３０．５、缶Ｂ（図７Ｂ）の危険値σ_min＝３１．
０、缶Ｃ（図７Ｃ）の危険値σ_min＝２８．７で缶Ｂと
缶Ｃのヒストグラムは缶Ｂから順に階級を＋１０ずつず
らせている。Using the embodiment configured as described above, FIG.
After learning the images of the empty cans A, B, and C, the histogram of each dangerous value σ _min determined by the images similar to FIGS. 7A, 7B, and 7C is shown in FIG. In FIG. 8, the dangerous value σ of can A (FIG. 7A)
_min = 30.5, the dangerous value σ _{min of} can B (FIG. 7B) = 31.
0 and the dangerous value σ _{min of} can C (FIG. 7C) = 28.7, the histograms of can B and can C are shifted from can B by +10 in order.

【００５１】上記実施例では極座標成分を得るＦＦＴが
必要ないので画像判定処理が高速になるとともに信頼性
の低い判定結果を判定不能とするので、誤判定の可能性
が低くなる。また、判定結果の信頼性が数値で表現でき
るので、処理が容易となる。In the above embodiment, since the FFT for obtaining the polar coordinate component is not required, the image judgment processing becomes fast and the judgment result with low reliability cannot be judged, so that the possibility of erroneous judgment becomes low. Further, the reliability of the determination result can be expressed by a numerical value, which facilitates the processing.

【００５２】図１３はこの発明の第３実施例を示す全体
構成のブロック図で、図１４は第３実施例の詳細な構成
を示す構成説明図で、図１、図２および図５と同一部分
は同一符号を付して示す。図１３および図１４に示す実
施例においては、判定モードに画像回転部２１を設けた
ものである。この第３実施例でも前記実施例と同様に学
習モードと判定モードの２つのモードからなり、予め学
習モードで画像を学習させ、判定モードで、判定する画
像が学習した画像と、どのくらい似ているかを判断す
る。FIG. 13 is a block diagram of the overall construction showing the third embodiment of the present invention, and FIG. 14 is a construction explanatory view showing the detailed construction of the third embodiment, which is the same as FIG. 1, FIG. 2 and FIG. The parts are denoted by the same reference numerals. In the embodiment shown in FIGS. 13 and 14, the image rotation unit 21 is provided in the determination mode. This third embodiment also has two modes, that is, a learning mode and a determination mode, similar to the above-described embodiments, and an image is learned in the learning mode in advance, and how similar the image to be determined in the determination mode is to the learned image. To judge.

【００５３】学習モードは前記実施例と同様にビデオに
撮った教師画像をＦＦＴし、ＰＳを求める。次にフィル
タで画像の特徴的なＰＳのみ通過させる。ここで、画像
に任意なコード番号をつけ、ＰＳとこのコード番号でＣ
ＭＡＣユニット１５に学習させる。In the learning mode, as in the above-described embodiment, FFT is performed on the teacher image taken on the video to obtain PS. Next, a filter passes only the characteristic PS of the image. Here, attach an arbitrary code number to the image, and C with PS and this code number
The MAC unit 15 is made to learn.

【００５４】判定モードでは判定画像をＦＦＴし、ＰＳ
を求める。次に、ＰＳを回転させてＣＭＡＣユニット１
５に入れ、この出力を判定部１９で判定させる。判定結
果は、学習させた画像の中で最も似た画像のコード番号
になる。ここで、ＰＳの回転は±９０°と１８０°とす
る。In the judgment mode, the judgment image is FFTed and PS
Ask for. Next, rotate the PS to rotate the CMAC unit 1
Then, the judgment section 19 judges this output. The determination result is the code number of the most similar image among the learned images. Here, the rotation of PS is ± 90 ° and 180 °.

【００５５】各機能ブロックでは前記実施例と同様にコ
ントローラ１４においては、学習モード、判定モードの
切り替えを行うとともに教師画像の種類によって、ＣＭ
ＡＣユニット数ｎを増減させる。ＦＦＴ部１２ａ，１２
ｂ，１７ａの動作は図２の実施例と同様に行う。In each functional block, as in the above-described embodiment, the controller 14 switches the learning mode and the judgment mode and the CM depending on the type of the teacher image.
Increase or decrease the number of AC units n. FFT section 12a, 12
The operations of b and 17a are performed in the same manner as the embodiment of FIG.

【００５６】画像回転部２２では０°、９０°、−９０
°、１８０°回転のＰＳを次のように求める。In the image rotating unit 22, 0 °, 90 °, -90
The PS of 180 ° and 180 ° rotation is calculated as follows.

【００５７】Ｐ₀（ωｘ，ωｙ）＝Ｐ（ωｘ，ωｙ）Ｐ₉₀（ωｘ，ωｙ）＝Ｐ（−ωｘ，ωｙ）Ｐ_-90（ωｘ，ωｙ）＝Ｐ（ωｘ，ωｙ）Ｐ₁₈₀（ωｘ，ωｙ）＝Ｐ（ωｘ，−ωｙ）上記４種類のＰＳをＣＭＡＣユニット１５ａ，１５ｂ…
の入力として用いる。P ₀ (ωx, ωy) = P (ωx, ωy) P ₉₀ (ωx, ωy) = P (−ωx, ωy) P ₋₉₀ (ωx, ωy) = P (ωx, ωy) P ₁₈₀ ( ωx, ωy) = P (ωx, −ωy) The above four types of PS are CMAC units 15a, 15b ...
Used as input.

【００５８】学習モードにおける１個のＣＭＡＣユニッ
トは３次元構成で３入力１出力とする。そして、処理の
流れはフィルタ１３ａ，１３ｂで求めたＰ（ωｘ，ω
ｙ）を順次ＣＭＡＣユニットの１番からｎ番へ入力し、
出力Ｏ₁（ｘ，ｙ）〜Ｏ_n（ｘ，ｙ）を得る。その後、こ
の出力とコード番号の差の絶対値を学習させる。In the learning mode, one CMAC unit has a three-dimensional structure and three inputs and one output. The flow of processing is P (ωx, ω) obtained by the filters 13a and 13b.
y) sequentially input from CMAC unit No. 1 to No.
Output _{O 1 (x, y) ~O} n (x, y) obtained. Then, the absolute value of the difference between this output and the code number is learned.

【００５９】判定モードにおけるＣＭＡＣユニットにお
いては、Ｐ（ωｘ，ωｙ）［Ｐ₀、Ｐ₉₀、Ｐ_-90、
Ｐ₁₈₀］を順次ＣＭＡＣユニットに入力し、出力Ｏ_0n，
Ｏ_90n，Ｏ_-90n，Ｏ₁₈₀を得る。（ただし、ｎ＝ＣＭＡＣ
ユニット数）判定部１９は出力Ｏ_0n，Ｏ_90n，Ｏ_-90n，Ｏ₁₈₀から最も
学習画像に近いコード番号を求める。図１４に示すよう
にＣＭＡＣの出力Ｏ₀₁について度数分布を判定部１９で
求めると、最頻値Ｏ_1maxと、この最頻値Ｏ_1maxについて
の標準偏差σ₁を求める。同様に、残りの出力Ｏ₀₂〜Ｏ
_0nについて、最頻値Ｏ_2max（ｘ，ｙ）〜Ｏ_nmax（ｘ，
ｙ）と、標準偏差σ₁（ｘ，ｙ）〜σ_n（ｘ，ｙ）を求め
る。以下同様に、出力Ｏ_90n，Ｏ_-90ｎ、Ｏ₁₈₀について
最頻値と標準偏差を求める。最後に標準偏差の最小値と
なる最頻値Ｏ_iを認識結果とする。In the CMAC unit in the judgment mode, P (ωx, ωy) [P ₀ , P ₉₀ , P _-90 ,
P ₁₈₀ ] are sequentially input to the CMAC unit and output O _0n ,
O _90n , O _−90n , and O ₁₈₀ are obtained. (However, n = CMAC
Number of Units) The determination unit 19 obtains the code number closest to the learning image from the outputs O _0n , O _90n , O _−90n , and O ₁₈₀ . As shown in FIG. 14, when the frequency distribution of the output O ₀₁ of the CMAC is obtained by the determination unit 19, the mode value O _1max and the standard deviation σ ₁ for this mode value O _1max are obtained. Similarly, the remaining outputs O _{02 to} O
For _0n, the mode _{O 2max (x, y) ~O} nmax (x,
y) and standard deviation σ ₁ (x, y) to σ _n (x, y). Similarly, the mode and the standard deviation of the outputs O _90n , O ₋₉₀ n, and O ₁₈₀ are obtained. Finally, the mode value O _i that is the minimum value of the standard deviation is set as the recognition result.

【００６０】図１３、図１４の実施例を用いると、図５
の第２実施例とは次のように異なる。図５に示した第２
実施例では、画像中の物体回転は、画像の極座標ＦＦＴ
によって決められるため、１画像につき２回ＦＦＴする
必要がある。また、図５の第２実施例ではどの様な回転
に対しても対応できるようにしているが、実際には、対
象とする物体の姿勢の変化として、対象が横になる（９
０°回転）か逆になる（１８０°回転）場合のみに限定
しても場合のものが図１３、図１４に示した実施例であ
る。この実施例では、従って、極座標によるＦＦＴや回
転した教師画像で何回か学習しなくても１回の学習と１
回のＦＦＴで認識できるようにした。これにより、極座
標が必要ないため、画像判定処理が高速になる利点があ
る。Using the embodiment of FIGS. 13 and 14, FIG.
The second embodiment is different from the second embodiment as follows. Second shown in FIG.
In an embodiment, the object rotation in the image is the polar FFT of the image.
It is necessary to perform FFT twice for each image, as determined by In addition, in the second embodiment of FIG. 5, any rotation can be dealt with, but in reality, the target lies sideways as a change in the posture of the target object (9
The examples shown in FIGS. 13 and 14 are limited to cases in which the rotation is 0 ° or the opposite (180 ° rotation). In this embodiment, therefore, one learning and one learning is performed without using the FFT in polar coordinates or the rotated teacher image several times.
It was made to be able to be recognized by FFT times. As a result, polar coordinates are not required, which has the advantage of speeding up the image determination processing.

【００６１】図１５はこの発明の第４実施例を示す全体
構成のブロック図であり、図１６は第４実施例の詳細な
構成を示す構成説明図で、図１、図２および図５と同一
部分は同一符号を付して示す。図１５および図１６に示
す実施例においては、ＦＦＴ部１２、１７および１２
ａ，１２ｂ，１７ａで教師画像および判定用画像情報を
ＦＦＴした後、その出力を規格化処理するために規格化
部２３、２３ａ、２３ｂ、２４を設けたものである。こ
の第４実施例でも前記実施例と同様に学習モードと判定
モードの２つのモードからなり、予め学習モードで画像
を学習させ、判定モードで、判定する画像が学習した画
像と、どのくらい似ているかを判断する。FIG. 15 is a block diagram of the overall construction showing the fourth embodiment of the present invention, and FIG. 16 is a construction explanatory view showing the detailed construction of the fourth embodiment, and is shown in FIG. 1, FIG. 2, and FIG. The same parts are designated by the same reference numerals. In the embodiment shown in FIGS. 15 and 16, the FFT units 12, 17 and 12 are used.
Standardization units 23, 23a, 23b and 24 are provided to standardize the output of the FFT of the teacher image and the determination image information at a, 12b and 17a. This fourth embodiment also has two modes, that is, a learning mode and a determination mode, similar to the above-described embodiments, and the image is learned in the learning mode in advance, and how similar the image determined in the determination mode is to the learned image. To judge.

【００６２】学習モードは前記実施例と同様にビデオに
撮った教師画像情報をＦＦＴし、パワースペクトルＰＳ
を求める。次にこのＰＳの最大となる周波数を規格化部
２３、２３ａ、２３ｂにより規格化して出力する。この
出力を、次にフィルタ１３、１３ａ、１３ｂで画像の特
徴的なＰＳのみ通過させる。ここで、画像に任意なコー
ド番号を付け、ＰＳとこのコード番号で、ＣＭＡＣユニ
ット１５に学習させる。判定モードでは判定用画像を
ＦＦＴし、ＰＳを求める。次に、前述と同様にＰＳの最
大となる周波数を規格化部２４により規格化して出力す
る。この出力したＰＳをＣＭＡＣユニット１５に入力
し、その出力を判定部１９で判定させる。判定結果は、
学習させた画像の中で最も似た画像のコード番号にな
る。In the learning mode, as in the above-described embodiment, the FFT is performed on the teacher image information captured in the video, and the power spectrum PS
Ask for. Next, the maximum frequency of the PS is standardized by the standardization units 23, 23a, 23b and output. This output is then passed through only the characteristic PS of the image by filters 13, 13a and 13b. Here, an arbitrary code number is given to the image, and the CMAC unit 15 is made to learn with PS and this code number. In the determination mode, the image for determination is subjected to FFT to obtain PS. Next, the frequency having the maximum PS is standardized by the normalizing unit 24 and output as in the above. The output PS is input to the CMAC unit 15, and the output of the PS is determined by the determination unit 19. The judgment result is
It is the code number of the most similar image among the learned images.

【００６３】各機能ブロックでは前記実施例と同様にコ
ントローラ１４においては、学習モード、判定モードの
切り替えを行うとともに教師画像の種類によって、ＣＭ
ＡＣユニット数ｎを増減させる。なお、ＦＦＴ部１２、
１７および１２ａ，１２ｂ，１７ａでは画像のＰＳ成分
であるＰ（ωｘ，ωｙ）を求めた後、規格化部２３、２
３ａ、２３ｂ、２４では最大のＰＳとなる周波数（ωｘ
ｍａｘ，ωｙｍａｘ）を次のように規格化する。In each functional block, as in the above-described embodiment, the controller 14 switches the learning mode and the judgment mode, and the CM depending on the type of the teacher image.
Increase or decrease the number of AC units n. The FFT unit 12,
17 and 12a, 12b, 17a, after obtaining P (ωx, ωy), which is the PS component of the image, the normalization units 23, 2
In 3a, 23b, and 24, the maximum PS frequency (ωx
max, ωymax) is standardized as follows.

【００６４】Ωｘ＝ωｘ／ωｘｍａｘ Ωｙ＝ωｙ／ωｙｍａｘＰｎ（Ωｘ，Ωｙ）＝Ｐ（ωｘ，ωｙ）／Ｐ（ωｘｍａ
ｘ，ωｙｍａｘ）規格化部で規格化された出力はフィルタ１３、１３ａ、
１３ｂに入力され、ここで次式を満たすＰＳを通過させ
る。通過したＰＳはＣＭＡＣユニット１５ａ，１５ｂ…
に入力される。Ωx = ωx / ωxmax Ωy = ωy / ωymax Pn (Ωx, Ωy) = P (ωx, ωy) / P (ωxma
x, ωymax) The outputs standardized by the normalization unit are filters 13, 13a,
13b, and the PS that satisfies the following equation is passed here. The PS that has passed through is the CMAC unit 15a, 15b ...
Entered in.

【００６５】│Ｐｎｉ（Ωｘ，Ωｙ）−Ｐｎｊ（Ωｘ，
Ωｙ）│＞量子化間隔（ｉ＝１，２…ｚ−１、ｊ＝ｉ＋１，…ｚ）ただし、教師画像数＝ｚ、画像１〜ｚのスペクトルＰｎ
１（Ωｘ，Ωｙ）〜Ｐｎｚ（Ωｘ，Ωｙ）、量子化間隔
＝ＣＭＡＣのパラメータである。| Pni (Ωx, Ωy) -Pnj (Ωx,
Ωy) │> quantization interval (i = 1, 2 ... z-1, j = i + 1, ... z) where the number of teacher images = z and the spectrum Pn of images 1 to z
1 (Ωx, Ωy) to Pnz (Ωx, Ωy), quantization interval = CMAC parameter.

【００６６】上述したＣＭＡＣユニットは前記実施例と
同様な構成で、図１６に示すように学習モードおよび判
定モードとも３次元構成で３入力１出力である。図１６
に示す学習モードでは、フィルタ１３ａ，１３ｂで求め
たＰｎ（Ωｘ，Ωｙ）を順次ＣＭＡＣユニットの１番か
らｎ番へ入力し、出力Ｏ₁（ｘ，ｙ）〜Ｏ_n（ｘ，ｙ）を
得る。その後、この出力とコード番号の差の絶対値を学
習させる。判定モードではＰｎ（Ωｘ，Ωｙ）を順次Ｃ
ＭＡＣユニット入力し、出力Ｏ_no（ｎ＝ＣＭＡＣユニッ
ト数）を得る。The above-mentioned CMAC unit has the same structure as that of the above-described embodiment, and as shown in FIG. 16, both the learning mode and the judgment mode have a three-dimensional structure and three inputs and one output. FIG.
In the learning mode shown, the filter 13a, 13b in the obtained Pn (Ωx, Ωy) type to n-th from 1st sequential CMAC units, output _{O 1 (x, y) ~O} n (x, y) and obtain. Then, the absolute value of the difference between this output and the code number is learned. In judgment mode, Pn (Ωx, Ωy) is sequentially C
The MAC unit is input and the output O _no (n = number of CMAC units) is obtained.

【００６７】判定部１９は、前記第３実施例と同様に、
出力Ｏ_nから最も学習画像に近いコード番号を求める。
具体的には次のようにして行う。図１６の判定モードの
ＣＭＡＣユニット１５ａ、１５ｂ……の出力Ｏ₀₁につい
て度数分布を求め、最頻値Ｏ_1max（ｘ，ｙ）と、この最
頻値Ｏ_1maxについての標準偏差σ₁を求める。同様に、
残りの出力Ｏ₀₂〜Ｏ_0nについて、最頻値Ｏ_2max（ｘ，
ｙ）〜Ｏ_nmax（ｘ，ｙ）と、標準偏差σ₁（ｘ，ｙ）〜
σ_n（ｘ，ｙ）を求める。最後に標準偏差の最小値とな
る最頻値Ｏ_iを認識結果とする。The judging section 19 is similar to the third embodiment in that
Request code number closest to the learning image from the output O _n.
Specifically, it is performed as follows. The frequency distribution is obtained for the output O ₀₁ of the CMAC units 15a, 15b, ... In the determination mode of FIG. 16, and the mode O _1max (x, y) and the standard deviation σ ₁ for this mode O _1max are obtained. Similarly,
For the remaining outputs O _{02 to} O _0n , the mode value O _2max (x,
y) to _Onmax (x, y) and standard deviation σ ₁ (x, y) to
Find σ _n (x, y). Finally, the mode value O _i that is the minimum value of the standard deviation is set as the recognition result.

【００６８】上記のように第４実施例では規格化部を設
けることにより、対象のカメラからの位置がずれても認
識でき、かつ、照明が変化しても認識できる利点があ
る。As described above, the provision of the standardization section in the fourth embodiment has the advantage that it can be recognized even if the position from the target camera is displaced and that it can be recognized even if the illumination changes.

【００６９】[0069]

【発明の効果】以上述べたように、この発明によれば、
第１発明から第３発明では、画像判定処理がＮＮの場合
に比較して学習させる分、高速処理ができ、かつ簡単に
判定でき、また、教師画像より認識する画像が大きくズ
レたり、ノイズが多くても認識できるようになる。第４
発明から第７発明では極座標成分を用いないので、画像
判定処理がより高速に処理できるとともに誤判定処理の
可能性も低くできる等の利点がある。第８発明および第
９発明では情報を規格化することにより、対象のカメラ
からの位置がずれしても、また、照明が変化しても認識
できるようになる。As described above, according to the present invention,
In the first invention to the third invention, as compared with the case where the image determination process is NN, learning is performed, so that high-speed processing can be performed and determination can be performed easily, and the recognized image is significantly different from the teacher image, and noise is generated. You will be able to recognize at most. Fourth
Since the invention to the seventh invention do not use polar coordinate components, there are advantages that the image determination processing can be performed at a higher speed and the possibility of erroneous determination processing can be reduced. In the eighth invention and the ninth invention, by normalizing the information, it becomes possible to recognize even if the position of the target from the camera is displaced or the illumination is changed.

[Brief description of drawings]

【図１】この発明の第１実施例を示すブロック図であ
る。FIG. 1 is a block diagram showing a first embodiment of the present invention.

【図２】第１実施例の詳細な構成を示す構成説明図であ
る。FIG. 2 is a configuration explanatory view showing a detailed configuration of the first embodiment.

【図３】Ａは教師画像を示す説明図、Ｂは判定画像を示
す説明図である。3A is an explanatory diagram showing a teacher image, and FIG. 3B is an explanatory diagram showing a determination image.

【図４】出力Ｏ_1(x，_y)の度数分布特性図である。FIG. 4 is a frequency distribution characteristic diagram of an output O _{1 (x} , _y) .

【図５】この発明の第２実施例を示す構成説明図であ
る。FIG. 5 is a structural explanatory view showing a second embodiment of the present invention.

【図６】伸縮部を説明するための図である。FIG. 6 is a diagram for explaining a stretchable portion.

【図７】Ａ，Ｂ，Ｃは空き缶の画像を示す説明図であ
る。7A, 7B and 7C are explanatory views showing images of empty cans.

【図８】図７Ａ，Ｂ，Ｃの空き缶による度数分布図であ
る。FIG. 8 is a frequency distribution chart of empty cans shown in FIGS. 7A, 7B and 7C.

【図９】ＣＭＡＣの概念図である。FIG. 9 is a conceptual diagram of CMAC.

【図１０】写像関係を示す概念図である。FIG. 10 is a conceptual diagram showing a mapping relationship.

【図１１】スカラー出力Ｐへ写像する作用素を示す説明
図である。11 is an explanatory diagram showing an operator that maps to a scalar output P. FIG.

【図１２】ベクトル出力Ｐへ写像する作用素を示す説明
図である。FIG. 12 is an explanatory diagram showing an operator that maps to a vector output P.

【図１３】この発明の第３実施例を示すブロック図であ
る。FIG. 13 is a block diagram showing a third embodiment of the present invention.

【図１４】第３実施例の詳細な構成を示す構成説明図で
ある。FIG. 14 is a structural explanatory view showing a detailed structure of a third embodiment.

【図１５】この発明の第４実施例を示すブロック図。FIG. 15 is a block diagram showing a fourth embodiment of the present invention.

【図１６】第４実施例の詳細な構成を示す構成説明図。FIG. 16 is a structural explanatory view showing a detailed structure of a fourth embodiment.

[Explanation of symbols]

１１，２１…画像データ１２，１２ａ，１２ｂ，１７，１７ａ，１７ｂ…高速フ
ーリエ変換部１３，１３ａ，１３ｂ…フィルタ１４…コントローラ１５，１５ａ〜１５ｎ…ＣＭＡＣユニット１６…偏差検出部１８…伸縮部１９…判定部２２…画像回転部２３，２３ａ，２３ｂ，２４…規格化部11, 21 ... Image data 12, 12a, 12b, 17, 17a, 17b ... Fast Fourier transform section 13, 13a, 13b ... Filter 14 ... Controller 15, 15a-15n ... CMAC unit 16 ... Deviation detection section 18 ... Expansion / contraction section 19 ... Judgment part 22 ... Image rotation part 23, 23a, 23b, 24 ... Normalization part

Claims

[Claims]

1. A learning mode in which teacher image information is input and learning is performed on the teacher image information, and a determination mode is input in which determination image information to be compared with image information learned in this learning mode is input. In the learning mode, the characteristic portion of the image is extracted from the teacher image information, an arbitrary code number is given to the image, and the code number and the extracted image are input to the cerebellum model computer to learn the teacher image information. In the determination mode, the input determination image information is extracted and input to the cerebellar model computer, and the image information most similar to the learned teacher image information is determined and output. Image recognition device.

2. The image recognition apparatus according to claim 1, wherein the orthogonal coordinate and polar coordinate information is obtained as an output from the teacher image information and the determination image information by using a fast Fourier transform.

3. The image recognition apparatus according to claim 1, wherein Cartesian coordinate information obtained from the determination image information is input to the cerebellum model computer via the image expansion / contraction unit.

4. In the learning mode, in addition to the teacher image information, the teacher image information that is rotated little by little is extracted and input to a cerebellum model computer together with a code number so that the teacher image information is learned. Claim 1
The image recognition device described.

5. The image recognition apparatus according to claim 1, wherein in the determination mode, the determination image information is input to the cerebellum model computer via the image rotation unit.

6. The image recognition apparatus according to claim 4, wherein the determination image information is input to the cerebellum model computer via the image expansion / contraction unit.

7. The image recognition apparatus according to claim 5, wherein the image information for judgment is subjected to fast Fourier transform and then inputted to the image rotation unit.

8. The fast image Fourier transform of the teacher image information is performed, a power spectrum is obtained from the transformed information, the maximum frequency of this power spectrum is standardized by a normalizing section, and the characteristic part of the image is extracted from the standardized information. The image recognition device according to claim 1, 2 or 4, wherein the image recognition device is extracted.

9. The image information for judgment is subjected to fast Fourier transform,
7. The image recognition apparatus according to claim 1, wherein the power spectrum is obtained from the converted information, and the maximum frequency of the power spectrum is standardized and extracted by the standardization unit.