JPH04153891A

JPH04153891A - Pattern recognition system using rough code expression

Info

Publication number: JPH04153891A
Application number: JP2280519A
Authority: JP
Inventors: Yukio Hayashi; 幸雄林
Original assignee: Fuji Xerox Co Ltd
Current assignee: Fujifilm Business Innovation Corp
Priority date: 1990-10-18
Filing date: 1990-10-18
Publication date: 1992-05-27
Anticipated expiration: 2015-03-13
Also published as: JP3018465B2

Abstract

PURPOSE:To efficiently perform pattern recognition with a little misrecognition by realizing a general inverse matrix which is high in associating ability approximately by a local and easy process and handling even deformed pattern by a threshold value process and a feedback circuit. CONSTITUTION:In a three-layered associative storage model, an input pattern is represented in output layers I, (i), and N and fixed mapping to be fed back is realized approximately. So the model is tolerant to a noise input. Further, the outputs of intermediate layers I, K, and M in a mutual suppression coupling state are in rough code expression wherein only one element showing the recognition category of the input pattern fires. The associative process of the model is approximately equivalent to the calculation of the general inverse matrix, so the associating ability is high. Consequently, the pattern recognition with a little misrecognition is efficiently realized.

Description

[Detailed description of the invention] [Industrial application field]

本発明は文字（画像）や音声等のパターンを動的に認識
するとともに、認識辞書を学習によって修正するパター
ン認識方式に関するものである。The present invention relates to a pattern recognition method that dynamically recognizes patterns of characters (images), sounds, etc., and modifies a recognition dictionary through learning.

[Conventional technology]

従来の自己（相互）想起連想記憶モデルでは、入出力層
間の結合重み値を、入出力パターンの相関行列で設定す
る相関型連想記憶モデルより、般逆行列で設定した直交
射影型連想記憶モデルの方が高い連想能力を持つことが
知られている。松岡氏は、相互抑制結合の中間層を持つ３層モデルを用
いて、一般逆行列の計算を効率的な局所演算で実現する
方法（第３図参照）を既に提案している（検閲（九工大
）：“直交射影型連想記憶回路の種々の構造について”
、電子情報通信学会論文誌、　Ｖｏｌ、Ｊ７３−Ｄｌｌ
、Ｎｏ、４．Ａｐｒｉ１’９０．ｐｐ８４１−８４７）
さらに松岡氏は、フィードバック回路による自己想起連
想記憶モデルにおける一般逆行列の計算方法（第４図）
も提案した（前記電子情報通信学会論文誌参照）。これらのモデルは、複雑な一般逆行列の計算を簡単なネ
ット構造の回路で実現するのみならず、認識辞書である
結合重み値の学習もプロトタイプパターンの付加によっ
て簡単に実現できるので、ＬＳＩ化にも適している。In conventional self-(mutual) recall associative memory models, the orthogonal projection type associative memory model sets the connection weight values between input and output layers using a general inverse matrix, rather than the correlation type associative memory model where the connection weight values between the input and output layers are set using a correlation matrix of input and output patterns. They are known to have higher associative abilities. Mr. Matsuoka has already proposed a method (see Figure 3) that uses a three-layer model with an intermediate layer of mutually inhibiting connections to realize the calculation of a general inverse matrix by efficient local operations (see Figure 3). University of Technology): “About various structures of orthogonal projection type associative memory circuits”
, Journal of the Institute of Electronics, Information and Communication Engineers, Vol. J73-Dll
, No, 4. Apri1'90. pp841-847)
Furthermore, Mr. Matsuoka describes a method for calculating a general inverse matrix in a self-recall associative memory model using a feedback circuit (Figure 4).
was also proposed (see the above-mentioned journal of the Institute of Electronics, Information and Communication Engineers). These models not only realize the calculation of complex general inverse matrices using a circuit with a simple network structure, but also easily realize the learning of connection weight values, which are recognition dictionaries, by adding prototype patterns, so they are suitable for LSI implementation. is also suitable.

[Problem to be solved by the invention]

前述の従来の連想記憶モデルを実際にパターン認識廉に
応用するには問題があった。すなわち、この連想記憶モ
デルは、原理的なプロトタイプパターンの記憶拳想起を
示すもので、現実的な変形パターンの記憶−想起には十
分とは言えないという欠点があった。本発明は、この欠点を除去することを目的とするもので
ある。すなわち、本発明は変形パターンに対する認識能
力の高いパターン認識を行うことを目的とするものであ
る。There were problems in actually applying the conventional associative memory model described above to pattern recognition. That is, this associative memory model shows the memory-retrieval of a basic prototype pattern, but has the drawback that it cannot be said to be sufficient for the memorization and retrieval of realistic deformed patterns. The present invention aims to eliminate this drawback. That is, an object of the present invention is to perform pattern recognition with high recognition ability for deformed patterns.

[Means to solve the problem]

本発明は、上記目的を達成するために、フィードバック
回路によって入力パターンを認１ｌｉｌｆｌｉカバター
ンに動的に変換するとともに、認識辞書を学習によって
獲得する連想記憶モデルにおいて、疎な符号表現を用い
た中間層を持ち、認識辞書である結合重みに記憶すべき
情報を分散的に記憶するパターン認識方式である。また、本発明の一態様によれば、上記構成のパターン認
識方式において、各認識カテゴリの分布の許容範囲を決
定する補助ネットを設けたことを特徴とする。In order to achieve the above object, the present invention dynamically converts an input pattern into a recognized cover pattern using a feedback circuit, and uses an intermediate layer using a sparse code representation in an associative memory model that acquires a recognition dictionary through learning. This is a pattern recognition method that stores information to be stored in a distributed manner in connection weights, which is a recognition dictionary. Further, according to one aspect of the present invention, the pattern recognition method having the above configuration is characterized in that an auxiliary net is provided for determining the permissible range of the distribution of each recognition category.

[Effect]

本発明は３層連想記憶モデルでは、入力パターンが出力
層に再現されてフィードバックする恒等写像が近似的に
実現されるので、ノイズ入力に強い。また、相互抑制結
合の中間層出力は、その入力パターンの認識カテゴリを
表す１個の素子のみが発火する疎な符号表現となる（第
８図（Ｃ）参照）。このモデルの連想処理は、一般逆行
列の計算と近似的に等価となるので、連想能力が高い。ゆえζこ、ＷＡＩＩｉの少ないパターン認識が効率良く
実現できる。また、本発明の補助ネットを設ける態様においては、連
想記憶モデルのｖｉ識結果が、上記３層連想モデルによ
る認識結果と、補助ネットによる他カテゴリ方向の分布
の許容範囲を統合した結果に従って決定される。故に、
−層Ｗ４ｍの少ないパターン認識が効率良く実現できる
。In the three-layer associative memory model of the present invention, an identity mapping in which the input pattern is reproduced and fed back to the output layer is approximately realized, so it is resistant to noise input. Further, the intermediate layer output of the mutually inhibiting combination becomes a sparse code representation in which only one element representing the recognition category of the input pattern fires (see FIG. 8(C)). The associative processing of this model is approximately equivalent to the calculation of a general inverse matrix, so it has a high associative ability. Therefore, pattern recognition with less WAIIi can be efficiently realized. Furthermore, in the aspect of the present invention in which an auxiliary net is provided, the vi recognition result of the associative memory model is determined according to the result of integrating the recognition result by the three-layer associative model and the permissible range of distribution in other category directions by the auxiliary net. Ru. Therefore,
- Pattern recognition with a small number of layers W4m can be efficiently realized.

【第１の実施例〕第１図は本発明の直交射影型フィードバック連想記憶モ
デルを示す図で、第２図は本発明のシステム構成の実施
例を示すものである。第２図に示すように、このシステムは、入力パターンま
たはフィードバック出力を保持する入力レジスタ２１と
、入力レジスタ２１の内容を次段に供給するタイミング
を制御するクロック発振器２２と、後述する０式のＷ２
”ｘ″を演算する積和演算部２３と、積和演算部２３お
よび２６の出力をもとに０式のｄｙ／ｄｔを演算すると
ともに変化分により更新された中間層の出力ｙ　（ｔ）
を得る微分演算部２４と、微分演算部２４の出力ｙ　（
ｔ）に関数演算を施しｈ　（ｙ（ｔ））を出力する関数
演算部２５と、関数演算部２５の出力にしきい値処理を
する非線形しきい処理部２７と、非線形しきい処理部２
７の出力を認識出力ｆ　（ｙ）として保持する出力レジ
スタ２８と、しきい処理部２７の出力にＸを乗じて０式
の演算をする積和演算部２９からなっている。この学習
ｅ認識処理を行う第２ｒＥＪの装置の各演算部は、連続
時間を離散化することで汎用の計算機上でのプログラム
や、専用のアナログ（またはディジタル）演算回路で実
現できる。各層の素子の状態は、それぞれのレジスタに
記憶され、結合重み値は積和演算部のメモリに記憶され
る。第６図と７図は、本発明の処理手順を示すフローチャー
トで、それぞれ認識と学習の処理手順を示す。以下では
、まず認識辞書である結合重み値Ｗｌ、Ｗ２が既に設定
されている時の認識処理の例を第６図により説明し、そ
の後に、その認識辞書の学習方法の例を第７図により説
明する。本発明の認識処理は、第６図のフローチャートに示すよ
うに、まず、初期パターンの設定（後記の０式の設定）
を行う（ＳｔｅｐＯ）。すなわち、サンプル時間間隔を
Ｔ、とするとき、０≦ｔ＜Ｔ、の間にカテゴリｍに属す
る入力パターンｘ　Ｉ　ｍ　＋を入力層に与える。第２
図の入力レジスタ２１に入力パターンｘ′＝ｘ（１は保
持される。その間、ｔ＝ｋＴ、までに中間送出力を■式により更新
する（Ｓｔｅｐ　１　）。 τｄ　ｙ（ｔ）／　ｄ　ｔ　＝　Ｗｌ　ｈ（ｙ（ｔ））
　＋　Ｗ２”　ｘ　’・・Φここで、Ｗ１＝−Ｘ”Ｘ、
Ｗ２＝Ｘ、　プロトタイプ行列：Ｘ＝［ｘ口＋、　ｘ（
２１，、、、Ｘ（Ｍｌ］丁ｘｆｌ＋：　（χ＋１１．．
　、　、　、　、　　ｘ（１１Ｎ）　とする。ま“た、
τ＜Ｔ、とする。 ■式によって中間層出力ｈ　（ｙ（ｔ）　）を計算する
。この計算は第２図の装置により行う。積和演算部２３に
よって、Ｗ２’　ｘ　’を演算し、積和演算部２６によ
ってＷｌｈ　（ｙ（ｔ））の演算をする。この中間層出力ｈ　（ｙ（ｔ）　）に非線形しきい処理
部２７によりｆｌｏでしきい値処理して認識比カバター
ンｆ　、　（ｈ（ｙ（ｔ））　）を得る。認識比カバタ
ーンの全ての要素が、誤差εの範囲でＯか１であるかを
判定する（５ｔｅｐ２）。すべて０か１になっていれば、処理を終了する。Ｏまたは１になっていない要素がある場合には、フィー
ドバック信号の更新を行い、ｋ＝に＋１とし５ｔｅｐ　
１へ行（（Ｓｔｅｐ３　）。すなわち、■式によってフ
ィードバック信号を計算する。この計算は第２図の積和演算部２９によって行う。この
フィードバック信号Ｘ“（１）を次のサンプル時間（ｋ
＋１）Ｔｓまで入力層に与える。すなわち、入力レジスタ２１にｋＴ、≦Ｔ＜　（ｋ＋１
）　Ｔ、の開信号を保持する。その間に■式に従って同
様の処理を行う。この処理を、認識出力の全ての要素が
Ｏか１となるまで繰り返す。なお、０≦ｔ＜Ｔｓにおいては、とする。 ■式のようにｋＴ、≦ｔ＜（ｋ＋１）Ｔ、の間に、中間層の相互抑制結合Ｗ１によって一般逆行列が近似的
に計算され、フィードバック信号で恒等写像が近似され
るのでノイズ入力に強い。また認識辞書（Ｗ２）に記憶
したプロトタイプパターンを入力した時の中間層出力は
、■式のようにその認識カテゴリを表す１個の素子のみ
が発火する疎な符号表現となる。ここで、Ｔ：転置、＋
ニ一般逆行列を表す。ｙ　（（ｋ＋１）Ｔ、）！”ｔ　（ＸＴＸ）”Ｘ↑Ｘ′
＝　　（Ｘ”Ｘ）”Ｘ”Ｘｆ＋　　（ｙ　　（ｋＴ、）
）＝　ｒ　＋　＜　ｙ　（ｋＴ−）　）　　　　　・・
・■ｙ　（Ｔ、）　Ｌ：ｉ（Ｘ”Ｘ）”Ｘ”Ｘ’＝　（
Ｘ”Ｘ）”Ｘ”ｘ’・ｌ＝　（０，・・・、０，１゜０
、・・・０）　　　　　　　　・・・０次に第７図のフ
ローチャートに従うて、本発明の認識辞書である結合重
み値の学習処理を説明する。まず、各認識カテゴリ内の複数の学習パターンの平均値
のようなプロトタイプパターンで、結合直み値ＷｔとＷ
２を初期設定をする（ＳｔｅｐＯ）。Ｗ＋＝−ＸＴＸＷ２＝Ｘ次に各学習パターンについて前述の認識処理を行う（Ｓ
ｔｅｐ　１　）。提示した学習パターンが正しく認識できれば、この学習
パターンに結合重み値（辞書）を少し近づける（５ｔｅ
ｐ　２　）。Ｗ１＝−ａ　ｅ”’ｘ”ｘ　ｅ（ｋ”＋　（１−ａ）Ｗ
ｌ　　−■Ｗ２＝ａｅ（ｋ’＋　（１−Ｂ）Ｗ２　　　
　　−■もし提示した学習パターンをＷＡ認するなら、
誤認カテゴリのプロトタイプパターンに関する安定点の
引き込み領域を縮小するように、ｆ、（ｏ）＝Ｏ，ｆｌ
（１）＝１．ｆ、（θｌ）＝θ１の条件の下で０１を太
き（して、しきい値開数ｆ１０を調整する（Ｓｔｅｐ２
’　）。学習パターンが全て誤認しなくなるまで、再度提示して
この操作を繰り返す（Ｓｔｅｐ３）。学習後は、学習パターン以外に対しても、誤認の少ない
認識処理が実行できる。各パラメータの例を次に示す。入力次元数：Ｎ＝２５８、カテゴリ数：Ｍ＝７１、減衰の時定数：τ＝０．１、サンプリング時間：　Ｔ、＝　１０゜学習定数：α＝０．１、不安定パラメータ：θ、＝０．５、 θ、の変動！：δ＝０．０１、しきい値開数：とする。なお、しきい値開数を図に示すと、第５図のようになる
。第８図は本実施例の効果を示すパターンの認識の実験例
を示すもので、第９図は比較のためにボす従来例による
パターンの認識の実験例を示すものである。両図におい
て、（ａ）は入力層に与えられたパターンｒＥＪを示し
、このパターン「Ｅ」は１５％のランダムノイズを含ん
でいる。（ｂ）は出力層に得られるパターンを示し、（
Ｃ）は中間層の状態を示す。第９図（ｂ）に示すように、従来例の場合、出力層に得
られるパターンはかなりノイズを含んでいるが、第８図
（ｂ）に示す本発明の実施例の場合、ノイズが除去され
たきれいなパターンが得られてている。また、中間層に
ついては、第９図（Ｃ）に示すように、従来技術によれ
ば比較的大きな中間値の出力が１つの素子に表れ、比較
的小さな中間値の出力が３つの素子に表れている。これ
に対し、本発明の実施例では第８図（Ｃ）に示すように
、一つの素子のみが１００％の発火素子となっている。このように実験例により従来技術と比べて、本発明はパ
ターンの認識の能力が格段に優れていることが分る。【第２の実施例コ第１０図に、大分類と詳細分類を用いた本発明の変形の
１実施例（第２の実施例）を示す。これは、本発明の連
想記憶モデルを複数のモジュールとして用い、類似カテ
ゴリからなるパターンを大まかに識別する大分類ネット
１０１と、その最大出力値で決定される類似カテゴリ選
択信号を出力する選択部１０２と、類似カテゴリ選択信
号により選択される、最終的な認識カテゴリを識別する
詳細分類ネッ）１０３．　　・・ｅ、１０４と、詳細分
類ネットの出力を統合する統合部１０５で構成される。この実施例によれば、大分類ネット１０１と選択部１０
２により、類似カテゴリの詳細分類ネッ）１０３．　　
・舎・　１０４を選択し、最終的な認識カテゴリを識別
するので、構成が簡単となり、また認識率が高くなる。なお、大分類部での選択で類似カテゴリを限定せずに、
各詳細分類ネットの認識出力を統合して、最終的な認識
カテゴリを識別しても良い。さらに、大分類部、または、詳細分類部の一方のみを、
既存の識別手法を用いて構成しても良い。【第３の実施例】第１１図は本発明の第３の実施例を示すブロック図であ
る。この実施例は第１の実施例に示した疎な中間層を実現す
るための相互抑制結合を持った３層ネット構造のフィー
ドバック連想記憶モデルを基本ネットとし、さらに同じ
構成を育し各カテゴリの分布の許容範囲を決定する補助
ネットを付加して、変形に強いパターン認識と、認識辞
書の学習を効率的に行うものである。この第３の実施例の装置は、第１の実施例において説明
した第１図および第２図に示すような構成を有する基本
ネット（基本認識ネットワーク）１１１と、基本ネット
１１１と同じ構成を有するカテゴリ（１）〜（ｍ）の補
助ネット１１２．１１３と、入力パターンｘ　（Ｏ）と
辞書との差分な求める差分パターン抽出部１１４と、正
弦方向の許容変動判定部１１５と、認識結果判定部１１
６とからなりでいる。第１２図により本実施例の認識処理を説明する。基本ネットの中間層の各認識素子は、各カテゴリプロト
タイプパターン方向の変動には敏感に反応するが、他カ
テゴリのプロトタイブノずターン方向の変動には鈍感で
ある。そこで、入力初期パターンｘ　（０）を設定しく５ｔｅ
ｐ　Ｏ）　、基本ネット１１１から式■〜■の認識処理
を１回だけ実行する。この時の各中間層出力ｙ１は入力
初期パターンと各カテゴリの辞書パターンとの類似度Ｃ
０５ｅ１を表すものとなる（Ｓｔｅｐｌ）。次に入力初期パターンＸ　（０）から得られたフィード
バック信号を用いて、基本ネットによる式■〜■の処理
を収束するまで繰り返すことにより認識出力ｆ　（ｈ（
ｙ））の計算をする（Ｓｔｅｐ２）。入力初期パターンＸ（０）と各辞書パターンＸ（１＋の
差分を差分パターン抽出部１１４により求め、この差分
パターンを各カテゴリ（１）〜（ｍ）の補助ネット１１
２〜１１３に入力する。各カテゴリの補助ネットは基本ネットと同様な構成で、
そのしきい値のみが異なるものである。各補助ネット１１２〜１１３では、入力パターンが各カ
テゴリの辞書パターンからどの程度変動しているかを、
差分パターンの各カテゴリの辞書パターン方向への射影
成分（ＣＯＳθ、）から求める。もし、これらの値が許容値以上ならば、あるカテゴリに
許容範囲外のパターンが入力されたことを示すように、
その許容外方向に相当する中間素子が発火する（オン状
態なる）。他のカテゴリの補助ネットについても同様に、許容範囲
外のパターンが入力されたことを示す発火素子があるか
どうかを調べる（Ｓｔｅｐ　４　）。また、正弦方向の変動に対しては、５ｔｅｐ　１で求め
た各カテゴリへの類似度ＣＯＳθ１を用いて、ｓｉｎθ
、＝　　　−ＣＯ５，を計算し、各カテゴリについてこ
の値が許容値以下かどうか確かめられる。もし、許容値を越えていれば、該当するカテゴリの認識
結果を受理しないように、補助ネットと同様な発火信号
を認識結果判定部１１６に送る。補助ネット１１２〜１１３と正弦方向の許容変動判定器
１１５からの発火素子があった場合には、そのカテゴリ
に対する認識出力を認識結果判定部１１６で零（ｎｏｎ
ａｃｔｉｖｅ　）にする（Ｓｔｅｐ　５　）　、すなわ
ち、基本ネット１１１の認識出力が発火していようと認
識結果判定部１１６による統合結果としては、そのカテ
ゴリの認識出力は否定される。最後に基本ネット１１１のカテゴリｉの認識出力のみが
１となっているかを判定する（Ｓｔｅｐ　６　）。その判定の結果、カテゴリｉの認識出力のみが１となっ
ている場合は、カテゴリｉを認識結果とする（Ｓｔｅｐ
　７　）。５ｔｅｐ　６の判定において、カテゴリｉの認識出力の
みが１となっていなかった場合は、誤認なので、結果を
リジェクトする（Ｓｔｅｐ８）。第１４図に２次元２カテゴリの場合の認識境界の様子を
示す。例えば、カテゴリＡの認識領域は、基本ネットで
下側の境界が定まり、カテゴリＡの補助ネットのカテゴ
リＢ　（ｃｏｓθ８）方向に関する素子で右側の境界が
、カテゴリＡ（ｃｏｓθＡ）方向に関する素子で上側の
境界がそれぞれ定まり、また、正弦方向（ｓｉｎθＡ）
の境界は正弦方向の許容変動゛判定器によって定まる。カテゴリＢの認識境界についても同様である。次に、第１３図に従って、本実施例の補助ネットにおけ
る他カテゴリ方向の許容範囲の調整方法を説明する。まず、各カテゴリのプロトタイプパターンがその入力パ
ターンには反応する（他カテゴリの素子が反応してもよ
い）ように記憶され、その許容変動範囲を示すしきい値
が調整されたとする（ＳｔｅｐＯ）。次に、この基本ネットをコピーして各カテゴリの補助ネ
ットを作り、入力パターンを順に提示する（Ｓｔｅｐ　
１　）。提示したパターンカテゴリ以外の他カテゴリの認識出力
が受理されるかどうかを判定する（Ｓｔｅｐ２）。判定の結果、他カテゴリの認識出力がなかったときは、
すべてのサンプルパターンに誤認がないかを調べ（Ｓｔ
ｅｐ　３　）　Ｎ　まだ誤認のあるサンプルパターンが
あるときは次のサンプルパターンを提示する。誤認のあ
るサンプルパターンがなくなったら処理を終了する。５ｔｅｐ　２の判定の結果、基本ネットの中間素子が誤
って他カテゴリに反応していたら、補助ネットを動作さ
せる（Ｓｔｅｐ４）。そして、基本ネットで誤認したカテゴリが補助ネットで
否定されるかどうかを調べる（ｓｔｅｐ５）。補助ネットで否定された場合には、認識処理時にはその
カテゴリは基本ネットが誤判定しても補助ネットを参照
することにより正しい判定ができるので、補助ネットの
調整は行わず、次のパターンを提示し調整を続行する。５ｔｅｐ　５の判定で、基本ネットで誤認したカテゴリ
が補助ネットで否定されなかった場合には、その入力カ
テゴリの補助ネットの中間素子（誤反応するカテゴリに
相当）のしきい値を微小量Δだけ小さくして、他カテゴ
リの変動許容範囲をきつくする（Ｓｔｅｐ　６　）。学習用の入力パターンが誤認しな（なるまで、再度提示
してこの操作を繰り返す。これによって学習後は、学習
パターン以外に対しても、誤認の少ない認識処理が実行
できる。【発明の効果］本発明は、連想能力の高い一般逆行列を局所的で簡単な
処理で近似的に実現し、しきい値処理とフィードバック
回路によって変形パターンにも対応できるようにしたの
で、誤認が少ないパターン認識が効率的に実行でき、か
つ、簡単な演算で認識辞書も学習できるという効果を奏
する。また、本発明は補助ネットを設けた態様によれば、各カ
テゴリの分布の許容範囲を決定し、認識を行うので、−
層誤認の少ないパターン認識が効率良く実現できる。[First Embodiment] FIG. 1 is a diagram showing an orthogonal projection type feedback associative memory model of the present invention, and FIG. 2 is a diagram showing an embodiment of the system configuration of the present invention. As shown in FIG. 2, this system includes an input register 21 that holds an input pattern or feedback output, a clock oscillator 22 that controls the timing of supplying the contents of the input register 21 to the next stage, and a W2
The sum-of-products calculation section 23 calculates "x", and the dy/dt of equation 0 is calculated based on the outputs of the sum-of-products calculation sections 23 and 26, and the output y (t) of the intermediate layer is updated by the change.
The differential calculation unit 24 obtains the output y (
t) and outputs h(y(t)); a nonlinear threshold processing unit 27 that performs threshold processing on the output of the function calculation unit 25; and a nonlinear threshold processing unit 2.
It consists of an output register 28 that holds the output of 7 as a recognition output f (y), and a product-sum operation section 29 that multiplies the output of the threshold processing section 27 by X and performs the operation of the 0 formula. Each calculation unit of the second rEJ device that performs this learning e-recognition process can be realized by a program on a general-purpose computer or a dedicated analog (or digital) calculation circuit by discretizing continuous time. The states of the elements in each layer are stored in their respective registers, and the connection weight values are stored in the memory of the product-sum operation section. FIGS. 6 and 7 are flowcharts showing the processing procedure of the present invention, and show the processing procedure of recognition and learning, respectively. In the following, an example of recognition processing when the combination weight values Wl and W2, which are recognition dictionaries, have already been set will be explained with reference to FIG. 6, and then an example of a learning method for the recognition dictionary will be explained with reference to FIG. 7. explain. In the recognition process of the present invention, as shown in the flowchart of FIG.
(Step O). That is, when the sample time interval is T, an input pattern x I m + belonging to category m is applied to the input layer during 0≦t<T. Second
The input pattern x'=x (1 is held in the input register 21 in the figure. In the meantime, by t=kT, the intermediate transmission output is updated by formula (Step 1). τd y(t)/d t = Wl h(y(t))
+ W2"x'...ΦHere, W1=-X"X,
W2=X, prototype matrix: X=[xmouth+, x(
21,,,,X(Ml]dxfl+: (χ+11...
, , , , x(11N). Also,
Let τ<T. (2) Calculate the intermediate layer output h (y(t)) using the formula. This calculation is performed using the apparatus shown in FIG. The product-sum calculation unit 23 calculates W2′ x ′, and the product-sum calculation unit 26 calculates Wlh (y(t)). This intermediate layer output h (y(t) ) is subjected to threshold processing using flo by the nonlinear threshold processing unit 27 to obtain a recognition ratio cover turn f , (h(y(t)) ). It is determined whether all elements of the recognition ratio cover turn are O or 1 within the range of error ε (5tep2). If all are 0 or 1, the process ends. If there is an element that is not O or 1, update the feedback signal and set k to +1 and repeat 5 steps.
1 ((Step 3). That is, the feedback signal is calculated using the equation
+1) Provide up to Ts to the input layer. That is, the input register 21 has kT, ≦T< (k+1
) T, holds the open signal. In the meantime, perform similar processing according to formula (2). This process is repeated until all elements of the recognition output become O or 1. Note that when 0≦t<Ts, the following is true. ■As shown in the formula, during kT, ≦t<(k+1)T, the general inverse matrix is approximately calculated by the mutually inhibiting coupling W1 in the intermediate layer, and the identity mapping is approximated by the feedback signal, so noise input Strong against Further, when the prototype pattern stored in the recognition dictionary (W2) is input, the output of the intermediate layer is a sparse code expression in which only one element representing the recognition category fires, as shown in equation (2). Here, T: transpose, +
D represents the general inverse matrix. y ((k+1)T,)! "t (XTX)"X↑X'
= (X”X)”X”Xf+ (y (kT,)
)=r+<y(kT-))...
・■y (T,) L:i(X”X)”X”X'= (
X”X)”X”x'・l= (0,...,0,1゜0
, . . 0) . . . 0 Next, the learning process of the connection weight value, which is the recognition dictionary of the present invention, will be explained according to the flowchart of FIG. First, with a prototype pattern such as the average value of multiple learning patterns in each recognition category, the combined directivity Wt and W
2 is initialized (Step O). W+=-XTX W2=X Next, perform the above-mentioned recognition process for each learning pattern (S
step 1). If the presented learning pattern can be recognized correctly, the connection weight value (dictionary) is made a little closer to this learning pattern (5te
p2). W1=-a e”'x”x e(k”+ (1-a)W
l −■W2=ae(k'+ (1-B)W2
-■If the presented learning pattern is approved by WA,
f,(o)=O,fl so as to reduce the area of attraction of the stable point for the prototype pattern of the misidentified category.
(1)=1. Under the condition of f, (θl) = θ1, 01 is thickened (and the threshold value numerical aperture f10 is adjusted (Step 2)
). The learning pattern is presented again and this operation is repeated until all learning patterns are no longer misrecognized (Step 3). After learning, recognition processing with fewer misidentifications can be performed on patterns other than the learned patterns. Examples of each parameter are shown below. Number of input dimensions: N = 258, number of categories: M = 71, time constant of decay: τ = 0.1, sampling time: T, = 10° learning constant: α = 0.1, instability parameter: θ, = Fluctuation of 0.5, θ! : δ=0.01, Threshold numerical aperture: Incidentally, when the threshold numerical aperture is shown in a diagram, it is as shown in FIG. 5. FIG. 8 shows an experimental example of pattern recognition showing the effects of this embodiment, and FIG. 9 shows an experimental example of pattern recognition using a conventional example for comparison. In both figures, (a) shows the pattern rEJ given to the input layer, this pattern "E" containing 15% random noise. (b) shows the pattern obtained in the output layer, (
C) shows the state of the intermediate layer. As shown in FIG. 9(b), in the case of the conventional example, the pattern obtained in the output layer contains considerable noise, but in the case of the embodiment of the present invention shown in FIG. 8(b), noise is removed. A beautiful pattern is obtained. Regarding the intermediate layer, as shown in FIG. 9(C), according to the prior art, a relatively large intermediate value output appears in one element, and relatively small intermediate value outputs appear in three elements. ing. In contrast, in the embodiment of the present invention, only one element is a 100% firing element, as shown in FIG. 8(C). As described above, it can be seen from the experimental examples that the present invention has a much superior pattern recognition ability compared to the prior art. [Second Embodiment] FIG. 10 shows a modified embodiment (second embodiment) of the present invention using major classification and detailed classification. This uses the associative memory model of the present invention as a plurality of modules, and includes a large classification net 101 that roughly identifies patterns consisting of similar categories, and a selection unit 102 that outputs a similar category selection signal determined by its maximum output value. and a detailed classification network that identifies the final recognition category selected by the similar category selection signal) 103. ...e, 104, and an integrating unit 105 that integrates the outputs of the detailed classification net. According to this embodiment, the major classification net 101 and the selection unit 10
2, detailed classification of similar categories) 103.
・Since 104 is selected and the final recognition category is identified, the configuration is simple and the recognition rate is high. In addition, without limiting the similar categories by selection in the major classification section,
A final recognition category may be identified by integrating the recognition outputs of each detailed classification net. Furthermore, only one of the major classification section or the detailed classification section,
It may be configured using an existing identification method. [Third Embodiment] FIG. 11 is a block diagram showing a third embodiment of the present invention. This example uses the feedback associative memory model of the three-layer net structure with mutually inhibiting connections to realize the sparse intermediate layer shown in the first example as the basic net, and further develops the same configuration to By adding an auxiliary net that determines the permissible range of the distribution, pattern recognition that is resistant to deformation and recognition dictionary learning can be performed efficiently. The device of this third embodiment has a basic net (basic recognition network) 111 having the configuration shown in FIGS. 1 and 2 described in the first embodiment, and a configuration same as that of the basic net 111. Auxiliary nets 112 and 113 of categories (1) to (m), a differential pattern extraction unit 114 that calculates the difference between the input pattern x (O) and the dictionary, an allowable variation determination unit 115 in the sine direction, and a recognition result determination unit 11
It consists of 6. The recognition processing of this embodiment will be explained with reference to FIG. Each recognition element in the intermediate layer of the basic net responds sensitively to changes in the direction of each category prototype pattern, but is insensitive to changes in the turn direction of prototype patterns in other categories. Therefore, let's set the input initial pattern x (0).
p O), the recognition processing of equations (1) to (2) is executed only once from the basic net 111. At this time, each intermediate layer output y1 is the similarity C between the input initial pattern and the dictionary pattern of each category.
05e1 (Step). Next, using the feedback signal obtained from the input initial pattern X (0), the recognition output f (h(
y)) is calculated (Step 2). The difference between the input initial pattern X(0) and each dictionary pattern
Enter numbers 2 to 113. The auxiliary net for each category has the same configuration as the basic net,
Only the thresholds are different. Each auxiliary net 112 to 113 measures how much the input pattern varies from the dictionary pattern of each category.
It is determined from the projected component (COS θ,) of each category of the difference pattern in the direction of the dictionary pattern. If these values are greater than or equal to the allowable values, it indicates that a pattern outside the allowable range has been entered in a certain category.
The intermediate element corresponding to the non-permissible direction fires (turns on). Similarly, for the auxiliary nets of other categories, it is checked whether there is a firing element that indicates that a pattern outside the permissible range has been input (Step 4). In addition, for fluctuations in the sinusoidal direction, using the degree of similarity COSθ1 to each category obtained in step 1, sinθ
, = -CO5, and check whether this value is below the allowable value for each category. If the allowable value is exceeded, a firing signal similar to that of the auxiliary net is sent to the recognition result determination unit 116 so as not to accept the recognition result of the corresponding category. If there is a firing element from the auxiliary nets 112 to 113 and the permissible variation determination unit 115 in the sinusoidal direction, the recognition output for that category is set to zero (non) by the recognition result determination unit 116.
active) (Step 5), that is, even if the recognition output of the basic net 111 is fired, the recognition output of that category is denied as an integrated result by the recognition result determination unit 116. Finally, it is determined whether only the recognition output of category i of the basic net 111 is 1 (Step 6). As a result of the determination, if only the recognition output of category i is 1, category i is set as the recognition result (Step
7). 5 In the determination of Step 6, if only the recognition output of category i is not 1, it is a misrecognition and the result is rejected (Step 8). FIG. 14 shows the state of the recognition boundary in the case of two dimensions and two categories. For example, in the recognition area of category A, the lower boundary is determined by the basic net, the right boundary is determined by the element in the category B (cos θ8) direction of the auxiliary net of category A, and the upper boundary is determined by the element in the category A (cos θA) direction of the auxiliary net of category A. The boundaries are determined, and the sine direction (sinθA)
The boundary of is determined by the permissible variation in the sinusoidal direction. The same applies to the recognition boundary of category B. Next, with reference to FIG. 13, a method of adjusting the allowable range in the other category direction in the auxiliary net of this embodiment will be explained. First, it is assumed that a prototype pattern of each category is stored so as to react to its input pattern (elements of other categories may also react), and a threshold value indicating its permissible variation range is adjusted (Step O). Next, copy this basic net to create auxiliary nets for each category, and present the input patterns in order (Step
1). It is determined whether recognition outputs of categories other than the presented pattern category are accepted (Step 2). As a result of the judgment, if there is no recognition output for other categories,
Check all sample patterns for misidentification (St
ep 3) N If there are still sample patterns that are misidentified, present the next sample pattern. The process ends when there are no more misidentified sample patterns. 5. If the intermediate element of the basic net is erroneously reacting to another category as a result of the determination in Step 2, the auxiliary net is operated (Step 4). Then, it is checked whether the category misidentified by the basic net is denied by the auxiliary net (step 5). If the auxiliary net is negative, the next pattern is presented without adjusting the auxiliary net, because during recognition processing, even if the basic net incorrectly judges that category, the correct judgment can be made by referring to the auxiliary net. and continue adjusting. 5tep In the judgment in step 5, if the category misidentified by the basic net is not denied by the auxiliary net, the threshold of the intermediate element (corresponding to the category that responds incorrectly) of the auxiliary net for that input category is set by a small amount Δ. and tighten the allowable range of variation for other categories (Step 6). The input pattern for learning is presented again and this operation is repeated until there is no misrecognition. By this, after learning, recognition processing with fewer misrecognitions can be performed even for patterns other than the learning pattern. [Effects of the invention] The present invention approximately realizes a general inverse matrix with high associative ability through local and simple processing, and can also handle deformed patterns using threshold processing and a feedback circuit, allowing pattern recognition with fewer misidentifications. This has the effect of being able to be executed efficiently and learning a recognition dictionary with simple calculations.Furthermore, according to an aspect of the present invention in which an auxiliary net is provided, the permissible range of the distribution of each category is determined, and the recognition Since −
Pattern recognition with less layer misidentification can be efficiently realized.

[Brief explanation of drawings]

第１図は、本発明の直交射影型フィードフォワード連想
記憶モデルを示す図である。ただし、図中には注目した
中間層素子への結合のみが示されている。第２図は、本発明の第１の実施例のシステム構成を示す
図である。第３図は、従来の直交射影型フィードフォワード連想記
憶モデルを示す図である。第４図は、従来の直交射影型フィードフォワード連想記
憶モデルを示す図である。第５図は、中間層出力にしきい値処理を施すための非線
形しきい関数の一例を示す図である。第６図は、本発明の認識処理の概略を示すフォローチャ
ートである。第７図は、本発明の学習処理の概略を示すフローチャー
トである。第８図（ａ）〜（ｃ）は、本実施例の効果を示すパター
ンの認識の実験例を示す図である。第９図（ａ）〜（Ｃ）は比較のために示す従来例による
パターンの認識の実験例を示すものである。第１０図は、大分類と詳細分類を用いた本発明の第２の
実施例を示す図である。第１１図は、本発明の第３の実施例のシステム構成を示
す図である。第１２図は、第３の実施例の認識処理の手順を示すフロ
ーチャートである。第１３図は、第３の実施例の学習処理の手順を示すフロ
ーチャートである。第１４図は第３の実施例における２次元パターンの２カ
テゴリの識別面の形成例を示した図である。２１・・・入力レジスタ、２２・・・クロック発振器、
２３．２６．２９・・・積和演算部、２４・・・微分演
算部、２５・・・関数演算部、２７・・・非線形しきい
処理部、２８・・・出力レジスタ。FIG. 1 is a diagram showing an orthogonal projection type feedforward associative memory model of the present invention. However, in the figure, only the coupling to the interlayer element of interest is shown. FIG. 2 is a diagram showing the system configuration of the first embodiment of the present invention. FIG. 3 is a diagram showing a conventional orthogonal projection type feedforward associative memory model. FIG. 4 is a diagram showing a conventional orthogonal projection type feedforward associative memory model. FIG. 5 is a diagram showing an example of a nonlinear threshold function for applying threshold processing to the intermediate layer output. FIG. 6 is a follow chart showing an outline of the recognition process of the present invention. FIG. 7 is a flowchart showing an outline of the learning process of the present invention. FIGS. 8(a) to 8(c) are diagrams showing experimental examples of pattern recognition showing the effects of this embodiment. FIGS. 9(a) to 9(C) show experimental examples of pattern recognition according to a conventional example for comparison. FIG. 10 is a diagram showing a second embodiment of the present invention using major classification and detailed classification. FIG. 11 is a diagram showing the system configuration of the third embodiment of the present invention. FIG. 12 is a flowchart showing the procedure of recognition processing in the third embodiment. FIG. 13 is a flowchart showing the procedure of learning processing in the third embodiment. FIG. 14 is a diagram showing an example of forming two categories of identification surfaces of a two-dimensional pattern in the third embodiment. 21... Input register, 22... Clock oscillator,
23.26.29... Product sum calculation section, 24... Differential calculation section, 25... Function calculation section, 27... Nonlinear threshold processing section, 28... Output register.

Claims

[Claims]

(1) In an associative memory model that dynamically converts an input pattern into a recognition output pattern using a feedback circuit and acquires a recognition dictionary through learning, it has an intermediate layer using sparse code representation, and is a recognition dictionary. A pattern recognition method characterized by storing information to be stored in connection weights in a distributed manner.

(2) An associative memory model that dynamically converts an input pattern into a recognition output pattern using a feedback circuit and acquires a recognition dictionary through learning. 1. A pattern recognition method in which information to be stored in connection weights is stored in a distributed manner, characterized in that an auxiliary net is provided to determine the permissible range of distribution of each recognition category.