JPH0329064A

JPH0329064A - Pattern recognizing device

Info

Publication number: JPH0329064A
Application number: JP1164847A
Authority: JP
Inventors: Yoshihisa Otsuka; 喜久大塚; Akira Nunoki; 布木　明
Original assignee: Kobe Steel Ltd
Current assignee: Kobe Steel Ltd
Priority date: 1989-06-27
Filing date: 1989-06-27
Publication date: 1991-02-07

Abstract

PURPOSE:To facilitate the weight correcting calculation by inputting the simplified data to a neural network and approximating the simplified data to the teacher data in accordance with an error between the output data of the neural network and the teacher data. CONSTITUTION:In a teaching step, the final target teacher data is computed into the simple data for calculation of the simplified data. This simplified data is inputted to a neural network 2 to obtain the output data. An arithmetic control part 4 functions to approximate the simplified data to the final target teacher data in accordance with an error between the obtained output data and the final target teacher data. A memory 3 stores the teacher data as well as the weight given to each coupling part 6 which connects the neurons 5 when the coincidence is obtained between the final target teacher data and the simplified data. Then an input port 4c transfers the pattern data between the part 4 and the outside of a pattern recognizing device. In such a constitution, the weight correcting calculation is facilitated and no extension of the network 2 is needed.

Description

【発明の詳細な説明】〔産業上の利用分野〕本発明は、パターン入力データと装置内に格納された複
数の教師データとを比較参照し、パターン入力データを
何れかの教師データに特定するパターン認識装置に関す
る．〔従来技術〕人の脳は約１４０ｒ！！個のニューロン（脳細胞）から
構成され、各ニューロンがシナプス結合によりネットワ
ークを横或していることはよく知られている．更に、脳
は前記ネットワークの結合状態を変化させることにより
記憶・学習を行っていると考えられている．特にパター
ン認識に関して人の脳は非常に優れ、その情報処理形態
は闇値演算による並列処理に基づくものであり、その記
憶形態は連想形式であるとされている．これらのネソト
ワークを数学的にモデル化したものに衆知のバーセプト
ロンが挙げられる。更に近年において、ハード面の性能
向上による超多重並列処理の実現及び記憶容量の大幅な
増大化等を背景に、多数のニューロンよりなるネノトワ
ーク（以下、ニューラルネットワークという）のモデル
によるパターン認識処理を計算機で実現しようとする試
みがなされつつある．第８図にニューロンの生物疑似モデルを示し、第９図に
その数学モデルを示す．５．はニューロンのモデル、６
１は軸索のモデル、１０Ｌはシナブス結合のモデルであ
る．ニューロン５の数学モデルは、第９図に示すように
、前記生物疑似モデルのシナスブ結合１０，に相当する
連結部６のニューロン５との結合部にて重み付け処理Ｒ
のなされた、各パターン入力データＸがニューロン５内
に入力される．そして、前記重み付け後のバターン入力
データは総和処理Ｓがなされた後に闇値処理Ｔが施され
る。前記闇値処理Ｔでは通常ングモイド関数等のＳ字関
数が用いられる。そして、闇値処理された値ｙがニュー
ロンの出力値として後続のニューロン５に向けて出力側
の連結部６へ出力される。[Detailed Description of the Invention] [Industrial Application Field] The present invention compares and refers pattern input data with a plurality of teacher data stored in a device, and specifies the pattern input data as one of the teacher data. Regarding pattern recognition devices. [Prior art] The human brain is approximately 140 r! ! It is well known that the brain is composed of individual neurons (brain cells), and each neuron forms a network through synaptic connections. Furthermore, the brain is thought to perform memory and learning by changing the connection state of the network. The human brain is particularly good at pattern recognition, and its information processing form is based on parallel processing using dark value calculations, and its memory form is said to be associative. The well-known berseptron is a mathematical model of these networks. Furthermore, in recent years, against the backdrop of the realization of massively multiplexed parallel processing due to improvements in hardware performance and the significant increase in memory capacity, computer-based pattern recognition processing using a model of a neural network (hereinafter referred to as a neural network) consisting of a large number of neurons has been developed. Attempts are being made to realize this. Figure 8 shows a biological model of a neuron, and Figure 9 shows its mathematical model. 5. is a neuron model, 6
1 is a model of an axon, and 10L is a model of synaptic connections. As shown in FIG. 9, the mathematical model of the neuron 5 is subjected to a weighting process R at the connection part of the connection part 6 with the neuron 5, which corresponds to the synaptic connection 10 of the biological pseudo model.
Each pattern input data X is input into the neuron 5. The weighted pattern input data is then subjected to a summation process S and then subjected to a dark value process T. In the dark value processing T, an S-shaped function such as a ngmoid function is usually used. Then, the value y subjected to the dark value processing is outputted to the output side connecting section 6 toward the subsequent neuron 5 as the output value of the neuron.

このようなニューラル不ソｌ・ワークのモデルとしては
、ネノトワーク横造の違い或いはパターン記憶方式の違
いから自動連想型のものとバクーン連想型のものとに大
別される．前記自動連想型のネ，トワークは、ここでは図示しない
が、入力ニューロンと出力ニューロンとが共通でネント
ワーク内の全てのニューｎンが相互に接続されると共に
複数のパターンデータをネットワーク内に格納し入力パ
ターンに最も近似したパターンデータを出力するネノト
ワークであって、パターンデークを教示させるステノブ
では似通った入力パターンを識別するようにニューロン
間の結合の強さを変えるようにしている．このようなネ
ソトワークは組合せ問題のＲ通化や曖昧情？の連想に優
れているとされる．一方、パターン連想型のニューラルネントワーク２■は
、第１０図に示すように、複数のニューロン５が入力層
．中間層，入力層に階層化され、入力層から出力層に向
けて連結部６を介して層間のニューロン５が接続されて
いる．更に、同一層内のニューロン５同志は接続されず
、入力層のニューロン５と出力層のニエーロン５とは独
立しているといった特徴がある。These models of neural insolence work can be broadly divided into automatic association type and Bakun association type based on the difference in the horizontal construction of neural work or the difference in pattern memory method. Although not shown here, the automatic associative network has a common input neuron and an output neuron, all neurons in the network are interconnected, and multiple pattern data are stored in the network. It is a neurotower that outputs pattern data that is most similar to the input pattern, and the stenowork that teaches the pattern data changes the strength of the connections between neurons in order to distinguish between similar input patterns. Is this kind of network work an R-commandation of combinatorial problems or ambiguity? It is said to be excellent at associating things. On the other hand, in the pattern-associative neural network 2■, as shown in FIG. 10, a plurality of neurons 5 are connected to the input layer. It is hierarchically divided into an intermediate layer and an input layer, and neurons 5 between the layers are connected via a connecting part 6 from the input layer to the output layer. Furthermore, the neurons 5 in the same layer are not connected, and the neurons 5 in the input layer and the neurons 5 in the output layer are independent.

前記パターン連想型のニューラルネットワーク２ａにお
ける学習ではパックブロパゲーション（逆伝搬法）アル
ゴリズムを適用したものがよく知られている．以下、こ
のバンクブロパゲーシうンアルゴリズムを用いたニュー
ラルネットワークに付き説明する．前記バックブロパゲーションアルゴリズムは１９８６年
に公開されたもので、システムの運用に先立って、先ず
パターンの教示ステップを必要とし、入力層のニューロ
ン５に教示データを与えなければならない，即ち、入力
層の各ニューロン５に学旨用のパターン入力データＸを
与えると、この信号は前記各ニューロン５で変換される
．続いて、中間層の各二エーロン５に伝達され、最後に
出力層の各ニューロン５からパターン出力データｙとし
て出力される（矢印Ｆ）。In learning in the pattern associative neural network 2a, it is well known that a pack propagation algorithm is applied. Below, we will explain the neural network using this bank propagation algorithm. The backbropage algorithm was published in 1986, and prior to system operation, it first requires a pattern teaching step, and teaching data must be given to neurons 5 in the input layer. When each neuron 5 is given a pattern input data X for academic purposes, this signal is converted by each neuron 5. Subsequently, it is transmitted to each neuron 5 in the intermediate layer, and finally output as pattern output data y from each neuron 5 in the output layer (arrow F).

各層のそれぞれのニューロン５への入力に際して連結部
６で各信号に瑣みが付加されるのは先述した通りである
。そして、前記出力層からの出力値と予め設定された望
ましい出力値とが比較され、その差を減少するように、
出力層と中間層との間の結合の強さ（重み）が変更され
る。この徂みの変更は、ニューラルネソトワーク２，の
信号伝搬方向（矢印Ｆ）を遡る方向に（矢印Ｍ）収束す
るまで順次繰り返される。As described above, the coupling section 6 adds a dwarf to each signal upon input to each neuron 5 of each layer. Then, the output value from the output layer and a preset desired output value are compared, and the difference is reduced.
The strength (weight) of the coupling between the output layer and the hidden layer is changed. This range change is sequentially repeated until the signal propagation direction (arrow F) of the neural network 2 converges in the backward direction (arrow M).

このように、前記バンクブロパゲーンゴンアルゴリズム
によるニューラルネノＩ・ワーク２ａは、得られた出力
ｙと望ましい出力Ａとの差がＯに近づくまでニューロン
５間の結合の強さを繰り返し変えるものであって、この
手法によれば、分類したいデータを抽出してパターンデ
ータとし、更に？師データとして分類結果の代表値を設
定することによりデータ群の任意のデータがどの代表値
に近い値を出力するかを調べることによっ゛ζデータの
分類をなすことができる。In this way, the neural node I work 2a based on the bank-propagating algorithm repeatedly changes the strength of the connections between the neurons 5 until the difference between the obtained output y and the desired output A approaches O. According to this method, the data to be classified is extracted and made into pattern data, and then... By setting the representative value of the classification result as master data, it is possible to classify the ζ data by checking which representative value any data in the data group outputs a value close to.

又、前記ニューラルネントワーク２１に対し、その入力
特徴を変えたものとして、第Ｈ図に示すようなニューラ
ルネノトワーク２ｂが挙げられる．即ち、入力特徴をテ
ンソルや三角関数を用いて拡大すること、例えば図中入
力値Ｘ１に対し、χ．を変数とするＩｔ数の三角関数も
同時に入力することにより、学習効率を大幅に向上させ
ようとするものである．このように、バンクプロバゲーションアルゴリズムによ
るパターン連想型のニューラルネットワーク２■，２ｂ
は、ネットワーク内にパターン認識機能や知識処理機能
を教え込むことが可能で、バーセブトロンが不得手とす
る線形分離処理を威すことのできないパターン入力デー
タをも確実に処理することができるといった点で有用な
ものである．［発明が解決しようとする課題］ところが、士記したようなニューラル不，トワーク２１
を具備してなるパターンＬ２識装置では、教師データに
よる対象パターンが複雑であると、教示のステノプにお
いて、前記重み修正のためのｆ４′ＪＸ繰り返し回数が
多く必要であったり、又重みの修正を繰り返しても、例
えば重みに対する誤差のポテンシャルが多峰性であるよ
うな場合に局所最適値に陥って、前記教示データに近い
出力をｉ：￥ることかできないといった問題がある．即
ち、データによっては容易に収束する場合と収束しない
場合がある．又、ニューラルネットワーク２Ｉ，を具備してなるもの
であれば、ネットワーク自体が複雑になることから、演
算量が増加するとともにハードの部品点数を多く必要と
するものであった。Further, as a version of the neural network 21 with different input characteristics, there is a neural network 2b shown in FIG. That is, expanding the input feature using a tensor or trigonometric function, for example, for the input value X1 in the figure, χ. By simultaneously inputting the trigonometric function of the It number as a variable, the learning efficiency is greatly improved. In this way, pattern associative neural networks 2■, 2b using bank propagation algorithms
It is possible to inculcate pattern recognition functions and knowledge processing functions into the network, and it is possible to reliably process pattern input data that cannot be affected by linear separation processing, which is a weak point of the Versebutron. It is useful. [Problems to be solved by the invention] However, neural dysfunction and twerking21 as described above
In the pattern L2 recognition device equipped with the above, if the target pattern according to the teacher data is complex, it may be necessary to repeat the f4'JX a large number of times for the weight correction in the teaching step, or it may be necessary to repeat the weight correction. Even if it is repeated, there is a problem that, for example, if the error potential for the weight is multimodal, it will fall into a local optimum value and will be unable to produce an output close to the teaching data. In other words, depending on the data, it may converge easily or it may not. Further, if the neural network 2I is provided, the network itself becomes complicated, which increases the amount of calculations and requires a large number of hardware components.

したがって、本発明の目的とするところは、対象パター
ンが複雑であっても、ニューラルネントワーク中のニュ
ーロン数を増加させることなく、重み修正計算の収束を
可能にしたり、あるいは高速化することのできるパター
ン認識装置を提供することにある．〔課題を解決するための手段〕上記目的を達或するために、本発明が採用する主たる手
段は、データの入力部，演算部および出力部を有する１
又は複数の情報処理部、ならびに該情報処理部の出力部
と入力部を連結するとともに該出力部よりの出力情報に
重みを付加して前記入力部に伝達するための連結部を具
備してなるパターン認識装置において、予め設定された
目標教師データを記億するメモリ手段と、入力データを
前記パターン認識装置に入力し演算することにより、そ
の出力データが予め設定された初期目標教示データに近
接もしくは一致するまで重みを変えつつ学習して、近接
もしくは一致した際の前記連結部の各々の重みを決定す
る第１重み決定手段と、前記初期目標教師データを少な
くとも１回以上段階的に変更して最終目標教師データに
近づけるとともに各段階における出力データが各段階に
おける目標教師データと近接もしくは一致するまで前記
重みを変えつつ学習することによって前記出力データが
最終目標教示データに近接もしくは一致した際の最終重
みを決定する第２重み決定手段とを具備してなる点を要
旨とするパターン認識装置である．〔作用〕本発明に係るパターン認識装置では、該パターン認識装
置にパターンを教示する際に、先ず予め設定されメモリ
手段に格納された最終目標教示データが、演算しやすく
簡素化された初期目標教師データに変更して設定される
．そして、前記パターン認識装置に入力データが入力さ
れ、得られた出力データが前記初期目標教示データと比
較参照され、それぞれが近接若しくは一致するまで前記
重みを繰り返し変更して学習され、前記出力データが初
期目標教師データに近接若しくは一致した際の前記連結
部の各々の重みが第１重み決定手段により決定される。Therefore, an object of the present invention is to enable convergence or speed up weight correction calculations without increasing the number of neurons in the neural network even if the target pattern is complex. The objective is to provide a pattern recognition device. [Means for Solving the Problems] In order to achieve the above object, the main means adopted by the present invention is a system having a data input section, a calculation section, and an output section.
or a connecting section for connecting a plurality of information processing sections and an output section and an input section of the information processing sections, and for adding weight to output information from the output section and transmitting the weighted information to the input section. The pattern recognition device includes a memory means for storing preset target teaching data, and input data is input to the pattern recognition device and is operated so that the output data is close to or close to the preset initial target teaching data. a first weight determining means that performs learning while changing the weight until a match is made, and determines the weight of each of the connected parts when they are close or match; and stepwise changing the initial target teacher data at least once or more. By learning while changing the weights until the output data at each stage approaches or matches the target teaching data at each stage, the final result when the output data approaches or matches the final target teaching data This pattern recognition device is characterized in that it includes a second weight determining means for determining weights. [Operation] In the pattern recognition device according to the present invention, when teaching a pattern to the pattern recognition device, first, the final goal teaching data set in advance and stored in the memory means is converted into an initial target teacher that is simplified and easy to calculate. The data is changed and set. Then, the input data is input to the pattern recognition device, the obtained output data is compared with the initial target teaching data, and learning is performed by repeatedly changing the weights until they are close to or coincide with each other, and the output data is The first weight determining means determines the weight of each of the connecting portions when the connecting portion is close to or coincides with the initial target teacher data.

次に、前記初期目標教師データが少なくとも１回以上段
階的に前記最終目標教師データに近似して設定され、前
記それぞれの段階毎の目標教師データに近接若しくはー
・致するように上記した処理が同様になされ、最終的に
前記出力データが最終目標教師データと近接若しくは一
致したときに前記連結部の各々の最終的な重みが第２重
み決定手段により決定される．それにより、重み修正計
算が容易になり、ネットワークを増加させる必要もない
．〔実施例〕以下、添付した図面を参照しつつ本発明を具体化した実
施例につき説明し、本発明の理解に供する．ここに第１
図は本発明の一実施例に係るパターン認！Ｉｉ装置を示
すブロック構威図、第２図は同パターン認識装置に具備
されたニエーラルネッｌ・ワークを示す概念図、第３図
は同ニューラルネットワークをｆｌ或する情報処理と連
結部を示す概略構造図、第４図は同二エーラルネッ１・
ワークへの入力データと該入力データに対応する教師デ
ータとの関係を示すグラフ、第５図は同パターン認識装
置による重み修正計算が収束する状態を説明するために
用いられる誤差と重みとの関係を示すグラフ、第６図は
第４図に破線で表した簡略化パターンデータを実線で表
した最終目標教師データに近付ける時のパターン上の各
点の変化を示す図表、第７図は第６図における各ステン
ブ毎の計算の収束に要したループ回数を示すグラフであ
る．尚、下記する実施例は、本発明の技術的範囲を限定
する性格のものではない．本実施例に係るパターン認識装置１は、第１図に示すよ
うに、バノクブ口パゲーションアルゴリズムを用いてパ
ターンＥｌｍを行うパターン連想型のニヱーラルネソト
ワーク２と、常時は該ニューラルネットワーク２ヘパタ
ーン入力データを与えると共に、教示ステップ時には最
終目標教師データを簡素なデータに演算して簡素化デー
タを算出し、該簡素化データを前記ニューラルネットワ
ーク２へ入力し得られた出力データと前記最終目標教師
データとの誤差の値に応じて前記簡素化データを最終目
標教師データに近づける演算制御部４と、前記簡素化デ
ータと最終目標教師データとを一致させた時のニューロ
ン５間を連結する各連結部６に付与された重みと前記最
終目標教師データとを記憶するメモリ３と、前記演算制
御部４と装置外部との間でパターンデータのやりとりを
行う入力ボート４ｃとから構戒されている．前記ニュー
ラルネットワーク２は、第２図に示すように、入力層が
２個のニューロン５から、中間層が１２個のニューロン
５から、出力層が１個のニューロン５からそれぞれなり
、それぞれの層間は連結部６により連結されている．そして、前記ニューロン５（情報処理部）は、第３図に
示すように、連結部６に接続され、重み付けされた連結
部６からのパターン入力データを入力させる入力部７と
、該入力部７に入力された重み付け後のそれぞれのパタ
ーン入力データを総和させ、得られた総和値に閾値処理
を施して、前記総和値がニューロン５毎に予め設定され
た閾値を超えたときに興奮（出力）するようになした演
Ｘ部８と、該演算部８が興奮したときにデータ処理方向
（矢印Ｆ）下流側の層にあるニューロン５にパターン出
力データｙを出力する出力部９とからなっている．そし
て、前記演算部８は、前記教示ステンブ時において一重
みを変更して決定し、重みの学習を前記データ処理方向
の上流側の層毎に順次遡って行うことは言うまでもない
。Next, the initial target teacher data is set to approximate the final target teacher data in at least one step or more, and the above-described process is performed so that the initial target teacher data approaches or approaches the target teacher data at each stage. Similarly, when the output data is close to or coincides with the final target teacher data, the final weight of each of the connected parts is determined by the second weight determining means. As a result, weight correction calculation becomes easy and there is no need to increase the number of networks. [Examples] Examples embodying the present invention will be described below with reference to the attached drawings to provide an understanding of the present invention. here the first
The figure shows pattern recognition according to an embodiment of the present invention! 2 is a conceptual diagram showing the neural network included in the pattern recognition device, and FIG. 3 is a schematic structure showing the information processing and connection parts of the neural network. Fig. 4 shows the same two eral net 1.
A graph showing the relationship between the input data to the workpiece and the teacher data corresponding to the input data, and FIG. 5 is the relationship between the error and the weight used to explain the state in which the weight correction calculation by the pattern recognition device converges. Figure 6 is a graph showing the changes in each point on the pattern when the simplified pattern data shown by the broken line in Figure 4 approaches the final target teacher data shown by the solid line. This is a graph showing the number of loops required to converge the calculation for each stem in the figure. The examples described below are not intended to limit the technical scope of the present invention. As shown in FIG. 1, the pattern recognition device 1 according to the present embodiment includes a pattern associative neural network 2 that performs pattern Elm using the Banokubu pagination algorithm, and a neural network 2 that is always connected to the neural network 2. At the same time, in the teaching step, the final target teacher data is calculated into simplified data to calculate simplified data, and the simplified data is inputted to the neural network 2, resulting in output data and the final target. A calculation control unit 4 that brings the simplified data closer to the final target teacher data according to the error value with respect to the teacher data, and a neuron 5 that connects the simplified data and the final target teacher data when the simplified data matches the final target teacher data. It is monitored by a memory 3 that stores the weights assigned to the connection unit 6 and the final target teacher data, and an input port 4c that exchanges pattern data between the arithmetic and control unit 4 and the outside of the device. ．． As shown in FIG. 2, the neural network 2 has an input layer consisting of two neurons 5, an intermediate layer consisting of 12 neurons 5, and an output layer consisting of one neuron 5. They are connected by a connecting part 6. As shown in FIG. 3, the neuron 5 (information processing section) includes an input section 7 that is connected to the connection section 6 and receives weighted pattern input data from the connection section 6; The weighted pattern input data input to the neuron 5 are summed up, the resulting summation value is subjected to threshold processing, and when the summation value exceeds a threshold set in advance for each neuron 5, excitement (output) is generated. It consists of an operation section 8 configured to do the following: and an output section 9 that outputs pattern output data y to the neuron 5 in the downstream layer in the data processing direction (arrow F) when the operation section 8 is excited. There is. It goes without saying that the arithmetic unit 8 changes and determines one weight during the teaching process, and performs weight learning sequentially for each layer on the upstream side in the data processing direction.

上記したように構威されるパターン認識装置Ｉにおいて
、第４図中に点ａ　−　ｔを結んだ実線で示すパターン
データＡを認識させる場合には、先ずパターンデータＡ
（最終目標教師データ）に近似する簡略化パターンデー
タＢ（初期目標教師データ）が前記演算制御部４によっ
て設定される。かかる簡略化パターンデータＢは予めメ
モリ３に格納されていてもよい．前記簡略化パターンデ
ータＢは後記する学習における収束が能率良く行われる
ように適当に定めればよく、この場合には、図中の点ａ
　　−ｉ’を結んだ破線で示すものが設定されている．
ここで、ニューラルネットワーク２中の入力層における
ニューロン５の内のーっ５，には、便宜上変化しない値
として常に１の値が入力される．無論、必要に応して適
当な規則に従った値を入力してもよい，そして、前記ニ
ューラルネットワーク２へのパターン入力データはそれ
ぞれのパターンａｚｉに対応する値であって、ここでは
０．１〜０．９の値で示される．次に、前記ニューラルネットワーク２による学習の第１
ステップとして、前記パターン入力データＸを前記パタ
ーン認識装ｉｔ１に入力し、出力データｙを演算する．
この時、前記出力データｙが予め設定された前記初期目
標教師データＢに近づけるように前記重みを修正する学
習（演Ｘ）を行う．この学習を全ての入力データに対し
、繰り返し行い、全ての出力データと対応する初期教師
データが、近接若しくは一致した際の前記連結部６にて
付与された重みを初ｌｌＩＪ！重みとして採用する．即
ち、前記演算制御部４と前記演算制御部８とにより初期
重みが決定されるまでのｍ能を実現するのが第１重み決
定手段である．例えば、入力層のニューロン５にパターン入力データＸ
として０．ｌが入力されると、前記パターン入力データ
ｘ（０．１）は演算部８にて所定の演算式で演算された
後にその演算値が連結部６を介してそれぞれ中間層のそ
れぞれのニューロン５に伝達される．この際、連結部６
において、前記演：Ｉ｛ａに或る重みが付加される．そして、中間層のニューａン５の入力部７を経て、重み
の付加された演算値が演算部８に入力され、前記総和処
理及び前記閾値処理によって演算される．この中間層の
それぞれの；ユーロン５で算出された演Ｘ値は、出力部
９に接続された連結部６を介して出力層のニューロン５
に伝達され、前記入力層のニューロン５から中間層のニ
ューロン５に伝達された際と同様に、重み付け・総和・
閾値処理がなされ、その演算値が出力データｙとして出
力層の二二一・ロン５の出力部９から出ノｊされる．そして、前記出力データｙとパターン入力データＸ（０
．１）に対応する初期目標教師データＢ（０．８）とが
比較され、アンマンチの場合には、先ず出力層と中間層
の間の連結部６における重みが変更され、次に中間層と
入力層の間の連結６における重みが変更される．そして
、再度前記手順に基づく処理が行われ、パターン入力デ
ータｘ（０．ｌ）に対応する出力データｙが初期目標教
示データＢ（０．８）に近づくように各連結部６の重み
を変える．続いて、次のパターン入力データｘ（０．２）について
、パターン入力データｘ（０．１．）においてなされた
と同様の処理が行われる．このとき、パターン入力デー
タｘ（０．１）において得られた各連結部６の重みが当
初用いられる．以下、同様にパターン入力データＸが０
．３，　０．４，　・，　０．８，０．９の場合におい
ても、前記した処理が実行される．このようなパターン
入力データＸがＯ．ｌ〜０．９までなされる繰り返し処
理は、最終的に全てのパターン入力データＸについて、
それぞれに対応する出力データｙが初期目標教師データ
Ｂに近接もくしは一致するまで実行される．このような学習は、全ての出力データｙが対応する初期
目標教師データＢと近接若しくは一致した時点で終了し
、第６図の表中に示すステップ１が完了し、そのときの
重みが演算部８により決定される．又、この場合に要し
たループ回数Ｎは、第７図の図中に３１で示すように、
誤差Ｆ．　＜　０．　０１を収束判定条件として約５万
回弱であった．以上が第１ステップの処理であり、その
ときのニューラルネットワーク２のニューロン５間の結
合の強さを示す各連結部６の重みが演算部８により決定
され、メモリ３に格納される．次に、初回の初期目標教師データＢを最終目標教師デー
タＡに任意量近づけた２段階目の初期目標教師データが
設定される．ここでは、ａ点とａ′点，Ｃ点とＣ′点．ｉ点とｉ′点
の値がそれぞれ異なっているので、ａ点，ｃ′点　ｉ／
点の初期教師データＢを最終目標教師データＡ（ａ点０
．２．ｃ点０．２，ｉ点０．８）にそれぞれ０．　１ず
つ近づけた値と、前回の初期目標教師データＢの残りの
値とでもって今回の初期目標教師データが設定される．尚、前記任意量が大きすぎる場合には、重み修正計算が
収束しないことがあり、逆に小さすぎるとニエーラルネ
ットワーク２全体として演算量が膨大なものになるので
初期目標教師データＢと最終目標教示データＡとの差に
応じて適当に決定すればよい．又、ここでは、入力デー
タｘ（０．１．０．３　　０．９）に対応するそれぞれ
の初期目標教師データＢのみを変更したが、これに限ら
ず他の入力データＸに対応する初期目標教師データＢを
適当量加減してもよく、初期目標教師データＢが全体と
して最終目標教師データＡに近づくように調整させれば
よい．今回の初期目標教師データが設定されると、それぞれの
入力データｘ（０．１〜０．９）について前回のステッ
プ１と同様の処理がなされ、出力データｙが今回の初期
目標教師データに近接若しくは一致するまで各重みを変
更して学習される（第６図ステップ２）．尚、今回のル
ープ回数Ｎは約３．５万回であった（第７図３２）．以下、上記したステップｌ及びステップ２と同様の処理
が段階的に実行される．本実施例では．ステンブ７まで
の処理がなされた（第６図のステップ３〜ステップ７及
び第７図の５３〜Ｓ７）．ここでステンブ７では、目標
としていた最終目標教師データＡについて前記したよう
な学習がなされ、全部のパターン入力データｘ　（０．
　１−０．　９　）に対応したそれぞれの出力データｙ
が最終ｒ１＋＝教師データＡに近接若しくは一致すれば
、全ての教示ステンプが完了する．従って、教示のステ
ップ２〜ステップ７において達威される機能を実現する
のが第２重み決定手段である．上記した一連の学習によって求められた各連結部６の重
みがパターンデータＡを認識するための最終的な「重み
」とＬ７てメモリ３に格納されて用いられる．１２Ｉｔ
ち、前記重みが連結部６に設定されたニューラルネット
ワーク２は、全個数若しくは任意の個数のパターン入力
データＸが入力層のニューロン５に入力され、得られた
出力データｙが最終目標教師データＡに一致すれば、パ
ターンＡを認識する．更に、各種・多数のパターンデー
タについて、前記重みを設定し、ニューラルネントワー
ク２の結合の強さを変えておけば、或るパターン入力デ
ータに対する出力データと教師データとが比較され、何
れのパターンであるかをｌｍすることができる．上記したように、本実施例に係るパターン認識装置Ｉは
、パターンデータが収束しにくい複雑なものであっても
、第７図に示す曲線Ｈのようにステップ毎にパターンデ
ータを収束させることができ、そのときのループ回数Ｎ
も多くを必要としない。因みに、バックプロバゲーシタ
ンアルゴリズムのみを用いた従来パターン認識装置では
、曲線Ｇで示すように、大きな誤差Ｅのまま収束判定条
件Ｅ＜０．０１に到達することなく収束しなかった。In the pattern recognition device I configured as described above, when recognizing the pattern data A shown by the solid line connecting points a-t in FIG.
Simplified pattern data B (initial target teacher data) that approximates (final target teacher data) is set by the arithmetic control section 4. Such simplified pattern data B may be stored in the memory 3 in advance. The simplified pattern data B may be appropriately determined so that convergence in learning to be described later is performed efficiently.In this case, point a in the figure
-i' is set as shown by the broken line.
Here, for convenience, a value of 1 is always input to 5 of the neurons 5 in the input layer of the neural network 2 as a value that does not change. Of course, values according to appropriate rules may be input as necessary, and the pattern input data to the neural network 2 is a value corresponding to each pattern azi, which is 0.1 here. It is indicated by a value of ~0.9. Next, the first learning by the neural network 2
As a step, the pattern input data X is input to the pattern recognition device it1, and output data y is calculated.
At this time, learning (operation X) is performed to modify the weights so that the output data y approaches the preset initial target teacher data B. This learning is repeated for all input data, and the weights given by the connection unit 6 when all the output data and the corresponding initial teacher data are close to each other or coincide with each other are calculated as the first llIJ! Adopt as weight. That is, the first weight determining means realizes m functions until the initial weights are determined by the arithmetic control section 4 and the arithmetic control section 8. For example, pattern input data X to neuron 5 of the input layer
as 0. When l is input, the pattern input data x (0.1) is calculated by a predetermined calculation formula in the calculation unit 8, and the calculated value is sent to each neuron 5 of the intermediate layer via the connection unit 6. It is transmitted to At this time, the connecting part 6
In , a certain weight is added to the expression I{a. Then, the weighted calculation value is inputted to the calculation unit 8 via the input unit 7 of the intermediate layer New A 5, and is calculated by the summation processing and the threshold processing. The operation X value calculated in each of the intermediate layers;
The weighting, summation, and
Threshold processing is performed, and the calculated value is output from the output section 9 of the 221-ron 5 in the output layer as output data y. Then, the output data y and the pattern input data X (0
．． 1) is compared with the initial target training data B (0.8), and in the case of unmanifold, first the weight in the connection part 6 between the output layer and the hidden layer is changed, and then the weight in the connection part 6 between the output layer and the input layer is changed. The weights in connections 6 between layers are changed. Then, the process based on the above procedure is performed again, and the weight of each connecting part 6 is changed so that the output data y corresponding to the pattern input data x (0.l) approaches the initial target teaching data B (0.8). ．． Subsequently, the same processing as that performed on the pattern input data x (0.1.) is performed on the next pattern input data x (0.2). At this time, the weight of each connection part 6 obtained in the pattern input data x (0.1) is initially used. Similarly, pattern input data X is 0 below.
．． 3, 0.4, . . , 0.8, 0.9, the above process is also executed. If such pattern input data X is O. The iterative processing performed from l to 0.9 finally yields for all pattern input data X,
The process is executed until the corresponding output data y approaches or matches the initial target teacher data B. Such learning ends when all the output data y are close to or coincide with the corresponding initial target teacher data B, and step 1 shown in the table of FIG. 6 is completed, and the weights at that time are Determined by 8. Also, the number of loops N required in this case is as shown by 31 in FIG.
Error F. <0. Using 01 as the convergence judgment condition, it took just under 50,000 times. The above is the process of the first step, and the weight of each connection section 6 indicating the strength of the connection between the neurons 5 of the neural network 2 at that time is determined by the calculation section 8 and stored in the memory 3. Next, a second stage of initial target teacher data is set in which the initial target teacher data B is brought closer to the final target teacher data A by an arbitrary amount. Here, point a and point a', point C and point C'. Since the values of point i and point i' are different, point a, point c' i/
The initial teacher data B at point is converted to the final target teacher data A (point a 0
．． 2. 0.2 at point c and 0.8 at point i, respectively. The current initial target teacher data is set using the value that is brought closer by 1 and the remaining value of the previous initial target teacher data B. Note that if the arbitrary amount is too large, the weight correction calculation may not converge, and if it is too small, the calculation amount for the neural network 2 as a whole will be enormous, so the initial target teacher data B and the final It may be determined appropriately depending on the difference from the target teaching data A. In addition, here, only the initial target teacher data B corresponding to the input data x (0.1.0.3 0.9) is changed, but this is not limited to changing the initial target corresponding to other input data X. The teacher data B may be adjusted by an appropriate amount, and the initial target teacher data B may be adjusted so as to approach the final target teacher data A as a whole. Once this initial target teacher data is set, the same processing as in the previous step 1 is performed for each input data x (0.1 to 0.9), and the output data y is close to the current initial target teacher data. Otherwise, the weights are learned by changing each weight until they match (Step 2 in Figure 6). Note that the number of loops N this time was approximately 35,000 times (Figure 7, 32). Thereafter, the same processes as Step 1 and Step 2 described above are executed step by step. In this example. Processing up to stencil 7 has been completed (steps 3 to 7 in FIG. 6 and 53 to S7 in FIG. 7). Here, in Step 7, the above-described learning is performed on the final target teacher data A, and all pattern input data x (0.
1-0. 9) Each output data y corresponding to
If it is close to or coincides with the final r1+=teacher data A, all teaching steps are completed. Therefore, the second weight determining means realizes the functions achieved in steps 2 to 7 of teaching. The weight of each connection part 6 obtained through the above-described series of learning is stored in the memory 3 and used as the final "weight" L7 for recognizing the pattern data A. 12It
In the neural network 2 in which the weights are set in the connection unit 6, all or any number of pattern input data X are input to the neurons 5 of the input layer, and the obtained output data y is the final target teacher data A. If it matches, pattern A is recognized. Furthermore, by setting the weights and changing the strength of the connections in the neural network 2 for various and large numbers of pattern data, the output data for a certain pattern input data and the teacher data can be compared, and which pattern You can lm whether it is. As described above, the pattern recognition device I according to this embodiment can converge the pattern data step by step as shown by the curve H shown in FIG. 7 even if the pattern data is complex and difficult to converge. possible, and the number of loops at that time N
You don't need much either. Incidentally, in the conventional pattern recognition device using only the backpropagation algorithm, as shown by the curve G, the convergence criterion condition E<0.01 was not reached with a large error E and the convergence was not achieved.

この発明では、前記パターン入力データＸと初期目標教
師データＢが用いられ、行き威り最終目標教師データが
用いられることはない．それは、複雑なデータ、例えば
多峰性のデータＣを当初より用いれば、第５図に示すよ
うに、誤差Ｅを最小限にするときの重みＷの解を得るた
めに、演算開始点ｍから真の最適点Ｐに到達するように
演算が繰り返されても、局所最適点ｎに至った時にあた
かも収束したかのような判断がなされ、局所最適点ｎ以
下の誤差で収束することのない場合があるからである．そこで、データＣに近似するデータＤを設定し、データ
Ｃの演算開始点ｍに対応するデータＤの演算開始点ｑか
ら演算を繰り返し、データＤの最適解である点ｒを求め
る．そして、前記点ｒに対応するデータＣの点０に戻り
、再度演算を繰り返すと、データＣにおける最適解であ
る点Ｐを求めることができる．以上は、本発明の概要の
一部を抽象的に説明したものである．上記した実施例では、実質的にパターンデータＸを入力
できる入力層のニューロン５はひとつであったが、所定
の数、例えばパターン入力データの個数分（上記実施例
では９個）配設してもよい。In this invention, the pattern input data X and the initial target teacher data B are used, and the final target teacher data is not used. If complex data, for example multimodal data C, is used from the beginning, as shown in Fig. Even if the calculation is repeated to reach the true optimal point P, when the local optimal point n is reached, it is judged as if it had converged, and the process does not converge with an error less than the local optimal point n. This is because there is. Therefore, data D that approximates data C is set, and the calculation is repeated from the calculation start point q of data D that corresponds to the calculation start point m of data C, to find the point r that is the optimal solution for data D. Then, by returning to point 0 of data C corresponding to the point r and repeating the calculation again, point P, which is the optimal solution for data C, can be found. The above is an abstract explanation of a part of the outline of the present invention. In the above-mentioned embodiment, there is one neuron 5 in the input layer that can substantially input the pattern data Good too.

又、中間層のニューロン５はなくてもよく、逆に中間層
を複数層としても構わない。Furthermore, the neurons 5 in the intermediate layer may be omitted, and conversely, the intermediate layer may have a plurality of layers.

更に、上記実施例のニエ五ラルネントワーク２では、連
結部６が上下層のニューロン５を連結している、入力層
と出力層のそれぞれのニューロン５を直接連結したもの
であってもよく、１のニュ一ロン５の入力部７と出力部
９とを自己連結したもの（第３図２点鎖線６′）も採用
することができる．そして、本実施例において、重み修正計算の収束判定条
件として誤差Ｅ＜０．０１の条件を用いたが、それに限
定されるものではなく、任意の収束判定条件でも適用で
きることを付記しておく．〔発明の効果〕本発明によれば、データの入力部，演算部および出力部
を有する１又は複数の情報処理部、ならびに該情報処理
部の出力部と入力部を連結するとともに該出力部よりの
出力情報に重みを付加して前記入力部に伝達するための
連結部を具備してなるパターン認識装置において、予め
設定された目標教師データを記憧するメモリ手段と、入
力データを前記パターン認識装置に入力し演算すること
により、その出力データが予め設定された初期目標教示
データに近接もしくは一致するまで重みを変えつつ学習
して、近接もしくは一致した際の前記連結部の各々の重
みを決定する第１重み決定手段と、前記初期目標教師デ
ータを少なくとも１回以上段階的に変更して最終目標教
師データに近づけるとともに各段階における出力データ
が各段階における目標教師データと近接もしくは一致す
るまで前記重みを変えつつ学習することによって前記出
力データが最終目標教示データに近接もしくは一致した
際の最終重みを決定する第２重み決定手段とを具備して
なることを特徴とするパターン認識装置であるから、必
要以上にニューラルネットワークを増大させることなく
、重み修正計算を確実且つ高速に収束させることができ
る。Furthermore, in the neural network 2 of the above embodiment, the connecting part 6 connects the neurons 5 of the upper and lower layers, and may be one in which the neurons 5 of the input layer and the output layer are directly connected, It is also possible to adopt a structure in which the input section 7 and output section 9 of the neuron 5 of No. 1 are self-connected (as shown by the two-dot chain line 6' in Fig. 3). In this embodiment, the condition of error E<0.01 was used as the convergence judgment condition for the weight correction calculation, but it should be noted that the condition is not limited to this and any convergence judgment condition can be applied. [Effects of the Invention] According to the present invention, one or more information processing sections each having a data input section, a calculation section, and an output section, and an output section and an input section of the information processing section are connected, and a A pattern recognition device comprising a connection section for adding weight to output information of and transmitting it to the input section, comprising a memory means for storing preset target teacher data, and a memory means for storing the input data to the pattern recognition section. By inputting the data into the device and performing calculations, learning is performed while changing the weight until the output data approaches or matches the preset initial target teaching data, and determines the weight of each of the connected parts when the output data approaches or matches the initial target teaching data set in advance. a first weight determining means for changing the initial target teacher data stepwise at least once or more to bring it closer to the final target teacher data, and changing the initial target teacher data at least once until the output data at each stage is close to or coincides with the target teacher data at each stage. This is a pattern recognition device characterized by comprising a second weight determining means that determines a final weight when the output data is close to or coincides with the final target teaching data by learning while changing the weight. , it is possible to reliably and quickly converge weight correction calculations without increasing the neural network more than necessary.

[Brief explanation of drawings]

第１図は本発明の一実施例に係るパターン認識装置を示
すブロック構成図、第２図は同パターン認識装置に具備
されたニューラルネットワークを示す概念図、第３図は
同ニューラルネットワークを横威する情報処理と連結部
を示す概略構造図、第４図は同ニューラルネットワーク
への入力データと該入力データに対応する教師データと
の関係を示すグラフ、第５図は同パターン認識装置によ
る重み修正計算が収束する状態を説明するために用いら
れる誤差と重みとの関係を示すグラフ、第６図は第４図
に破線で表した簡略化パターンデータを実線で表した最
終目標教師データに近付ける時のパターン上の各点の変
化を示す図表、第７図は第６図における各ステンブ毎の
計算の収束に要したループ回数を示すグラフ、第８図は
ニューロンと連結部との結合状態を示す概念図、第９図
は前記ニューロン及び連結部の数学モデルを示す説明図
、第１０図はバックブロバゲーシゴンアルゴリズムによ
るニューラルネットワークの演算処理概要を示す状態説
明図、第１ｌ図は別のニューラルネットワークの例を示
すネットワーク構戒図である．〔符号の説明〕ｌ・・・パターン認識装置２２ａ，２ｂ・・・ニューラルネットワーク３・・・メ
モリ４・・・演算制Ｉ１部（第１及び第２重み設定手段）５
．５，・・・ニエーロン（情報処理部）６．６′・・・
連結部７・・・入力部８・・・演算部（第１及び第２重み決定手段）９・・・
出力部FIG. 1 is a block configuration diagram showing a pattern recognition device according to an embodiment of the present invention, FIG. 2 is a conceptual diagram showing a neural network included in the pattern recognition device, and FIG. 3 is a block diagram showing the neural network installed in the pattern recognition device. Figure 4 is a graph showing the relationship between the input data to the neural network and the teacher data corresponding to the input data, and Figure 5 is the weight correction by the pattern recognition device. A graph showing the relationship between error and weight used to explain the state in which calculation converges. Figure 6 shows when the simplified pattern data shown by the broken line in Figure 4 approaches the final target teacher data shown by the solid line. Figure 7 is a graph showing the number of loops required for convergence of calculation for each stem in Figure 6, Figure 8 shows the connection state between neurons and connections. Conceptual diagram, Figure 9 is an explanatory diagram showing the mathematical model of the neuron and the connection part, Figure 10 is a state diagram showing an outline of the calculation processing of the neural network based on the backblowing algorithm, and Figure 1l is an explanatory diagram showing the mathematical model of the neuron and the connection part. This is a network configuration diagram showing an example of a network. [Explanation of symbols] l...Pattern recognition devices 22a, 2b...Neural network 3...Memory 4...Arithmetic system I1 section (first and second weight setting means) 5
．． 5,... Nieron (Information Processing Department) 6.6'...
Connecting unit 7...Input unit 8...Calculating unit (first and second weight determining means) 9...
Output section

Claims

[Claims]

(1) 1 having a data input section, a calculation section, and an output section
or a connecting section for connecting a plurality of information processing sections and an output section and an input section of the information processing sections, and for adding weight to output information from the output section and transmitting the weighted information to the input section. A pattern recognition device includes a memory means for storing preset target teaching data, and input data is inputted to the pattern recognition device and calculated so that the output data is close to or coincides with the preset initial target teaching data. a first weight determining means that learns while changing the weight until the first weight is determined, and determines the weight of each of the connected parts when they are close or coincident; The final weight when the output data approaches or matches the final target teaching data by learning while changing the weights until the output data at each stage approaches or matches the target teaching data at each stage. A pattern recognition device comprising second weight determining means for determining.