JPH0680505B2

JPH0680505B2 - Learning method for multilayer neural network

Info

Publication number: JPH0680505B2
Application number: JP63250915A
Authority: JP
Inventors: 信也細木
Original assignee: Agency of Industrial Science and Technology
Current assignee: National Institute of Advanced Industrial Science and Technology AIST
Priority date: 1988-10-06
Filing date: 1988-10-06
Publication date: 1994-10-12
Anticipated expiration: 2009-10-12
Also published as: JPH0298770A

Description

【発明の詳細な説明】〔概要〕多層神経回路網の学習を行う学習方式に関し、中間層に興奮性荷重および抑制性荷重を設け、第１層に
入力パターンを入力して自己学習させることにより、入
力パターンに対して特異的に反応する要素を自動形成さ
せ、冗長性、汎化能力を向上させることを目的とし、入力パターンを非線型処理する第１層と、この第１層か
らのパターンV⁽¹⁾の全てに対して演算するための学習し
得る興奮性荷重、およびモニタセルからのパターンに対
して演算するための学習し得る抑制性荷重、上記興奮性
荷重と上記抑制性荷重とについてそれぞれ演算したパタ
ーンの総和を求める和回路、この和回路によって求めた
パターンを非線型処理する第２層と、この第２層からの
パターンV⁽²⁾に所定の荷重を演算し、これら演算したパ
ターンの総和を求める和回路からなる第３層と、上記第
１層からのパターンV⁽¹⁾および上記第２層からのパター
ンV⁽²⁾の和を求めて非線型処理を行い、その結果のパタ
ーンを上記第２層（中間層）の抑制性荷重と演算させる
モニタセルとを備え、第１層からのパターンV⁽¹⁾および
第２層のパターンV⁽²⁾に基づいて該当する興奮性荷重の
値を更新（増分・減分）すると共に、第１層からのパタ
ーンV⁽¹⁾の総和および第２層のパターンV⁽²⁾の総和に基
づいて該当する抑制性荷重（４）の値を更新（増分・減
分）する自己学習を行うようち構成する。また、第３層
の荷重に対して誤差分に対応するパターンによって、当
該荷重を更新（増分・減分）して誤差修正学習し得るよ
うに構成する。DETAILED DESCRIPTION OF THE INVENTION [Outline] A learning method for learning a multi-layer neural network, in which an excitatory load and an inhibitory load are provided in an intermediate layer, and an input pattern is input to the first layer to perform self-learning. , A first layer for nonlinearly processing an input pattern and a pattern from this first layer for the purpose of automatically forming elements that specifically react to the input pattern and improving redundancy and generalization ability. Regarding the excitable load that can be learned for all V ⁽¹⁾ , and the restrainable load that can be learned for computing the pattern from the monitor cell, the excitatory load and the inhibitory load OR circuit for obtaining the sum of each operation pattern, computed and a second layer of non-linear processing a pattern determined by the OR circuit, a predetermined load pattern V ⁽²⁾ from the second layer, and these calculation A third layer consisting of OR circuit for obtaining the sum of the pattern, performs non-linear processing calculates the sum of the pattern V ⁽²⁾ from the pattern V ⁽¹⁾ and the second layer from the first layer, as a result It said second layer a pattern of a monitor cell for calculating the inhibitory load (intermediate layer), excitatory appropriate based on the pattern V ⁽¹⁾ and pattern V of the second layer ⁽²⁾ from the first layer The value of the load is updated (increment / decrement ⁾ , and based on the sum of the pattern V ⁽¹⁾ from the first layer and the sum of the pattern V ⁽²⁾ of the second layer, the applicable inhibitory load (4) is It is configured to perform self-learning to update (increment / decrement) the value. In addition, the load is updated (incremented / decremented) by a pattern corresponding to the error with respect to the load of the third layer, and error correction learning can be performed.

[Industrial application field]

本発明は、多層神経回路網の学習を行う学習方式、特に
中間層を設けて自己学習を行うと共に誤差学習を行い得
るように構成した多層神経回路網の学習方式に関するも
のである。The present invention relates to a learning method for learning a multi-layered neural network, and more particularly to a learning method for a multi-layered neural network configured to provide an intermediate layer for self-learning and error learning.

[Problems to be Solved by Conventional Techniques and Inventions]

近年、小脳を模式した多層神経回路網を適用したロボッ
トマニュピュレータの学習方式として、バックプロパゲ
ーション法、誤差修正法などが提案されている。バック
プロパゲーション法は、中間層も最終層と同じアルゴリ
ズムで学習可能である点で優れているが、記憶可能な入
力パターン数、冗長性（細胞の一部に故障が発生しても
学習により正常な動作が可能となる性質）、汎化能力
（学習された入力パターンとは異なるが、それに近い入
力パターンがきた場合に、近い出力が得られる能力）な
どの点で未だ十分とは言えない問題がある。また、誤差
修正法は、最終層のみについて学習が行われ、中間層の
構成は先見的な知識に基づいていわば固定的であり、自
己学習し得ないという問題がある。In recent years, a back propagation method, an error correction method, etc. have been proposed as a learning method for a robot manipulator to which a multilayered neural network that models the cerebellum is applied. The backpropagation method is excellent in that the middle layer can be learned with the same algorithm as the last layer, but the number of memorable input patterns and redundancy (normal even if a part of a cell fails due to learning) Is not yet sufficient in terms of generalization ability (the ability to perform various actions) and generalization ability (the ability to obtain a similar output when an input pattern that is different from the learned input pattern, but similar to the learned input pattern). There is. In addition, the error correction method has a problem in that learning is performed only in the final layer, and the structure of the intermediate layer is fixed based on the a priori knowledge, and cannot be self-learned.

本発明は、中間層に興奮性荷重および抑制性荷重を設
け、第１層に入力パターンを入力して自己学習させるこ
とにより、入力パターンに対して特異的に反応する要素
を自動形成させ、冗長性、汎化能力を向上させることを
目的としている。According to the present invention, an excitatory load and an inhibitory load are provided in the middle layer, and an input pattern is input to the first layer for self-learning, whereby elements that specifically react to the input pattern are automatically formed and redundant. Its purpose is to improve sex and generalization ability.

[Means for solving the problem]

第１図を参照して課題を解決する手段を説明する。 Means for solving the problems will be described with reference to FIG.

第１図において、第１層は、入力パターンを非線型処理
１するものである。In FIG. 1, the first layer is for performing non-linear processing 1 on the input pattern.

第２層（中間層）は、第１層からのパターンV⁽¹⁾の全て
に対して演算するための学習し得る興奮性荷重２、およ
びモニタセル６からのパターンに対して演算するための
学習し得る抑制性荷重４、興奮性荷重２と抑制性荷重４
とについてそれぞれ演算したパターンの総和を求める和
回路３、この和回路３によって求めたパターンを非線型
処理５するものである。The second layer (intermediate layer) has a learnable excitatory load 2 for computing all patterns V ⁽¹⁾ from the first layer, and a learning for computing patterns from the monitor cell 6. Possible inhibitory load 4, excitatory load 2 and inhibitory load 4
A summing circuit 3 for calculating the sum of the patterns calculated for and, and a non-linear processing 5 for the patterns calculated by the summing circuit 3.

第３層は、第２層からのパターンV⁽²⁾に所定の荷重７を
演算し、これら演算したパターンの総和を求める和回路
８からなるものである。The third layer is composed of a summing circuit 8 which calculates a predetermined load 7 on the pattern V ⁽²⁾ from the second layer and calculates the sum of these calculated patterns.

モニタセル６は、第１層からのパターンV⁽¹⁾および第２
層からのパターンV⁽²⁾の和を求めて非線型処理を行い、
その結果のパターンを第２層（中間層）の抑制性荷重４
と演算させるものである。The monitor cell 6 includes the pattern V ⁽¹⁾ and the second pattern from the first layer.
Non-linear processing is performed by summing the patterns V ⁽²⁾ from the layers,
The resulting pattern is the restraining load 4 of the second layer (intermediate layer).
Is calculated.

[Action]

本発明は、第１図に示すように、第１層が入力パターン
について非線型処理を行ってパターンV⁽¹⁾を送出し、第
２層がこれらのパターンV⁽¹⁾に興奮性荷重２を演算した
結果と、モニタセル６からのパターンに抑制性荷重４を
演算した結果との和を求め、更に非線形処理５を行って
パターンV⁽²⁾を送し、第３層がこのパターンV⁽²⁾に荷重
７を演算して和を求めて出力パターン（例えば第７図T
n）を出力するようにしている。この際、第２層へのパ
ターンV⁽¹⁾および第２層からのパターンV⁽²⁾に基づい
て、第２層の要素の興奮性荷重２の値を更新（増分・減
分）し、また、モニタセル６が、第１層からのパターン
V⁽¹⁾の和および第２層からのパターンV⁽²⁾に基づいて、
第２層の要素の抑制性荷重４の値を更新（増分・減分）
し、発火要素数を抑制することにより、自己学習（入力
パターンに対し特異的に反応する要素を形成）するよう
にしている。更に、第３層の荷重７について、誤差を修
正するように更新（増分・減分）する学習を行うように
している（第７図、第８図参照）。According to the present invention, as shown in FIG. 1, the first layer performs non-linear processing on an input pattern and sends out a pattern V ⁽¹⁾ , and the second layer sends an excitable load 2 to these patterns V ^(1). and results of calculation, and calculates the sum of the calculation result of the inhibitory load 4 to the pattern of the monitor cell 6, to further feed the pattern V ⁽²⁾ performs a nonlinear process 5, the third layer is the pattern V ^{( 2) The} load 7 is calculated and the sum is calculated to output the output pattern (for example, FIG. 7 T
n) is output. At this time, based on the pattern V ^{(1) to} the second layer and the pattern V ⁽²⁾ from the second layer, the value of the excitatory load 2 of the element of the second layer is updated (increment / decrement), In addition, the monitor cell 6 is a pattern from the first layer.
Based on the sum of V ⁽¹⁾ and the pattern V ⁽²⁾ from the second layer,
Update the value of the restraining load 4 of the element of the 2nd layer (increment / decrement)
However, by suppressing the number of firing elements, self-learning (forming elements that react specifically to the input pattern) is performed. Further, with respect to the load 7 of the third layer, learning for updating (increment / decrement) so as to correct the error is performed (see FIGS. 7 and 8).

従って、入力パターンを第１層に入力して中間層に設け
た興奮性荷重２および抑制性荷重４について自己学習さ
せることにより、入力に適応した結線が自動的に行われ
て入力パターンに対して特異的に反応する要素（細胞）
を形成することが可能となると共に、冗長性、汎可能力
を得ることが可能となる。更に、最終層である第３層の
荷重７に対して誤差修正学習を行わせる。Therefore, by inputting the input pattern to the first layer and self-learning the excitatory load 2 and the inhibitory load 4 provided in the intermediate layer, the wiring adapted to the input is automatically performed, and the input pattern is connected to the input pattern. Elements (cells) that react specifically
Can be formed, and at the same time, redundancy and universal power can be obtained. Further, the error correction learning is performed on the load 7 of the third layer which is the final layer.

〔実施例〕次に、第１図から第８図を用いて本発明の１実施例の構
成および動作を順次詳細に説明する。[Embodiment] Next, the configuration and operation of one embodiment of the present invention will be sequentially described in detail with reference to FIGS. 1 to 8.

第１図において、第１層は、入力パターンを非線型処理
１する要素から構成されている。この非線型処理１は、
例えば下式で表される非線型処理を行う。In FIG. 1, the first layer is composed of elements that perform nonlinear processing 1 on the input pattern. This non-linear processing 1
For example, non-linear processing represented by the following formula is performed.

第２層（中間層）は、興奮性荷重２、和回路３、抑制性
荷重４、非線型処理５を持つ複数の要素（例えば500
個）から構成されている。ここで、和回路３は、第１層
からのパターンV⁽¹⁾の全てについて興奮性荷重２をそれ
ぞれ演算した値と、モニタセル６からのパターンについ
て抑制性荷重４を演算した値との和を基めるものであ
る。 The second layer (intermediate layer) is a plurality of elements having excitatory load 2, sum circuit 3, inhibitory load 4, and nonlinear processing 5 (for example, 500).
Individual)). Here, the sum circuit 3 calculates the sum of the value obtained by calculating the excitatory load 2 for all the patterns V ⁽¹⁾ from the first layer and the value obtained by calculating the inhibitory load 4 for the pattern from the monitor cell 6. It is the basis.

第３層は、第２層からのパターンV⁽²⁾に所定の荷重（誤
差修正学習した荷重）７を演算し、これら演算したパタ
ーンの総和を求める和回路８から構成されている。The third layer is composed of a summing circuit 8 that calculates a predetermined load (a load that has been subjected to error correction learning) 7 on the pattern V ⁽²⁾ from the second layer and calculates the sum of these calculated patterns.

モニタセル６は、第１層からのパターンV⁽¹⁾および第２
層からのパターンV⁽²⁾の総和を求めて非線型処理を行う
ものである。The monitor cell 6 includes the pattern V ⁽¹⁾ and the second pattern from the first layer.
Non-linear processing is performed by obtaining the sum of patterns V ⁽²⁾ from the layers.

第２図は、第１図第２層の１つの要素（細胞）構成例を
示す。ここで、Vj⁽¹⁾は第１層からのパターン、Vi⁽²⁾は
第２層から出力されるパターン、Wijは興奮性荷重、Wig
は抑制性荷重、Σは和回路、θｉは閾値、∫は非線型処
理を表す。FIG. 2 shows an example of the constitution of one element (cell) in the second layer of FIG. Here, Vj ⁽¹⁾ is the pattern from the first layer, Vi ⁽²⁾ is the pattern output from the second layer, Wij is the excitatory load, Wig
Represents an inhibitory load, Σ represents a sum circuit, θi represents a threshold, and ∫ represents nonlinear processing.

次に、第３図を用いて、本発明の全体概念を説明する。Next, the overall concept of the present invention will be described with reference to FIG.

第３図において、signalは後述する第７図に示すような
平面２関節マニュピレータの理想軌道についての入力
（関節角とその速度）である。In FIG. 3, signal is an input (joint angle and its velocity) about an ideal trajectory of a plane two-joint manipulator as shown in FIG. 7 described later.

Gaussian filterは、ガウスフィルタであって、第４図
を用いて後述するように、入力パターンから第４図
（ハ）上段に示すような位相幅Δａ（＝40゜）、間隔10
゜からなる信号成分を抽出するものである。The Gaussian filter is a Gaussian filter, and as will be described later with reference to FIG. 4, a phase width Δa (= 40 °) and an interval of 10 from the input pattern as shown in the upper part of FIG.
This is to extract a signal component consisting of °.

第１層は、第１図非線型処理から構成されている。The first layer comprises the non-linear processing of FIG.

第２層（中間層）は、第１図第２層に示す構成を持ち、
self-organization（自己学習）を実行する層である。The second layer (intermediate layer) has the structure shown in FIG.
It is a layer that executes self-organization.

第３層は、第１図第３層に示す構成を持ち、error-corr
ection（誤差修正学習）を実行する層である。The third layer has the structure shown in FIG. 1 and the third layer, and error-corr
This is the layer that executes ection (error correction learning).

モニタセル６は、第１層からのパターンV⁽¹⁾および第２
層からのパターンV⁽²⁾に基づいて、第２層（中間層）の
抑制を行うものである。The monitor cell 6 includes the pattern V ⁽¹⁾ and the second pattern from the first layer.
The second layer (intermediate layer) is suppressed based on the pattern V ⁽²⁾ from the layer.

第４図を用いて、第１層へ入力する入力パターンの生成
例について説明する。An example of generating an input pattern to be input to the first layer will be described with reference to FIG.

第４図（イ）はガウスフィルタの動作説明を示す。これ
は、第７図平面２関節マニピュレータのものであって、
第４図（ハ）上段の関節角θに示すように、 −60゜ないし＋60゜（関節角の範囲） Δａ＝40゜（半値幅）フィルタ数＝16 とした場合、第４図（ロ）に示すように指示された関節
角θを入力として、16個のガウスフィルタ（半値幅Δａ
＝40゜、間隔＝10゜の特性を持つ合計16個のガウスフィ
ルタ）によって、合計16個の要素を持つ入力パターンを
生成する。また、第４図（ハ）速度についても同様に、
24個のガウスフィルタ（半値幅Δｖ＝240度、間隔50度
／秒の特性を持つ合計24個のガウスフィルタ）によっ
て、合計24個の要素を持つ入力パターンを生成する。そ
して、第７図平面２関節マニュピレータの場合には、第
５図関節角および速度が２組となるから、合計（16＋24）×２＝80 の入力パターンを第３図第１層（第１図第１層）に入力
する。これら入力した80個の要素の入力パターンについ
てそれぞれ既述した非線型処理を行ったパターンV⁽¹⁾を
第２層に入力する。FIG. 4A shows the operation of the Gaussian filter. This is that of the plane 2 joint manipulator of FIG.
As shown in the joint angle θ in the upper part of Fig. 4 (c), -60 ° to + 60 ° (range of joint angle) Δa = 40 ° (half width) When the number of filters = 16, Fig. 4 (b) Input the joint angle θ instructed as shown in, and enter 16 Gaussian filters (half-value width Δa
= 40 °, 16 = a total of 16 Gaussian filters with intervals = 10 °) to generate an input pattern with a total of 16 elements. In addition, similarly for the speed shown in FIG.
An input pattern having a total of 24 elements is generated by the 24 Gauss filters (a total of 24 Gauss filters having a half width Δv = 240 degrees and an interval of 50 degrees / second). In the case of the plane two-joint manipulator shown in FIG. 7, since there are two sets of joint angles and velocities shown in FIG. 5, a total of (16 + 24) × 2 = 80 input patterns are shown in FIG. Input in the 1st layer). The pattern V ⁽¹⁾ that has been subjected to the nonlinear processing described above for the input patterns of these 80 input elements is input to the second layer.

次に、第５図式（１）から（４）を用いて、第２層（中
間層）の自己学習について詳細に説明する。Next, the self-learning of the second layer (intermediate layer) will be described in detail by using the fifth equations (1) to (4).

第５図において、式（１）の左辺のτdwij⁽²⁾/dtは興奮
性荷重の増分を示し、式（２）の右辺のτdwig/dtは抑
制性荷重の増分を示し、式（３）の左辺のVgはモニタセ
ル６からの抑制パターンを示し、式（４）のVi⁽²⁾は第
２層からのパターンを示す。また、τは時定数、c₁、c₂
は学習の速度や収束値を決める学習パラメータを示す。
以下自己学習の手順を説明する。In FIG. 5, τdwij ⁽²⁾ / dt on the left side of equation (1) indicates the increment of excitatory load, τdwig / dt on the right side of equation (2) indicates the increment of inhibitory load, and equation (3) Vg on the left-hand side of FIG. 4 shows the suppression pattern from the monitor cell 6, and Vi ^{(2) in} equation (4) shows the pattern from the second layer. Also, τ is the time constant, c ₁ , c ₂
Indicates a learning parameter that determines the learning speed and the convergence value.
The procedure of self-learning will be described below.

第１に、式（１）の興奮性荷重wij⁽²⁾および式（２）の
抑制性荷重wigは、当初ランダム（例えば一様乱数）に
与える。First, the excitatory load wij ⁽²⁾ of the equation (1) and the inhibitory load wig of the equation (2) are initially given randomly (for example, uniform random numbers).

第２に、第４図ガウスフィルタを通ってきた入力パター
ン（例えば平面２関節マニュピレータの関節角（θ_１、
θ_２）、速度（V₁、V₂）から生成した80個の要素を持つ
入力パターン）が、第１層の非線形処理１によって非線
型の変換を受け、第２層へのパターンV⁽¹⁾となる。Second, the input pattern that has passed through the Gaussian filter shown in FIG. 4 (for example, the joint angle (θ ₁ , the joint angle of the plane two-joint manipulator,
θ ₂ ), velocity (V ₁ , V ₂ ) generated input pattern with 80 elements) undergoes non-linear conversion by the non-linear processing 1 of the first layer, and the pattern V ^{(1 )} .

第３に、この第２層へのパターンV⁽¹⁾について、当初ラ
ンダムに設定し、その後学習する興奮性荷重wij⁽²⁾およ
び抑制性荷重wigの値に基づいて、式（４）に従って第
２層から出力されるパターンV⁽²⁾を生成する。ここで、
θｉは閾値である。Thirdly, based on the values of the excitatory load wij ⁽²⁾ and the inhibitory load wig, which are initially set at random for the pattern V ⁽¹⁾ for the second layer, the second pattern is calculated according to the equation (4). Generate the pattern V ⁽²⁾ output from the second layer. here,
θi is a threshold.

第４に、式（１）によって興奮性荷重の増分“Δwi
j⁽²⁾"を行う。この式（１）によって表される増分“Δw
ij⁽²⁾"は（シナップス値の上昇は）、第２層のある要素
（細胞、Vi⁽²⁾）が発火し、かつ第１層のある要素（細
胞、Vj⁽¹⁾）が発火した時に、その積に比例する。一
方、第２層のある要素が発火していない時や第１層から
の入力がない時には、減少する。当初ランダムな興奮性
荷重wij⁽²⁾に設定しても、種々の特異的な入力パターン
を入力して繰り返し学習を行って更新することにより、
式（１）の右辺の括弧の中が零となるような安定平衡状
態に収束するように自動的に第２層の結線が形成され
る。Fourthly, according to the equation (1), the increment of excitatory load “Δwi
j ⁽²⁾ ". The increment" Δw represented by this equation (1)
"ij ⁽²⁾ " (increased synapse value) fired an element in the second layer (cell, Vi ⁽²⁾ ) and an element in the first layer (cell, Vj ⁽¹⁾ ). Sometimes it is proportional to the product, on the other hand, it decreases when some element of the second layer is not firing or when there is no input from the first layer. Initially set to random excitatory load wij ⁽²⁾ Also, by inputting various specific input patterns and performing repeated learning and updating,
The connection of the second layer is automatically formed so as to converge to a stable equilibrium state where the value in the parenthesis on the right side of Expression (1) becomes zero.

第５に、式（２）によって抑制性荷重の増分“Δwig"を
行う。この式（２）によって表される増分“Δwig"は、
第２層のある要素（細胞、Vi⁽²⁾が発火し、かつ第１層
からの出力パターンVi⁽¹⁾の総和および第２層からのパ
ターンVj⁽²⁾の総和が所定閾値θ_０よりも大きい時に、
その積に比例する。一方、第２層のある要素が発火して
いない時や、総和が小さい時には、減少する。当初ラン
ダムな抑制性荷重wigに設定しても、種々の特異的な入
力パターンを入力して学習を行って更新することによ
り、式（２）の右辺の括弧の中が零となるような安定平
衡状態に収束するように自動的に第２層の抑制が制御さ
れる。即ち、第２層（中間層）の要素（細胞）が発火し
すぎると、式（３）のVgが大となり、式（２）によって
抑制性荷重が大きくなって第２層の総発火数を抑えるよ
うに制御する。一方、総発火数が少ない時には、多くな
るように制御する。更に、第２層からのパターンV⁽²⁾の
値が小さくなってくると、式（１）、式（２）の右辺に
かかっているV⁽²⁾によって、興奮性荷重および抑制性荷
重の増分が小さくなるように制御される。Fifth, the inhibitory load increment “Δwig” is performed by the equation (2). The increment “Δwig” represented by this equation (2) is
A certain element of the second layer (cell, Vi ⁽²⁾ is fired, and the sum of output patterns Vi ⁽¹⁾ from the first layer and the sum of patterns Vj ⁽²⁾ from the second layer are more than a predetermined threshold θ ₀ . Is also large,
Proportional to the product. On the other hand, it decreases when a certain element of the second layer is not firing or when the sum is small. Even if initially set to random suppressive load wig, by inputting various peculiar input patterns and performing learning and updating, stability such that the value in the parenthesis on the right side of equation (2) becomes zero The suppression of the second layer is automatically controlled so as to converge to the equilibrium state. That is, when the elements (cells) of the second layer (intermediate layer) are overly ignited, Vg of the equation (3) becomes large, and the inhibitory load becomes large according to the equation (2), and the total number of firings of the second layer is increased. Control to suppress. On the other hand, when the total number of ignitions is small, it is controlled to increase. Further, when the value of the pattern V ⁽²⁾ from the second layer becomes smaller, V ⁽²⁾ applied to the right side of the equations (1) and (2) reduces the excitatory load and the inhibitory load. The increment is controlled to be small.

次に、第６図は、中間層（第２層）の応答例のシミュレ
ーション結果を模式的に表したものである。横軸に入力
パターンを示し、縦軸は細胞（要素）の種類を示す。図
中の横棒が発火した細胞、即ち入力パターンに対して特
異的に反応する細胞を示す。この応答例から入力パター
ンに対応してほぼ一様に当該入力パターンの検出細胞
（要素）が形成されたことが判明する（90％以上）。ま
た、検出細胞（第２層の要素）は、１個の入力パターン
に反応するものから、類似する数個の入力パターンに反
応するものまで分布している様子が判る。Next, FIG. 6 schematically shows a simulation result of a response example of the intermediate layer (second layer). The horizontal axis represents the input pattern, and the vertical axis represents the type of cell (element). The horizontal bar in the figure indicates a fired cell, that is, a cell that specifically reacts to the input pattern. From this response example, it is found that the detected cells (elements) of the input pattern were formed substantially uniformly (90% or more) corresponding to the input pattern. Further, it can be seen that the detected cells (elements of the second layer) are distributed from those that respond to one input pattern to those that respond to several similar input patterns.

次に、第７図および第８図を用いて、本発明を平面２関
節マニュピレータに適用した場合の構成およびシミュレ
ーション結果を説明する。Next, a configuration and a simulation result when the present invention is applied to a planar two-joint manipulator will be described with reference to FIGS. 7 and 8.

第７図は、本発明を平面２関節マニュピレータに適用し
た場合の応用例を示す。この制御対象の力学的モデル
は、下式で表せる。FIG. 7 shows an application example when the present invention is applied to a plane two-joint manipulator. The mechanical model of this controlled object can be expressed by the following equation.

Ａ（θ）＋Ｂ（θ、）＋Ｃ（θ）＝Ｔここで、左辺の各項は慣性項、求心およびコリオリ力の
項、重力項を表す。また、右辺のＴはトルクである。シ
ミュレーションでは、アームの質量M₁＝M₂＝5.0kg、長
さL₁＝L₂＝0.3mとした。理想軌道として、１秒間続く４
種類を採用し、シミュレーションの時間間隔は0.02秒と
した。従って、１個の軌道につき、50パターンが第１図
構成に提示されることとなる。フィードバックの利得Kp
＝1.0、速度Kv＝3.0とした。第１図の第１層の各要素
（細胞）には、角度と角速度とを既述した第４図（ロ）
ガウスフィルタを通して80個に離散化した入力パターン
を与えた。各関節につき、40個（角度:16個、各速度:24
個）である。A (θ) + B (θ,) + C (θ) = T Here, each term on the left side represents an inertial term, a centripetal and Coriolis force term, and a gravity term. Further, T on the right side is torque. In the simulation, the mass of the arm M ₁ = M ₂ = 5.0 kg and the length L ₁ = L ₂ = 0.3 m. As an ideal trajectory, it lasts 1 second 4
Different types were adopted, and the simulation time interval was 0.02 seconds. Therefore, 50 patterns are presented in the configuration of FIG. 1 for one trajectory. Feedback gain Kp
= 1.0 and speed Kv = 3.0. In each element (cell) of the first layer in FIG. 1, the angle and the angular velocity are already described in FIG. 4 (b).
80 discrete patterns were given through a Gaussian filter. For each joint, 40 pieces (angle: 16 pieces, each speed: 24
Individual).

第８図（イ）、（ロ）は、アーム角θ_１、アーム角θ_２
について、単一軌道について学習を行わせた結果を表
す。横軸は学習回数を表し、縦軸は規格化したRMSE（平
均自乗誤差の平方）を表す。尚、第８図（ハ）は60回の
学習後のRMSEを示す。ここで、N⁽²⁾は中間層（第２の
層）の細胞数である。8A and 8B show the arm angle θ ₁ and the arm angle θ _2.
For, the result of learning about a single trajectory is shown. The horizontal axis represents the number of times of learning, and the vertical axis represents the normalized RMSE (square of mean square error). Note that FIG. 8C shows the RMSE after 60 learnings. Here, N ⁽²⁾ is the number of cells in the intermediate layer (second layer).

〔The invention's effect〕

以上説明したように、本発明によれば、入力パターンを
第１層に入力して中間層（第２層）に設けた興奮性荷重
２および抑制性荷重４について自己学習させ、入力に適
応した結線を自動的に行う構成を採用しているため、入
力パターンに対して特異的に反応する要素（細胞）を自
己学習的に形成することができると共に、冗長性、汎化
能力を持たせることができる。更に、最終層である第３
層の荷重７の学習によって誤差修正学習を行わせること
ができる。これら中間層の自己学習および最終層の誤差
修正学習を行う多層神経回路網をフィードバック系に適
用、例えば第７図に示すように適用することにより、フ
ィードバック系から学習に従い、本発明に係わる多層神
経回路網によるフィードフォワード系に移行する。As described above, according to the present invention, the input pattern is input to the first layer, the excitatory load 2 and the inhibitory load 4 provided in the intermediate layer (second layer) are self-learned, and applied to the input. Since the configuration that automatically connects the wires is adopted, it is possible to form elements (cells) that specifically react to the input pattern in a self-learning manner and to have redundancy and generalization ability. You can Furthermore, the third layer, which is the final layer
The error correction learning can be performed by learning the layer weight 7. By applying a multilayer neural network that performs self-learning of the intermediate layer and error correction learning of the final layer to a feedback system, for example, as shown in FIG. 7, the multilayer neural network according to the present invention is learned according to the learning from the feedback system. Transition to feedforward system by network.

[Brief description of drawings]

第１図は本発明の原理ブロック図、第２図は第２層の要
素構成例、第３図は本発明の全体説明図、第４図はガウ
スフィルタ説明図、第５図は中間層の学習動作説明図、
第６図は中間層の応答例、第７図は本発明の応用例説明
図、第８図は本発明の応用例の学習説明図を示す。図中、１、５は非線形処理、２は興奮性荷重、３、８は
和回路、４は抑制性荷重、６はモニタセル、７は荷重を
表す。FIG. 1 is a block diagram of the principle of the present invention, FIG. 2 is an example of the element structure of the second layer, FIG. 3 is an overall explanatory diagram of the present invention, FIG. 4 is a Gaussian filter explanatory diagram, and FIG. Learning operation explanatory diagram,
FIG. 6 is a response example of the intermediate layer, FIG. 7 is an explanatory diagram of an application example of the present invention, and FIG. 8 is a learning explanatory diagram of an application example of the present invention. In the figure, 1 and 5 are nonlinear processing, 2 is excitatory load, 3 and 8 are sum circuits, 4 is inhibitory load, 6 is monitor cell, and 7 is load.

Claims

[Claims]

1. A learning method for learning a multi-layer neural network, comprising first means for nonlinearly processing an input pattern.
A layer, a learnable excitatory load means (2) for computing all patterns V ⁽¹⁾ from the first layer, and a learning for computing patterns from the monitor cell (6) Restrainable load means (4) and the excitable load means (2)
And a second circuit including a summing circuit (3) for calculating the sum of the patterns calculated by the suppressing load means (4) and a means (5) for nonlinearly processing the patterns calculated by the summing circuit (3). A third layer including a load means (7) for calculating a predetermined load on the pattern V ⁽²⁾ from the second layer, and a summing circuit (8) for obtaining the sum of the calculated patterns; Non-linear processing is performed by obtaining the sum of the pattern V ⁽¹⁾ from the first layer and the pattern V ⁽²⁾ from the second layer, and the resulting pattern is the restraining load means of the second layer (intermediate layer). The pattern V ⁽¹⁾ from the first layer and the pattern V ⁽²⁾ from the second layer are provided with the monitor cell (6) for calculating (4 ⁾
Pattern V ⁽² values and updates (down increment-min) of the corresponding excitatory load means in proportion to the product (2), the sum of the pattern V ⁽¹⁾ from the first layer from the second layer ^{) And} the pattern V from the second layer ⁽²⁾
Inhibitory load means (4) applicable in proportion to the integrated value of
A learning method for a multilayer neural network characterized by being configured to perform self-learning for updating (incrementing / decrementing) the value of.

2. The weight means (7) of the third layer is configured to update (increment / decrement) the weight means (7) by a pattern corresponding to the error amount so that the error learning can be performed. A learning method for a multilayer neural network according to item (1).