JPH0696046A

JPH0696046A - Learning processor of neural network

Info

Publication number: JPH0696046A
Application number: JP4244467A
Authority: JP
Inventors: Masato Kobayashi; 正人小林; Takashi Yamaguchi; 高司山口
Original assignee: Hitachi Ltd
Current assignee: Hitachi Ltd
Priority date: 1992-09-14
Filing date: 1992-09-14
Publication date: 1994-04-08

Abstract

PURPOSE:To provide the learning processor of the neural network which increases the efficiency of the whole learning and makes the learning fast and precise. CONSTITUTION:The structure of the neural network is changed between pattern conversion and the learning. At the time of the pattern conversion, a sigmoid function is applied to for units of an output layer as before. At the time of the learning, on the other hand, nonlinear conversion is not applied to the units of the output layer 6, an inverse sigmoid function 8 is applied to a tutor signal 9, and the difference is an error signal 11. A load calculating circuit 13 learns a synapsis load 5 by least square algorithm and a load calculating circuit 14 learns a synapsis load 3 by an error inverse propagating method.

Description

Detailed Description of the Invention

【０００１】[0001]

【産業上の利用分野】本発明は、記憶，推論，判断，予
測，パターン認識，制御，最適化などに用いられる階層
型ニューラルネットワークの高速な学習処理装置に関す
る。BACKGROUND OF THE INVENTION 1. Field of the Invention The present invention relates to a high-speed learning processing device for a hierarchical neural network used for memory, inference, judgment, prediction, pattern recognition, control, optimization and the like.

【０００２】[0002]

【従来の技術】ニューラルネットワークは、生物の神経
素子の働きを模した多入力一出力の人工的神経素子（ユ
ニット）を多数層状に結合することにより、信号処理，
情報処理を実現するネットワークの総称である。2. Description of the Related Art A neural network is a signal processing system in which a plurality of input-output artificial neural elements (units) simulating the functions of biological neural elements are connected in layers.
It is a general term for networks that realize information processing.

【０００３】図２は、３層の階層型ニューラルネットワ
ークの構成例で、ｉ個のユニットを持つ入力層２、ｊ個
のユニットを持つ中間層４、ｋ個のユニットを持つ出力
層７からなる。ここでは、中間層４を一層としているが
複数層あってもよい。図２において、信号の伝達は以下
に示す通りである。FIG. 2 shows an example of the structure of a three-layer hierarchical neural network, which comprises an input layer 2 having i units, an intermediate layer 4 having j units, and an output layer 7 having k units. . Here, the intermediate layer 4 is a single layer, but there may be a plurality of layers. In FIG. 2, signal transmission is as follows.

【０００４】ｘ_i をネットワークの入力信号１，ｗ_jiを
入力層と中間層間のシナプス荷重３，θ_j をオフセット
量とすると、中間層４の各ユニットの内部状態信号ｕ_j
は次式で表される。When x _i is the input signal of the network 1, w _ji is the synapse weight between the input layer and the intermediate layer, and θ _j is the offset amount, the internal state signal u _j of each unit of the intermediate layer 4 is represented.
Is expressed by the following equation.

【０００５】[0005]

【数１】 [Equation 1]

【０００６】記述を簡単化するために上式を新たに次式
に置き改める。In order to simplify the description, the above equation is newly replaced with the following equation.

【０００７】[0007]

【数２】 [Equation 2]

【０００８】これより、中間層４のｊユニットの出力ｈ
_jは次式で表される。From this, the output h of the j unit of the intermediate layer 4
_j is expressed by the following equation.

【０００９】[0009]

【数３】ｈ_j＝ｆ(ｕ_j) …（数３）ここで、ｆ(・)は、例えば、次式のシグモイド関数が一
般に用いられている。## EQU00003 ## h _j = f (u _j ) (Equation 3) Here, for f (·), for example, a sigmoid function of the following equation is generally used.

【００１０】[0010]

【数４】 [Equation 4]

【００１１】同様に、ｖ_kjを中間層と出力層間のシナプ
ス荷重５、φ_k をオフセット量とすると、出力層７の各
ユニットの内部状態信号ｓ_k は次式で表される。Similarly, when v _kj is the synapse load 5 between the intermediate layer and the output layer and φ _k is the offset amount, the internal state signal s _k of each unit of the output layer 7 is expressed by the following equation.

【００１２】[0012]

【数５】 [Equation 5]

【００１３】記述を簡単化するために上式を新たに次式
に置き改める。In order to simplify the description, the above equation is newly replaced with the following equation.

【００１４】[0014]

【数６】 [Equation 6]

【００１５】これより、出力層７のｋユニットの出力ｙ
_k は次式で表される。From this, the output y of the k unit of the output layer 7
_k is expressed by the following equation.

【００１６】[0016]

【数７】ｙ_k＝ｆ(ｓ_k) …（数７）以上、階層型ニューラルネットワークは、入力層２に与
えられた入力データ１を、各ユニットが処理して、次の
層へ伝達し、出力層７から入力データに応じた出力デー
タ１０が得られるようになっている。Y _k = f (s _k ) (Equation 7) As described above, in the hierarchical neural network, each unit processes the input data 1 given to the input layer 2 and transfers it to the next layer. The output data 7 corresponding to the input data is obtained from the output layer 7.

【００１７】従来から、階層型ニューラルネットワーク
のシナプス荷重の学習方法として誤差逆伝播法が広く用
いられてきたラメルハルト，ヒルトンアンドウィリ
アムス；“ラーニングインターナルレプレゼンテー
ションバイエラーバッシプロパゲーション”(R
umelhart,Hinton,and Williams:“Learning InternalRe
presenations by Error Back Propagation”, In Paral
lel DistributedProcessing,Vol.１，pp３１８−３６
２，ＭＩＴ Press(１９８６））。The error back-propagation method has been widely used as a learning method for synapse weights of a hierarchical neural network, as previously described by Ramelhardt, Hilton and Williams; “Learning Internal Representation By Error Bass Propagation” (R
umelhart, Hinton, and Williams: “Learning InternalRe
presenations by Error Back Propagation ”, In Paral
lel Distributed Processing, Vol.1, pp318-36
2, MIT Press (1986)).

【００１８】図３は、図２の階層型ニューラルネットワ
ークに誤差逆伝播法を適用した構成例である。以下、図
３を用いて誤差逆伝播法を説明する。FIG. 3 is an example of a configuration in which the error back propagation method is applied to the hierarchical neural network of FIG. The error back propagation method will be described below with reference to FIG.

【００１９】入力層２にパターンＰの入力データ１が入
力された時、出力層７のユニットｋに出てきてもらいた
い出力データを教師信号ｙ_mk９とする。この時、教師信
号９と実際の出力データ１０の誤差１２をWhen the input data 1 of the pattern P is input to the input layer 2, the output data which the unit k of the output layer 7 wants to output is the teacher signal y _mk 9. At this time, the error 12 between the teacher signal 9 and the actual output data 10

【００２０】[0020]

【数８】ｅ_k＝ｙ_mk−ｙ_k …（数８）と定義すると、ある一つのパターンＰに対する二乗誤差
の評価関数Ｅ_P は次式で表される。## EQU00008 _## Defining e _k = y _mk -y _k (Equation 8), the square error evaluation function E _P for a certain pattern _P is expressed by the following equation.

【００２１】[0021]

【数９】 [Equation 9]

【００２２】まず、荷重計算回路１５の設計を行う。こ
れは、シナプス荷重ｖ_kjの変化量を最急降下法より以下
のようになる。First, the load calculation circuit 15 is designed. This is as follows based on the steepest descent method for the amount of change in the synaptic load v _kj .

【００２３】[0023]

【数１０】 [Equation 10]

【００２４】次に、荷重計算回路１６の設計を行う。こ
れは、シナプス荷重ｗ_kjの変化量を最急降下法より以下
のようになる。Next, the load calculation circuit 16 is designed. This is as follows from the steepest descent method for the amount of change in the synaptic weight w _kj .

【００２５】[0025]

【数１１】 [Equation 11]

【００２６】層の数が４層以上の場合も同様にして、逐
次、誤差を前段階の層における誤差に換算することを繰
り返すことにより、全ての層間のシナプス荷重を決定す
ることができる。Similarly, when the number of layers is four or more, the synapse load between all the layers can be determined in the same manner by repeatedly converting the error into the error in the previous layer.

【００２７】また、数９と数１０の誤差逆伝播法の高速
化を達成するために、前回の修正量を考慮する学習法が
知られている。前回（ｎ−１）ステップの修正量をΔｖ
（ｎ−１），Δｗ（ｎ−１），今回（ｎ）ステップの修
正量をΔｖ（ｎ），Δｗ（ｎ）とすると次式となる。In addition, a learning method is known in which the previous correction amount is taken into consideration in order to achieve the speedup of the error backpropagation method of Expressions 9 and 10. The correction amount of the previous (n-1) step is Δv
(N−1), Δw (n−1), and the correction amounts of the (n) th step this time are Δv (n) and Δw (n), the following equation is obtained.

【００２８】[0028]

【数１２】 [Equation 12]

【００２９】[0029]

【数１３】 [Equation 13]

【００３０】これは、前回の修正量を加えることによっ
て、シナプス荷重の変化に一種の慣性を生じさせ、誤差
曲面の細かい凹凸を無視する効果が得られる。This is because by adding the correction amount of the previous time, a kind of inertia is generated in the change of the synapse load, and the effect of ignoring the fine irregularities of the error curved surface can be obtained.

【００３１】ところで、上記の学習はあるパターンＰの
入出力の組に対する誤差Ｅ_P を最小化するもので、逐
次、修正学習と呼ばれている。一方、全パターンの入出
力の組に対する以下の誤差量Ｅ_T を最小化するには、逐
次、修正学習で求めたシナプス荷重を加算し、全パター
ンについて加算されたシナプス荷重で修正を行う必要が
ある。これは一括修正学習と呼ばれている。By the way, the above learning is to minimize the error E _P with respect to the input / output group of a certain pattern P, and is sequentially called correction learning. On the other hand, in order to minimize the following error amount E _T with respect to the input / output pairs of all patterns, it is necessary to sequentially add the synaptic loads obtained by the correction learning and perform correction with the added synaptic loads for all patterns. is there. This is called batch modification learning.

【００３２】[0032]

【数１４】 [Equation 14]

【００３３】また、従来、上記誤差逆伝播法の学習速度
の高速化を図る手法が、特開平3−252887号公報に記載
されている。そこでは、出力の内部信号（ｓ_k：数６）
と教師信号（ｙ_mk）を逆シグモイド変換を施した教師内
部信号との差を用いて上記誤差伝播法で学習を行う手法
が記載されている。Further, conventionally, a method for increasing the learning speed of the above-mentioned error back-propagation method is described in Japanese Patent Laid-Open No. 3-252887. There, the internal signal of the output (s _k : Equation 6)
And a teacher signal (y _mk ) and a teacher internal signal subjected to inverse sigmoid transformation are used to perform learning by the above error propagation method.

【００３４】[0034]

【発明が解決しようとする課題】しかしながら、上記従
来技術の誤差逆伝播法および特開平3-252887号公報に記
載のものは、入力層と中間層間のシナプス荷重ｗ_jiと、
中間層と出力層間のシナプス荷重ｖ_kjの両方の荷重を最
急降下法に基づいて決定しているため、上述の二乗誤差
の総和Ｅ_P を十分に小さくして学習を終了するまでに要
する学習の繰り返し回数が膨大な値になってしまい、効
率の良い学習処理を行うことができないという問題があ
った。However, the above-mentioned conventional back-propagation method and the method disclosed in Japanese Patent Application Laid-Open No. 3-252887 disclose the synapse load w _ji between the input layer and the intermediate layer,
Since the _weights of both the synapse weights v _kj between the intermediate layer and the output layer are determined based on the steepest descent method, the sum of the squared errors E _P described above is made sufficiently small and the learning required until the learning is finished. There is a problem that the number of iterations becomes a huge value and efficient learning processing cannot be performed.

【００３５】さらに詳細に述べるならば、従来の誤差逆
伝播法の学習手順は、入力層と中間層間のシナプス荷重
ｗ_jiを更新する際、数１１で示されているように中間層
と出力層間のシナプス荷重ｖ_kjが正しい値を示している
ものとして学習を行い、同様に、シナプス荷重ｖ_kjを更
新する際、数１０で示されているように中間層の出力ｈ
_j の情報が必要となりシナプス荷重ｗ_jiが正しい値であ
るものとして学習を行っている。すなわち、従来の誤差
逆伝播法は、シナプス荷重ｗ_ji，ｖ_kjの更新を互いに独
立に学習しているにもかかわらず、その両方のシナプス
荷重を、一般に収束が遅いと言われている誤差曲面の勾
配に基づいて決定する最急降下法で学習する構成となっ
ているため、学習時間が膨大な値になってしまうという
問題があった。More specifically, the learning procedure of the conventional error back-propagation method is as follows, when updating the synapse weight w _ji between the input layer and the intermediate layer, as shown in _Equation 11. perform learning as synaptic weights v _kj of indicates the correct value, likewise, synapse load v when updating _kj, output h of the intermediate layer as shown by the number 10
The information of _j is needed, and the learning is performed assuming that the synaptic weight w _ji is a correct value. That is, although the conventional error back-propagation method learns the update of the synapse weights w _ji and v _kj independently of each other, both synapse weights of the error curved surface, which are generally said to have slow convergence, are calculated. Since the learning is performed by the steepest descent method that is determined based on the gradient of, there is a problem that the learning time becomes a huge value.

【００３６】本発明の目的は、従来の問題点に鑑み、階
層型ニューラルネットワークの中間層と出力層間のシナ
プス荷重の学習を高速高精度化することで、全体の学習
の効率を高め、学習の高速高精度化を達成するニューラ
ルネットワークの学習処理装置を提供することにある。In view of the conventional problems, an object of the present invention is to improve the learning efficiency of the whole learning by improving the learning speed of the synapse weight between the intermediate layer and the output layer of the hierarchical neural network with high accuracy. It is to provide a learning processing device for a neural network that achieves high speed and high accuracy.

【００３７】[0037]

【課題を解決するための手段】上記目的を達成するため
に、本発明は、シグモイド状の非線形関数を内部にも
ち、人工的神経素子に対応する信号処理を行う複数のユ
ニットにより構成された、入力層，中間層、および出力
層を備える信号処理部と、前記入力層に入力される入力
信号パターンに対する該出力層の出力値と教師信号との
誤差信号に基づいて前記各ユニット間の結合の強さの係
数を前記出力層側から前記入力層側に向かって順次に繰
り返し計算する学習処理部とを備えたニューラルネット
ワークの学習処理装置において、前記中間層と前記出力
層間の前記結合の強さの係数を学習する第一の学習処理
部と、それ以外の前記結合の強さの係数を学習する前記
第一の学習処理部とは異なる第二の学習処理部を設けた
ものである。In order to achieve the above-mentioned object, the present invention comprises a plurality of units which have a sigmoid-like nonlinear function inside and which perform signal processing corresponding to an artificial neural element, A signal processing unit including an input layer, an intermediate layer, and an output layer, and a coupling between the units based on an error signal between an output value of the output layer and a teacher signal with respect to an input signal pattern input to the input layer. In a learning processing device of a neural network, comprising: a learning processing unit that sequentially and repeatedly calculates a coefficient of strength from the output layer side toward the input layer side, the strength of the coupling between the intermediate layer and the output layer. The second learning processing unit different from the first learning processing unit that learns the coefficient and the other first learning processing unit that learns the coefficient of the coupling strength is provided.

【００３８】また、本発明は、前記第一の学習処理部と
前記第二の学習処理部は、前記教師信号を前記シグモイ
ド状の非線形関数の逆関数に通した値を用いて前記誤差
信号を決定し、前記結合の強さの係数を計算する学習処
理部を設けたものである。Further, in the present invention, the first learning processing unit and the second learning processing unit use the value obtained by passing the teacher signal through an inverse function of the sigmoid-like nonlinear function to obtain the error signal. A learning processing unit for determining and calculating the coefficient of the coupling strength is provided.

【００３９】さらに、本発明は、前記第一の学習処理部
は、前記誤差信号より得られる誤差曲面の最小値に向か
って前記結合の強さの係数を計算する最小二乗法を用
い、前記第二の学習処理部は、該誤差曲面の最急降下方
向に向かって前記結合の強さの係数を計算する学習処理
部を設けたものである。Further, in the present invention, the first learning processing section uses the least squares method for calculating the coefficient of the coupling strength toward the minimum value of the error curved surface obtained from the error signal, The second learning processing unit is provided with a learning processing unit that calculates the coefficient of the coupling strength in the steepest descent direction of the error curved surface.

【００４０】[0040]

【作用】本発明の学習方法は、中間層と出力層間のシナ
プス荷重の学習を高速高精度化することで階層型ニュー
ラルネットワークの学習の高速高精度化を図るものであ
る。すなわち、シナプス荷重ｖ_kjを最小二乗法アルゴリ
ズムに基づき高速高精度学習させ、その他のシナプス荷
重は従来の誤差逆伝播法で学習させることにより、中間
層と出力層間のシナプス荷重の学習が極小値に陥ること
なく、最小値に収束させることができる。According to the learning method of the present invention, learning of the synapse weight between the intermediate layer and the output layer is performed at high speed and with high accuracy, thereby achieving high speed and high accuracy of learning of the hierarchical neural network. That is, the synapse weight v _kj is learned at high speed and high accuracy based on the least squares method algorithm, and other synapse weights are learned by the conventional error backpropagation method, so that the learning of the synapse weight between the intermediate layer and the output layer is minimized. It can be converged to the minimum value without falling.

【００４１】[0041]

【実施例】以下、本発明の一実施例を図面により詳細に
説明する。DETAILED DESCRIPTION OF THE PREFERRED EMBODIMENTS An embodiment of the present invention will be described in detail below with reference to the drawings.

【００４２】図１は、本発明の学習法で学習を行う時の
３層のニューラルネットワーク構造である。従来の誤差
逆伝播法は、パターン変換時（図２）と学習時（図３）
とでニューラルネットワークの構造に変化はない。ここ
で、パターン変換時とは学習が終了しシナプス荷重を固
定して入力層に入るパターンを変換し、出力層からニュ
ーラルネットワークの解を出力している間を示す。学習
時とは、ある評価関数に従ってシナプス荷重を学習して
いる間を示す。FIG. 1 shows a three-layer neural network structure when learning is performed by the learning method of the present invention. The conventional error backpropagation method uses pattern conversion (Fig. 2) and learning (Fig. 3).
There is no change in the structure of the neural network. Here, the time of pattern conversion refers to a period during which learning is completed, a synapse weight is fixed, a pattern entering the input layer is converted, and a neural network solution is output from the output layer. “Learning time” refers to a time when learning the synapse weight according to a certain evaluation function.

【００４３】一方、本発明の学習法は、パターン変換時
（図２）と学習時（図１）とでニューラルネットワーク
の構造を変化させる。パターン変換時は誤差逆伝播法と
同様に出力層７のユニットに数４のシグモイド関数を施
すが、学習時は出力層６のユニットに非線形変換を施さ
ずに教師信号９を次式の逆シグモイド関数８を施して変
換させる。On the other hand, the learning method of the present invention changes the structure of the neural network at the time of pattern conversion (FIG. 2) and at the time of learning (FIG. 1). At the time of pattern conversion, the sigmoid function of the equation 4 is applied to the unit of the output layer 7 similarly to the error back propagation method, but at the time of learning, the teacher signal 9 is given the inverse sigmoid of the following equation without performing the nonlinear conversion on the unit of the output layer 6. Function 8 is applied and converted.

【００４４】[0044]

【数１５】 [Equation 15]

【００４５】これより、学習時の出力層６のｋユニット
の出力ｓ_kPは次式となる。From this, the output s _kP of the k unit of the output layer 6 at the time of learning is given by the following equation.

【００４６】[0046]

【数１６】 [Equation 16]

【００４７】ただし、However,

【００４８】[0048]

【数１７】 [Equation 17]

【００４９】[0049]

【数１８】 [Equation 18]

【００５０】である。ここで、下付きのＰはパターンＰ
に対する信号である。また、パターンＰに対する教師信
号ｙ_mkP９の逆シグモイド変換をｆ~¹(ｙ_mkP)８とする。It is Here, the subscript P is the pattern P
Is a signal to. Further, the inverse sigmoid transformation of the teacher signal y _mkP 9 for the pattern P is _defined as f ~ ¹ (y _mkP ) 8.

【００５１】ここで、パターンＰに対する誤差信号ｅ_kP
１１をHere, the error signal e _kP for the pattern P
11

【００５２】[0052]

【数１９】ｅ_kP＝ｆ~¹(ｙ_mkP)−ｓ_kP …（数１９）とし、全パターンに対する各出力層毎の二乗誤差の評価
関数を次式と定義する。Equation 19] e _kP = f ~ ¹ and _(y mkP) -s _kP ... (Equation 19), the evaluation function of the square error of each output layer to the total pattern is defined as the following equation.

【００５３】[0053]

【数２０】 [Equation 20]

【００５４】まず、荷重計算回路１３の設計を行う。
今、誤差Ｅ_kAをシナプス荷重Ｖ_k(式１７）に関して最小
化することを考える。すると、シナプス荷重Ｖ_k の微小
変化に対する誤差Ｅ_kAへの影響は以下のように分解で
き、最小点が存在することから下式を零と置く。First, the load calculation circuit 13 is designed.
Now consider minimizing the error E _kA with respect to the synaptic weight V _k (equation 17). Then, the influence on the error E _kA with respect to the minute change of the synapse load V _k can be decomposed as follows, and since the minimum point exists, the following equation is set to zero.

【００５５】[0055]

【数２１】 [Equation 21]

【００５６】これより、上式の右辺第１項が正則なら
ば、ＶkをＶ_kPと置きFrom this, if the first term on the right-hand side of the above equation is regular, Vk is set as _VkP.

【００５７】[0057]

【数２２】 [Equation 22]

【００５８】としてＶ_kPを定めればよい。これを逐次式
に書き改めると次式となる。V _kP may be set as Rewriting this into a sequential formula gives the following formula.

【００５９】[0059]

【数２３】 [Equation 23]

【００６０】[0060]

【数２４】 [Equation 24]

【００６１】ただし、However,

【００６２】[0062]

【数２５】 [Equation 25]

【００６３】[0063]

【数２６】 [Equation 26]

【００６４】例えば、λ_1P＝１，λ_2P＝１とすると上式
は逐次型最小二乗法アルゴリズムとなる。For example, if λ _1P = 1 and λ _2P = 1 then the above equation is a recursive least squares algorithm.

【００６５】また、Also,

【００６６】[0066]

【数２７】 [Equation 27]

【００６７】[0067]

【数２８】 [Equation 28]

【００６８】と設定すると、Г_Pのトレースを一定とす
ることもできる。By setting, the trace of Γ _P can be made constant.

【００６９】次に、荷重計算回路１４の設計を行う。入
力層２と中間層４間のシナプス荷重ｗ_ji３の学習方法
は、誤差信号１１（数１９）に対し、従来の誤差逆伝播
法で学習を行う。まず、パターンＰに対する二乗誤差を
定義する。Next, the load calculation circuit 14 is designed. As a learning method of the synapse weight w _ji 3 between the input layer 2 and the intermediate layer 4, the error signal 11 (Equation 19) is learned by the conventional error back propagation method. First, the squared error for the pattern P is defined.

【００７０】[0070]

【数２９】 [Equation 29]

【００７１】シナプス荷重ｗ_kjの変化量を最急降下法よ
り以下のように定める。The amount of change in the synaptic weight w _kj is determined by the steepest descent method as follows.

【００７２】[0072]

【数３０】 [Equation 30]

【００７３】上式に数１３のように前回の修正量を考慮
し高速化することもできる。It is also possible to increase the speed by considering the previous correction amount as shown in the above equation.

【００７４】図４は、本発明の一実施例の実行手順を示
す図である。まず、図１の構成でシナプス荷重の学習
（ステップ４０１）を、次式の絶対値誤差がある設定値
Ｅ_R 以下になるまで繰り返す（ステップ４０２）。FIG. 4 is a diagram showing an execution procedure of an embodiment of the present invention. First, the learning of the synapse load (step 401) is repeated with the configuration of FIG. 1 until the absolute value error of the following equation becomes equal to or less than a set value E _R (step 402).

【００７５】[0075]

【数３１】 [Equation 31]

【００７６】次に、シナプス荷重を固定して、図２の構
成でパターン変換を実施する（ステップ４０３）。この
場合、図１の学習は、読み込み専用メモリ（ＲＯＭ）と
ランダムアクセスメモリ（ＲＡＭ）で実施し、図２のパ
ターン変換は、ＲＯＭで実施することが可能となる。Next, the synapse load is fixed and pattern conversion is carried out with the configuration of FIG. 2 (step 403). In this case, the learning shown in FIG. 1 can be executed by the read only memory (ROM) and the random access memory (RAM), and the pattern conversion shown in FIG. 2 can be executed by the ROM.

【００７７】図５は、本発明の一実施例の実行手順を示
す図である。まず、図５で説明したのと同様に、図１の
構成でシナプス荷重の学習（ステップ５０１）を、(数
３１)の絶対値誤差がある設定値Ｅ_R 以下になるまで繰
り返す（ステップ５０２）。FIG. 5 is a diagram showing an execution procedure of an embodiment of the present invention. First, as described with reference to FIG. 5, learning of synapse weights (step 501) in the configuration of FIG. 1 is repeated until the absolute value error of (Equation 31) becomes equal to or less than a set value E _R (step 502). .

【００７８】次に、図２の構成でパターン変換を実施す
る（ステップ５０３）。この時、式（数３１）の絶対値
をパターン変換毎に監視し（ステップ５０４）、もし、
その値がある設定値Ｅ_S 以下ならばシナプス荷重をその
まま固定させてパターン変換を繰り返し（ステップ５０
６）、その値がある設定値以上ならば図１の構成にネッ
トワークの構造を変化させて、シナプス荷重を学習する
(ステップ５０５０)。この場合、読み込み専用メモリ
（ＲＯＭ）とランダムアクセスメモリ（ＲＡＭ）でネッ
トワークを実施する。Next, pattern conversion is performed with the configuration of FIG. 2 (step 503). At this time, the absolute value of the equation (Equation 31) is monitored for each pattern conversion (step 504).
If the value is less than or equal to a set value E _{S, the} synaptic weight is fixed as it is and the pattern conversion is repeated (step 50).
6) If the value is above a certain set value, the structure of the network is changed to the configuration of FIG. 1 to learn the synaptic weight.
(Step 5050). In this case, the read only memory (ROM) and the random access memory (RAM) are used to implement the network.

【００７９】本発明の学習処理装置の有効性を確認する
ために以下で排他的論理和(ＸＯＲ)の学習結果を示す。
この問題の応用として、種々のパターン認識が考えられ
る。In order to confirm the effectiveness of the learning processing device of the present invention, the learning result of exclusive OR (XOR) will be shown below.
Various pattern recognitions can be considered as applications of this problem.

【００８０】以下の表１に排他的論理和（ＸＯＲ）の入
出力関係を示す。Table 1 below shows the input / output relationship of the exclusive OR (XOR).

【００８１】[0081]

【表１】 [Table 1]

【００８２】この関係をニューラルネットワークが獲得
するためには、中間層の学習が必要となる。３層ニュー
ラルネットワークの構成で学習を行った。入力層ユニッ
ト数２，中間層ユニット数２，出力層ユニット数１であ
る。全てのシナプス荷重は±１の範囲の乱数で初期化
し、全てのオフセット量は０，＋１の範囲の乱数で初期
化した。In order for the neural network to acquire this relationship, learning of the intermediate layer is necessary. Learning was performed with a three-layer neural network configuration. The number of input layer units is 2, the number of intermediate layer units is 2, and the number of output layer units is 1. All synapse weights were initialized with random numbers in the range of ± 1, and all offset amounts were initialized with random numbers in the range of 0 and +1.

【００８３】図６に本発明による学習パラメータγ₀ と
ηに関する学習結果を、図７に従来の誤差逆伝播法によ
る学習パラメータηとαに関する学習結果を示す。縦軸
は、（数３１）で定義される絶対値誤差が０.１以下に
なるのに要した学習のステップ数である。FIG. 6 shows the learning results for the learning parameters γ ₀ and η according to the present invention, and FIG. 7 shows the learning results for the learning parameters η and α by the conventional error back propagation method. The vertical axis represents the number of learning steps required for the absolute value error defined by (Equation 31) to become 0.1 or less.

【００８４】本発明の学習方法では、数２７のσを１と
し、また、ｗ_jiの学習を（数３０）で学習を行ってい
る。In the learning method of the present invention, σ in the equation 27 is set to 1, and learning of w _ji is performed by the _equation (30).

【００８５】一方、従来の誤差逆伝播法では、数１２，
数１３を用い学習を行っている。On the other hand, in the conventional error back propagation method,
Learning is performed using Equation 13.

【００８６】本発明の学習方法では、最短で２７ステッ
プ(γ₀＝１０.０，η＝０.０１の場合）で学習を終了し
ているのに対し、誤差逆伝播法では最短で１５３ステッ
プ（η＝１.０，α＝０.９の場合）で学習を終了してい
る。また、図より誤差逆伝播法はαに対し線形に学習ス
テップ数が減少するのに対し、本発明の学習方法ではγ
₀ に対し指数関数的に学習ステップ数が減少するのが分
かる。本発明の学習法は誤差逆伝播法に比べ全体的にみ
て５倍から１０倍の高速性が実現できる。In the learning method of the present invention, the learning is completed in the shortest 27 steps (when γ ₀ = 10.0 and η = 0.01), whereas the error back propagation method has the shortest 153 steps. The learning is completed at (η = 1.0, α = 0.9). From the figure, the error backpropagation method linearly reduces the number of learning steps with respect to α, whereas the learning method of the present invention uses γ
_It can be seen that the number of learning steps decreases exponentially with respect to ₀ . The learning method of the present invention as a whole can realize 5 to 10 times higher speed than the back propagation method.

【００８７】なお、上述した実施例では、中間層と出力
層間のシナプス荷重の学習を最小二乗法アルゴリズムを
用いて実施する方法を示したが、本発明は、最小二乗法
アルゴリズムに限定するものではなく、例えば、高速学
習が可能な共役勾配法や、種々の最適化アルゴリズムを
用いても良い。In the above-described embodiment, the method of learning the synapse weight between the intermediate layer and the output layer by using the least square method algorithm is shown. However, the present invention is not limited to the least square method algorithm. Instead, for example, a conjugate gradient method that enables high-speed learning or various optimization algorithms may be used.

【００８８】また、上述した実施例では、３層のニュー
ラルネットワークに対し説明したが、本発明は層数を限
定するものではない。In the above-mentioned embodiment, the description has been given for the three-layer neural network, but the present invention does not limit the number of layers.

【００８９】[0089]

【発明の効果】本発明の学習処理装置および学習方法で
は、中間層と出力層間のシナプス荷重を高速高精度に学
習することが可能であるため、全体としての学習速度と
学習精度を向上させることができる。According to the learning processing apparatus and the learning method of the present invention, the synapse weight between the intermediate layer and the output layer can be learned at high speed and with high accuracy, so that the learning speed and learning accuracy as a whole are improved. You can

[Brief description of drawings]

【図１】本発明の一実施例を示す学習時におけるニュー
ラルネットワーク構造の説明図。FIG. 1 is an explanatory diagram of a neural network structure during learning showing an embodiment of the present invention.

【図２】パターン変換時におけるニューラルネットワー
ク構造の説明図。FIG. 2 is an explanatory diagram of a neural network structure during pattern conversion.

【図３】従来の誤差逆伝播法による学習時のニューラル
ネットワーク構造の説明図。FIG. 3 is an explanatory diagram of a neural network structure at the time of learning by the conventional back propagation method.

【図４】本発明の一実施例を示す実行手順のフローチャ
ート。FIG. 4 is a flowchart of an execution procedure showing an embodiment of the present invention.

【図５】本発明の一実施例を示す実行手順のフローチャ
ート。FIG. 5 is a flowchart of an execution procedure showing an embodiment of the present invention.

【図６】本発明の一実施例による実行結果の説明図。FIG. 6 is an explanatory diagram of an execution result according to an embodiment of the present invention.

【図７】従来の誤差逆伝播法による学習結果の説明図。FIG. 7 is an explanatory diagram of a learning result by the conventional error back propagation method.

[Explanation of symbols]

１…入力データ、２…入力層、３…シナプス荷重、４…
中間層、５…シナプス荷重、６…出力層、８…逆シグモ
イド関数、９…教師信号、１１…誤差信号、１３…荷重
計算回路、１４…荷重計算回路。1 ... input data, 2 ... input layer, 3 ... synaptic load, 4 ...
Intermediate layer, 5 ... Synapse load, 6 ... Output layer, 8 ... Inverse sigmoid function, 9 ... Teacher signal, 11 ... Error signal, 13 ... Weight calculation circuit, 14 ... Weight calculation circuit.

─────────────────────────────────────────────────────
─────────────────────────────────────────────────── ───

【手続補正書】[Procedure amendment]

【提出日】平成５年３月２２日[Submission date] March 22, 1993

【手続補正１】[Procedure Amendment 1]

【補正対象書類名】明細書[Document name to be amended] Statement

【補正対象項目名】図面の簡単な説明[Name of item to be corrected] Brief description of the drawing

【補正方法】変更[Correction method] Change

【補正内容】[Correction content]

【図面の簡単な説明】[Brief description of drawings]

【図１】本発明の一実施例を示す学習時におけるニュ−
ラルネットワ−ク構造の説明図。FIG. 1 is a diagram illustrating a new example during learning showing an embodiment of the present invention.
Explanatory drawing of a Lar network structure.

【図２】パタ−ン変換時におけるニュ−ラルネットワ−
ク構造の説明図。[Fig. 2] Neural network at the time of pattern conversion
FIG.

【図３】従来の誤差逆伝播法による学習時のニュ−ラル
ネットワ−ク構造の説明図。FIG. 3 is an explanatory diagram of a neural network structure at the time of learning by the conventional back propagation method.

【図４】本発明の一実施例を示す実行手順のフロ−チャ
−ト。FIG. 4 is a flowchart of an execution procedure showing an embodiment of the present invention.

【図５】本発明の一実施例を示す実行手順のフロ−チャ
−ト。FIG. 5 is a flowchart of an execution procedure showing an embodiment of the present invention.

【符号の説明】１…入力デ−タ、２…入力層、３…シナプス荷重、４…
中間層、５…シナプス荷重、６…出力層、８…逆シグモ
イド関数、９…教師信号、１１…誤差信号、１３…荷重
計算回路、１４…荷重計算回路。[Explanation of Codes] 1 ... Input data, 2 ... Input layer, 3 ... Synapse load, 4 ...
Intermediate layer, 5 ... Synapse load, 6 ... Output layer, 8 ... Inverse sigmoid function, 9 ... Teacher signal, 11 ... Error signal, 13 ... Weight calculation circuit, 14 ... Weight calculation circuit.

【手続補正２】[Procedure Amendment 2]

【補正対象書類名】図面[Document name to be corrected] Drawing

【補正対象項目名】全図[Correction target item name] All drawings

【補正方法】変更[Correction method] Change

【補正内容】[Correction content]

【図１】 [Figure 1]

【図２】 [Fig. 2]

【図４】 [Figure 4]

【図３】 [Figure 3]

【図５】 [Figure 5]

【図６】 [Figure 6]

【図７】 [Figure 7]

Claims

[Claims]

1. A sigmoid-like nonlinear function is provided inside,
A signal processing unit including an input layer, an intermediate layer, and an output layer configured by a plurality of units that perform signal processing corresponding to an artificial neural element, and an output of the output layer with respect to an input signal pattern input to the input layer A neural network including a learning processing unit that sequentially and repeatedly calculates a coefficient of coupling strength between the units from the output layer side to the input layer side based on an error signal between a value and a teacher signal. In the learning processing device, the first learning processing unit that learns the coefficient of the bond strength between the intermediate layer and the output layer, and the second learning process that learns the other coefficient of the bond strength between them. A learning processing device for a neural network, which is provided with a section.