JPH06149767A

JPH06149767A - Neural network

Info

Publication number: JPH06149767A
Application number: JP4295775A
Authority: JP
Inventors: Isao Horiba; 勇夫堀場; Kazuo Iketani; 和夫池谷; Kenji Suzuki; 賢治鈴木; Koji Ueda; 浩次上田; Muneo Yamada; 宗男山田
Original assignee: Nagoya Electric Works Co Ltd
Current assignee: Nagoya Electric Works Co Ltd
Priority date: 1992-11-05
Filing date: 1992-11-05
Publication date: 1994-05-31

Abstract

PURPOSE:To provide the neural network which precisely outputs an analog output and also outputs its extremal value speedily. CONSTITUTION:The neural network consists of an input layer I consisting of neurons I1-Ii, an intermediate layer H consisting of neurons H1-Hj, and an output layer L consisting of a neuron L1, and a linear function is used as the response characteristics of the output layer L.

Description

Detailed Description of the Invention

【０００１】[0001]

【産業上の利用分野】本発明は、学習機能を有するニュ
ーラルネットワーク（神経回路網）に関するものであ
り、殊に、アナログ出力が精度よく得られると共に、ニ
ューロンの発火特性にみられるような極値１，０を速や
かに収束させ得る階層型のニューラルネットワークに係
るものである。BACKGROUND OF THE INVENTION 1. Field of the Invention The present invention relates to a neural network (neural network) having a learning function, and in particular, an analog output can be obtained with high accuracy and an extreme value such as that found in the firing characteristics of neurons. The present invention relates to a hierarchical neural network that can quickly converge 1,0.

【０００２】[0002]

【従来の技術】従来のニューラルネットワークについ
て、図５に基づいて説明する。同図は、階層型のニュー
ラルネットワークを示しており、Ｉは入力層、Ｈは中間
層、Ｏは出力層である。入力層Ｉ，中間層Ｈ，出力層Ｏ
は、それぞれＩ₁〜Ｉ _i，Ｈ₁〜Ｈ_j，Ｏ₁〜Ｏ_kから
なるニューロンによって構成される。各々のニューロン
は、非線形的なシグモイド応答関数によって情報を伝達
して出力Ｏ_K1〜Ｏ_Knを得ている。2. Description of the Related Art Conventional neural networks
Then, it demonstrates based on FIG. This figure shows a hierarchical new
It shows a Ral network, where I is the input layer and H is the middle.
Layer, O is an output layer. Input layer I, middle layer H, output layer O
Respectively I₁~ I _i, H₁~ H_j, O₁~ O_kFrom
Is composed of neurons. Each neuron
Conveys information by a nonlinear sigmoid response function
And output O_K1~ O_KnIs getting

【０００３】一方、出力層Ｏの各ニューロンＯ₁〜Ｏ_k
には、教師信号（正解事象）Ｔ₁〜Ｔ_kが入力され、バ
ックプロパゲーション法（逆伝搬学習法）により、矢印
Ａに示すように信号の伝搬とは逆方向に進行する。中間
層Ｈから出力層Ｏへの結合重み係数Ｖ_jkと入力層Ｉから
中間層Ｈへの結合重み係数Ｗ_ijを、教師信号Ｔ₁〜Ｔ _k
が入力される毎に、出力Ｏ_k1〜Ｏ_knと、それに対応する
各教師信号Ｔ₁〜Ｔ_kとの誤差が最小となるように修正
される。On the other hand, each neuron O in the output layer O₁~ O_k
Is a teacher signal (correct event) T₁~ T_kIs entered,
By the back propagation method (back propagation learning method)
As indicated by A, the signal travels in the opposite direction to the signal propagation. Middle
Coupling weight coefficient V from layer H to output layer O_jkAnd from input layer I
Coupling weight coefficient W to the intermediate layer H_ijIs the teacher signal T₁~ T _k
Output O each time is input_k1~ O_knAnd corresponding
Each teacher signal T₁~ T_kCorrected to minimize the error between
To be done.

【０００４】周知のように、このシグモイド応答関数
は、人間の神経応答を模擬的に表現した非線形な応答関
数である。この応答関数を用いたニューラルネットワー
クの動作は、入力層から中間層への結合によって、被認
識対象の特徴抽出が行われ、更に、中間層から出力層へ
の結合によって特徴表現がなされる。As is well known, the sigmoid response function is a non-linear response function that simulates the human nerve response. In the operation of the neural network using this response function, the feature of the object to be recognized is extracted by the connection from the input layer to the intermediate layer, and the feature is expressed by the connection from the intermediate layer to the output layer.

【０００５】このシグモイド応答関数は、図１（ｂ）に
示された特性を有するものであり、次式のように表され
る。ｆ（Ｘ）＝１／｛１＋ｅｘｐ（−２Ｘ／ｕ₀）｝…………（１）（１）式のｕ₀は、シグモイド関数の傾きを決定する正
のパラメータである。以下、本発明のニューラルネット
ワークの理解を容易とする為に、ニューラルネットワー
クの学習機能について説明する。（１）式をＸについて
微分すると、次式が得られる。ｆ′（Ｘ）＝２・ｆ（Ｘ）・｛１−ｆ（Ｘ）｝／ｕ₀………（２）各層の出力Ｉ_in，Ｈ_jn，０_knは、以下のように表され
る。This sigmoid response function has the characteristics shown in FIG. 1 (b) and is expressed by the following equation. f (X) = 1 / {1 + exp (−2X / u ₀ )} (1) u ₀ in the equation (1) is a positive parameter that determines the slope of the sigmoid function. Hereinafter, in order to facilitate understanding of the neural network of the present invention, the learning function of the neural network will be described. Differentiating the expression (1) with respect to X, the following expression is obtained. f ′ (X) = 2 · f (X) · {1-f (X)} / u ₀ (2) The outputs I _in , H _jn , and 0 _kn of each layer are expressed as follows. .

【０００６】[0006]

【数１】 [Equation 1]

【０００７】（但し、θ_jは中間層Ｈのニューロンのオ
フセット，γ_kは出力層Ｏのニューロンのオフセット）(Where θ _j is the offset of neurons in the intermediate layer H and γ _k is the offset of neurons in the output layer O)

【０００８】又、出力層Ｏの出力Ｏ_Knにおける教師信号
Ｔ_Kとの誤差をｅ_K（＝Ｔ_K−Ｏ_Kn）とすると、この平
均２乗誤差Ｅ_Kは、次式のように表される。Ｅ_K＝１／２・（Ｔ_K−Ｏ_Kn）² ………………（５）ニューラルネットワークの平均２乗誤差Ｅ_kが最小とな
る状態が最適学習状態であり、繰り返し学習することに
よって平均２乗誤差Ｅ_kを結合重み係数Ｖ_jkに関して最
小化する。この平均２乗誤差Ｅ_kが最小となる様に結合
重み係数，オフセットを修正することがニューラルネッ
トワークにおける学習行為である。If the error between the output O _Kn of the output layer O and the teacher signal T _K is e _K (= T _K −O _Kn ), this mean square error E _K is expressed by the following equation. It E _K = 1 / _2 (T _K −O _Kn ) ² ……………… (5) The state in which the mean square error E _{k of the} neural network is the minimum is the optimal learning state. The mean squared error E _k is minimized with respect to the combination weighting factor V _jk . It is a learning action in the neural network to correct the coupling weight coefficient and the offset so that the mean square error E _k is minimized.

【０００９】一方、平均２乗誤差の関数は出力層Ｏの出
力Ｏ_Knの関数であり、出力Ｏ_Knは中間層Ｈの出力Ｈ_jnと
の関数であるため、次式の関係が成り立つ。 ∂Ｅ_K／∂Ｖ_jk＝∂Ｅ_K／∂Ｏ_Kn・∂Ｏ_Kn／∂Ｖ_jk ………（６） ∂Ｅ_K／∂Ｏ_Kn＝−（Ｔ_K−Ｏ_Kn） ………………（７）ここで、（４）式を以下のように書き表す。On the other hand, the function of the mean square error is a function of the output O _Kn of the output layer O, and the output O _Kn is a function of the output H _jn of the intermediate layer H. _Therefore , the following relation holds. ∂E _K / ∂V _jk = ∂E _K / ∂O _Kn・ ∂O _Kn / ∂V _jk ……… (6) ∂E _K / ∂O _Kn ＝－ (T _K −O _Kn ) ……………… (7) Here, the equation (4) is written as follows.

【００１０】[0010]

【数２】 [Equation 2]

【００１１】Ｓ_KをＶ_jkについて微分すると、 ∂Ｓ_K／∂Ｖ_jk＝Ｈ_jn ………………（９）と表される。When S _K is differentiated with respect to V _jk, it is expressed as ∂S _K / ∂V _jk = H _jn ............ (9).

【００１２】結合重み係数Ｖ_jkの微少変化に対する出力
Ｏ_Knの変化は、以下のように表される。 ∂Ｏ_Kn／∂Ｖ_jk＝∂Ｏ_Kn／∂Ｓ_K・∂Ｓ_K／∂Ｖ_jk ＝ｆ′（Ｓ_K）・Ｈ_jn ＝２／ｕ₀・Ｏ_Kn・（１─Ｏ_Kn）・Ｈ_jn……（１０）A change in the output O _Kn with respect to a slight change in the coupling weight coefficient V _jk is expressed as follows. ∂O _Kn / ∂V _jk = ∂O _Kn / ∂S _K・ ∂S _K / ∂V _jk = f '(S _K ) ・ H _jn = 2 / u ₀・ O _Kn・ (1─O _Kn ) ・ H _jn …… (10)

【００１３】従って、結合重み係数Ｖ_jkの微小変化に対
する平均２乗誤差Ｅ_Kの変化量は（７）式と（１０）式
から次式のように表される。 ∂Ｅ_K／∂Ｖ_jk＝∂Ｅ_K／∂Ｏ_Kn・∂Ｏ_Kn／∂Ｖ_jk ＝−Ｈ_jn・２／ｕ₀・（Ｔ_K−Ｏ_Kn）・Ｏ_Kn・（１─Ｏ_Kn）＝−δ_K・Ｈ_jn ………………………………（１１）（但し、δ_K＝２／ｕ₀・（Ｔ_K−Ｏ_Kn）・Ｏ_Kn・（１─Ｏ_Kn）とする。）Therefore, the change amount of the mean square error E _K with respect to the minute change of the coupling weight coefficient V _jk is expressed by the following formula from the formulas (7) and (10). ∂E _K / ∂V _jk = ∂E _K / ∂O _Kn・ ∂O _Kn / ∂V _jk = −H _jn・ 2 / u ₀・ (T _K −O _Kn ) ・ O _Kn・ (1─O _Kn ) = -Δ _K · H _jn ………………………… (11) (where δ _K = 2 / u ₀ · (T _K −O _Kn ) · O _Kn · (1 − O _Kn ). And)

【００１４】平均２乗誤差Ｅ_Kが減少する方向に結合重
み係数Ｖ_jkを修正する学習を繰り返すことによって、平
均２乗誤差Ｅ_Kは、極小値に達する。その修正量ΔＶ_jk
は次式のように表される。 ΔＶ_jk＝−η₁（∂Ｅ_K／∂Ｖ_jk ）＝η₁・δ_K・Ｈ_jn ………………………（１２）（但し、η₁は定数）次に、同様な手法によって修正量ΔＷ_ijを求める。ここ
で、（３）式を以下のように書き表す。The mean squared error E _K reaches a minimum value by repeating the learning for modifying the coupling weight coefficient V _jk in the direction in which the mean squared error E _K decreases. The correction amount ΔV _jk
Is expressed by the following equation. ΔV _jk = −η ₁ (∂E _K / ∂V _jk ) = η ₁ · δ _K · H _jn …………………… (12) (where η ₁ is a constant) Next, the same method The correction amount ΔW _ij is calculated by Here, the formula (3) is written as follows.

【００１５】[0015]

【数３】 [Equation 3]

【００１６】結合重み係数Ｗ_ijの微小変化に対する平均
２乗誤差Ｅ_Kの変化量は、以下のようになる。The change amount of the mean square error E _K with respect to the minute change of the coupling weight coefficient W _ij is as follows.

【００１７】[0017]

【数４】 [Equation 4]

【００１８】従って、結合重み係数Ｗ_ijの修正量ΔＷ_ij
は、次式のように表される。[0018] Therefore, the correction amount ΔW _ij of the coupling weight coefficient W _ij
Is expressed by the following equation.

【００１９】[0019]

【数５】 [Equation 5]

【００２０】同様な手法により修正量Δγ_k，Δθ_jを
求めると、次式のように表される。 ∂Ｅ_K／∂γ_k＝∂Ｅ_K／∂Ｏ_Kn・∂Ｏ_Kn／∂γ_k ＝２／ｕ₀・（Ｔ_k−Ｏ_kn）・Ｏ_Kn・（１−Ｏ_Kn）＝−δ_K 従って、修正量Δγ_kは、 Δγ_K＝−η₃・（∂Ｅ_K／∂γ_K）＝η₃・δ_K …………………（１６）（但し、η₃は定数）When the correction amounts Δγ _k and Δθ _j are _obtained by a similar method, they are expressed by the following equation. ∂E _K / ∂γ _k = ∂E _K / ∂O _Kn · ∂O _Kn / ∂γ _k = 2 / u ₀ · (T _k −O _kn ) · O _Kn · (1-O _Kn ) = − δ _K Therefore, the correction amount Δγ _k is Δγ _K = −η ₃ · (∂E _K / ∂γ _K ) = η ₃ · δ _K …………………… (16) (where η ₃ is a constant)

【００２１】[0021]

【数６】 [Equation 6]

【００２２】上記のη₁〜η₄は、修正量の大きさを制
御する学習係数である。The above η ₁ to η ₄ are learning coefficients for controlling the magnitude of the correction amount.

【００２３】[0023]

【発明が解決しようとする課題】上述の如きニューラル
ネットワークは、入力層Ｉ、中間層Ｈ及び出力層Ｏのニ
ューロンの応答関数として、（１）式に示すようなシグ
モイド関数が用いられている。シグモイド関数は指数関
数による応答関数であり、その出力ｆ（Ｘ）を０又は１
とするためには、Ｘ＝−∞，＋∞としない限り出力され
ない。従って、出力ｆ（Ｘ）が０，１に達するまでのそ
の演算回数は極めて多く、収束時間が多く掛かり、容易
に収束しないという欠点がある。即ち、各層のニューロ
ンの応答関数としてシグモイド関数を用いた場合、ニュ
ーロンの発火特性で重要である極値０，１の値が出力さ
れ難いという問題点がある。このことは、極値１，０を
学習させることが出来ないことを意味している。In the neural network as described above, the sigmoid function as shown in the equation (1) is used as the response function of the neurons of the input layer I, the intermediate layer H and the output layer O. The sigmoid function is a response function based on an exponential function, and its output f (X) is 0 or 1
In order to satisfy the above, no output is made unless X = −∞ and + ∞. Therefore, there are disadvantages that the number of times the calculation is performed until the output f (X) reaches 0, 1 is extremely long, the convergence time is long, and the convergence is not easy. That is, when the sigmoid function is used as the response function of the neuron in each layer, there is a problem that it is difficult to output the extreme values 0 and 1 which are important in the firing characteristics of the neuron. This means that the extreme values 1 and 0 cannot be learned.

【００２４】従来のニューラルネットワークにサイン関
数を学習させ、その加重加算された出力結果が図４
（ａ）に示されている。図４（ａ）の実線（イ）が期待
すべきサイン関数曲線を示しており、点線で示した曲線
（ロ）が５０００回の学習に基づく出力結果を示してい
る。この出力結果から明らかなように、５０００回の学
習にもかかわらず、曲線（ロ）で示した出力結果では、
極値１，０に収束していない。しかし、ニューラルネッ
トワークを用いたアナログ判断を必要とする情報処理装
置或いは制御装置では、この極値１，０を情報結果或い
は出力結果として頻繁に必要とする場合があり、従っ
て、極値１，０を速やかに収束させ得るニューラルネッ
トワークの開発が望まれる。The sine function is learned by the conventional neural network, and the weighted addition output result is shown in FIG.
It is shown in (a). The solid line (a) in FIG. 4A shows the expected sine function curve, and the dotted curve (b) shows the output result based on 5000 times of learning. As is clear from this output result, despite the 5000 learnings, in the output result shown by the curve (b),
It has not converged to the extreme values 1 and 0. However, an information processing device or a control device that requires analog judgment using a neural network may frequently need the extreme values 1 and 0 as an information result or an output result. It is desired to develop a neural network that can quickly converge.

【００２５】又、シグモイド関数の応答は非線形である
ので、図４（ａ）の曲線（ロ）から明らかなように
［０，１］の範囲のアナログ出力に対して偏りを持った
応答特性を示しており、これはその出力結果になんらか
の“くせ”を生じることを意味している。これらの問題
点によって、従来のシグモイド応答関数のみを用いたニ
ューラルネットワークは、特にアナログ的判断を必要と
する場合においては汎用性が低下する欠点がある。Further, since the response of the sigmoid function is non-linear, it is clear from the curve (b) of FIG. 4 (a) that a response characteristic having a bias with respect to the analog output in the range of [0, 1] is obtained. It is shown, which means that there is some "habit" in the output result. Due to these problems, the conventional neural network using only the sigmoid response function has a drawback that the versatility is lowered particularly when analog judgment is required.

【００２６】本発明は、上述のような問題点に鑑みなさ
れたものであって、アナログ出力が精度よく得られると
共に、その極値１，０が速やかに出力され得るニューラ
ルネットワークを提供することを目的とするものであ
る。The present invention has been made in view of the above-mentioned problems, and it is an object of the present invention to provide a neural network in which an analog output can be accurately obtained and the extreme values 1 and 0 thereof can be quickly output. It is intended.

【００２７】[0027]

【課題を解決するための手段】上記のような課題を解決
する為に本発明のニューラルネットワークは、学習機能
を有すると共に、そのニューラルネットワークの出力層
の応答関数としてリニア関数を用いたものである。In order to solve the above problems, the neural network of the present invention has a learning function and uses a linear function as the response function of the output layer of the neural network. .

【００２８】[0028]

【作用】本発明のニューラルネットワークでは、入力層
と中間層にはシグモイド応答関数によるニューロンを用
いて学習機能を維持すると共に、出力層のニューロンに
は応答関数としてリニア関数を用いることにより、偏り
のないアナログ出力が得られ、極値である１，０に速や
かに収束し得るようにして、汎用性を高めると共に、出
力層が単純な一次関数であるので学習時の演算速度を短
縮できるものである。In the neural network of the present invention, the learning function is maintained by using the neurons of the sigmoid response function in the input layer and the intermediate layer, and the linear function is used as the response function in the neurons of the output layer. A non-analog output can be obtained and quickly converged to the extreme value of 1,0 to enhance versatility, and the output layer is a simple linear function, which can reduce the calculation speed during learning. is there.

【００２９】[0029]

【実施例】以下、本発明のニューラルネットワークにつ
いて図に基づいて説明する。図１（ａ）は、本発明のニ
ューラルネットワークの一実施例を示しており、同図に
於いて、ニューラルネットワークは、ニューロンＩ₁〜
Ｉ_iからなる入力層Ｉ、ニューロンＨ₁〜Ｈ_jからなる
中間層Ｈ、及びニューロンＬ₁からなる出力層Ｌから構
成される。入力層ＩのニューロンＩ₁〜Ｉ_iには、情報
信号ＩＮ₁〜ＩＮ_nが入力され、出力層Ｌから出力Ｌ_kn
が出力される。又、学習時は、出力層Ｌに教師信号（ｔ
₁〜ｔ₁₃）が入力される。DESCRIPTION OF THE PREFERRED EMBODIMENTS A neural network of the present invention will be described below with reference to the drawings. FIG. 1A shows an embodiment of the neural network of the present invention. In FIG. 1A, the neural network includes neurons I ₁ ...
The input layer I is composed of I _{i, the} intermediate layer H is composed of neurons H _{1 to} H _j , and the output layer L is composed of neurons L ₁ . Information signals IN _{1 to} IN _n are input to the neurons I _{1 to} I _i of the input layer I, and output L _kn from the output layer L.
Is output. Further, at the time of learning, the teacher signal (t
_{1 to} t ₁₃ ) is input.

【００３０】入力層ＩのニューロンＩ₁〜Ｉ_i及び中間
層ＨのニューロンＨ₁〜Ｈ_jは、図１（ｂ）に示した
（１）式のシグモイド応答関数が用いられ、出力層Ｌの
ニューロンＬ₁には、図１（ｃ）に示したｇ（Ｘ）＝
１，０の範囲で線形応答特性を有するリニア関数が用い
られる。ｇ（Ｘ）＝ａＸ＋ｂ ………………（１８）（１８）式のａはリニア関数の傾きを決定するパラメー
タであり、図１（ｃ）に示すようにパラメータａの大小
によってその勾配が変化する。尚、ｂは０．５の値をと
る。The neurons I _{1 to} I _i of the input layer I and the neurons H _{1 to} H _j of the intermediate layer H use the sigmoid response function of the equation (1) shown in FIG. In the neuron L ₁ , g (X) = shown in FIG.
A linear function having a linear response characteristic in the range of 1,0 is used. g (X) = aX + b (18) a in the equation (18) is a parameter that determines the slope of the linear function, and as shown in FIG. 1 (c), the slope depends on the magnitude of the parameter a. Change. In addition, b takes a value of 0.5.

【００３１】又、図２は、本発明のニューラルネットワ
ークの他の実施例を示しており、中間層Ｈが複数の層か
らなるニューラルネットワークである。この実施例に於
いても、出力層ＬのニューロンＬ₁〜Ｌ_Kは、（１８）
式に示した線形応答特性を有するリニア関数が用いら
れ、入力層ＩのニューロンＩ₁〜Ｉ_i及び中間層Ｈのニ
ューロンＨ_j1〜Ｈ_jnには、（１）式に示したシグモイド
応答関数が用いられる。Ｔ₁〜Ｔ_nは教師信号である。FIG. 2 shows another embodiment of the neural network of the present invention, in which the intermediate layer H is a neural network composed of a plurality of layers. Also in this embodiment, the neurons L _{1 to} L _K of the output layer L are (18)
The linear function having the linear response characteristic shown in the equation is used, and the neurons I _{1 to} I _i of the input layer I and the neurons H _{j1 to} H _jn of the intermediate layer H have the sigmoid response function shown in the equation (1). Used. T _{1 to} T _n are teacher signals.

【００３２】以下、図１（ａ）の実施例にサイン関数を
学習させ、その学習機能とその出力結果ついて説明す
る。（Ａ）学習機能ニューラルネットワークに於いて、その入力層Ｉのニュ
ーロンＩ₁〜Ｉ_nに角度θ等の情報信号ＩＮ₁〜ＩＮ_n
が入力され、出力層ＬのニューロンＬ₁には、教師信号
としての各角度θに対応したサイン関数値である教師デ
ータ（ｔ₁〜ｔ ₁₃）が入力される。教師データ（ｔ₁〜
ｔ₁₃）に基づいて、バックプロパゲーション法により、
図３（ａ）に示したＸ印の各点で図４（ａ）の出力結果
（ロ）に対比される５０００回の学習を行う。又、教師
データ（ｔ₁〜ｔ₁₃）は、図３（ｂ）に示すように、角
度θ（0 ，30，50，…… 330，360 ）に対応するサイン
関数値が出力層ＬのニューロンＬ₁に入力される。即
ち、角度θが０度のときは、サイン関数値ｔ₁として
〔０．５０００〕が教師データとして入力され、角度θ
が３０度のときはサイン関数値ｔ₂として〔０．７５０
０〕が入力されることになる。Below, the sine function is applied to the embodiment of FIG.
Learn and explain its learning function and its output result
It (A) Learning function In the neural network, the input layer I
Ron I₁~ I_nInformation signal IN such as angle θ₁~ IN_n
Is input, the neuron L of the output layer L₁The teacher signal
As the sine function value corresponding to each angle θ as
Data (t₁~ T ₁₃) Is entered. Teacher data (t₁~
t₁₃) Based on the backpropagation method,
The output result of FIG. 4A at each point of the X mark shown in FIG.
The learning is performed 5000 times as compared with (b). Also the teacher
Data (t₁~ T₁₃) Is a corner as shown in FIG.
Sign corresponding to the degree θ (0, 30, 50, ... 330, 360)
Neuron L whose function value is output layer L₁Entered in. Immediately
When the angle θ is 0 degree, the sine function value t₁As
[0.5000] is input as teacher data, and the angle θ
Is 30 degrees, the sine function value t₂As [0.750
0] will be input.

【００３３】教師データ（ｔ₁〜ｔ₁₃）の各データは、
矢印８で示すように信号方向とは逆方向に出力層Ｌから
中間層Ｈそして入力層Ｉへと伝搬され、その出力Ｌ_kに
対応する教師データ（ｔ₁〜ｔ₁₃）との誤差が最小とな
るように各結合重み係数Ｗ_ij，Ｕ_jkが自動的に修正され
る。尚、Ｗ_ijは入力層Ｉから中間層Ｈへの結合重み係数
であり、Ｕ_jkは、中間層Ｈから出力層Ｌへの結合重み係
数である。Each data of the teacher data (t _{1 to} t ₁₃ ) is
As indicated by the arrow 8, the signal is propagated in the direction opposite to the signal direction from the output layer L to the intermediate layer H and then to the input layer I, and the error from the teacher data (t _{1 to} t ₁₃ ) corresponding to the output L _k is minimal The respective connection weight coefficients W _ij and U _jk are automatically corrected so that Note that W _ij is a coupling weight coefficient from the input layer I to the intermediate layer H, and U _jk is a coupling weight coefficient from the intermediate layer H to the output layer L.

【００３４】以下、結合重み係数Ｗ_ij、Ｕ_jkと、中間層
Ｈと出力層のニューロンのオフセットθ_j0，γ_k0の修正
量について説明する。各層の出力Ｈ_jn，Ｌ_knは、次式の
ように表される。The correction amounts of the coupling weight coefficients W _ij and U _jk and the offsets θ _j0 and γ _k0 of the neurons in the intermediate layer H and the output layer will be described below. The outputs H _jn and L _kn of each layer are expressed by the following equation.

【００３５】[0035]

【数７】 [Equation 7]

【００３６】又、出力層Ｌの出力Ｌ_Knにおける教師信号
Ｔとの誤差をδ_K0（＝Ｔ−Ｌ_Kn）とすると、この平均２
乗誤差Ｅ_Kは、次式のように表される。Ｅ_K＝１／２・（Ｔ−Ｌ_Kn）² ……………（２１）If the error between the output L _Kn of the output layer L and the teacher signal T is δ _K0 (= T−L _Kn ), this average 2
The multiplication error E _K is expressed by the following equation. E _K = 1/2 ・ (T-L _Kn ) ² ……………… (21)

【００３７】一方、平均２乗誤差Ｅ_kの関数は出力層Ｌ
の出力Ｌ_Kの関数であり、出力Ｌ_Kは中間層Ｈの出力Ｈ
_jnとの関数であるため、次式の関係が成り立つ。 ∂Ｅ_K／∂Ｕ_jk＝∂Ｅ_K／∂Ｌ_Kn・∂Ｌ_Kn／∂Ｕ_jk ………（２２） ∂Ｅ_K／∂Ｌ_Kn＝−（Ｔ−Ｌ_Kn） ………………（２３）ここで、（２０）式を以下のように書き表す。On the other hand, the function of the mean square error E _k is the output layer L
Is a function of the output L _K, the output L _K output H of the intermediate layer H
_Since it is a function with _jn , the following relation holds. ∂E _K / ∂U _jk = ∂E _K / ∂L _Kn · ∂L _Kn / ∂U _jk ……… (22) ∂E _K / ∂L _Kn ＝－ (T−L _Kn ) ……………… (23) Here, the equation (20) is written as follows.

【００３８】[0038]

【数８】 [Equation 8]

【００３９】結合重み係数Ｕ_jkの微少変化に対する出力
Ｌ_Kへの影響は、以下のように表される。 ∂Ｌ_Kn／∂Ｕ_jk＝∂Ｌ_Kn／∂Ｓ_K・∂Ｓ_K／∂Ｕ_jk ＝ｇ′（Ｓ_K）・Ｈ_jn ＝ａ・Ｈ_jn ………………………（２６）The influence on the output L _K with respect to the minute change of the coupling weight coefficient U _jk is expressed as follows. ∂L _Kn / ∂U _jk = ∂L _Kn / ∂S _K・ ∂S _K / ∂U _jk = g '(S _K ) ・ H _jn = a ・ H _jn …………………… (26)

【００４０】従って、結合重み係数Ｕ_jkの微小変化に対
する平均２乗誤差Ｅ_Kの変化量は、（２３）式と（２
６）式から次式のように表される。 ∂Ｅ_K／∂Ｕ_jk＝∂Ｅ_K／∂Ｌ_Kn・∂Ｌ_Kn／∂Ｕ_jk ＝−ａ・（Ｔ−Ｌ_Kn）・Ｈ_jn ＝−δ_k′・Ｈ_jn ………………………（２７）（但し、δ_k′＝ａ・（Ｔ−Ｌ_Kn））平均２乗誤差Ｅ_Kが減少する方向に結合重み係数Ｕ_jkを
修正する学習を繰り返すことによって、平均２乗誤差Ｅ
_Kは極小値に達する。その修正量ΔＵ_jkは次式のように
表される。 ΔＵ_jk＝−η₁′（∂Ｅ_K／∂Ｕ_jk ）＝η₁′・δ_k′・Ｈ_jn …………………（２８）（但し、η₁′は定数）Therefore, the change amount of the mean square error E _K with respect to the minute change of the coupling weight coefficient U _jk is expressed by the equation (23) and (2)
From the equation (6), the following equation is obtained. _{_{_{∂E K / ∂U jk = ∂E K}}} / ∂L Kn · ∂L Kn / ∂U jk = -a · (T-L Kn) · H jn = -δ k '· H jn .................. (27) (where δ _k ′ = a · (T−L _Kn )) Mean square error is repeated by repeating learning to correct the coupling weight coefficient U _jk in the direction of decreasing the mean square error E _K. Error E
_K reaches a local minimum. The correction amount ΔU _jk is expressed by the following equation. ΔU _jk = −η ₁ ′ (∂E _K / ∂U _jk ) = η ₁ ′ · δ _k ′ · H _jn …………………… (28) (However, η ₁ ′ is a constant)

【００４１】次に、同様な手法によって、修正量Δ
Ｗ_ij，Δγ_k0，Δθ_j0を求めると、次式のように表され
る。尚、γ_k0，θ_j0は出力層Ｌ及び中間層Ｈの各ニュー
ロンのオフセットを示している。以下、修正量ΔＷ_ijを
求める。Next, the correction amount Δ
When W _ij , _Δγ _k0 , and Δθ _j0 are obtained, they are expressed by the following equation. Note that γ _k0 and θ _j0 indicate the offsets of the neurons in the output layer L and the intermediate layer H. Hereinafter, the correction amount ΔW _ij will be obtained.

【００４２】[0042]

【数９】 [Equation 9]

【００４３】次に、修正量Δγ_k0を求める。 ∂Ｅ_K／∂γ_K0＝∂Ｅ_K／∂Ｌ_Kn・∂Ｌ_Kn／∂γ_K0 ＝−ａ（Ｔ−Ｌ_Kn）＝−δ_K′ Δγ_K0＝−η₃′（∂Ｅ_K／∂γ_K0）＝η₃′・δ_K′ …………………（３０）（但し、η₃′は定数である。）Next, the correction amount Δγ _k0 is obtained. ∂E _K / ∂γ _K0 = ∂E _K / ∂L _Kn・ ∂L _Kn / ∂γ _K0 = −a (T−L _Kn ) = − δ _K ′ Δγ _K0 = −η ₃ ′ (∂E _K / ∂ γ _K0 ) = η ₃ ′ · δ _K ′ ……………… (30) (However, η ₃ ′ is a constant.)

【００４４】[0044]

【数１０】 [Equation 10]

【００４５】但し、η₁′〜η₄′は、修正量の大きさ
を制御する学習係数である。上述の結果から明らかなよ
うに、ニューラルネットワークの出力層Ｌの応答関数と
してリニア関数を用いたとしても学習機能には全く支障
がないことを示している。However, η ₁ ′ to η ₄ ′ are learning coefficients that control the magnitude of the correction amount. As is clear from the above results, even if a linear function is used as the response function of the output layer L of the neural network, there is no problem in the learning function.

【００４６】（Ｂ）出力結果図４（ｂ）は、本発明のニューラルネットワークの出力
結果を示しており、実線で示したサイン関数曲線（イ）
は期待すべきサイン関数曲線を示し、点線で示した曲線
（ロ）が実施例の出力結果に基づくサイン関数曲線を示
している。入力層Ｉの各ニューロンＩ₁〜Ｉ_iには、角
度θ等の情報信号ＩＮ₁〜ＩＮ_nが入力され、出力層Ｌ
のニューロンＬ_{1 kn}からは、上記の学習によって得られ
た結合重み係数Ｗ_ij，Ｕ_jkに基づいて加重加算された出
力Ｌ_knが、図４（ｂ）の曲線（ロ）に示すようなサイン
関数曲線として出力される。(B) Output Result FIG. 4 (b) shows the output result of the neural network of the present invention, which is the sine function curve (a) shown by the solid line.
Indicates a sine function curve to be expected, and the curve (b) indicated by a dotted line indicates a sine function curve based on the output result of the embodiment. Information signals IN _{1 to} IN _n such as the angle θ are input to the neurons I _{1 to} I _i of the input layer I, and the output layer L
From the neuron L _{1 kn of} FIG. 4, the output L _kn weighted and added based on the connection weighting factors W _ij and U _jk obtained by the above learning is the sine sign as shown in the curve (b) of FIG. It is output as a function curve.

【００４７】図４（ｂ）から明らかなように、結合重み
係数Ｗ_ij，Ｕ_jkに基づく出力結果は、実線（イ）のサイ
ン関数曲線と略一致したものとなっている。又、極値
１，０においても、図４（ａ）の点線（ロ）に比べてよ
く一致している。更に、［０，１］の範囲のアナログ出
力も図４（ａ）にみられたような偏りは発生していな
い。As is apparent from FIG. 4 (b), the output result based on the coupling weight coefficients W _ij and U _jk is substantially in agreement with the sine function curve of the solid line (a). Further, even at the extreme values 1 and 0, the agreement is better than that of the dotted line (b) in FIG. Further, the analog output in the range of [0, 1] does not have the bias as seen in FIG.

【００４８】この出力結果から、本発明のニューラルネ
ットワークは、出力層Ｌにリニア応答関数を用いること
で、極値１，０に収束することが実証された。又、シグ
モイド応答関数にみられるアナログ出力の偏りも解消さ
れ、従来のシグモイド関数のみを用いるニューラルネッ
トワークに比べて極値１，０に対する収束性が極めて高
く、種々の情報処理装置や制御装置への汎用性が高いこ
とが実証された。From this output result, it was proved that the neural network of the present invention converges to the extreme values 1, 0 by using the linear response function in the output layer L. In addition, the bias of analog output seen in the sigmoid response function is eliminated, and the convergence to extreme values 1 and 0 is extremely high as compared with the conventional neural network using only the sigmoid function, and it can be applied to various information processing devices and control devices. It was proved to be highly versatile.

【００４９】無論、本発明のニューラルネットワーク
は、コンピータによる演算処理によって等価的に実施し
てもよく、又、ニューロチップで実施してもよいことは
明らかであり、後者の場合、極めて小型で高性能な情報
処理装置や制御装置を提供できるものである。Of course, it is obvious that the neural network of the present invention may be equivalently implemented by arithmetic processing by a computer, or may be implemented by a neurochip. In the latter case, it is extremely small and high. It is possible to provide a high-performance information processing device and control device.

【００５０】[0050]

【発明の効果】本発明のニューラルネットワークによれ
ば、ニューラルネットワークの出力層のニューロンの応
答関数にリニア関数を用いることによって、ニューラル
ネットワークの出力として、極値１，０を無理なく出力
し得ると共に、［０，１］の範囲のアナログ出力も偏り
がなく高精度に出力し得る利点を有している。According to the neural network of the present invention, by using a linear function as the response function of the neuron in the output layer of the neural network, the extreme values 1 and 0 can be reasonably output as the output of the neural network. , [0, 1] range also has the advantage that it can be output with high accuracy without bias.

【００５１】又、出力層にリニア関数を用いることで、
収束性が高い為に演算回数を低減することができる利点
を有し、計算効率が図られる為に高速で処理結果を得る
ことができる効果を有する。更に、本発明のニューラル
ネットワークによれば、出力の極値１，０を、情報処理
装置や制御装置に利用することができ、ニューラルネッ
トワークの汎用性を高めることができる利点を有してい
る。By using a linear function in the output layer,
Since the convergence is high, the number of calculations can be reduced, and the calculation efficiency is improved, so that the processing result can be obtained at high speed. Further, according to the neural network of the present invention, the extreme values 1 and 0 of the output can be used for the information processing device and the control device, and there is an advantage that the versatility of the neural network can be improved.

【００５２】無論、学習過程においても各層間の結合重
み係数を修正する処理時間も短縮できる利点を有すると
共に、極値１，０を比較的高速に学習させることができ
るので学習回数も低減できる極めて効果的なものであ
る。Of course, in the learning process as well, there is an advantage that the processing time for correcting the coupling weight coefficient between each layer can be shortened, and since the extreme values 1 and 0 can be learned relatively quickly, the number of times of learning can be extremely reduced. It is effective.

[Brief description of drawings]

【図１】（ａ）は本発明のニューラルネットワークの一
実施例を示す図、（ｂ）はシグモイド関数を示す図、
（ｃ）はリニア関数を示す図である。1A is a diagram showing an embodiment of a neural network of the present invention, FIG. 1B is a diagram showing a sigmoid function,
(C) is a figure which shows a linear function.

【図２】本発明のニューラルネットワークの他の実施例
を示す図である。FIG. 2 is a diagram showing another embodiment of the neural network of the present invention.

【図３】（ａ）は学習機能を説明する為の図、（ｂ）は
サイン関数の教師データを表に示した図である。3A is a diagram for explaining a learning function, and FIG. 3B is a diagram showing teacher data of a sine function in a table.

【図４】（ａ）は従来のニューラルネットワークの出力
結果を示す図、（ｂ）は本発明のニューラルネットワー
クの出力結果を示す図である。4A is a diagram showing an output result of a conventional neural network, and FIG. 4B is a diagram showing an output result of the neural network of the present invention.

【図５】従来のニューラルネットワークの一例を示す図
である。FIG. 5 is a diagram showing an example of a conventional neural network.

[Explanation of symbols]

Ｉ入力層Ｈ中間層Ｌ出力層Ｉ₁〜Ｉ_i 入力層ＩのニューロンＨ₁〜Ｈ_j 中間層ＨのニューロンＬ₁〜Ｌ_k 出力層ＬのニューロンI input layer H intermediate layer L output layer I _{1 to} I _i input layer I neuron H _{1 to} H _j intermediate layer H neuron L _{1 to} L _k output layer L neuron

───────────────────────────────────────────────────── フロントページの続き (72)発明者上田浩次愛知県海部郡美和町大字篠田字面徳29−１名古屋電機工業株式会社美和工場内 (72)発明者山田宗男愛知県海部郡美和町大字篠田字面徳29−１名古屋電機工業株式会社美和工場内 ─────────────────────────────────────────────────── ─── Continuation of the front page (72) Inventor Koji Ueda 29-1, Mita, Miwa-cho, Kaifu-gun, Aichi Prefecture Mita Plant, Nagoya Electric Industry Co., Ltd. (72) Muneo Yamada, Shinoda, Miwa-cho, Kaifu-gun, Aichi Prefecture 29-1 Nagoya Electric Industry Co., Ltd. Miwa Plant

Claims

[Claims]

1. A neural network, wherein the neural network has a learning function, and a linear function is used as a response function of its output layer.