JP3367295B2

JP3367295B2 - Multi-valued neural network learning method

Info

Publication number: JP3367295B2
Application number: JP22701495A
Authority: JP
Inventors: 陽太郎八塚
Original assignee: ケイディーディーアイ株式会社
Priority date: 1995-08-14
Filing date: 1995-08-14
Publication date: 2003-01-14
Anticipated expiration: 2015-08-14
Also published as: JPH0954763A

Description

Detailed Description of the Invention

【０００１】[0001]

【発明の属する技術分野】本発明は、大規模論理回路、
パターン認識、連想メモリ、データ変換及び画像処理な
どの分野に適用可能なニューラルネットワークにおい
て、ローカルミニマム捕獲状態を検出することにより高
速かつ安定に所望の出力信号を得ることができる多値ニ
ューラルネットワークの学習方式に関するものである。
本発明は特に特願平７−７７１６８（平成７年３月９日
出願）の改良に関する。BACKGROUND OF THE INVENTION 1. Field of the Invention The present invention relates to a large scale logic circuit,
Learning of a multi-valued neural network that can obtain a desired output signal at high speed and stably by detecting a local minimum capture state in a neural network applicable to fields such as pattern recognition, associative memory, data conversion and image processing. It is related to the method.
The present invention particularly relates to an improvement of Japanese Patent Application No. 7-77168 (filed on Mar. 9, 1995).

【０００２】[0002]

【従来の技術】ニューラルネットワークの１つとして、
多層（階層）ニューラルネットワークがあるが、このニ
ューラルネットワークの教師信号を用いた学習方法とし
て、バック・プロパゲーション・アルゴリズムが幅広く
使用されている。本アルゴリズムを用いた学習過程で
は、重み係数を初期設定した後、予め用意された教師信
号Ｔ（教師信号エレメント、Ｔ_１、Ｔ_２、...、Ｔ_Ｍ）
から、入力層に入力された学習用入力信号に対する出力
層からの出力ユニット信号を減算器を介して差し引き誤
差信号を求め、各層の出力ユニット信号と誤差信号とを
基に誤差信号の電力を最小にするように各層間の重み係
数の更新を行い学習を行うものである。この重み係数適
応制御からなる学習をすべての学習用入力信号に対して
実行し、収束するまで繰り返す。2. Description of the Related Art As one of neural networks,
Although there is a multilayer neural network, a back propagation algorithm is widely used as a learning method using a teacher signal of this neural network. In the learning process using this algorithm, after the weighting coefficient is initialized, the teacher signal T (teacher signal element, T ₁ , T ₂ , ..., T _M ) prepared in advance is set.
Then, the output unit signal from the output layer for the learning input signal input to the input layer is subtracted through a subtractor to obtain the error signal, and the error signal power is minimized based on the output unit signal and error signal of each layer. As described above, the weighting coefficient between layers is updated and learning is performed. The learning consisting of the weighting coefficient adaptive control is executed for all learning input signals and repeated until convergence.

【０００３】この学習過程に於て誤差電力が最小となる
と完全に収束し、学習用入力信号に対する、出力ユニッ
ト信号を多値識別スレショルドにより多値化した多値出
力ユニット信号が多値教師信号と一致する。しかしなが
ら、誤差電力が局部極小（ローカルミニマム）となる所
に一旦落ち込むと、それが非常に安定な場合はそれ以降
は学習が進まず最小とならないこと、また、最小の状態
になるためには学習回数が著しく増加することや、収束
特性に重み係数の初期値依存性があるなどの問題があ
る。In the learning process, when the error power is minimized, the error power completely converges, and the multi-valued output unit signal obtained by multi-valued the output unit signal with respect to the learning input signal by the multi-valued discrimination threshold becomes the multi-valued teacher signal. Match. However, once the error power drops to the local minimum (local minimum), if it is very stable, the learning does not proceed thereafter and does not become the minimum. There are problems that the number of times increases remarkably and that the convergence characteristic depends on the initial value of the weighting coefficient.

【０００４】一方、０および１の２値教師信号Ｔに対し
て学習速度を改善するために"Parallel Distributed Pr
ocessing" D.E. Rumelhart, MIT Press.では、図１に示
す従来方式の学習を用いた多層ニューラルネットワーク
において、それぞれのエレメントに対して０．１及び
０. ９の新たな教師信号Ｔ’を設定し、動作モード制御
器９からの制御信号により夫々初期設定を行った後、学
習を開始する。端子３からの教師信号Ｔ’から、端子２
からの学習用入力信号に対する多層ニューラルネットワ
ーク１の出力ユニット信号を減算器４を介して差し引き
誤差信号を求め、これを重み係数制御器５に入力し、バ
ックプロパゲーションアルゴリズムにより重み係数更新
を行い、多層ニューラルネットワーク１に再度設定する
処理を繰り返し学習する。２値識別スレショルドを用い
て２値化する２値スレショルド回路６を介して出力ユニ
ット信号から２値出力ユニット信号を得、また、２値識
別スレショルドを用いて２値化する２値スレショルド回
路７を介して教師信号Ｔ’から２値教師信号Ｔを得、一
致検出器８にてこれらが完全に一致する状態を検出する
ことにより、多層ニューラルネットワーク１が収束した
として動作モード制御器９を介して学習が終了する。On the other hand, in order to improve the learning speed for the binary teacher signals T of 0 and 1, "Parallel Distributed Pr" is used.
In ocessing "DE Rumelhart, MIT Press., a new teacher signal T'of 0.1 and 0.9 is set for each element in the multilayer neural network using the conventional learning shown in FIG. Learning is started after initialization is performed by the control signal from the operation mode controller 9. From the teacher signal T ′ from the terminal 3, the terminal 2 is started.
The output unit signal of the multilayer neural network 1 with respect to the learning input signal is obtained through the subtractor 4 to obtain a subtraction error signal, which is input to the weighting factor controller 5, and the weighting factor is updated by the back propagation algorithm, The process of setting again in the multilayer neural network 1 is repeatedly learned. A binary output unit signal is obtained from the output unit signal through the binary threshold circuit 6 that performs binarization using the binary discrimination threshold, and a binary threshold circuit 7 that performs binarization using the binary discrimination threshold is provided. A binary teacher signal T is obtained from the teacher signal T'through the detection signal, and the coincidence detector 8 detects a state in which they are completely coincident with each other. Learning ends.

【０００５】このように教師信号を０．１及び０. ９に
設定することにより、０及び１の２値教師信号の場合に
比較して収束するための学習回数が削減されることが述
べられている。これは、シグモイド関数において、入力
が０あるいは１に近づくと、勾配が非常に小さくなるこ
とにより、重み係数の更新速度が小さくなることの理由
による。このため、教師信号を０．１及び０. ９に設定
し、勾配を大きくすることにより、学習速度を改善して
いる。しかしながら、同様に安定したローカルミニマム
に落ち込むと学習が進展しない。By setting the teacher signals to 0.1 and 0.9 in this way, it is stated that the number of learnings for convergence is reduced as compared with the case of binary teacher signals of 0 and 1. ing. This is because, in the sigmoid function, when the input approaches 0 or 1, the gradient becomes very small and the update rate of the weighting coefficient becomes small. Therefore, the learning speed is improved by setting the teacher signal to 0.1 and 0.9 and increasing the gradient. However, learning will not progress if it falls into a stable local minimum as well.

【０００６】これらの従来の学習方式では、上記の説明
のごとくローカルミニマムの状態に落ち込みやすく、ま
た、なかなか抜け出せないことから、完全な収束状態が
簡単には到達できないなどの欠点がある。特に、入力ユ
ニット数の多い３層あるいは多層ニューラルネットワー
クにおいて、多値教師信号に対して確実に収束させる設
計手法は明らかになっておらず、初期値依存性の為に重
み係数の初期値を変更したり、隠れユニット（中間ユニ
ット）数を増やすなどの試行錯誤を行っている。また、
安定なローカルミニマムへの捕獲状態を簡単に検出する
手段はこれまでない。[0006] These conventional learning methods have the drawback that the state of the local minimum can be easily dropped as described above, and that the state of complete convergence cannot be easily reached because it is difficult to get out. In particular, in a three-layer or multi-layer neural network with a large number of input units, a design method for surely converging the multivalued teacher signal has not been clarified, and the initial value of the weighting factor is changed due to the initial value dependence. And increasing the number of hidden units (intermediate units) by trial and error. Also,
Until now, there has been no easy way to detect the capture state in the stable local minimum.

【０００７】[0007]

【発明が解決しようとする課題】上記の説明のごとく、
多値教師信号を用いた多層ニューラルネットワークの従
来の学習処理において、多層ニューラルネットワークが
学習用入力信号に対応した所望の多値出力ユニット信号
を送出するよう重み係数を更新する際に、所望の多値出
力ユニット信号を送出する収束した状態になるまでの学
習回数即ち学習繰り返し回数が非常に多くなる場合や、
学習しても収束しない状態即ち非常に安定したローカル
ミニマムの状態に落ち込み所望の多値出力ユニット信号
が送出されないなど重み係数の初期値依存性があるなど
の欠点を有している。特に、入力層のユニット数が多
く、出力ユニット数が少なく、而も分布型の表現形式を
持った教師信号の場合には、学習による収束が非常に困
難となり、所望の多値出力ユニット信号を送出する多層
ニューラルネットワークを自由に設計する方法が確立さ
れていない。また、収束しやすくするために中間ユニッ
ト数を大幅に増やす方法があるが、汎化能力が劣化する
と共に当然のことながらそれぞれの演算量が増し、非常
に大きいハードウェア能力あるいは演算能力が要求され
る。DISCLOSURE OF THE INVENTION Problems to be Solved by the Invention As described above,
In the conventional learning process of the multi-layer neural network using the multi-valued teaching signal, when the weighting coefficient is updated so that the multi-layer neural network sends out the desired multi-valued output unit signal corresponding to the learning input signal, When the number of learning, that is, the number of learning iterations until the converged state in which the value output unit signal is transmitted becomes very large,
However, it has a drawback that the weighting coefficient has an initial value dependency such that it does not converge even if it is learned, that is, it falls into a very stable local minimum state and a desired multi-value output unit signal is not transmitted. In particular, in the case of a teacher signal having a large number of units in the input layer and a small number of output units and a distributed expression format, convergence due to learning becomes very difficult, and a desired multi-valued output unit signal can be obtained. A method for freely designing a multi-layer neural network for transmission has not been established. In addition, there is a method to significantly increase the number of intermediate units to make it easier to converge, but as the generalization capacity deteriorates, the amount of calculation of each increases naturally, and a very large hardware capacity or calculation capacity is required. It

【０００８】本発明の目的は、上記の問題を解決し、多
値ニューラルネットワークの従来の学習方式などに比べ
て、非常に少ない学習回数で収束し、１０倍から１００
倍の高速度で安定に学習を完了させることが出来、学習
用入力信号に対して所望の多値出力ユニット信号を容易
に得ることができるローカルミニマム捕獲状態検出機能
を有した多値ニューラルネットワークの新たな学習方式
を提供することにある。An object of the present invention is to solve the above-mentioned problems, to converge with an extremely small number of learning times as compared with the conventional learning method of a multivalued neural network, and to increase from 10 times to 100 times.
A multi-valued neural network with a local minimum capture state detection function that can stably complete learning at double speed and can easily obtain a desired multi-valued output unit signal for the learning input signal. To provide a new learning method.

【０００９】[0009]

【課題を解決するための手段】上記の問題点を解決する
ために、多値教師信号を用いて学習させる多値ニューラ
ルネットワークにおいて、該多値教師信号から正解の出
力ユニットの出力ユニット信号を差し引いて得た誤差信
号の絶対値が第１のスレショルド以下であると、該誤差
信号と逆極性を持ち該多値教師信号から離れるに従い振
幅が小さくなる重み係数更新誤差信号を、該誤差信号の
絶対値が該第１のスレショルドより大きいと、該誤差信
号以下の振幅を持った同極性の重み係数更新誤差信号
を、不正解の出力ユニットでは該誤差信号以下の振幅を
持った同極性の重み係数更新誤差信号を生成し、該重み
係数更新誤差信号を用いて重み係数を更新し、不正解の
出力ユニットにおける該誤差信号の絶対値の最小値を少
なくとも用いてローカルミニマム捕獲状態を検出し、該
重み係数更新誤差信号の振幅あるいは更に該第１のスレ
ショルドを調整し重み係数を更新することを特徴とした
ニューラルネットワーク学習方式を構成する。In order to solve the above problems, in a multi-valued neural network for learning using a multi-valued teacher signal, the output unit signal of the correct output unit is subtracted from the multi-valued teacher signal. If the absolute value of the obtained error signal is less than or equal to the first threshold, a weighting coefficient update error signal that has a polarity opposite to that of the error signal and whose amplitude decreases as the distance from the multilevel teacher signal increases If the value is larger than the first threshold, the weighting coefficient update error signal having the same amplitude as the error signal or less and the same weighting coefficient having the amplitude less than the error signal is output in the incorrect output unit. An update error signal is generated, the weight coefficient is updated by using the weight coefficient update error signal, and the weight coefficient is updated by using at least the minimum absolute value of the error signal in the incorrect output unit. Detecting a Ruminimamu captivity, constituting the neural network learning method which is characterized by updating the amplitude or further adjusted weighting coefficient threshold of the first heavy viewed coefficient updating error signal.

【００１０】[0010]

【作用】本発明の多値ニューラルネットワークの学習方
式においては、多値教師信号から大きくずれた不正解の
多値出力ユニット信号を与えている出力ユニットに対し
ては、多値教師信号から差し引いて得られる誤差信号以
下の振幅を持った重み係数更新誤差信号を用い、一方、
誤差信号が多値教師信号に非常に近く第１のスレショル
ド以下で正解の多値出力ユニット信号を与えている出力
ユニットにおいては、誤差信号とは逆極性を持った該教
師信号から離れるに従い振幅が小さくなる重み係数更新
誤差信号を用い、更に誤差信号が第１のスレショルドよ
り大きく多値教師信号から比較的近い正解の多値出力ユ
ニット信号を与えている出力ユニットにおいては、誤差
信号と同一極性の若干小さめの振幅を持った重み係数更
新誤差信号を用いて、それぞれ重み係数の修正を行う。
この際、不正解の出力ユニットの内、誤差信号の絶対値
の最小値である出力ユニット最小誤差を検出し、与えら
れたスレショルドとの比較結果から安定したローカルミ
ニマム捕獲状態を簡単に検出でき、検出結果に従って重
み係数更新誤差信号の振幅あるいは更に第１のスレショ
ルドを制御し、学習させることにより非常に安定したロ
ーカルミニマムからも抜け出し易く、少ない学習回数で
最適状態に素早く近付ける働きを持っている。このよう
な極性と振幅を制御された重み係数更新誤差信号を用い
ることにより、重み係数の初期値依存性もなく学習回数
を著しく短縮することが出来、学習回数や中間ユニット
（隠れユニット）数あるいは中間層数を大幅に削減でき
る。In the learning method of the multi-valued neural network of the present invention, an output unit giving an incorrect multi-valued output unit signal which is largely deviated from the multi-valued teacher signal is subtracted from the multi-valued teacher signal. Using the weighting factor update error signal with amplitude less than the obtained error signal, while
In the output unit in which the error signal is very close to the multi-valued teacher signal and gives the correct multi-valued output unit signal below the first threshold, the amplitude changes as the distance from the teacher signal having the opposite polarity to the error signal increases. In the output unit which uses the smaller weight coefficient update error signal and further provides the correct multi-valued output unit signal whose error signal is larger than the first threshold and is relatively close to the multi-valued teacher signal, the same polarity as the error signal is used. The weighting coefficient update error signal having a slightly smaller amplitude is used to modify the weighting coefficient.
At this time, among the incorrect output units, the output unit minimum error which is the minimum value of the absolute value of the error signal is detected, and the stable local minimum capture state can be easily detected from the comparison result with the given threshold, By controlling and learning the amplitude of the weighting coefficient update error signal or further the first threshold according to the detection result, it is possible to easily get out of a very stable local minimum and to quickly approach the optimum state with a small number of learnings. By using the weight coefficient update error signal whose polarity and amplitude are controlled as described above, it is possible to significantly reduce the number of learning without the dependency of the weight coefficient on the initial value, and the number of learning or the number of intermediate units (hidden units) or The number of intermediate layers can be significantly reduced.

【００１１】上記説明のごとく本発明の学習方式は、非
常に安定したローカルミニマム捕獲状態を簡単に検出で
きることから多値ニューラルネットワークのローカルミ
ニマムから抜け出させ易く、従来方式に比べて重み係数
の初期値依存性もなく非常に高速に且つ確実に収束さ
せ、所望の多値出力ユニット信号を容易に得ることがで
きることから、大規模な多値ニューラルネットワークを
自由に設計できる。また、実時間で学習をやり直す必要
のある学習機能を持った論理システムや、非常に多くの
入力信号エレメント数を持った入力信号やユニット数の
多い多層ニューラルネットワークを用いた多値論理シス
テムを実現できる。更に、従来方式では安定で高速に収
束させることができず所望の出力信号を得ることが困難
であったパターン認識や画像処理など、更には、データ
変換なども容易に設計し実現することが可能となる。As described above, since the learning method of the present invention can easily detect a very stable local minimum capture state, it is easy to get out of the local minimum of a multi-valued neural network, and the initial value of the weighting coefficient is higher than that of the conventional method. Since a desired multi-valued output unit signal can be easily obtained without any dependence and very quickly and surely, a large-scale multi-valued neural network can be freely designed. In addition, we have realized a logic system with a learning function that requires re-learning in real time, and a multi-valued logic system using an input signal with an extremely large number of input signal elements and a multilayer neural network with a large number of units. it can. Furthermore, it is possible to easily design and realize pattern recognition, image processing, etc., which were difficult to obtain a desired output signal because the conventional method could not converge stably and at high speed, and also data conversion etc. Becomes

【００１２】[0012]

【発明の実施の形態】以下に本発明の学習方式を用いた
多値ニューラルネットワークの実施例をあげ、その構成
及びその動作について、２値教師信号を用いた場合につ
いて詳細に説明する。BEST MODE FOR CARRYING OUT THE INVENTION An embodiment of a multi-valued neural network using the learning method of the present invention will be described below, and its configuration and operation will be described in detail when a binary teacher signal is used.

【００１３】本発明の学習方式を用いた多値ニューラル
ネットワークの学習過程の１実施例を図２に示す。端子
２から入力信号が入力層の入力ユニットに入力され、出
力層の出力ユニットから出力ユニット信号を送出する多
層ニューラルネットワーク１、２値教師信号と出力ユニ
ット信号と出力正誤判定器１１からの正誤判定信号とを
入力とし、重み係数更新誤差信号を出力する重み係数更
新誤差信号生成器１０、出力ユニット信号を２値識別ス
レショルドを用いて２値化し２値出力ユニット信号を得
る２値スレショルド回路６、入力された重み係数更新誤
差信号を用いて多層ニューラルネットワーク１の重み係
数を更新し設定する重み係数制御器５、２値教師信号と
２値出力ユニット信号とを入力とし、２値教師信号と２
値出力ユニット信号とを比較し出力ユニット内の正解、
不正解を検出し正誤判定信号を送出する出力正誤判定器
１１、出力正誤判定器１１からの正誤判定信号と２値教
師信号と出力ユニット信号とを入力し、不正解の出力ユ
ニットにおいて出力ユニット信号と２値教師信号との差
の絶対値の最小値、即ち不正解出力ユニット最小誤差を
検出する不正解出力ユニット最小誤差検出器１３と、正
解の出力ユニットにおいて出力ユニット信号と２値スレ
ショルド回路６の２値識別スレショルドとの差の絶対値
の最小値、即ち正解出力ユニット最小余裕を検出する正
解出力ユニット最小余裕検出器１４と、不正解出力ユニ
ット最小誤差及び正解出力ユニット最小余裕とを用いて
ローカルミニマム捕獲状態を検出するローカルミニマム
捕獲状態検出器１５、多層ニューラルネットワーク１と
重み係数制御器５と出力正誤判定器１１、不正解出力ユ
ニット最小誤差検出器１３、正解出力ユニット最小余裕
検出器１４及びローカルミニマム捕獲状態検出器１５の
各初期設定、更に学習の開始及び終了を制御する動作モ
ード制御器１２とから構成される。FIG. 2 shows an embodiment of the learning process of a multi-valued neural network using the learning method of the present invention. The input signal from the terminal 2 is input to the input unit of the input layer, and the output unit signal is sent from the output unit of the output layer. Multilayer neural network 1, binary teacher signal, output unit signal, and output A weighting factor update error signal generator 10 which receives a signal as an input and outputs a weighting factor update error signal, a binary threshold circuit 6 which binarizes the output unit signal using a binary discrimination threshold to obtain a binary output unit signal, The weight coefficient controller 5 for updating and setting the weight coefficient of the multilayer neural network 1 by using the input weight coefficient update error signal, receives the binary teacher signal and the binary output unit signal, and inputs the binary teacher signal and the binary teacher signal.
Correct value in the output unit by comparing with the value output unit signal,
An output correct / wrong judgment device 11 for detecting an incorrect answer and sending a correct / wrong judgment signal, a correct / wrong judgment signal from the output correct / wrong judgment device 11, a binary teacher signal and an output unit signal are inputted, and an output unit signal is outputted at an incorrect output unit. And the binary teacher signal, the minimum absolute value of the difference, that is, the incorrect output unit minimum error detector 13 for detecting the incorrect output unit minimum error, and the output unit signal and the binary threshold circuit 6 in the correct output unit. Using the minimum absolute value of the difference from the binary discrimination threshold of, that is, the correct output unit minimum margin detector 14 for detecting the correct output unit minimum margin, and the incorrect output unit minimum error and the correct output unit minimum margin. Local minimum capture state detector 15, multi-layer neural network 1 and weighting factor controller 5 for detecting the local minimum capture state Each of the output correctness determination unit 11, the incorrect output unit minimum error detector 13, the correct output unit minimum margin detector 14, and the local minimum capture state detector 15 is initialized, and the operation mode controller controls the start and end of learning. 12 and.

【００１４】次に学習過程における動作を説明する。多
層ニューラルネットワーク１は端子２からの学習用入力
信号と端子３からの２値教師信号とを用いて学習を行
う。重み係数更新誤差信号生成器１０において、出力正
誤判定器１１からの正誤判定信号により不正解の２値出
力ユニット信号が検出された出力ユニットに対しては、
２値教師信号からその出力ユニット信号を差し引いて得
られた誤差信号を式（３）のごとく振幅をＤｍ３程小さ
くして重み係数更新誤差信号として出力させ、一方、正
解の出力ユニット信号を与えている出力ユニットにおい
ては、その誤差信号の絶対値が与えられた第１のスレシ
ョルド値ｄｍ以下であると、逆極性で該教師信号から離
れるに従い振幅が定数Ｄｍ１（但し≧０）より小さくな
る重み係数更新誤差信号を生成し出力させ、誤差信号の
絶対値が前記スレショルドより大きいと、同一極性でＤ
ｍ２程小さい振幅を持った重み係数更新誤差信号を生成
し出力させる。Next, the operation in the learning process will be described. The multilayer neural network 1 performs learning using a learning input signal from the terminal 2 and a binary teacher signal from the terminal 3. In the weighting coefficient update error signal generator 10, for the output unit in which the incorrect binary output unit signal is detected by the correctness determination signal from the output correctness determination unit 11,
The error signal obtained by subtracting the output unit signal from the binary teacher signal is output as the weighting coefficient update error signal with the amplitude reduced by about Dm3 as shown in equation (3), while the correct output unit signal is given. In the output unit, when the absolute value of the error signal is less than or equal to the given first threshold value dm, the weighting coefficient whose amplitude becomes smaller than the constant Dm1 (however ≧ 0) as the distance from the teacher signal is increased with the opposite polarity. An updated error signal is generated and output, and if the absolute value of the error signal is larger than the threshold, D with the same polarity
A weight coefficient update error signal having an amplitude as small as m2 is generated and output.

【００１５】ここで、２値あるいは多値教師信号を用い
た場合の重み係数更新誤差信号の１生成方法を式を用い
て以下に示す。Here, a method for generating one of the weighting coefficient update error signals in the case of using a binary or multi-valued teacher signal will be shown below by using an equation.

【００１６】出力ユニットの多値出力ユニット信号が正解の場合、｜Ｔｍ−Ｙｍ｜≦ｄｍの時、Ｅｍ＝Ｔｍ−Ｙｍ−Ｄｍ１・ｓｇｎ（Ｔｍ−Ｙｍ）（１）但しＤｍ１＞ｄｍ｜Ｔｍ−Ｙｍ｜＞ｄｍの時、Ｅｍ＝Ｔｍ−Ｙｍ−Ｄｍ２・ｓｇｎ（Ｔｍ−Ｙｍ）（２）但しｄｍ≧Ｄｍ２出力ユニットの多値出力ユニット信号が不正解の場合、Ｅｍ＝Ｔｍ−Ｙｍ−Ｄｍ３・ｓｇｎ（Ｔｍ−Ｙｍ）（３）ここで、ｓｇｎ（ｘ）＝１：ｘ≧０＝−１：ｘ＜０、（４）[0016] If the multi-valued output unit signal of the output unit is correct, When | Tm−Ym | ≦ dm, Em = Tm-Ym-Dm1 · sgn (Tm-Ym) (1) However, Dm1> dm When | Tm-Ym |> dm, Em = Tm-Ym-Dm2 · sgn (Tm-Ym) (2) However, dm ≧ Dm2 If the multilevel output unit signal of the output unit is incorrect, Em = Tm-Ym-Dm3 · sgn (Tm-Ym) (3) here, sgn (x) = 1: x ≧ 0 = -1, x <0, (4)

【００１７】ｍは出力ユニット番号（１≦ｍ≦Ｍ）、Ｅ
ｍは出力ユニットｍの重み係数更新誤差信号、Ｔｍは出
力ユニットｍの多値教師信号（２値の場合は、０または
１）、Ｙｍは出力ユニットｍの出力ユニット信号、ｄｍ
は出力ユニットｍに対する正の定数、Ｄｍ１は出力ユニ
ットｍに対する０または正の定数、Ｄｍ２は出力ユニッ
トｍに対する正の定数、Ｄｍ３は出力ユニットｍに対す
る正の定数である。M is an output unit number (1≤m≤M), E
m is a weight coefficient update error signal of the output unit m, Tm is a multivalued teacher signal (0 or 1 in the case of binary) of the output unit m, Ym is an output unit signal of the output unit m, dm
Is a positive constant for the output unit m, Dm1 is 0 or a positive constant for the output unit m, Dm2 is a positive constant for the output unit m, and Dm3 is a positive constant for the output unit m.

【００１８】ここで、ｄｍを第１のスレショルドとし、
第１のスレショルドを収束領域と定めれば、連続量の教
師信号を扱うニューラルネットワークにも同様に適用出
来る。重み係数制御器５では、この重み係数更新誤差信
号を用いてバックプロパゲーションアルゴリズムにより
重み係数修正を行い、誤差信号の電力が最小となるよう
学習を繰り返す。Where dm is the first threshold,
If the first threshold is defined as the convergence region, it can be similarly applied to a neural network that handles a continuous amount of teacher signals. The weighting factor controller 5 uses the weighting factor update error signal to correct the weighting factor by the back propagation algorithm, and repeats learning so that the power of the error signal is minimized.

【００１９】また、学習を進める過程で、ローカルミニ
マム捕獲状態検出器１５において、不正解出力ユニット
最小誤差検出器１３からの出力である不正解出力ユニッ
ト最小誤差が与えられた第２のスレショルドより大きく
なると、非常に安定したローカルミニマムと見做しロー
カルミニマム捕獲状態として出力端子１６に検出出力を
送出する。この時、検出出力に従って重み係数更新誤差
信号生成器１０内のＤｍ１及びＤｍ３を、あるいは更に
Ｄｍ２やｄｍを大きくすれば、非常に安定したローカル
ミニマムからも容易に逃れられる。Further, in the course of learning, in the local minimum capture state detector 15, the incorrect output unit minimum error output from the incorrect output unit minimum error detector 13 becomes larger than the given second threshold. Then, it is regarded as a very stable local minimum, and the detection output is sent to the output terminal 16 as the local minimum capture state. At this time, if Dm1 and Dm3 in the weighting factor update error signal generator 10 or Dm2 and dm are further increased according to the detection output, it is possible to easily escape from a very stable local minimum.

【００２０】また、ローカルミニマム捕獲状態検出器１
５において、不正解出力ユニット最小誤差検出器１３か
らの出力である不正解出力ユニット最小誤差あるいは正
解出力ユニット最小余裕検出器１４からの出力である正
解出力ユニット最小余裕が、与えられた第２のスレショ
ルドを越えた状態から一旦スレショルド以下となり、再
びスレショルドを急激に越えた場合には、異なる安定な
ローカルミニマム捕獲状態に移ったと見做す。Further, the local minimum capture state detector 1
5, the incorrect output unit minimum error which is the output from the incorrect output unit minimum error detector 13 or the correct output unit minimum allowance which is the output from the correct output unit minimum margin detector 14 is given by the second If the threshold is exceeded and then falls below the threshold, and the threshold is rapidly exceeded again, it is considered that a different stable local minimum capture state has been entered.

【００２１】更に、安定なローカルミニマム捕獲状態と
なった後、不正解出力ユニット最小誤差検出器１３から
の出力である不正解出力ユニット最小誤差が、学習回数
と共に第３のスレショルドを越えた状態から徐々に小さ
くなり、この第３のスレショルド以下となり、多値化の
ための多値識別スレショルドに近づくと、安定したロー
カルミニマム捕獲状態から放れ、収束過程にあるとみな
し、ｄｍ、Ｄｍ１、Ｄｍ２及びＤｍ３の値を小さくして
も良い。Further, after the stable local minimum capture state is reached, the incorrect output unit minimum error output from the incorrect output unit minimum error detector 13 exceeds the third threshold together with the learning count. It gradually becomes smaller than this third threshold, and when it approaches the multilevel discrimination threshold for multileveling, it is considered that the local minimum capture state is released and it is considered to be in the convergence process, and dm, Dm1, Dm2 and Dm3. The value of may be reduced.

【００２２】出力正誤判定器１１内において、誤りがな
くなり多値教師信号と全ての多値出力ユニット信号とが
それぞれ一致した収束状態を一旦達成した後は、重み係
数更新誤差信号生成器１０において誤差信号をそのまま
重み係数更新誤差信号として用いて、重み係数制御器５
にて重み係数を更新してもよい。また、正解出力ユニッ
ト最小余裕検出器１４の出力信号が予め与えられた第４
のスレショルドを越えると、動作モード制御器１２から
学習終了信号を送出し、多層ニューラルネットワーク１
の学習を終了させてもよい。これにより、不必要な学習
を避けるとともに汎化性を大きく出来る。After the error is eliminated and the multivalued teacher signal and all the multivalued output unit signals have reached the converged state in the output correctness / incorrectness determiner 11, the error is generated in the weighting coefficient update error signal generator 10. The signal is used as it is as the weighting factor update error signal, and the weighting factor controller 5
The weighting factor may be updated at. In addition, the output signal of the correct output unit minimum margin detector 14 is given in advance by the fourth signal.
When the threshold of is exceeded, the learning end signal is sent from the operation mode controller 12, and the multilayer neural network 1
The learning of may be ended. This can avoid unnecessary learning and increase generalization.

【００２３】尚、本発明の学習方式は、多値ニューラル
ネットワークの他に連続量の教師信号を扱うニューラル
ネットワークにおいても、教師信号に収束し正解とみな
す領域を設けることにより、上記と同様な方式を適用で
き一般の教師信号を用いたニューラルネットワークの学
習に対しても幅広く適用できる。The learning method according to the present invention is similar to the above method in the neural network which handles a continuous amount of teacher signals in addition to the multi-valued neural network, by providing a region which is regarded as a correct answer by converging to the teacher signals. Can be widely applied to the learning of neural networks using general teacher signals.

【００２４】以上の実施例において、多層ニューラルネ
ットワークを前提に説明したが、教師信号を利用して学
習させるニューラルネットワークであれば、上記以外の
ニューラルネットワークを用いても良い。また、ここで
は、２値ニューラルネットワークを例にあげ説明した
が、説明は省略するが連続量の教師信号を持ったニュー
ラルネットワークでも本学習方式は適用できる。In the above embodiments, the explanation has been made on the premise of the multilayer neural network, but a neural network other than the above may be used as long as it is a neural network which performs learning by using a teacher signal. Further, although a binary neural network has been described as an example here, although the description is omitted, the present learning method can also be applied to a neural network having a continuous amount of teacher signals.

【００２５】[0025]

【発明の効果】以上述べたように、本発明のニューラル
ネットワークの学習方式において、学習を進めても多値
教師信号と多値出力ユニット信号との間の誤り個数が変
化しない非常に安定したローカルミニマムに陥っている
場合には、不正解出力ユニット最小誤差検出あるいは更
に正解出力ユニット最小余裕検出とを用いたローカルミ
ニマム捕獲状態検出器１５を介してこれを検出し、正解
を与える出力ユニットにおける重み係数更新誤差信号の
振幅あるいは更に極性を調整すること、更に不正解の出
力ユニットにおける重み係数更新誤差信号の振幅を調整
することにより、簡単にこのような安定したローカルミ
ニマムからも出ることが出来る。従って、従来方式より
学習回数が１０倍から１００倍程度著しく短縮される。
また、必要最小限の中間ユニット数あるいは中間層数を
用いて安定に最適状態に収束させることが出来ることか
ら、ハードウェア規模や演算量を少なく出来る。As described above, in the learning method of the neural network of the present invention, the number of errors between the multi-valued teacher signal and the multi-valued output unit signal does not change even if the learning is advanced, and it is a very stable local. In the case of falling into the minimum, this is detected through the local minimum capture state detector 15 using the incorrect output unit minimum error detection or the correct output unit minimum margin detection, and the weight in the output unit giving the correct answer. By adjusting the amplitude or further polarity of the coefficient update error signal, and further adjusting the amplitude of the weighting coefficient update error signal at the incorrect output unit, one can easily get out of such a stable local minimum. Therefore, the number of times of learning is remarkably reduced by 10 to 100 times as compared with the conventional method.
Further, since it is possible to stably converge to the optimum state by using the minimum required number of intermediate units or the number of intermediate layers, the hardware scale and the amount of calculation can be reduced.

【００２６】本発明の学習方式を用いた多値ニューラル
ネットワークは、従来方式に比べて少ない中間層ユニッ
ト数あるいは中間層を用い、初期依存性もなく且つ低い
演算精度を持った重み係数で重み係数が高速かつ安定に
収束し、所望の多値出力信号を送出することができる。
このことから、従来技術では実現が困難な大規模な多値
論理回路などを本発明を用いた多値ニューラルネットワ
ークにより短時間で自由に設計し実現することや、これ
まで迅速な学習が必要で、且つ完全な収束が要求される
人口知能システムや検索システム、パターン認識、デー
タ変換、データ圧縮、多値画像処理さらには通信システ
ムなどへの幅広い応用ができるなどの非常に幅広い効果
を有している。The multi-valued neural network using the learning method of the present invention uses a smaller number of intermediate layer units or intermediate layers as compared with the conventional method, has no initial dependence, and has a low calculation accuracy. Can converge at high speed and stably, and a desired multilevel output signal can be transmitted.
From this, it is necessary to freely design and realize a large-scale multi-valued logic circuit or the like, which is difficult to be realized by the conventional technology, in a short time by the multi-valued neural network using the present invention, and rapid learning so far. In addition, it has a very wide range of effects such as a wide range of applications such as artificial intelligence systems and search systems, pattern recognition, data conversion, data compression, multi-valued image processing, and communication systems that require complete convergence. There is.

[Brief description of drawings]

【図１】従来の学習方式による多層ニューラルネットワ
ークにおける学習処理の１構成例である。FIG. 1 is a configuration example of a learning process in a multilayer neural network according to a conventional learning method.

【図２】実施例における本発明の学習方式を用いた多層
ニューラルネットワークの学習処理の１構成例である。FIG. 2 is a configuration example of a learning process of a multilayer neural network using the learning method of the present invention in the embodiment.

[Explanation of symbols]

１多層ニューラルネットワーク２入力信号入力端子２_１、２_２、２_Ｎ入力信号ユニット入力端子３教師信号入力端子３_１、３_２、３_Ｍ教師信号ユニット入力端子４、４_１、４_２、４_Ｍ減算器５重み係数制御器６、６_１、６_２、６_Ｍ、７、７_１、７_２、７_Ｍ２値ス
レショルド回路８一致検出器９動作モード制御器１０、１０_１、１０_２、１０_Ｍ重み係数更新誤差信号
生成器１１学習判定器１２動作モード制御器１３不正解出力ユニット最小誤差検出器１４正解出力ユニット最小余裕検出器１５ローカルミニマム捕獲状態検出器１６ローカルミニマム捕獲状態検出器出力端子1 Multi-layer neural network 2 Input signal input terminal 2 ₁ , 2 ₂ , 2 _N Input signal unit input terminal 3 Teacher signal input terminal 3 ₁ , 3 ₂ , 3 _M Teacher signal unit input terminal 4, 4 ₁ , 4 ₂ , 4 _M Subtractor 5 Weighting factor controller 6, 6 ₁ , 6 ₂ , 6 _M , 7, 7 ₁ , 7 ₂ , 7 _M Binary threshold circuit 8 Matching detector 9 Operation mode controller 10, 10 ₁ , 10 ₂ , 10 _M weight coefficient update error signal generator 11 learning determiner 12 operation mode controller 13 incorrect output unit minimum error detector 14 correct output unit minimum margin detector 15 local minimum capture state detector 16 local minimum capture state detector output terminal

───────────────────────────────────────────────────── フロントページの続き (56)参考文献特開平４−15860（ＪＰ，Ａ) 特開平４−251392（ＪＰ，Ａ) 特開平３−218559（ＪＰ，Ａ) 八塚陽太郎，「誤差摂動による２値３層ニューラルネットワークの高速収束化」，電子情報通信学会1995年総合大会講演論文集，日本，社団法人電子情報通信学会，1995年３月10日，第１分冊, ｐｐ．24 (58)調査した分野(Int.Cl.⁷，ＤＢ名) G06N 1/00 - 7/08 G06G 7/60 ＪＳＴファイル（ＪＯＩＳ) ＣＳＤＢ（日本国特許庁)─────────────────────────────────────────────────── ─── Continuation of the front page (56) References JP-A-4-15860 (JP, A) JP-A-4-251392 (JP, A) JP-A-3-218559 (JP, A) Yotaro Yatsuka, “Error Fast Convergence of Binary 3-Layer Neural Network by Perturbation ", Proc. Of IEICE General Conference 1995, Japan, The Institute of Electronics, Information and Communication Engineers, March 10, 1995, 1st volume, pp. 24 (58) Fields investigated (Int.Cl. ⁷ , DB name) G06N 1/00-7/08 G06G 7/60 JST file (JOIS) CSDB (Japan Patent Office)

Claims

(57) [Claims]

1. A multi-valued neural network for learning using a multi-valued teacher signal, wherein:
In the output unit in which the multi-valued output unit signal of the multi-valued neural network matches the multi-valued teacher signal and is correct, the absolute value of the error signal obtained by subtracting the output unit signal from the multi-valued teacher signal is less than or equal to the first threshold. Then,
Generating a weighting coefficient update error signal having a polarity opposite to that of the error signal and having an amplitude that decreases as the distance from the multivalued teacher signal increases,
When the absolute value of the error signal is larger than the first threshold, a weighting coefficient update error signal of the same polarity having an amplitude equal to or smaller than the error signal is generated, and the multi-valued output unit signal becomes the multi-valued teacher signal. For output units that are incorrect and do not match,
A weight coefficient update error signal of the same polarity having an amplitude equal to or smaller than the error signal obtained by subtracting the output unit signal from the multi-valued teacher signal is generated, and the weight coefficient is updated using the weight coefficient update error signal. The minimum value of the absolute value of the error signal in the correct output unit is set as the incorrect output unit minimum error, and when it exceeds the second threshold, it is regarded as the local minimum capture state, and when this is detected, the weight coefficient update error signal is detected. A multi-valued neural network learning method characterized in that after the amplitude or the first threshold is increased, a weighting coefficient update error signal is obtained and the weighting coefficient is updated.

2. A multi-valued neural network for learning using a multi-valued teacher signal,
In the output unit in which the multi-valued output unit signal of the multi-valued neural network matches the multi-valued teacher signal and is correct, the absolute value of the error signal obtained by subtracting the output unit signal from the multi-valued teacher signal is less than or equal to the first threshold. Then,
Generating a weighting coefficient update error signal having a polarity opposite to that of the error signal and having an amplitude that decreases as the distance from the multivalued teacher signal increases,
When the absolute value of the error signal is larger than the first threshold, a weighting coefficient update error signal of the same polarity having an amplitude equal to or smaller than the error signal is generated, and the multi-valued output unit signal becomes the multi-valued teacher signal. For output units that are incorrect and do not match,
A weight coefficient update error signal of the same polarity having an amplitude equal to or smaller than the error signal is generated, the weight coefficient is updated using the weight coefficient update error signal, and the minimum absolute value of the error signal in the incorrect output unit Is the incorrect error output unit minimum error,
At the same time when this exceeds the second threshold, the minimum absolute value of the difference obtained by subtracting the output unit signal from the multilevel discrimination threshold for the correct output unit is defined as the correct output unit minimum margin, and the second threshold is set. When it exceeds, it is regarded as a local minimum capture state, this is detected, and the amplitude of the weighting factor update error signal or further the first
Multi-valued neural network learning method characterized by updating the weighting coefficient by obtaining the weighting coefficient update error signal after increasing the threshold of.

3. The learning of the multivalued neural network according to claim 1, wherein when the minimum error output unit minimum error becomes smaller than a third threshold after a local minimum capture state is entered, the weight is reduced. A multi-valued neural network learning method characterized in that the amplitude of the coefficient update error signal or further the first threshold is reduced to obtain a weight coefficient update error signal and the weight coefficient is updated.

4. The learning in the multi-valued neural network according to claim 1, 2 or 3, wherein after all of the multi-valued output unit signals match the corresponding multi-valued teacher signals, the correct output unit is selected. When the correct output unit minimum margin, which is the minimum absolute value of the difference obtained by subtracting the output unit signal from the multilevel discrimination threshold, becomes larger than the fourth threshold, the updating of the weighting coefficient in learning is terminated. Multi-valued neural network learning method.