JPH04305785A

JPH04305785A - Method for evaluating neural network

Info

Publication number: JPH04305785A
Application number: JP3070004A
Authority: JP
Inventors: Masahide Nomura; 野村　正英; Hisanori Miyagaki; 宮垣　久典; Eiji Toyama; 栄二遠山
Original assignee: Hitachi Ltd
Current assignee: Hitachi Ltd
Priority date: 1991-04-02
Filing date: 1991-04-02
Publication date: 1992-10-28
Anticipated expiration: 2016-02-05
Also published as: JP3132027B2

Abstract

PURPOSE:To obtain a method for evaluating cause and effect relation between the input and output of a neural network. CONSTITUTION:The I/O functions of units in the neural network constituted of the hierarchical connection of units using respective neurons as models are developed by series and the network is expressed by a non-linear regression expression based upon the developed expression to attain the evaluation. Thus the cause and effect relation between the I/O of the neural network can be cleared by expressing the network by the non-linear regression expression.

Description

[Detailed description of the invention]

【０００１】0001

【産業上の利用分野】本発明は、ニューラル・ネットワ
ークの評価方法に係り、特に、階層型ニューラル・ネッ
トワークの入出力間の因果関係を評価するに好適なニュ
ーラル・ネットワークの評価方法に関する。BACKGROUND OF THE INVENTION 1. Field of the Invention The present invention relates to a neural network evaluation method, and more particularly to a neural network evaluation method suitable for evaluating causal relationships between inputs and outputs of a hierarchical neural network.

【０００２】0002

【従来の技術】階層型ニューラル・ネットワークは、ニ
ューロンをモデル化したユニットの階層状結合により構
成され、入力信号を非線形変換でき、この非線形変換の
関数形を学習により構築できる特徴がある。このため、
この特徴を利用して、“ニューラル・コンピュータ”（
東京電気大学出版局、昭和６３−４）、“ニューラル・
ネットワーク情報処理”（産業図書、昭和６３−７）に
記載されているように、種々の分野への応用が試みられ
ている。2. Description of the Related Art Hierarchical neural networks are constructed by hierarchical combinations of units modeled on neurons, and are capable of nonlinear transformation of input signals, and are characterized by the ability to construct the functional form of this nonlinear transformation through learning. For this reason,
Taking advantage of this feature, “neural computers” (
Tokyo Denki University Press, 1986-4), “Neural
As described in "Network Information Processing" (Sangyo Tosho, 1986-7), attempts have been made to apply it to various fields.

【０００３】階層型ニューラル・ネットワークの構成例
及びユニットの構成を、それぞれ図１，図２に示す。前
記非線形変換の機能は、ユニットの入出力関数ｆ（ｘ）
の非線形性に依存しており、次式で表わされる関数が一
般的に使用されている。An example of the configuration of a hierarchical neural network and a unit configuration are shown in FIGS. 1 and 2, respectively. The function of the nonlinear transformation is the input/output function f(x) of the unit.
The function expressed by the following equation is generally used.

【０００４】0004

【数１】[Math 1]

【０００５】ここで、ｘ：入力 θ：しきい値この入出力関数ｆ（ｘ）の非線形性及びネットワークの
非線形変換機能については、“ニューラル・ネットワー
クによる連続写像の近似実現について”（電子情報通信
学会技術研究報告，ＭＢＥ８８−９，１９８８−４），
“ニューラル・ネットワークのｃａｐａｂｉｌｉｔｙに
ついて”（電子情報通信学会技術研究報告、ＭＢＥ８８
−５２，１９８８−７）に記載されているように、数学
理論の面から検討され、階層型ニューラル・ネットワー
クが連続写像の実現機構としてある種の万能性を持って
いることが証明されている。Here, x: Input θ: Threshold Regarding the nonlinearity of this input/output function f(x) and the nonlinear transformation function of the network, please refer to "About Realization of Approximation of Continuous Mapping by Neural Network" (Electronic Information and Communication Academic Technical Research Report, MBE88-9, 1988-4),
“About the capability of neural networks” (IEICE technical research report, MBE88
52, 1988-7), it has been studied from the perspective of mathematical theory and it has been proven that hierarchical neural networks have a certain degree of versatility as a mechanism for realizing continuous mapping. .

【０００６】[0006]

【発明が解決しようとする課題】上記従来技術は、階層
型ニューラル・ネットワークが連続写像の実現機構とし
てある種の万能性を持っていることを証明しているが、
ある連続写像を近似する階層型ニューラル・ネットワー
クの設計について、具体的な方法を提供していない。ま
た、学習により形成されたネットワークから入出力関係
の構造を抽出するための理論的背景も与えられていない
。このため、従来は、ネットワークは、ブラック・ボッ
クスとして扱われていた。[Problem to be Solved by the Invention] The above-mentioned prior art proves that hierarchical neural networks have a certain degree of versatility as a mechanism for realizing continuous mapping.
It does not provide a specific method for designing a hierarchical neural network that approximates a certain continuous mapping. Furthermore, no theoretical background has been provided for extracting the structure of input-output relationships from networks formed through learning. For this reason, networks have traditionally been treated as black boxes.

【０００７】本発明の目的は、ニューラル・ネットワー
クの入出力間の因果関係を評価する方法を提供するにあ
る。An object of the present invention is to provide a method for evaluating causal relationships between inputs and outputs of a neural network.

【０００８】[0008]

【課題を解決するための手段】上記目的を達成するため
に、本発明は、ニューロンをモデル化したユニットの階
層状結合により構成したニューラル・ネットワークの応
用システムにおいて、ユニットの入出力関数を級数展開
し、この展開式を用いて、ネットワークを非線形回帰式
で表わすようにした。[Means for Solving the Problems] In order to achieve the above object, the present invention provides a neural network application system configured by hierarchically connecting units modeled on neurons, in which the input/output functions of the units are expanded into a series. Then, using this expansion formula, the network was expressed by a nonlinear regression formula.

【０００９】[0009]

【作用】ニューラル・ネットワークを非線形回帰式で表
わすことにより、ネットワークの入出力間の因果関係が
明確になる。[Operation] By expressing the neural network using a nonlinear regression equation, the causal relationship between the input and output of the network becomes clear.

【００１０】0010

【実施例】以下、本発明の一実施例を図３により説明す
る。本実施例は、対象システム１の入出力特性をニュー
ラル・ネットワークに学習させる学習システム２，学習
後のニューラル・ネットワークにより対象システム１の
入出力特性を推定する推定システム３，学習後のニュー
ラル・ネットワークを評価する評価システム４、から構
成される。[Embodiment] An embodiment of the present invention will be described below with reference to FIG. This embodiment includes a learning system 2 that causes a neural network to learn the input/output characteristics of the target system 1, an estimation system 3 that estimates the input/output characteristics of the target system 1 using the neural network after learning, and a neural network after learning. The system consists of an evaluation system 4 that evaluates.

【００１１】対象システム１は、プラント等のように入
力に対して出力が対応付けられるものであれば、どのよ
うなものでもよい。The target system 1 may be any type of system, such as a plant, as long as outputs can be associated with inputs.

【００１２】推定システム３は、階層型ニューラル・ネ
ットワークを用いて、対象システム１の入力ｘ（ベクト
ル）に対する出力ｙ（ベクトル）を推定する。この関係
は、次式で表わされる。The estimation system 3 estimates the output y (vector) of the target system 1 for the input x (vector) using a hierarchical neural network. This relationship is expressed by the following equation.

【００１３】[0013]

【数２】[Math 2]

【００１４】ここで、ｇ：非線形関係（ベクトル）ｙ：
出力ｙの推定値（ベクトル）階層型ニューラル・ネットワークは、図１に示すように
、図２に示すユニットの階層状結合により構成される。各層のユニットの入出力関係は、次式で表わされる。Here, g: nonlinear relationship (vector) y:
Estimated value (vector) of output y A hierarchical neural network, as shown in FIG. 1, is constructed by a hierarchical combination of units shown in FIG. 2. The input/output relationship of the units in each layer is expressed by the following equation.

【００１５】[0015]

【数３】[Math 3]

【００１６】[0016]

【数４】[Math 4]

【００１７】ここで、ｕｊ（ｋ）：第ｋ層の第ｊユニッ
トへの入力の総和ｖｊ（ｋ）：第ｋ層の第ｊユニットの出力ｗｉｊ（ｋ−
１，ｋ）：第（ｋ−１）層の第ｉユニットから第ｋ層の
第ｊユニットへの結合の重み係数ｆ：各ユニットの入出
力関数を与える関数（入出力関数）なお、第１層（入力層）の各ユニットは、ユニットの入
力と同じものを出力する。数２の非線形関数ｇの特性は
、層の個数，各層のユニットの個数、重み係数ｗｉｊ（
ｋ−１，ｋ）が変わると変化する。したがって、これら
を調整することにより、対象システム１の入出力特性を
表わす非線形関数ｇが得られる。特に、この重み係数ｗ
ｉｊ（ｋ−１，ｋ）の調整は、学習により実現できる。Here, uj(k): sum of inputs to the j-th unit of the k-th layer vj(k): output wij(k-
1, k): Weighting coefficient for coupling from the i-th unit of the (k-1)th layer to the j-th unit of the k-th layer f: Function that provides the input/output function of each unit (input/output function) Each unit in the layer (input layer) outputs the same thing as the unit's input. The characteristics of the nonlinear function g in Equation 2 are the number of layers, the number of units in each layer, and the weighting coefficient wij (
k-1, k) changes. Therefore, by adjusting these, a nonlinear function g representing the input/output characteristics of the target system 1 can be obtained. In particular, this weighting factor w
Adjustment of ij (k-1, k) can be realized by learning.

【００１８】学習システム２は、対象システム１の入出
力特性を表わす非線形関数ｇを学習により構築する。次
に、この学習のアルゴリズムについて説明する。The learning system 2 constructs a nonlinear function g representing the input/output characteristics of the target system 1 through learning. Next, this learning algorithm will be explained.

【００１９】先ず、学習用データとして入出力の組（ｘ
ｔ，ｙｔ）が与えられたとき、次式に示す誤差の２乗和
を損失関数ｒとして定義する。First, an input/output set (x
t, yt) is given, the sum of squared errors shown in the following equation is defined as the loss function r.

【００２０】[0020]

【数５】[Math 5]

【００２１】ここで、ｗ：ニューラル・ネットワークの
結合の重み係数をすべてまとめたものｖｊ（ｍ）（ｗ，ｘｔ）：入力ｘｔ　と重みｗから総合
的に得られる第ｎ層（出力層）の第ｊユニットの出力ｗ
の修正量Δｗは、損失関数ｒのｗについての勾配（ｇｒ
ａｄｉｅｎｔ）から求められ、次式で表わされる。Here, w: A collection of all the weighting coefficients of connections in the neural network vj (m) (w, xt): The nth layer (output layer) obtained comprehensively from the input xt and the weight w. Output of j-th unit w
The correction amount Δw is the gradient (gr
adient) and is expressed by the following formula.

【００２２】[0022]

【数６】[Math 6]

【００２３】数６の右辺の▽ｒ　の各成分は、次式のよ
うに変形できる。Each component of ▽r on the right side of equation 6 can be transformed as shown in the following equation.

【００２４】[0024]

【数７】[Math 7]

【００２５】数７に数４を代入して整理すると、次式が
導かれる。By substituting the equation 4 into the equation 7 and sorting it out, the following equation is derived.

【００２６】[0026]

【数８】[Math. 8]

【００２７】ｋ≠ｍのとき、数８の右辺の∂ｒ／∂ｕｊ
（ｋ）は、次式により求められる。When k≠m, ∂r/∂uj on the right side of equation 8
(k) is obtained by the following equation.

【００２８】[0028]

【数９】[Math. 9]

【００２９】数９に数３，数４を代入して、整理すると
次式が得られる。By substituting Equation 3 and Equation 4 into Equation 9 and rearranging, the following equation is obtained.

【００３０】[0030]

【数１０】[Math. 10]

【００３１】ここで、ｆ′：ｆの導関数∂ｒ／∂ｕｊ（
ｋ）＝ｄｊ（ｋ）とおくと、数６，数１０は、次式で表
わされる。Here, f': derivative of f ∂r/∂uj(
k)=dj(k), Equations 6 and 10 are expressed by the following equations.

【００３２】[0032]

【数１１】[Math. 11]

【００３３】[0033]

【数１２】[Math. 12]

【００３４】また、ｋ＝ｍのとき、∂ｒ／∂ｕｊ（ｍ）
＝ｄｊ（ｍ）は、数５より次式で求められる。[0034] Also, when k=m, ∂r/∂uj(m)
=dj(m) is obtained from Equation 5 using the following equation.

【００３５】[0035]

【数１３】[Math. 13]

【００３６】数１１，数１２，数１３を用いると、結合
の重み係数ｗｉｊ（ｋ−１，ｋ）の修正が、ｋ＝ｍから
ｋ＝２に向って、再帰的に計算できる。すなわち、出力
層での理想出力と実際の出力との誤差が、出力層から入
力層の方向へ、信号の伝播と逆の方向にｗｉｌ（ｋ，ｋ
＋１）で重み付けた和をとりながら伝播していく。これ
が、誤差逆伝播学習アルゴリズムである。Using Equations 11, 12, and 13, the modification of the connection weighting coefficient wij(k-1,k) can be calculated recursively from k=m to k=2. In other words, the error between the ideal output and the actual output at the output layer is wil(k, k
+1) and propagates while calculating the weighted sum. This is the error backpropagation learning algorithm.

【００３７】入出力関数ｆがすべてのユニットに共通で
、数１で与えられる場合、ｆ′は次式で表わされる。When the input/output function f is common to all units and is given by equation 1, f' is expressed by the following equation.

【００３８】[0038]

【数１４】[Math. 14]

【００３９】数３と数１４より、次式が導かれる。From Equations 3 and 14, the following equation is derived.

【００４０】[0040]

【数１５】[Math. 15]

【００４１】なお、学習を滑らかに速く収束させるため
に、数１１は次式のように修正することができる。Note that in order to converge the learning smoothly and quickly, equation 11 can be modified as shown in the following equation.

【００４２】[0042]

【数１６】[Math. 16]

【００４３】ここで、α：小さな正の定数（α＝１−ε
としてもよい）ｔ：修正の回数（あるいは時刻（離散））評価システム
４は、学習後のニューラル・ネットワークを評価する。すなわち、学習により対象システム１の入出力特性を表
わすニューラル・ネットワークが得られるが、このニュ
ーラル・ネットワークを評価する。このために、ユニッ
トの入出力関数を級数展開し、この展開式を用いて、ネ
ットワークを非線形回帰式で表わし、この非線形回帰式
を用いて、ネットワークを評価する。以下、これについ
て、詳細に説明する。なお、評価システム４は、学習後
のニューラル・ネットワークの解析評価のみならずニュ
ーラル・ネットワークの設計にも利用できる。Here, α: a small positive constant (α=1−ε
) t: number of corrections (or time (discrete)) The evaluation system 4 evaluates the neural network after learning. That is, a neural network representing the input/output characteristics of the target system 1 is obtained through learning, and this neural network is evaluated. For this purpose, the input/output function of the unit is expanded into a series, the network is expressed by a nonlinear regression equation using this expansion, and the network is evaluated using this nonlinear regression equation. This will be explained in detail below. Note that the evaluation system 4 can be used not only for analysis and evaluation of neural networks after learning, but also for designing neural networks.

【００４４】関数の級数展開の１つの方法として、テイ
ラー展開があり、これを利用すると、入出力関数ｆ（ｘ
）は、次式により表わされる。One method of series expansion of a function is Taylor expansion, and when this is used, the input/output function f(x
) is expressed by the following formula.

【００４５】[0045]

【数１７】[Math. 17]

【００４６】数１に示す入出力関数ｆ（ｘ）の導関数は
、次式で表わされ、次数が高くなると急激に式が複雑に
なる。The derivative of the input/output function f(x) shown in Equation 1 is expressed by the following equation, and as the order increases, the equation becomes rapidly complicated.

【００４７】[0047]

【数１８】[Math. 18]

【００４８】この数１８で表わされる導関数のｘ＝ｘ０
　における値を数１７に代入すると、ｆ（ｘ）のテイラ
ー展開式が得られる。x=x0 of the derivative expressed by this number 18
By substituting the value in Equation 17, the Taylor expansion of f(x) is obtained.

【００４９】上記ｆ（ｘ）のテイラー展開式の基本式は
、ｘ０　＝０における入出力関数ｆ（ｘ）のテイラー展
開式、すなわちマクローリン展開式であり、これについ
て誤差評価する。数１７にｘ０　＝０を代入すると、次
式が得られる。The basic formula of the above Taylor expansion of f(x) is the Taylor expansion of the input/output function f(x) at x0 = 0, that is, the Maclaurin expansion, and errors are evaluated for this. By substituting x0 = 0 into Equation 17, the following equation is obtained.

【００５０】[0050]

【数１９】[Math. 19]

【００５１】また、数１８にｘ＝０を代入すると、次式
が導かれる。Further, by substituting x=0 into Equation 18, the following equation is derived.

【００５２】[0052]

【数２０】[Math. 20]

【００５３】この数２０を整理すると、ｘ＝０における
入出力関数ｆ（ｘ）の６次までの導関数の値は、次式で
表わされる。Rearranging this number 20, the value of the derivative up to the sixth order of the input/output function f(x) at x=0 is expressed by the following equation.

【００５４】[0054]

【数２１】[Math. 21]

【００５５】この数２１において、しきい値θ＝０を代
入すると次式が得られる。By substituting the threshold value θ=0 into Equation 21, the following equation is obtained.

【００５６】[0056]

【数２２】[Math. 22]

【００５７】また、数２１において、しきい値θ＝０．
５　及び１を代入すると次式が得られる。In addition, in Equation 21, the threshold value θ=0.
By substituting 5 and 1, the following formula is obtained.

【００５８】[0058]

【数２３】[Math. 23]

【００５９】[0059]

【数２４】[Math. 24]

【００６０】数１９において、７次以上の項を省略して
、数２２，数２３，数２４を代入すると、次式が導かれ
る。In Equation 19, by omitting the terms of order 7 or higher and substituting Equation 22, Equation 23, and Equation 24, the following equation is derived.

【００６１】[0061]

【数２５】[Math. 25]

【００６２】[0062]

【数２６】[Math. 26]

【００６３】[0063]

【数２７】[Math. 27]

【００６４】ここで、ｆｎ（ｘ）：ｆ（ｘ）のｎ次近似
式数２５より、しきい値θが零の場合、入出力関数ｆ（
ｘ）のｘ＝０でのテイラー展開式は、奇数次の項のみに
より表わされることが予想される。これに対して、数２
６，数２７より、しきい値θが非零の場合、このテイラ
ー展開式は、奇数次及び偶数次の項により表わされるこ
とが分かる。Here, fn(x): From the n-th approximation formula of f(x) (25), when the threshold value θ is zero, the input/output function f(
The Taylor expansion of x) at x=0 is expected to be expressed only by odd-order terms. On the other hand, the number 2
6. From Equation 27, it can be seen that when the threshold value θ is non-zero, this Taylor expansion equation is expressed by odd-order and even-order terms.

【００６５】しきい値θ＝０の場合について、入出力関
数ｆ（ｘ）のｎ次近似式ｆｎ（ｘ）数２５で求めた近似
値と真値との比較結果を表１及び図４に示す。この表及
び図から、ｆ（ｘ）の値がほぼ０．１〜０．９となるｘ
の範囲、For the case where the threshold value θ=0, Table 1 and FIG. show. From this table and figure, it can be seen that x where the value of f(x) is approximately 0.1 to 0.9
range of,

【００６６】[0066]

【表１】[Table 1]

【００６７】−２≦ｘ≦２において、近似の次数が大き
くなる程精度が良くなることが分かる。すなわち、１次
，３次，５次近似式で、それぞれ、±１２％，±５％，
±２％の誤差内の近似値が得られている。It can be seen that when -2≦x≦2, the accuracy increases as the order of approximation increases. In other words, the first, third, and fifth approximations are ±12%, ±5%, respectively.
Approximate values within ±2% error have been obtained.

【００６８】また、しきい値θ＝０．５　及び１の場合
について、入出力関数ｆ（ｘ）の６次近似式ｆ６（ｘ）
（数２６及び数２７で求めた近似値と真値との比較結果
を表２、図５及び表３，図６に示す。これらの表及び図
から、ｘの範囲が−２≦ｘ≦２において、θ＝０．５，
１　の場合の誤差が、それぞれ、±５０％，±１４０％
となり、しきい値θが大きくなる程精度が悪くなること
が分かる。In addition, for the cases where the threshold value θ=0.5 and 1, the sixth-order approximation formula f6(x) of the input/output function f(x)
(Table 2, Figure 5, Table 3, and Figure 6 show the comparison results between the approximate values and true values obtained using Equations 26 and 27. From these tables and figures, the range of x is -2≦x≦2. , θ=0.5,
1, the error is ±50% and ±140%, respectively.
It can be seen that the larger the threshold value θ, the worse the accuracy.

【００６９】[0069]

【表２】[Table 2]

【００７０】[0070]

【表３】[Table 3]

【００７１】次に、入出力関数ｆ（ｘ）のテイラー展開
式の特性について説明する。Next, the characteristics of the Taylor expansion of the input/output function f(x) will be explained.

【００７２】しきい値θが零の場合、入出力関数ｆ（ｘ
）のｘ＝０でのテイラー展開式は、奇数次の項のみによ
り表わされ、しきい値θが非零の場合は、奇数次及び偶
数次の項により表わされると先に述べたが、先ず、これ
について説明する。When the threshold value θ is zero, the input/output function f(x
) at x=0 is expressed by only odd-order terms, and when the threshold θ is non-zero, it is expressed by odd-order and even-order terms, First, this will be explained.

【００７３】関数ｇ（ｘ）を次式により定義する。The function g(x) is defined by the following equation.

【００７４】[0074]

【数２８】[Math. 28]

【００７５】数２８に−ｘを代入すると、次式が導かれ
る。By substituting -x into Equation 28, the following equation is derived.

【００７６】[0076]

【数２９】[Math. 29]

【００７７】また、数２８の両辺に−１を掛けると、次
式が得られる。Further, by multiplying both sides of Equation 28 by -1, the following equation is obtained.

【００７８】[0078]

【数３０】[Math. 30]

【００７９】数２９，数３０より次式が成立つ。From Equations 29 and 30, the following equation holds true.

【００８０】[0080]

【数３１】[Math. 31]

【００８１】数３１は、関数ｇ（ｘ）が奇関数であるこ
とを示している。したがって、関数ｇ（ｘ）は、次式で
表わされる。Equation 31 shows that the function g(x) is an odd function. Therefore, the function g(x) is expressed by the following equation.

【００８２】[0082]

【数３２】[Math. 32]

【００８３】ここで、ａ２ｎ＋１：係数一方、数１と数
２８から、次式が成立つ。Here, a2n+1: Coefficient On the other hand, from Equation 1 and Equation 28, the following equation holds true.

【００８４】[0084]

【数３３】[Math. 33]

【００８５】ここで、ｈ（ｘ）：入出力関数ｆ（ｘ）に
おいて、θ＝０としたときの関数この数３３に数３２を
代入すると、次式が得られる。Here, h(x): Input/output function f(x), function when θ=0. Substituting equation 32 into equation 33, the following equation is obtained.

【００８６】[0086]

【数３４】[Math. 34]

【００８７】数３４より、しきい値θが零の場合、入出
力関数ｆ（ｘ）のｘ＝０でのテイラー展開式が奇数次の
項のみにより表わされることが分かる。From Equation 34, it can be seen that when the threshold value θ is zero, the Taylor expansion of the input/output function f(x) at x=0 is expressed by only odd-order terms.

【００８８】次に、数２８から次式が導かれる。Next, the following equation is derived from Equation 28.

【００８９】[0089]

【数３５】[Math. 35]

【００９０】数１と数３５より、次式が得られる。From equations 1 and 35, the following equation is obtained.

【００９１】[0091]

【数３６】[Math. 36]

【００９２】数３６に数３２を代入すると、次式が導か
れる。By substituting the number 32 into the number 36, the following equation is derived.

【００９３】[0093]

【数３７】[Math. 37]

【００９４】数３７を変形すると、次式が得られる。By transforming Equation 37, the following equation is obtained.

【００９５】[0095]

【数３８】[Math. 38]

【００９６】ここで、ｂｉ　：係数（θの関数）数３８
より、しきい値θが非零の場合、入出力関数ｆ（ｘ）の
ｘ＝０でのテイラー展開式が、奇数次及び偶数次の項に
より表示されることが分かる。Here, bi: coefficient (function of θ) number 38
From this, it can be seen that when the threshold value θ is non-zero, the Taylor expansion equation of the input/output function f(x) at x=0 is expressed by odd-order and even-order terms.

【００９７】さらに、先に、しきい値θが零の場合、入
出力関数ｆ（ｘ）のｘ＝０でのテイラー展開式でｘの６
次の項まで使用すると、かなり精度の良い近似値が得ら
れるが、しきい値θが非零の場合は、精度が悪いことが
分かった。しかし、この問題は、容易に解決できる。す
なわち、数３３と数１から次式が導かれる。Furthermore, first, when the threshold value θ is zero, the Taylor expansion of the input/output function f(x) at x=0 is
It was found that if the following terms are used, a fairly accurate approximation value can be obtained, but if the threshold value θ is non-zero, the accuracy is poor. However, this problem can be easily solved. That is, the following equation is derived from Equation 33 and Equation 1.

【００９８】[0098]

【数３９】[Math. 39]

【００９９】数３９は、ｈ（ｘ）をθだけ平行移動する
と、ｆ（ｘ）となることを示している。したがって、し
きい値θが非零の場合、入出力関数ｆ（ｘ）のテイラー
展開式で良い精度を得るには、θ＝０の場合のｆ（ｘ）
のｘ＝０でのテイラー展開式をθだけ平行移動し、これ
を使用すればよい。Equation 39 shows that when h(x) is translated by θ, it becomes f(x). Therefore, when the threshold θ is non-zero, in order to obtain good accuracy with the Taylor expansion of the input/output function f(x), f(x) when θ=0
It is sufficient to translate the Taylor expansion equation at x=0 by θ and use this.

【０１００】先に、テイラー展開式を利用して、入出力
関数ｆ（ｘ）の非線形性について検討できることを示し
た。引続いて、このテイラー展開式を利用して、階層型
ニューラル・ネットワークの非線形変換機能について検
討できることを示す。なお、階層型ネットワークのうち
で基本となるのは、３層型ネットワークであり、これを
対象にして説明する。Previously, it was shown that the nonlinearity of the input/output function f(x) can be studied using the Taylor expansion equation. Next, we show that this Taylor expansion can be used to study the nonlinear transformation function of hierarchical neural networks. Note that the basic type of hierarchical network is a three-layer network, and this will be explained below.

【０１０１】３層型ネットワークのうちで、比較的単純
な２入力１出力のネットワークについて検討する。図７
にその構成を示す。なお、展開を簡単にするために、入
出力関数ｆ（ｘ）は、全てのユニットで同じ関数を使用
するものとする。Among the three-layer networks, a relatively simple two-input one-output network will be considered. Figure 7
The structure is shown below. Note that in order to simplify the expansion, it is assumed that the same input/output function f(x) is used in all units.

【０１０２】先ず、入力層のユニットの出力ｖｊ（１）
（ｊ＝１，２）は、ネットワークの定義より次式で与え
られる。First, the output vj(1) of the input layer unit
(j=1, 2) is given by the following equation from the network definition.

【０１０３】[0103]

【数４０】[Math. 40]

【０１０４】次に、中間層のユニットへの入力の総和ｕ
ｊ（２）（ｊ＝１，２，３，…，Ｎ）は、次式で与えら
れる。Next, the sum of inputs to the units in the middle layer u
j(2) (j=1, 2, 3,..., N) is given by the following equation.

【０１０５】[0105]

【数４１】[Math. 41]

【０１０６】また、中間層のユニットの出力ｖｊ（２）
（ｊ＝１，２，３，…，Ｎ）は、数３８で用いると次式
で表わされる。[0106] Also, the output vj (2) of the unit in the middle layer
(j=1, 2, 3, . . . , N) is expressed by the following equation when used in Equation 38.

【０１０７】[0107]

【数４２】[Math. 42]

【０１０８】出力層のユニットへの入力の総和ｕｊ（３
）（ｊ＝１）は、次式で与えられる。The sum of inputs to the output layer unit uj(3
) (j=1) is given by the following equation.

【０１０９】[0109]

【数４３】[Math. 43]

【０１１０】また、出力層のユニットの出力ｖｊ（３）
（ｊ＝１）は、数３８を用いると次式で表わされる。[0110] Also, the output vj (3) of the output layer unit
(j=1) is expressed by the following equation using Equation 38.

【０１１１】[0111]

【数４４】[Math. 44]

【０１１２】数４４に数４２を代入すると、次式が得ら
れる。By substituting equation 42 into equation 44, the following equation is obtained.

【０１１３】[0113]

【数４５】[Math. 45]

【０１１４】この数４５に数４０を代入すると、ｖ１（
３）をｙと表わすと、次式が導かれる。[0114] Substituting the number 40 into this number 45, v1(
If 3) is expressed as y, the following equation is derived.

【０１１５】[0115]

【数４６】[Math. 46]

【０１１６】数４６を展開して整理すると共に、次式が
得られる。By expanding and rearranging Equation 46, the following equation is obtained.

【０１１７】[0117]

【数４７】[Math. 47]

【０１１８】ここで、ｂｍｎ：ｂｉ　，ｗｉｊ（ｋ，ｌ
）の関数数４７より、図７に示すニューラル・ネットワ
ークは、中間層のユニットの個数Ｎを増加させると、重
み係数ｗ１ｉ（１，２），ｗ２ｉ（１，２），ｗｉ１（
２，３）（ｉ＝１，２，３，…，Ｎ）の個数が増加して
調整の自由度が増加し、より高次で複雑な非線関数を近
似できることが分かる。Here, bmn:bi, wij(k, l
), the neural network shown in FIG. 7 has weighting coefficients w1i (1, 2), w2i (1, 2), wi1 (
2, 3) (i=1, 2, 3, . . . , N) increases, the degree of freedom of adjustment increases, and it is possible to approximate higher-order and more complex nonlinear functions.

【０１１９】次に、しきい値ユニットの機能について、
ｆ（ｘ）のテイラー展開式を利用して検討できることを
示す。Next, regarding the function of the threshold unit,
We will show that it can be studied using the Taylor expansion formula of f(x).

【０１２０】しきい値ユニットは、常に１を出力し、階
層型ニューラル・ネットワークの各ユニットの入出力関
数のしきい値をユニット毎に変化させる機能がある。す
なわち、これによりユニット毎に入出力関数の平行移動
量を変化させることができる。このしきい値ユニットを
組込んだ２入力１出力ネットワークについて以下検討す
る。図８にその構成を示す。なお、ここでは、入力層の
しきい値ユニットは、入力の個数から除外している。ま
た、展開を簡単にするために、入出力関数ｆ（ｘ）は、
全てのユニットで同じ関数を使用するものとする。The threshold unit always outputs 1 and has a function of changing the threshold value of the input/output function of each unit of the hierarchical neural network for each unit. That is, this allows the amount of parallel movement of the input/output function to be changed for each unit. A two-input one-output network incorporating this threshold unit will be discussed below. Figure 8 shows its configuration. Note that here, the threshold unit of the input layer is excluded from the number of inputs. Also, to simplify the expansion, the input/output function f(x) is
The same function shall be used in all units.

【０１２１】先ず、入力層のユニットの出力ｖｊ（１）
（ｊ＝０，１，２）は、ネットワークの定義より次式で
与えられる。First, the output vj(1) of the input layer unit
(j=0, 1, 2) is given by the following equation from the network definition.

【０１２２】[0122]

【数４８】[Math. 48]

【０１２３】次に、中間層のユニットへの入力の総和ｕ
ｊ（２）（ｊ＝０，１，２，３，…，Ｎ）は次式で与え
られる。Next, the sum of inputs to the units in the middle layer u
j(2) (j=0, 1, 2, 3,..., N) is given by the following equation.

【０１２４】[0124]

【数４９】[Math. 49]

【０１２５】[0125]

【数５０】[Number 50]

【０１２６】このとき、中間層のユニットの出力ｖｊ（
２）（ｊ＝０，１，２，３，…，Ｎ）は、数３８を用い
ると次式で表わされる。At this time, the output vj(
2) (j=0, 1, 2, 3,..., N) can be expressed by the following equation using Equation 38.

【０１２７】[0127]

【数５１】[Math. 51]

【０１２８】また、出力層のユニットへの総和ｕｊ（３
）（ｊ＝１）は、次式で与えられる。[0128] Also, the summation uj(3
) (j=1) is given by the following equation.

【０１２９】[0129]

【数５２】[Math. 52]

【０１３０】このとき、出力層のユニットの出力ｖｊ（
３）（ｊ＝１）は、数３８を用いると次式で表わされる
。At this time, the output vj(
3) (j=1) can be expressed by the following equation using Equation 38.

【０１３１】[0131]

【数５３】[Math. 53]

【０１３２】数５３に数５１を代入すると、次式が得ら
れる。By substituting equation 51 into equation 53, the following equation is obtained.

【０１３３】[0133]

【数５４】[Math. 54]

【０１３４】この数５４に数４８を代入すると共に、ｖ
１（３）をｙと表わすと、次式が導かれる。[0134] While substituting the number 48 into this number 54, v
If 1(3) is expressed as y, the following equation is derived.

【０１３５】[0135]

【数５５】[Math. 55]

【０１３６】数５５を展開して整理すると、数４６を展
開して得られる数４７と同形の式が導かれる。この場合
、しきい値ユニットを導入したことにより、数４６より
数４７の方がｗ０１（２，３），ｗ０ｉ（１，２）（ｉ
＝１，２，３，…，Ｎ）の分だけ重み係数の個数が増加
し、入出力関数のテイラー展開式を平行移動させる自由
度が増加する。これにより高次で複雑な非線形関数の近
似に大きい調整の自由度が生じて、近似精度が向上する
ことが分かる。When formula 55 is expanded and rearranged, an expression having the same form as formula 47 obtained by expanding formula 46 is derived. In this case, by introducing the threshold unit, number 47 is better than number 46 with w01 (2, 3), w0i (1, 2) (i
=1, 2, 3, . . . , N), the number of weighting coefficients increases, and the degree of freedom for translating the Taylor expansion of the input/output function increases. It can be seen that this allows a large degree of freedom in adjustment in the approximation of high-order, complex nonlinear functions, and improves the approximation accuracy.

【０１３７】先に、入出力関数ｆ（ｘ）の非線形性及び
階層型ニューラル・ネットワークの非線形変換機能につ
いてｆ（ｘ）のテイラー展開式を利用して検討できるこ
とを示した。引続いて、以下の項目について説明する。Previously, it was shown that the nonlinearity of the input/output function f(x) and the nonlinear transformation function of the hierarchical neural network can be studied using the Taylor expansion equation of f(x). Next, the following items will be explained.

【０１３８】（１）入出力関数ｆ（ｘ）の高次導関数値
の簡易導出法（２）ニューラル・ネットワークの他の構成法（３）ニ
ューラル・ネットワークの構造決定の１方法先ず、入出
力関数ｆ（ｘ）の高次導関数値の簡易導出法について説
明する。先に、入出力関数ｆ（ｘ）の６次までの導関数
を導出し、ｘ＝０における導関数の値を求めた。しかしながら、導関数の次数が高くなると急激に式が複
雑になり、式の導出及び値の計算に非常に時間が掛かる
という問題がある。この問題を解決する方法について、
以下、説明する。(1) A simple method for deriving the higher-order derivative value of the input/output function f(x) (2) Other methods for constructing a neural network (3) One method for determining the structure of a neural network First, input/output A simple method for deriving the higher-order derivative value of the function f(x) will be explained. First, derivatives up to the sixth order of the input/output function f(x) were derived, and the value of the derivative at x=0 was determined. However, there is a problem in that as the order of the derivative increases, the equation rapidly becomes more complex, and it takes a very long time to derive the equation and calculate the value. For information on how to resolve this issue,
This will be explained below.

【０１３９】先ず、入出力関数ｆ（ｘ）において、θ＝
０のときの関数ｈ（ｘ）は、先に説明したように数３３
で表わされる。この関数ｈ（ｘ）のマクローリン展開式
は、次式で与えられる。First, in the input/output function f(x), θ=
The function h(x) when 0 is, as explained earlier, the number 33
It is expressed as The Maclaurin expansion of this function h(x) is given by the following equation.

【０１４０】[0140]

【数５６】[Number 56]

【０１４１】数５６は、次式のように変形できる。[0141] Equation 56 can be transformed as shown in the following equation.

【０１４２】[0142]

【数５７】[Math. 57]

【０１４３】[0143]

【数５８】[Number 58]

【０１４４】また、数３３は、次式のように変形できる
。[0144] Furthermore, Equation 33 can be transformed as shown in the following equation.

【０１４５】[0145]

【数５９】[Math. 59]

【０１４６】[0146]

【数６０】[Number 60]

【０１４７】この数６０を数５９に代入して整理すると
、次式が導かれる。By substituting this number 60 into number 59 and rearranging it, the following equation is derived.

【０１４８】[0148]

【数６１】[Number 61]

【０１４９】数５７と数６１が等しいとして整理すると
、次式が得られる。If we rearrange the equations 57 and 61 as being equal, we obtain the following equation.

【０１５０】[0150]

【数６２】[Number 62]

【０１５１】数６２において、両辺のｘのｎ次の係数が
一致するためには、次式が成立つ必要がある。In Equation 62, in order for the n-th coefficients of x on both sides to match, the following equation must hold.

【０１５２】[0152]

【数６３】[Number 63]

【０１５３】この数６３の一般化式は、次式で表わされ
る。The generalized equation of number 63 is expressed by the following equation.

【０１５４】[0154]

【数６４】[Number 64]

【０１５５】さらに数３９と数５７より、入出力関数ｆ
（ｘ）は、次式で表わされる。Furthermore, from equations 39 and 57, the input/output function f
(x) is expressed by the following formula.

【０１５６】[0156]

【数６５】[Number 65]

【０１５７】数６４を用いて、ｃｎ　（ｎ＝０，１，…
，１４）を求めると、表４に示すようになり、次数ｎが
大きくなる程係数ｃｎ　が急速に小さくなることが分か
る。[0157] Using Equation 64, cn (n=0, 1,...
, 14) as shown in Table 4, and it can be seen that the coefficient cn decreases rapidly as the order n increases.

【０１５８】[0158]

【表４】[Table 4]

【０１５９】また、これらの値を用いて、入出力関数ｆ
（ｘ）でθ＝０のときの関数ｈ（ｘ）のマクローリン展
開式数５７において、ｎ次で打切ったときの近似式、す
なわちｎ次近似式ｆｎ（ｘ）の推定値を求めると表５及
び図９に示すようになる。[0159] Also, using these values, the input/output function f
In Maclaurin expansion formula number 57 of function h(x) when θ = 0 in 5 and FIG. 9.

【０１６０】[0160]

【表５】[Table 5]

【０１６１】この表及び図から、ｆ（ｘ）の値がほぼ０
．１〜０．９となるｘの範囲、−２≦ｘ≦２において、
７次，９次，１１次，１３次近似式で、それぞれ、±０
．７８％，±０．３２％，±０．１３％，±０．０５％
の誤差内の近似値が得られることが分かる。すなわち、
近似の打切り次数が大きくなる程精度が良くなることが
分かる。ただ、ｘの範囲が、−３≦ｘ≦３の場合、７次
，９次，１１次，１３次近似式でも、それぞれ、±２２
％，±２０％，±１８％，±１６．７％　の誤差内の近
似値となり、−２≦ｘ≦２の場合と比較して誤差がかな
り大きい。このことから、入出力関数ｆ（ｘ）は、ｘの
範囲が広がる程非線形度が急激に大きく、非常に高次の
近似式でも誤差は小さくならないことが分かる。[0161] From this table and figure, the value of f(x) is almost 0.
．． In the range of x from 1 to 0.9, -2≦x≦2,
±0 for 7th, 9th, 11th, and 13th approximations, respectively.
．． 78%, ±0.32%, ±0.13%, ±0.05%
It can be seen that an approximate value within the error of can be obtained. That is,
It can be seen that the larger the truncation order of approximation, the better the accuracy. However, when the range of x is -3≦x≦3, even the 7th, 9th, 11th, and 13th approximations are ±22
%, ±20%, ±18%, ±16.7%, and the error is considerably larger than in the case of -2≦x≦2. From this, it can be seen that the degree of nonlinearity of the input/output function f(x) increases rapidly as the range of x widens, and the error does not decrease even with a very high-order approximation formula.

【０１６２】次に、ニューラル・ネットワークの他の構
成法について、ｆ（ｘ）のテイラー展開式を利用して検
討できることを説明する。Next, it will be explained that another method of constructing a neural network can be studied using the Taylor expansion formula of f(x).

【０１６３】先に、図７に示す３層型ニューラル・ネッ
トワークを対象にして、非線形変換処理機能について検
討できることを説明した。[0163] Earlier, it was explained that the nonlinear transformation processing function can be studied using the three-layer neural network shown in Fig. 7 as a target.

【０１６４】このネットワークは、入力層，中間層，出
力層からなり、それぞれの層で、線形，非線形，非線形
の変換を行っている。このため、この構成を線形−非線
形−非線形構成と呼ぶことにする。この構成は、ニュー
ラル・ネットワークの基本構成であるが、他の構成とし
て、（１）線形−非線形−線形構成，（２）線形−線形
−非線形構成も考えられる。ｆ（ｘ）のテイラー展開式
を利用すると、これらの構成についても検討できること
を以下説明する。[0164] This network consists of an input layer, an intermediate layer, and an output layer, and each layer performs linear, nonlinear, and nonlinear transformations. Therefore, this configuration will be referred to as a linear-nonlinear-nonlinear configuration. This configuration is the basic configuration of a neural network, but other configurations include (1) linear-nonlinear-linear configuration and (2) linear-linear-nonlinear configuration. It will be explained below that these configurations can also be studied by using the Taylor expansion of f(x).

【０１６５】先ず、線形−非線形−線形構成の場合であ
るが、図１０にこの線形−非線形−線形構成のニューラ
ル・ネットワークを示す。なお、入出力の個数は、２入
力１出力とする。また、展開を簡単にするために、入出
力関数ｆ（ｘ）は、全てのユニットで同じ関数を使用す
るものとする。First, in the case of a linear-nonlinear-linear configuration, FIG. 10 shows a neural network with this linear-nonlinear-linear configuration. Note that the number of inputs and outputs is 2 inputs and 1 output. Furthermore, in order to simplify the expansion, it is assumed that the same input/output function f(x) is used in all units.

【０１６６】入力層のユニットの出力ｖｊ（１）（ｊ＝
１，２）は、図７に示す線形−非線形−線形構成と同様
、数４０で与えられ、また、中間層のユニットの入力の
総和ｕｊ（２）（ｊ＝１，２，３，…，Ｎ）及び出力ｖ
ｊ（２）（ｊ＝１，２，３，…，Ｎ）も、それぞれ数４
１及び数４２に与えられる。さらに、出力層のユニット
への火力の総和ｕｊ（３）（ｊ＝１）も、同様に数４３
で表わされる。ただし出力層のユニットの出力ｖｊ（３
）（ｊ＝１）は、次式に示すように、入力の総和ｕｊ（
３）（ｊ＝１）をそのまま出力した値として求められる
。[0166] Output vj (1) of input layer unit (j=
1, 2) is given by equation 40, similar to the linear-nonlinear-linear configuration shown in FIG. N) and output v
j(2) (j=1, 2, 3,..., N) is also the number 4
1 and number 42. Furthermore, the summation uj(3) (j=1) of the firepower to the units in the output layer is similarly expressed as Equation 43.
It is expressed as However, the output vj (3
) (j=1) is the sum of inputs uj(
3) It is obtained as a value that is output as is (j=1).

【０１６７】[0167]

【数６６】[Number 66]

【０１６８】数４２を数６６に代入すると、次式が得ら
れる。By substituting equation 42 into equation 66, the following equation is obtained.

【０１６９】[0169]

【数６７】[Number 67]

【０１７０】数４０を数６７に代入すると共に、ｖ１（
３）をｙで表わすと、次式が導かれる。[0170] While substituting the number 40 into the number 67, v1(
When 3) is expressed by y, the following equation is derived.

【０１７１】[0171]

【数６８】[Number 68]

【０１７２】数６８を展開して整理すると、数４６を展
開した数４７と同形の式が得られる。これより、図１０
に示す構成の階層型ネットワークも、図７に示す線形−
非線形−非線形構成のネットワークと同様、中間層のユ
ニットの個数を増加させると、重み係数ｗ１ｉ（１，２
），ｗ２ｉ（１，２），ｗｉ１（２，３）（ｉ＝１，２
，３，…，Ｎ）の個数が増加して調整の自由度が増加し
、より高次で複雑な非線形関数を近似できることが分か
る。ただ、数６８より数４６の方が、非線形変換を２回
行う分、より非線形度の高い関数を近似できる。When formula 68 is expanded and rearranged, an expression having the same form as formula 47 obtained by expanding formula 46 is obtained. From this, Figure 10
The hierarchical network with the configuration shown in FIG.
Similar to a network with a nonlinear-nonlinear configuration, when the number of units in the hidden layer increases, the weighting coefficient w1i (1, 2
), w2i (1, 2), wi1 (2, 3) (i=1, 2
, 3, . However, Equation 46 can approximate a function with a higher degree of nonlinearity than Equation 68 because the nonlinear transformation is performed twice.

【０１７３】次に、線形−線形−非線形構成の場合であ
るが、図１１に線形−線形−非線形構成のニューラル・
ネットワークを示す。なお、この場合も、入出力の個数
は、２入力１出力とする。また、展開を簡単にするため
に入出力関数は、全てのユニットで同じ関数を使用する
ものとする。Next, in the case of a linear-linear-nonlinear configuration, FIG.
Show network. In this case as well, the number of inputs and outputs is 2 inputs and 1 output. Also, in order to simplify the expansion, it is assumed that the same input/output functions are used in all units.

【０１７４】入力層のユニットの出力ｖｊ（１）（ｊ＝
１，２）は、図７に示す線形−非線形−非線形構成と同
様、数４０で与えられ、中間層のユニットの入力の総和
ｕｊ（２）（ｊ＝１，２，３，…，Ｎ）は、数４１で与
えられる。このとき、中間層のユニットの出力ｖｊ（２
）（ｊ＝１，２，３，…，Ｎ）は、次式に示すように入
力の総和ｕｊ（２）（ｊ＝１，２，３，…，Ｎ）をその
まま出力した値として求められる。Output vj (1) (j=
1, 2) is given by Equation 40, similar to the linear-nonlinear-nonlinear configuration shown in FIG. is given by equation 41. At this time, the output vj(2
) (j = 1, 2, 3, ..., N) is obtained as the value obtained by outputting the input summation uj (2) (j = 1, 2, 3, ..., N) as is, as shown in the following formula. .

【０１７５】[0175]

【数６９】[Number 69]

【０１７６】また、出力層のユニットへの入力層の総和
ｕｊ（３）（ｊ＝１）は、図７に示す線形−非線形−非
線形構成と同様数４３で与えられる。さらに、出力層の
ユニットの出力ｖｊ（３）（ｊ＝１）は、同様に数４４
で表わされる。数６９を数４４に代入すると、次式が得
られる。Further, the summation uj(3) (j=1) of the input layer to the unit of the output layer is given by Equation 43 as in the linear-nonlinear-nonlinear configuration shown in FIG. Furthermore, the output vj(3) (j=1) of the unit in the output layer is similarly expressed by the equation 44.
It is expressed as By substituting the number 69 into the number 44, the following equation is obtained.

【０１７７】[0177]

【数７０】[Number 70]

【０１７８】数４０を数７０に代入すると共に、ｖ１（
３）をｙと表わすと、次式が導かれる。[0178] While substituting the number 40 into the number 70, v1(
If 3) is expressed as y, the following equation is derived.

【０１７９】[0179]

【数７１】[Math. 71]

【０１８０】数７１は、展開して整理すると、数４６を
展開した数４７と同形の式が得られる。しかしながら、
この場合は、数４６と違って、中間層のユニットの個数
を増加させても非線形関数の近似の自由度は増加せず、
任意の高次非線形関数の近似は難しい。すなわち、数７
１は、次式のように変形できる。When Equation 71 is expanded and rearranged, an expression having the same form as Equation 47 obtained by expanding Equation 46 is obtained. however,
In this case, unlike Equation 46, increasing the number of units in the intermediate layer does not increase the degree of freedom in approximating the nonlinear function.
Approximating arbitrary high-order nonlinear functions is difficult. In other words, number 7
1 can be transformed as shown in the following equation.

【０１８１】[0181]

【数７２】[Number 72]

【０１８２】[0182]

【数７３】[Number 73]

【０１８３】数７２のパラメータは、実質Ｗ１，Ｗ２の
２個であり、中間層のユニットの個数を２個以上にして
も、自由度はユニット１個の場合と同じである。There are actually two parameters, W1 and W2, in Equation 72, and even if the number of units in the intermediate layer is two or more, the degree of freedom is the same as in the case of one unit.

【０１８４】次に、ニューラル・ネットワークの構造決
定の１方法について、ｆ（ｘ）のテイラー展開式を利用
して検討できることを説明する。Next, it will be explained that one method for determining the structure of a neural network can be studied using the Taylor expansion formula of f(x).

【０１８５】階層型ネットワークにより実現される非線
形関数の特性は、層の個数，各層のユニットの個数，重
み係数が変わると変化する。したがって、これらを調整
することにより、目的に適合する特性を持った非線形関
数が得られる。このうち、重み係数の調整は、学習によ
り実現できる。しかしながら、層の個数，各層のユニッ
トの個数の調整は、試行錯誤的に実施している。ここで
は、これらのうち中間層のユニットの個数決定のための
１つの方法を提案する。The characteristics of the nonlinear function realized by the hierarchical network change when the number of layers, the number of units in each layer, and the weighting coefficient change. Therefore, by adjusting these, a nonlinear function with characteristics suitable for the purpose can be obtained. Of these, adjustment of the weighting coefficients can be realized by learning. However, the number of layers and the number of units in each layer are adjusted by trial and error. Here, we propose one method for determining the number of units in the middle layer.

【０１８６】説明を簡単にするために、図１２に示す線
形−非線形−線形構成の１入力１出力系を考える。この
とき、入力ｘと出力ｙの関係は、数６８から導かれ、次
式で表わされる。To simplify the explanation, consider a one-input, one-output system with a linear-nonlinear-linear configuration shown in FIG. At this time, the relationship between the input x and the output y is derived from Equation 68 and is expressed by the following equation.

【０１８７】[0187]

【数７４】[Number 74]

【０１８８】数７４を書下すと、次式が得られる。By writing down Equation 74, the following equation is obtained.

【０１８９】[0189]

【数７５】[Number 75]

【０１９０】図１２に示すニューラル・ネットワークで
模擬する関数として、次式で表わされる関数を考える。As a function to be simulated by the neural network shown in FIG. 12, consider a function expressed by the following equation.

【０１９１】[0191]

【数７６】[Number 76]

【０１９２】ここで、ｄｉ　：係数数７５と数７６を一致させるには、次式が成立つ必要が
ある。Here, di : In order to match the number of coefficients 75 and 76, the following equation needs to hold true.

【０１９３】[0193]

【数７７】[Number 77]

【０１９４】数７６で表わされる関数のｘの６次以上の
係数が零（ｄ６＝ｄ７＝ｄ８＝…＝０）の場合、数７７
は、７個の式から成る連立方程式となる。ただし、この
範囲として、入出力関数ｆ（ｘ）で６次以上の係数の影
響が小さい範囲を考える。この連立方程式は、中間層の
ユニットの個数により未知数（重み係数）の個数が変化
し、それにより解決が求まるかどうかが決まる。[0194] When the coefficient of the sixth order or higher of x of the function expressed by Equation 76 is zero (d6=d7=d8=...=0), Equation 77
is a simultaneous equation consisting of seven equations. However, as this range, consider a range in which the influence of coefficients of order 6 or higher is small in the input/output function f(x). In this simultaneous equation, the number of unknowns (weighting coefficients) changes depending on the number of units in the intermediate layer, and this determines whether or not a solution can be found.

【０１９５】[0195]

【表６】[Table 6]

【０１９６】この関数を表６に示す。この表から分かる
よう、中間層のユニットの個数が３個以下の場合は、未
知数（重み係数）の個数が６個以下となり、式の個数よ
り未知数の個数が小さいので解は求まらない。ところが
、中間層のユニットの個数が４個以上の場合は、未知数
（重み係数）の個数が８個以上となり、式の個数より未
知の個数が大きくなり解は求まる。ただ、未知数の個数
と式の個数の差だけ自由度があり、この差の個数分の未
知数を任意に指定できる。このことは、誤差逆伝播学習
アルゴリズムにより重み係数を決定する場合、初期値に
より重み係数の収束値が異なることと対応している。This function is shown in Table 6. As can be seen from this table, when the number of units in the intermediate layer is 3 or less, the number of unknowns (weighting coefficients) is 6 or less, and the number of unknowns is smaller than the number of equations, so a solution cannot be found. However, when the number of units in the intermediate layer is four or more, the number of unknowns (weighting coefficients) is eight or more, and the number of unknowns becomes larger than the number of equations, and a solution can be found. However, there is a degree of freedom equal to the difference between the number of unknowns and the number of equations, and you can arbitrarily specify as many unknowns as this difference. This corresponds to the fact that when weighting coefficients are determined by an error backpropagation learning algorithm, the convergence value of the weighting coefficients differs depending on the initial value.

【０１９７】[0197]

【発明の効果】本発明によれば、入出力関数の級数展開
式を利用して、ニューラル・ネットワークを非線形回帰
式で表わすことにより、ネットワークの入出力間の因果
関係が明瞭になる。According to the present invention, by expressing a neural network as a nonlinear regression equation using a series expansion equation of an input/output function, the causal relationship between the input and output of the network becomes clear.

[Brief explanation of the drawing]

【図１】階層型ニューラル・ネットワークの一例を示す
構成図である。FIG. 1 is a configuration diagram showing an example of a hierarchical neural network.

【図２】階層型ニューラル・ネットワークのユニットの
一例を示す構成図である。FIG. 2 is a configuration diagram showing an example of a unit of a hierarchical neural network.

【図３】本発明の一実施例を示す図である。FIG. 3 is a diagram showing an embodiment of the present invention.

【図４】入出力関数と近似式とを比較したグラフである
。FIG. 4 is a graph comparing input/output functions and approximate expressions.

【図５】他の入出力関数と近似式とを比較したグラフで
ある。FIG. 5 is a graph comparing other input/output functions and approximate expressions.

【図６】他の入出力関数と近似式とを比較したグラフで
ある。FIG. 6 is a graph comparing other input/output functions and approximate expressions.

【図７】２入力１出力の３層型ネットワークの一例を示
す構成図である。FIG. 7 is a configuration diagram showing an example of a three-layer network with two inputs and one output.

【図８】しきい値ユニットを組込んだ２入力１出力の３
層型ネットワークの一例を示す構成図である。[Figure 8] 2-input 1-output 3 with built-in threshold unit
FIG. 1 is a configuration diagram showing an example of a layered network.

【図９】入出力関数と近似式とを比較したグラフである
。FIG. 9 is a graph comparing input/output functions and approximate expressions.

【図１０】２入力１出力の３層型ネットワークの別の例
を示す構成図である。FIG. 10 is a configuration diagram showing another example of a three-layer network with two inputs and one output.

【図１１】２入力１出力の３層型ネットワークの別の例
を示す構成図である。FIG. 11 is a configuration diagram showing another example of a three-layer network with two inputs and one output.

【図１２】１入力１出力の３層型ネットワークの一例を
示す構成図である。FIG. 12 is a configuration diagram showing an example of a three-layer network with one input and one output.

[Explanation of symbols]

ｆ（ｘ）…入出力関数。 f(x)...input/output function.

Claims

[Claims]

Claim 1: An application system of a neural network configured by a hierarchical combination of units modeled on neurons, in which the input/output functions of the units are expanded into a series, and this expansion formula is used to form the network using a nonlinear regression formula. A method for evaluating neural networks characterized by the representation of neural networks.

2. A neural network evaluation method according to claim 1, characterized in that a sigmoidal function is used as an input/output function of the unit.

3. A neural network evaluation method according to claim 1, characterized in that a Taylor expansion is used as a series expansion of the input/output function of the unit.

4. A method for evaluating a neural network according to claim 1, characterized in that a nonlinear regression expression of the neural network is used for analysis of the neural network.

5. A method for evaluating a neural network according to claim 1, characterized in that a nonlinear regression expression of the neural network is used in designing the neural network.

6. The neural network evaluation method according to claim 1, wherein a nonlinear regression expression of the neural network is used to acquire knowledge from the neural network.