JP2018116420A

JP2018116420A - Perturbation learning device and method thereof

Info

Publication number: JP2018116420A
Application number: JP2017006062A
Authority: JP
Inventors: 志栞小仁所; Shiori Konisho; 晃洋鴻野; Akihiro Kono; 岡　宗一; Soichi Oka; 宗一岡
Original assignee: Nippon Telegraph and Telephone Corp
Current assignee: Nippon Telegraph and Telephone Corp
Priority date: 2017-01-17
Filing date: 2017-01-17
Publication date: 2018-07-26
Anticipated expiration: 2037-01-17
Also published as: JP6655032B2

Abstract

PROBLEM TO BE SOLVED: To provide a perturbation learning device capable of processing a weight learning at fast speed.SOLUTION: A perturbation learning device includes: a branching unit 10 that outputs not-to-be-learned data X acquired by branching, element by element, learning data containing n number of elements, and to-be-learned data Xacquired by branching, element by element, from the header of the learning data by m times for each j pieces; a weight computing unit 20 that calculates pieces of weight learning data Dto Dby multiplying each element of the not-to-be-learned data X by a corresponding weight coefficient; a perturbation learning unit 30 that calculates j pieces of perturbation weight learning data by multiplying the to-be-learned data Xby a corresponding perturbation weight coefficient; a second branching unit 40 that branches each piece of the weight learning data into j+1 pieces; a coupling unit 50 that calculates a first output value O summing all pieces of the weight learning data branched into j+1 pieces, and j number of second output values Oδacquired by substituting any one of the j+1 pieces of weight learning data by the perturbation weight learning data corresponding to such weight learning data; and a learning unit 60 that repeats a calculation of updating each weight coefficient for each j piece by a unit of m times.SELECTED DRAWING: Figure 1

Description

本発明は、摂動学習の大規模な並列計算を高速に行う摂動学習装置とその方法に関する。 The present invention relates to a perturbation learning apparatus and method for performing large-scale parallel computation of perturbation learning at high speed.

摂動学習は、アナログ演算器用の学習アルゴリズムである。例えばニューラルネットワークのシナプス重み等の学習すべきパラメータを微少量変化させ、誤差関数に及ぼす影響をモニタすることで、パラメータの修正量を算出する方法である。 Perturbation learning is a learning algorithm for an analog computing unit. For example, a parameter correction amount is calculated by changing a parameter to be learned such as a synaptic weight of a neural network by a minute amount and monitoring an influence on an error function.

例えば非特許文献１に同時摂動最適化法が開示されている。また、例えば非特許文献２に多周波振動学習法が開示されている。 For example, Non-Patent Document 1 discloses a simultaneous perturbation optimization method. Further, for example, Non-Patent Document 2 discloses a multi-frequency vibration learning method.

前田裕ほか１名、「同時摂動学習則を用いたニューラルネットワークによる２軸駆動型ロボットアームの追値制御」、電学論Ｃ、123巻、9号、2003年Hiroshi Maeda et al., “Additional control of 2-axis robot arm by neural network using simultaneous perturbation learning rule”, Denki Theory C, Vol. 123, No. 9, 2003 宮尾浩ほか３名、「アラログニューラルネットワークのための多周波振動学習法とそれを適用した学習ＬＳＩ」、電子情報通信学会論文誌Ａ、Vol.179-A、No.4、pp.917-929、1996年4月Hiroshi Miyao et al., "Multi-frequency vibration learning method for neural network and learning LSI using it", IEICE Transactions A, Vol.179-A, No.4, pp.917-929 April 1996

摂動学習を高速に行うためには、重み更新の並列処理が重要になる。非特許文献２に開示された方法では、信号を周波数多重することで並列処理を行う。しかし、帯域やクローストーク等の問題から周波数の多重数は数百が限界である。 In order to perform perturbation learning at high speed, parallel processing of weight update is important. In the method disclosed in Non-Patent Document 2, parallel processing is performed by frequency-multiplexing signals. However, the number of multiplexed frequencies is limited to several hundred due to problems such as bandwidth and crosstalk.

一方、ニューラルネットワークの摂動学習は、ニューロン同士を連結したシナプスの重みを更新することによって行われる。ニューラルネットワークを基本とした機械学習技術の一つであるディープラーニング（深層学習）では、学習データ数及びシナプス数が大規模化し、シナプス数が数億個を越える場合もある。 On the other hand, perturbation learning of a neural network is performed by updating the weight of a synapse that connects neurons. In deep learning, which is one of machine learning technologies based on neural networks, the number of learning data and the number of synapses is large, and the number of synapses may exceed several hundred million.

したがって、従来の周波数多重を用いた数百スケールの並列処理では十分な高速化が期待できないという課題がある。 Therefore, there is a problem that sufficient speedup cannot be expected in the parallel processing of several hundred scales using the conventional frequency multiplexing.

本発明は、この課題に鑑みてなされたものであり、重み学習を高速に処理できる摂動学習装置とその方法を提供することを目的とする。 The present invention has been made in view of this problem, and an object of the present invention is to provide a perturbation learning apparatus and method that can process weight learning at high speed.

本実施形態の一態様に係る摂動学習装置は、ｎ個の要素から成る学習データを要素ごとに分岐した学習非対象データと、前記学習データの先頭からｊ個ずつｍ回に分けて要素ごとに分岐した学習対象データを出力する分岐部と、前記学習非対象データの前記要素ごとに、それぞれ対応する重み係数を乗じて重み学習データを計算する重み演算部と、前記学習対象データに対応する前記重み係数に、該重み係数を補正する摂動係数を加えた摂動重み係数を生成し、該摂動重み係数を、対応する前記学習対象データに乗じてｊ個の摂動重み学習データを計算する摂動重み演算部と、前記重み学習データのそれぞれをｊ＋１個に分岐する第２分岐部と、ｊ＋１個に分岐された重み学習データの全てを合計した第１出力値と、ｊ＋１個の重み学習データの何れか１個を、該重み学習データに対応する前記摂動重み学習データに置き代えて他の全ての重み学習データと合計したｊ個の第２出力値とを計算する結合部と、ｊ個設けられ、前記第１出力値と前記第２出力値からｊ個ごとに前記重み係数をそれぞれ更新する計算を、前記ｍ回の単位で繰り返す学習部とを備えることを要旨とする。 The perturbation learning device according to an aspect of the present embodiment includes learning non-target data obtained by branching learning data composed of n elements for each element and j elements from the beginning of the learning data divided into m times for each element. A branch unit that outputs the branched learning target data, a weight calculation unit that calculates weight learning data by multiplying a corresponding weighting factor for each element of the learning non-target data, and the learning unit data corresponding to the learning target data A perturbation weight calculation that generates a perturbation weight coefficient by adding a perturbation coefficient that corrects the weight coefficient to the weight coefficient, and multiplies the corresponding learning object data by the perturbation weight coefficient. , A second branching unit for branching each of the weight learning data into j + 1 pieces, a first output value obtained by summing all of the j + 1 pieces of weight learning data, and j + 1 weight learning data. A combining unit that calculates j second output values that are totaled with all other weight learning data by replacing any one of the above with the perturbation weight learning data corresponding to the weight learning data; The gist is provided with a learning unit that is provided and repeats the calculation of updating the weighting coefficient for every j pieces from the first output value and the second output value in units of m times.

本実施形態の他の態様に係る摂動学習装置は、ｎ個の波長を含む波長多重光を、ｊ＋１個の波長を含む多波長光と、該多波長光に含まれない波長の単波長光とに分岐し、前記多波長光にｎ個の学習データの先頭からｊ個ずつの前記学習データをｍ回に分けて付与した学習対象信号と、前記単波長光に対応する重みを付与した学習非対象信号とを生成する学習データ生成部と、前記学習対象信号を、波長の異なるｊ個の単波長光と、それ以外の波長を含むｊ個の多波長光とに２分岐する光分岐部と、ｊ個の前記多波長光のそれぞれに、対応する重み係数を乗じて第１重み学習データを計算する第１重み演算部と、ｊ個の前記単波長光のそれぞれに対応する重み係数に、該重み係数を補正する摂動係数を加えた摂動重み係数を生成し、該摂動重み係数を、対応する前記単波長光に乗じて摂動重み学習データを計算する摂動重み演算部と、前記学習非対象信号のそれぞれに、対応する重み係数を乗じて第２重み学習データを計算する第２重み演算部と、ｊ個の前記第１重み学習データとｊ個の摂動重み学習データを入力とし、ｊ個の前記第１重み学習データを全て結合した第１結合値と、ｊ個の前記第１重み学習データの何れか１個を、該第１重み学習データに対応する前記摂動重み学習データに置き代えて他の全ての第１重み学習データと結合したｊ個の第２結合値とを出力する波長周回器と、前記第１結合値とｎ−ｊ個の前記第２重み学習データを結合した第１出力値と、ｊ個の前記第２結合値のそれぞれとｎ−ｊ個の前記第２重み学習データを結合した第２出力値を生成する光結合器と、ｊ個設けられ、前記第１出力値と前記第２出力値からｊ個ごとに前記重み係数をそれぞれ更新する計算を、前記ｍ回の単位で繰り返す学習部とを備えることを要旨とする。 The perturbation learning device according to another aspect of the present embodiment includes wavelength multiplexed light including n wavelengths, multi-wavelength light including j + 1 wavelengths, and single-wavelength light having a wavelength not included in the multi-wavelength light. A learning target signal obtained by dividing the multi-wavelength light by j pieces of learning data from the beginning of n pieces of learning data and giving a weight corresponding to the single wavelength light. A learning data generation unit that generates a target signal; and an optical branching unit that splits the learning target signal into two pieces of j single-wavelength light having different wavelengths and j multi-wavelength light including other wavelengths; , A first weight calculator that calculates first weight learning data by multiplying each of the j multi-wavelength lights by a corresponding weight coefficient, and a weight coefficient corresponding to each of the j single-wavelength lights, Generating a perturbation weighting coefficient by adding a perturbation coefficient for correcting the weighting coefficient; A perturbation weight calculator that calculates the perturbation weight learning data by multiplying the corresponding single wavelength light by a number, and a second weight learning data that multiplies each of the learning non-target signals by a corresponding weight coefficient. A two-weight operation unit, j first weight learning data and j perturbation weight learning data as inputs, and a first combined value obtained by combining all the j first weight learning data; J second combination values in which any one of the first weight learning data is replaced with the perturbation weight learning data corresponding to the first weight learning data and combined with all the other first weight learning data; , A first output value obtained by combining the first combined value and nj second weight learning data, and each of the j second combined values and nj Optical coupling for generating a second output value by coupling the second weight learning data And j learning units that repeat the calculation for updating the weighting factor for each j from the first output value and the second output value in units of m times. .

本実施形態の一態様に係る摂動学習方法は、上記の摂動学習装置が行う摂動学習方法であって、分岐部が、ｎ個の要素から成る学習データを要素ごとに分岐した学習非対象データと、前記学習データの先頭からｊ個ずつｍ回に分けて要素ごとに分岐した学習対象データを出力し、重み演算部が、前記学習非対象データの前記要素ごとに、それぞれ対応する重み係数を乗じて重み学習データを計算し、摂動重み演算部が、前記学習対象データに対応する前記重み係数に、該重み係数を補正する摂動係数を加えた摂動重み係数を生成し、該摂動重み係数を、対応する前記学習対象データに乗じてｊ個の摂動重み学習データを計算し、第２分岐部が、前記重み学習データのそれぞれをｊ＋１個に分岐し、結合部が、ｊ＋１個に分岐された重み学習データの全てを合計した第１出力値と、ｊ＋１個の重み学習データの何れか１個を、該重み学習データに対応する前記摂動重み学習データに置き代えて他の全ての重み学習データと合計したｊ個の第２出力値とを計算し、ｊ個設けられた学習部が、前記第１出力値と前記第２出力値からｊ個ごとに前記重み係数をそれぞれ更新する計算を、前記ｍ回の単位で繰り返すことを要旨とする。 A perturbation learning method according to an aspect of the present embodiment is a perturbation learning method performed by the perturbation learning device, wherein the branching unit includes learning non-target data obtained by branching learning data including n elements for each element. , Output learning target data that is divided into m pieces j times from the beginning of the learning data, and the weight calculation unit multiplies the corresponding weight coefficient for each element of the learning non-target data. The weight learning data is calculated, and a perturbation weight calculation unit generates a perturbation weight coefficient obtained by adding a perturbation coefficient for correcting the weight coefficient to the weight coefficient corresponding to the learning target data, and the perturbation weight coefficient is Multiplying the corresponding learning target data to calculate j perturbation weight learning data, the second branching unit branches each of the weighted learning data into j + 1 pieces, and the combining unit branches into j + 1 weights Learning The first output value obtained by summing all of the data and j + 1 weight learning data is replaced with the perturbation weight learning data corresponding to the weight learning data and summed with all other weight learning data. The j second output values are calculated, and the j learning units update the weighting coefficient for each j from the first output value and the second output value. The gist is to repeat it in units of times.

本発明によれば、重み学習を高速に処理できる摂動学習装置とその方法を提供することができる。 According to the present invention, it is possible to provide a perturbation learning apparatus and method that can process weight learning at high speed.

本発明の第１実施形態に係る摂動学習装置の機能構成例を示す図である。It is a figure which shows the function structural example of the perturbation learning apparatus which concerns on 1st Embodiment of this invention. 本実施形態に係る摂動学習装置の動作フローを示す図である。It is a figure which shows the operation | movement flow of the perturbation learning apparatus which concerns on this embodiment. 図１に示す摂動学習装置の分岐部〜結合部の間の接続関係を示す図である。It is a figure which shows the connection relationship between the branch part-coupling | bond part of the perturbation learning apparatus shown in FIG. 第１出力値と第２出力値の例を示す図である。It is a figure which shows the example of a 1st output value and a 2nd output value. 図１に示す学習部のより具体的な構成例を示す図である。It is a figure which shows the more specific structural example of the learning part shown in FIG. 図５に示す学習部の変形例を示す図である。It is a figure which shows the modification of the learning part shown in FIG. 本発明の第２実施形態に係る摂動学習装置の機能構成例を示す図である。It is a figure which shows the function structural example of the perturbation learning apparatus which concerns on 2nd Embodiment of this invention. 図７に示す学習データ生成部のより具体的な構成例を示す図である。It is a figure which shows the more specific structural example of the learning data generation part shown in FIG. 図７に示す分岐部のより具体的な構成例を示す図である。It is a figure which shows the more specific structural example of the branch part shown in FIG. 図７に示す波長周回器の作用を模式的に示す図である。It is a figure which shows typically the effect | action of the wavelength circulator shown in FIG. 図７に示す光結合部を構成する光結合器と光分岐器の作用を模式的に示す図である。It is a figure which shows typically the effect | action of the optical coupler and optical splitter which comprise the optical coupling part shown in FIG. 図７に示す光結合部を構成する光結合器の作用を模式的に示す図である。It is a figure which shows typically the effect | action of the optical coupler which comprises the optical coupling part shown in FIG. 比較例との比較結果を示す図であり、（ａ）はフィルタの消費電力を示す、図（ｂ）は構成部品点数を示す図である。It is a figure which shows the comparison result with a comparative example, (a) shows the power consumption of a filter, FIG.5 (b) is a figure which shows the number of components.

以下、本発明の実施形態について図面を用いて説明する。複数の図面中同一のものに
は同じ参照符号を付し、説明は繰り返さない。 Hereinafter, embodiments of the present invention will be described with reference to the drawings. The same reference numerals are given to the same components in a plurality of drawings, and the description will not be repeated.

〔第１実施形態〕
図１に、第１実施形態に係る摂動学習装置１の機能構成例を示す。図２に、摂動学習装置１の動作フローを示す。 [First Embodiment]
FIG. 1 shows a functional configuration example of the perturbation learning device 1 according to the first embodiment. FIG. 2 shows an operation flow of the perturbation learning device 1.

摂動学習装置１は、分岐部１０、重み演算部２０、摂動重み演算部３０、第２分岐部４０、結合部５０、及び学習部６０を備える。摂動学習装置１は、例えば階層型ニューラルネットワークの学習演算を高速に処理する。 The perturbation learning device 1 includes a branching unit 10, a weight calculation unit 20, a perturbation weight calculation unit 30, a second branching unit 40, a combining unit 50, and a learning unit 60. The perturbation learning apparatus 1 processes, for example, a learning operation of a hierarchical neural network at high speed.

分岐部１０は、ｎ個の要素から成る学習データを要素ごとに分岐した学習非対象データＸと、該学習データの先頭からｊ個ずつｍ回に分けて要素ごとに分岐した学習対象データＸ_ｊを出力する（ステップＳ１）。学習非対象データＸは、学習する対象が画像であるならば、ピクセルに対応するｎ個の成分を持つベクトルである（式（１））。 The branching unit 10 includes learning non-target data X obtained by branching learning data composed of n elements for each element, and learning target data X _j branched for each element j times from the beginning of the learning data. Is output (step S1). The learning non-target data X is a vector having n components corresponding to pixels if the learning target is an image (formula (1)).

重み演算部２０は、学習非対象データＸの要素ごとに、それぞれ対応する重み係数を乗じて重み学習データＤ_１〜Ｄ_ｎを計算する（ステップＳ２）。重み演算部２０は、学習非対象データＸに対して連結されるニューロン値を算出する。ニューロン同士の連結の強さを表すシナプスの重みとは、学習非対象データＸと同様にｎ個の成分を持つベクトルである（式（２））。 The weight calculator 20 calculates the weight learning data D _{1 to} D _n by multiplying the corresponding weight coefficient for each element of the learning non-target data X (step S2). The weight calculator 20 calculates a neuron value to be connected to the learning non-target data X. The synaptic weight representing the strength of connection between neurons is a vector having n components as in the learning non-target data X (formula (2)).

摂動重み演算部３０は、学習対象データＸ_ｊに対応する重み係数に、該重み係数を補正する摂動係数δを加えた摂動重み係数を生成し、該摂動重み係数を、対応する学習対象データＸ_ｊに乗じてｊ個の摂動重み学習データＤδ_ｉを計算する（ステップＳ３）。摂動重み係数は、ｊ個の成分を持つベクトルである（式（３））。 The perturbation weight calculation unit 30 generates a perturbation weight coefficient obtained by adding a perturbation coefficient δ for correcting the weight coefficient to a weight coefficient corresponding to the learning target data X _j , and the perturbation weight coefficient is converted to the corresponding learning target data X _By multiplying _j , j perturbation weight learning data Dδ _i are calculated (step S3). The perturbation weight coefficient is a vector having j components (formula (3)).

第２分岐部４０は、重み学習データＤ_１〜Ｄ_ｎのそれぞれをｊ＋１個に分岐する（ステップＳ４）。つまり、第２分岐部４０は、１個の重み学習データＤ_１をＪ＋１個のＤ_１′に分岐する。同様に重み学習データＤ_ｉ、及び重み学習データＤ_ｎを、Ｊ＋１個のＤ_ｉ′,Ｄ_ｎ′に分岐する。 The second branching unit 40 branches each of the weight learning data D _{1 to} D _n into j + 1 pieces (step S4). That is, the second branching unit 40 branches one weight learning data D ₁ into J + 1 D ₁ ′. Similarly, the weight learning data D _i and the weight learning data D _n are branched into J + 1 D _i ′ and D _n ′.

結合部５０は、第２分岐部４０でｊ＋１個に分岐された重み学習データＤ_１′〜Ｄ_ｎ′の全てを合計した第１出力値Ｏ（式（４））と、ｊ＋１個の重み学習データＤ_１′〜Ｄ_ｎ′の何れか１個を、該重み学習データＤ_１′〜Ｄ_ｎ′に対応する摂動重み学習データＤδ_１〜Ｄδ_ｎに置き代えて他の全ての重み学習データと合計したｊ個の第２出力値Ｏδ_１〜Ｏδ_ｊ（式（５））とを計算する（ステップＳ５）。 The combining unit 50 includes a first output value O (equation (4)) obtained by summing all of the weight learning data D ₁ ′ to D _n ′ branched to j + 1 by the second branching unit 40 and j + 1 weight learning. one either data _D 1 '~D _n', and all other weights training data instead placed perturbation weight learning data Dδ ₁ ~Dδ _n corresponding to heavy saw training data _D 1 'to D _n' The total j second output values Oδ _{1 to} Oδ _j (formula (5)) are calculated (step S5).

学習部６０は、摂動重み演算部３０で同時に計算する重み係数と同じ数のｊ個設けられ、第１出力値Ｏと第２出力値Ｏδ_ｉからｊ個ごとに重み係数ｗ_ｉを更新する計算をｍ回の単位で繰り返す（ステップＳ６）。更新する重み係数ｗ_{ｉ＿ｎｅｗ}を求める計算は、その値が収束するまで繰り返される（ステップＳ７のＮｏ）。 The learning unit 60 is provided with the same number of j as the weighting coefficients simultaneously calculated by the perturbation weight calculating unit 30 and updates the weighting factor w _i every j from the first output value O and the second output value Oδ _i. Is repeated in units of m times (step S6). The calculation for _obtaining the weighting factor w _{i_new} to be updated is repeated until the value converges (No in step S7).

重み係数ｗ_{ｉ＿ｎｅｗ}が収束すると、重み係数は収束した値に更新される（ステップＳ８）。以上説明したステップＳ１〜Ｓ８の処理は、ｍ回の単位で繰り返される。 When the weight coefficient _{wi_new} converges, the weight coefficient is updated to the converged value (step S8). The processes in steps S1 to S8 described above are repeated in units of m times.

以上説明した本実施形態の摂動学習装置１によれば、重み更新の計算を並列に処理するので重み学習を高速に処理できる。また、一度に学習する学習データの数をｊ個に限定するので、摂動学習装置１の構成部品点数を削減することもできる。以降、図面を参照して各機能構成部を更に詳しく説明する。 According to the perturbation learning device 1 of the present embodiment described above, the weight update calculation is processed in parallel, so that the weight learning can be processed at high speed. Further, since the number of learning data to be learned at a time is limited to j, the number of component parts of the perturbation learning device 1 can be reduced. Hereinafter, each functional component will be described in more detail with reference to the drawings.

（分岐部）
図３に、分岐部１０〜結合部５０のより具体的な接続例を示す。分岐部１０は、ｎ個の光分波器１０_１〜１０_ｎと、マルチプレクサ１１とで構成される。 (Branch part)
In FIG. 3, the more specific example of a connection of the branch part 10-the coupling | bond part 50 is shown. The branching unit 10 includes n optical demultiplexers 10 ₁ to 10 _n and a multiplexer 11.

光分波器１０_１〜１０_ｎは、ｎ個の要素から成る学習データを、それぞれ２分岐し、一方のＡ_１〜Ａ_ｎを第２分岐部４０に出力し、他方のＡ_１〜Ａ_ｎをマルチプレクサ１１に出力する。マルチプレクサ１１は、学習部から与えられるｍの値に応じてｍｊ＋１〜（ｍ＋１）ｊの範囲の学習データＡ_ｍｊ＋１〜Ａ_{（ｍ＋１）ｊ}を選択して摂動重み演算部３０に出力する。 The optical demultiplexers 10 ₁ to 10 _n each branch the learning data composed of n elements into two parts, and output _one A _{1 to} _An to the second branch unit 40 and the other A _{1 to} _An. Is output to the multiplexer 11. The multiplexer 11 selects learning data A _{mj + 1 to} A _{(m + 1) j} in the range of mj + 1 to (m + 1) j according to the value of m given from the learning unit, and outputs it to the perturbation weight calculation unit 30.

例えば、ｎ＝14、ｊ＝5、とした場合、学習対象データＸ_ｊは、Ｘ_ｊ＝ｘ_１，ｘ_２，ｘ_３，ｘ_４，ｘ_５、Ｘ_ｊ＝ｘ_６，ｘ_７，ｘ_８，ｘ_９，ｘ_１０、Ｘ_ｊ＝ｘ_１１，ｘ_１２，ｘ_１３，ｘ_１４、が3回に分けて出力される。ｍは0以上の整数であり、Ｘ_ｊは次式で表せる。 For example, when n = 14 and j = 5, the learning target data X _j is X _j = x ₁ , x ₂ , x ₃ , x ₄ , x ₅ , X _j = x ₆ , x ₇ , x _8. , X ₉ , x ₁₀ , X _j = x ₁₁ , x ₁₂ , x ₁₃ , x ₁₄ are output in three times. m is an integer of 0 or more, and X _j can be expressed by the following equation.

光分波器１０_１〜１０_ｎは、例えば一方のＡ_１〜Ａ_ｎの光強度をｊ＋1分のｊとし他方のＡ_ｍｊ＋１〜Ａ_{（ｍ＋１）ｊ}の光強度をｊ＋１分の１とする。これは、後段の重み演算部２０が、学習対象データＸ_ｊが生成される回数、学習データＸを重複して参照することによる。 Optical demultiplexer ₁₀ 1 to 10 _n, for example the light intensity of one _A 1 to A _n and j + 1 minute j other _{_{A mj + 1 ~A (m +}} 1) the light intensity of _j to 1 j + 1 minute. This is because the weight calculation unit 20 in the subsequent stage refers to the learning data X redundantly, the number of times the learning target data X _j is generated.

本実施形態では、学習対象とする学習データの数をｊ個に減らすことで、構成部品点数を削減している。構成部品点数の削減数の具体例については後述する。 In the present embodiment, the number of component parts is reduced by reducing the number of learning data to be learned to j. A specific example of the reduced number of component parts will be described later.

（重み演算部）
重み演算部２０は、例えば液晶を用いたｎ個の光減衰フィルタ２０_０〜２０_ｎで構成することができる。その場合、重みｗ_ｉは、光減衰フィルタの透過率によって与えられる。重み演算部２０に入力された学習非対象データＡ_１〜Ａ_ｎに対し、光減衰フィルタの透過率によって学習データＸの各々を、ｗ_１,ｗ_２,…,ｗ_ｉ,…,ｗ_ｎ倍に変換する。 (Weight calculation part)
The weight calculation unit 20 can be composed of, for example, n light attenuation filters 20 _{0 to} 20 _n using liquid crystal. In that case, the weight w _i is given by the transmittance of the light attenuation filter. Learning is input to the weight calculating unit 20 with respect to non-object data _A 1 to A _n, each of the learning data X by the transmittance of the light attenuating _{_{filter, w 1, w 2, ...}} , w i, ..., w n times Convert to

重み演算部２０の出力する重み学習データは、学習データＸと重みｗ_ｉの積演算値であり、Ｄ_ｉと表記する（式（７））。ｉは、１〜ｎの整数である。 Weight learning data output from the weight calculation section 20 is a product operation value of the learning data X and the weight w _i, is denoted as D _i (Equation (7)). i is an integer of 1 to n.

重み演算部２０として光減衰フィルタを用いる場合、電気光学（ＥＯ）変調器、音響光学（ＡＯ）変調器、ＭＥＭＳミラーを利用できる。光減衰フィルタの透過率の値は、入力される信号の光強度による影響を受けない。よって、透過率の制御が比較的容易であり、与えたい透過率と実際に与えた透過率の差を小さくして計算誤差を小さくすることができる。また、光減衰フィルタの透過率は、1以上であることは原理的に不可能である。そこでｗ_ｉ＞1の場合、変換係数ｑ＞1を用いて1＞ｗｉ/ｑ=Ｗｉ′＞1/qに変換すれば良い。具体的には、後段の学習部６０で信号レベルをｑ倍に変換すれば良い。 When an optical attenuation filter is used as the weight calculation unit 20, an electro-optic (EO) modulator, an acousto-optic (AO) modulator, or a MEMS mirror can be used. The transmittance value of the light attenuation filter is not affected by the light intensity of the input signal. Therefore, the transmittance can be controlled relatively easily, and the difference between the desired transmittance and the actually applied transmittance can be reduced to reduce the calculation error. In addition, it is impossible in principle that the transmittance of the light attenuation filter is 1 or more. Therefore, when w _i > 1, the conversion coefficient q> 1 may be used to convert 1> wi / q = Wi ′> 1 / q. Specifically, the signal level may be converted to q times by the learning unit 60 in the subsequent stage.

また、重み演算部２０は、光増幅器で構成しても良い。その場合、半導体光増幅器（ＳＯＡ）を利用できる。光増幅器では、微小な光信号を増幅して、観測可能な光強度を出力する。よって、ニューラルネットワークの規模が大きくなり、ｎが増加し、学習データの要素へ与えられる光強度が微小になったとしても、学習データＸと重みｗ_ｉの積演算値の計算が可能である。また、光増幅器の増幅率は1以下であることは原理的に不可能である。そこでｗ_ｉ＜1の場合、変換係数ｑ′＜1を用いる。この場合の信号レベルの変換は、上記の光減衰フィルタを用いた場合と同じである。 Further, the weight calculation unit 20 may be configured with an optical amplifier. In that case, a semiconductor optical amplifier (SOA) can be used. The optical amplifier amplifies a minute optical signal and outputs an observable light intensity. Therefore, even if the scale of the neural network increases, n increases, and the light intensity given to the elements of the learning data becomes minute, the product operation value of the learning data X and the weight w _i can be calculated. In addition, it is impossible in principle that the amplification factor of the optical amplifier is 1 or less. Therefore, when w _i <1, the conversion coefficient q ′ <1 is used. The signal level conversion in this case is the same as when the above-described optical attenuation filter is used.

（摂動重み演算部）
摂動重み演算部３０は、入力された学習対象データＸ_ｊに対して摂動重み係数を乗じて、摂動重み学習データＤδ_iを計算する（式（８））。摂動重み係数は、重み係数に、該重み係数を補正する摂動係数δを加えたものである。摂動重み演算部３０は、重み演算部２０と同様に例えば光減衰フィルタ、又は光増幅器で構成できる。 (Perturbation weight calculator)
The perturbation weight calculation unit 30 multiplies the input learning target data X _j by a perturbation weight coefficient to calculate perturbation weight learning data Dδ _i (formula (8)). The perturbation weight coefficient is obtained by adding a perturbation coefficient δ for correcting the weight coefficient to the weight coefficient. The perturbation weight calculation unit 30 can be configured by, for example, an optical attenuation filter or an optical amplifier, similarly to the weight calculation unit 20.

摂動重み演算部３０は、ｍの値に応じた摂動重み学習データＤδ_iを計算する。例えば、ｎ＝14、ｊ＝5、とした場合、ｍ=0では、学習対象データｘ_１〜ｘ_５のそれぞれに、摂動重み係数ｗ_１＋δ〜ｗ_５＋δを乗じた摂動重み学習データＤδ_１〜Ｄδ_５を計算する。ｍ=1では、学習対象データｘ_６〜ｘ_１０のそれぞれに、摂動重み係数ｗ_６＋δ〜ｗ_１０＋δを乗じた摂動重み学習データＤδ_６〜Ｄδ_１０を計算する。ｍ=2では、学習対象データｘ_１１〜ｘ_１４のそれぞれに、摂動重み係数ｗ_１１＋δ〜ｗ_１４＋δを乗じた摂動重み学習データＤδ_１１〜Ｄδ_１４を計算する。 The perturbation weight calculator 30 calculates perturbation weight learning data Dδ _i according to the value of m. For example, when n = 14 and j = 5, when m = 0, the perturbation weight learning data Dδ ₁ obtained by multiplying the learning target data x _{1 to} x ₅ by the perturbation weight coefficients w ₁ + δ to w ₅ + δ, respectively. to calculate the ~Dδ _5. When m = 1, perturbation weight learning data Dδ _{6 to} Dδ ₁₀ are calculated by multiplying the learning target data x _{6 to} x ₁₀ by the perturbation weight coefficients w ₆ + δ to w ₁₀ + δ. When m = 2, perturbation weight learning data Dδ _{11 to} Dδ ₁₄ are calculated by multiplying the learning target data x _{11 to} x ₁₄ by the perturbation weight coefficients w ₁₁ + δ to w ₁₄ + δ.

（第２分岐部）
第２分岐部４０は、重み学習データＤ_＊のそれぞれをｊ＋１個に分岐する（ステップＳ４）。第２分岐部４０は、重み学習データＤ_１をＪ＋１個のＤ_１′に分岐する。同様に重み学習データＤ_ｉ、及び重み学習データＤ_ｎを、Ｊ＋１個のＤ_ｉ′,Ｄ_ｎ′に分岐する。 (Second branch)
The second branching unit 40 branches each of the weight learning data D _* into j + 1 pieces (step S4). The second branching unit 40 branches the weight learning data D ₁ into J + 1 D ₁ ′. Similarly, the weight learning data D _i and the weight learning data D _n are branched into J + 1 D _i ′ and D _n ′.

第２分岐部４０は、分岐部１０と同様に光分波器で構成することができる。光分波器としては、石英平面光導波路（ＰＬＣ）、フレキシブルポリマー光導波路、及び融着延伸型ファイバーカプラ等を用いることができる。 The second branching unit 40 can be configured by an optical demultiplexer as with the branching unit 10. As the optical demultiplexer, a quartz planar optical waveguide (PLC), a flexible polymer optical waveguide, a fusion-stretch fiber coupler, or the like can be used.

（結合部）
結合部５０は、第２分岐部４０でｊ＋１個に分岐された重み学習データＤ_１′〜Ｄ_ｎ′の全てを合計した第１出力値Ｏ（式（４））と、ｊ＋１個の重み学習データＤ_＊の何れか１個を、該重み学習データＤ_＊に対応する摂動重み学習データＤδ_＊に置き代えて他の全ての重み学習データＤ_＊と合計したｊ個の第２出力値Ｏδ_＊（式（５））とを計算する。 (Joining part)
The combining unit 50 includes a first output value O (equation (4)) obtained by summing all of the weight learning data D ₁ ′ to D _n ′ branched to j + 1 by the second branching unit 40 and j + 1 weight learning. one one of the data D _*, heavy unlearned data D _* corresponding to perturbation weight learning data d? _* every place of the j-number of the sum of all other weighting the learning data D _* second output value Oderuta _* (Equation (5)) is calculated.

図４に、第１出力値Ｏと第２出力値Ｏδ_＊の例を示す。図４は、学習データＸの要素数ｎをｎ=14、一度に学習する学習対象データの数ｊをｊ=5とした場合を示す。 FIG. 4 shows an example of the first output value O and the second output value Oδ _* . FIG. 4 shows a case where the number n of elements of learning data X is n = 14 and the number j of learning object data to be learned at a time is j = 5.

この例の場合、結合部５０は、第１出力値Ｏと第２出力値Ｏδ_１〜Ｏδ_５を３回に分けて出力する。３回に分けて出力する第１出力値Ｏは、全て同じ値である。 In the case of this example, the combining unit 50 outputs the first output value O and the second output values Oδ _{1 to} Oδ ₅ in three times. The first output values O that are output in three steps are all the same value.

一方、第２出力値Ｏδ_ｉは、３回に分けて出力される値が全て異なる。１回目のＯδ_１は、Ｄ_１（ｘ_１ｗ_１）がＤδ_１に置き代えられ、他の全ての重み学習データと合計した値である。１回目のＯδ_２は、Ｄ_２（ｘ₂ｗ₂）がＤδ_２に置き代えられる。同様に、１回目のＯδ_５は、Ｄ_５（ｘ₅ｗ₅）がＤδ_５に置き代えられて他の重み学習データと合計した値である。 On the other hand, the second output value Oδ _i is different in all the values output in three steps. The first Oδ ₁ is a value obtained by replacing D ₁ (x ₁ w ₁ ) with Dδ ₁ and adding all the other weight learning data. In the first Oδ ₂ , D ₂ (x ₂ w ₂ ) is replaced with Dδ ₂ . Similarly, the first Oδ ₅ is a value obtained by replacing D ₅ (x ₅ w ₅ ) with Dδ ₅ and adding it to other weight learning data.

２回目のＯδ_１は、Ｄ₆（ｘ₆ｗ₆）がＤδ₆に置き代えられ、他の全ての重み学習データと合計した値である。２回目のＯδ_２は、Ｄ₇（ｘ₇ｗ₇）がＤδ₇に置き代えられる。同様に、２回目のＯδ_５は、Ｄ₁₀（ｘ₁₀ｗ₁₀）がＤδ₁₀に置き代えられて他の重み学習データと合計した値である。 The second Oδ ₁ is a value obtained by replacing D ₆ (x ₆ w ₆ ) with Dδ ₆ and adding all the other weight learning data. In the second Oδ ₂ , D ₇ (x ₇ w ₇ ) is replaced with Dδ ₇ . Similarly, the second Oδ ₅ is a value obtained by adding D ₁₀ (x ₁₀ w ₁₀ ) to Dδ ₁₀ and adding it to other weight learning data.

３回目のＯδ_１は、Ｄ₁₁（ｘ₁₁ｗ₁₁）がＤδ₁₁に置き代えられ、他の全ての重み学習データと合計した値である。３回目のＯδ_２は、Ｄ₁₂（ｘ₁₂ｗ₁₂）がＤδ₁₂に置き代えられる。同様に、３回目のＯδ_５は、Ｄ₁₅（ｘ₁₅ｗ₁₅）がこの例では存在しないので出力なしである。 The third Oδ ₁ is a value obtained by replacing D ₁₁ (x ₁₁ w ₁₁ ) with Dδ ₁₁ and adding all the other weight learning data. In the third Oδ ₂ , D ₁₂ (x ₁₂ w ₁₂ ) is replaced with Dδ ₁₂ . Similarly, the third Oδ ₅ has no output because D ₁₅ (x ₁₅ w ₁₅ ) does not exist in this example.

（学習部）
図５に、学習部６０_ｉのより具体的な構成例を示す。学習部６０_ｉは、例えばアナログ電気回路で構成することができる。 (Learning Department)
FIG. 5 shows a more specific configuration example of the learning unit 60 _i . The learning unit 60 _i can be configured by an analog electric circuit, for example.

学習部６０_ｉは、３個の差動増幅器６１_ｉ，６２_ｉ，６３_ｉ,６５_ｉ、及び計算部６４_ｉを備える。学習部６０_ｉは、摂動重み演算部３０を構成する例えば光減衰フィルタと同じ数だけ設けられる。 The learning unit 60 _i includes three differential amplifiers 61 _i , 62 _i , 63 _i , 65 _i , and a calculation unit 64 _i . The learning units 60 _i are provided in the same number as, for example, the optical attenuation filters constituting the perturbation weight calculation unit 30.

差動増幅器６１_ｉは、結合部５０が出力する第１出力値Ｏと、外部から入力される学習目標を表す教師値Ｔとの差分である第１コスト値Ｅ（Ｏ−Ｔ）を算出する。差動増幅器６２_ｉは、結合部５０が出力する第２出力値Ｏ_δｉと、教師値Ｔとの差分である第２コスト値Ｅ_δｉを算出する。差動増幅器４３_ｉは、第１コスト値Ｅと第２コスト値Ｅ_δｉの差分Ｅ−Ｅ_δｉを算出する。 The differential amplifier 61 _i calculates a first cost value E (OT) that is a difference between the first output value O output from the combining unit 50 and the teacher value T that represents the learning target input from the outside. . The differential amplifier 62 _i calculates a second cost value E _δi that is a difference between the second output value O _δi output from the combining unit 50 and the teacher value T. Differential amplifier 43 _i calculates a first cost value E and the difference E-E _.delta.i the second cost value E _.delta.i.

計算部６４_ｉは、外部から入力される強調重みｗ_ｉを補正する摂動係数δと学習速度の傾きを表す学習係数η、及び差動増幅器６３_ｉが出力する差分Ｅ−Ｅ_δｉを入力として、重みｗ_ｉを更新する変化量Δｗ_{ｉ＿ｎｅｗ}を次式で計算する。 The calculation unit 64 _i receives, as inputs, a perturbation coefficient δ that corrects an emphasis weight w _i that is input from the outside, a learning coefficient η that represents the gradient of the learning speed, and the difference EE _δi that is output from the differential amplifier 63 _i . A change amount Δw _{i_new} for updating the weight w _i is calculated by the following equation.

差動増幅器６５_ｉは、１回前の重みｗ_{ｉ＿ｏｌｄ}に変化量Δｗ_{ｊ＿ｎｅｗ}を加えて更新した重みｗ_{ｉ＿ｎｅｗ}を出力する。 Differential amplifier 65 _i outputs the weights _{w i - new} updating by adding the change amount [Delta] w _{J_new} once before the weight _{w i_old.}

なお、図６に示すように差動増幅器６１_ｉと６３_ｉの間に第１乗算器６６_ｉ、及び差動増幅器６２_ｉと６３_ｉの間に第２乗算器６７_ｉを設けても良い。第１乗算器６６_ｉは、第１コスト値Ｅ（Ｏ−Ｔ）を正の値に変換する。第１乗算器６６_ｉは、第１コスト値Ｅ（Ｏ−Ｔ）に同じ値を乗ずる二乗演算器で構成しても良いし、第１コスト値Ｅ（Ｏ−Ｔ）の値が負の場合に−１を乗ずる乗算器で構成しても良い。第２乗算器６７_ｉも同様である。 It is also possible to provide a second multiplier 67 _i between the first multiplier 66 _i, and a differential amplifier 62 _i and 63 _i between the differential amplifier 61 _i and 63 _i, as shown in FIG. The first multiplier 66 _i converts the first cost value E (OT) to a positive value. The first multiplier 66 _i may be configured by a square calculator that multiplies the first cost value E (OT) by the same value, or when the first cost value E (OT) is negative. Alternatively, the multiplier may be multiplied by −1. The same applies to the second multiplier 67 _i .

第１乗算器６６_ｉと第２乗算器６７_ｉを備えることで、変化量Δｗ_{ｉ＿ｎｅｗ}の計算精度を高めることができる。 By providing the first multiplier 66 _i and the second multiplier 67 _i , the calculation accuracy of the change amount Δw _{i_new} can be increased.

〔第２実施形態〕
図７に、第２実施形態に係る摂動学習装置２の機能構成例を示す。摂動学習装置２は、波長多重光を用いて、例えばニューラルネットワークの重み学習を行う学習装置である。 [Second Embodiment]
FIG. 7 shows a functional configuration example of the perturbation learning device 2 according to the second embodiment. The perturbation learning device 2 is a learning device that performs weight learning of a neural network, for example, using wavelength multiplexed light.

摂動学習装置２は、学習データ生成部７０、光分岐部１２、第１重み演算部２１、第２重み演算部２２、摂動重み演算部３１、波長周回器８０、光結合部９０、及び学習部６０を備える。 The perturbation learning device 2 includes a learning data generation unit 70, an optical branching unit 12, a first weight calculation unit 21, a second weight calculation unit 22, a perturbation weight calculation unit 31, a wavelength circulator 80, an optical coupling unit 90, and a learning unit. 60.

学習データ生成部７０は、ｎ個の波長を含む波長多重光を、ｊ＋１個の波長を含む多波長光と、該多波長光に含まれない波長ごとの単波長光とに分岐し、多波長光にｎ個の学習データの先頭からｊ個ずつの学習データをｍ回に分けて付与した学習対象信号と、単波長光に対応する重みを付与した学習非対象信号とを生成する。 The learning data generation unit 70 branches the wavelength multiplexed light including n wavelengths into multi-wavelength light including j + 1 wavelengths and single-wavelength light for each wavelength not included in the multi-wavelength light. A learning target signal in which j pieces of learning data from the beginning of the n pieces of learning data are provided to the light divided m times and a learning non-target signal to which a weight corresponding to the single wavelength light is given are generated.

波長多重光は、ｎ個の波長成分を持つ光源を用いて生成する。光源としては、ｎ個の波長成分を持つ光を生成できれば良い。したがって、単波長コヒーレント光源をｎ個用意し、各光源から照射された光を光結合器によって合波して生成して生成しても良い。また、多波長光源を用いても良いし、インコーヒーレント光源を用いても良い。学習データ生成部７０について、詳しくは後述する。 Wavelength multiplexed light is generated using a light source having n wavelength components. As the light source, it is only necessary to generate light having n wavelength components. Therefore, n single-wavelength coherent light sources may be prepared, and the light emitted from each light source may be generated and combined by an optical coupler. In addition, a multi-wavelength light source or an incoherent light source may be used. The learning data generation unit 70 will be described in detail later.

光分岐部１２は、ｊ＋１個の波長を含む学習対象信号を、ある波長のｊ個の単波長光Ｂ_１〜Ｂ_ｊと、それ以外の波長を含むｊ個の多波長光Ａ_１〜Ａ_ｊとに２分岐する。例えば、ｊ個の単波長光の波長は最も短い波長λ_０であり、ｊ個の多波長光は波長λ_１〜λ_ｊの波長を含む。光分岐部１２は、例えば、ＡＷＧ、回折格子、プリズム、及びＷＤＭカプラ等の波長分岐素子で構成できる。 The optical branching unit 12 receives a learning target signal including j + 1 wavelengths from j single-wavelength lights B _{1 to} B _j having a certain wavelength and j multi-wavelength lights A _{1 to} A _j including other wavelengths. Two branches. For example, the wavelength of j single-wavelength lights is the shortest wavelength λ ₀ , and the j multi-wavelength lights include wavelengths λ _{1 to} λ _j . The optical branching unit 12 can be composed of wavelength branching elements such as AWG, diffraction grating, prism, and WDM coupler, for example.

第１重み演算部２１は、ｊ個の多波長光のそれぞれに対応する重み係数を乗じて第１重み学習データＤ_１〜Ｄ_ｊを計算する。第１重み学習データＤ_１〜Ｄ_ｊは、次式で表せる。ここでｉは、１〜ｊの整数である。 The first weight calculation unit 21 calculates the first weight learning data D _{1 to} D _j by multiplying the weight coefficient corresponding to each of the j multi-wavelength lights. The first weight learning data D _{1 to} D _j can be expressed by the following equation. Here, i is an integer of 1 to j.

ここでＡ_１は、λ_１〜λ_ｊの波長を含む多波長光である。よって、例えばＤ_１は、多波長光に重みｗ_ｍｊ＋１を乗じた値である。この作用は、第１実施形態で説明した重み演算部２０と同様に、例えば光減衰フィルタで実現できる。 Here, A ₁ is multi-wavelength light including wavelengths λ _{1 to} λ _j . Thus, for example, D ₁ is a value obtained by multiplying the multi-wavelength light by the weight w _{mj + 1} . This effect can be realized by, for example, an optical attenuation filter, similarly to the weight calculation unit 20 described in the first embodiment.

摂動重み演算部３１は、ｊ個の単波長光のそれぞれに対応する重み係数に、該重み係数を補正する摂動係数δを加えた摂動重み係数を生成し、該摂動重み係数を、対応する単波長光に乗じて摂動重み学習データを計算する。摂動重み学習データＤδ_１〜Ｄδ_ｊは、次式で表せる。ここでＢ_１は、λ_０の波長の単波長光である。 The perturbation weight calculation unit 31 generates a perturbation weight coefficient obtained by adding a perturbation coefficient δ for correcting the weight coefficient to a weight coefficient corresponding to each of the j single-wavelength lights, and the perturbation weight coefficient The perturbation weight learning data is calculated by multiplying the wavelength light. The perturbation weight learning data Dδ _{1 to} Dδ _j can be expressed by the following equations. Here, B ₁ is single wavelength light having a wavelength of λ ₀ .

第２重み演算部２２は、学習非対象信号のそれぞれに、対応する重み係数を乗じて第２重み学習データＤ_ｊ＋１〜Ｄ_ｎを計算する。例えば、第２重み学習データＤ_ｊ＋１は次式で表せる。第２重み学習データＤ_ｎは、式（１２）の添え字がｊ＋１→ｎに変わる。 The second weight calculation unit 22 calculates second weight learning data D _{j + 1 to} D _n by multiplying each learning non-target signal by a corresponding weight coefficient. For example, the second weight learning data D _{j + 1} can be expressed by the following equation. In the second weight learning data D _n , the subscript of Expression (12) changes from j + 1 → n.

波長周回器８０は、ｊ個の第１重み学習データＤ_１〜Ｄ_ｊとｊ個の摂動重み学習データＤδ_１〜Ｄδ_ｊを入力とし、ｊ個の第１重み学習データＤ_１〜Ｄ_ｊを全て結合した第１結合値Ｄ１と、ｊ個の第１重み学習データＤ_１〜Ｄ_ｊの何れか１個を、該第１重み学習データＤ_１〜Ｄ_ｊに対応する摂動重み学習データＤδ_１〜Ｄδ_ｊに置き代えて他の全ての第１重み学習データと結合したｊ個の第２結合値Ｄ２_１〜Ｄ２_ｊとを出力する。 The wavelength circulator 80 receives j pieces of first weight learning data D _{1 to} D _j and j pieces of perturbation weight learning data Dδ _{1 to} Dδ _j, and inputs the j pieces of first weight learning data D _{1 to} D _j . a first coupling value D1 which all bonds, one or of the j first weight learning data _D 1 to D _j, perturbation weight learning data d? ₁ corresponding to the first weight learning data _D 1 to D _j ˜Dδ _j are output, and _j second combined values D2 _{1 to} D2 j combined with all the other first weight learning data are output.

波長周回器８０は、例えばＡＷＧで構成できる。波長周回器８０は、多波長光りが入力されると、多波長光を波長ごとに分岐して、入力ポートから波長の整数倍離れた距離の出力ポートに出力し、多波長光を生成する作用をする。この作用を用いて和演算する積演算値を選択する。波長周回器８０について、詳しくは後述する。 The wavelength circulator 80 can be composed of, for example, AWG. When the multi-wavelength light is input, the wavelength circulator 80 branches the multi-wavelength light for each wavelength and outputs the multi-wavelength light to an output port at a distance that is an integer multiple of the wavelength away from the input port to generate multi-wavelength light. do. A product operation value to be summed is selected using this action. The wavelength circulator 80 will be described in detail later.

光結合部９０は、第１結合値Ｄ１とｎ−ｊ個の第２重み学習データＤ_ｊ＋１〜Ｄ_ｎを結合した第１出力値Ｏと、第２結合値Ｄ２_１〜Ｄ２_ｊのそれぞれと第２重み学習データＤ_ｊ＋１〜Ｄ_ｎを結合した第２出力値Ｏδ_１〜Ｏδ_ｊを生成する。第１出力値Ｏと第２出力値Ｏδ_ｉは次式で表せる。但し、ｊ≦ｎ≦２ｊの場合である。 The optical coupling unit 90 includes a first output value O obtained by combining the first combined value D1 and n−j pieces of second weight learning data D _{j + 1 to} D _n , second combined values D2 _{1 to} D2 _j , and the first output value O. Second output values Oδ _{1 to} Oδ _j obtained by combining the two weight learning data D _{j + 1 to} D _n are generated. The first output value O and the second output value Oδ _i can be expressed by the following equations. However, it is a case of j <= n <= 2j.

最後の和演算は、波長周回器８０で光結合を行った光信号の光強度を受光器によって読み取ることで行う。なお、受光器の表記は省略している。 The final sum operation is performed by reading the light intensity of the optical signal optically coupled by the wavelength circulator 80 with a light receiver. The notation of the light receiver is omitted.

学習部６０は、第１出力値Ｏと第２出力値Ｏδ_１〜Ｏδ_ｊからｊ個ごとに重み係数をそれぞれ更新する計算を、ｍ回の単位で繰り返す。学習部６０は、参照符号から明らかなように第１実施形態と同じである。 The learning unit 60 repeats the calculation of updating the weighting factor for each _j from the first output value O and the second output values Oδ _{1 to} Oδ _j in units of m times. The learning unit 60 is the same as that of the first embodiment as is apparent from the reference numerals.

以上説明したように本実施形態の摂動学習装置２によれば、重み更新の計算を並列に処理するので重み学習を高速に処理できる。また、一度に学習する学習データの数をｊ個に限定するので、摂動学習装置２の構成部品点数を削減することもできる。以降、図面を参照して各機能構成部を更に詳しく説明する。 As described above, according to the perturbation learning apparatus 2 of the present embodiment, the weight update calculation is processed in parallel, so that the weight learning can be processed at high speed. Further, since the number of learning data to be learned at a time is limited to j, the number of component parts of the perturbation learning device 2 can be reduced. Hereinafter, each functional component will be described in more detail with reference to the drawings.

（学習データ生成部）
図８に、より具体的な学習データ生成部７０の機能構成例を示す。学習データ生成部７０は、波長分岐部７１、波長合成部７２、光分岐部７３、第１学習データフィルタ７４、第２学習データフィルタ７５、及び光結合部７６を備える。 (Learning data generator)
FIG. 8 shows a more specific functional configuration example of the learning data generation unit 70. The learning data generation unit 70 includes a wavelength branching unit 71, a wavelength synthesizing unit 72, an optical branching unit 73, a first learning data filter 74, a second learning data filter 75, and an optical coupling unit 76.

波長分岐部７１は、ｎ個の波長を含む波長多重光を、波長ごと（λ_０，…，λ_ｊ，…λ_ｎ）の単波長光に分波する。波長分岐部７１は、上記の通りＡＷＧ等の波長分岐素子で構成される。 The wavelength branching unit 71 demultiplexes wavelength multiplexed light including n wavelengths into single wavelength light for each wavelength (λ ₀ ,..., Λ _j ,... Λ _n ). The wavelength branching unit 71 is configured by a wavelength branching element such as AWG as described above.

例えば、最も波長の短い波長λ_０〜λ_ｊまでのＪ＋１個の単波長光は、波長合成部７２に入力され、波長λ_ｊ＋１〜λ_ｎまでのｎ−ｊ個の単波長光は第２学習データフィルタ７５に入力される。 For example, most single-wavelength light shorter to wavelength λ ₀ ~λ _{j J} + 1 single wavelength is inputted to the wavelength combining unit 72, n-j-number of single-wavelength light and the second learning to wavelength λ _{j + 1} ~λ _n Input to the data filter 75.

波長合成部７２は、波長分岐部７１で分波した最短の波長からｊ＋１番目に長い波長までの単波長光を合波する。波長合成部７２は、波長分岐部７１と同様に波長分岐素子で構成できる。 The wavelength synthesizer 72 multiplexes single wavelength light from the shortest wavelength demultiplexed by the wavelength branching unit 71 to the j + 1st longest wavelength. The wavelength synthesizer 72 can be configured with a wavelength branching element in the same manner as the wavelength branching unit 71.

なお、波長分岐部７１と波長合成部７２を、１組の波長分岐素子で構成する代わりに光合成器であるカプラを用いても良い。光合成器を用いた場合は、損失無しで結合することは原理的に不可能であるため、変換係数ｑ＞１を用い、後段の学習部６０において信号をｑ倍すれば良い。 It should be noted that the wavelength branching unit 71 and the wavelength synthesizing unit 72 may be a coupler which is an optical combiner instead of a single set of wavelength branching elements. In the case of using a photo combiner, since it is impossible in principle to combine without loss, the conversion coefficient q> 1 is used, and the signal may be multiplied by q in the subsequent learning unit 60.

光分岐部７３は、波長合成部７２の出力する信号をｊ個に等分岐する。光分岐部７３は、石英平面光導波路（ＰＬＣ）、フレキシブルポリマー光導波路等で構成できる。 The optical branching unit 73 equally divides the signal output from the wavelength combining unit 72 into j signals. The optical branching unit 73 can be configured by a quartz flat optical waveguide (PLC), a flexible polymer optical waveguide, or the like.

第１学習データフィルタ７４は、光分岐部７３の出力するｊ個の信号のそれぞれに、ｎ個の学習データの先頭からｊ個ずつｍ回に分けて学習データを付与した学習対象信号を生成する。学習データｘ_１〜ｘ_ｊは、学習部６０から入力されるｍによって次式に示すように変化する。 The first learning data filter 74 generates a learning target signal in which learning data is assigned to each of the j signals output from the optical branching unit 73 by dividing the j learning data into m times from the beginning of the n learning data. . The learning data x _{1 to} x _j change as indicated by the following equation depending on m input from the learning unit 60.

ｊ＝5，ｎ＝14とした場合、ｍ＝０ではｘ_１＝ｘ_１，…，ｘ_５＝ｘ_５、ｍ＝1ではｘ_１＝ｘ_６，…，ｘ_１０＝ｘ_５、ｍ＝2ではｘ_１＝ｘ₁₁，…，ｘ₃＝ｘ₁₄である。このように、学習データは、ｊ個ずつｍ回に分けて付与される。 When j = 5 and n = 14, when m = 0, x ₁ = x ₁ ,..., x ₅ = x ₅ , and when m = 1, x ₁ = x ₆ ,..., x ₁₀ = x ₅ , m = 2 Then, x ₁ = x ₁₁ ,..., X ₃ = x ₁₄ . In this way, the learning data is given j times m times.

第２学習データフィルタ７５は、波長分岐部７１の出力するｎ−ｊ個の信号のそれぞれに、ｎ−ｊ個の学習データを付与した学習非対象信号を生成する。ｍ＝０では波長λ_ｊ＋１の単波長光にｘ_６、波長λ_ｎの単波長光にｘ_ｎの学習データが付与される。 The second learning data filter 75 generates a learning non-target signal in which n−j learning data is added to each of the n−j signals output from the wavelength branching unit 71. When m = 0, x ₆ learning data is assigned to single wavelength light of wavelength λ _{j + 1} and x _n learning data is assigned to single wavelength light of wavelength λ _n .

光結合部７６は、第１学習データフィルタ７４のｊ個の出力信号を結合した学習対象信号を生成する。学習対象信号は、光分岐部１２に出力する。 The optical coupling unit 76 generates a learning target signal obtained by combining the j output signals of the first learning data filter 74. The learning target signal is output to the optical branching unit 12.

（第２実施形態の分岐部）
図９に、光分岐部１２のより具体的な機能構成例を示す。光分岐部１２は、波長分岐部１２_１、波長合成部１２_２、第１光分岐部１２_３、及び第２光分岐部１２_４を備える。 (Branching part of 2nd Embodiment)
FIG. 9 shows a more specific functional configuration example of the optical branching unit 12. Optical branching unit 12 is provided with the wavelength branching unit 12 _1, the wavelength combining unit 12 _2, the first optical branching unit 12 _3, and a second optical branching unit 12 _4.

波長分岐部１２_１と波長合成部１２_２の関係は、学習データ生成部７０の波長分岐部７１と波長合成部７２の関係と同じである。この例では、波長分岐部１２_１と波長合成部１２_２で学習対象信号に含まれるλ_０〜λ_ｊの波長を、最も短い波長λ_０の単波長光と、それ以外の波長を含む多波長光に分岐する。 Relationship between the wavelength branching unit 12 ₁ and the wavelength combining unit 12 ₂ is the same as the relationship between the wavelength branching unit 71 and the wavelength combining unit 72 of the learning data generating unit 70. In this example, the wavelengths of λ ₀ to λ _j included in the learning target signal in the wavelength branching unit 12 ₁ and the wavelength synthesizing unit 12 ₂ are the single wavelength light having the shortest wavelength λ ₀ and multiple wavelengths including other wavelengths. Branch to light.

第１光分岐部１２_３は、多波長光をｊ個の多波長光Ａ_１〜Ａ_ｊに分岐する。第２光分岐部１２_４は、単波長光をｊ個の単波長光Ｂ_１〜Ｂ_ｊに分岐する。 The first optical splitter 12 ₃ branches the multiple wavelength light in the j multi-wavelength light _A 1 to A _j. Second optical branching unit 12 ₄ branches the single-wavelength light in the j single-wavelength light _B 1 .about.B _j.

（波長周回器）
図１０に、波長周回器８０の作用を模式的に示す。波長周回器８０は、第１重み学習データＤ_１〜Ｄ_ｊと摂動重み学習データＤδ_１〜Ｄδ_ｊを入力として、第１結合値Ｄ１と第２結合値Ｄ２_１〜Ｄ２_ｊを出力する。 (Wavelength circulator)
FIG. 10 schematically shows the operation of the wavelength circulator 80. The wavelength circulator 80 receives the first weight learning data D _{1 to} D _j and the perturbation weight learning data Dδ _{1 to} Dδ _j, and outputs the first combined value D1 and the second combined values D2 _{1 to} D2 _j .

第１結合値Ｄ１は、ｊ個の第１重み学習データＤ_１〜Ｄ_ｊの全てを合計（結合）した値である。第２結合値Ｄ２_１は、第１重み学習データＤ_１を摂動重み学習データＤδ_１に置き代えて、摂動重み学習データＤδ_１と第１重み学習データＤ_１を除く他の第１重み学習データＤ_２〜Ｄ_ｊを合計した値である。 The first combined value D1 is a value obtained by summing (combining) all of the j pieces of first weight learning data D _{1 to} D _j . The second combined value D2 ₁ is obtained by replacing the first weight learning data D ₁ with the perturbation weight learning data Dδ ₁ and other first weight learning data excluding the perturbation weight learning data Dδ ₁ and the first weight learning data D _1. It is a value obtained by summing D _{2 to} D _j .

第２結合値Ｄ２_ｊは、第１重み学習データＤ_ｊを摂動重み学習データＤδ_ｊに置き代えて、摂動重み学習データＤδ_ｊと第１重み学習データＤ_ｊを除く他の第１重み学習データＤ_１〜Ｄ_ｊ−１を合計した値である。 The second combined value D2 _j replaces the first weight learning data D _j with the perturbation weight learning data Dδ _j , and other first weight learning data excluding the perturbation weight learning data Dδ _j and the first weight learning data D _j. It is a value obtained by summing D _{1 to} D _j−1 .

このように、波長周回器８０は、ｊ個の第１重み学習データＤ_１〜Ｄ_ｊを全て結合した第１結合値Ｄ１と、ｊ個の第１重み学習データＤ_１〜Ｄ_ｊの何れか１個を、該第１重み学習データに対応する摂動重み学習データに置き代えて他の全ての第１重み学習データと結合したｊ個の第２結合値Ｄ２_１〜Ｄ２_ｊとを出力する。 Thus, the wavelength orbiting unit 80 includes a first coupling value D1 bound all the j first weight learning data _D 1 to D _j, one of the j first weight learning data _D 1 to D _j one, to output the first weight learning corresponding to the data perturbation weight learning data every place the j second coupling value D2 ₁ ~ D2 coupled with all the other first weight learning data _j.

（光結合部）
光結合部９０は、光結合器９１、光分岐器９２、及び光結合器９３を備える。図１１に、光結合器９１と光分岐器９２の作用を模式的に示す。図１２に、光結合器９３の作用を模式的に示す。 (Optical coupling part)
The optical coupler 90 includes an optical coupler 91, an optical splitter 92, and an optical coupler 93. FIG. 11 schematically shows the operation of the optical coupler 91 and the optical splitter 92. FIG. 12 schematically shows the operation of the optical coupler 93.

光結合器９１は、今回学習しない重みを付与した非学習対象信号Ｄ_ｊ＋１〜Ｄ_ｎを結合（合波）する。今回とは、ｍ回の回数の何れかのことである。 The optical coupler 91 combines (combines) the non-learning target signals D _{j + 1 to} D _n to which weights that are not learned this time are given. This time is one of m times.

光分岐器９２は、結合した非学習対象信号Ｄ_ｊ＋１〜Ｄ_ｎをｊ＋１個に等分岐する。ｊ＋１とは、同時に学習する重みの数＋１の数である。 The optical branching device 92 equally branches the combined non-learning target signals D _{j + 1 to} D _n into j + 1. j + 1 is the number of weights to be learned simultaneously + 1.

光結合器９３は、波長周回器８０を通過した今回学習する重みを付与した第１結合値Ｄ１及び第２結合値Ｄ２_１〜Ｄ２_ｊのそれぞれに、非学習対象信号Ｄ_ｊ＋１〜Ｄ_ｎを結合させて第１出力値Ｏと第２出力値Ｏδ_１〜Ｏδ_ｊを生成する。 The optical coupler 93 couples the non-learning target signals D _{j + 1 to} D _n to the first coupling value D1 and the second coupling values D2 _{1 to} D2 _j to which the current learning weight passed through the wavelength circulator 80 is given. Thus, the first output value O and the second output values Oδ _{1 to} Oδ _j are generated.

図１２に示す第１出力値Ｏと第２出力値Ｏδ_１〜Ｏδ_ｊの関係は、図４に示した第１実施形態の両者の関係と同じである。つまり、本実施形態は、波長多重光に学習する重みが付与されている点で異なるだけで作用効果は第１実施形態と同じである。 The relationship between the first output value O and the second output values Oδ _{1 to} Oδ _j shown in FIG. 12 is the same as the relationship between both in the first embodiment shown in FIG. That is, the present embodiment is the same as the first embodiment except that the present embodiment is different only in that a learning weight is given to the wavelength multiplexed light.

（比較例との対比）
以上説明した摂動学習装置１，２によれば重み学習を高速化できる。そこで、本実施形態の効果を確認する目的で、比較例と摂動学習装置１（図１）の学習速度の比較を行った。学習速度の比較は、ニューラルネットワークのハードウェアにおいて、１秒間に書き換え可能なパラメータの更新回数ＣＵＰＳ（Connections Updated Per Second）を比較することで行った。 (Contrast with comparative example)
According to the perturbation learning devices 1 and 2 described above, the weight learning can be speeded up. Therefore, for the purpose of confirming the effect of the present embodiment, the learning speeds of the comparative example and the perturbation learning device 1 (FIG. 1) were compared. The learning speed was compared by comparing the number of parameter updates CUPS (Connections Updated Per Second) that can be rewritten per second in the hardware of the neural network.

シミュレーションの条件は、比較例を、ニューラルネットワークの規模を10000ニューロンであるとき、動作クロック3GHz、8コアのＣＰＵを使用とした。比較例のＣＰＵによる重み書き換え数は、1.8MCUPSと試算された。 The simulation conditions were as follows: when the neural network scale was 10,000 neurons, an operation clock of 3 GHz and an 8-core CPU were used. The number of weight rewrites by the CPU of the comparative example was estimated to be 1.8 MCUPS.

本実施形態の構成で、摂動重み演算部３１で同時に計算する重みの数を１０個としてシミュレーションした。条件は、入力フィルタとして用いる空間光り変調器の時定数を0.1ns、重みフィルタとして用いる空間光変調器の時定数を0.1ns、受光器として用いるアバランシェフォトダイオードの時定数を1ns、重み変化量を演算するアナログ回路の遅延を0.1ns、重み更新量を指令するコントローラの遅延を1nsと仮定した。 In the configuration of this embodiment, the number of weights simultaneously calculated by the perturbation weight calculation unit 31 is 10 and simulated. The conditions are: the time constant of the spatial light modulator used as the input filter is 0.1 ns, the time constant of the spatial light modulator used as the weight filter is 0.1 ns, the time constant of the avalanche photodiode used as the light receiver, and the weight change amount The delay of the analog circuit to calculate is assumed to be 0.1 ns, and the delay of the controller that commands the weight update amount is assumed to be 1 ns.

本実施形態の重み書き換え数は100MCPUSと試算された。比較例に対して50倍以上に高速化することができた。 The number of weight rewrites in this embodiment is estimated to be 100 MCPUS. The speed could be increased 50 times or more compared to the comparative example.

また、本実施形態によれば低消費電力化が図れる。図１３に、比較した結果の一例を示す。図１３（ａ）に、フィルタの消費電力を比較した例、図１３（ｂ）に構成部品点数を比較した例を示す。 Further, according to the present embodiment, low power consumption can be achieved. FIG. 13 shows an example of the comparison result. FIG. 13A shows an example in which the power consumption of the filter is compared, and FIG. 13B shows an example in which the number of components is compared.

ニューラルネットワークの規模は10000ニューロンで比較した。同時に学習する重みの数を１０個とし、フィルタサイズを800×600ピクセル、消費電力を50VA、データ１成分当たりのフィルタサイズを8ピクセルと仮定した。 The scale of the neural network was compared with 10,000 neurons. It was assumed that the number of weights to be learned simultaneously was 10, the filter size was 800 × 600 pixels, the power consumption was 50 VA, and the filter size per data component was 8 pixels.

その条件で、比較例のフィルタの消費電力は183.3kVAと試算された。一方、本実施形態は16.7kVAと試算された。比較例に対して消費電力を約10分の1にすることができた。 Under these conditions, the power consumption of the filter of the comparative example was estimated to be 183.3 kVA. On the other hand, this embodiment was estimated to be 16.7 kVA. Compared with the comparative example, the power consumption could be reduced to about 1/10.

また、本実施形態の摂動学習装置１，２は、当該装置を構成する構成部品点数を削減することもできる。図１３（ｂ）に示すように、ニューロンに依存しないで本実施形態の構成部品点数は、比較例に対して一桁少なくて済む。なお、第２実施形態では波長周回器８０を用いるため、部品点数が増加する（二点鎖線）。但し、増加分はニューロン数に依存せず10数個で一定である。よって、摂動学習装置２でも十分に構成分品点数を削減することができる。 Moreover, the perturbation learning apparatuses 1 and 2 of this embodiment can also reduce the number of components constituting the apparatus. As shown in FIG. 13B, the number of component parts of this embodiment can be reduced by an order of magnitude compared to the comparative example without depending on neurons. In the second embodiment, since the wavelength circulator 80 is used, the number of parts increases (two-dot chain line). However, the increment is not more than the number of neurons and is constant at a dozen. Therefore, the perturbation learning device 2 can sufficiently reduce the number of component parts.

以上説明したように本実施形態の摂動学習装置１，２によれば、重み学習を、高速化、低消費電力化、及び小型化することができる。 As described above, according to the perturbation learning devices 1 and 2 of the present embodiment, weight learning can be speeded up, reduced in power consumption, and reduced in size.

摂動学習装置２は、光部品を用いて構成した例で説明を行った。光をキャリアとして用いると、３次元空間に光配線を形成することができ、ショートの恐れや配線の取り回しといったデメリットが少ない。 The perturbation learning device 2 has been described with an example configured using optical components. When light is used as a carrier, an optical wiring can be formed in a three-dimensional space, and there are few demerits such as a short circuit and wiring.

なお、本発明はこの例に限定されない。例えば、摂動学習装置１の分岐部１０は、不等分岐の例で説明したが、等分岐する分岐部１０で構成しても良い。等分岐する場合は、摂動重み演算部３０と結合部５０との間に、信号をｊ分の１に減衰する減衰器を設ければ良い。また、光部品を用いずに本実施形態の摂動学習装置１を構成しても良い。光は電磁波の一種である。よって、例えば分岐部１０は、電磁波分岐器で構成することが可能である。また、結合部５０も電磁波結合器で構成することが可能である。このように本発明は、上記の実施形態に限定されるものではなく、その要旨の範囲内で変形が可能である。 Note that the present invention is not limited to this example. For example, the branching unit 10 of the perturbation learning device 1 has been described with an example of unequal branching, but may be configured with a branching unit 10 that branches equally. In the case of equal branching, an attenuator that attenuates the signal to 1 / j may be provided between the perturbation weight calculation unit 30 and the coupling unit 50. Moreover, you may comprise the perturbation learning apparatus 1 of this embodiment, without using an optical component. Light is a type of electromagnetic wave. Therefore, for example, the branching unit 10 can be configured by an electromagnetic wave branching device. The coupling unit 50 can also be configured by an electromagnetic wave coupler. Thus, the present invention is not limited to the above-described embodiment, and can be modified within the scope of the gist thereof.

本実施の形態は、例えば階層型ニューラルネットワークの学習装置に適用することができ、光コンピューティングなどの分野に利用可能である。 The present embodiment can be applied to, for example, a learning device for a hierarchical neural network, and can be used in fields such as optical computing.

１、２：摂動学習装置
１０、１２：分岐部
１０_１、１０_ｉ、１０_ｎ：光分波器
１１：マルチプレクサ
１２、７３：光分岐部
１２_１：波長分岐部
１２_２：波長合成部
１２_３：第１光分岐部
１２_４：第２光分岐部
２０：重み演算部
２０_０、２０_ｉ、２０_ｎ：光減衰フィルタ
３０：摂動重み演算部
４０：第２分岐部
５０：結合部
６０、６０_１、６０_ｊ：学習部
６１_ｊ，６２_ｊ，６３_ｊ,６５_ｉ：差動増幅器
６４_ｊ：計算部
７０：学習データ生成部
７４：第１学習データフィルタ
７５：第２学習データフィルタ
８０：波長周回器
７６、９０：光結合部
９１：光結合器
９２：光分岐器 1, 2: Perturbation learning device 10, 12: Branching unit 10 ₁ , 10 _i , 10 _n : Optical demultiplexer 11: Multiplexer 12, 73: Optical branching unit 12 ₁ : Wavelength branching unit 12 ₂ : Wavelength synthesizing unit 12 ₃ : First optical branching unit 12 ₄ : second optical branching unit 20: weight calculation units 20 ₀ , 20 _i , 20 _n : optical attenuation filter 30: perturbation weight calculation unit 40: second branching unit 50: coupling units 60 and 60 ₁ , 60 _j : learning unit 61 _j , 62 _j , 63 _j , 65 _i : differential amplifier 64 _j : calculation unit 70: learning data generation unit 74: first learning data filter 75: second learning data filter 80: wavelength Circulators 76, 90: optical coupler 91: optical coupler 92: optical splitter

Claims

a learning non-target data obtained by branching learning data consisting of n elements for each element; a branching unit that outputs learning target data branched for each element j times from the beginning of the learning data; and
A weight calculation unit that calculates weight learning data by multiplying a corresponding weighting factor for each element of the learning non-target data,
A perturbation weight coefficient obtained by adding a perturbation coefficient for correcting the weight coefficient to the weight coefficient corresponding to the learning target data is generated, and the perturbation weight coefficient is multiplied by the corresponding learning target data to obtain j perturbation weights. A perturbation weight calculator for calculating learning data;
A second branching unit for branching each of the weight learning data into j + 1 pieces;
The first output value obtained by summing all the j + 1 weight learning data and any one of the j + 1 weight learning data are replaced with the perturbation weight learning data corresponding to the weight learning data. A combining unit that calculates all the weight learning data of j and the total of j second output values;
a perturbation learning comprising: a learning unit which is provided with j and repeats the calculation of updating the weighting coefficient for each j from the first output value and the second output value in units of m times. apparatus.

The learning data is an optical signal,
The branching unit sets the light intensity of the n learning non-target data as a signal having a light intensity of j + 1 / j, and the light intensity of the learning target data as a signal having a light intensity of j + 1 /. The perturbation learning device according to claim 1.

The learning unit is provided in the same number as the perturbation weight calculation unit, and the teacher value is subtracted from each of the first cost value obtained by subtracting the teacher value representing the learning target from the first output value and the second output value. The calculation for obtaining the amount of change for updating the weighting coefficient from the second cost value, the perturbation coefficient, and the learning coefficient representing the gradient of the learning speed is repeated in units of m times. The perturbation learning device described in 1.

When the learning unit sets the first cost value as E, the second cost value as E _δi , the perturbation coefficient as δ, and the learning coefficient as η,

4. The perturbation learning device according to claim 3, wherein the change amount [Delta] _{wi_new} is calculated by the above formula.

5. The perturbation learning according to claim 4, further comprising: a first multiplier that converts the first cost value into a positive value; and a second multiplier that converts the second cost value into a positive value. apparatus.

Wavelength multiplexed light including n wavelengths is branched into multi-wavelength light including j + 1 wavelengths and single-wavelength light having a wavelength not included in the multi-wavelength light, and n learning data is included in the multi-wavelength light. A learning data generation unit that generates a learning target signal to which j pieces of learning data from the top of the learning data are given m times, and a learning non-target signal to which a weight corresponding to the single wavelength light is given,
An optical branching unit that bifurcates the learning target signal into j single-wavelength lights having different wavelengths and j multi-wavelength lights including other wavelengths;
a first weight calculator that calculates first weight learning data by multiplying each of the j multi-wavelength lights by a corresponding weight coefficient;
A perturbation weight coefficient is generated by adding a perturbation coefficient for correcting the weight coefficient to a weight coefficient corresponding to each of the j single-wavelength lights, and the perturbation weight coefficient is multiplied by the corresponding single wavelength light to perturb. A perturbation weight calculator for calculating weight learning data;
A second weight calculator that calculates second weight learning data by multiplying each of the learning non-target signals by a corresponding weight coefficient;
The j first weight learning data and the j perturbation weight learning data are input, and a first combined value obtained by combining all the j first weight learning data, and the j first weight learning data. Wavelength circulator for outputting j second combined values obtained by replacing any one of them with the perturbation weight learning data corresponding to the first weight learning data and combining with all the other first weight learning data When,
The first output value obtained by combining the first combined value and nj second weight learning data, and the j second combined value and nj second weight learning data are combined. An optical coupler for generating a second output value,
a perturbation learning comprising: a learning unit which is provided with j and repeats the calculation of updating the weighting coefficient for each j from the first output value and the second output value in units of m times. apparatus.

The learning data generation unit
a wavelength branching unit that demultiplexes wavelength multiplexed light including n wavelengths for each wavelength;
A wavelength combining unit that combines wavelength components from the shortest wavelength demultiplexed by the wavelength branching unit to the j + 1st longest wavelength;
An optical branching unit for equally branching the signal output from the wavelength multiplexing unit into j pieces;
A first learning data filter that assigns the learning data to each of j signals output from the optical branching unit by dividing j times from the beginning of n learning data into m times;
A second learning data filter that generates a learning non-target signal obtained by adding nj learning data to each of the nj signals output from the optical branching unit;
The perturbation learning device according to claim 6, further comprising: an optical coupling unit configured to generate a learning target signal obtained by combining the j output signals of the first learning data filter.

A perturbation learning method performed by a perturbation learning device,
A branching unit outputs learning non-target data obtained by branching learning data composed of n elements for each element, and learning target data branched for each element j times from the beginning of the learning data,
A weight calculation unit calculates weight learning data by multiplying a corresponding weighting factor for each element of the learning non-target data,
A perturbation weight calculation unit generates a perturbation weight coefficient obtained by adding a perturbation coefficient for correcting the weight coefficient to the weight coefficient corresponding to the learning target data, and multiplies the corresponding learning target data by the perturbation weight coefficient. To calculate j perturbation weight learning data,
A second branching unit branches each of the weight learning data into j + 1 pieces;
The combining unit adds any one of the first output value obtained by summing all the weight learning data branched into j + 1 pieces and the j + 1 weight learning data to the perturbation weight learning data corresponding to the weight learning data. Replacing all other weight learning data with the summed j second output values,
A perturbation learning method characterized in that j learning units repeat the calculation of updating the weighting factor for each j from the first output value and the second output value in units of m times. .