WO2025041292A1

WO2025041292A1 - Training device, inference device, training inference device, program, training inference system, training method, and inference method

Info

Publication number: WO2025041292A1
Application number: PCT/JP2023/030253
Authority: WO
Inventors: 裕史鹿毛
Original assignee: Mitsubishi Electric Corp
Current assignee: Mitsubishi Electric Corp
Priority date: 2023-08-23
Filing date: 2023-08-23
Publication date: 2025-02-27
Anticipated expiration: 2026-02-23
Also published as: JPWO2025041292A1

Abstract

In the present invention, a pattern detection device (100) comprises: a training data generation unit (101) that generates training data in which a plurality of feature vectors corresponding respectively to a plurality of training target images are arranged in time series; a true value data construction unit (104) that constructs true value data in which a plurality of true value vectors, each indicating true/false for each of the plurality of training target images, are arranged so as to correspond to the time series of the plurality of feature vectors; and a reservoir circuit training unit (107) that uses the training data and the true value data to train a three-layer neural network using reservoir computing.

Description

Learning device, inference device, learning inference device, program, learning inference system, learning method, and inference method

　本開示は、学習装置、推論装置、学習推論装置、プログラム、学習推論システム、学習方法及び推論方法に関する。 This disclosure relates to a learning device, an inference device, a learning inference device, a program, a learning inference system, a learning method, and an inference method.

　機械学習は、ＣＮＮ（Ｃｏｎｖｏｌｕｔｉｏｎａｌ　Ｎｅｕｒａｌ　Ｎｅｔｗｏｒｋ）と、ＲＮＮ（Ｒｅｃｕｒｒｅｎｔ　Ｎｅｕｒａｌ　Ｎｅｔｗｏｒｋ）に分かれる。
　ＣＮＮでは、深層学習技術として静的パターン検出の高精度化が進んでいる。一方、ＲＮＮは、ＣＮＮが有さない再帰的結合を持つため、時系列信号学習に専ら利用されてきた。 Machine learning is divided into CNN (Convolutional Neural Network) and RNN (Recurrent Neural Network).
CNN is a deep learning technology that has been improving the accuracy of static pattern detection. On the other hand, RNN has been used exclusively for time series signal learning because it has recurrent connections that CNN does not have.

　一般に、ＣＮＮは、ＲＮＮに比べて大規模のハードウェアリソースを必要とするのに対し、ＲＮＮの一技術であるリザバーコンピューティングは、ＣＮＮ及びＲＮＮの他の手法と比べて、低リソースで学習が可能という利点を持つ。また、リザバーコンピューティングは、中間層内部を学習する必要がないことから、中間層を一般の物理媒体で回路実装できるという利点も持つ。 Generally, CNN requires large-scale hardware resources compared to RNN, whereas reservoir computing, which is a technique for RNN, has the advantage of being able to learn with fewer resources compared to other CNN and RNN techniques. In addition, reservoir computing has the advantage that it does not require learning inside the intermediate layer, so the intermediate layer can be implemented as a circuit using ordinary physical media.

　リザバーコンピューティングは、時系列信号学習に数多く応用されてきたが、リザバーコンピューティングを静的な二次元パターンの学習及び検出に応用することで、リザバーコンピューティングの適用範囲を拡大することができる。そうした技術の例として、リザバーコンピューティングを利用した二次元パターン検出学習の実装手法が非特許文献１に示されている。 Reservoir computing has been widely applied to time-series signal learning, but the scope of application of reservoir computing can be expanded by applying reservoir computing to the learning and detection of static two-dimensional patterns. As an example of such technology, a method for implementing two-dimensional pattern detection learning using reservoir computing is shown in Non-Patent Document 1.

Ｐａｕｇａｍ－Ｍｏｉｓｙ　ｅｔ　ａｌ．，　“Ｄｅｌａｙ　ｌｅａｒｎｉｇ　ａｎｄ　ｐｏｌｙｃｈｒｏｎｉｚａｔｉｏｎ　ｆｏｒ　ｒｅｓｅｒｖｏｉｒ　ｃｏｍｐｕｔｉｎｇ”，　Ｎｅｕｒｏｃｏｍｐｕｔｉｎｇ　７１（７－９），　ｐｐ．１１４３－１１５８，　２００８．Paugam-Moisy et al. , “Delay learning and polychronization for reservoir computing”, Neurocomputing 71 (7-9), pp. 1143-1158, 2008.

　従来の技術では、リザバーコンピューティングの中間層ユニット間の結合定数を学習する必要があるため、本来のリザバーコンピューティングが持つ、中間層の構築に一般の物理媒体が利用できるという特徴が失われる。このため、仮に中間層構築に利用できる物理媒体があったとしても、それによる学習ネットワークの構築の難易度が上がるという問題がある。　In conventional technology, it is necessary to learn the coupling constants between the intermediate layer units of reservoir computing, which means that the original feature of reservoir computing, that is, the ability to use ordinary physical media to construct the intermediate layer, is lost. For this reason, even if there were physical media that could be used to construct the intermediate layer, there is the problem that it would be more difficult to construct a learning network using them.

　そこで、本開示の一又は複数の態様は、リザバーコンピューティングによる学習ネットワークを容易に構築することができるようにすることを目的とする。 Therefore, one or more aspects of the present disclosure aim to make it possible to easily build a learning network using reservoir computing.

　本開示の一態様に係る学習装置は、複数の学習対象画像のそれぞれに対応する複数の特徴ベクトルが時系列で配置された学習データを生成する学習データ生成部と、前記複数の学習対象画像のそれぞれの正偽を示す複数の真値ベクトルが、前記複数の特徴ベクトルの時系列に対応するように配置された真値データを構築する真値データ構築部と、前記学習データ及び前記真値データを用いて、リザバーコンピューティングを利用した三層ニューラルネットワークの学習を行う学習部と、を備えることを特徴とする。 The learning device according to one aspect of the present disclosure is characterized by comprising: a learning data generation unit that generates learning data in which a plurality of feature vectors corresponding to a plurality of learning target images are arranged in a time series; a true-value data construction unit that constructs true-value data in which a plurality of true-value vectors indicating the truth or falsity of each of the plurality of learning target images are arranged so as to correspond to the time series of the plurality of feature vectors; and a learning unit that uses the learning data and the true-value data to train a three-layer neural network using reservoir computing.

　本開示の一態様に係る推論装置は、推論対象となる画像である推論対象画像から特徴ベクトルを適用データとして生成する適用データ生成部と、複数の学習対象画像のそれぞれに対応する複数の特徴ベクトルが時系列で配置された学習データと、前記複数の学習対象画像のそれぞれの正偽を示す複数の真値ベクトルが、前記複数の特徴ベクトルの時系列に対応するように配置された真値データとを用いて、リザバーコンピューティングを利用した三層ニューラルネットワークの学習を行い、前記学習の結果として取得された、前記三層ニューラルネットワークの結合定数を用いて、学習済み三層ニューラルネットワークを生成し、前記適用データを前記学習済み三層ニューラルネットワークに入力することで、前記推論対象画像の正偽を推論する処理部と、を備えることを特徴とする。 The inference device according to one aspect of the present disclosure is characterized by comprising: an application data generation unit that generates a feature vector as application data from an inference target image, which is an image to be inferred; and a processing unit that uses training data in which a plurality of feature vectors corresponding to each of a plurality of training target images are arranged in a time series, and true value data in which a plurality of true value vectors indicating the authenticity of each of the plurality of training target images are arranged so as to correspond to the time series of the plurality of feature vectors, to train a three-layer neural network using reservoir computing, generates a trained three-layer neural network using the coupling constants of the three-layer neural network obtained as a result of the training, and inputs the application data into the trained three-layer neural network to infer the authenticity of the inference target image.

　本開示の一態様に係る学習推論装置は、複数の学習対象画像のそれぞれに対応する複数の特徴ベクトルが時系列で配置された学習データを生成する学習データ生成部と、前記複数の学習対象画像のそれぞれの正偽を示す複数の真値ベクトルが、前記複数の特徴ベクトルの時系列に対応するように配置された真値データを構築する真値データ構築部と、前記学習データ及び前記真値データを用いて、リザバーコンピューティングを利用した三層ニューラルネットワークの学習を行い、前記三層ニューラルネットワークの結合定数と、前記学習の結果として取得する学習部と、推論対象となる画像である推論対象画像から特徴ベクトルを適用データとして生成する適用データ生成部と、前記取得された結合定数を用いて、学習済み三層ニューラルネットワークを生成し、前記適用データを前記学習済み三層ニューラルネットワークに入力することで、前記推論対象画像の正偽を推論する処理部と、を備えることを特徴とする。 The learning and inference device according to one aspect of the present disclosure is characterized by comprising: a learning data generation unit that generates learning data in which a plurality of feature vectors corresponding to each of a plurality of learning target images are arranged in a time series; a true-value data construction unit that constructs true-value data in which a plurality of true-value vectors indicating the authenticity of each of the plurality of learning target images are arranged so as to correspond to the time series of the plurality of feature vectors; a learning unit that uses the learning data and the true-value data to train a three-layered neural network using reservoir computing and obtains coupling constants of the three-layered neural network as a result of the training; an application data generation unit that generates feature vectors as application data from an inference target image, which is an image that is the subject of inference; and a processing unit that uses the obtained coupling constants to generate a trained three-layered neural network and inputs the application data to the trained three-layered neural network to infer the authenticity of the inference target image.

　本開示の第１の態様に係るプログラムは、コンピュータを、複数の学習対象画像のそれぞれに対応する複数の特徴ベクトルが時系列で配置された学習データを生成する学習データ生成部、前記複数の学習対象画像のそれぞれの正偽を示す複数の真値ベクトルが、前記複数の特徴ベクトルの時系列に対応するように配置された真値データを構築する真値データ構築部、及び、前記学習データ及び前記真値データを用いて、リザバーコンピューティングを利用した三層ニューラルネットワークの学習を行う学習部、として機能させることを特徴とする。 The program according to the first aspect of the present disclosure causes a computer to function as a training data generation unit that generates training data in which a plurality of feature vectors corresponding to a plurality of training target images are arranged in a time series, a true-value data construction unit that constructs true-value data in which a plurality of true-value vectors indicating the truth or falsity of each of the plurality of training target images are arranged so as to correspond to the time series of the plurality of feature vectors, and a learning unit that uses the training data and the true-value data to train a three-layer neural network using reservoir computing.

　本開示の第２の態様に係るプログラムは、コンピュータを、推論対象となる画像である推論対象画像から特徴ベクトルを適用データとして生成する適用データ生成部、及び、複数の学習対象画像のそれぞれに対応する複数の特徴ベクトルが時系列で配置された学習データと、前記複数の学習対象画像のそれぞれの正偽を示す複数の真値ベクトルが、前記複数の特徴ベクトルの時系列に対応するように配置された真値データとを用いて、リザバーコンピューティングを利用した三層ニューラルネットワークの学習を行い、前記学習の結果として取得された、前記三層ニューラルネットワークの結合定数を用いて、学習済み三層ニューラルネットワークを生成し、前記適用データを前記学習済み三層ニューラルネットワークに入力することで、前記推論対象画像の正偽を推論する処理部、として機能させることを特徴とする。 The program according to the second aspect of the present disclosure causes a computer to function as an application data generation unit that generates a feature vector as application data from an inference target image, which is an image to be inferred, and a processing unit that uses training data in which a plurality of feature vectors corresponding to each of a plurality of training target images are arranged in a time series and true-value data in which a plurality of true-value vectors indicating the truth or falsehood of each of the plurality of training target images are arranged so as to correspond to the time series of the plurality of feature vectors, trains a three-layer neural network using reservoir computing, generates a trained three-layer neural network using the coupling constants of the three-layer neural network obtained as a result of the training, and inputs the application data into the trained three-layer neural network to infer the truth or falsehood of the inference target image.

　本開示の一態様に係る学習推論システムは、複数の学習対象画像のそれぞれに対応する複数の特徴ベクトルが時系列で配置された学習データを生成する学習データ生成部と、前記複数の学習対象画像のそれぞれの正偽を示す複数の真値ベクトルが、前記複数の特徴ベクトルの時系列に対応するように配置された真値データを構築する真値データ構築部と、前記学習データ及び前記真値データを用いて、リザバーコンピューティングを利用した三層ニューラルネットワークの学習を行い、前記三層ニューラルネットワークの結合定数と、前記学習の結果として取得する学習部と、推論対象となる画像である推論対象画像から特徴ベクトルを適用データとして生成する適用データ生成部と、前記取得された結合定数を用いて、学習済み三層ニューラルネットワークを生成し、前記適用データを前記学習済み三層ニューラルネットワークに入力することで、前記推論対象画像の正偽を推論する処理部と、を備えることを特徴とする。 The learning inference system according to one aspect of the present disclosure includes a learning data generation unit that generates learning data in which a plurality of feature vectors corresponding to each of a plurality of learning target images are arranged in a time series; a true-value data construction unit that constructs true-value data in which a plurality of true-value vectors indicating the authenticity of each of the plurality of learning target images are arranged so as to correspond to the time series of the plurality of feature vectors; a learning unit that uses the learning data and the true-value data to train a three-layer neural network using reservoir computing and obtains coupling constants of the three-layer neural network as a result of the training; an application data generation unit that generates feature vectors as application data from an inference target image, which is an image that is the subject of inference; and a processing unit that uses the obtained coupling constants to generate a trained three-layer neural network and inputs the application data to the trained three-layer neural network to infer the authenticity of the inference target image.

　本開示の一態様に係る学習方法は、複数の学習対象画像のそれぞれに対応する複数の特徴ベクトルが時系列で配置された学習データを生成し、前記複数の学習対象画像のそれぞれの正偽を示す複数の真値ベクトルが、前記複数の特徴ベクトルの時系列に対応するように配置された真値データを構築し、前記学習データ及び前記真値データを用いて、リザバーコンピューティングを利用した三層ニューラルネットワークの学習を行うことを特徴とする。 A learning method according to one aspect of the present disclosure is characterized in that it generates learning data in which a plurality of feature vectors corresponding to a plurality of training target images are arranged in a time series, constructs true-value data in which a plurality of true-value vectors indicating the truth or falsehood of each of the plurality of training target images are arranged to correspond to the time series of the plurality of feature vectors, and uses the learning data and the true-value data to train a three-layered neural network using reservoir computing.

　本開示の一態様に係る推論方法は、推論対象となる画像である推論対象画像から特徴ベクトルを適用データとして生成し、複数の学習対象画像のそれぞれに対応する複数の特徴ベクトルが時系列で配置された学習データと、前記複数の学習対象画像のそれぞれの正偽を示す複数の真値ベクトルが、前記複数の特徴ベクトルの時系列に対応するように配置された真値データとを用いて、リザバーコンピューティングを利用した三層ニューラルネットワークの学習を行い、前記学習の結果として取得された、前記三層ニューラルネットワークの結合定数を用いて、学習済み三層ニューラルネットワークを生成し、前記適用データを前記学習済み三層ニューラルネットワークに入力することで、前記推論対象画像の正偽を推論することを特徴とする。 The inference method according to one aspect of the present disclosure is characterized in that it generates a feature vector as application data from an inference target image, which is an image that is the subject of inference, and trains a three-layered neural network using reservoir computing using training data in which a plurality of feature vectors corresponding to each of a plurality of training target images are arranged in time series, and true-value data in which a plurality of true-value vectors indicating the authenticity of each of the plurality of training target images are arranged so as to correspond to the time series of the plurality of feature vectors, generates a trained three-layered neural network using the coupling constants of the three-layered neural network obtained as a result of the training, and inputs the application data into the trained three-layered neural network to infer the authenticity of the inference target image.

　本開示の一又は複数の態様によれば、リザバーコンピューティングによる学習ネットワークを容易に構築することができる。 According to one or more aspects of the present disclosure, a learning network can be easily constructed using reservoir computing.

実施の形態１に係る学習推論装置としてのパターン検出装置の構成を概略的に示すブロック図である。1 is a block diagram illustrating a schematic configuration of a pattern detection device as a learning and inference device according to a first embodiment. パターンの検出対象を含むサンプル画像である学習用画像の一例を示す概略図である。1 is a schematic diagram showing an example of a learning image that is a sample image including a pattern detection target; 学習用画像から、正画像又は偽画像を判定する複数の部分画像を抽出する例を示す概略図である。10 is a schematic diagram showing an example of extracting a plurality of partial images for determining whether the image is a genuine image or a false image from a learning image; 学習用画像セットの一例を説明するための概略図である。FIG. 11 is a schematic diagram for explaining an example of a learning image set. 実施の形態１における学習フェーズでの処理を説明するための概略図である。FIG. 4 is a schematic diagram for explaining processing in a learning phase in the first embodiment. 実施の形態１において、複数枚の学習用画像を学習する例を説明するための概略図である。FIG. 2 is a schematic diagram for explaining an example of learning a plurality of learning images in the first embodiment. 実施の形態１における適用フェーズでの処理を説明するための概略図である。FIG. 11 is a schematic diagram for explaining a process in an application phase in the first embodiment. コンピュータの構成を概略的に示すブロック図である。FIG. 1 is a block diagram showing a schematic configuration of a computer. 実施の形態２に係る学習推論装置としてのパターン検出装置の構成を概略的に示すブロック図である。FIG. 11 is a block diagram illustrating a schematic configuration of a pattern detection device as a learning and inference device according to a second embodiment. 実施の形態２における学習フェーズにおける学習用入力データ指定部及び直交フィルタ適用部での処理を説明するための概略図である。13 is a schematic diagram for explaining the processing in a learning input data designation unit and an orthogonal filter application unit in the learning phase in embodiment 2. FIG. 二次元のＷａｌｓｈ－Ｈａｄａｍａｒｄフィルタの階数Ｎ＝８の例を示す概略図である。FIG. 2 is a schematic diagram illustrating an example of a two-dimensional Walsh-Hadamard filter with rank N=8. 実施の形態２における適用フェーズでの処理を説明するための概略図である。FIG. 11 is a schematic diagram for explaining processing in an application phase in the second embodiment. 実施の形態３に係るパターン検出装置の構成を概略的に示すブロック図である。FIG. 11 is a block diagram illustrating a schematic configuration of a pattern detection device according to a third embodiment. 実施の形態３における学習フェーズにおける学習用入力データ指定部、直交フィルタ適用部及び値域変換部での処理を説明するための概略図である。13 is a schematic diagram for explaining the processing in a learning input data designation unit, an orthogonal filter application unit, and a range conversion unit in the learning phase in embodiment 3. FIG. 双曲線正接関数を示すグラフである。1 is a graph showing a hyperbolic tangent function. 実施の形態３における適用フェーズでの処理を説明するための概略図である。FIG. 13 is a schematic diagram for explaining processing in an application phase in the third embodiment.

実施の形態１．
　図１は、実施の形態１に係る学習推論装置としてのパターン検出装置１００の機能構成を概略的に示すブロック図である。
　パターン検出装置１００は、学習データ生成部１０１と、真値データ構築部１０４と、リザバー回路学習部１０７と、適用データ入力部１１１と、リザバー回路処理部１１２と、適用データ出力部１１３とを備える。 Embodiment 1.
FIG. 1 is a block diagram illustrating a schematic functional configuration of a pattern detection device 100 serving as a learning and inference device according to the first embodiment.
The pattern detection device 100 includes a learning data generating unit 101 , a true value data constructing unit 104 , a reservoir circuit learning unit 107 , an application data input unit 111 , a reservoir circuit processing unit 112 , and an application data output unit 113 .

　パターン検出装置１００は、リザバーコンピューティングの実装形態として離散時間実装版であるＥｃｈｏ　Ｓｔａｔｅ　Ｎｅｔｗｏｒｋを利用した処理プロセスに関し、学習フェーズと、適用フェーズとからなる２つのフェーズで動作する。 The pattern detection device 100 operates in two phases, a learning phase and an application phase, for a processing process that uses an Echo State Network, which is a discrete-time implementation of reservoir computing.

　学習データ生成部１０１は、複数の学習対象画像のそれぞれに対応する複数の特徴ベクトルが時系列で配置された学習データを生成する。
　実施の形態１では、学習データ生成部１０１は、二次元画像である複数の学習対象画像の各々において予め定められた順番で画素値を抽出して、抽出された順番でその抽出された画素値を並べることで複数の特徴ベクトルの各々を生成する。
　学習データ生成部１０１は、学習用入力データ指定部１０２と、学習用特徴ベクトル格納部１０３とを備える。 The learning data generating unit 101 generates learning data in which a plurality of feature vectors corresponding to a plurality of learning target images are arranged in time series.
In embodiment 1, the training data generation unit 101 extracts pixel values in a predetermined order from each of a plurality of training target images, which are two-dimensional images, and generates each of a plurality of feature vectors by arranging the extracted pixel values in the order in which they were extracted.
The learning data generating unit 101 includes a learning input data specifying unit 102 and a learning feature vector storage unit 103 .

　学習用入力データ指定部１０２は、学習フェーズにおいて、一つの学習用画像を変換して特徴ベクトルを生成する。なお、学習用画像に含まれる個々の部分画像を学習対象画像ともいう。学習用入力データ指定部１０２は、複数の学習対象画像を変換して、複数の特徴ベクトルを生成する。そして、学習用入力データ指定部１０２は、学習フェーズにおいて、これらの複数の特徴ベクトルが一定の時間長を持つように構成することで、学習データとしての学習用特徴ベクトルを生成する。 In the learning phase, the learning input data designation unit 102 transforms one learning image to generate a feature vector. Note that each partial image included in the learning image is also referred to as a learning target image. The learning input data designation unit 102 transforms multiple learning target images to generate multiple feature vectors. Then, in the learning phase, the learning input data designation unit 102 configures these multiple feature vectors to have a fixed time length, thereby generating learning feature vectors as learning data.

　学習用特徴ベクトル格納部１０３は、学習フェーズにおいて、学習用入力データ指定部１０２で生成された学習用特徴ベクトルを記憶する。 The learning feature vector storage unit 103 stores the learning feature vectors generated by the learning input data designation unit 102 during the learning phase.

　真値データ構築部１０４は、複数の学習対象画像のそれぞれの正偽を示す複数の真値ベクトルが、複数の学習対象画像から生成された複数の特徴ベクトルの時系列に対応するように配置された真値データを構築する。
　真値データ構築部１０４は、学習用出力データ指定部１０５と、学習用真値ベクトル格納部１０６とを備える。 The true value data construction unit 104 constructs true value data in which a plurality of true value vectors indicating the true or false status of each of the plurality of training target images are arranged to correspond to a time series of a plurality of feature vectors generated from the plurality of training target images.
The true-value data constructing unit 104 includes a learning output data specifying unit 105 and a learning true-value vector storing unit 106 .

　学習用出力データ指定部１０５は、学習フェーズにおいて、学習用特徴ベクトルに含まれる個々の特徴ベクトルに対応した真値データを示す真値ベクトルを、学習用入力データ指定部１０２において生成された学習用特徴ベクトルに含まれている個々の特徴ベクトルが持つ真値と一対一に対応するように、学習用真値ベクトルとして生成する。 In the learning phase, the learning output data specification unit 105 generates a learning true value vector that indicates true value data corresponding to each feature vector included in the learning feature vector, so that the true value vector corresponds one-to-one to the true values of each feature vector included in the learning feature vector generated by the learning input data specification unit 102.

　学習用真値ベクトル格納部１０６は、学習フェーズにおいて、学習用出力データ指定部１０５により生成された学習用真値ベクトルに含まれている個々の真値ベクトルが、学習用特徴ベクトル格納部１０３において記憶されている学習用特徴ベクトルに含まれている個々の特徴ベクトルに対応するように記憶する。
　そして、学習用真値ベクトル格納部１０６は、真値データとしての学習用真値ベクトルを、リザバー回路学習部１０７に入力する。 In the learning phase, the training true value vector storage unit 106 stores the individual true value vectors included in the training true value vector generated by the training output data designation unit 105 so that the individual true value vectors correspond to the individual feature vectors included in the training feature vector stored in the training feature vector storage unit 103.
Then, the learning true-value vector storage unit 106 inputs the learning true-value vector as true-value data to the reservoir circuit learning unit 107 .

　リザバー回路学習部１０７は、学習データ生成部１０１で生成された学習データ及び真値データ構築部１０４で構築された真値データを用いて、リザバーコンピューティングを利用した三層ニューラルネットワークの学習を行う学習部として機能する。 The reservoir circuit learning unit 107 functions as a learning unit that uses the learning data generated by the learning data generation unit 101 and the true value data constructed by the true value data construction unit 104 to train the three-layer neural network using reservoir computing.

　リザバー回路学習部１０７は、学習フェーズにおいて、学習用特徴ベクトル格納部１０３に保持されている学習用特徴ベクトルと、学習用真値ベクトル格納部１０６に保持されている学習用真値ベクトルとを元に、三層ニューラルネットワークに対してＥｃｈｏ　Ｓｔａｔｅ　Ｎｅｔｗｏｒｋ学習アルゴリズムを適用する。
　そして、リザバー回路学習部１０７は、その三層ニューラルネットワークの結合定数を、Ｅｃｈｏ　Ｓｔａｔｅ　Ｎｅｔｗｏｒｋ学習アルゴリズムによる学習結果として保存する。 In the learning phase, the reservoir circuit learning unit 107 applies an Echo State Network learning algorithm to the three-layer neural network based on the learning feature vectors stored in the learning feature vector storage unit 103 and the learning true-value vectors stored in the learning true-value vector storage unit 106.
Then, the reservoir circuit learning unit 107 stores the coupling constants of the three-layered neural network as the learning result obtained by the Echo State Network learning algorithm.

　実施の形態１における適用データ入力部１１１は、推論対象となる画像である推論対象画像から特徴ベクトルを適用データとして生成する適用データ生成部として機能する。
　実施の形態１では、適用データ入力部１１１は、二次元画像である推論対象画像において予め定められた順番で画素値を抽出して、抽出された順番でその抽出された画素値を並べることで適用データを生成する。 The application data input unit 111 in the first embodiment functions as an application data generation unit that generates, as application data, a feature vector from an inference target image, which is an image to be inferred.
In embodiment 1, the application data input unit 111 extracts pixel values in a predetermined order from the inference target image, which is a two-dimensional image, and generates application data by arranging the extracted pixel values in the order in which they were extracted.

　例えば、適用データ入力部１１１は、適用フェーズにおいて、新たに二次元画像の入力を受け付けて、その二次元画像を変換することで、その特徴ベクトルを取得する。ここでの二次元画像が推論対象画像となる。 For example, in the application phase, the application data input unit 111 accepts a new input of a two-dimensional image and converts the two-dimensional image to obtain its feature vector. This two-dimensional image becomes the inference target image.

　リザバー回路処理部１１２は、リザバー回路学習部１０７での学習の結果として取得された、三層ニューラルネットワークの結合定数を用いて、学習済み三層ニューラルネットワークを生成し、適用データ入力部１１１からの適用データをその学習済み三層ニューラルネットワークに入力することで、推論対象画像の正偽を推論する処理部として機能する。 The reservoir circuit processing unit 112 generates a trained three-layer neural network using the coupling constants of the three-layer neural network obtained as a result of learning in the reservoir circuit learning unit 107, and functions as a processing unit that infers the authenticity of the inference target image by inputting application data from the application data input unit 111 to the trained three-layer neural network.

　例えば、リザバー回路処理部１１２は、適用フェーズにおいて、リザバー回路学習部１０７に保存された結合定数を利用して、学習済み三層ニューラルネットワークを構築する。リザバー回路処理部１１２は、学習済み三層ネットワークに対して学習済みのＥｃｈｏ　Ｓｔａｔｅ　Ｎｅｔｗｏｒｋを適用する。後述の図７に示すとおり、学習済みのＥｃｈｏ　Ｓｔａｔｅ　Ｎｅｔｗｏｒｋは、入力層１１２ａ、中間層１１２ｂ及び出力層１１２ｃを備える。
　そして、リザバー回路処理部１１２は、適用データ入力部１１１で取得された特徴ベクトルを、入力データとして、入力層１１２ａに入力する。 For example, in the application phase, the reservoir circuit processing unit 112 constructs a trained three-layer neural network using the coupling constants stored in the reservoir circuit learning unit 107. The reservoir circuit processing unit 112 applies a trained Echo State Network to the trained three-layer network. As shown in FIG. 7, which will be described later, the trained Echo State Network includes an input layer 112a, an intermediate layer 112b, and an output layer 112c.
Then, the reservoir circuit processing unit 112 inputs the feature vector acquired by the application data input unit 111 to the input layer 112a as input data.

　適用データ出力部１１３は、出力層１１２ｃからの出力データを取得し、判定結果として出力する。 The application data output unit 113 obtains the output data from the output layer 112c and outputs it as the judgment result.

　本実施の形態では、パターン検出対象として二次元画像を例として、説明する。
　学習用画像から抽出された複数の部分画像を学習用入力画像セットとする。なお、複数の部分画像の各々が学習対象画像となる。
　複数の部分画像の内、検出対象を含む部分画像を正画像（ｔｒｕｅ）、それ以外の部分画像を偽画像（ｆａｌｓｅ）として、部分画像が正画像の場合、真値ベクトル（１，０）^Ｔ（Ｔはベクトルの転置）を、部分画像が偽画像の場合、真値ベクトル（０，１）^Ｔを、２個の出力層ユニット数における真値データとして、学習用入力画像セットに含まれている複数の部分画像に対応付けた学習用出力データセットとする。
　ここでは、学習用入力画像セットと、学習用出力データセットとから構成される学習用画像セットを、Ｅｃｈｏ　Ｓｔａｔｅ　Ｎｅｔｗｏｒｋで学習した際の動作例について、学習フェーズと、適用フェーズとに分けて説明する。 In this embodiment, a two-dimensional image will be taken as an example of a pattern detection target.
A set of partial images extracted from a learning image is defined as a learning input image set, and each of the partial images is a learning target image.
Of the multiple partial images, the partial image containing the detection target is defined as a true image, and the other partial images are defined as false images. If the partial image is a true image, the true vector (1,0) ^T (T is the transpose of the vector) is used. If the partial image is a false image, the true vector (0,1) ^T is used. These are used as true data for two output layer units, and correspond to the multiple partial images included in the learning input image set, forming a learning output data set.
Here, an example of operation when a learning image set consisting of a learning input image set and a learning output data set is learned by an Echo State Network will be described, divided into a learning phase and an application phase.

　まず、学習フェーズについて説明する。
　図２は、パターンの検出対象を含むサンプル画像である学習用画像の一例を示す概略図である。
　ここでは、検出対象の例として、学習用画像中に存在する６個のりんごを学習するものとする。 First, the learning phase will be described.
FIG. 2 is a schematic diagram showing an example of a learning image, which is a sample image including a pattern detection target.
In this example, six apples present in a learning image are used as an example of a detection target.

　このため、例えば、図３に示されているように、学習用画像ＩＭから、正画像又は偽画像を判定するために、複数の矩形枠を指定して、その複数の矩形枠内の画像が部分画像Ｐ０１～Ｐ１２として抽出される。 For this reason, for example, as shown in FIG. 3, in order to determine whether an image is genuine or fake from the learning image IM, multiple rectangular frames are specified, and the images within the multiple rectangular frames are extracted as partial images P01 to P12.

　図４は、学習用画像セットの一例を説明するための概略図である。
　学習用画像セットは、検知対象である正画像が「ｔｒｕｅ」として示されており、非検知対象である偽画像が「ｆａｌｓｅ」として示されている。 FIG. 4 is a schematic diagram for explaining an example of a learning image set.
In the learning image set, positive images that are the detection targets are indicated as "true," and false images that are not the detection targets are indicated as "false."

　図４に示されている正画像は、図３に示されている学習用画像ＩＭ中に存在する６個のりんごを含む矩形枠から抽出された部分画像である。この正画像を検出対象として学習するために、真値ベクトル（１，０）^Ｔが真値データとして与えられる。 The normal image shown in Fig. 4 is a partial image extracted from a rectangular frame including six apples present in the learning image IM shown in Fig. 3. In order to learn this normal image as a detection target, a true value vector (1, 0) ^T is given as true value data.

　また、図４に示されている偽画像は、図３に示されている学習用画像ＩＭ中に存在する６個のりんごを含まない領域から、ランダムに指定された矩形枠から抽出された部分画像である。この偽画像を検出対象として学習するために、真値ベクトル（０，１）^Ｔが真値データとして与えられる。 The fake image shown in Fig. 4 is a partial image extracted from a randomly specified rectangular frame in an area not including six apples present in the learning image IM shown in Fig. 3. To learn this fake image as a detection target, a true value vector (0, 1) ^T is given as true value data.

　ここでは、学習用画像セットから、正画像又は偽画像を真値として持つ一つの部分画像を、Ｅｃｈｏ　Ｓｔａｔｅ　Ｎｅｔｗｏｒｋで学習する場合の処理について、図５を用いて説明する。 Here, we will use Figure 5 to explain the process of learning a partial image from a learning image set, which has a true or false image as its true value, using the Echo State Network.

　図５は、学習フェーズでの処理を説明するための概略図である。
　まず、Ｅｃｈｏ　Ｓｔａｔｅ　Ｎｅｔｗｏｒｋの学習に用いる一つの部分画像を示す二次元画像データが、学習用入力データ指定部１０２が有する学習画像入力バッファ１０２ａに格納される。 FIG. 5 is a schematic diagram for explaining the processing in the learning phase.
First, two-dimensional image data showing one partial image used for learning the Echo State Network is stored in the learning image input buffer 102 a of the learning input data designation unit 102 .

　次に、学習画像入力バッファ１０２ａに格納されている二次元画像データで示される部分画像を特徴ベクトル化するために、学習用入力データ指定部１０２は、その部分画像の解像度を変換して、変換後のデータを特徴ベクトル変換用画像バッファ１０２ｂに格納する。 Next, in order to convert the partial image represented by the two-dimensional image data stored in the training image input buffer 102a into a feature vector, the training input data designation unit 102 converts the resolution of the partial image and stores the converted data in the feature vector conversion image buffer 102b.

　次に、学習用入力データ指定部１０２は、解像度変換された変換部分画像を特徴ベクトル化して特徴ベクトル用バッファ１０２ｃに保存する。この変換部分画像をベクトル化するためには、学習用入力データ指定部１０２は、例えば、変換部分画像の左上隅画素をスタートとして順に左行方向に１画素ずつ画素値を読み出し、順次上端行から下端行へ進み、右下端画素まで画素値を読み出すことにより、ベクトル化すればよい。 Next, the learning input data specification unit 102 converts the resolution-converted converted partial image into a feature vector and stores it in the feature vector buffer 102c. To vectorize this converted partial image, the learning input data specification unit 102 may, for example, start with the upper left corner pixel of the converted partial image and read out the pixel values one pixel at a time in the leftward direction, proceed from the top row to the bottom row, and so on down to the bottom right pixel, thereby vectorizing the image.

　次に、学習用入力データ指定部１０２に入力された一つの部分画像が持つ真値、言い換えると、検出対象を含む画像であれば正画像、そうでなければ偽画像に対応付けられた真値ベクトルが、学習用出力データ指定部１０５が有する真値ベクトル格納バッファ１０５ａに格納される。 Next, the true value of one partial image input to the learning input data specification unit 102, in other words, the true value vector associated with a positive image if the image contains the detection target, or a false image if not, is stored in the true value vector storage buffer 105a of the learning output data specification unit 105.

　ここで、真値ベクトルの要素数は、１個以上の数が指定されればよい。例えば、真値ベクトル要素数を２個として、正画像には、列ベクトル（１，０）^Ｔ、偽画像には列ベクトル（０，１）^Ｔを対応付けて、学習が行われればよい。 Here, the number of elements of the true vector may be specified as a number equal to or greater than 1. For example, the number of elements of the true vector may be set to 2, and learning may be performed by associating a column vector (1,0) ^T with a positive image and a column vector (0,1) ^T with a false image.

　以上のように、特徴ベクトル用バッファ１０２ｃに保存された、部分画像から得られる特徴ベクトルと、真値ベクトル格納バッファ１０５ａに保存された真値ベクトルと、リザバー回路学習部１０７が有するＥｃｈｏ　Ｓｔａｔｅ　Ｎｅｔｗｏｒｋに関して適切なネットワーク初期設定を行えば、下記の参考文献１に記載された学習アルゴリズムにより、正画像と、偽画像とを併せて２枚の画像をＥｃｈｏ　Ｓｔａｔｅ　Ｎｅｔｗｏｒｋで学習できる。

As described above, by performing appropriate network initial settings for the feature vectors obtained from partial images and stored in the feature vector buffer 102c, the true-value vectors stored in the true-value vector storage buffer 105a, and the Echo State Network of the reservoir circuit learning unit 107, it is possible to learn two images, a positive image and a false image, in the Echo State Network using the learning algorithm described in Reference 1 below.

　ここで、Ｅｃｈｏ　Ｓｔａｔｅ　Ｎｅｔｗｏｒｋの学習は、中間層から出力層に至る結合定数のみを学習すればよく、その具体的計算手法について説明する。
　まず、リザバー回路学習部１０７は、Ｅｃｈｏ　Ｓｔａｔｅ　Ｎｅｔｗｏｒｋの入力層１０７ａ、中間層１０７ｂ及び出力層１０７ｃを相互に結合する結線について、入力層１０７ａから中間層１０７ｂへの結合定数行列をＷ_ｉｎ、中間層１０７ｂの内部ユニットの相互的な結合定数行列をＷ_ｒｅｓ、出力層１０７ｃから中間層１０７ｂへの結合定数行列をＷ_ｆｂとする。また、離散時刻ｋにおける入力層１０７ａへの入力信号をｕ［ｋ］、中間層１０７ｂの内部ユニットの状態をｘ［ｋ］、出力層１０７ｃの出力信号をｙ［ｋ］としたとき、下記の（１）式の関係がある。 Here, the learning of the Echo State Network requires learning only the coupling constants from the intermediate layer to the output layer, and a specific calculation method for this will be described.
First, in the reservoir circuit learning unit 107, for the connections mutually connecting the input layer 107a, the intermediate layer 107b, and the output layer 107c of the Echo State Network, the coupling constant matrix from the input layer 107a to the intermediate layer 107b is W _in , the mutual coupling constant matrix of the internal units of the intermediate layer 107b is W _res , and the coupling constant matrix from the output layer 107c to the intermediate layer 107b is W _fb . In addition, when the input signal to the input layer 107a at a discrete time k is u[k], the state of the internal units of the intermediate layer 107b is x[k], and the output signal of the output layer 107c is y[k], the relationship of the following formula (1) holds.

　（１）式において、λは、ｌｅａｋｉｎｇ　ｒａｔｅと呼ばれ、１時刻前のｘ［ｋ］の活動をｘ［ｋ＋１］の活動にどの程度反映させるか、言い換えると、過去の中間層の内部状態履歴をどれだけ時間軸方向に引きずるかを意味する。λ＝１のとき、ｘ［ｋ］は、ｘ［ｋ＋１］の更新に一切反映されない、言い換えると、内部状態履歴を時間軸方向に一切引きずらないことを意味する。 In equation (1), λ is called the leaking rate, and indicates the extent to which the activity of x[k] from one time point before is reflected in the activity of x[k+1], in other words, how much of the past internal state history of the hidden layer is dragged along the time axis. When λ=1, x[k] is not reflected at all in the update of x[k+1], in other words, the internal state history is not dragged along the time axis at all.

　さらに、中間層１０７ｂから出力層１０７ｃへの結合定数行列をＷ_ｏｕｔ、時刻１からＴまでのｕ［ｋ］と、ｘ［ｋ］とを列ベクトルとしてまとめた行列をＸ、真値データであるｙ［ｋ］を列ベクトルとしてまとめた行列をＹ_{ｔａｒｇｅｔ}としたとき、Ｘと、Ｙ_{ｔａｒｇｅｔ}とには、下記の（２）式の関係がある。 Furthermore, when the coupling constant matrix from the intermediate layer 107b to the output layer 107c _{is Wout} , the matrix in which u[k] and x[k] from time 1 to T are organized as column vectors is X, and the matrix in which y[k], which is the true value data, is organized as a column vector is _Ytarget , there is a relationship between X and _Ytarget as shown in the following equation (2).

　上記の参考文献１に記載されているＲｉｄｇｅ　Ｒｅｇｒｅｓｓｉｏｎを用いて、（１）式をＷ_ｏｕｔについて解くと、下記の（３）式のようになる。 When equation (1) is solved for W _out using the Ridge Regression described in the above-mentioned Reference 1, the following equation (3) is obtained.

　これにより、部分画像から導いた特徴ベクトルと、中間層１０７ｂの内部状態から構成されるＸと、真値ベクトルから構成されるＹ_{ｔａｒｇｅｔ}とから、Ｗ_ｏｕｔを算出することができる。 As a result, W _out can be calculated from the feature vector derived from the partial image, X configured from the internal state of the intermediate layer 107b, and Y _target configured from the true value vector.

　なお、（３）式は、Ｒｉｄｇｅ　Ｒｅｇｒｅｓｓｉｏｎの計算プロセスを示しており、βは、Ｒｉｄｇｅ　Ｒｅｇｒｅｓｓｉｏｎにおける最適化パラメータである。
　仮に、（２）式に対してＲｉｄｇｅ　Ｒｅｇｒｅｓｓｉｏｎを利用せず、Ｍｏｏｒｅ－Ｐｅｎｒｏｓｅの擬似逆行列を利用して、Ｗ_ｏｕｔを直接算出する場合の解は、下記の（４）式で得られる。 It should be noted that equation (3) shows the calculation process of Ridge Regression, and β is an optimization parameter in Ridge Regression.
If Ridge Regression is not used for equation (2) and W _out is directly calculated using the Moore-Penrose pseudoinverse matrix, the solution is obtained by the following equation (4).

　しかしながら、（４）式は、Ｘの次元が大きい場合に擬似逆行列の計算量が嵩むため、Ｒｉｄｇｅ　Ｒｅｇｒｅｓｓｉｏｎを利用して擬似逆行列計算を回避し、計算量を抑え込む。この場合のＲｉｄｇｅ　Ｒｅｇｒｅｓｓｉｏｎによる最適化計算は、下記の（５）式で与えられる。 However, since the amount of calculation required for the pseudo-inverse matrix in equation (4) increases when the dimension of X is large, Ridge Regression is used to avoid the pseudo-inverse matrix calculation and reduce the amount of calculation. The optimization calculation using Ridge Regression in this case is given by the following equation (5).

　（５）式で、仮にβ||Ｗ_ｏｕｔ||^２の項がない場合、Ｗ_ｏｕｔのノルムサイズが巨大になると、Ｅｃｈｏ　Ｓｔａｔｅ　Ｎｅｔｗｏｒｋの出力が不安定になる。このため、（５）式の計算過程にβ||Ｗ_ｏｕｔ||^２の項を含めて最適化計算を実行し、ノルムサイズが調節されたＷ_ｏｕｔを得る。 In formula (5), if the term β∥W _out ∥ ² is not present, when the norm size of W _out becomes large, the output of the Echo State Network becomes unstable. For this reason, the term β∥W _out ∥ ² is included in the calculation process of formula (5) to perform optimization calculations and obtain W _out with an adjusted norm size.

　以上では、一つの部分画像をＥｃｈｏ　Ｓｔａｔｅ　Ｎｅｔｗｏｒｋに学習させる例を説明したが、ここでは、正画像又は偽画像の真値が付与された複数枚の学習用画像をＥｃｈｏ　Ｓｔａｔｅ　Ｎｅｔｗｏｒｋで学習する例について、図６を用いて説明する。 Above, we have explained an example of training the Echo State Network on one partial image, but here we will use Figure 6 to explain an example of training multiple learning images with true values of either positive or false images assigned to them using the Echo State Network.

　なお、図６においては、Ｅｃｈｏ　Ｓｔａｔｅ　Ｎｅｔｗｏｒｋに学習させる複数枚の二次元画像である学習用画像として、いずれも、学習用入力データ指定部１０２によって変換された特徴ベクトルが利用される。個々の二次元画像を特徴ベクトルに変換する過程については、図５を用いて説明した上述の手続きが利用されればよい。 In FIG. 6, the feature vectors converted by the learning input data designation unit 102 are used as the learning images, which are multiple two-dimensional images that are trained by the Echo State Network. The process of converting each two-dimensional image into a feature vector can be performed using the procedure described above with reference to FIG. 5.

　図６において、学習用入力データ指定部１０２には、ｎ枚（ｎは、２以上の整数）の正画像Ｔ_１，Ｔ_２，・・・，Ｔ_ｎと、ｍ枚（ｍは、２以上の整数）の偽画像Ｆ_１，Ｆ_２，・・・，Ｆ_ｍが特徴ベクトルの形式で時系列順に格納されている。さらに、個々の正画像及び偽画像については、いずれも、特徴ベクトルがｌ枚連続して格納されている。 6, n (n is an integer of 2 or more) true images _T1 , _T2 , ..., _Tn and m (m is an integer of 2 or more) false images _F1 , _F2 , ..., _Fm are stored in chronological order in the form of feature vectors in the learning input data designation unit 102. Furthermore, l consecutive feature vectors are stored for each of the true and false images.

　言い換えると、学習用入力データ指定部１０２には、合計ｌ×（ｎ＋ｍ）個の特徴ベクトルが時系列順に格納され、上記の（２）式における行列Ｘの列ベクトルを構成している。この時系列順に呼応する形で、学習用出力データ指定部１０５には個々の正画像及び偽画像に対応付けられた合計ｌ×（ｎ＋ｍ）個の真値ベクトルが、時系列順に格納され、上記の（２）式における行列Ｙ_{ｔａｒｇｅｔ}の列ベクトルを構成している。 In other words, a total of l×(n+m) feature vectors are stored in the learning input data designation unit 102 in chronological order, constituting the column vectors of the matrix X in the above formula (2). Corresponding to this chronological order, a total of l×(n+m) true value vectors associated with each of the positive and false images are stored in the learning output data designation unit 105 in chronological order, constituting the column vectors of the matrix Y _target in the above formula (2).

　具体的な真値ベクトルとして、上述同様、正画像には、列ベクトル（１，０）^Ｔ、偽画像には、列ベクトル（０，１）^Ｔが対応付けられればよい。 As a specific true vector, as described above, a column vector (1,0) ^T may be associated with a true image, and a column vector (0,1) ^T may be associated with a false image.

　次に、実施の形態１における適用フェーズについて、図７を用いて説明する。
　まず、一つの部分画像を示す二次元画像データが、適用データ入力部１１１が有する適用画像入力バッファ１１１ａに格納される。 Next, the application phase in the first embodiment will be described with reference to FIG.
First, two-dimensional image data representing one partial image is stored in the application image input buffer 111 a of the application data input unit 111 .

　次に、適用画像入力バッファ１１１ａに格納されている二次元画像データで示される部分画像を特徴ベクトル化するために、適用データ入力部１１１は、その部分画像の解像度を変換して、変換後のデータを特徴ベクトル変換用画像バッファ１１１ｂに格納する。 Next, in order to convert the partial image represented by the two-dimensional image data stored in the application image input buffer 111a into a feature vector, the application data input unit 111 converts the resolution of the partial image and stores the converted data in the image buffer for feature vector conversion 111b.

　次に、適用データ入力部１１１は、解像度変換された変換部分画像を特徴ベクトル化して特徴ベクトル用バッファ１１１ｃに保存する。この変換部分画像をベクトル化するためには、適用データ入力部１１１は、例えば、変換部分画像の左上隅画素をスタートとして順に左行方向に１画素ずつ画素値を読み出し、順次上端行から下端行へ進み、右下端画素まで画素値を読み出すことにより、ベクトル化すればよい。 Next, the application data input unit 111 converts the resolution-converted converted partial image into a feature vector and stores it in the feature vector buffer 111c. To vectorize this converted partial image, the application data input unit 111 may, for example, start with the upper left corner pixel of the converted partial image and read out the pixel values one pixel at a time in the leftward direction, proceed from the top row to the bottom row, and so on down to the bottom right pixel, thereby vectorizing the image.

　リザバー回路処理部１１２は、リザバー回路学習部１０７での学習の結果として取得された、三層ニューラルネットワークの結合定数を用いて、学習済み三層ニューラルネットワークを生成する。リザバー回路処理部１１２は、学習済み三層ニューラルネットワークに対して学習済みのＥｃｈｏ　Ｓｔａｔｅ　Ｎｅｔｗｏｒｋを適用する。学習済みのＥｃｈｏ　Ｓｔａｔｅ　Ｎｅｔｗｏｒｋを有したリザバー回路処理部１１２は、新たな二次元画像を列ベクトルとして適用データ入力部１１１が構築した特徴ベクトルの入力を受け入れる。
　学習済のＥｃｈｏ　Ｓｔａｔｅ　Ｎｅｔｗｏｒｋは、入力層１１２ａ、中間層１１２ｂ及び出力層１１２ｃを備える。そして、リザバー回路処理部１１２は、学習済みのＥｃｈｏ　Ｓｔａｔｅ　Ｎｅｔｗｏｒｋにより出力ベクトルを算出して、適用データ出力部１１３の出力ベクトル格納バッファ１１３ａに、その出力ベクトルを格納する。 The reservoir circuit processing unit 112 generates a trained three-layered neural network using the coupling constants of the three-layered neural network acquired as a result of training by the reservoir circuit training unit 107. The reservoir circuit processing unit 112 applies a trained Echo State Network to the trained three-layered neural network. The reservoir circuit processing unit 112 having the trained Echo State Network accepts input of a feature vector constructed by the application data input unit 111 with a new two-dimensional image as a column vector.
The trained Echo State Network includes an input layer 112 a, an intermediate layer 112 b, and an output layer 112 c. The reservoir circuit processing unit 112 calculates an output vector using the trained Echo State Network, and stores the output vector in an output vector storage buffer 113 a of the application data output unit 113.

　そして、適用データ出力部１１３では、判定回路１１３ｂが、出力ベクトルが検知対象であることを意味する正画像か、あるいは非検知対象である偽画像かを判定する。この正画像あるいは偽画像の判定については、判定回路１１３ｂは、例えば、要素数２の出力ベクトルに対し、各要素についてしきい値判定を適用して判定してもよく、あるいは線形分離又はその他の機械学習手法を利用して判定を行ってもよい。 Then, in the application data output unit 113, the judgment circuit 113b judges whether the output vector is a positive image, meaning that it is a detection target, or a false image, meaning that it is not a detection target. In order to judge whether the image is positive or false, the judgment circuit 113b may, for example, apply a threshold judgment to each element of an output vector having two elements, or may use linear separation or other machine learning methods to make the judgment.

　以上により、リザバーコンピューティングによる学習ネットワークを容易に構築することができる。そして、Ｅｃｈｏ　Ｓｔａｔｅ　Ｎｅｔｗｏｒｋを利用して静的な二次元画像に含まれる特定の対象を、部分画像の矩形位置として検出することが可能になる。この識別能力をさらに高めるために、下記の参考文献２に記載されたアンサンブル学習と呼ばれる機械学習の手法が知られており、識別性能の低い識別関数を複数組み合わせることで、高い識別性能を確保することができる。このアンサンブル学習を利用して、学習済みのＥｃｈｏ　Ｓｔａｔｅ　Ｎｅｔｗｏｒｋを一つの識別関数と見て、複数のＥｃｈｏ　Ｓｔａｔｅ　Ｎｅｔｗｏｒｋを組み合わせることにより、より高精度の識別関数を構築することができる。
　参考文献２：Ｉ．　Ｄ．　Ｍｉｅｎｙｅ　ａｎｄ　Ｙ．　Ｓｕｎ，　“Ａ　Ｓｕｒｖｅｙ　ｏｆ　Ｅｎｓｅｍｂｌｅ　Ｌｅａｒｎｉｎｇ：　Ｃｏｎｃｅｐｔｓ，　Ａｌｇｏｒｉｔｈｍｓ，　Ａｐｐｌｉｃａｔｉｏｎｓ，　ａｎｄ　Ｐｒｏｓｐｅｃｔｓ”，　ＩＥＥＥ　Ａｃｃｅｓｓ　Ｖｏｌ．　１０，　ｐｐ．　９９１２９－９９１４９，　２０２２． As a result, a learning network using reservoir computing can be easily constructed. Then, it becomes possible to detect a specific object contained in a static two-dimensional image as a rectangular position of a partial image using the Echo State Network. In order to further improve this discrimination ability, a machine learning technique called ensemble learning described in Reference 2 below is known, and high discrimination performance can be ensured by combining multiple discrimination functions with low discrimination performance. Using this ensemble learning, a trained Echo State Network can be regarded as one discrimination function, and multiple Echo State Networks can be combined to construct a discrimination function with higher accuracy.
Reference 2: I. D. Mienye and Y. Sun, “A Survey of Ensemble Learning: Concepts, Algorithms, Applications, and Prospects”, IEEE Access Vol. 10, pp. 99129-99149, 2022.

　なお、実施の形態１では、リザバーコンピューティング回路への入力パターンを列ベクトルとして表現し、各列ベクトルが一定の時間長を持って入力パターンを構成し、出力データには入力パターンの識別カテゴリに相当する教師信号を対応づけることにより、リザバーコンピューティング回路の学習でリザバーコンピューティング回路の中間層が持つ動的平衡点に入力パターンを対応付けることができる。これにより、リザバーコンピューティング回路を利用してパターン検出を実現することが可能になる。 In the first embodiment, the input pattern to the reservoir computing circuit is expressed as a column vector, each column vector has a certain time length and constitutes an input pattern. By associating the output data with a teacher signal corresponding to the classification category of the input pattern, the input pattern can be associated with the dynamic equilibrium point of the intermediate layer of the reservoir computing circuit during learning of the reservoir computing circuit. This makes it possible to realize pattern detection using the reservoir computing circuit.

　以上に記載されたパターン検出装置１００は、例えば、図８に示されているようなコンピュータ１０により実現することができる。
　コンピュータ１０は、ＨＤＤ（Ｈａｒｄ　Ｄｉｓｋ　Ｄｒｉｖｅ）又はＳＳＤ（Ｓｏｌｉｄ　Ｓｔａｔｅ　Ｄｒｉｖｅ）等のストレージ１１と、メモリ１２と、ＣＰＵ（Ｃｅｎｔｒａｌ　Ｐｒｏｃｅｓｓｉｎｇ　Ｕｎｉｔ）等のプロセッサ１３と、処理回路１４と、リザバー回路１５とを備える。 The above-described pattern detection apparatus 100 can be realized by, for example, a computer 10 as shown in FIG.
The computer 10 includes a storage 11 such as a hard disk drive (HDD) or a solid state drive (SSD), a memory 12 , a processor 13 such as a central processing unit (CPU), a processing circuit 14 , and a reservoir circuit 15 .

　処理回路１４は、　単一回路、複合回路、プログラムで動作するプロセッサ、プログラムで動作する並列プロセッサ、ＡＳＩＣ（Ａｐｐｌｉｃａｔｉｏｎ　Ｓｐｅｃｉｆｉｃ　Ｉｎｔｅｇｒａｔｅｄ　Ｃｉｒｃｕｉｔ）又はＦＰＧＡ（Ｆｉｅｌｄ　Ｐｒｏｇｒａｍｍａｂｌｅ　Ｇａｔｅ　Ａｒｒａｙ）等により構成される。 The processing circuit 14 is composed of a single circuit, a composite circuit, a processor operated by a program, a parallel processor operated by a program, an ASIC (Application Specific Integrated Circuit), an FPGA (Field Programmable Gate Array), etc.

　例えば、学習用入力データ指定部１０２、学習用出力データ指定部１０５及び適用データ入力部１１１は、処理回路１４により構成することができる。
　学習用特徴ベクトル格納部１０３及び学習用真値ベクトル格納部１０６は、メモリ１２により構成することができる。
　リザバー回路学習部１０７及びリザバー回路処理部１１２は、ストレージ１１に記憶されているプログラムをメモリ１２にロードして、そのプログラムをプロセッサ１３が実行して、プロセッサ１３がリザバー回路１５を利用することで構成することができる。
　適用データ出力部１１３は、ストレージ１１に記憶されているプログラムをメモリ１２にロードして、そのプログラムをプロセッサ１３が実行することで実現することができる。 For example, the learning input data designation unit 102 , the learning output data designation unit 105 and the application data input unit 111 can be configured by the processing circuitry 14 .
The learning feature vector storage unit 103 and the learning true value vector storage unit 106 can be configured by the memory 12 .
The reservoir circuit learning unit 107 and the reservoir circuit processing unit 112 can be configured by loading a program stored in the storage 11 into the memory 12, having the processor 13 execute the program, and having the processor 13 utilize the reservoir circuit 15.
The application data output unit 113 can be realized by loading a program stored in the storage 11 into the memory 12 and having the processor 13 execute the program.

　以上のプログラムは、図示しないリーダ／ライタを介して、図示しない記録媒体から、あるいは、図示しない通信Ｉ／Ｆ（ＩｎｔｅｒＦａｃｅ）を介してネットワークから、ストレージ１１にダウンロードされ、それから、メモリ１２上にロードされてプロセッサ１３により実行されてもよい。また、リーダ／ライタを介して、図示しない記録媒体から、あるいは、図示しない通信Ｉ／Ｆを介してネットワークから、メモリ１２上に直接ロードされ、プロセッサ１３により実行されてもよい。
　言い換えると、プログラムは、記録媒体等のプログラムプロダクトにより提供されてもよい。
　以上のように、パターン検出装置１００は、処理回路網により実現することができる。 The above programs may be downloaded to the storage 11 from a recording medium (not shown) via a reader/writer (not shown) or from a network via a communication I/F (Interface) (not shown), and then loaded onto the memory 12 and executed by the processor 13. Alternatively, the programs may be directly loaded onto the memory 12 from a recording medium (not shown) via a reader/writer or from a network via a communication I/F (not shown), and executed by the processor 13.
In other words, the program may be provided by a program product such as a recording medium.
As described above, the pattern detection apparatus 100 can be realized by a processing circuit network.

実施の形態２．
　図９は、実施の形態２に係る学習推論装置としてのパターン検出装置２００の構成を概略的に示すブロック図である。
　パターン検出装置２００は、学習データ生成部２０１と、真値データ構築部１０４と、リザバー回路学習部１０７と、適用データ生成部２１０と、リザバー回路処理部１１２と、適用データ出力部１１３とを備える。
　実施の形態２では、学習データ生成部２０１及び適用データ生成部２１０の一部として機能する直交フィルタ適用部２２０が設けられている。 Embodiment 2.
FIG. 9 is a block diagram showing a schematic configuration of a pattern detection apparatus 200 as a learning and inference apparatus according to the second embodiment.
The pattern detection device 200 includes a learning data generation unit 201 , a true value data construction unit 104 , a reservoir circuit learning unit 107 , an application data generation unit 210 , a reservoir circuit processing unit 112 , and an application data output unit 113 .
In the second embodiment, an orthogonal filter application unit 220 that functions as a part of the learning data generation unit 201 and the application data generation unit 210 is provided.

　実施の形態２に係るパターン検出装置２００の真値データ構築部１０４、リザバー回路学習部１０７、リザバー回路処理部１１２及び適用データ出力部１１３は、実施の形態１に係るパターン検出装置１００の真値データ構築部１０４、リザバー回路学習部１０７、リザバー回路処理部１１２及び適用データ出力部１１３と同様である。 The true value data construction unit 104, the reservoir circuit learning unit 107, the reservoir circuit processing unit 112, and the application data output unit 113 of the pattern detection device 200 according to the second embodiment are similar to the true value data construction unit 104, the reservoir circuit learning unit 107, the reservoir circuit processing unit 112, and the application data output unit 113 of the pattern detection device 100 according to the first embodiment.

　但し、実施の形態２に係るパターン検出装置２００のリザバー回路処理部１１２は、適用フェーズにおいて、直交フィルタ適用部２２０においてフィルタ処理された特徴ベクトルを、入力データとして、入力層１１２ａに入力する。 However, in the application phase, the reservoir circuit processing unit 112 of the pattern detection device 200 according to the second embodiment inputs the feature vector filtered by the orthogonal filter application unit 220 to the input layer 112a as input data.

　学習データ生成部２０１は、複数の学習対象画像のそれぞれに対応する複数の特徴ベクトルが時系列で配置された学習データを生成する。
　実施の形態２では、学習データ生成部２０１は、二次元画像である複数の学習対象画像の各々に対して直交フィルタを適用した結果である処理画像において予め定められた順番で画素値を抽出して、抽出された順番でその抽出された画素値を並べることで複数の特徴ベクトルの各々を生成する。 The learning data generating unit 201 generates learning data in which a plurality of feature vectors corresponding to a plurality of learning target images are arranged in time series.
In embodiment 2, the training data generation unit 201 extracts pixel values in a predetermined order from a processed image, which is the result of applying an orthogonal filter to each of a plurality of training images, which are two-dimensional images, and generates each of a plurality of feature vectors by arranging the extracted pixel values in the order in which they were extracted.

　学習データ生成部２０１は、学習用入力データ指定部２０２と、直交フィルタ適用部２２０と、学習用特徴ベクトル格納部１０３とを備える。
　実施の形態２に係るパターン検出装置２００の学習用特徴ベクトル格納部１０３は、実施の形態１に係るパターン検出装置１００の学習用特徴ベクトル格納部１０３と同様である。 The learning data generating unit 201 includes a learning input data specifying unit 202 , an orthogonal filter applying unit 220 , and a learning feature vector storage unit 103 .
The training feature vector storage unit 103 of the pattern detection device 200 according to the second embodiment is similar to the training feature vector storage unit 103 of the pattern detection device 100 according to the first embodiment.

　学習用入力データ指定部２０２は、学習フェーズにおいて、一つの学習用画像の解像度を変換した画像を、直交フィルタ適用部２２０に与える。 In the learning phase, the learning input data specification unit 202 provides the orthogonal filter application unit 220 with an image in which the resolution of one learning image has been converted.

　適用データ生成部２１０は、二次元画像である推論対象画像に対して直交フィルタを適用した結果である処理画像において予め定められた順番で画素値を抽出して、抽出された順番でその抽出された画素値を並べることで適用データを生成する。
　適用データ生成部２１０は、適用データ入力部２１１と、直交フィルタ適用部２２０とを備える。 The application data generation unit 210 extracts pixel values in a predetermined order from a processed image, which is the result of applying an orthogonal filter to the inference target image, which is a two-dimensional image, and generates application data by arranging the extracted pixel values in the order in which they were extracted.
The application data generating unit 210 includes an application data input unit 211 and an orthogonal filter application unit 220 .

　適用データ入力部２１１は、適用フェーズにおいて、新たに二次元画像の入力を受け付けて、その二次元画像の解像度を変換した画像を、直交フィルタ適用部２２０に与える。 In the application phase, the application data input unit 211 accepts new input of a two-dimensional image and provides the two-dimensional image with its resolution converted to the orthogonal filter application unit 220.

　直交フィルタ適用部２２０は、学習用入力データ指定部２０２からの画像に直交フィルタを適用して特徴ベクトルを生成する。そして、直交フィルタ適用部２２０は、フィルタ処理された複数の特徴ベクトルが一定の時間長を持つように構成することで、学習用特徴ベクトルを生成し、その特徴ベクトルを学習用特徴ベクトル格納部１０３に与える。
　また、直交フィルタ適用部２２０は、適用データ入力部２１１からの画像に直交フィルタを適用して特徴ベクトルを生成し、その特徴ベクトルをリザバー回路処理部１１２に与える。 The orthogonal filter application unit 220 generates a feature vector by applying an orthogonal filter to the image from the learning input data designation unit 202. The orthogonal filter application unit 220 then generates a learning feature vector by configuring a plurality of filtered feature vectors to have a certain time length, and provides the feature vector to the learning feature vector storage unit 103.
Moreover, the orthogonal filter application unit 220 applies an orthogonal filter to the image from the application data input unit 211 to generate a feature vector, and provides the feature vector to the reservoir circuit processing unit 112 .

　図１０は、実施の形態２における学習フェーズにおける学習用入力データ指定部２０２及び直交フィルタ適用部２２０での処理を説明するための概略図である。
　ここでは、直交フィルタ適用部２２０で用いる直交フィルタとして、フィルタ係数が「１」と、「－１」との２値からなるＷａｌｓｈ－Ｈａｄａｍａｒｄフィルタが利用される例を示す。また、実施の形態１と同様に、学習用入力データ指定部２０２に保持されている入力データを変換する動作のうち、学習用画像セットから正画像又は偽画像を真値として持つ一つの画像をＥｃｈｏ　Ｓｔａｔｅ　Ｎｅｔｗｏｒｋで学習する場合の学習フェーズでの処理について説明する。 FIG. 10 is a schematic diagram for explaining the processing in the learning input data designation unit 202 and the orthogonal filter application unit 220 in the learning phase in the second embodiment.
Here, an example is shown in which a Walsh-Hadamard filter having filter coefficients of two values, "1" and "-1", is used as the orthogonal filter used in the orthogonal filter application unit 220. Also, as in the first embodiment, among the operations for converting the input data held in the learning input data designation unit 202, a process in the learning phase when one image having a true image or a false image as a true value from the learning image set is learned by the Echo State Network will be described.

　まず、Ｅｃｈｏ　Ｓｔａｔｅ　Ｎｅｔｗｏｒｋの学習に用いる一つの部分画像を示す二次元画像データが、学習用入力データ指定部２０２が有する学習画像入力バッファ２０２ａに格納される。 First, two-dimensional image data showing one partial image to be used for learning the Echo State Network is stored in the learning image input buffer 202a of the learning input data designation unit 202.

　次に、学習画像入力バッファ２０２ａに格納されている二次元画像データで示される部分画像を特徴ベクトル化するために、学習用入力データ指定部２０２は、その部分画像の解像度を変換して、変換後のデータを特徴ベクトル変換用画像バッファ２０２ｂに格納する。 Next, in order to convert the partial image represented by the two-dimensional image data stored in the training image input buffer 202a into a feature vector, the training input data designation unit 202 converts the resolution of the partial image and stores the converted data in the feature vector conversion image buffer 202b.

　次に、直交フィルタ適用部２２０の直交フィルタ適用処理ユニット２２０ａは、特徴ベクトル変換用画像バッファ２０２ｂに格納されている二次元画像に対し、フィルタ処理を行い、その結果を直交フィルタ出力バッファ２２０ｂに格納する。 Next, the orthogonal filter application processing unit 220a of the orthogonal filter application section 220 performs filter processing on the two-dimensional image stored in the feature vector conversion image buffer 202b, and stores the result in the orthogonal filter output buffer 220b.

　次に、直交フィルタ適用部２２０は、直交フィルタ出力バッファ２２０ｂに保存されているデータから、実施の形態１と同様に、特徴ベクトルを生成し、その特徴ベクトルを、特徴ベクトル用バッファ２２０ｃに格納する。 Next, the orthogonal filter application unit 220 generates a feature vector from the data stored in the orthogonal filter output buffer 220b, in the same manner as in embodiment 1, and stores the feature vector in the feature vector buffer 220c.

　以降、Ｅｃｈｏ　Ｓｔａｔｅ　Ｎｅｔｗｏｒｋによる学習フェーズの処理は実施の形態１と同様である。ここで、直交フィルタ適用部２２０が有する直交フィルタ適用処理ユニット２２０ａにおいて使用する直交フィルタの例として、下記の参考文献３に記載されているＷａｌｓｈ－Ｈａｄａｍａｒｄフィルタがある。 Then, the learning phase processing by the Echo State Network is the same as in embodiment 1. Here, an example of an orthogonal filter used in the orthogonal filter application processing unit 220a of the orthogonal filter application section 220 is the Walsh-Hadamard filter described in Reference 3 below.

　参考文献３：Ｙ．　Ｈｅｌ－Ｏｒ　ａｎｄ　Ｈ．　Ｈｅｌ－Ｏｒ，　“Ｒｅａｌ－Ｔｉｍｅ　Ｐａｔｔｅｒｎ　Ｍａｔｃｈｉｎｇ　Ｕｓｉｎｇ　Ｐｒｏｊｅｃｔｉｏｎ　Ｋｅｒｎｅｌｓ”，　ＩＥＥＥ　Ｔｒａｎｓａｃｔｉｏｎｓ　ｏｎ　Ｐａｔｔｅｒｎ　Ａｎａｌｙｓｉｓ　ａｎｄ　Ｍａｃｈｉｎｅ　Ｉｎｔｅｌｌｉｇｅｎｃｅ，　２７（９），　ｐｐ．１４３０－１４４５，　２００５． Reference 3: Y. Hel-Or and H. Hel-Or, “Real-Time Pattern Matching Using Projection Kernels”, IEEE Transactions on Pattern Analysis and Machine Intelligence, 27(9), pp. 1430-1445, 2005.

　Ｗａｌｓｈ－Ｈａｄａｍａｒｄフィルタは、１次元として定義され、さらに１次元と１次元の積として二次元のＷａｌｓｈ－Ｈａｄａｍａｒｄフィルタを定義することができる。 A Walsh-Hadamard filter is defined as one-dimensional, and a two-dimensional Walsh-Hadamard filter can be further defined as the product of a first dimension and a first dimension.

　図１１は、二次元のＷａｌｓｈ－Ｈａｄａｍａｒｄフィルタの階数Ｎ＝８の例であり、８×８個の係数からなる１個の要素フィルタを、空間周波数を細かくする形で縦方向及び横方向に８×８個の要素フィルタを並べたもので、縦横８×８画素の画像に適用可能である。 Figure 11 shows an example of a two-dimensional Walsh-Hadamard filter with rank N=8, where one element filter made up of 8x8 coefficients is arranged vertically and horizontally in a way that refines the spatial frequency, and can be applied to an image with 8x8 pixels vertically and horizontally.

　ここでＷａｌｓｈ－Ｈａｄａｍａｒｄフィルタの階数Ｎは２のべき乗である必要があり、Ｎ＝２，４，８，１６，３２，・・・を取り得る。
　図１１に示されている個々のフィルタについて、白は「＋１」、黒は「－１」の係数を意味する。言い換えると、このＷａｌｓｈ－Ｈａｄａｍａｒｄフィルタを８×８画素の画像に適用する際は乗算を必要とせず、加減算だけでフィルタ演算がなされるため、空間周波数分解の高速演算が可能になる。 Here, the rank N of the Walsh-Hadamard filter must be a power of 2, and can be N=2, 4, 8, 16, 32, . . .
11, white means a coefficient of "+1" and black means a coefficient of "-1." In other words, when applying this Walsh-Hadamard filter to an image of 8×8 pixels, no multiplication is required, and the filter calculation is performed only by addition and subtraction, enabling high-speed calculation of spatial frequency decomposition.

　次に、実施の形態２における適用フェーズについて、図１２を用いて説明する。
　まず、一つの部分画像を示す二次元画像データが、適用データ入力部２１１が有する適用画像入力バッファ２１１ａに格納される。 Next, the application phase in the second embodiment will be described with reference to FIG.
First, two-dimensional image data representing one partial image is stored in the application image input buffer 211 a of the application data input unit 211 .

　次に、適用画像入力バッファ２１１ａに格納されている二次元画像データで示される部分画像を特徴ベクトル化するために、適用データ入力部２１１は、その部分画像の解像度を変換して、変換後のデータを特徴ベクトル変換用画像バッファ２１１ｂに格納する。 Next, in order to convert the partial image represented by the two-dimensional image data stored in the application image input buffer 211a into a feature vector, the application data input unit 211 converts the resolution of the partial image and stores the converted data in the image buffer for feature vector conversion 211b.

　次に、直交フィルタ適用部２２０の直交フィルタ適用処理ユニット２２０ａは、特徴ベクトル変換用画像バッファ２１１ｂに格納されている二次元画像に対し、フィルタ処理を行い、その結果を直交フィルタ出力バッファ２２０ｂに格納する。 Next, the orthogonal filter application processing unit 220a of the orthogonal filter application section 220 performs filter processing on the two-dimensional image stored in the feature vector conversion image buffer 211b, and stores the result in the orthogonal filter output buffer 220b.

　次に、直交フィルタ適用部２２０は、直交フィルタ出力バッファ２２０ｂに保存されているデータから、実施の形態１と同様に、特徴ベクトルを生成し、その特徴ベクトルを、特徴ベクトル用バッファ２２０ｃに格納する。 Next, the orthogonal filter application unit 220 generates a feature vector from the data stored in the orthogonal filter output buffer 220b, as in the first embodiment, and stores the feature vector in the feature vector buffer 220c.

　学習済みのＥｃｈｏ　Ｓｔａｔｅ　Ｎｅｔｗｏｒｋを有したリザバー回路処理部１１２は、新たな二次元画像を列ベクトルとして直交フィルタ適用部２２０が構築した特徴ベクトルの入力を受け入れる。
　学習済のＥｃｈｏ　Ｓｔａｔｅ　Ｎｅｔｗｏｒｋは、入力層１１２ａ、中間層１１２ｂ及び出力層１１２ｃを備える。そして、リザバー回路処理部１１２は、学習済みのＥｃｈｏ　Ｓｔａｔｅ　Ｎｅｔｗｏｒｋにより出力ベクトルを算出して、適用データ出力部１１３の出力ベクトル格納バッファ１１３ａに、その出力ベクトルを格納する。 The reservoir circuit processing unit 112 having the trained echo state network receives an input of a feature vector constructed by the orthogonal filter application unit 220 with a new two-dimensional image as a column vector.
The trained Echo State Network includes an input layer 112 a, an intermediate layer 112 b, and an output layer 112 c. The reservoir circuit processing unit 112 calculates an output vector using the trained Echo State Network, and stores the output vector in an output vector storage buffer 113 a of the application data output unit 113.

　そして、適用データ出力部１１３では、判定回路１１３ｂが、出力ベクトルが検知対象であることを意味する正画像か、あるいは非検知対象である偽画像かを判定する。 Then, in the application data output unit 113, the determination circuit 113b determines whether the output vector is a positive image, meaning that it is a detection target, or a false image, meaning that it is not a detection target.

　なお、実施の形態２においては、実施の形態１と同様、学習フェーズにおいて、学習用入力データ指定部２０２に複数枚の学習用画像、学習用出力データ指定部１０５に、個々の学習用画像に紐づけられた真値ベクトルを複数個束ねて格納した上で、リザバー回路学習部１０７がＥｃｈｏ　Ｓｔａｔｅ　Ｎｅｔｗｏｒｋの学習を行ってもよい。 In the second embodiment, similarly to the first embodiment, in the learning phase, a plurality of learning images may be stored in the learning input data designation unit 202, and a plurality of true value vectors linked to each learning image may be stored in the learning output data designation unit 105, and then the reservoir circuit learning unit 107 may learn the Echo State Network.

　なお、実施の形態２では、直交フィルタ適用部２２０で用いる直交フィルタとして、Ｗａｌｓｈ－Ｈａｄａｍａｒｄフィルタを利用した例を説明したが、別の直交フィルタが利用されてもよい。例えば、上記の参考文献３で紹介されている、離散コサイン変換（ＤＣＴ：Ｄｉｓｃｒｅｔｅ　Ｃｏｓｉｎｅ　Ｔｒａｎｓｆｏｒｍ）又は高速フーリエ変換（ＦＦＴ：Ｆａｓｔ　Ｆｏｕｒｉｅｒ　Ｔｒａｎｓｆｏｒｍ）を使って、二次元画像が特徴ベクトルに変換されてもよい。 In the second embodiment, an example was described in which a Walsh-Hadamard filter was used as the orthogonal filter used in the orthogonal filter application unit 220, but another orthogonal filter may be used. For example, a two-dimensional image may be converted into a feature vector using a discrete cosine transform (DCT) or a fast Fourier transform (FFT) as introduced in the above reference 3.

　なお、実施の形態２では、リザバーコンピューティング回路に学習させる学習用パターンを特徴空間にマッピングしたときに、直交変換を利用して、真値の異なる学習用パターンの特徴空間内における距離を拡大することができるため、リザバーコンピューティングによる学習ネットワークを容易に構築することができるとともに、リザバーコンピューティング回路を利用したパターン検出の識別性能を向上させることが可能になる。 In addition, in the second embodiment, when the learning patterns to be trained by the reservoir computing circuit are mapped into the feature space, the distance in the feature space of the learning patterns having different true values can be expanded using an orthogonal transformation, so that a learning network using reservoir computing can be easily constructed and the discrimination performance of pattern detection using the reservoir computing circuit can be improved.

　以上に記載されたパターン検出装置２００も、図８に示されているようなコンピュータ１０で実現することができる。
　例えば、直交フィルタ適用部２２０も、ストレージ１１に記憶されているプログラムをメモリ１２にロードして、そのプログラムをプロセッサ１３が実行すること、又は、処理回路１４により構成することができる。 The above-described pattern detection apparatus 200 can also be realized by a computer 10 as shown in FIG.
For example, the orthogonal filter application unit 220 can also be configured by loading a program stored in the storage 11 into the memory 12 and having the processor 13 execute the program, or by the processing circuit 14 .

実施の形態３．
　図１３は、実施の形態３に係るパターン検出装置３００の構成を概略的に示すブロック図である。
　パターン検出装置３００は、学習データ生成部３０１と、真値データ構築部１０４と、リザバー回路学習部１０７と、適用データ生成部３１０と、リザバー回路処理部１１２と、適用データ出力部１１３とを備える。
　実施の形態３では、学習データ生成部３０１及び適用データ生成部３１０の一部として機能する直交フィルタ適用部３２０及び値域変換部３２１が設けられている。 Embodiment 3.
FIG. 13 is a block diagram illustrating a schematic configuration of a pattern detection apparatus 300 according to the third embodiment.
The pattern detection device 300 includes a learning data generation unit 301 , a true value data construction unit 104 , a reservoir circuit learning unit 107 , an application data generation unit 310 , a reservoir circuit processing unit 112 , and an application data output unit 113 .
In the third embodiment, an orthogonal filter application unit 320 and a range conversion unit 321 that function as part of the learning data generation unit 301 and the application data generation unit 310 are provided.

　実施の形態３に係るパターン検出装置３００の真値データ構築部１０４、リザバー回路学習部１０７、リザバー回路処理部１１２及び適用データ出力部１１３は、実施の形態１に係るパターン検出装置１００の真値データ構築部１０４、リザバー回路学習部１０７、リザバー回路処理部１１２及び適用データ出力部１１３と同様である。
　但し、実施の形態３に係るパターン検出装置３００のリザバー回路処理部１１２は、適用フェーズにおいて、値域変換部３２１において変換処理された特徴ベクトルを、入力データとして、入力層１１２ａに入力する。 The true value data construction unit 104, the reservoir circuit learning unit 107, the reservoir circuit processing unit 112, and the application data output unit 113 of the pattern detection device 300 of embodiment 3 are similar to the true value data construction unit 104, the reservoir circuit learning unit 107, the reservoir circuit processing unit 112, and the application data output unit 113 of the pattern detection device 100 of embodiment 1.
However, in the application phase, the reservoir circuit processing unit 112 of the pattern detection device 300 according to the third embodiment inputs the feature vector converted by the range conversion unit 321 to the input layer 112a as input data.

　学習データ生成部３０１は、複数の学習対象画像のそれぞれに対応する複数の特徴ベクトルが時系列で配置された学習データを生成する。
　実施の形態３では、学習データ生成部３０１は、二次元画像である複数の学習対象画像の各々に対して直交フィルタを適用した結果である処理画像に対して、画素値を予め定められた値域内に変換する関数を適用した結果である変換画像において予め定められた順番で画素値を抽出して、抽出された順番でその抽出された画素値を並べることで複数の特徴ベクトルの各々を生成する。 The learning data generating unit 301 generates learning data in which a plurality of feature vectors corresponding to a plurality of learning target images are arranged in time series.
In embodiment 3, the training data generation unit 301 extracts pixel values in a predetermined order from a transformed image, which is the result of applying a function that transforms pixel values into a predetermined range, to a processed image, which is the result of applying an orthogonal filter to each of a plurality of training images, which are two-dimensional images, and generates each of a plurality of feature vectors by arranging the extracted pixel values in the order in which they were extracted.

　学習データ生成部３０１は、学習用入力データ指定部２０２と、直交フィルタ適用部３２０と、値域変換部３２１と、学習用特徴ベクトル格納部１０３とを備える。
　実施の形態３に係るパターン検出装置３００の学習用入力データ指定部２０２は、実施の形態２に係るパターン検出装置２００の学習用入力データ指定部２０２と同様である。
　また、実施の形態３に係るパターン検出装置３００の学習用特徴ベクトル格納部１０３は、実施の形態１に係るパターン検出装置３００の学習用特徴ベクトル格納部１０３と同様である。 The learning data generation unit 301 includes a learning input data specification unit 202 , an orthogonal filter application unit 320 , a range conversion unit 321 , and a learning feature vector storage unit 103 .
The learning input data specifying unit 202 of the pattern detection device 300 according to the third embodiment is similar to the learning input data specifying unit 202 of the pattern detection device 200 according to the second embodiment.
Moreover, the training feature vector storage unit 103 of the pattern detection device 300 according to the third embodiment is similar to the training feature vector storage unit 103 of the pattern detection device 300 according to the first embodiment.

　適用データ生成部３１０は、推論対象となる画像である推論対象画像から特徴ベクトルを適用データとして生成する。
　実施の形態３では、適用データ生成部３１０は、二次元画像である推論対象画像に対して直交フィルタを適用した結果である処理画像に対して、画素値を予め定められた値域内に変換する関数を適用した結果である変換画像において予め定められた順番で画素値を抽出して、抽出された順番でその抽出された画素値を並べることで適用データを生成する。 The application data generating unit 310 generates, as application data, a feature vector from an inference target image, which is an image to be inferred.
In embodiment 3, the application data generation unit 310 extracts pixel values in a predetermined order from a transformed image, which is the result of applying a function that transforms pixel values into a predetermined value range, to a processed image, which is the result of applying an orthogonal filter to an inference target image, which is a two-dimensional image, and generates application data by arranging the extracted pixel values in the order in which they were extracted.

　適用データ生成部３１０は、適用データ入力部２１１と、直交フィルタ適用部３２０と、値域変換部３２１とを備える。
　実施の形態３に係るパターン検出装置３００の適用データ入力部２１１は、実施の形態３に係るパターン検出装置２００の適用データ入力部２１１と同様である。 The application data generation unit 310 includes an application data input unit 211 , an orthogonal filter application unit 320 , and a range conversion unit 321 .
The application data input unit 211 of the pattern detection device 300 according to the third embodiment is similar to the application data input unit 211 of the pattern detection device 200 according to the third embodiment.

　直交フィルタ適用部３２０は、学習用入力データ指定部２０２からの画像に直交フィルタを適用して、フィルタ処理後の画像データを値域変換部３２１に与える。
　また、直交フィルタ適用部３２０は、適用データ入力部２１１からの画像に直交フィルタを適用して、フィルタ処理後の画像データを値域変換部３２１に与える。 The orthogonal filter application unit 320 applies an orthogonal filter to the image from the learning input data designation unit 202 , and provides the image data after the filter processing to a value range conversion unit 321 .
Furthermore, the orthogonal filter application unit 320 applies an orthogonal filter to the image from the application data input unit 211 , and provides the image data after the filter processing to the value range conversion unit 321 .

　値域変換部３２１は、学習フェーズにおいて、学習用入力データ指定部２０２からの画像に対して直交フィルタ適用部３２０によりフィルタ処理された画像データに値域を変換する関数を適用して、特徴ベクトルを生成する。そして、値域変換部３２１は、そのような関数が適用された複数の特徴ベクトルが一定の時間長を持つように構成することで、学習用特徴ベクトルを生成し、その特徴ベクトルを学習用特徴ベクトル格納部１０３に与える。
　また、値域変換部３２１は、学習フェーズにおいて、適用データ入力部２１１からの画像に対して直交フィルタ適用部３２０によりフィルタ処理された画像データに値域を変換する関数を適用して、特徴ベクトルを生成し、その特徴ベクトルをリザバー回路処理部１１２に与える。 In the learning phase, the value range conversion unit 321 applies a function that converts a value range to image data that has been filtered by the orthogonal filter application unit 320 for the image from the learning input data designation unit 202, thereby generating a feature vector. The value range conversion unit 321 then configures a plurality of feature vectors to which such a function has been applied so as to have a certain time length, thereby generating a learning feature vector, and provides the feature vector to the learning feature vector storage unit 103.
In addition, during the learning phase, the value range conversion unit 321 applies a function that converts the value range to the image data filtered by the orthogonal filter application unit 320 for the image from the application data input unit 211, generates a feature vector, and provides the feature vector to the reservoir circuit processing unit 112.

　図１４は、実施の形態３における学習フェーズにおける学習用入力データ指定部２０２、直交フィルタ適用部３２０及び値域変換部３２１での処理を説明するための概略図である。
　ここでは、実施の形態２と同様に、直交フィルタ適用部３２０で用いる直交フィルタとして、フィルタ係数が「１」と、「－１」との２値からなるＷａｌｓｈ－Ｈａｄａｍａｒｄフィルタが利用される例を示す。
　そして、値域変換部３２１で用いる値域変換のための非線形関数として、双曲線正接関数（ｔａｎｈ（ｘ））が利用され、直交フィルタ適用部３２０から出力される、Ｗａｌｓｈ－Ｈａｄａｍａｒｄフィルタの出力値を「－１」から「１」の間に収めて、特徴ベクトルを構築する動作について説明する。 FIG. 14 is a schematic diagram for explaining the processing in the learning input data designation unit 202, the orthogonal filter application unit 320, and the range conversion unit 321 in the learning phase in the third embodiment.
Here, as in the second embodiment, an example is shown in which a Walsh-Hadamard filter having filter coefficients of two values, "1" and "-1", is used as the orthogonal filter used in orthogonal filter application section 320.
Next, the hyperbolic tangent function (tanh(x)) is used as a nonlinear function for range conversion used in the range conversion unit 321, and the output value of the Walsh-Hadamard filter output from the orthogonal filter application unit 320 is limited to a range between "-1" and "1" to construct a feature vector.

　次に、直交フィルタ適用部３２０の直交フィルタ適用処理ユニット３２０ａは、特徴ベクトル変換用画像バッファ２０２ｂに格納されている二次元画像に対し、フィルタ処理を行い、その結果を直交フィルタ出力バッファ３２０ｂに格納する。 Next, the orthogonal filter application processing unit 320a of the orthogonal filter application section 320 performs filtering on the two-dimensional image stored in the feature vector conversion image buffer 202b, and stores the result in the orthogonal filter output buffer 320b.

　次に、値域変換部３２１では、直交フィルタ出力バッファ３２０ｂに格納されているフィルタ処理された画像データの値を、値域変換関数適用処理ユニット３２１ａが有する値域変換関数により変換し、変換された値を有するデータが値域変換関数出力バッファ３２１ｂに格納される。 Next, the range conversion unit 321 converts the values of the filtered image data stored in the orthogonal filter output buffer 320b using the range conversion function possessed by the range conversion function application processing unit 321a, and data having the converted values is stored in the range conversion function output buffer 321b.

　そして、値域変換部３２１は、値域変換関数出力バッファ３２１ｂに保存されているデータから、実施の形態１と同様に、特徴ベクトルを生成し、その特徴ベクトルを、特徴ベクトル用バッファ３２１ｃに格納する。 Then, the range conversion unit 321 generates a feature vector from the data stored in the range conversion function output buffer 321b, in the same manner as in embodiment 1, and stores the feature vector in the feature vector buffer 321c.

　ここで、値域変換関数適用処理ユニット３２１ａが有する値域変換関数の例として、双曲線正接関数（ｈｙｐｅｒｂｏｌｉｃ　ｔａｎｇｅｎｔ）がある。
　双曲線正接関数ｔａｎｈ（ｘ）は、下記の（６）式で示され、図１５は、そのグラフを示す。 Here, an example of the range conversion function that the range conversion function application processing unit 321a has is a hyperbolic tangent function.
The hyperbolic tangent function tanh(x) is expressed by the following equation (6), and FIG. 15 shows a graph thereof.

　実施の形態２においては、直交フィルタ適用部２２０の算出結果である直交フィルタ出力バッファ２２０ｂの各値から直接特徴ベクトルが構築されているが、実施の形態３においては、値域変換部３２１によって、入力画像に対する直交フィルタの適用値の分布が大きくばらつく場合であっても、ｔａｎｈ（ｘ）が値域を－１＜ｆ（ｘ）＜１の間に収めるため、リザバー回路学習部１０７の学習時の動作を安定させることができる。 In the second embodiment, the feature vector is constructed directly from each value of the orthogonal filter output buffer 220b, which is the calculation result of the orthogonal filter application unit 220. In the third embodiment, however, even if the distribution of the application values of the orthogonal filter to the input image varies greatly, the value range conversion unit 321 ensures that tanh(x) falls within the range -1<f(x)<1, so that the operation of the reservoir circuit learning unit 107 during learning can be stabilized.

　次に、実施の形態３における適用フェーズについて、図１６を用いて説明する。
　まず、一つの部分画像を示す二次元画像データが、適用データ入力部２１１が有する適用画像入力バッファ２１１ａに格納される。 Next, the application phase in the third embodiment will be described with reference to FIG.
First, two-dimensional image data representing one partial image is stored in the application image input buffer 211 a of the application data input unit 211 .

　次に、直交フィルタ適用部３２０の直交フィルタ適用処理ユニット３２０ａは、特徴ベクトル変換用画像バッファ２１１ｂに格納されている二次元画像に対し、フィルタ処理を行い、その結果を直交フィルタ出力バッファ３２０ｂに格納する。 Next, the orthogonal filter application processing unit 320a of the orthogonal filter application section 320 performs filter processing on the two-dimensional image stored in the feature vector conversion image buffer 211b, and stores the result in the orthogonal filter output buffer 320b.

　学習済みのＥｃｈｏ　Ｓｔａｔｅ　Ｎｅｔｗｏｒｋを有したリザバー回路処理部１１２は、新たな二次元画像を列ベクトルとして値域変換部３２１が構築した特徴ベクトルの入力を受け入れる。
　学習済のＥｃｈｏ　Ｓｔａｔｅ　Ｎｅｔｗｏｒｋは、入力層１１２ａ、中間層１１２ｂ及び出力層１１２ｃを備える。そして、リザバー回路処理部１１２は、学習済みのＥｃｈｏ　Ｓｔａｔｅ　Ｎｅｔｗｏｒｋにより出力ベクトルを算出して、適用データ出力部１１３の出力ベクトル格納バッファ１１３ａに、その出力ベクトルを格納する。 The reservoir circuit processing unit 112 having a trained echo state network receives an input of a feature vector constructed by the range conversion unit 321 with a new two-dimensional image as a column vector.
The trained Echo State Network includes an input layer 112 a, an intermediate layer 112 b, and an output layer 112 c. The reservoir circuit processing unit 112 calculates an output vector using the trained Echo State Network, and stores the output vector in an output vector storage buffer 113 a of the application data output unit 113.

　なお、実施の形態３においても、実施の形態１と同様、学習フェーズにおいて、学習用入力データ指定部２０２に複数枚の学習用画像、学習用出力データ指定部１０５に、個々の学習用画像に紐づけられた真値ベクトルを複数個束ねて格納した上で、リザバー回路学習部１０７がＥｃｈｏ　Ｓｔａｔｅ　Ｎｅｔｗｏｒｋの学習を行ってもよい。 In the third embodiment, as in the first embodiment, in the learning phase, multiple learning images may be stored in the learning input data designation unit 202, and multiple true value vectors linked to each learning image may be stored in the learning output data designation unit 105, and then the reservoir circuit learning unit 107 may learn the Echo State Network.

　以上のように、実施の形態３によれば、学習済みのＥｃｈｏ　Ｓｔａｔｅ　Ｎｅｔｗｏｒｋを有したリザバー回路処理部１１２に特徴ベクトルを入力する前に、値域変換部３２１によって特徴ベクトルを構成する各要素の値を「－１」から「１」の間に収めることができるため、リザバーコンピューティングによる学習ネットワークを容易に構築することができるとともに、リザバー回路処理部１１２の処理動作を安定させることができる。 As described above, according to the third embodiment, before inputting a feature vector to the reservoir circuit processing unit 112 having a trained Echo State Network, the value of each element constituting the feature vector can be kept between "-1" and "1" by the value range conversion unit 321. This makes it possible to easily construct a learning network using reservoir computing and stabilize the processing operation of the reservoir circuit processing unit 112.

　なお、実施の形態３では、リザバーコンピューティング回路に入力する入力データの値域を狭めることにより、リザバーコンピューティング回路の動作が安定するため、リザバーコンピューティング回路を利用したパターン検出の識別性能を向上させることが可能になる。 In addition, in the third embodiment, by narrowing the range of values of the input data input to the reservoir computing circuit, the operation of the reservoir computing circuit becomes stable, and it becomes possible to improve the discrimination performance of pattern detection using the reservoir computing circuit.

　以上に記載された実施の形態１～３では、パターン検出装置１００～３００は、学習フェーズ及び推論フェーズの両方の処理を行う装置として説明したが、実施の形態１～３は、以上のような装置に限定されない。
　例えば、実施の形態１～３は、学習フェーズでの処理を行う学習装置及び推論フェーズでの処理を行う推論装置として構成することもできる。
　また、実施の形態１～３に係るパターン検出装置１００～３００が行っている処理を、ネットワークに接続されて、互いにデータを送受信することのできる複数のコンピュータが分散して行ってもよい。言い換えると、実施の形態１～３は、複数のコンピュータからなる学習推論システムとしてのパターン検出システムとして構成されてもよい。 In the above-described embodiments 1 to 3, the pattern detection devices 100 to 300 have been described as devices that perform processing in both the learning phase and the inference phase, but embodiments 1 to 3 are not limited to such devices.
For example, the first to third embodiments can be configured as a learning device that performs processing in the learning phase and an inference device that performs processing in the inference phase.
Furthermore, the processes performed by the pattern detection devices 100 to 300 according to the first to third embodiments may be distributed among a plurality of computers that are connected to a network and capable of transmitting and receiving data to and from each other. In other words, the first to third embodiments may be configured as a pattern detection system that serves as a learning inference system made up of a plurality of computers.

　１００，２００，３００　パターン検出装置、　１０１，２０１，３０１　学習データ生成部、　１０２，２０２　学習用入力データ指定部、　１０３　学習用特徴ベクトル格納部、　１０４　真値データ構築部、　１０５　学習用出力データ指定部、　１０６　学習用真値ベクトル格納部、　１０７　リザバー回路学習部、　２１０，３１０　適用データ生成部、　１１１，２１１　適用データ入力部、　１１２　リザバー回路処理部、　１１３　適用データ出力部、　２２０，３２０　直交フィルタ適用部、　３２１　値域変換部。 100, 200, 300 Pattern detection device; 101, 201, 301 Learning data generation unit; 102, 202 Learning input data specification unit; 103 Learning feature vector storage unit; 104 True value data construction unit; 105 Learning output data specification unit; 106 Learning true value vector storage unit; 107 Reservoir circuit learning unit; 210, 310 Application data generation unit; 111, 211 Application data input unit; 112 Reservoir circuit processing unit; 113 Application data output unit; 220, 320 Orthogonal filter application unit; 321 Value range conversion unit.

Claims

a learning data generation unit that generates learning data in which a plurality of feature vectors corresponding to a plurality of learning target images are arranged in time series;
a true-value data constructing unit that constructs true-value data in which a plurality of true-value vectors indicating true or false of each of the plurality of learning target images are arranged in correspondence with a time series of the plurality of feature vectors;
a learning unit that uses the training data and the true value data to train a three-layer neural network using reservoir computing.

each of the plurality of training images is a two-dimensional image;
2. The learning device according to claim 1, wherein the learning data generation unit generates each of the plurality of feature vectors by extracting pixel values in a predetermined order in the two-dimensional image and arranging the extracted pixel values in the extraction order.

each of the plurality of training images is a two-dimensional image;
2. The learning device according to claim 1, wherein the learning data generation unit extracts pixel values in a predetermined order from a processed image that is a result of applying an orthogonal filter to the two-dimensional image, and generates each of the plurality of feature vectors by arranging the extracted pixel values in the extraction order.

each of the plurality of training images is a two-dimensional image;
2. The learning device according to claim 1, wherein the learning data generation unit extracts pixel values in a predetermined order from a transformed image, which is a result of applying a function that transforms pixel values into a predetermined range, to a processed image, which is a result of applying an orthogonal filter to the two-dimensional image, and generates each of the plurality of feature vectors by arranging the extracted pixel values in the order in which they were extracted.

an application data generation unit that generates a feature vector as application data from an inference target image that is an image to be inferred;
a processing unit that uses training data in which a plurality of feature vectors corresponding to each of a plurality of training target images are arranged in a time series and true-value data in which a plurality of true-value vectors indicating the authenticity of each of the plurality of training target images are arranged to correspond to the time series of the plurality of feature vectors, generates a trained three-layered neural network using a coupling constant of the three-layered neural network obtained as a result of the training, and infers the authenticity of the inference target image by inputting the application data to the trained three-layered neural network.

The inference target image is a two-dimensional image,
The inference device according to claim 5 , wherein the application data generation unit generates the application data by extracting pixel values in a predetermined order in the two-dimensional image and arranging the extracted pixel values in the order in which they were extracted.

The inference target image is a two-dimensional image,
The inference device according to claim 5, wherein the application data generation unit generates the application data by extracting pixel values in a predetermined order from a processed image that is a result of applying an orthogonal filter to the two-dimensional image, and arranging the extracted pixel values in the order in which they were extracted.

The inference target image is a two-dimensional image,
The inference device according to claim 5, wherein the application data generation unit generates the application data by extracting pixel values in a predetermined order from a transformed image, which is a result of applying a function that transforms pixel values into a predetermined value range, to a processed image, which is a result of applying an orthogonal filter to the two-dimensional image, and arranging the extracted pixel values in the order in which they were extracted.

a learning data generation unit that generates learning data in which a plurality of feature vectors corresponding to a plurality of learning target images are arranged in time series;
a true-value data constructing unit that constructs true-value data in which a plurality of true-value vectors indicating true or false of each of the plurality of learning target images are arranged in correspondence with a time series of the plurality of feature vectors;
A learning unit that uses the learning data and the true value data to learn a three-layered neural network using reservoir computing, and obtains a coupling constant of the three-layered neural network and a result of the learning;
an application data generation unit that generates a feature vector as application data from an inference target image that is an image to be inferred;
a processing unit that uses the acquired coupling constants to generate a trained three-layered neural network and inputs the application data into the trained three-layered neural network to infer whether the inference target image is true or false.

Computer,
a learning data generation unit that generates learning data in which a plurality of feature vectors corresponding to a plurality of learning target images are arranged in time series;
a true-value data constructing unit that constructs true-value data in which a plurality of true-value vectors indicating true or false of each of the plurality of learning target images are arranged in correspondence with a time series of the plurality of feature vectors; and
a program that functions as a learning unit that uses the training data and the true value data to train a three-layer neural network using reservoir computing.

Computer,
an application data generation unit that generates a feature vector as application data from an inference target image that is an image to be inferred;
a processing unit that performs training of a three-layered neural network using reservoir computing, using training data in which a plurality of feature vectors corresponding to each of a plurality of training target images are arranged in a time series, and true-value data in which a plurality of true-value vectors indicating the authenticity of each of the plurality of training target images are arranged so as to correspond to the time series of the plurality of feature vectors, generates a trained three-layered neural network using a coupling constant of the three-layered neural network obtained as a result of the training, and inputs the application data into the trained three-layered neural network, thereby functioning as a processing unit that infers the authenticity of the inference target image.

generating training data in which a plurality of feature vectors corresponding to a plurality of training target images are arranged in a time series;
constructing true-value data in which a plurality of true-value vectors indicating the authenticity of each of the plurality of learning target images are arranged to correspond to a time series of the plurality of feature vectors;
A learning method comprising: learning a three-layer neural network using reservoir computing by using the learning data and the true value data.

A feature vector is generated as application data from an inference target image, which is an image to be inferred;
training a three-layered neural network using reservoir computing, using training data in which a plurality of feature vectors corresponding to each of a plurality of training target images are arranged in a time series, and true-value data in which a plurality of true-value vectors indicating the authenticity of each of the plurality of training target images are arranged so as to correspond to the time series of the plurality of feature vectors; and generating a trained three-layered neural network using coupling constants of the three-layered neural network obtained as a result of the training;
and inputting the application data into the trained three-layer neural network to infer whether the inference target image is true or false.