JP2018181184A

JP2018181184A - Evaluation device, evaluation method, and program therefor

Info

Publication number: JP2018181184A
Application number: JP2017083501A
Authority: JP
Inventors: 安藤　丹一; Tanichi Ando; 丹一安藤
Original assignee: Omron Corp; Omron Tateisi Electronics Co
Current assignee: Omron Corp
Priority date: 2017-04-20
Filing date: 2017-04-20
Publication date: 2018-11-15
Anticipated expiration: 2037-04-20
Also published as: WO2018193934A1; JP6917004B2

Abstract

PROBLEM TO BE SOLVED: To provide an evaluation device, an evaluation method, and a program therefor that can deal with the case where control performance of a learning module is degraded after additional learning.SOLUTION: An evaluation device 60 evaluating a second learning module additionally learning a first learning module comprises: a learning target receiving part 203 that receives a learning target to be achieved by the second learning module; an evaluation part 204 that evaluates the second learning module in terms of at least evaluation items included in the learning target to create evaluation data; a determination part 205 that determines whether the second learning module achieves the learning target by using the learning target and evaluation data; and a learning module selection part 206 that, when the second learning module is determined not to achieve the learning target, acquires a third learning module different from the second learning module on the basis of at least the learning target.SELECTED DRAWING: Figure 6

Description

本発明は、評価装置、評価方法及びそのプログラムに関する。 The present invention relates to an evaluation device, an evaluation method and a program thereof.

従来より、ニューラルネットワークの技術を用いてシステムの制御を行うことが知られている。例えば、特許文献１には、ニューラルネットワークを用いて、複数のかごの中から乗り場へ移動させるべき最適なかごを選択するエレベータシステムが開示されている。 It is known in the prior art to control a system using neural network technology. For example, Patent Document 1 discloses an elevator system that uses a neural network to select an optimal car to be moved from a plurality of cars to a landing.

また、引用文献２には、ニューラルネットワークを用いたパターン認識装置において誤認識を低減するために、ニューラルネットワークに対して追加学習を行う方法が開示されている。 Further, Patent Document 2 discloses a method of performing additional learning on a neural network in order to reduce erroneous recognition in a pattern recognition apparatus using the neural network.

特開２００５−２５５２８９号公報JP, 2005-255289, A 特開平９−６２６４８号公報JP-A-9-62648

ニューラルネットワークの技術を用いた制御システムでは、制御性能を向上等させるためにニューラルネットワークの追加学習を行うことがある。例えば、最初の学習時には、図１１の時刻１から時刻２の期間に得られたデータからランダムに選択した学習データＡを用い、その後、図１１の時刻２から時刻３の期間に得られたデータからランダムに選択した学習データＢを用いて追加学習を行う場合が考えられる。 In control systems using neural network technology, additional learning of neural networks may be performed to improve control performance. For example, at the time of the first learning, learning data A randomly selected from data obtained in the period of time 1 to time 2 in FIG. 11 is used, and then data obtained in the period of time 2 to time 3 in FIG. There may be a case where additional learning is performed using learning data B randomly selected from.

このような場合に、学習データの選択をランダムに行っていることに起因して、学習データＢが不適切な学習データを含む場合がある。このように不適切な学習データを含む追加学習データＢを用いて追加学習を行った場合、追加学習後のニューラルネットワークの制御性能は、追加学習以前のニューラルネットワークと比較して性能が劣化することがある。 In such a case, the learning data B may contain inappropriate learning data due to random selection of the learning data. As described above, when additional learning is performed using additional learning data B including inappropriate learning data, the control performance of the neural network after the additional learning is degraded compared to the neural network before the additional learning. There is.

また、時刻２から時刻３の期間において特殊なイベントが発生していた場合は、追加学習後のニューラルネットワークが、特殊なイベントに特化したものとなる可能性がある。その結果、追加学習後のニューラルネットワークの制御性能は、追加学習以前のニューラルネットワークと比較して性能が劣化してしまう恐れがある。 In addition, when a special event occurs in the period from time 2 to time 3, there is a possibility that the neural network after the additional learning becomes specialized for the special event. As a result, the control performance of the neural network after the additional learning may be degraded compared to the neural network before the additional learning.

そこで、本発明は、追加学習後に学習モジュールの制御性能が劣化した場合に対処し得る評価装置、評価方法及びそのプログラムを提供することを目的とする。 Then, this invention aims at providing the evaluation apparatus which can cope with the case where the control performance of a learning module deteriorates after additional learning, an evaluation method, and its program.

本発明の一態様に係る第１の学習モジュールを追加学習させた第２の学習モジュールを評価する評価装置は、第２の学習モジュールが達成すべき学習目標を受け付ける学習目標受付部と、少なくとも学習目標に含まれる評価項目について第２の学習モジュールの評価を行い、評価データを生成する評価部と、学習目標と評価データとを用いて、第２の学習モジュールが学習目標を達成したか否か判定する判定部と、学習目標を達成しないと判定される場合、少なくとも学習目標に基づいて、第２の学習モジュールと異なる第３の学習モジュールを取得する学習モジュール選択部とを備える。 An evaluation apparatus for evaluating a second learning module which additionally learns a first learning module according to an aspect of the present invention includes: a learning target receiving unit that receives a learning target to be achieved by the second learning module; Whether the second learning module has achieved the learning goal by using the evaluation unit that evaluates the second learning module about the evaluation items included in the goal and generates evaluation data, and the learning goal and the evaluation data A determination unit is provided, and a learning module selection unit that acquires a third learning module different from the second learning module based on at least the learning target when it is determined that the learning target is not achieved.

この態様によれば、追加学習後に学習モジュールの制御性能が学習目標を達成しなかった場合に、学習目標を達成しなかった学習モジュールを使用し続けることを回避することができるので、学習モジュールを用いるシステムの信頼性を高めることができる。例えば、学習目標を達成しなかった学習モジュールの使用を回避することで、システム全体の処理精度の低下を防ぐことができる。また、学習目標を達成しない場合に、少なくとも学習目標に基づいて、学習目標を達成しなかった学習モジュールと異なる学習モジュールを取得する構成により、学習モジュールを用いて制御を行う制御装置、評価を行う評価装置のいずれも複数の学習モジュールを保持する必要がないので、学習モジュールを記録するためのハードウェア資源を最小限にすることができる。 According to this aspect, if the control performance of the learning module does not achieve the learning goal after the additional learning, it is possible to avoid continuing to use the learning module that did not achieve the learning goal. The reliability of the system used can be increased. For example, by avoiding the use of a learning module that did not achieve the learning goal, it is possible to prevent a decrease in the processing accuracy of the entire system. In addition, when not achieving the learning goal, a control device that performs control using the learning module is evaluated, by using a configuration that acquires a learning module different from the learning module that did not achieve the learning goal based on at least the learning goal. Since none of the evaluation devices need to hold multiple learning modules, the hardware resources for recording learning modules can be minimized.

上記評価装置において、第３の学習モジュールは、第１の学習モジュールとしてもよい。この態様によれば、性能劣化前の学習モジュールを用いてシステムの制御を続行することができるので、学習モジュールを用いるシステムの安定性の低下を防ぐことができる。 In the evaluation device, the third learning module may be a first learning module. According to this aspect, since control of the system can be continued using the learning module prior to performance degradation, it is possible to prevent deterioration in the stability of the system using the learning module.

上記評価装置において、学習モジュールが学習目標を達成することができなかった要因を推定して、学習モジュールの追加学習に用いる学習データに関する要因改善データを生成する要因推定部をさらに備えてもよい。この態様によれば、性能劣化した学習モジュールから有益な情報を得ることができる。このような情報を活用することにより、学習モジュールの精度向上に必要となる学習データや処理を低減することができ、ＣＰＵの負荷を低減することができる。 The evaluation device may further include a factor estimation unit configured to estimate factor that the learning module could not achieve the learning target, and to generate factor improvement data related to learning data used for additional learning of the learning module. According to this aspect, useful information can be obtained from the degraded learning module. By utilizing such information, it is possible to reduce learning data and processing required to improve the accuracy of the learning module, and to reduce the load on the CPU.

上記評価装置において、学習モジュール選択部は、要因改善データに基づいて学習指示を生成してもよい。この態様によれば、性能劣化した学習モジュールが有する問題を克服し得る、効率的な追加学習を学習モジュールにさせることができる。この効率的な追加学習をさせた学習モジュールをシステムに用いることで、最短時間で、学習モジュールの更新による性能劣化を改善することができる。 In the above-described evaluation device, the learning module selection unit may generate a learning instruction based on the factor improvement data. According to this aspect, it is possible to cause the learning module to perform efficient additional learning, which can overcome the problem of the degraded learning module. By using the learning module that has been subjected to this efficient additional learning in the system, it is possible to improve the performance deterioration due to the updating of the learning module in the shortest time.

上記評価装置において、要因推定部は、推定した要因を出力し、推定した要因に関するユーザ入力を受信して、ユーザ入力に基づいて要因改善データを生成してもよい。この態様によれば、より信頼性の高い情報に基づいて、性能劣化要因に関する情報を生成することができる。 In the evaluation device, the factor estimation unit may output the estimated factor, receive a user input regarding the estimated factor, and generate factor improvement data based on the user input. According to this aspect, it is possible to generate the information on the performance deterioration factor based on the more reliable information.

本発明の他の態様に係る、第１の学習モジュールを追加学習させた第２の学習モジュールを評価する評価方法は、第２の学習モジュールが達成すべき学習目標を受け付ける工程と、少なくとも学習目標に含まれる評価項目について第２の学習モジュールの評価を行い、評価データを生成する工程と、学習目標と評価データとを用いて、第２の学習モジュールが学習目標を達成したか否か判定する工程と、学習目標を達成しないと判定される場合、少なくとも学習目標に基づいて、第２の学習モジュールと異なる第３の学習モジュールを取得する工程とを含む。 According to another aspect of the present invention, an evaluation method for evaluating a second learning module obtained by additionally learning a first learning module includes the steps of: receiving a learning target to be achieved by the second learning module; The second learning module is evaluated with respect to the evaluation items included in, and it is determined whether the second learning module has achieved the learning target, using the step of generating the evaluation data and the learning target and the evaluation data. And a step of acquiring a third learning module different from the second learning module based on at least the learning target if it is determined that the learning target is not achieved.

本発明の他の態様に係るプログラムは、コンピュータに、第１の学習モジュールを追加学習させた第２の学習モジュールを評価させるためのプログラムであって、第２の学習モジュールが達成すべき学習目標を受け付ける処理と、少なくとも学習目標に含まれる評価項目について第２の学習モジュールの評価を行い、評価データを生成する処理と、学習目標と評価データとを用いて、第２の学習モジュールが学習目標を達成したか否か判定する処理と、学習目標を達成しないと判定される場合、第２の学習モジュールと異なる第３の学習モジュールを取得する処理とを実行させる。 A program according to another aspect of the present invention is a program for causing a computer to evaluate a second learning module to which the first learning module is additionally learned, and a learning goal to be achieved by the second learning module. The second learning module uses the learning goal and the evaluation data to evaluate the second learning module for the process of accepting the second learning module for at least the evaluation item included in the learning goal, and the learning goal for the second learning module. And a process of acquiring a third learning module different from the second learning module when it is determined that the learning goal is not achieved.

本発明によれば、追加学習後に学習モジュールの制御性能が性能劣化した場合に対処し得る評価装置、評価方法及びそのプログラムを提供することができる。 According to the present invention, it is possible to provide an evaluation apparatus, an evaluation method, and a program thereof that can cope with the case where the control performance of the learning module degrades after the additional learning.

本発明の実施形態に係る学習システムのネットワーク構成を示す図である。It is a figure showing the network composition of the learning system concerning the embodiment of the present invention. 本発明の実施形態に係る学習装置の物理的構成を示す図である。It is a figure showing physical composition of a learning device concerning an embodiment of the present invention. 本発明の実施形態に係る追加学習制御装置の物理的構成を示す図である。It is a figure showing physical constitution of an additional learning control device concerning an embodiment of the present invention. 本発明の実施形態に係る学習装置の機能ブロック図である。It is a functional block diagram of a learning device concerning an embodiment of the present invention. 本発明の実施形態に係る追加学習制御装置の機能ブロック図である。It is a functional block diagram of an additional learning control device concerning an embodiment of the present invention. 本発明の実施形態に係る追加学習制御装置によって実行される追加学習処理の流れを示すフローチャートである。It is a flowchart which shows the flow of the additional learning process performed by the additional learning control apparatus which concerns on embodiment of this invention. 本発明の実施形態に係る追加学習制御装置によって実行される追加学習処理の流れを示すフローチャートである。It is a flowchart which shows the flow of the additional learning process performed by the additional learning control apparatus which concerns on embodiment of this invention. 本発明の実施形態に係る追加学習制御装置によって出力される画面の一例である。It is an example of the screen output by the additional learning control apparatus which concerns on embodiment of this invention. 本発明の別の実施形態に係る学習システムのネットワーク構成を示す図である。It is a figure which shows the network configuration of the learning system which concerns on another embodiment of this invention. 本発明の実施形態に係る評価装置の物理的構成を示す図である。It is a figure which shows the physical structure of the evaluation apparatus which concerns on embodiment of this invention. 学習データを選択する際の時系列を示す図である。It is a figure which shows the time series at the time of selecting learning data.

添付図面を参照して、本発明の実施形態について説明する。なお、以下の実施形態は、本発明の理解を容易にするためのものであり、本発明を限定して解釈するためのものではない。また、本発明は、その要旨を逸脱しない限り、さまざまな変形が可能である。さらに、当業者であれば、以下に述べる各要素を均等なものに置換した実施形態を採用することが可能であり、係る実施形態も本発明の範囲に含まれる。 Embodiments of the present invention will be described with reference to the accompanying drawings. The following embodiments are for the purpose of facilitating the understanding of the present invention, and are not for the purpose of limiting and interpreting the present invention. Moreover, the present invention can be variously modified without departing from the scope of the invention. Further, those skilled in the art can adopt an embodiment in which each of the elements described below is replaced with an equivalent one, and such embodiments are also included in the scope of the present invention.

（ネットワーク構成）
図１を参照して、本発明の所定の実施形態に係る学習システム１のネットワーク構成について説明する。学習システム１は、学習装置１０、追加学習制御装置２０、１又は複数のセンサ３０及び記憶装置４０を備える。学習装置１０は、通信ネットワークＮを介して、追加学習制御装置２０、１又は複数のセンサ３０及び記憶装置４０に接続される。通信ネットワークＮは、有線又は無線回線により構成された有線通信網及び無線通信網のいずれであってもよく、インターネットやＬｏｃａｌＡｒｅａＮｅｔｗｏｒｋ（ＬＡＮ）であってよい。 (Network configuration)
A network configuration of a learning system 1 according to a predetermined embodiment of the present invention will be described with reference to FIG. The learning system 1 includes a learning device 10, an additional learning control device 20, one or more sensors 30, and a storage device 40. The learning device 10 is connected to the additional learning control device 20, one or more sensors 30 and a storage device 40 via the communication network N. The communication network N may be any of a wired communication network and a wireless communication network configured by a wired or wireless channel, and may be the Internet or a Local Area Network (LAN).

学習装置１０は、記憶装置４０に記憶された学習データに基づいて、学習モジュールの学習を行い、学習済モジュールを記憶装置４０に記憶する。本実施形態に係る学習装置１０は、学習モジュールを備えるが、学習モジュールは、学習装置１０と別体の装置に備えられてもよい。 The learning device 10 learns the learning module based on the learning data stored in the storage device 40, and stores the learned module in the storage device 40. The learning device 10 according to the present embodiment includes a learning module, but the learning module may be provided in a device separate from the learning device 10.

なお、学習モジュールとは、学習能力を備えた専用若しくは汎用のハードウェア若しくはソフトウェアの一単位、又は、当該ハードウェア若しくはソフトウェアの一単位の組合せを含む。当該学習を行う学習モジュールには、学習データによりすでに何らかの学習を行っているものもあれば、学習前のものも含む。ここで、学習能力とは、あるタスクの処理能力を、学習データから得られる経験に基づいて向上させることのできる能力をいう。 The learning module includes one unit of dedicated or general-purpose hardware or software having a learning ability, or a combination of one unit of the hardware or software. Some learning modules that perform the learning include those that have already performed some learning using learning data, and those that have not yet learned. Here, the learning ability refers to the ability to improve the processing ability of a certain task based on the experience obtained from the learning data.

追加学習制御装置２０は、学習済モジュールを用いて、入力データの特徴に応じた出力データを出力する。本実施形態に係る追加学習制御装置２０は、学習済モジュール又は当該学習済モジュールの複製物を学習装置１０から取得して、学習モジュールとして設定する。追加学習制御装置２０は、設定された学習モジュールを用いて出力した出力データに対する評価を行い、評価データを出力することができる。追加学習制御装置２０は、設定された学習モジュールが達成すべき学習目標と評価データとを用いて、現在設定されている学習モジュールが学習目標を達成したか否か判定することができる。学習目標を達成していないと判定される場合、追加学習制御装置２０は、例えば以前設定されていた学習モジュールを学習装置１０から取得して、学習モジュールとして設定してもよい。なお、追加学習制御装置２０は、後述する評価装置６０の機能構成を備えており、評価装置６０を実質的に含むものである。 The additional learning control device 20 outputs the output data according to the feature of the input data using the learned module. The additional learning control device 20 according to the present embodiment acquires a learned module or a copy of the learned module from the learning device 10, and sets it as a learning module. The additional learning control device 20 can evaluate the output data output using the set learning module, and can output evaluation data. The additional learning control device 20 can use the learning target to be achieved by the set learning module and the evaluation data to determine whether the currently set learning module has achieved the learning target. When it is determined that the learning goal is not achieved, the additional learning control device 20 may acquire, for example, a learning module set previously, from the learning device 10 and set it as a learning module. The additional learning control device 20 has a functional configuration of an evaluation device 60 described later, and substantially includes the evaluation device 60.

なお、学習済モジュールの複製物とは、学習済モジュールの機能を再現することができる専用若しくは汎用のハードウェア若しくはソフトウェアの一単位、又は、当該ハードウェア若しくはソフトウェアの一単位の組合せを含む。 The copy of the learned module includes one unit of dedicated or general-purpose hardware or software capable of reproducing the function of the learned module, or a combination of one unit of the hardware or software.

学習済モジュールの複製物は、必ずしも学習能力を備えていなくてもよい。また、学習済モジュールの構成と、学習済モジュールの複製物の構成は、必ずしも一致していなくてもよい。また、学習済モジュールの複製物は、いわゆる蒸留によって得られる学習モジュールを含む。すなわち、学習済モジュールの複製物は、学習済モジュールの機能を保つように、学習済モジュールと構造が異なる他の学習モジュールを学習させることで得られる、学習済みの当該他の学習モジュールを含む。 The duplicate of the learned module may not necessarily have the learning ability. Also, the configuration of the learned module and the configuration of the duplicate of the learned module do not necessarily have to match. Also, a copy of the learned module includes a learning module obtained by so-called distillation. That is, the copy of the learned module includes the learned other learning module obtained by learning the learning module and another learning module having a different structure so as to maintain the function of the learned module.

ここで、当該他の学習モジュールは、学習済モジュールよりも構造が単純であってよく、よりデプロイに適したものであってよいし、当該他の学習モジュールの学習には、学習済モジュールの出力データを用いてよい。なお、学習済モジュールの複製物は、学習モジュールの学習過程において、オーバーフィッティングを防ぐ正則化の方法を変えたり、バックプロパゲーションの学習率を変えたり、重み係数の更新アルゴリズムを変えたりして得られる学習済モジュールを含む。 Here, the other learning module may be simpler in structure than the learned module and may be more suitable for deployment, and the output of the learned module may be used for learning of the other learning module. Data may be used. The copy of the learned module is obtained by changing the regularization method to prevent overfitting, changing the learning rate of back propagation, changing the algorithm of updating the weighting coefficient, or the like in the learning process of the learning module. Containing learned modules.

また、学習済モジュール又は当該学習済モジュールの複製物を取得するとは、学習済モジュールの機能を追加学習制御装置２０において再現するために必要な情報を取得することをいう。例えば、学習モジュールがニューラルネットワークを含む場合、学習済モジュール又は当該学習済モジュールの複製物を取得するとは、少なくとも、ニューラルネットワークのレイヤ数、各レイヤに関するノード数、ノード間を繋ぐリンクの重みパラメータ、各ノードに関するバイアスパラメータ及び各ノードに関する活性化関数の関数形に関する情報を取得することをいう。 Further, acquiring a learned module or a duplicate of the learned module means acquiring information necessary for reproducing the function of the learned module in the additional learning control device 20. For example, when the learning module includes a neural network, acquiring the learned module or a copy of the learned module means at least the number of layers of the neural network, the number of nodes related to each layer, and the weight parameter of the link connecting the nodes. It means obtaining information on the bias parameter for each node and the function form of the activation function for each node.

センサ３０は、物理量を検出する物理量センサ、化学量を検出する化学量センサ、情報を検出する情報センサのいずれであってもよいが、これらに限られるものではなく、任意のセンサを含み得る。物理量センサは、例えば光を検出して画像データや動画データを出力するカメラや、人の心拍を検出して心拍データを出力する心拍センサ、人の血圧を検出して血圧データを出力する血圧センサ及び人の体温を検出して体温データを出力する体温センサ等のバイタルセンサを含み、その他任意の物理量を検出して電気的信号を出力するセンサを含む。化学量センサは、例えばガスセンサ、湿度センサ、イオンセンサを含み、その他任意の化学量を検出して電気信号を出力するセンサを含む。情報センサは、例えば統計データから特定のパターンを検出するセンサを含み、その他任意の情報を検出するセンサを含む。 The sensor 30 may be any of a physical quantity sensor that detects a physical quantity, a chemical quantity sensor that detects a chemical quantity, and an information sensor that detects information, but is not limited to these and may include any sensor. The physical quantity sensor is, for example, a camera that detects light and outputs image data or moving image data, a heart rate sensor that detects heart rate of a person and outputs heart rate data, and a blood pressure sensor that detects blood pressure of a person and outputs blood pressure data And a vital sensor such as a body temperature sensor that detects body temperature of a person and outputs body temperature data, and includes a sensor that detects any other physical quantity and outputs an electrical signal. The stoichiometry sensor includes, for example, a gas sensor, a humidity sensor, an ion sensor, and a sensor that detects an arbitrary stoichiometry and outputs an electrical signal. The information sensor includes, for example, a sensor that detects a specific pattern from statistical data, and also includes a sensor that detects any other information.

記憶装置４０は、センサ３０によって出力されたセンシングデータを記憶する。また、記憶装置４０は、学習装置１０によって出力された学習済モジュールを記憶する。図１では、記憶装置４０を単一の記憶部として示しているが、記憶装置４０は、１又は複数のファイルサーバによって構成されてよい。 The storage device 40 stores sensing data output by the sensor 30. The storage device 40 also stores the learned module output by the learning device 10. Although FIG. 1 illustrates the storage device 40 as a single storage unit, the storage device 40 may be configured by one or more file servers.

なお、図１において、学習装置１０、追加学習制御装置２０及び記憶装置４０は、それぞれ別体として構成されているが、これらを一体として構成してもよい。すなわち、学習装置１０、追加学習制御装置２０及び記憶装置４０の全てを一体として構成してもよく、学習装置１０、追加学習制御装置２０及び記憶装置４０のうちの２つを選択的に一体として構成してもよい。このとき、一体として構成された、学習装置１０、追加学習制御装置２０及び記憶装置４０の各要素間は、内部バスを介して接続される。 In addition, in FIG. 1, although the learning apparatus 10, the additional learning control apparatus 20, and the memory | storage device 40 are each comprised separately, you may comprise these integrally. That is, all of the learning device 10, the additional learning control device 20, and the storage device 40 may be integrally configured, and two of the learning device 10, the additional learning control device 20, and the storage device 40 may be selectively integrated. It may be configured. At this time, the respective elements of the learning device 10, the additional learning control device 20, and the storage device 40 which are integrally configured are connected via an internal bus.

（物理的構成：学習装置）
図２を参照して、本発明の所定の実施形態に係る学習装置１０の物理的構成について説明する。学習装置１０は、制御部１０ａと、記憶部１０ｂと、通信部１０ｃと、入力部１０ｄと、表示部１０ｅを有する。これら各構成は、バスを介して相互にデータ送受信可能に接続される。 (Physical configuration: learning device)
The physical configuration of the learning device 10 according to a predetermined embodiment of the present invention will be described with reference to FIG. The learning device 10 includes a control unit 10a, a storage unit 10b, a communication unit 10c, an input unit 10d, and a display unit 10e. These components are mutually connected so as to be able to transmit and receive data via a bus.

制御部１０ａは、ハードウェアプロセッサに相当するＣｅｎｔｒａｌＰｒｏｃｅｓｓｉｎｇＵｎｉｔ（ＣＰＵ）、及びメモリに相当するＲａｎｄｏｍＡｃｃｅｓｓＭｅｍｏｒｙ（ＲＡＭ）を含む。ＣＰＵが記憶部１０ｂに記憶されているプログラムをＲＡＭに展開し、ＲＡＭに展開された当該プログラムを解釈及び実行することにより、制御部１０ａは、後述する図４の各部として機能する。 The control unit 10a includes a central processing unit (CPU) corresponding to a hardware processor, and a random access memory (RAM) corresponding to a memory. The CPU develops the program stored in the storage unit 10 b in the RAM, interprets and executes the program developed in the RAM, and the control unit 10 a functions as each unit in FIG. 4 described later.

なお、ハードウェアプロセッサの種類はＣＰＵに限定されない。例えば、ハードウェアプロセッサとして、ＣＰＵ、ＧｒａｐｈｉｃｓＰｒｏｃｅｓｓｉｎｇＵｎｉｔ（ＧＰＵ）、Ｆｉｅｌｄ−ｐｒｏｇｒａｍｍａｂｌｅＧａｔｅＡｒｒａｙ（ＦＰＧＡ）、ＤｉｇｉｔａｌＳｉｇｎａｌＰｒｏｃｅｓｓｏｒ（ＤＳＰ）、ＡｐｐｌｉｃａｔｉｏｎＳｐｅｃｉｆｉｃＩｎｔｅｇｒａｔｅｄＣｉｒｃｕｉｔ（ＡＳＩＣ）を単独で、又は、組合せて使用することができる。ＲＡＭは、データの書き換えが可能な記憶部であり、例えば半導体記憶素子で構成される。ＲＡＭは、ＣＰＵが実行するアプリケーション等のプログラムやデータを一時的に記憶する。 The type of hardware processor is not limited to the CPU. For example, using a CPU, a graphics processing unit (GPU), a field-programmable gate array (FPGA), a digital signal processor (DSP), an application specific integrated circuit (ASIC) alone or in combination as a hardware processor Can. The RAM is a storage unit in which data can be rewritten, and is formed of, for example, a semiconductor storage element. The RAM temporarily stores programs and data such as applications executed by the CPU.

記憶部１０ｂは、例えばＨａｒｄＤｉｓｋＤｒｉｖｅ（ＨＤＤ）やＳｏｌｉｄＳｔａｔｅＤｒｉｖｅ（ＳＤＤ）等の不揮発性の記憶媒体である。記憶部１０ｂは、ＣＰＵが実行するプログラム及びデータを記憶する。 The storage unit 10 b is a non-volatile storage medium such as, for example, a hard disk drive (HDD) or a solid state drive (SDD). The storage unit 10 b stores programs and data executed by the CPU.

通信部１０ｃは、学習装置１０を通信ネットワークＮに接続するハードウェアインタフェースである。 The communication unit 10 c is a hardware interface that connects the learning device 10 to the communication network N.

入力部１０ｄは、ユーザからの入力を受け付けるものであり、例えば、キーボードやマウス、タッチパネルで構成される。 The input unit 10 d receives an input from the user, and is configured of, for example, a keyboard, a mouse, and a touch panel.

表示部１０ｅは、ＣＰＵによる処理結果を視覚的に表示するものであり、例えば、ＬｉｑｕｉｄＣｒｙｓｔａｌＤｉｓｐｌａｙ（ＬＣＤ）により構成される。 The display unit 10 e visually displays the processing result of the CPU, and is configured of, for example, a Liquid Crystal Display (LCD).

学習装置１０は、例えば一般のパーソナルコンピュータのＣＰＵによって本実施形態に係る学習プログラムを実行することで構成されてよい。学習プログラムは、ＲＡＭや記憶部１０ｂ等のコンピュータによって読み取り可能な記憶媒体に記憶されて提供されてもよいし、通信部１０ｃにより接続される通信ネットワークＮを介して提供されてもよい。これらの物理的な構成は例示であって、必ずしも独立した構成でなくてもよい。 The learning device 10 may be configured, for example, by executing a learning program according to the present embodiment by a CPU of a general personal computer. The learning program may be stored and provided in a computer-readable storage medium such as a RAM or the storage unit 10b, or may be provided via a communication network N connected by the communication unit 10c. These physical configurations are examples and may not necessarily be independent configurations.

（物理的構成：追加学習制御装置）
図３を参照して、本発明の所定の実施形態に係る追加学習制御装置２０の物理的構成について説明する。追加学習制御装置２０も、学習装置１０と同様に、ＣＰＵ及びＲＡＭを含む制御部２０ａ、データ等を記憶する記憶部２０ｂ、ネットワークＮと接続するための通信部２０ｃ、ユーザからの入力を受け付ける入力部２０ｄ、表示部２０ｅ等を有する。これら各構成は、バスを介して相互にデータ送受信可能に接続される。ＣＰＵが記憶部２０ｂに記憶されているプログラムをＲＡＭに展開し、ＲＡＭに展開された当該プログラムを解釈及び実行することにより、制御部２０ａは、後述する図５の各部として機能する。 (Physical configuration: additional learning control device)
The physical configuration of the additional learning control device 20 according to the predetermined embodiment of the present invention will be described with reference to FIG. Similar to the learning device 10, the additional learning control device 20 also includes a control unit 20a including a CPU and a RAM, a storage unit 20b for storing data and the like, a communication unit 20c for connecting to the network N, and an input for receiving input from the user. It has a unit 20d, a display unit 20e and the like. These components are mutually connected so as to be able to transmit and receive data via a bus. The CPU develops the program stored in the storage unit 20b in the RAM, interprets and executes the program developed in the RAM, and the control unit 20a functions as each unit in FIG. 5 described later.

追加学習制御装置２０は、例えば一般のパーソナルコンピュータのＣＰＵによって追加学習制御プログラムを実行することで構成されてよい。追加学習制御プログラムは、ＲＡＭや記憶部２０ｂ等のコンピュータによって読み取り可能な記憶媒体に記憶されて提供されてもよいし、通信部２０ｃにより接続される通信ネットワークＮを介して提供されてもよい。 The additional learning control device 20 may be configured, for example, by executing the additional learning control program by the CPU of a general personal computer. The additional learning control program may be provided by being stored in a computer-readable storage medium such as a RAM or the storage unit 20b, or may be provided via the communication network N connected by the communication unit 20c.

（機能構成：学習装置）
図４を参照して、本発明の所定の実施形態に係る学習装置１０の機能構成について説明する。学習装置１０は、学習指示受付部１０１、学習データ取得部１０２、学習制御部１０３、学習モジュール１０４、学習済モジュール出力部１０５及び学習済モジュール抽出部１０６を備える。 (Functional configuration: learning device)
The functional configuration of the learning device 10 according to the predetermined embodiment of the present invention will be described with reference to FIG. The learning device 10 includes a learning instruction receiving unit 101, a learning data acquisition unit 102, a learning control unit 103, a learning module 104, a learned module output unit 105, and a learned module extraction unit 106.

学習指示受付部１０１は、入力部１０ｄを介したユーザからの学習指示、又は通信部１０ｃを介した追加学習制御装置２０からの学習指示を受け付けて、学習指示に含まれる情報を後述の学習データ取得部１０２に引き渡す。本実施形態では、学習指示には、学習データ取得条件、入力パラメータの指定等が含まれる。学習データ取得条件とは、学習モジュール１０４を学習させるための学習データとして使用できるデータのうちでも、ユーザからの学習指示を満たすために必要となる条件をいう。例えばセンサ３０で取得されるデータのうち、取得日時を指定したものでもよい。入力パラメータとは、学習指示に含まれる情報のうち、学習済モジュールの制御の性能に影響を与える要因をいう。 The learning instruction receiving unit 101 receives a learning instruction from the user via the input unit 10 d or a learning instruction from the additional learning control device 20 via the communication unit 10 c, and the information included in the learning instruction is learning data described later. Delivered to the acquisition unit 102. In the present embodiment, the learning instruction includes learning data acquisition conditions, designation of input parameters, and the like. Among the data that can be used as learning data for causing the learning module 104 to learn, the learning data acquisition condition is a condition that is necessary to satisfy a learning instruction from the user. For example, among the data acquired by the sensor 30, an acquisition date and time may be designated. The input parameter is a factor that affects the control performance of the learned module among the information included in the learning instruction.

学習データ取得部１０２は、学習データ取得条件を受信して、受信した学習データ取得条件に基づいて記憶装置４０から学習データを取得する。 The learning data acquisition unit 102 receives the learning data acquisition condition, and acquires learning data from the storage device 40 based on the received learning data acquisition condition.

学習制御部１０３は、学習データ取得部１０２が取得した学習データを用いて、学習モジュール１０４を学習させる。学習制御部１０３は、学習指示受付部１０１で受け付けた学習指示に基づいて学習を完了させる。学習が完了したと判断する基準は、例えば所定個数の学習データによる学習をした場合でもよい。または学習済モジュールの制御の性能が後述の学習目標を満たした場合に、学習を完了してもよい。学習が完了すると、学習制御部１０３は、学習済モジュールを記憶装置４０に記憶する。この際、本実施形態では、学習制御部１０３は、学習済モジュールを一意に識別可能な学習モジュール識別子及び学習データ取得条件と関連付けて学習済モジュールを記憶する。 The learning control unit 103 causes the learning module 104 to learn using the learning data acquired by the learning data acquisition unit 102. The learning control unit 103 completes the learning based on the learning instruction received by the learning instruction receiving unit 101. The criteria for determining that learning has been completed may be, for example, when learning is performed using a predetermined number of learning data. Alternatively, the learning may be completed when the control performance of the learned module satisfies the learning target described later. When the learning is completed, the learning control unit 103 stores the learned module in the storage device 40. At this time, in the present embodiment, the learning control unit 103 stores the learned module in association with the learning module identifier that can uniquely identify the learned module and the learning data acquisition condition.

ここで、記憶装置４０に記憶される学習済モジュールは、その更新履歴がわかるようにバージョン管理されることが望ましい。バージョン管理は、学習モジュール識別子そのもので行ってもよいし、別途設けたバージョン情報で行ってもよい。 Here, it is desirable that the learned modules stored in the storage device 40 be version-controlled so that the update history can be known. Version management may be performed by the learning module identifier itself or by separately provided version information.

学習モジュール１０４は、機械学習を実現するためのモジュールである。ここでは、学習モジュール１０４の一例としてニューラルネットワークを適用した実施例について説明する。しかしながら、ニューラルネットワークは学習モジュール１０４の一例にすぎず、学習装置１０は、学習モジュール１０４として他の構成を適用してもよい。 The learning module 104 is a module for realizing machine learning. Here, an embodiment to which a neural network is applied as an example of the learning module 104 will be described. However, the neural network is only an example of the learning module 104, and the learning device 10 may apply another configuration as the learning module 104.

学習済モジュール出力部１０５は、学習済モジュールと学習モジュール識別子とを例えば追加学習制御装置２０のような外部に出力する。 The learned module output unit 105 outputs the learned module and the learning module identifier to the outside such as the additional learning control device 20, for example.

学習済モジュール抽出部１０６は、学習モジュール抽出条件を受信して、受信した学習モジュール抽出条件に基づいて記憶装置４０から学習済モジュールを取得する。本実施形態では、学習モジュール抽出条件には、現在設定されている学習モジュールの学習モジュール識別子、及び抽出ポイントが含まれる。抽出ポイントは、性能が劣化する前の日付や現在設定されている学習モジュールから遡るバージョンの指定等、抽出対象となる学習モジュールを指定可能な情報を含む。例えば、抽出ポイントは、「２０１７年１２月３１日以前」、「現在設定されている学習モジュールの１つ前のバージョン」とすることができる。 The learned module extraction unit 106 receives the learning module extraction condition, and acquires the learned module from the storage device 40 based on the received learning module extraction condition. In the present embodiment, the learning module extraction condition includes the learning module identifier of the learning module currently set and the extraction point. The extraction point includes information that can specify a learning module to be extracted, such as a date before performance degradation or specification of a version retroactively set from a currently set learning module. For example, the extraction point can be “before December 31, 2017”, “previous version of currently set learning module”.

（機能構成：追加学習制御装置）
図５を参照して、本発明の所定の実施形態に係る追加学習制御装置２０の機能構成について説明する。追加学習制御装置２０は、学習済モジュール受付部２０１、学習モジュール２０２、学習目標受付部２０３、評価部２０４、判定部２０５、学習モジュール選択部２０６、要因推定部２０７、制御部２０８及びデータベース（ＤＢ）２０９を備える。 (Functional configuration: additional learning control device)
The functional configuration of the additional learning control device 20 according to the predetermined embodiment of the present invention will be described with reference to FIG. The additional learning control device 20 includes a learned module reception unit 201, a learning module 202, a learning target reception unit 203, an evaluation unit 204, a determination unit 205, a learning module selection unit 206, a factor estimation unit 207, a control unit 208, and a database (DB ) 209.

学習済モジュール受付部２０１は、学習済モジュールと学習モジュール識別子とを受け付けて、受け付けた学習済モジュールを学習モジュール２０２として設定する。本実施形態では、学習済モジュール受付部２０１は、学習装置１０の学習済モジュール出力部１０５から学習済モジュールと学習モジュール識別子とを受け付けて、受け付けた学習済モジュールを学習モジュール２０２として設定する。なお、学習済モジュール受付部２０１は、学習済モジュールを記憶装置４０から受け付けて、学習モジュール２０２として設定してもよい。ここでは、学習モジュール２０２の一例としてニューラルネットワークを適用した実施例について説明する。しかしながら、ニューラルネットワークは学習モジュール２０２の一例にすぎず、追加学習制御装置２０は、学習モジュール２０２として他の構成を適用してもよい。 The learned module receiving unit 201 receives the learned module and the learning module identifier, and sets the received learned module as the learning module 202. In the present embodiment, the learned module receiving unit 201 receives a learned module and a learning module identifier from the learned module output unit 105 of the learning device 10, and sets the received learned module as the learning module 202. Note that the learned module receiving unit 201 may receive the learned module from the storage device 40 and set it as the learning module 202. Here, an embodiment to which a neural network is applied as an example of the learning module 202 will be described. However, the neural network is only an example of the learning module 202, and the additional learning control device 20 may apply another configuration as the learning module 202.

学習目標受付部２０３は、入力部２０ｄを介して、学習モジュール２０２が達成すべき学習目標を受け付けて、受け付けた学習目標をＤＢ２０９に記憶する。本実施形態では、学習目標には、評価項目及び条件が含まれる。評価項目は、学習モジュール２０２を評価するための項目であり、例えば、学習モジュールが出力する出力データの精度を判断するために用いられる項目である。 The learning target receiving unit 203 receives the learning target to be achieved by the learning module 202 via the input unit 20 d, and stores the received learning target in the DB 209. In the present embodiment, the learning goal includes evaluation items and conditions. The evaluation item is an item for evaluating the learning module 202, and is an item used to determine the accuracy of output data output from the learning module, for example.

条件は、評価項目に対する条件として、例えば、評価項目「制御部２０８が制御する装置に対する外部操作回数／日」に対して、「当該評価項目が基準値ｘ以下」、「当該評価項目が直前に設定されていた学習モジュールより小さい」、評価項目「制御部２０８が制御する装置の消費電力量／月」に対して、「当該評価項目が基準値ｙ以下」、「当該評価項目が直前に設定されていた学習モジュールより小さい」、評価項目「所定の時間内に学習モジュール２０２が算出した値が許容変化率を超える回数／月」に対して、「当該評価項目が基準値ｚ以下」、「当該評価項目が直前に設定されていた学習モジュールより小さい」等とすることができる。 As the condition for the evaluation item, for example, “the evaluation item is less than or equal to the reference value x”, “the evaluation item is immediately before” with respect to the evaluation item “the number of external operations for the device controlled by the control unit 208 / day”. "The evaluation item is less than or equal to the reference value y", "the evaluation item is set immediately before" for the evaluation item "energy consumption of the device controlled by the control unit 208 / month", which is smaller than the learning module set. “The evaluation item is less than or equal to the reference value z”, “for the evaluation item“ the number of times the value calculated by the learning module 202 exceeds the allowable change rate / month ”within the predetermined time”, The evaluation item may be smaller than that of the learning module set immediately before.

評価部２０４は、記憶装置４０に記憶されたセンシングデータを用いて学習モジュール２０２の評価を行って評価データを生成し、学習済モジュール受付部２０１が受け付けた学習モジュール識別子と関連付けて、生成した評価データをＤＢ２０９に記憶する。本実施形態では、評価部２０４は、少なくとも学習目標に含まれる評価項目について評価を行い、評価データを生成することができる。評価部２０４は、学習モジュール２０２が設定されてから一定の期間の経過後に自動で評価を行うこともできるし、別途、ユーザからの評価指示を受け付けて評価を行ってもよい。例えば、評価部２０４は、学習目標を受け付けることに応答して、評価を行ってもよい。なお、評価部２０４は学習目標に含まれる評価項目の他に、予め設定された評価項目について評価を行ってもよい。 The evaluation unit 204 evaluates the learning module 202 using sensing data stored in the storage device 40 to generate evaluation data, and associates the generated evaluation data with the learning module identifier received by the learned module reception unit 201. The data is stored in DB 209. In the present embodiment, the evaluation unit 204 can perform evaluation on at least an evaluation item included in the learning target, and generate evaluation data. The evaluation unit 204 can automatically perform evaluation after a certain period of time has elapsed since the learning module 202 was set, or may separately perform evaluation by accepting an evaluation instruction from a user. For example, the evaluation unit 204 may perform the evaluation in response to receiving the learning target. In addition to the evaluation items included in the learning target, the evaluation unit 204 may evaluate the evaluation items set in advance.

判定部２０５は、学習目標受付部２０３が受け付けた学習目標とＤＢ２０９に記憶された評価データとを用いて、現在設定されている学習モジュールが、学習目標を達成したか否か判定する。学習目標を達成したと判定される場合、判定部２０５は、処理を終了する。 The determination unit 205 determines whether the currently set learning module has achieved the learning target, using the learning target received by the learning target receiving unit 203 and the evaluation data stored in the DB 209. If it is determined that the learning goal has been achieved, the determination unit 205 ends the process.

学習モジュール選択部２０６は、学習目標を達成していないと判定される場合に、学習装置１０の学習済モジュール抽出部１０６に後述するとおり学習モジュール抽出条件を送信し、その応答として得られた学習済モジュールを学習モジュール２０２として設定する。学習モジュール選択部２０６が学習済モジュール抽出部１０６に学習モジュール抽出条件を送信することにより得られた学習済モジュールは、学習モジュール２０２として設定される前に、再度学習目標を達成するか評価されてもよい。 When it is determined that the learning goal is not achieved, the learning module selection unit 206 transmits the learning module extraction condition to the learned module extraction unit 106 of the learning device 10 as described later, and learning obtained as a response thereof The completed module is set as the learning module 202. The learned module obtained by the learning module selection unit 206 transmitting the learning module extraction condition to the learned module extraction unit 106 is evaluated whether it achieves the learning goal again before being set as the learning module 202. It is also good.

また、学習モジュール選択部２０６は、後述の要因推定部２０７に、学習目標を達成することができなかった要因の推定を指示して、要因改善データを得てもよい。その後、要因推定部２０７から得られた要因改善データを用いて、学習モジュール選択部２０６は、学習装置１０に再追加学習を指示してもよい。 The learning module selection unit 206 may also obtain factor improvement data by instructing a factor estimation unit 207 described later to estimate a factor that could not achieve the learning target. Thereafter, using the factor improvement data obtained from the factor estimation unit 207, the learning module selection unit 206 may instruct the learning device 10 to perform readditional learning.

要因推定部２０７は、現在設定されている学習モジュールが学習目標を達成していないと判定される場合に、学習目標を達成することができなかった要因を推定する。本実施形態では、要因推定部２０７は、学習モジュール選択部２０６からの指示に応答して、学習目標と記憶装置４０のセンシングデータとを用いて、学習目標を達成することができなかった要因を推定する。学習目標を達成することができなかった要因を推定するために、記憶装置４０のセンシングデータを統計処理する。 The factor estimating unit 207 estimates factors that can not achieve the learning goal when it is determined that the currently set learning module does not achieve the learning goal. In the present embodiment, in response to the instruction from the learning module selection unit 206, the factor estimation unit 207 uses the learning target and the sensing data of the storage device 40, and causes the factor estimation unit 207 to fail to achieve the learning target. presume. The sensing data of the storage device 40 is statistically processed to estimate the factor that could not achieve the learning goal.

制御部２０８は、学習モジュール２０２が算出した値を用いて、制御を行う。本実施形態では学習モジュールを用いてシステムの制御を行う一例として制御を行う制御部を示すが、後述するように、本発明の実施形態は、学習モジュールを用いて処理を実行する様々なシステムに適用することができる。 The control unit 208 performs control using the value calculated by the learning module 202. Although a control unit that performs control is illustrated as an example of performing control of a system using a learning module in the present embodiment, an embodiment of the present invention applies to various systems that execute processing using a learning module as described later. It can apply.

ＤＢ２０９には、学習目標ＤＢ２０９１及び評価データＤＢ２０９２が記憶される。本実施形態では、学習目標ＤＢ２０９１には、学習目標受付部２０３が受け付けた学習目標に含まれる、評価項目及び条件が記憶される。また、評価データＤＢ２０９２には、評価部２０４が生成した評価データと評価対象である学習モジュールの学習モジュール識別子とが関連付けて記憶される。 A learning target DB 2091 and an evaluation data DB 2092 are stored in the DB 209. In the present embodiment, the learning goal DB 2091 stores evaluation items and conditions included in the learning goal accepted by the learning goal accepting unit 203. Further, the evaluation data DB 2092 stores the evaluation data generated by the evaluation unit 204 and the learning module identifier of the learning module to be evaluated in association with each other.

（追加学習処理）
［第１実施形態］
図６のフローチャートに沿って、追加学習制御装置２０によって実行される追加学習処理の第１実施形態について説明する。本発明の実施形態は、学習モジュールを用いて処理を実行する様々なシステムに適用することができ、その分野は特に限定されないが、以下の説明においては空調制御システムを例として記載する。 (Additional learning process)
First Embodiment
A first embodiment of the additional learning process performed by the additional learning control device 20 will be described along the flowchart of FIG. Embodiments of the present invention can be applied to various systems that execute processing using a learning module, and the field is not particularly limited, but in the following description, an air conditioning control system will be described as an example.

第１実施形態では、制御部２０８は例えば、空調制御部である。学習モジュール２０２はセンサ３０によって出力された現在室温、外気温、湿度等の値や既知の日時、部屋の体積等を入力パラメータとし、室温設定値を算出する。追加学習制御装置２０の制御部２０８は、学習モジュール２０２によって算出された室温設定値を用いて空調制御を行う。ここでは、制御部２０８が学習モジュール２０２を用いて実際に空調制御を行い、評価部２０４は、制御部２０８による制御結果について評価を行う実施例について説明する。しかしながら、代替として、追加学習制御装置２０はシミュレーション部（図示せず）を備えてもよい。代替の実施例では、シミュレーション部が学習モジュール２０２を用いて空調制御を行い、評価部２０４は、シミュレーション部によるシミュレーション結果について評価を行うことができる。 In the first embodiment, the control unit 208 is, for example, an air conditioning control unit. The learning module 202 calculates the room temperature setting value using the current room temperature, the outside air temperature, the value of humidity, etc., the known date and time, the volume of the room, etc. output by the sensor 30 as input parameters. The control unit 208 of the additional learning control device 20 performs air conditioning control using the room temperature setting value calculated by the learning module 202. Here, an embodiment will be described in which the control unit 208 actually performs air conditioning control using the learning module 202, and the evaluation unit 204 evaluates the control result by the control unit 208. However, alternatively, the additional learning control device 20 may include a simulation unit (not shown). In an alternative embodiment, the simulation unit may perform air conditioning control using the learning module 202, and the evaluation unit 204 may evaluate the simulation results by the simulation unit.

また、ここでは、追加学習制御装置２０が学習モジュール２０２及び制御部２０８を備えた実施例について説明するが、代替として、追加学習制御装置２０とは別体の制御装置が学習モジュール及び制御部を備えてもよい。 Further, although an embodiment in which the additional learning control device 20 includes the learning module 202 and the control unit 208 will be described here, alternatively, a control device separate from the additional learning control device 20 includes the learning module and the control unit. You may have.

学習装置１０の学習制御部１０３によって事前に行われた学習の結果、記憶装置４０には、２０１０年１月１日から２０１２年１２月３１日までの期間の学習データを用いて学習したニューラルネットワーク０と、２０１０年１月１日から２０１４年１２月３１日までの期間の学習データを用いて学習したニューラルネットワーク１と、２０１０年１月１日から２０１６年１２月３１日までの期間の学習データを用いて学習したニューラルネットワーク２とが記憶されているものとする。ニューラルネットワーク０は、学習モジュール識別子０と関連付けて記憶装置４０に記憶されている。ニューラルネットワーク１は、学習モジュール識別子１と関連付けて記憶装置４０に記憶されている。ニューラルネットワーク２は、学習モジュール識別子２と関連付けて記憶装置４０に記憶されている。 As a result of learning performed in advance by the learning control unit 103 of the learning device 10, a neural network learned using the learning data of the period from January 1, 2010 to December 31, 2012 in the storage device 40 0 and neural network 1 learned using learning data from January 1, 2010 to December 31, 2014, and learning from January 1, 2010 to December 31, 2016 It is assumed that a neural network 2 learned using data is stored. The neural network 0 is stored in the storage device 40 in association with the learning module identifier 0. The neural network 1 is stored in the storage device 40 in association with the learning module identifier 1. The neural network 2 is stored in the storage device 40 in association with the learning module identifier 2.

追加学習制御装置２０は、２０１７年１月３１日まで、ニューラルネットワーク１を学習モジュール２０２として設定しているものとする。記憶装置４０には、センサ３０によって出力されたセンシングデータが随時記憶されている。すなわち、記憶装置４０には、ニューラルネットワーク１を用いて空調制御を行っていた期間中の、現在室温や外気温等の入力パラメータに関するセンシングデータ、及び室温設定値に対する外部操作回数や消費電力量等の評価項目に関するセンシングデータが記憶されている。また、評価部２０４によって以前行われた評価の結果、追加学習制御装置２０の評価データＤＢ２０９２には、ニューラルネットワーク０の評価データ０、及びニューラルネットワーク１の評価データ１が記憶されている。 It is assumed that the additional learning control device 20 sets the neural network 1 as the learning module 202 until January 31, 2017. The storage device 40 stores sensing data output by the sensor 30 as needed. That is, in the storage device 40, sensing data regarding input parameters such as the current room temperature and the outside air temperature, and the number of times of external operation with respect to the room temperature setting value, power consumption etc. during the air conditioning control using the neural network 1. The sensing data on the evaluation items of are stored. Further, as a result of the evaluation performed previously by the evaluation unit 204, evaluation data 0 of the neural network 0 and evaluation data 1 of the neural network 1 are stored in the evaluation data DB 2092 of the additional learning control device 20.

Ｓ６０１において、追加学習制御装置２０の学習済モジュール受付部２０１は、ニューラルネットワークと学習モジュール識別子とを受け付けて、受け付けたニューラルネットワークを学習モジュール２０２として設定する。本実施形態では、２０１７年２月１日に、学習済モジュール受付部２０１は、学習装置１０の学習済モジュール出力部１０５から、ニューラルネットワーク２と学習モジュール識別子２と受け付けて、受け付けたニューラルネットワーク２を学習モジュール２０２として設定する。２０１７年２月１日以降、制御部２０８は、設定されたニューラルネットワーク２によって算出された室温設定値を用いて空調制御を行っている。その間、前述したように、記憶装置４０には、ニューラルネットワーク２を用いて空調制御を行っている期間中の、センサ３０によって出力されたセンシングデータが随時記憶されている。 In S601, the learned module receiving unit 201 of the additional learning control device 20 receives the neural network and the learning module identifier, and sets the received neural network as the learning module 202. In this embodiment, on February 1, 2017, the learned module reception unit 201 receives the neural network 2 and the learning module identifier 2 from the learned module output unit 105 of the learning device 10, and receives the received neural network 2 Are set as the learning module 202. From February 1, 2017, the control unit 208 performs air conditioning control using the room temperature set value calculated by the set neural network 2. Meanwhile, as described above, the storage device 40 stores the sensing data output by the sensor 30 as needed during the air conditioning control using the neural network 2.

Ｓ６０２において、追加学習制御装置２０の学習目標受付部２０３は、入力部２０ｄを介して、学習モジュール２０２が達成すべき学習目標を受け付けて、受け付けた学習目標をＤＢ２０９の学習目標ＤＢ２０９１に記憶する。本実施形態では、２０１７年４月１日に、入力部２０ｄを介して追加学習制御装置２０の管理者から学習目標を受け付けて、学習目標受付部２０３は、受け付けた学習目標を学習目標ＤＢ２０９１に記憶する。学習目標には、評価項目「室温設定値に対する外部操作回数／日」と、条件「直前に設定されていた学習モジュールより少ない」とが含まれるものとする。 In S602, the learning target receiving unit 203 of the additional learning control device 20 receives the learning target to be achieved by the learning module 202 via the input unit 20d, and stores the received learning target in the learning target DB 2091 of the DB 209. In the present embodiment, on April 1, 2017, a learning goal is received from the administrator of the additional learning control apparatus 20 via the input unit 20d, and the learning goal receiving unit 203 transfers the received learning goal to the learning goal DB 2091. Remember. The learning target includes the evaluation items “the number of external operations per day for the room temperature setting value / day” and the condition “less than the learning module set immediately before”.

Ｓ６０３において、追加学習制御装置２０の評価部２０４は、学習モジュール２０２の評価を行って評価データを生成し、学習済モジュール受付部２０１が受け付けた学習モジュール識別子と関連付けて、生成した評価データをＤＢ２０９の評価データＤＢ２０９２に記憶する。本実施形態では、Ｓ６０２で学習目標を受け付けることに応答して、評価部２０４は、ニューラルネットワーク２を用いて空調制御が行われた期間中に記憶装置４０に記憶されたセンシングデータを用いて、ニューラルネットワーク２の評価を行う。 In step S603, the evaluation unit 204 of the additional learning control device 20 evaluates the learning module 202 to generate evaluation data, and associates the generated evaluation data with the learning module identifier received by the learned module receiving unit 201. Are stored in the evaluation data DB 2092 of In the present embodiment, in response to the acceptance of the learning target in S602, the evaluation unit 204 uses sensing data stored in the storage device 40 during the period in which air conditioning control is performed using the neural network 2. The neural network 2 is evaluated.

ここで、評価部２０４は、Ｓ６０２で受け付けた学習目標に含まれる評価項目のみならず、予め設定された他の評価項目について評価を行ってもよい。このような構成とすることで、今後の学習目標において、異なる評価項目について、以前設定されていた学習モジュールとの比較条件が指定される場合に対応することができる。本実施形態では、評価部２０４は、学習目標に含まれる評価項目「室温設定値に対する外部操作回数／日」のみならず、予め設定された他の評価項目について評価を行って評価データ２を生成し、学習モジュール識別子２と関連付けて、評価データ２を評価データＤＢ２０９２に記憶する。 Here, the evaluation unit 204 may evaluate not only the evaluation items included in the learning target received in S602 but also other evaluation items set in advance. With such a configuration, it is possible to cope with the case where a comparison condition with a learning module that has been set previously is designated for different evaluation items in future learning goals. In the present embodiment, the evaluation unit 204 generates evaluation data 2 by evaluating not only the evaluation item “the number of external operations with respect to the room temperature setting value / day” included in the learning target but also other evaluation items set in advance. The evaluation data 2 is stored in the evaluation data DB 2092 in association with the learning module identifier 2.

次に、Ｓ６０４において、追加学習制御装置２０の判定部２０５は、Ｓ６０２で学習目標受付部２０３が受け付けた学習目標と評価データＤＢ２０９２に記憶された評価データとを用いて、現在設定されている学習モジュールが、学習目標を達成したか否か判定する。学習目標を達成したと判定される場合（Ｓ６０４：Ｙｅｓ）、追加学習制御装置２０は、処理を終了する。 Next, in step S604, the determination unit 205 of the additional learning control device 20 uses the learning target received by the learning target receiving unit 203 in step S602 and the evaluation data stored in the evaluation data DB 2092 to perform learning that is currently set. The module determines whether the learning goal has been achieved. If it is determined that the learning goal has been achieved (S604: Yes), the additional learning control device 20 ends the process.

本実施形態では、判定部２０５は、評価項目「室温設定値に対する外部操作回数／日」、及び、条件「直前に設定されていた学習モジュールより少ない」を含む学習目標と、評価データ１及び評価データ２とを用いて、ニューラルネットワーク２が、学習目標を達成したか否か判定する。その結果、学習目標を達成していないと判定され（Ｓ６０４：Ｎｏ）、処理はＳ６０５に進む。 In the present embodiment, the determination unit 205 includes a learning target including the evaluation item “the number of external operations per day for the room temperature setting value / day” and the condition “less than the learning module set immediately before”, evaluation data 1 and evaluation. Using the data 2, it is determined whether the neural network 2 has achieved the learning goal. As a result, it is determined that the learning target is not achieved (S604: No), and the process proceeds to S605.

Ｓ６０５において、追加学習制御装置２０の学習モジュール選択部２０６は、学習装置１０の学習済モジュール抽出部１０６に学習モジュール抽出条件を送信し、その応答として得られた学習モジュールを学習モジュール２０２として設定し、処理を終了する。本実施形態では、学習モジュール選択部２０６は、学習済モジュール抽出部１０６に学習モジュール抽出条件「現在設定されている学習モジュールの学習モジュール識別子：学習モジュール識別子２、抽出ポイント：現在設定されている学習モジュールの１つ前のバージョン」を送信し、応答として得たニューラルネットワーク１を学習モジュール２０２として設定する。 In S605, the learning module selection unit 206 of the additional learning control device 20 transmits the learning module extraction condition to the learned module extraction unit 106 of the learning device 10, and sets the learning module obtained as the response as the learning module 202. , End the process. In the present embodiment, the learning module selection unit 206 causes the learning module extraction unit 106 to set the learning module extraction condition “learning module identifier of learning module currently set: learning module identifier 2; extraction point: learning currently set The “previous version of the module” is transmitted, and the neural network 1 obtained as a response is set as the learning module 202.

［第２実施形態］
第１実施形態では、学習目標を達成していないと判定されたニューラルネットワーク２について、更新前のバージョンであるニューラルネットワーク１に戻す例について説明した。第２実施形態では、学習目標を達成していないと判定された場合に、ニューラルネットワークをさらに追加学習させる例について説明する。 Second Embodiment
In the first embodiment, an example has been described in which the neural network 2 determined as not achieving the learning target is returned to the neural network 1 that is the version before update. In the second embodiment, an example will be described in which the neural network is additionally trained when it is determined that the learning target is not achieved.

図７は、追加学習制御装置２０によって実行される追加学習処理の流れを示すフローチャートである。第２実施形態では第１実施形態と共通の事柄についての記述を省略し、異なる点についてのみ説明する。図７のＳ７０１からＳ７０５は、図６のＳ６０１からＳ６０５と同じ処理であるので、これらの処理の詳細な説明は省略する。 FIG. 7 is a flowchart showing the flow of the additional learning process performed by the additional learning control device 20. In the second embodiment, the description of matters common to the first embodiment will be omitted, and only different points will be described. Since S701 to S705 in FIG. 7 are the same processes as S601 to S605 in FIG. 6, detailed descriptions of these processes will be omitted.

Ｓ７０６において、学習モジュール選択部２０６は、追加学習制御装置２０の要因推定部２０７に、学習目標を達成することができなかった要因の推定を指示し、要因推定部２０７は、学習目標を達成することができなかった要因を推定する。本実施形態では、要因推定部２０７は、学習目標に含まれる評価項目「室温設定値に対する外部操作回数／日」について、記憶装置４０のセンシングデータを用いて要因を推定する。例えば、要因推定部２０７は、ニューラルネットワーク２に用いた学習データの期間である２０１０年１月１から２０１６年１２月３１日までの期間のセンシングデータを用いて、要因を推定する。 In S706, the learning module selection unit 206 instructs the factor estimation unit 207 of the additional learning control device 20 to estimate the factor that could not achieve the learning goal, and the factor estimation unit 207 achieves the learning goal. Estimate the factors that could not be done. In the present embodiment, the factor estimating unit 207 estimates factors using the sensing data of the storage device 40 for the evaluation item “the number of external operations with respect to the room temperature setting value / day” included in the learning target. For example, the factor estimating unit 207 estimates factors using sensing data of a period from January 1, 2010 to December 31, 2016, which is a period of learning data used for the neural network 2.

本実施形態では、要因推定部２０７は、既知のアルゴリズムを使用して、例えば学習目標に含まれる評価項目「室温設定値に対する外部操作回数／日」が他の日の値から大きく外れて多かった日について、考えられるパターンを見出す。例えば、特定の期間「２０１６年７月１日から２０１６年７月３１日まで」において「室温設定値に対する外部操作回数／日」が多いというパターンを見出したものとする。なお、パターンについては期間によるものだけでなく、入力パラメータに含まれる外気温、湿度等の値が所定の値と異なる値が続いたときに、「入力パラメータの値」が大きい又は少ないというパターンを見出してもよい。また、入力パラメータに含まれる部屋の体積が変わっていた場合に、「部屋の体積」が変化したというパターンを見出してもよい。 In the present embodiment, the factor estimating unit 207 uses, for example, a known algorithm, the evaluation item “the number of external operations to the room temperature setting value / day” included in the learning target is largely deviated from the values of other days. Find possible patterns for the day. For example, it is assumed that a pattern is found that “the number of external operations per day with respect to the room temperature setting value / day” is high in a specific period “from July 1, 2016 to July 31, 2016”. In addition, the pattern is not limited to the period, but when the values such as the outside air temperature and humidity included in the input parameter continue to differ from the predetermined values, the pattern in which the “input parameter value” is large or small You may find out. In addition, when the volume of the room included in the input parameter is changed, a pattern may be found that the "room volume" has changed.

Ｓ７０７において、要因推定部２０７は、外れ値のパターンを見出したか否か判定する。外れ値のパターンを見出していない場合（Ｓ７０７：Ｎｏ）、追加学習制御装置２０は処理を終了する。一方、外れ値のパターンを見出した場合（Ｓ７０７：Ｙｅｓ）、Ｓ７０８に進み、要因推定部２０７は、見出した外れ値のパターンを出力してもよい。本実施形態では、図８に示されるように、見出した外れ値のパターンを示すテキスト８０１「２０１６年７月１日から２０１６年７月３１日において、室温設定値に対する外部操作回数／日が多い。」、パターンが性能劣化要因か否かを設定する性能劣化要因ラジオボタン８０２、詳細設定ボタン８０３、及び決定ボタン８０４を含む要因確定画面を出力する。詳細設定ボタンについては後述する。 In S 707, the factor estimating unit 207 determines whether a pattern of outliers has been found. When the outlier pattern is not found (S 707: No), the additional learning control device 20 ends the process. On the other hand, when a pattern of outliers is found (S 707: Yes), the process proceeds to S 708, and the factor estimating unit 207 may output the found out value pattern. In the present embodiment, as shown in FIG. 8, a text 801 indicating the found outlier pattern “the number of external operations per day with respect to the room temperature setting value is large from July 1, 2016 to July 31, 2016. The factor determination screen including the performance deterioration factor radio button 802 for setting whether the pattern is the performance deterioration factor, the detail setting button 803, and the determination button 804 is output. The detail setting button will be described later.

Ｓ７０９において、要因推定部２０７は、ユーザ入力を受信したか否か判定する。ユーザ入力を受信した場合、Ｓ７１０に進み、要因推定部２０７は、ユーザ入力に基づいて要因改善データを生成し、生成した要因改善データを学習モジュール選択部２０６に戻す。要因改善データには、Ｓ７０８において要因推定部２０７が出力した外れ値のパターンのうち、性能劣化要因ラジオボタン８０２で性能劣化要因として選択されること等により、学習データに含めることが望ましくないことが指示されたパターンが含まれる。 In step S709, the factor estimating unit 207 determines whether a user input has been received. If the user input is received, the process proceeds to S 710, the factor estimating unit 207 generates factor improvement data based on the user input, and returns the generated factor improvement data to the learning module selection unit 206. In the factor improvement data, it is not desirable to include in the learning data by being selected as the performance deterioration factor by the performance deterioration factor radio button 802 among the patterns of the outliers outputted by the factor estimation unit 207 in S708. It contains the indicated pattern.

本実施形態では、追加学習制御装置２０の管理者は、２０１６年７月１日から２０１６年７月３１日までの期間において室内工事が行われていたことを確認し、Ｓ７０８において出力された画面において、性能劣化要因ラジオボタン８０２で性能劣化要因として選択し、決定ボタン８０４を押下したものとする。その結果、要因推定部２０７は、受信したユーザ入力に基づいて要因改善データ「２０１６年７月１日から２０１６年７月３１日までの期間を除く」を生成し、生成した要因改善データを学習モジュール選択部２０６に戻す。 In the present embodiment, the administrator of the additional learning control device 20 confirms that indoor work has been performed in the period from July 1, 2016 to July 31, 2016, and the screen output in S 708 It is assumed that the performance deterioration factor radio button 802 is selected as the performance deterioration factor and the determination button 804 is pressed. As a result, the factor estimating unit 207 generates factor improvement data “except for the period from July 1, 2016 to July 31, 2016” based on the received user input, and learns the factor improvement data that has been generated. It returns to the module selection unit 206.

Ｓ７１１において、学習モジュール選択部２０６は、要因改善データに基づいて学習指示を生成し、生成した学習指示を学習装置１０の学習指示受付部１０１に送信して、処理を終了する。本実施形態では、学習モジュール選択部２０６は、「学習データ取得条件：２０１０年１月１日から２０１６年１２月３１日まで、２０１６年７月１日から２０１６年７月３１日までの期間を除く」を含む学習指示を学習指示受付部１０１に送信する。 In S711, the learning module selection unit 206 generates a learning instruction based on the factor improvement data, transmits the generated learning instruction to the learning instruction receiving unit 101 of the learning device 10, and ends the process. In the present embodiment, the learning module selection unit 206 sets “the learning data acquisition condition: a period from January 1, 2016 to December 31, 2016, from July 1, 2016 to July 31, 2016. A learning instruction including “excluding” is transmitted to the learning instruction receiving unit 101.

なお、本実施形態では、Ｓ７０７で外れ値のパターンを見出したと判定される場合に、Ｓ７１０でユーザの入力に基づいて要因改善データを生成したが、Ｓ７０８からＳ７１０を省略してユーザの入力を受け付けることなく、要因推定部２０７が、見出した外れ値のパターンに基づいて要因改善データを生成してもよい。 In the present embodiment, when it is determined that the outlier pattern is found in S707, the factor improvement data is generated based on the user's input in S710, but S708 to S710 are omitted and the user's input is accepted. Instead, the factor estimation unit 207 may generate factor improvement data based on the found outlier pattern.

このようにすることで、学習装置１０は、受信した学習指示に基づいて、ニューラルネットワークをさらに追加学習させることができる。本実施形態では、学習装置１０は、２０１６年７月１日から２０１６年７月３１日までの期間を除く学習データを用いて、ニューラルネットワーク１をさらに追加学習させることができる。 By doing this, the learning device 10 can additionally learn the neural network based on the received learning instruction. In the present embodiment, the learning device 10 can additionally learn the neural network 1 by using the learning data excluding the period from July 1, 2016 to July 31, 2016.

［第３実施形態］
第２実施形態では、学習目標を達成していないと判定された場合に、学習データに含めることが望ましくないデータを除外して、ニューラルネットワークをさらに追加学習させる例について説明した。第３実施形態では、学習目標を達成していないと判定された場合に、ニューラルネットワークに対してさらに学習させるべきデータを追加して、ニューラルネットワークを追加学習させる例について説明する。 Third Embodiment
In the second embodiment, an example has been described in which, when it is determined that the learning target is not achieved, data that is not desirable to be included in the learning data is excluded and the neural network is additionally learned. In the third embodiment, an example will be described in which, when it is determined that the learning target is not achieved, data to be learned further is added to the neural network to additionally learn the neural network.

図７を参照して、第３実施形態について説明する。第３実施形態においても、第１実施形態及び第２実施形態と共通の事柄についての記述を省略し、異なる点についてのみ説明する。図７のＳ７０１からＳ７０５までは、第１実施形態と同じ処理であるので、これらの処理の詳細な説明は省略する。 A third embodiment will be described with reference to FIG. Also in the third embodiment, descriptions of matters in common with the first embodiment and the second embodiment will be omitted, and only differences will be described. Since S701 to S705 in FIG. 7 are the same processes as in the first embodiment, detailed description of these processes is omitted.

Ｓ７０６において、要因推定部２０７は、学習目標を達成することができなかった要因を推定する。本実施形態では、特定の時間帯「毎週土曜日の８時から９時」において「室温設定値に対する外部操作回数」が多いというパターンを見出したものとする。なお、パターンについては期間によるものだけでなく、入力パラメータに含まれる外気温、湿度等の値が所定の値と異なる値が続いたときに、「入力パラメータの値」が大きい又は少ないというパターンを見出してもよい。また、入力パラメータに含まれる部屋の体積が変わっていた場合に、「部屋の体積」が変化したというパターンを見出してもよい。 In S706, the factor estimating unit 207 estimates factors that could not achieve the learning goal. In the present embodiment, it is assumed that a pattern is found that "the number of external operations with respect to the room temperature setting value" is large in a specific time zone "8 to 9 o'clock every Saturday". In addition, the pattern is not limited to the period, but when the values such as the outside air temperature and humidity included in the input parameter continue to differ from the predetermined values, the pattern in which the “input parameter value” is large or small You may find out. In addition, when the volume of the room included in the input parameter is changed, a pattern may be found that the "room volume" has changed.

Ｓ７０７において、要因推定部２０７は、外れ値のパターンを見出したか否か判定する。本実施形態では、外れ値のパターンを見出したのでＳ７０８に進み、要因推定部２０７は、見出した外れ値のパターンを出力してもよい。本実施形態では、見出した外れ値のパターンを示すテキスト８０１「毎週土曜日の８時から９時において、室温設定値に対する外部操作が多い。」、性能劣化要因ラジオボタン８０２、詳細設定ボタン８０３、及び決定ボタン８０４を含む要因確定画面を出力する。 In S 707, the factor estimating unit 207 determines whether a pattern of outliers has been found. In the present embodiment, since the pattern of outliers is found, the process proceeds to S 708, and the factor estimating unit 207 may output the found pattern of outliers. In the present embodiment, a text 801 indicating the found outlier pattern “There are many external operations to the room temperature setting value from 8 o'clock to 9 o'clock every Saturday”, the performance deterioration factor radio button 802, the detail setting button 803, and The factor determination screen including the determination button 804 is output.

本実施形態では、追加学習制御装置２０の管理者は、毎週土曜日は平日よりも少ない数のユーザが部屋を利用することから、室温設定値を平日と比較して高く設定することが望ましいことを確認し、Ｓ７０９において出力された画面において、性能劣化要因ラジオボタン８０２で非性能劣化要因として選択し、詳細設定ボタン８０３を押下したものとする。例えば詳細設定ボタン８０３を押下することにより、詳細設定画面が表示され、管理者は、この詳細設定画面において入力パラメータの調整を行ってもよい。代替として、追加学習制御装置２０が、入力パラメータの調整を行ってもよい。 In the present embodiment, the administrator of the additional learning control device 20 preferably sets the room temperature setting value higher than that on weekdays since every Saturday a smaller number of users than weekdays use a room. In the screen output in S709, the performance degradation factor radio button 802 is selected as the non-performance degradation factor, and the detail setting button 803 is pressed. For example, when the detail setting button 803 is pressed, a detail setting screen is displayed, and the administrator may adjust the input parameter on the detail setting screen. Alternatively, the additional learning control device 20 may adjust the input parameters.

Ｓ７０９において、要因推定部２０７は、ユーザ入力を受信したか否か判定する。本実施形態では、管理者が、詳細設定画面において入力パラメータの調整をし、その後、決定ボタン８０４を押下したものとする。その結果、処理はＳ７１０に進み、要因推定部２０７は、ユーザ入力に基づいて要因改善データを生成し、生成した要因改善データを学習モジュール選択部２０６に戻す。本実施形態では、要因改善データには、詳細設定画面で指定された入力パラメータに関する設定が含まれる。 In step S709, the factor estimating unit 207 determines whether a user input has been received. In this embodiment, it is assumed that the administrator adjusts the input parameter on the detail setting screen and then presses the determination button 804. As a result, the process advances to step S710, and the factor estimation unit 207 generates factor improvement data based on the user input, and returns the generated factor improvement data to the learning module selection unit 206. In the present embodiment, the factor improvement data includes the setting regarding the input parameter designated on the detail setting screen.

Ｓ７１１において、学習モジュール選択部２０６は、要因改善データに基づいて学習指示を生成し、生成した学習指示を学習装置１０の学習指示受付部１０１に送信して、処理を終了する。本実施形態では、学習モジュール選択部２０６は、詳細設定画面で指定された入力パラメータに関する設定を含む学習指示を学習指示受付部１０１に送信する。 In S711, the learning module selection unit 206 generates a learning instruction based on the factor improvement data, transmits the generated learning instruction to the learning instruction receiving unit 101 of the learning device 10, and ends the process. In the present embodiment, the learning module selection unit 206 transmits, to the learning instruction receiving unit 101, a learning instruction including the setting related to the input parameter specified on the detail setting screen.

このようにすることで、学習装置１０は、受信した学習指示に基づいて、ニューラルネットワークをさらに追加学習させることができる。本実施形態では、学習装置１０は、追加学習制御装置２０が見出したパターンが性能劣化要因ではなく、ニューラルネットワークにさらに学習させるべきパターンである場合に、追加の学習データを用いて、ニューラルネットワーク２をさらに追加学習させることができる。 By doing this, the learning device 10 can additionally learn the neural network based on the received learning instruction. In the present embodiment, the learning device 10 uses the additional learning data when the pattern found by the additional learning control device 20 is not a performance degradation factor but a pattern to be made to further learn by the neural network. Can be further trained.

［別の実施形態］
上述したとおり、別の実施形態では、図９に示されるように、学習システム１は、学習済モジュール受付部２０１と、学習モジュール２０２と、制御部２０８とを備えた制御装置５０を備えてもよい。学習システム１が制御装置５０を備える本実施形態では、学習システム１は、学習目標受付部２０３、評価部２０４、判定部２０５、学習モジュール選択部２０６、要因推定部２０７及びＤＢ２０９を備えた評価装置６０を備える。前述したように、追加学習制御装置２０は、評価装置６０を含む。 Another Embodiment
As described above, in another embodiment, as illustrated in FIG. 9, the learning system 1 includes the control device 50 including the learned module reception unit 201, the learning module 202, and the control unit 208. Good. In the present embodiment in which the learning system 1 includes the control device 50, the learning system 1 includes the learning target receiving unit 203, the evaluation unit 204, the determination unit 205, the learning module selection unit 206, the factor estimation unit 207, and the DB 209 60 is provided. As described above, the additional learning control device 20 includes the evaluation device 60.

評価装置６０は、図１０に示されるように、ＣＰＵ及びＲＡＭを含む制御部６０ａ、ＤＢ２０９のデータ等を記憶する記憶部６０ｂ、ネットワークＮと接続するための通信部６０ｃ、ユーザからの入力を受け付ける入力部６０ｄ、表示部６０ｅ等を備えている。これら各構成は、バスを介して相互にデータ送受信可能に接続される。ＣＰＵが記憶部に記憶されているプログラムをＲＡＭに展開し、ＲＡＭに展開された当該プログラムを解釈及び実行することにより、制御部６０ａは、図９の各部として機能する。 As shown in FIG. 10, the evaluation device 60 receives a control unit 60a including a CPU and a RAM, a storage unit 60b for storing data of the DB 209, a communication unit 60c for connecting to the network N, and an input from a user. An input unit 60d, a display unit 60e and the like are provided. These components are mutually connected so as to be able to transmit and receive data via a bus. The control unit 60a functions as each unit in FIG. 9 by the CPU developing the program stored in the storage unit in the RAM and interpreting and executing the program expanded in the RAM.

評価装置６０は、例えば一般のパーソナルコンピュータのＣＰＵによって追加学習プログラムを実行することで構成されてよい。追加学習プログラムは、ＲＡＭや記憶部６０ｂ等のコンピュータによって読み取り可能な記憶媒体に記憶されて提供されてもよいし、通信部により接続される通信ネットワークＮを介して提供されてもよい。 The evaluation device 60 may be configured, for example, by executing an additional learning program by a CPU of a general personal computer. The additional learning program may be provided by being stored in a computer-readable storage medium such as a RAM or the storage unit 60b, or may be provided via the communication network N connected by the communication unit.

本明細書において説明した各処理を実施するプログラムは、記録媒体に記憶させてもよい。この記録媒体を用いれば、コンピュータに上記プログラムをインストールすることにより、当該コンピュータを評価装置６０又は追加学習制御装置２０として機能させることができる。ここで、上記プログラムを記憶した記録媒体は、非一過性の記録媒体であってもよい。非一過性の記録媒体は特に限定されないが、例えば、ＣＤ−ＲＯＭ等の記録媒体であってもよい。 A program for performing each process described in the present specification may be stored in a recording medium. If this recording medium is used, the computer can be made to function as the evaluation device 60 or the additional learning control device 20 by installing the program in the computer. Here, the recording medium storing the program may be a non-transitory recording medium. The non-transitory recording medium is not particularly limited, but may be, for example, a recording medium such as a CD-ROM.

上記の実施形態の一部又は全部は、以下の付記のようにも記載され得るが、以下には限られない。 Some or all of the above embodiments may be described as in the following appendices, but are not limited to the following.

（付記１）
少なくとも１つのメモリと、前記メモリと接続された少なくとも１つのハードウェアプロセッサとを備えた、第１の学習モジュールを追加学習させた第２の学習モジュールを評価する評価装置であって、
前記ハードウェアプロセッサが、
前記第２の学習モジュールが達成すべき学習目標を受け付け、
少なくとも前記学習目標に含まれる評価項目について前記第２の学習モジュールの評価を行い、評価データを生成し、
前記学習目標と前記評価データとを用いて、前記第２の学習モジュールが前記学習目標を達成したか否か判定し、
前記学習目標を達成しないと判定される場合、少なくとも前記学習目標に基づいて、前記第２の学習モジュールと異なる第３の学習モジュールを取得する、
評価装置。 (Supplementary Note 1)
An evaluation device for evaluating a second learning module additionally learning a first learning module, comprising: at least one memory and at least one hardware processor connected to the memory.
The hardware processor is
Accepting a learning goal to be achieved by the second learning module;
Evaluating the second learning module with respect to at least an evaluation item included in the learning target to generate evaluation data;
Using the learning goal and the evaluation data, determine whether the second learning module has achieved the learning goal,
When it is determined that the learning goal is not achieved, a third learning module different from the second learning module is acquired based on at least the learning goal.
Evaluation device.

（付記２）
第１の学習モジュールを追加学習させた第２の学習モジュールを評価する評価方法であって、
少なくとも１つ以上のハードウェアプロセッサによって、前記第２の学習モジュールが達成すべき学習目標を受け付け、
前記ハードウェアプロセッサによって、少なくとも前記学習目標に含まれる評価項目について前記第２の学習モジュールの評価を行い、評価データを生成し、
前記ハードウェアプロセッサによって、前記学習目標と前記評価データとを用いて、前記第２の学習モジュールが前記学習目標を達成したか否か判定し、
前記学習目標を達成しないと判定される場合、少なくとも前記学習目標に基づいて、前記第２の学習モジュールと異なる第３の学習モジュールを取得する、
評価方法。 (Supplementary Note 2)
An evaluation method for evaluating a second learning module obtained by additionally learning a first learning module, comprising:
The second learning module receives the learning goal to be achieved by the at least one hardware processor;
The hardware processor evaluates the second learning module for at least an evaluation item included in the learning target, and generates evaluation data.
The hardware processor uses the learning goal and the evaluation data to determine whether the second learning module has achieved the learning goal.
When it is determined that the learning goal is not achieved, a third learning module different from the second learning module is acquired based on at least the learning goal.
Evaluation method.

１…学習システム、１０…学習装置、１０ａ…制御部、１０ｂ…記憶部、１０ｃ…通信部、１０ｄ…入力部、１０ｅ…表示部、１０１…学習指示受付部、１０２…学習データ取得部、１０３…学習制御部、１０４…学習モジュール、１０５…学習済モジュール出力部、１０６…学習済モジュール抽出部、２０…追加学習制御装置、２０ａ…制御部、２０ｂ…記憶部、２０ｃ…通信部、２０ｄ…入力部、２０ｅ…表示部、２０１…学習済モジュール受付部、２０２…学習モジュール、２０３…学習目標受付部、２０４…評価部、２０５…判定部、２０６…学習モジュール選択部、２０７…要因推定部、２０８…制御部、２０９…ＤＢ、２０９１…学習目標ＤＢ、２０９２…評価データＤＢ、３０…センサ、４０…記憶装置、５０…制御装置、６０…評価装置、６０ａ…制御部、６０ｂ…記憶部、６０ｃ…通信部、６０ｄ…入力部、６０ｅ…表示部、８０１…テキスト、８０２…性能劣化要因ラジオボタン、８０３…詳細設定ボタン、８０４…決定ボタン DESCRIPTION OF SYMBOLS 1 ... Learning system, 10 ... Learning apparatus, 10a ... Control part, 10b ... Storage part, 10c ... Communications part, 10d ... Input part, 10e ... Display part, 101 ... Learning instruction reception part, 102 ... Learning data acquisition part, 103 ... learning control unit, 104 ... learning module, 105 ... learned module output unit, 106 ... learned module extraction unit, 20 ... additional learning control device, 20a ... control unit, 20b ... storage unit, 20c ... communication unit, 20d ... Input unit 20e: Display unit 201: Learned module reception unit 202: Learning module 203: Learning target reception unit 204: Evaluation unit 205: Determination unit 206: Learning module selection unit 207: Factor estimation unit , 208: control unit, 209: DB, 2091: learning target DB, 2092: evaluation data DB, 30: sensor, 40: storage device, 50: control device, 6 ... evaluation device, 60a ... control unit, 60b ... storage unit, 60c ... communication unit, 60d ... input unit, 60e ... display unit, 801 ... text, 802 ... performance degradation factor radio button, 803 ... detailed setting button, 804 ... determination button

Claims

An evaluation apparatus for evaluating a second learning module that additionally learns a first learning module, comprising:
A learning target receiving unit that receives a learning target to be achieved by the second learning module;
An evaluation unit that evaluates the second learning module with respect to at least an evaluation item included in the learning target, and generates evaluation data;
A determination unit that determines whether the second learning module has achieved the learning target, using the learning target and the evaluation data;
A learning module selection unit that acquires a third learning module different from the second learning module based on at least the learning target when it is determined that the learning target is not achieved.

The evaluation device according to claim 1, wherein the third learning module is the first learning module.

The method according to claim 1, further comprising: a factor estimating unit configured to estimate factor that the second learning module could not achieve the learning target and to generate factor improvement data related to learning data used for additional learning of the learning module. The evaluation device according to 1.

The evaluation device according to claim 3, wherein the learning module selection unit generates a learning instruction based on the factor improvement data.

The evaluation device according to claim 3, wherein the factor estimation unit outputs the estimated factor, receives a user input related to the estimated factor, and generates the factor improvement data based on the user input.

It is an evaluation method which evaluates the 2nd learning module which made the 1st learning module additionally learn, and a computer provided with a control part is
Receiving a learning goal to be achieved by the second learning module;
Evaluating the second learning module with respect to at least an evaluation item included in the learning target to generate evaluation data;
Determining whether the second learning module has achieved the learning goal using the learning goal and the evaluation data;
Obtaining a third learning module different from the second learning module based on at least the learning target, when it is determined that the learning target is not achieved.

A program for causing a computer to evaluate a second learning module that additionally learns a first learning module,
A process of receiving a learning goal to be achieved by the second learning module;
A process of evaluating the second learning module for at least an evaluation item included in the learning target, and generating evaluation data;
A process of determining whether the second learning module has achieved the learning goal using the learning goal and the evaluation data;
A program for executing a process of acquiring a third learning module different from the second learning module when it is determined that the learning goal is not achieved.