JP2020021301A

JP2020021301A - Training data evaluation device, training data evaluation method, and program

Info

Publication number: JP2020021301A
Application number: JP2018144881A
Authority: JP
Inventors: 宏俊安岡; Hirotoshi Yasuoka; 洋桑島; Hiroshi Kuwajima
Original assignee: Denso Corp
Current assignee: Denso Corp
Priority date: 2018-08-01
Filing date: 2018-08-01
Publication date: 2020-02-06
Anticipated expiration: 2038-08-01
Also published as: JP7095467B2

Abstract

To provide a training data evaluation device for evaluating contributions of training data against requirements for identifying a subject to be inferred.SOLUTION: A training data evaluation device 1 comprises: a batch data generation section 10 for splitting training data into a plurality of batch data; a training processing section 11 for training a model by sequentially using the plurality of batch data; a training progress storage section 22 in which, through being trained by the training processing section 11, models in steps obtained through changes of the model by sequentially applying the batch data thereto, and information of the batch data generating each model, are stored; a follow-up processing section 12 which, by applying test data to the plurality of models stored in the training progress storage section 22, evaluates each model for selecting batch data on the basis of evaluation results of each model; and an outputting section 13 for outputting a result of the selected batch data.SELECTED DRAWING: Figure 1

Description

本発明は、機械学習において用いられる訓練データの訓練データ評価装置、訓練データ評価方法、およびプログラムに関する。 The present invention relates to a training data evaluation device, a training data evaluation method, and a program for training data used in machine learning.

近年、機械学習システムの研究が盛んに行われている。機械学習システムは非常に高性能化しており、例えば、セキュリティや自動運転車等へのアプリケーションが検討されている。 In recent years, research on machine learning systems has been actively conducted. Machine learning systems have become extremely sophisticated, and applications to security and self-driving cars, for example, are being studied.

特許文献１は、学習モデルの予測性能を高めるために、対象のデータに対して適切な機械学習のアルゴリズムを選択する機械学習管理の発明を開示している。この発明では、同じデータに対して機械学習アルゴリズムを変えながら何度もモデルの作成と評価を繰り返すモデル探索を行う。このモデル探索を繰り返すときに、過去に実施したモデル探索の過程で生成し、キャッシュに格納されたデータを再利用する。 Patent Literature 1 discloses a machine learning management invention that selects an appropriate machine learning algorithm for target data in order to improve the prediction performance of a learning model. In the present invention, a model search is repeatedly performed on the same data by repeatedly creating and evaluating a model while changing a machine learning algorithm. When this model search is repeated, data generated in the process of the model search performed in the past and stored in the cache is reused.

機械学習システムにおいては、推論の対象を特定する要件（例えば、自動運転でいえば、道路を走行する車両の検出等）を定め、予め多数の訓練データを用いて、当該推論を行うためのモデルを学習する（非特許文献１）。 In a machine learning system, a requirement for specifying an object of inference (for example, detection of a vehicle running on a road in the case of automatic driving) is determined, and a model for performing the inference using a large amount of training data in advance. Is learned (Non-Patent Document 1).

特開２０１７−２２８０８６号公報JP 2017-228086 A

Laura L. Pullum Brian J. Taylor Majorie A. Darrah「Guidance for the Verification and Validation of Neural Networks」Laura L. Pullum Brian J. Taylor Majorie A. Darrah `` Guidance for the Verification and Validation of Neural Networks ''

機械学習システムにおいて、要件に紐づく訓練データのデータセットは、機械学習に用いた全ての訓練データであった。すなわち、全ての訓練データを用いて訓練を行った結果が、要件に合っているかどうか、という観点でモデルの評価が行われることが一般的であった。 In the machine learning system, the training data set associated with the requirements was all training data used for machine learning. That is, the model is generally evaluated from the viewpoint of whether or not the result of training using all the training data meets the requirements.

ところで、新規にモデルを作成する際に、過去のモデル開発に使った訓練データを再利用することがある。この場合、新規のモデルにおいても過去の要件項目を引き継ぐ場合には、過去のモデル開発の訓練データを再利用するだけでなく、どの訓練データが過去のモデル生成に有効であったかが分かると、新規のモデル開発を効率良く行うことができる。 By the way, when a new model is created, training data used in past model development may be reused. In this case, when inheriting the past requirement items even in the new model, not only reuse the training data of the past model development but also know which training data was effective for the past model generation. Model development can be performed efficiently.

本発明は、上記背景に鑑み、推論対象を特定する要件に対する訓練データの寄与を評価する訓練データ評価装置を提供することを目的とする。 In view of the above background, an object of the present invention is to provide a training data evaluation device that evaluates the contribution of training data to a requirement for specifying an inference target.

本発明は上記課題を解決するために以下の技術的手段を採用する。特許請求の範囲及びこの項に記載した括弧内の符号は、ひとつの態様として後述する実施の形態に記載の具体的手段との対応関係を示す一例であって、本発明の技術的範囲を限定するものではない。 The present invention employs the following technical means to solve the above problems. The reference numerals in the claims and the parentheses described in this section are examples showing the correspondence with specific means described in the embodiment described below as one aspect, and limit the technical scope of the present invention. It does not do.

本発明の訓練データ評価装置（１）は、訓練データを複数のバッチデータに分けるバッチデータ生成部（１０）と、複数のバッチデータを順次用いてモデルの訓練を行う訓練処理部（１１）と、前記バッチデータを順次適用した訓練によって変化していく過程のモデルと、それぞれのモデルを生成したバッチデータの情報とを記憶した訓練経過記憶部（２２）と、前記訓練経過記憶部（２２）に記憶された複数のモデルにテストデータを適用して、前記各モデルを評価し、各モデルの評価結果に基づいてバッチデータを選定する追跡処理部（１２）と、前記バッチデータの選定結果を出力する出力部（１３）とを備える。 A training data evaluation device (1) according to the present invention includes a batch data generation unit (10) for dividing training data into a plurality of batch data, and a training processing unit (11) for training a model by sequentially using the plurality of batch data. A training progress storage unit (22) storing a model of a process of being changed by the training in which the batch data is sequentially applied, and information of the batch data for generating each model; and a training progress storage unit (22). A tracking processing unit (12) for applying the test data to the plurality of models stored in the storage unit, evaluating each of the models, and selecting batch data based on the evaluation result of each model; An output unit (13) for outputting.

訓練データを複数のバッチデータに分け、それぞれのバッチデータを用いて訓練を行ったモデルをテストすることにより、どのバッチデータが要件を満たすモデルの生成につながっているかを評価することができる。 By dividing the training data into a plurality of batch data and testing the trained model using each batch data, it is possible to evaluate which batch data has led to the generation of a model satisfying the requirements.

本発明の別の態様の訓練データ評価装置（５）は、訓練データの中から選定された選定データを評価する訓練データ評価装置（５）であって、前記選定データを除いた前記訓練データに基づいてモデルを生成し、生成されたモデルにテストデータを適用してモデルを評価することによって、前記選定データの検証を行う検証部（１３）を備える。ここで、選定データは、所定の精度で要件に適合する推論を行えるモデルを生成できるとして選定されたものであり、選定の方法は問わない。 A training data evaluation device (5) according to another aspect of the present invention is a training data evaluation device (5) for evaluating selected data selected from training data. A verification unit configured to generate a model based on the selected model and to apply the test data to the generated model to evaluate the model, thereby verifying the selected data; Here, the selection data is selected as a model capable of generating a model capable of performing inference that meets requirements with predetermined accuracy, and the selection method is not limited.

このように選定データを除く訓練データを用いて生成したモデルを評価することにより選定データを検証できる。すなわち、選定データを除く訓練データによって生成したモデルの評価が低い場合には、選定データの評価が高いことが確認される。逆に、選定データを除く訓練データによって生成したモデルの評価が高い場合には、訓練データ全体の評価が高いと考えられ、選定データだけが殊更に評価が高いというわけではないことが分かる。 Thus, the selection data can be verified by evaluating the model generated using the training data excluding the selection data. That is, when the evaluation of the model generated by the training data excluding the selection data is low, it is confirmed that the evaluation of the selection data is high. Conversely, when the evaluation of the model generated by the training data excluding the selection data is high, it is considered that the evaluation of the entire training data is high, and it can be seen that only the selection data is not particularly high.

本発明の訓練データ評価方法は、訓練データを複数のバッチデータに分けるステップ（Ｓ１０）と、複数のバッチデータを順次用いてモデルの訓練を行うステップ（Ｓ１２）と、前記バッチデータを順次適用した訓練によって変化していく過程のモデルと、それぞれのモデルを生成したバッチデータの情報とを訓練経過記憶部（２２）に記憶するステップ（Ｓ１３）と、前記訓練経過記憶部（２２）に記憶された複数のモデルにテストデータを適用して、前記各モデルを評価し、各モデルの評価結果に基づいてバッチデータを選定するステップ（Ｓ１７）と、前記バッチデータの選定結果を出力するステップ（Ｓ１８）とを備える。 In the training data evaluation method of the present invention, the step of dividing the training data into a plurality of batch data (S10), the step of training a model by sequentially using the plurality of batch data (S12), and the batch data are sequentially applied. A step (S13) of storing in the training progress storage unit (22) a model in the process of being changed by the training and information of the batch data for generating each model; and storing the training progress storage unit (22) in the training progress storage unit (22). Applying the test data to the plurality of models, evaluating each of the models, selecting batch data based on the evaluation results of each model (S17), and outputting the batch data selection results (S18). ).

本発明のプログラムは、訓練データを評価するためのプログラムであって、コンピュータに、訓練データを複数のバッチデータに分けるステップと、複数のバッチデータを順次用いてモデルの訓練を行うステップと、前記バッチデータを順次適用した訓練によって変化していく過程のモデルと、それぞれのモデルを生成したバッチデータの情報とを訓練経過記憶部に記憶したステップと、前記訓練経過記憶部に記憶された複数のモデルにテストデータを適用して、前記各モデルを評価し、各モデルの評価結果に基づいてバッチデータを選定するステップと、前記バッチデータの選定結果を出力するステップとを実行させる。 The program of the present invention is a program for evaluating training data, a computer, a step of dividing the training data into a plurality of batch data, a step of training the model using a plurality of batch data sequentially, and A step of storing in the training progress storage unit the model of the process of changing by the training in which the batch data is sequentially applied, and the information of the batch data that generated each model; and a plurality of steps stored in the training progress storage unit. A step of applying the test data to the model to evaluate each of the models, selecting batch data based on an evaluation result of each model, and outputting the batch data selection result is executed.

本発明によれば、訓練データのうちのどの訓練データがモデルの評価につながっているかを評価することができる。 According to the present invention, it is possible to evaluate which training data of the training data leads to the evaluation of the model.

第１の実施の形態の訓練データ評価装置の構成を示す図である。It is a figure showing composition of a training data evaluation device of a 1st embodiment. （ａ）訓練データを示す模式図である。（ｂ）訓練データをバッチデータに分けた例を示す図である。(A) It is a schematic diagram which shows training data. (B) It is a figure showing an example which divided training data into batch data. 追跡結果記憶部に記憶されたデータの例を示す図である。FIG. 4 is a diagram illustrating an example of data stored in a tracking result storage unit. 第１の実施の形態の訓練データ評価装置の動作を示すフローチャートである。It is a flowchart which shows operation | movement of the training data evaluation apparatus of 1st Embodiment. 第２の実施の形態の訓練データ評価装置の構成を示す図である。It is a figure showing the composition of the training data evaluation device of a 2nd embodiment. 第２の実施の形態の訓練データ評価装置の動作を示すフローチャートである。It is a flowchart which shows operation | movement of the training data evaluation apparatus of 2nd Embodiment. 第３の実施の形態の訓練データ評価装置の構成を示す図である。It is a figure showing the composition of the training data evaluation device of a 3rd embodiment. 訓練データ選定部による訓練データの選定について説明するための図である。It is a figure for explaining selection of training data by a training data selection part. 第３の実施の形態の訓練データ評価装置の動作を示すフローチャートである。It is a flowchart which shows operation | movement of the training data evaluation apparatus of 3rd Embodiment. 第４の実施の形態の訓練データ評価装置の構成を示す図である。It is a figure showing the composition of the training data evaluation device of a 4th embodiment. 第４の実施の形態の訓練データ評価装置の動作を示すフローチャートである。It is a flowchart which shows operation | movement of the training data evaluation apparatus of 4th Embodiment. 別の例に係る訓練データ評価装置の構成を示す図である。It is a figure showing the composition of the training data evaluation device concerning other examples.

以下、本発明の実施の形態の訓練データ評価装置について図面を参照して説明する。以下で説明する実施の形態では、ニューラルネットワークモデルを例として説明するが、本発明は、別のモデルを訓練する訓練データの評価にも用いることができる。 Hereinafter, a training data evaluation device according to an embodiment of the present invention will be described with reference to the drawings. In the embodiment described below, a neural network model will be described as an example, but the present invention can also be used for evaluating training data for training another model.

（第１の実施の形態）
図１は、第１の実施の形態の訓練データ評価装置１の構成を示す図である。訓練データ評価装置１は、評価の対象となる訓練データを記憶した訓練データ記憶部２０を有している。訓練データによって生成するモデルの要件は、例えば、「自車線上の自動車を認識すること」であり、このための訓練データは、フロントガラスから撮影した画像に自動車を示す境界ボックスを付した大量の画像である。 (First Embodiment)
FIG. 1 is a diagram illustrating a configuration of a training data evaluation device 1 according to the first embodiment. The training data evaluation device 1 has a training data storage unit 20 that stores training data to be evaluated. The requirement of the model generated by the training data is, for example, `` recognizing the car on the own lane '', and the training data for this is a large amount of images with a bounding box indicating the car on the image taken from the windshield. It is an image.

本実施の形態の訓練データ評価装置１は、大量の訓練データを評価して、「自車線上の自動車を認識すること」という要件を満たすモデルを生成するのに適した訓練データを選定するものである。モデルとしては、Ｎ層畳み込みニューラルネットワークのモデルを用いる。 The training data evaluation device 1 of the present embodiment evaluates a large amount of training data and selects training data suitable for generating a model that satisfies the requirement of “recognizing a car on the own lane”. It is. As a model, an N-layer convolutional neural network model is used.

訓練データ評価装置１は、バッチデータ生成部１０と、訓練処理部１１と、追跡処理部１２と、出力部１３とを有している。バッチデータ生成部１０は、訓練データ記憶部２０に記憶された大量の訓練データをバッチデータに分ける機能を有する。 The training data evaluation device 1 has a batch data generation unit 10, a training processing unit 11, a tracking processing unit 12, and an output unit 13. The batch data generation unit 10 has a function of dividing a large amount of training data stored in the training data storage unit 20 into batch data.

図２（ａ）は、訓練データ記憶部２０に記憶された大量の訓練データを示す模式図である。図２（ａ）に示す一つ一つの四角は、フロントガラスから撮影した画像に自動車を示す境界ボックスを付した画像を模したものである。バッチデータ生成部１０は、図２（ｂ）に示すように、訓練データをバッチデータに分ける。図２（ｂ）では、９つのデータを一つのバッチとしているが、これは例であって、一つのバッチに含める訓練データの数はいくつでもよい。なお、バッチデータは、１つの訓練データで構成されていてもよい。 FIG. 2A is a schematic diagram illustrating a large amount of training data stored in the training data storage unit 20. Each square shown in FIG. 2A simulates an image obtained by adding a bounding box indicating an automobile to an image taken from a windshield. The batch data generator 10 divides the training data into batch data, as shown in FIG. In FIG. 2 (b), nine data are taken as one batch, but this is an example, and any number of training data may be included in one batch. Note that the batch data may be composed of one training data.

バッチデータ生成部１０は、生成したバッチデータをバッチデータ記憶部２１に記憶する。なお、バッチデータ記憶部２１にはバッチに含まれる訓練データ自体を記憶してもよいし、訓練データ自体を記憶しないでバッチに含まれる訓練データのＩＤを記憶してもよい。後者の構成の場合には、実際に訓練を行う際には、訓練データ記憶部２０から訓練データを読み出すことになる。 The batch data generation unit 10 stores the generated batch data in the batch data storage unit 21. Note that the batch data storage unit 21 may store the training data itself included in the batch, or may store the ID of the training data included in the batch without storing the training data itself. In the case of the latter configuration, when training is actually performed, the training data is read from the training data storage unit 20.

訓練処理部１１は、訓練データを用いてモデルの訓練を行い、モデルを生成する処理を行う。訓練処理部１１は、バッチデータを順次用いてモデルの訓練を行う。訓練処理部１１は、例えば、図２（ｂ）に示すバッチ１００の訓練データを用いてモデルを訓練してモデルＭ１００を生成し、次に、モデルＭ１００に対してバッチ１０１の訓練データを用いて訓練してモデルＭ１０１を生成する。このように、バッチデータを順次適用してモデルを更新していく。訓練処理部１１は、更新されていくモデルのデータとそのモデルを生成するのに用いたバッチデータを特定するデータを訓練経過記憶部２２に記憶する。 The training processing unit 11 performs training of the model using the training data, and performs a process of generating the model. The training processing unit 11 performs training of the model by using the batch data sequentially. The training processing unit 11 generates a model M100 by training the model using the training data of the batch 100 shown in FIG. 2B, for example, and then uses the training data of the batch 101 for the model M100. Training is performed to generate a model M101. Thus, the model is updated by sequentially applying the batch data. The training processing unit 11 stores in the training progress storage unit 22 the data of the model to be updated and the data specifying the batch data used to generate the model.

追跡処理部１２は、訓練経過記憶部２２に記憶されたモデルに対して、テストデータを適用して追跡評価を行う。テストデータは、訓練データとは異なるデータであり、テストデータ記憶部２３に記憶されている。訓練経過記憶部２２に、モデルＭ１００→モデルＭ１０１→・・・というモデルの訓練経過が記憶されているとき、追跡処理部１２は、モデルＭ１００、モデルＭ１０１、・・・のそれぞれに対して、テストデータを適用してモデルの評価を行う。追跡処理部１２は、各モデルに対してテストデータを適用して評価した評価結果を追跡結果記憶部２４に記憶する。 The tracking processing unit 12 performs a tracking evaluation by applying test data to the model stored in the training progress storage unit 22. The test data is different from the training data, and is stored in the test data storage unit 23. When the training progress of the model “model M100 → model M101 →...” Is stored in the training progress storage unit 22, the tracking processing unit 12 performs a test on each of the models M100, M101,. Evaluate the model by applying the data. The tracking processing unit 12 stores an evaluation result evaluated by applying test data to each model in the tracking result storage unit 24.

図３は、追跡結果記憶部２４に記憶されたデータの例を示す図である。横軸は訓練進捗を示し、縦軸は要件を満たす度合いを示している。一番左にプロットされた「バッチ１０１」は、バッチ１０１によって訓練したモデルＭ１００に対してテストデータを適用して得られた評価結果を示す。左から二番目にプロットされた「バッチ１０２」は、バッチ１０１で訓練されたモデルＭ１００に対してさらにバッチ１０２で訓練して得られたモデルＭ１０２の評価結果を示す図である。このように図３に示すプロットは、右に進むにしたがって多くの訓練データが用いられているので、評価結果が安定していく。 FIG. 3 is a diagram illustrating an example of data stored in the tracking result storage unit 24. The horizontal axis indicates the training progress, and the vertical axis indicates the degree to which the requirements are satisfied. “Batch 101” plotted on the far left indicates an evaluation result obtained by applying test data to the model M100 trained by the batch 101. “Batch 102” plotted second from the left is a diagram showing an evaluation result of the model M 102 obtained by further training in the batch 102 with respect to the model M 100 trained in the batch 101. In this manner, the plot shown in FIG. 3 uses more training data as it goes to the right, so that the evaluation result becomes stable.

追跡処理部１２は、追跡結果記憶部２４に記憶されたデータに基づいて、要件に適したモデルの生成に寄与したバッチデータを選定する。追跡処理部１２は、選定したバッチデータを選定データ記憶部２５に記憶する。なお、追跡結果に基づいて、バッチデータを選定する手法はいろいろと考えられる。 The tracking processing unit 12 selects, based on the data stored in the tracking result storage unit 24, batch data that has contributed to generation of a model suitable for the requirement. The tracking processing unit 12 stores the selected batch data in the selected data storage unit 25. There are various methods for selecting batch data based on the tracking result.

例えば、評価結果が最良のモデルに対応するバッチデータを選定することができる。または、評価結果が良い方から所定個数のモデルに対応するバッチデータを選定してもよい。あるいは、ニューラルネットワークから出力される判定の確信度（ＳＯＦＴＭＡＸの出力値等）が最も高いモデル又は高い方から所定個数のモデルに対応するバッチデータを選定してもよい。 For example, batch data corresponding to a model having the best evaluation result can be selected. Alternatively, batch data corresponding to a predetermined number of models may be selected from the one with the better evaluation result. Alternatively, batch data corresponding to a model with the highest degree of certainty (e.g., an output value of SOFTMAX) of the judgment output from the neural network or a predetermined number of models from the higher model may be selected.

また、評価結果の変化に着目してバッチデータを選定してもよい。例えば、モデルの評価結果を最後に良い方向へと変化させたバッチデータを選定してもよいし、モデルの評価結果を所定の閾値より大きく良い方向へ変化させたバッチデータを選定してもよい。 Further, the batch data may be selected by focusing on a change in the evaluation result. For example, batch data in which the evaluation result of the model is finally changed to a better direction may be selected, or batch data in which the evaluation result of the model has been changed to a better direction than a predetermined threshold may be selected. .

出力部１３は、追跡処理部１２にて選定したバッチデータを示すデータを出力する。この際、追跡結果記憶部２４に記憶されたデータ（図３参照）を合わせて出力してもよい。これにより、バッチデータの選定理由を理解することができる。 The output unit 13 outputs data indicating the batch data selected by the tracking processing unit 12. At this time, the data (see FIG. 3) stored in the tracking result storage unit 24 may be output together. Thereby, the reason for selecting the batch data can be understood.

図４は、第１の実施の形態の訓練データ評価装置１の動作を示す図である。訓練データ評価装置１は、訓練データ記憶部２０に記憶されている大量の訓練データを分けてバッチデータを生成する（Ｓ１０）。続いて、訓練データ評価装置１は、どの順序でバッチデータを用いて訓練を行うのか、訓練順序を決定する（Ｓ１１）。 FIG. 4 is a diagram illustrating an operation of the training data evaluation device 1 according to the first embodiment. The training data evaluation device 1 divides a large amount of training data stored in the training data storage unit 20 and generates batch data (S10). Subsequently, the training data evaluation device 1 determines the training order in which order the training is performed using the batch data (S11).

続いて、訓練データ評価装置１は、バッチデータを使って訓練を行い（Ｓ１２）、訓練によって生成されたモデルとその訓練に用いたバッチデータを特定するデータを訓練経過記憶部２２に記憶する（Ｓ１３）。訓練データ評価装置１は、全バッチデータの処理を終了したか否かを判定する（Ｓ１４）。全バッチデータについて処理を終了していない場合には（Ｓ１４でＮＯ）、次のバッチデータを用いて、モデルをさらに訓練する（Ｓ１２）。 Subsequently, the training data evaluation device 1 performs training using the batch data (S12), and stores the model generated by the training and the data specifying the batch data used for the training in the training progress storage unit 22 ( S13). The training data evaluation device 1 determines whether the processing of all batch data has been completed (S14). If the processing has not been completed for all the batch data (NO in S14), the model is further trained using the next batch data (S12).

全バッチデータについて訓練を終了した場合は（Ｓ１４でＹＥＳ）、訓練データ評価装置１は、訓練経過記憶部２２から訓練過程のモデルを読み出し（Ｓ１５）、テストデータを用いて訓練経過のモデルのテストを行い、訓練経過のモデルの評価をする（Ｓ１６）。続いて、訓練データ評価装置１は、訓練経過のモデルの評価結果に基づいて、要件に適したモデルを生成したバッチデータを選定し（Ｓ１７）、選定結果を出力する（Ｓ１８）。 When the training has been completed for all the batch data (YES in S14), the training data evaluation device 1 reads the training process model from the training progress storage unit 22 (S15), and tests the training progress model using the test data. Is performed, and the model of the training process is evaluated (S16). Subsequently, the training data evaluation device 1 selects batch data that has generated a model suitable for the requirements based on the evaluation result of the training progress model (S17), and outputs the selection result (S18).

以上、本実施の形態の訓練データ評価装置１の構成について説明したが、上記した訓練データ評価装置１のハードウェアの例は、ＣＰＵ、ＲＡＭ、ＲＯＭ、ハードディスク、ディスプレイ、キーボード、マウス、通信インターフェース等を備えたコンピュータである。上記した各機能を実現するモジュールを有するプログラムをＲＡＭまたはＲＯＭに格納しておき、ＣＰＵによって当該プログラムを実行することによって、上記した訓練データ評価装置１が実現される。このようなプログラムも本発明の範囲に含まれる。 The configuration of the training data evaluation device 1 according to the present embodiment has been described above. Examples of the hardware of the training data evaluation device 1 include a CPU, a RAM, a ROM, a hard disk, a display, a keyboard, a mouse, a communication interface, and the like. It is a computer provided with. The above-described training data evaluation device 1 is realized by storing a program having modules for realizing the above-described functions in a RAM or a ROM and executing the program by the CPU. Such a program is also included in the scope of the present invention.

第１の実施の形態の訓練データ評価装置１は、訓練データを複数のバッチデータに分け、それぞれのバッチデータを用いて訓練を行ったモデルをテストすることにより、どのバッチデータが要件を満たすモデルの生成につながっているかを評価することができる。 The training data evaluation device 1 according to the first embodiment divides training data into a plurality of batch data, tests a model that has been trained using each batch data, and determines which batch data satisfies the requirements. Can be evaluated whether or not it has been generated.

（第２の実施の形態）
図５は、第２の実施の形態の訓練データ評価装置２の構成を示す図である。第２の実施の形態の訓練データ評価装置２の基本的な構成は、第１の実施の形態の訓練データ評価装置１と同じであるが、第２の実施の形態の訓練データ評価装置２は、訓練過程のモデルを記憶する訓練経過記憶部２２を備えていない。第２の実施の形態の訓練データ評価装置２は、モデルの訓練を行いながら、モデルの評価を行う点が異なる。 (Second embodiment)
FIG. 5 is a diagram illustrating a configuration of the training data evaluation device 2 according to the second embodiment. The basic configuration of the training data evaluation device 2 of the second embodiment is the same as that of the training data evaluation device 1 of the first embodiment, but the training data evaluation device 2 of the second embodiment has And a training progress storage unit 22 for storing a training process model. The training data evaluation device 2 of the second embodiment is different in that the model evaluation is performed while the model is trained.

図６は、第２の実施の形態の訓練データ評価装置２の動作を示す図である。訓練データ評価装置２は、訓練データ記憶部２０に記憶されている大量の訓練データを分けてバッチデータを生成する（Ｓ２０）。続いて、訓練データ評価装置２は、どの順序でバッチデータを用いて訓練を行うのか、訓練順序を決定する（Ｓ２１）。 FIG. 6 is a diagram illustrating an operation of the training data evaluation device 2 according to the second embodiment. The training data evaluation device 2 divides a large amount of training data stored in the training data storage unit 20 and generates batch data (S20). Subsequently, the training data evaluation device 2 determines a training order in which order the training is performed using the batch data (S21).

続いて、訓練データ評価装置２は、バッチデータを使って訓練を行い（Ｓ２２）、訓練によって生成されたモデルのテストを行い、訓練経過のモデルの評価をする（Ｓ２３）。訓練データ評価装置２は、訓練経過のモデルの評価とそのモデルの生成に得られたバッチデータを特定するデータを追跡結果記憶部２４に記憶する（Ｓ２４）。訓練データ評価装置２は、全てのバッチデータを用いたか否かに基づいて、訓練を終了するか否かを判定する（Ｓ２５）。全バッチデータの処理を終了していない場合には（Ｓ２５でＮＯ）、次のバッチデータを用いて、モデルの訓練および評価を行う（Ｓ２２〜Ｓ２４）。全バッチデータの処理を終了した場合は（Ｓ２５でＹＥＳ）、訓練データ評価装置２は、訓練経過のモデルの評価結果に基づいて、要件に適したモデルを生成したバッチデータを選定し（Ｓ２６）、選定結果を出力する（Ｓ２７）。 Subsequently, the training data evaluation device 2 performs training using the batch data (S22), tests the model generated by the training, and evaluates the model of the training process (S23). The training data evaluation device 2 stores, in the tracking result storage unit 24, data that specifies the batch data obtained for the evaluation of the training progress model and the generation of the model (S24). The training data evaluation device 2 determines whether to end the training based on whether all the batch data has been used (S25). If the processing of all batch data has not been completed (NO in S25), the model is trained and evaluated using the next batch data (S22 to S24). When the processing of all the batch data is completed (YES in S25), the training data evaluation device 2 selects the batch data that has generated the model suitable for the requirement based on the evaluation result of the model of the training progress (S26). , And outputs the selection result (S27).

第２の実施の形態の訓練データ評価装置２は、第１の実施の形態の訓練データ評価装置１と同様に、どのバッチデータが要件を満たすモデルの生成につながっているかを評価することができることに加え、バッチデータを用いたモデルの訓練を行いつつ、訓練過程で得られたモデルのテストを行うので、訓練過程で得られたモデルを残しておく必要がない。 Like the training data evaluation device 1 of the first embodiment, the training data evaluation device 2 of the second embodiment can evaluate which batch data has led to the generation of a model satisfying the requirements. In addition, since the model obtained during the training process is tested while training the model using the batch data, it is not necessary to keep the model obtained during the training process.

（第３の実施の形態）
図７は、第３の実施の形態の訓練データ評価装置３の構成を示す図である。第３の実施の形態の訓練データ評価装置３は、上記した第１の実施の形態の訓練データ評価装置１と同様にバッチデータの評価を行うが、バッチを組み替えてバッチデータの評価を繰り返し行う。そして、異なる試行で選定されたバッチに共通して含まれる訓練データを選定する。すなわち、第１の実施の形態では、バッチを単位として、評価の高い訓練データを選定していたのに対し、第３の実施の形態では各訓練データの単位で評価の高い訓練データを選定する。 (Third embodiment)
FIG. 7 is a diagram illustrating a configuration of the training data evaluation device 3 according to the third embodiment. The training data evaluation device 3 of the third embodiment evaluates batch data in the same manner as the training data evaluation device 1 of the above-described first embodiment, but repeatedly performs batch data evaluation by rearranging batches. . Then, training data that is commonly included in batches selected in different trials is selected. That is, in the first embodiment, training data with high evaluation is selected in batches, whereas in the third embodiment, training data with high evaluation is selected in units of training data. .

第３の実施の形態の訓練データ評価装置３は、繰返処理部３０を有している。繰返処理部３０は、バッチデータ生成部１０、訓練処理部１１および追跡処理部１２を有している。バッチデータ生成部１０は、訓練データからバッチデータを生成するが、繰り返しのたびに異なるバッチデータを生成する。訓練処理部１１および追跡処理部１２は、バッチデータ生成部１０にて生成されたバッチデータに対して、第１の実施の形態の訓練データ評価装置１と同様に、モデルの生成とそのモデルの評価を行い、評価結果に基づいてバッチデータを選定する。追跡処理部１２は、選定したバッチデータを選定バッチデータ記憶部２５に記憶する。続いて、訓練データ選定部１４は、選定されたバッチデータに共通に含まれる訓練データを選定する。 The training data evaluation device 3 according to the third embodiment has a repetition processing unit 30. The repetition processing unit 30 includes a batch data generation unit 10, a training processing unit 11, and a tracking processing unit 12. The batch data generation unit 10 generates batch data from the training data, but generates different batch data for each repetition. The training processing unit 11 and the tracking processing unit 12 generate a model and generate a model of the model with respect to the batch data generated by the batch data generation unit 10 in the same manner as the training data evaluation device 1 according to the first embodiment. Perform an evaluation and select batch data based on the evaluation results. The tracking processing unit 12 stores the selected batch data in the selected batch data storage unit 25. Subsequently, the training data selection unit 14 selects training data commonly included in the selected batch data.

図８は、訓練データ選定部１４による訓練データの選定について説明するための図である。図８には、繰り返し処理のＭ回目の試行において良い結果を得たバッチデータと、Ｎ回目の試行において良い結果を得たバッチデータの例を示している。訓練データ選定部１４は、異なる試行において得られたバッチデータに共通して含まれる訓練データを選定する。図８に示す例では、網掛けをしたデータＡとデータＢが両方のバッチデータに共に含まれているので、訓練データ選定部１４は、データＡとデータＢを選定する。 FIG. 8 is a diagram for describing selection of training data by the training data selection unit 14. FIG. 8 shows an example of batch data that obtained a good result in the M-th trial of the repetitive processing and batch data that obtained a good result in the N-th trial. The training data selection unit 14 selects training data commonly included in batch data obtained in different trials. In the example shown in FIG. 8, the data A and the data B, which are shaded, are included in both the batch data, so the training data selecting unit 14 selects the data A and the data B.

図８では、２回の選定結果に共通して含まれるデータを選定する例を挙げたが、訓練データ選定部１４は、Ｋ回（例えば、３回等）の結果に共通して含まれるデータを選定することとしてもよいし、すべての結果に共通して含まれるデータを選定することとしてもよい。 FIG. 8 shows an example of selecting data that is commonly included in the results of the two selections. However, the training data selection unit 14 determines the data that is commonly included in the results of the K times (for example, three times). May be selected, or data included in all the results may be selected.

図９は、第３の実施の形態の訓練データ評価装置３の動作を示すフローチャートである。訓練データ評価装置３は、訓練データ記憶部２０に記憶されている大量の訓練データを分けてバッチデータを生成する（Ｓ３０）。続いて、訓練データ評価装置３は、どの順序でバッチデータを用いて訓練を行うのか、訓練順序を決定する（Ｓ３１）。続いて、訓練データ評価装置３は、バッチデータを使って訓練を行い（Ｓ３２）、訓練によって生成されたモデルとその訓練に用いたバッチデータを特定するデータを訓練経過記憶部２２に記憶する（Ｓ３３）。訓練データ評価装置３は、全バッチデータの処理を終了したか否かを判定する（Ｓ３４）。全バッチデータの処理を終了していない場合には（Ｓ３４でＮＯ）、次のバッチデータを用いて、モデルをさらに訓練する（Ｓ３２）。 FIG. 9 is a flowchart illustrating the operation of the training data evaluation device 3 according to the third embodiment. The training data evaluation device 3 divides a large amount of training data stored in the training data storage unit 20 and generates batch data (S30). Subsequently, the training data evaluation device 3 determines the training order in which order the training is performed using the batch data (S31). Subsequently, the training data evaluation device 3 performs training using the batch data (S32), and stores the model generated by the training and the data specifying the batch data used for the training in the training progress storage unit 22 ( S33). The training data evaluation device 3 determines whether the processing of all batch data has been completed (S34). If the processing of all batch data has not been completed (NO in S34), the model is further trained using the next batch data (S32).

全バッチデータの処理を終了した場合は（Ｓ３４でＹＥＳ）、訓練データ評価装置３は、訓練経過記憶部２２から訓練過程のモデルを読み出し（Ｓ３５）、テストデータを用いて訓練経過のモデルのテストを行い、訓練経過のモデルの評価をする（Ｓ３６）。続いて、訓練データ評価装置３は、訓練経過のモデルの評価結果に基づいて、要件に適したモデルの生成に寄与したバッチデータを選定する（Ｓ３７）。 When the processing of all the batch data is completed (YES in S34), the training data evaluation device 3 reads the model of the training process from the training progress storage unit 22 (S35), and tests the model of the training process using the test data. Is performed, and the model of the training process is evaluated (S36). Subsequently, the training data evaluation device 3 selects the batch data that has contributed to the generation of the model suitable for the requirement, based on the evaluation result of the training progress model (S37).

次に、訓練データ評価装置３は、訓練を終了するか否かを判定する（Ｓ３８）。訓練を終了しないと判定された場合（Ｓ３８でＮＯ）、訓練データ評価装置３は、バッチデータを生成し直し（Ｓ３０）、新たなバッチデータを用いて上記した処理を繰り返す（Ｓ３１〜Ｓ３７）。訓練を終了すると判定された場合（Ｓ３８でＹＥＳ）、訓練データ評価装置３は、選定されたバッチデータに共通して含む訓練データを抽出し（Ｓ３９）、抽出結果を出力する（Ｓ４０）。 Next, the training data evaluation device 3 determines whether to end the training (S38). If it is determined that the training is not to be ended (NO in S38), the training data evaluation device 3 regenerates the batch data (S30), and repeats the above processing using the new batch data (S31 to S37). When it is determined that the training is to be ended (YES in S38), the training data evaluation device 3 extracts training data commonly included in the selected batch data (S39), and outputs an extraction result (S40).

第３の実施の形態の訓練データ評価装置３は、バッチデータを組み直してバッチデータの評価を行い、選定されたバッチデータに共通に含まれる訓練データを選定するので、バッチの単位よりもきめ細かく、要件を満たすモデルの生成に寄与する訓練データを選定できる。 The training data evaluation device 3 of the third embodiment reassembles the batch data and evaluates the batch data, and selects the training data that is commonly included in the selected batch data. Training data that contributes to the generation of a model that meets the requirements can be selected.

（第４の実施の形態）
図１０は、第４の実施の形態の訓練データ評価装置４の構成を示す図である。第４の実施の形態の訓練データ評価装置４の基本的な構成は第１の実施の形態の訓練データ評価装置１と同じであるが、第４の実施の形態の訓練データ評価装置４は、選定されたバッチデータの検証を行う検証部１５をさらに備えている。検証部１５は、選定されたバッチデータを除く訓練データを用いてモデルを生成し、生成したモデルにテストデータを適用して評価を行う。 (Fourth embodiment)
FIG. 10 is a diagram illustrating a configuration of the training data evaluation device 4 according to the fourth embodiment. The basic configuration of the training data evaluation device 4 according to the fourth embodiment is the same as that of the training data evaluation device 1 according to the first embodiment, but the training data evaluation device 4 according to the fourth embodiment includes: It further includes a verification unit 15 for verifying the selected batch data. The verification unit 15 generates a model using the training data excluding the selected batch data, and performs an evaluation by applying test data to the generated model.

図１１は、第４の実施の形態の訓練データ評価装置４において検証の動作を示すフローチャートである。訓練データ評価装置４は、訓練データ記憶部２０から訓練データを読み出し（Ｓ５０）、読み出した訓練データから選定されたバッチデータを除外する（Ｓ５１）。次に、訓練データ評価装置４は、バッチデータを除外した訓練データによって訓練を行ったモデルを生成し（Ｓ５２）、生成したモデルにテストデータを適用して、モデルの評価を行う（Ｓ５３）。訓練データ評価装置４は、その評価結果を出力する（Ｓ５４）。 FIG. 11 is a flowchart illustrating a verification operation in the training data evaluation device 4 according to the fourth embodiment. The training data evaluation device 4 reads the training data from the training data storage unit 20 (S50), and excludes the selected batch data from the read training data (S51). Next, the training data evaluation device 4 generates a model trained by the training data excluding the batch data (S52), and evaluates the model by applying test data to the generated model (S53). The training data evaluation device 4 outputs the evaluation result (S54).

このように選定データを除く訓練データを用いて生成したモデルをテストして評価することにより選定されたバッチデータを検証できる。すなわち、選定されたバッチデータを除く訓練データによって生成したモデルの評価が低い場合には、選定されたバッチデータの評価が高いことが確認される。逆に、選定されたバッチデータを除く訓練データによって生成したモデルの評価が高い場合には、訓練データ全体の評価が高いと考えられ、選定されたバッチデータだけが殊更に評価が高いというわけではないことが分かる。 As described above, the batch data selected can be verified by testing and evaluating the model generated using the training data excluding the selected data. That is, when the evaluation of the model generated by the training data excluding the selected batch data is low, it is confirmed that the evaluation of the selected batch data is high. Conversely, when the model generated by the training data excluding the selected batch data has a high evaluation, the evaluation of the entire training data is considered to be high, and only the selected batch data has a particularly high evaluation. I understand that there is no.

以上、本発明の訓練データ評価装置について、実施の形態を挙げて詳細に説明したが、本発明は上記した実施の形態に限定されるものではない。上記した実施の形態においては、モデルの評価結果に基づいてバッチデータを選定する手法をいくつか説明したが、これらの複数の手法を用いてバッチデータを選定し、それらの和をとってもよい。 As described above, the training data evaluation device of the present invention has been described in detail with reference to the embodiments. However, the present invention is not limited to the above embodiments. In the above-described embodiment, several methods for selecting batch data based on the evaluation result of the model have been described. However, batch data may be selected using a plurality of these methods, and the sum thereof may be calculated.

モデルの生成過程において、訓練の初期はモデルが大きく変化するため、その評価が大きく変動しやすい。そこで、初期に生成されたモデルについてはその評価を行わないこととしてもよい。初期に生成されたモデルとは、例えば、バッチデータ全体に対する割合で「初期」を規定してもよく、例えば、全体の５分の１のバッチデータを用いるまでを「初期」としてもよい。また、適用する訓練データの絶対数によって「初期」を規定してもよく、例えば、１０００枚のバッチデータを用いるまでを「初期」としてもよい。 In the process of generating a model, the model greatly changes at the beginning of training, and its evaluation tends to fluctuate greatly. Therefore, the model generated at the beginning may not be evaluated. The model generated at the beginning may define “initial” as a percentage of the entire batch data, for example, and may define “initial” until one-fifth of the batch data is used. The “initial” may be defined by the absolute number of training data to be applied. For example, the “initial” may be used until batch data of 1000 sheets is used.

上記した第４の実施の形態では、第１の実施の形態の訓練データ評価装置１にて選定したバッチデータに対して検証を行う装置を例として説明したが、第１の実施の形態の訓練データ評価装置１にて選定した訓練データ以外の訓練データに対して検証を行うことができる。 In the above-described fourth embodiment, an example has been described in which the apparatus for verifying the batch data selected by the training data evaluation apparatus 1 of the first embodiment is used as an example. The data evaluation device 1 can verify the training data other than the training data selected.

図１２は、訓練データ評価装置の別の例を示す図である。図１２に示す訓練データ評価装置５は、データ評価部３１と、検証部１５と、出力部１３とを備えている。データ評価部３１は、訓練データ記憶部２０に記憶された大量の訓練データの中からモデルの要件に合ったデータを選定する機能を有している。データ評価部３１がデータを選定する方法は、限定されず、いかなる方法を採用してもよい。 FIG. 12 is a diagram illustrating another example of the training data evaluation device. The training data evaluation device 5 illustrated in FIG. 12 includes a data evaluation unit 31, a verification unit 15, and an output unit 13. The data evaluation unit 31 has a function of selecting data that meets the requirements of the model from a large amount of training data stored in the training data storage unit 20. The method by which the data evaluation unit 31 selects data is not limited, and any method may be adopted.

検証部１５は、選定されたデータがモデルの要件に合っているかどうかを検証する。検証部１５は、訓練データから選定データを除き、選定データを除いた訓練データを用いてモデルを生成する。検証部１５は、生成されたモデルに対してテストデータを適用して、（訓練データ−選定データ）で生成されたモデルの評価を行う。このモデルの評価が高いか低いかによって、選定データの検証を行うことができる。 The verification unit 15 verifies whether the selected data meets the requirements of the model. The verification unit 15 removes the selection data from the training data, and generates a model using the training data from which the selection data has been removed. The verification unit 15 applies the test data to the generated model to evaluate the model generated by (training data-selection data). Based on whether the evaluation of the model is high or low, the selected data can be verified.

本発明は、機械学習において用いられる訓練データの評価を行う装置として有用である。 The present invention is useful as an apparatus for evaluating training data used in machine learning.

１〜５訓練データ評価装置，１０バッチデータ生成部，１１訓練処理部，
１２追跡処理部，１３出力部，１４訓練データ選定部，１５検証部，
２０訓練データ記憶部，２１バッチデータ記憶部，２２訓練経過記憶部，
２３テストデータ記憶部，２４追跡結果記憶部，２５選定バッチデータ記憶部，
２６選定訓練データ記憶部，３０繰返処理部，３１データ評価部 1-5 training data evaluation device, 10 batch data generation unit, 11 training processing unit,
12 tracking processing unit, 13 output unit, 14 training data selection unit, 15 verification unit,
20 training data storage unit, 21 batch data storage unit, 22 training progress storage unit,
23 test data storage unit, 24 tracking result storage unit, 25 selected batch data storage unit,
26 selection training data storage unit, 30 repetition processing unit, 31 data evaluation unit

Claims

A batch data generator (10) for dividing the training data into a plurality of batch data;
A training processing unit (11) for training a model by sequentially using a plurality of batch data;
A training progress storage unit (22) storing a model of a process of changing by the training to which the batch data is sequentially applied, and information of the batch data for generating each model;
A tracking processing unit (12) that applies test data to a plurality of models stored in the training progress storage unit (22), evaluates each model, and selects batch data based on an evaluation result of each model; ,
An output unit (13) for outputting the selection result of the batch data;
A training data evaluation device (1) comprising:

The training data evaluation device (1) according to claim 1, wherein the tracking processing unit (12) selects batch data corresponding to a model having the best accuracy rate for test data.

The model is a model of a neural network,
The training data evaluation device (1) according to claim 1, wherein the tracking processing unit (12) selects the batch data based on a certainty factor of a determination output from a neural network model.

The training data evaluation device (1) according to claim 1, wherein the tracking processing unit (12) selects batch data that has finally changed the evaluation result of the model in a better direction.

The training data evaluation device (1) according to claim 1, wherein the tracking processing unit (12) selects batch data in which an evaluation result of the model is changed in a better direction than a predetermined threshold.

The training data evaluation device (1) according to claim 1, wherein the tracking processing unit (12) selects batch data by a plurality of different methods, and calculates a sum of the selected batch data.

The batch data generating unit (10) generates different batch data, repeats the processing by the training processing unit (11) and the tracking processing unit (12), and selects a batch processing unit ( 30)
A training data selection unit (14) for selecting training data commonly included in the batch data selected by the repetition processing unit (30);
The training data evaluation device (3) according to claim 1, comprising:

The training data evaluation according to claim 1, wherein the tracking processing unit (12) does not evaluate an initially generated model among a plurality of models stored in the training progress storage unit (22). Apparatus (1).

A model is generated using the training data excluding the batch data selected by the tracking processing unit (12), and the model is evaluated by applying test data to the model. The training data evaluation device (4) according to claim 1, further comprising a verification unit (15) for verifying the selected batch data.

A training data evaluation device (5) for evaluating selected data selected from training data,
A training unit (13) for generating a model based on the training data excluding the selection data and applying a test data to the generated model to evaluate the model, thereby verifying the selection data; Data evaluation device (5).

Dividing the training data into a plurality of batch data (S10);
Training the model by sequentially using a plurality of batch data (S12);
A step (S13) of storing in the training progress storage unit (22) a model in a process of changing by the training in which the batch data is sequentially applied, and information of the batch data that generated each model;
Applying test data to a plurality of models stored in the training progress storage unit (22) to evaluate each of the models, and selecting batch data based on an evaluation result of each model (S17);
Outputting the selection result of the batch data (S18);
Training data evaluation method comprising:

A program for evaluating training data, comprising:
Dividing the training data into a plurality of batch data;
Training the model using a plurality of batch data sequentially;
Storing the model of the process of changing by the training in which the batch data is sequentially applied, and information of the batch data that generated each model in a training progress storage unit;
Applying test data to a plurality of models stored in the training progress storage unit, evaluating each of the models, and selecting batch data based on an evaluation result of each model;
Outputting the selection result of the batch data;
A program that executes