JP2023104464A

JP2023104464A - Machine learning data set evaluation support device and machine learning data set evaluation support method

Info

Publication number: JP2023104464A
Application number: JP2022005461A
Authority: JP
Inventors: 啓伸來間; Hironobu Kuruma; 智之明神; Tomoyuki Myojin; 直人佐藤; Naoto Sato; 秀人野口; Hideto Noguchi
Original assignee: Hitachi Ltd
Current assignee: Hitachi Ltd
Priority date: 2022-01-18
Filing date: 2022-01-18
Publication date: 2023-07-28

Abstract

To configure a sophisticated learning data set that avoids inappropriate learning or lack of learning.SOLUTION: A machine learning data set evaluation support device 100 is configured to include: a learning data set holding unit 116 that holds a learning data set and label information used for supervised machine learning; an inference execution unit 111 that inputs a predetermined evaluation data set to a plurality of learned AI models and obtains a plurality of inference results; a difference analysis unit 113 that analyzes the difference between the inference results from a confidence degree of the inference result and a stability degree of the inference result in each of the plurality of learned AI models; a data evaluation unit 114 that evaluates the evaluation data set based on the analysis results of the difference; and a learning data set editing unit 115 that edits the learning data set and the label information based on the evaluation result of the evaluation data set.SELECTED DRAWING: Figure 1

Description

本発明は、機械学習データセット評価支援装置及び機械学習データセット評価支援方法に関するものである。 The present invention relates to a machine learning dataset evaluation support device and a machine learning dataset evaluation support method.

ＩｏＴデータなど各種ビッグデータを活用した、様々な装置類の監視や制御、故障予知、或いは自動運転時の物体認識など、実に広範な分野で機械学習技術が導入されている。そうした分野での機械学習技術においては、人が正解データを付与するため学習精度が高く、学習速度も早い、教師あり機械学習が有用である。 Machine learning technology has been introduced in a wide range of fields, such as monitoring and control of various devices, failure prediction, and object recognition during automatic driving using various big data such as IoT data. In machine learning technology in such fields, supervised machine learning, which has high learning accuracy and fast learning speed because humans give correct data, is useful.

教師あり機械学習に際しては、実世界からデータを収集し、当該データがＡＩモデルに入力された際に期待する出力値、つまり正解ラベルを付与した学習データセット（訓練データとテストデータ）を作成する。このうち訓練データを教師データとしてＡＩモデルに学習させ、テストデータを使って学習済みＡＩモデルの精度を評価する。 In supervised machine learning, data is collected from the real world, and the output value expected when the data is input to the AI model, that is, a learning data set (training data and test data) with correct labels is created. . Of these, training data is used as teacher data to make an AI model learn, and test data is used to evaluate the accuracy of the trained AI model.

こうした機械学習に関連する従来技術としては、例えば、データの適切なクレンジング処理が困難であるといった課題を踏まえ、入力データから機械学習アルゴリズムに適した学習用データを生成するための学習データ生成装置（特許文献１参照）などが提案されている。 Conventional technologies related to machine learning include, for example, learning data generation devices ( See Patent Document 1) and the like have been proposed.

この技術は、学習データを生成するために入力データに対してクレンジング処理を行う処理手段と、機械学習アルゴリズムを備え、前記機械学習アルゴリズムを用いて、学習データに基づいて学習モデルを生成する生成手段と、を有する学習データ生成装置であって、前記生成手段は、前記機械学習アルゴリズムとして、種類が異なる、複数の機械学習アルゴリズムを備え、前記処理手段は、入力データに対して、複数の前記機械学習アルゴリズムのそれぞれに対応して、第１のクレンジング処理を行うことを特徴とする学習データ生成装置である。 This technology includes processing means for performing cleansing processing on input data to generate learning data, and a machine learning algorithm, and generating means for generating a learning model based on the learning data using the machine learning algorithm. and, wherein the generating means includes a plurality of machine learning algorithms of different types as the machine learning algorithms, and the processing means processes input data with a plurality of the machine learning algorithms The learning data generation device is characterized by performing a first cleansing process corresponding to each learning algorithm.

特開２０２０－３８５１４号公報JP 2020-38514 A

上述の学習データセットに望まれる要件は以下のとおりである。想定する入力データを学習データセットがカバーしていること、頑健（入力値の微小なずれに対して安定）なＡＩモデルが構成できること、及び、学習データに付与されているラベルが妥当であること、である。 The desired requirements for the above training data set are as follows. The learning data set covers the assumed input data, a robust AI model (stable against small deviations in input values) can be constructed, and the labels assigned to the learning data are appropriate. , is.

しかし、特段の工夫なく素朴に収集した学習データセットでは、学習の進行を制御することが困難である。そのため、意図しない学習が行われる可能性がある。例えば、学習データが不足するケース、正解ラベルが異なる学習データの不用意な近接が生じるケース、及び、学習意図とは異なる特徴が優勢となるケースが該当する。 However, it is difficult to control the progress of learning with a learning data set simply collected without special ingenuity. Therefore, unintended learning may occur. For example, there is a shortage of learning data, a case in which learning data with different correct labels are inadvertently approached, and a case in which a feature different from the learning intent is dominant.

一方、従来技術によれば、学習データのクレンジングを機械学習アルゴリズムの特徴に基づいて行う構成となっている。そのため、生成された学習モデルによる推論の特性をフ
ィードバックする形での、学習データの洗練はできない。 On the other hand, according to the conventional technology, cleansing of learning data is performed based on the features of the machine learning algorithm. Therefore, it is not possible to refine the learning data by feeding back the inference characteristics of the generated learning model.

そこで本発明の目的は、不適切な学習や学習不足を回避する、洗練された学習データセットの構成を可能とする技術を提供することにある。 SUMMARY OF THE INVENTION Accordingly, an object of the present invention is to provide a technique that enables the construction of a sophisticated learning data set that avoids inappropriate learning and insufficient learning.

上記課題を解決する本発明の機械学習データセット評価支援装置は、教師あり機械学習に用いる学習データセット及びラベル情報を保持する学習データセット保持部と、所定の評価データセットを複数の学習済みＡＩモデルに入力し、複数の推論結果を得る推論実行部と、前記複数の推論結果の間の差異を、前記複数の学習済みＡＩモデルそれぞれにおける推論結果の確信度及び推論結果の安定度から解析する差異解析部と、前記差異の解析結果に基づいて、前記評価データセットを評価するデータ評価部と、前記評価データセットの評価結果に基づいて前記学習データセット保持部に保持されている学習データセット及びラベル情報を編集する学習データセット編集部と、を備えることを特徴とする。 A machine learning data set evaluation support device of the present invention for solving the above problems includes a learning data set holding unit that holds a learning data set and label information used for supervised machine learning, and a predetermined evaluation data set for a plurality of trained AIs. An inference execution unit that obtains a plurality of inference results by inputting data into the model, and analyzes the difference between the plurality of inference results from the certainty of the inference result and the stability of the inference result in each of the plurality of trained AI models. a difference analysis unit; a data evaluation unit that evaluates the evaluation data set based on the analysis result of the difference; and a learning data set held in the learning data set holding unit based on the evaluation result of the evaluation data set. and a learning data set editing unit that edits the label information.

また、本発明の機械学習データセット評価支援方法は、情報処理装置が、教師あり機械学習に用いる学習データセット及びラベル情報を記憶装置で保持し、所定の評価データセットを複数の学習済みＡＩモデルに入力し、複数の推論結果を得る処理と、前記複数の推論結果の間の差異を、前記複数の学習済みＡＩモデルそれぞれにおける推論結果の確信度及び推論結果の安定度から解析する処理と、前記差異の解析結果に基づいて、前記評価データセットを評価する処理と、前記評価データセットの評価結果に基づいて前記記憶装置に保持されている学習データセット及びラベル情報を編集する処理を、実行することを特徴とする。 Further, in the machine learning data set evaluation support method of the present invention, the information processing device holds learning data sets and label information used for supervised machine learning in a storage device, and a predetermined evaluation data set is stored in a plurality of trained AI models. , a process of obtaining a plurality of inference results, and a process of analyzing the difference between the plurality of inference results from the confidence of the inference result and the stability of the inference result in each of the plurality of trained AI models; executing a process of evaluating the evaluation data set based on the difference analysis result, and a process of editing the learning data set and label information held in the storage device based on the evaluation result of the evaluation data set. characterized by

本発明によれば、不適切な学習や学習不足を回避する、洗練された学習データセットの構成が可能となる。 According to the present invention, it is possible to construct a sophisticated training data set that avoids improper or under-learning.

本実施形態の学習データセット評価支援装置の機能構成の一例を示すブロック図である。It is a block diagram showing an example of functional composition of a learning data set evaluation support device of this embodiment. 本実施形態の学習データセット評価支援装置のハードウェア構成例を示す図である。It is a figure which shows the hardware structural example of the learning data set evaluation assistance apparatus of this embodiment. 本実施形態の学習データセット評価支援方法のフロー例を示す図である。It is a figure which shows the flow example of the learning data set evaluation assistance method of this embodiment. 本実施形態における学習データセットを変化させて訓練したＡＩモデルを取得する処理のフロー図である。FIG. 4 is a flowchart of processing for obtaining an AI model trained by changing the learning data set in the embodiment. 本実施形態におけるモデル構造を変化させて訓練したＡＩモデルを取得する処理のフロー図である。FIG. 10 is a flow diagram of processing for acquiring an AI model trained by changing the model structure in this embodiment. 本実施形態における推論実行処理のフロー図であり、学習済みＡＩモデルの推論を順次実行する処理のフロー図である。It is a flow diagram of inference execution processing in the present embodiment, and is a flow diagram of processing for sequentially executing inference of a trained AI model. 本実施形態における推論実行処理のフロー図であり、学習済みＡＩモデルの推論を並列に実行する処理のフロー図である。FIG. 4 is a flow diagram of inference execution processing in the present embodiment, and is a flow diagram of processing for executing inferences of trained AI models in parallel. 本実施形態における推論実行処理のフロー図であり、推論時にモデルの構造を変化させたＡＩモデルを用いる処理のフロー図である。FIG. 4 is a flow diagram of inference execution processing in the present embodiment, and is a flow diagram of processing using an AI model whose model structure is changed at the time of inference. 本実施形態における推論結果差異解析処理のフロー図である。FIG. 10 is a flow chart of inference result difference analysis processing in this embodiment. 本実施形態における推論結果の確信度と安定度の計算方法の一例を示す図である。It is a figure which shows an example of the calculation method of the certainty degree of an inference result in this embodiment, and a stability degree. 本実施形態における差異解析部の解析結果例を示す図である。It is a figure which shows the example of an analysis result of the difference analysis part in this embodiment. 本実施形態におけるデータ評価処理のフロー図である。It is a flow chart of data evaluation processing in this embodiment. 本実施形態におけるデータ評価部が行う閾値設定処理の一例を示す図である。It is a figure which shows an example of the threshold value setting process which the data evaluation part in this embodiment performs. 本実施形態における学習データセット編集処理のフロー図である。FIG. 5 is a flow diagram of learning data set editing processing in the present embodiment. 本実施形態における学習データセット編集処理部が行う類似データの判定方法の一例を示す図である。It is a figure which shows an example of the determination method of similar data which the learning data set edit process part in this embodiment performs. 本実施形態における類似データの生成方法の一例を示す図である。It is a figure which shows an example of the generation method of similar data in this embodiment. 実施例１における学習データセット評価支援装置の機能構成を示すブロック図である。2 is a block diagram showing the functional configuration of the learning data set evaluation support device in Example 1. FIG. 実施例１における評価データの例ならびに推論結果と差異解析結果の例を示す図である。4A and 4B are diagrams showing an example of evaluation data, an inference result, and a difference analysis result in Example 1. FIG. 実施例１における高確信度の閾値の例、低確信度の閾値の例、低安定度の閾値の例を示す図である。8A and 8B are diagrams illustrating an example of a high confidence threshold, an example of a low confidence threshold, and an example of a low stability threshold in Embodiment 1; FIG. 実施例１における評価データの評価結果の例を示す図である。5 is a diagram showing an example of evaluation results of evaluation data in Example 1. FIG. 実施例１における学習データセット編集処理の例であり、学習データのラベルと推論結果が一致する場合を示す図である。FIG. 10 is an example of learning data set editing processing in Example 1, and shows a case where the label of learning data and the inference result match. 実施例１における学習データセット編集処理の例であり、学習データのラベルと推論結果が一致しない場合を示す図である。FIG. 10 is an example of learning data set editing processing in Example 1, and shows a case where the label of learning data and the inference result do not match. 実施例１における学習データセット編集処理の例を示す図である。FIG. 10 is a diagram illustrating an example of learning data set editing processing in Example 1; 実施例１における学習データセット編集処理の例を示す図である。FIG. 10 is a diagram illustrating an example of learning data set editing processing in Example 1; 実施例２における学習データセット評価支援装置の機能構成を示すブロック図である。FIG. 11 is a block diagram showing the functional configuration of a learning data set evaluation support device in Example 2; 実施例２における推論結果の確信度の表示例を示す図である。FIG. 11 is a diagram showing a display example of the certainty factor of an inference result in Example 2; 実施例２における推論結果の安定度の表示例を示す図である。FIG. 11 is a diagram showing a display example of the stability of an inference result in Example 2; 実施例２における高確信度の閾値の例、低確信度の閾値の例、低安定度の閾値の例を示す図である。FIG. 10 is a diagram showing an example of a high confidence threshold, an example of a low confidence threshold, and an example of a low stability threshold in Example 2; 実施例２における評価データの評価結果の例を示す図である。FIG. 10 is a diagram showing an example of evaluation results of evaluation data in Example 2;

＜機械学習データセット評価支援装置の構成例＞
以下に本発明の実施形態について図面を用いて詳細に説明する。図１は、本実施形態の機械学習データセット評価支援装置１００の機能構成例を示す図である。図１に示す機械学習データセット評価支援装置１００は、不適切な学習や学習不足を回避する、洗練された学習データセットの構成を可能とするコンピュータである。 <Configuration example of machine learning dataset evaluation support device>
Embodiments of the present invention will be described in detail below with reference to the drawings. FIG. 1 is a diagram showing a functional configuration example of a machine learning dataset evaluation support device 100 of this embodiment. A machine learning data set evaluation support device 100 shown in FIG. 1 is a computer that enables the construction of sophisticated learning data sets that avoid inappropriate learning and insufficient learning.

本実施形態の機械学習データセット評価支援装置１００は、図１で示すように、評価データセット保持部１１０、推論実行部１１１、学習済みＡＩモデル保持部１１２、差異解析部１１３、データ評価部１１４、学習データセット編集部１１５、及び、学習データセット保持部１１６を有している。 As shown in FIG. 1, the machine learning dataset evaluation support device 100 of the present embodiment includes an evaluation dataset storage unit 110, an inference execution unit 111, a trained AI model storage unit 112, a difference analysis unit 113, and a data evaluation unit 114. , a learning data set editing unit 115 and a learning data set holding unit 116 .

このうち評価データセット保持部１１０は、評価データセット１２５を保持している。また、学習済みＡＩモデル保持部１１２は、学習済みＡＩモデル１１８を保持している。また、学習データセット保持部１１６は、学習データセット１２６及びラベル情報１２７を保持している。 Of these, the evaluation data set holding unit 110 holds the evaluation data set 125 . Also, the trained AI model holding unit 112 holds a trained AI model 118 . Also, the learning data set holding unit 116 holds a learning data set 126 and label information 127 .

学習データセット１２６は、例えば、実世界から収集された訓練データである。また、ラベル情報１２７は、訓練データである学習データセットがＡＩモデルに入力されたときに期待する出力つまり正解ラベルの情報である。 Learning data set 126 is, for example, training data collected from the real world. Also, the label information 127 is information of an output expected when a learning data set, which is training data, is input to the AI model, that is, correct label information.

また、評価データセット１２５は、そうして学習が進んだＡＩモデルすなわち学習済みＡＩモデル１１８のそれぞれに入力する対象となる。その場合、学習済みＡＩモデル１１８から、複数の推論結果を得ることとなる。 In addition, the evaluation data set 125 is a target to be input to each of the AI models that have been trained in this way, that is, the trained AI models 118 . In that case, a plurality of inference results will be obtained from the trained AI model 118 .

機械学習データセット評価支援装置１００は、具体的には、サーバ装置、パーソナルコンピュータなどを想定できる。 Specifically, the machine learning dataset evaluation support device 100 can be assumed to be a server device, a personal computer, or the like.

＜ハードウェア構成＞
また、本実施形態の機械学習データセット評価支援装置１００のハードウェア構成は、図２に以下の如くとなる。 <Hardware configuration>
Further, the hardware configuration of the machine learning dataset evaluation support device 100 of the present embodiment is shown in FIG. 2 as follows.

すなわち機械学習データセット評価支援装置１００は、記憶装置１０１、メモリ１０３、演算装置１０４、入力装置１０５、出力装置１０６、および通信装置１０７を備える。 That is, the machine learning dataset evaluation support device 100 includes a storage device 101 , a memory 103 , an arithmetic device 104 , an input device 105 , an output device 106 and a communication device 107 .

このうち記憶装置１０１は、ＳＳＤ（ＳｏｌｉｄＳｔａｔｅＤｒｉｖｅ）やハードディスクドライブなど適宜な不揮発性記憶素子で構成される。 Among them, the storage device 101 is composed of an appropriate non-volatile storage element such as an SSD (Solid State Drive) or a hard disk drive.

また、メモリ１０３は、ＲＡＭなど揮発性記憶素子で構成される。 Also, the memory 103 is composed of a volatile memory element such as a RAM.

また、演算装置１０４は、記憶装置１０１に保持されるプログラム１０２をメモリ１０３に読み出すなどして実行し装置自体の統括制御を行なうとともに各種判定、演算及び制御処理を行なうＣＰＵである。 Arithmetic device 104 is a CPU that reads program 102 stored in storage device 101 into memory 103 and executes it, performs overall control of the device itself, and performs various determinations, calculations, and control processes.

また、入力装置１０５は、ユーザからのキー入力や音声入力を受け付けるキーボードやマウス、マイクといった装置である。 The input device 105 is a device such as a keyboard, mouse, and microphone that receives key input and voice input from the user.

また、出力装置１０６は、演算装置１０４での処理結果の出力を行うディスプレイやスピーカー等の装置である。 Also, the output device 106 is a device such as a display or a speaker that outputs the processing result of the arithmetic device 104 .

また、通信装置１０７は、適宜なネットワークと接続して、ユーザ端末等との通信処理を担うネットワークインターフェイスカード等を想定する。ただし、機械学習データセット評価支援装置１００がスタンドアロンマシンである場合、本通信装置１０７は省略可能である。 Also, the communication device 107 is assumed to be a network interface card or the like that connects to an appropriate network and handles communication processing with a user terminal or the like. However, if the machine learning dataset evaluation support device 100 is a stand-alone machine, the communication device 107 can be omitted.

なお、記憶装置１０１内には、本実施形態の機械学習データセット評価支援装置として必要な機能を実装する為のプログラム１０２に加えて、評価データセット保持部１１０における評価データセット１２５、学習データセット保持部１１５における学習データセット１２６及びラベル情報１２７、学習済みＡＩモデル保持部１１２における学習済みＡＩモデル１１８が少なくとも記憶されている。ただし、これらについての詳細は後述する。 In the storage device 101, in addition to the program 102 for implementing the necessary functions as the machine learning dataset evaluation support device of this embodiment, the evaluation dataset 125 in the evaluation dataset holding unit 110, the learning dataset At least the learning data set 126 and label information 127 in the holding unit 115 and the trained AI model 118 in the trained AI model holding unit 112 are stored. However, the details of these will be described later.

また、プログラム１０２は、演算装置１０４により実行されることで、推論実行部１１１、差異解析部１１３、データ評価部１１４、及び学習データセット編集部１１５、の各機能を実装することになる。 Also, the program 102 implements the functions of the inference execution unit 111 , the difference analysis unit 113 , the data evaluation unit 114 , and the learning data set editing unit 115 by being executed by the arithmetic unit 104 .

このうち推論実行部１１１は、評価データセット保持部１１０で保持する評価データセットを複数の学習済みＡＩモデル１１８に入力し、複数の推論結果を得るものである。 Among these, the inference execution unit 111 inputs the evaluation data set held in the evaluation data set holding unit 110 to the plurality of trained AI models 118 and obtains a plurality of inference results.

なお、この推論実行部１１１は、複数の学習済みＡＩモデル１１８として、学習時に学習データセットないしモデル構造を変化させて訓練したＡＩモデルを用いる、ものとしてもよい。 Note that the inference execution unit 111 may use, as the plurality of trained AI models 118, AI models trained by changing the learning data set or model structure during learning.

また、推論実行部１１１は、複数の学習済みＡＩモデルとして、推論時にモデル構造を変化させた学習済みＡＩモデルを用いる、ものとしてもよい。 Also, the inference execution unit 111 may use, as a plurality of trained AI models, trained AI models whose model structures have been changed at the time of inference.

また、推論実行部１１１は、複数の学習済みＡＩモデルを逐次的に実行して複数の推論結果を得るものとしてもよい。 Also, the inference execution unit 111 may sequentially execute a plurality of trained AI models to obtain a plurality of inference results.

また、推論実行部１１１は、複数の学習済みＡＩモデルを並列に実行して複数の推論結果を同時に得るものとしてもよい。 Also, the inference execution unit 111 may execute a plurality of trained AI models in parallel to obtain a plurality of inference results at the same time.

また、差異解析部１１３は、上述の推論実行部１１１で得た複数の推論結果の間の差異を、複数の学習済みＡＩモデル１１８それぞれにおける推論結果の確信度及び推論結果の安定度から解析するものである。 In addition, the difference analysis unit 113 analyzes the difference between the plurality of inference results obtained by the above-described inference execution unit 111 from the certainty of the inference result and the stability of the inference result in each of the plurality of trained AI models 118. It is a thing.

また、データ評価部１１４は、差異解析部１１３にて得た上述の差異の解析結果に基づいて、評価データセット１２５を評価するものである。 The data evaluation unit 114 evaluates the evaluation data set 125 based on the difference analysis result obtained by the difference analysis unit 113 .

また、学習データセット編集部１１５は、評価データセット１２５の評価結果に基づいて、学習データセット保持部１１６に保持されている学習データセット１２６及びラベル情報１２７を編集するものである。 Also, the learning data set editing unit 115 edits the learning data set 126 and the label information 127 held in the learning data set holding unit 116 based on the evaluation result of the evaluation data set 125 .

なお、上述の学習データセット編集部１１５は、推論実行部１１１による推論結果の確信度が所定の閾値より大きい評価データに類似しかつ評価対象データの推論結果と正解ラベルが異なる学習データを、学習データセット１２６から削除するものとしてもよい。 Note that the above-described learning data set editing unit 115 learns learning data similar to the evaluation data in which the degree of certainty of the inference result by the inference execution unit 111 is greater than a predetermined threshold and which has a different correct label from the inference result of the evaluation target data. It may be deleted from dataset 126 .

また、学習データセット編集部１１５は、推論実行部１１１による推論結果の確信度が所定の閾値より小さい評価データに類似する学習データを、学習データセット１２６から削除するものとしてもよい。 Also, the learning data set editing unit 115 may delete from the learning data set 126 learning data similar to the evaluation data in which the degree of certainty of the inference result by the inference execution unit 111 is smaller than a predetermined threshold.

また、学習データセット編集部１１５は、推論実行部１１１による推論結果の確信度が所定の閾値より大きく、安定度が所定の閾値より小さい評価データに類似する学習データを生成し、学習データセット１２６に追加するものとしてもよい。 In addition, the learning data set editing unit 115 generates learning data similar to the evaluation data in which the degree of certainty of the inference result by the inference execution unit 111 is greater than a predetermined threshold and whose stability is less than a predetermined threshold. may be added to

なお、図１ではなく図２５に基づき後述するが、機械学習データセット評価支援装置１００は、上述の他にも、解析結果表示部１１７を有する。この解析結果表示部１１７は、差異解析部１１３での解析結果を出力装置１０６に表示するものである。 Note that the machine learning dataset evaluation support device 100 has an analysis result display unit 117 in addition to the above, although it will be described later based on FIG. 25 instead of FIG. The analysis result display unit 117 displays the analysis result of the difference analysis unit 113 on the output device 106 .

＜フロー例＞
以下、本実施形態における機械学習データセット評価支援方法の実際手順について図に基づき説明する。以下で説明する機械学習データセット評価支援方法に対応する各種動作は、機械学習データセット評価支援装置１００がメモリ等に読み出して実行するプログラムによって実現される。そして、このプログラムは、以下に説明される各種の動作を行うためのコードから構成されている。 <Flow example>
The actual procedure of the machine learning data set evaluation support method according to the present embodiment will be described below with reference to the drawings. Various operations corresponding to the machine-learning dataset evaluation support method described below are implemented by a program that the machine-learning dataset evaluation support apparatus 100 reads out to a memory or the like and executes. This program is composed of codes for performing various operations described below.

図３は、本実施形態における機械学習データセット評価支援方法のフロー例を示す図である。この場合、機械学習データセット評価支援装置１００は、学習済みＡＩモデル１１８を取得する（３０１）。この学習済みＡＩモデル１１８は、記憶装置１０１の学習済みＡＩモデル保持部１１２で予め保持しているものである。 FIG. 3 is a diagram showing a flow example of the machine learning data set evaluation support method according to this embodiment. In this case, the machine learning dataset evaluation support device 100 acquires the learned AI model 118 (301). This learned AI model 118 is held in advance in the learned AI model holding unit 112 of the storage device 101 .

続いて、機械学習データセット評価支援装置１００の推論実行部１１１は、評価データセット保持部１１０の評価データセット１２５から１つの評価データを、ステップ３０１で得ている学習済みＡＩモデル１１８に入力し、推論処理を実行する（３０２）。この推論処理の詳細は図４～図８に基づき後述する。 Subsequently, the inference execution unit 111 of the machine learning dataset evaluation support device 100 inputs one piece of evaluation data from the evaluation dataset 125 of the evaluation dataset holding unit 110 to the trained AI model 118 obtained in step 301. , perform inference processing (302). Details of this inference processing will be described later with reference to FIGS.

次に、機械学習データセット評価支援装置１００の差異解析部１１３は、ステップ３０２での推論処理の結果の確信度と安定度を解析し、これを例えばメモリ１０３に記録する（３０３）。 Next, the difference analysis unit 113 of the machine learning dataset evaluation support device 100 analyzes the confidence and stability of the result of the inference processing in step 302 and records them in the memory 103, for example (303).

続いて、機械学習データセット評価支援装置１００は、ここまでのステップでの処理対象が、評価データセット１２５に含まれる未処理の評価データのうち最後の評価データであったか判定する（３０４）。 Subsequently, the machine learning dataset evaluation support device 100 determines whether the processing target in the steps up to this point was the last evaluation data among the unprocessed evaluation data included in the evaluation dataset 125 (304).

上述の判定の結果、最後の評価データではない場合（３０４：ＮＯ）、機械学習データセット評価支援装置１００は、処理をステップ３０２に戻す。 As a result of the above determination, if the evaluation data is not the last one (304: NO), the machine learning dataset evaluation support device 100 returns the process to step 302.

一方、上述の判定の結果、最後の評価データである場合（３０４：ＹＥＳ）、機械学習データセット評価支援装置１００のデータ評価部１１４は、当該評価データを評価し、その結果を、例えばメモリ１０３に記録する（３０５）。このデータ評価部１１４の処理の詳細は図１２などに基づき後述する。 On the other hand, as a result of the above determination, if it is the last evaluation data (304: YES), the data evaluation unit 114 of the machine learning dataset evaluation support device 100 evaluates the evaluation data, and stores the result in the memory 103, for example. (305). The details of the processing of the data evaluation unit 114 will be described later with reference to FIG. 12 and the like.

また、機械学習データセット評価支援装置１００の学習データセット編集部１１５は、学習データセット保持部１１６の学習データセット１２６とラベル情報１２７を編集し（３０６）、本フローを終了する。この学習データセット編集部１１５の処理の詳細は、図１４などに基づき後述する。ここまでの処理により、不適切な学習や学習不足を回避する、洗練された学習データセットが生成された。 Also, the learning data set editing unit 115 of the machine learning data set evaluation support device 100 edits the learning data set 126 and the label information 127 in the learning data set holding unit 116 (306), and ends this flow. The details of the processing of this learning data set editing unit 115 will be described later with reference to FIG. 14 and the like. The process so far has produced a refined training dataset that avoids improper or under-training.

＜推論実行部のフロー例１＞
続いて図４において、本実施形態の推論実行部１１１が、学習データセットを変化させた学習データセットによる学習で、複数のＡＩモデルを生成する処理のフロー例を示す。 <Flow example 1 of the inference execution part>
Next, FIG. 4 shows an example flow of processing in which the inference execution unit 111 of the present embodiment generates a plurality of AI models through learning using a learning data set obtained by changing the learning data set.

この場合、機械学習データセット評価支援装置１００の推論実行部１１１は、例えば、入力装置１０５から学習前ＡＩモデルを取得する（４０１）。 In this case, the inference execution unit 111 of the machine learning dataset evaluation support device 100 acquires the pre-learning AI model from the input device 105, for example (401).

続いて、機械学習データセット評価支援装置１００の推論実行部１１１は、引数ｉに１をセットする（４０２）。 Subsequently, the inference execution unit 111 of the machine learning dataset evaluation support device 100 sets 1 to the argument i (402).

次に、機械学習データセット評価支援装置１００の推論実行部１１１は、学習データセット保持部１１５の学習データセット１２６から、複数の学習データを取得する（４０３）。 Next, the inference executing unit 111 of the machine learning dataset evaluation support device 100 acquires a plurality of learning data from the learning dataset 126 of the learning dataset holding unit 115 (403).

続いて、機械学習データセット評価支援装置１００の推論実行部１１１は、学習データセット保持部１１５のラベル情報１２７から、上述のステップ４０３で得た学習データのラベルを取得する（４０４）。 Subsequently, the inference executing unit 111 of the machine learning dataset evaluation support device 100 acquires the label of the learning data obtained in step 403 from the label information 127 of the learning dataset holding unit 115 (404).

次に、機械学習データセット評価支援装置１００の推論実行部１１１は、ステップ４０３、ステップ４０４で得た学習データとラベルを教師データとして、ステップ４０１で得ている学習前ＡＩモデルを訓練する（４０５）。 Next, the inference execution unit 111 of the machine learning dataset evaluation support device 100 trains the pre-learning AI model obtained in step 401 using the learning data and labels obtained in steps 403 and 404 as teacher data (405 ).

また、機械学習データセット評価支援装置１００の推論実行部１１１は、ステップ４０５での訓練で得た学習済みＡＩモデル１１８を学習済みＡＩモデル保持部１１２に格納する（４０６）。 In addition, the inference execution unit 111 of the machine learning dataset evaluation support device 100 stores the learned AI model 118 obtained by the training in step 405 in the learned AI model holding unit 112 (406).

ここで機械学習データセット評価支援装置１００の推論実行部１１１は、引数ｉの値を
インクリメントし（４０７）、ステップ４０８に遷移する。 Here, the inference execution unit 111 of the machine learning dataset evaluation support device 100 increments the value of the argument i (407), and transitions to step 408. FIG.

次に、機械学習データセット評価支援装置１００の推論実行部１１１は、上述の引数ｉが予め定めた値Ｔを越えたか判定する（４０８）。 Next, the inference execution unit 111 of the machine learning dataset evaluation support device 100 determines whether the above argument i exceeds a predetermined value T (408).

上述の判定の結果、引数ｉの値がＴを越えていない場合（４０８：ＮＯ）、推論実行部１１１は、処理をステップ４０３に戻す。ここでステップ４０３に戻る場合、学習データセット１２６が含む学習データのうち未処理のものを次の学習対象とすることの他、学習データの追加、変更、削除を、所定ルールまたはユーザ操作に応じて実行して学習対象とすることも想定できる。 As a result of the above determination, if the value of the argument i does not exceed T (408: NO), the inference executing section 111 returns the process to step 403. Here, when returning to step 403, the learning data included in the learning data set 126 that have not been processed are used as the next learning target, and addition, change, and deletion of the learning data are performed according to a predetermined rule or user operation. It can also be assumed that it is executed as a learning target.

一方、上述の判定の結果、引数ｉの値がＴを越えた場合（４０８：ＹＥＳ）、推論実行部１１１は、本フローを終了する。 On the other hand, if the value of the argument i exceeds T as a result of the above determination (408: YES), the inference execution unit 111 terminates this flow.

＜推論実行部のフロー例２＞
続いて図５において、本実施形態の推論実行部１１１が、モデル構造を変化させて訓練したＡＩモデルを取得する処理のフロー例を示す。 <Flow example 2 of the inference execution part>
Next, FIG. 5 shows an example flow of processing in which the inference execution unit 111 of this embodiment obtains an AI model trained by changing the model structure.

この場合、機械学習データセット評価支援装置１００の推論実行部１１１は、例えば、入力装置１０５から学習前ＡＩモデルを取得する（５０１）。 In this case, the inference execution unit 111 of the machine learning dataset evaluation support device 100 acquires the pre-learning AI model from the input device 105, for example (501).

続いて、機械学習データセット評価支援装置１００の推論実行部１１１は、引数ｉに１をセットする（５０２）。 Subsequently, the inference execution unit 111 of the machine learning dataset evaluation support device 100 sets 1 to the argument i (502).

次に、機械学習データセット評価支援装置１００の推論実行部１１１は、学習データセット保持部１１５の学習データセット１２６から、複数の学習データを取得する（５０３）。 Next, the inference executing unit 111 of the machine learning dataset evaluation support device 100 acquires a plurality of learning data from the learning dataset 126 of the learning dataset holding unit 115 (503).

続いて、機械学習データセット評価支援装置１００の推論実行部１１１は、学習データセット保持部１１５のラベル情報１２７から、上述のステップ５０３で得た学習データのラベルを取得する（５０４）。 Subsequently, the inference execution unit 111 of the machine learning dataset evaluation support device 100 acquires the label of the learning data obtained in step 503 from the label information 127 of the learning dataset holding unit 115 (504).

次に、機械学習データセット評価支援装置１００の推論実行部１１１は、ステップ５０１で得ている学習前ＡＩモデルの構造を改変する（５０５）。この構造の改変は、例えば、ＡＩモデルを構成するニューラルネットワークのうち、所定層（例えば、中間層のうちの或る層）のノードの１つランダムに削除するか、当該ノードと他層の所定ノードとの間の１つのエッジを削除する、といったものを想定できる。 Next, the inference execution unit 111 of the machine learning dataset evaluation support device 100 modifies the structure of the pre-learning AI model obtained in step 501 (505). Modification of this structure, for example, among the neural network that constitutes the AI model, one of the nodes of a predetermined layer (for example, a certain layer of the intermediate layer) is randomly deleted, or the node and a predetermined layer of another layer One can imagine deleting one edge between a node and so on.

また、機械学習データセット評価支援装置１００の推論実行部１１１は、ステップ５０５で構造を改変した学習前ＡＩモデルに、ステップ５０３、ステップ５０４で得ている学習データとラベルを教師データとして与えることで訓練する（５０６）。 In addition, the inference execution unit 111 of the machine learning dataset evaluation support device 100 gives the learning data and labels obtained in steps 503 and 504 to the pre-learning AI model whose structure has been modified in step 505 as teacher data. Train (506).

また、機械学習データセット評価支援装置１００の推論実行部１１１は、ステップ５０６での訓練で得た学習済みＡＩモデル１１８を学習済みＡＩモデル保持部１１２に格納する（５０７）。 In addition, the inference execution unit 111 of the machine learning dataset evaluation support device 100 stores the learned AI model 118 obtained by the training in step 506 in the learned AI model holding unit 112 (507).

ここで機械学習データセット評価支援装置１００の推論実行部１１１は、引数ｉの値をインクリメントし（５０７）、ステップ５０８に遷移する。 Here, the inference execution unit 111 of the machine learning dataset evaluation support device 100 increments the value of the argument i (507), and transitions to step 508. FIG.

次に、機械学習データセット評価支援装置１００の推論実行部１１１は、上述の引数ｉ
が予め定めた値Ｔを越えたか判定する（５０９）。 Next, the inference execution unit 111 of the machine learning dataset evaluation support device 100 uses the above argument i
exceeds a predetermined value T (509).

上述の判定の結果、引数ｉの値がＴを越えていない場合（５０９：ＮＯ）、推論実行部１１１は、処理をステップ５０５に戻す。一方、上述の判定の結果、引数ｉの値がＴを越えた場合（５０９：ＹＥＳ）、推論実行部１１１は、本フローを終了する。 As a result of the above determination, if the value of the argument i does not exceed T (509: NO), the inference execution unit 111 returns the process to step 505. On the other hand, if the value of the argument i exceeds T as a result of the above determination (509: YES), the inference execution unit 111 terminates this flow.

＜推論実行部のフロー例３＞
続いて図６において、本実施形態の推論実行部１１１による推論実行処理のフローであり、学習済みＡＩモデルの推論を順次実行する処理のフロー例を示す。 <Flow example 3 of the inference execution part>
Next, FIG. 6 shows a flow of inference execution processing by the inference execution unit 111 of the present embodiment, and shows an example of a flow of processing for sequentially executing inference of a trained AI model.

この場合、機械学習データセット評価支援装置１００の推論実行部１１１は、評価データセット保持部１１０の評価データセット１２５から、評価データを１つ取得する（６０１）。 In this case, the inference executing unit 111 of the machine learning dataset evaluation support device 100 acquires one piece of evaluation data from the evaluation dataset 125 of the evaluation dataset holding unit 110 (601).

続いて、機械学習データセット評価支援装置１００の推論実行部１１１は、学習済みＡＩモデル保持部１１２から、Ｔ個の学習済みＡＩモデル１１８を取得する（６０２）。 Subsequently, the inference execution unit 111 of the machine learning dataset evaluation support device 100 acquires T trained AI models 118 from the trained AI model holding unit 112 (602).

次に、機械学習データセット評価支援装置１００の推論実行部１１１は、引数ｉの値として１をセットする（６０３）。 Next, the inference execution unit 111 of the machine learning dataset evaluation support device 100 sets 1 as the value of the argument i (603).

続いて、機械学習データセット評価支援装置１００の推論実行部１１１は、ｋ番目の学習済みＡＩモデル１１８に、ステップ６０１で得ている評価データを入力して推論を実行する（６０４）。 Subsequently, the inference execution unit 111 of the machine learning dataset evaluation support device 100 inputs the evaluation data obtained in step 601 to the k-th trained AI model 118 and executes inference (604).

また、機械学習データセット評価支援装置１００の推論実行部１１１は、上述のステップ６０５での推論の結果を、例えばメモリ１０３に格納する（６０５）。 In addition, the inference execution unit 111 of the machine learning dataset evaluation support device 100 stores the result of the inference in step 605 described above, for example, in the memory 103 (605).

また、機械学習データセット評価支援装置１００の推論実行部１１１は、引数ｉの値を１つインクリメントする（６０６）。 Also, the inference execution unit 111 of the machine learning dataset evaluation support device 100 increments the value of the argument i by one (606).

続いて、機械学習データセット評価支援装置１００の推論実行部１１１は、上述の引数ｉの値がＴを越えたか判定する（６０７）。 Subsequently, the inference execution unit 111 of the machine learning dataset evaluation support device 100 determines whether the value of the above argument i exceeds T (607).

上述の判定の結果、引数ｉの値がＴを越えていない場合（６０７：ＮＯ）、推論実行部１１１は、処理をステップ６０４に戻す。 As a result of the above determination, if the value of the argument i does not exceed T (607: NO), the inference executing section 111 returns the process to step 604.

一方、上述の判定の結果、引数ｉの値がＴを越えた場合（６０７：ＹＥＳ）、推論実行部１１１は本フローを終了する。 On the other hand, if the value of the argument i exceeds T as a result of the above determination (607: YES), the inference execution unit 111 terminates this flow.

＜推論実行部のフロー例４＞
続いて図７において、本実施形態における推論実行処理のフローであり、学習済みＡＩモデルの推論を並列に実行する処理のフロー例を示す。 <Flow example 4 of the inference execution part>
Next, FIG. 7 shows a flow of inference execution processing in the present embodiment, which is an example of a flow of processing for executing inference of trained AI models in parallel.

この場合、機械学習データセット評価支援装置１００は、評価データセット保持部１１０から評価データを１つ取得する（７０１）。 In this case, the machine learning dataset evaluation support device 100 acquires one piece of evaluation data from the evaluation dataset holding unit 110 (701).

続いて、機械学習データセット評価支援装置１００は、学習済みＡＩモデル保持部１１２からＴ個の学習済みＡＩモデル１１８を取得する（７０２）。 Subsequently, the machine learning dataset evaluation support device 100 acquires T learned AI models 118 from the trained AI model holding unit 112 (702).

次に、機械学習データセット評価支援装置１００は、ステップ７０２で得たＴ個の学習
済みＡＩモデル１１８を、推論実行部１１１に展開する（７０３）。 Next, the machine learning dataset evaluation support device 100 deploys the T trained AI models 118 obtained in step 702 to the inference execution unit 111 (703).

続いて、機械学習データセット評価支援装置１００の推論実行部１１１は、ステップ７０３で展開したＴ個の学習済みＡＩモデル１１８に、ステップ７０１で得ている評価データを入力して推論を実行する（７０４）。 Subsequently, the inference execution unit 111 of the machine learning dataset evaluation support device 100 inputs the evaluation data obtained in step 701 to the T trained AI models 118 deployed in step 703 to execute inference ( 704).

次に、機械学習データセット評価支援装置１００の推論実行部１１１は、ステップ７０４での推論の結果を、例えばメモリ１０３に格納し（７０５）、本フローを終了する。 Next, the inference execution unit 111 of the machine learning dataset evaluation support device 100 stores the result of the inference in step 704, for example, in the memory 103 (705), and ends this flow.

＜推論実行部のフロー例５＞
続いて図８において、本実施形態における推論実行処理のフローであり、推論時にモデルの構造を変化させたＡＩモデルを用いる処理のフロー例を示す。 <Flow example 5 of the inference execution part>
Next, FIG. 8 shows a flow of inference execution processing in this embodiment, which is an example of a flow of processing using an AI model whose model structure is changed at the time of inference.

この場合、機械学習データセット評価支援装置１００の推論実行部１１１は、評価データセット保持部１１０から評価データを１つ取得する（８０１）。 In this case, the inference executing unit 111 of the machine learning dataset evaluation support device 100 acquires one piece of evaluation data from the evaluation dataset holding unit 110 (801).

また、機械学習データセット評価支援装置１００の推論実行部１１１は、学習済みＡＩモデル保持部１１２から学習済みＡＩモデル１１８を取得する（８０２）。 In addition, the inference execution unit 111 of the machine learning dataset evaluation support device 100 acquires the trained AI model 118 from the trained AI model holding unit 112 (802).

次に、機械学習データセット評価支援装置１００の推論実行部１１１は、引数ｉの値として１をセットする（８０３）。 Next, the inference execution unit 111 of the machine learning dataset evaluation support device 100 sets 1 as the value of the argument i (803).

続いて、機械学習データセット評価支援装置１００の推論実行部１１１は、ステップ８０２で得た学習済みＡＩモデル１１８の構成要素を一部不活性化する（８０４）。この場合、構成要素とは、例えば、ニューラルネットワークを構成するノードやエッジであり、一部不活性化とは、ノードの重み値をゼロにする措置やエッジを削除するといったものを想定できる。 Subsequently, the inference execution unit 111 of the machine learning dataset evaluation support device 100 partially inactivates the components of the trained AI model 118 obtained in step 802 (804). In this case, the constituent elements are, for example, nodes and edges that constitute the neural network, and the partial inactivation can be assumed to be a measure to set the weight value of the node to zero or to delete the edge.

次に、機械学習データセット評価支援装置１００の推論実行部１１１は、ステップ８０４で一部不活性化の措置を施した学習済みＡＩモデル１１８に、ステップ８０１で得ている評価データを入力して推論を実行する（８０５）。 Next, the inference execution unit 111 of the machine learning dataset evaluation support device 100 inputs the evaluation data obtained in step 801 to the trained AI model 118 partially deactivated in step 804. Inference is performed (805).

また、機械学習データセット評価支援装置１００の推論実行部１１１は、ステップ８０５での推論の結果を、例えばメモリ１０３に格納する（８０６）。 In addition, the inference execution unit 111 of the machine learning dataset evaluation support device 100 stores the result of inference in step 805 in the memory 103, for example (806).

続いて、機械学習データセット評価支援装置１００の推論実行部１１１は、引数ｉの値をインクリメントし（８０７）、当該引数ｉの値が、予め定めたＴの値を越えたか判定する（８０８）。 Subsequently, the inference execution unit 111 of the machine learning dataset evaluation support device 100 increments the value of the argument i (807), and determines whether the value of the argument i exceeds a predetermined value of T (808). .

上述の判定の結果、引数ｉの値がＴを越えていない場合（８０８：ＮＯ）、推論実行部１１１は、処理をステップ８０４に戻す。 As a result of the above determination, if the value of the argument i does not exceed T (808: NO), the inference executing section 111 returns the process to step 804.

一方、上述の判定の結果、引数ｉの値がＴを越えた場合（８０８：ＹＥＳ）、推論実行部１１１は、本フローを終了する。 On the other hand, if the value of the argument i exceeds T as a result of the above determination (808: YES), the inference execution unit 111 terminates this flow.

＜差異解析部のフロー例＞
続いて図９において、本実施形態の差異解析部１１３における、推論結果差異解析処理のフロー例を示す。また図１０に、本実施形態における推論結果の確信度と安定度の計算方法の一例を示す。 <Flow example of difference analysis part>
Next, FIG. 9 shows an example flow of inference result difference analysis processing in the difference analysis unit 113 of this embodiment. Also, FIG. 10 shows an example of a calculation method of the certainty and stability of the inference result in this embodiment.

この場合、機械学習データセット評価支援装置１００の差異解析部１１３は、推論実行部１１１によるＫ個の推論結果をメモリ１０３より取得する（９０１）。 In this case, the difference analysis unit 113 of the machine learning dataset evaluation support device 100 acquires K inference results by the inference execution unit 111 from the memory 103 (901).

また、機械学習データセット評価支援装置１００の差異解析部１１３は、ステップ９０１で得た推論結果の確信度を計算し、例えば、メモリ１０３に格納する（９０２）。 Further, the difference analysis unit 113 of the machine learning dataset evaluation support device 100 calculates the certainty factor of the inference result obtained in step 901 and stores it in the memory 103, for example (902).

次に、機械学習データセット評価支援装置１００は、ステップ９０１で得た推論結果の安定度を計算し、例えば、メモリ１０３に格納し（９０３）、本フローを終了する。 Next, the machine learning dataset evaluation support device 100 calculates the stability of the inference result obtained in step 901, stores it in the memory 103, for example (903), and ends this flow.

なお、上述の差異解析部１１３による確信度および安定度の計算手法としては、例えば、図１０で示すように、分類問題の推論結果を式１２０１とし、Ｔ個の学習済みＡＩモデル１１８の推論結果の確率の平均値の式１２０２とした場合、確信度は式１２０３で、また、安定度は式１２０４で算定できる。 As a method of calculating the confidence and stability by the above-described difference analysis unit 113, for example, as shown in FIG. , the probability can be calculated by the formula 1203, and the stability can be calculated by the formula 1204.

同様に、回帰問題の推論結果を式１２０５とし、Ｔ個の学習済みＡＩモデル１１８の推論結果の確率の平均値の式１２０６とした場合、確信度は式１２０７で、また、安定度は式１２０８で算定できる。 Similarly, when the inference result of the regression problem is expressed by expression 1205 and the mean value of the probability of the inference result of T trained AI models 118 is expressed by expression 1206, the confidence is expressed by expression 1207, and the stability is expressed by expression 1208. can be calculated by

なお、上述のように算定した確信度および安定度の結果は、例えば、図１１に示すような構成として出力できる。確信度の分布図１４０１において、各点は、１つの評価データに対する複数の推論結果について、クラスｋであると判定される確率の平均値と推論結果の確信度を表す。また、安定度の分布図１４０２において、各点は、１つの評価データに対する複数の推論結果について、クラスｋであると判定される確率の平均値と推論結果の安定度を表す。 It should be noted that the result of the certainty factor and the stability factor calculated as described above can be output as a configuration as shown in FIG. 11, for example. In the confidence factor distribution diagram 1401, each point represents the average value of the probabilities of being determined to be class k and the confidence factor of the inference result for a plurality of inference results for one evaluation data. In the stability distribution diagram 1402, each point represents the average value of probabilities of being determined to be class k and the stability of the inference results for a plurality of inference results for one piece of evaluation data.

＜データ評価部のフロー例＞
続いて図１２において、本実施形態におけるデータ評価部１１４のフロー例を示す。この場合、機械学習データセット評価支援装置１００のデータ評価部１１４は、高確信度の閾値、低確信度の閾値、低安定度の閾値を設定する（１００１）。 <Flow example of the data evaluation section>
Next, FIG. 12 shows a flow example of the data evaluation unit 114 in this embodiment. In this case, the data evaluation unit 114 of the machine learning dataset evaluation support device 100 sets a high confidence threshold, a low confidence threshold, and a low stability threshold (1001).

この場合の設定手法としては、例えば、予め与えられたデフォルト値を使用するもの、分類される評価データが一定割合になるよう機械的に設定するもの、データ密度の高い領域が低い領域と分離されるよう機械的に設定するもの、及び差異解析結果（図１１参照）を参照してオペレータが設定するもの、のいずれかを想定する。 As a setting method in this case, for example, a method of using a preset default value, a method of mechanically setting so that the evaluation data to be classified is a constant ratio, and a method of separating areas with high data density from areas with low data density. One is assumed to be mechanically set so as to make the difference, or the other is set by the operator referring to the difference analysis result (see FIG. 11).

具体的な閾値設定の概念例としては、図１３の分布図１４０１に示すように、「１」の周辺のデータは推論結果が「クラスｋである」ことの確信度が高く、また、「２」の周辺のデータは推論結果が「クラスｋでない」ことの確信度が高いため、「１」と「２」の周辺のデータをカバーするよう高確信度の閾値を設定する。 As a specific conceptual example of threshold setting, as shown in the distribution diagram 1401 of FIG. , a high confidence threshold is set so as to cover the data around '1' and '2'.

また、分布図１４０２で示すように、推論結果の確信度が低いデータは推論結果が安定せず、確実な認識が困難、という状況の場合、高確信度の閾値＞低確信度の閾値となる低確信度の閾値を確信度の低いデータの分布から設定する。 In addition, as shown in the distribution diagram 1402, in the case of data with low confidence of the inference result, the inference result is not stable and it is difficult to recognize with certainty, the threshold of high confidence is greater than the threshold of low confidence. A low-confidence threshold is set from the distribution of low-confidence data.

また、分布図１４０３で示すように、推論結果の安定度が低いデータは学習によって推論結果が安定して収束する可能性がある状況の場合、安定度の低いデータの分布から低安定度の閾値を設定する。 In addition, as shown in the distribution diagram 1403, in a situation where the inference results of data with low stability of inference results may stably converge by learning, the low stability threshold set.

また、機械学習データセット評価支援装置１００のデータ評価部１１４は、確信度が高確信度の閾値（ステップ１００１で設定したもの）以上の評価データを抽出し記録する（
１００２）。 In addition, the data evaluation unit 114 of the machine learning dataset evaluation support device 100 extracts and records evaluation data whose certainty is equal to or higher than the high certainty threshold (set in step 1001) (
1002).

次に、機械学習データセット評価支援装置１００のデータ評価部１１４は、確信度が低確信度の閾値（ステップ１００１で設定したもの）未満の評価データを抽出し記録する（１００３）。 Next, the data evaluation unit 114 of the machine learning dataset evaluation support device 100 extracts and records evaluation data whose confidence is less than the low confidence threshold (set in step 1001) (1003).

続いて、機械学習データセット評価支援装置１００のデータ評価部１１４は、安定度が低安定度の閾値（ステップ１００１で設定したもの）未満の評価データを抽出して記録し（１００４）、本フローを終了する。 Subsequently, the data evaluation unit 114 of the machine learning dataset evaluation support device 100 extracts and records evaluation data whose stability is less than the low stability threshold (set in step 1001) (1004), and this flow exit.

＜学習データセット編集部のフロー例＞
続いて図１４において、本実施形態における学習データセット編集部１１５での処理フロー例を示す。この場合、機械学習データセット評価支援装置１００の学習データセット編集部１１５は、評価データセット保持部１１０から評価データを取得する（１１０１）。 <Flow example of learning data set editor>
Next, FIG. 14 shows an example of the processing flow in the learning data set editing unit 115 in this embodiment. In this case, the learning data set editing unit 115 of the machine learning data set evaluation support device 100 acquires evaluation data from the evaluation data set holding unit 110 (1101).

続いて、機械学習データセット評価支援装置１００の学習データセット編集部１１５は、当該評価データが低確信度を示すものか判定する（１１０２）。 Subsequently, the learning data set editing unit 115 of the machine learning data set evaluation support device 100 determines whether the evaluation data indicates a low confidence (1102).

上述の判定の結果、評価データが低確信度を示すものである場合（１１０２：ＹＥＳ）、学習データセット編集部１１５は、学習データセット保持部１１６の学習データセット１２６から、ステップ１１０１で得ている評価データに類似する全ての学習データを取得する（１１０９）。 As a result of the above determination, if the evaluation data indicates low confidence (1102: YES), the learning data set editing unit 115 obtains in step 1101 the learning data set 126 of the learning data set holding unit 116 All learning data similar to existing evaluation data are acquired (1109).

また、学習データセット編集部１１５は、学習データセット保持部１１６のラベル情報１２７から、ステップ１１０９で得ている学習データのラベルを取得し（１１１０）、処理をステップ１１１１に遷移させる。 Also, the learning data set editing unit 115 acquires the label of the learning data obtained in step 1109 from the label information 127 of the learning data set holding unit 116 (1110), and shifts the processing to step 1111.

一方、上述の判定の結果、評価データは低確信度を示さないものである場合（１１０２：ＮＯ）、機械学習データセット評価支援装置１００の学習データセット編集部１１５は、当該評価データが高確信度を示すものか判定する（１１０３）。 On the other hand, as a result of the above determination, if the evaluation data does not indicate the low confidence (1102: NO), the learning dataset editing unit 115 of the machine learning dataset evaluation support device 100 determines that the evaluation data is high confidence. It is determined whether it indicates the degree (1103).

上述の判定の結果、当該評価データが高確信度を示すものではない場合（１１０３：ＮＯ）、機械学習データセット評価支援装置１００の学習データセット編集部１１５は、処理を１１１６に遷移させる。 As a result of the above determination, if the evaluation data does not indicate high confidence (1103: NO), the learning dataset editing unit 115 of the machine learning dataset evaluation support device 100 shifts the process to 1116.

一方、上述の判定の結果、評価データが高確信度を示すものである場合（１１０３：ＹＥＳ）、機械学習データセット評価支援装置１００の学習データセット編集部１１５は、当該評価データが低安定度を示すものか判定する（１１０４）。 On the other hand, as a result of the above determination, if the evaluation data indicates a high degree of certainty (1103: YES), the learning dataset editing unit 115 of the machine learning dataset evaluation support device 100 determines that the evaluation data has a low stability (1104).

上述の判定の結果、当該評価データが低安定度を示すものではない場合（１１０４：ＮＯ）、機械学習データセット評価支援装置１００の学習データセット編集部１１５は、学習データセット保持部１１６の学習データセット１２６から、当該評価データに類似する全ての学習データを取得する（１１０５）。 As a result of the above determination, if the evaluation data does not indicate low stability (1104: NO), the learning data set editing unit 115 of the machine learning data set evaluation support device 100 All learning data similar to the evaluation data are obtained from the data set 126 (1105).

なお、上述の「類似」について判定する手法としては、例えば、図１５で示す式１５０１及び式１５０２を用いたものを想定できる。或いは、図１６で示すように、類似するデータとその類似度を学習データとして学習させたＡＩモデルを用意し、類似度を判定する構成１５０３も想定できる。 It should be noted that, as a method for determining "similarity" described above, for example, a method using equations 1501 and 1502 shown in FIG. 15 can be assumed. Alternatively, as shown in FIG. 16, a configuration 1503 of preparing an AI model trained using similar data and its degree of similarity as learning data and determining the degree of similarity can also be assumed.

続いて、機械学習データセット評価支援装置１００の学習データセット編集部１１５は、学習データセット保持部１６のラベル情報１２７から、ステップ１１０５で得ている学習データのラベルを取得する（１１０６）。 Subsequently, the learning data set editing unit 115 of the machine learning data set evaluation support device 100 acquires the label of the learning data obtained in step 1105 from the label information 127 of the learning data set holding unit 16 (1106).

また、機械学習データセット評価支援装置１００の学習データセット編集部１１５は、ステップ１１０６で取得したラベルと、ステップ１１０１で得ている評価データの推論結果を比較する（１１０７）。 Also, the learning data set editing unit 115 of the machine learning data set evaluation support device 100 compares the label obtained in step 1106 with the inference result of the evaluation data obtained in step 1101 (1107).

上述の比較の結果、上述のラベルと推論結果が一致する場合（１１０８：ＹＥＳ）、機械学習データセット評価支援装置１００の学習データセット編集部１１５は、処理をステップ１１１６に遷移させる。 As a result of the above comparison, if the above label matches the inference result (1108: YES), the learning dataset editing unit 115 of the machine learning dataset evaluation support device 100 shifts the process to step 1116.

他方、上述の判定の結果、ラベルと推論結果が一致しない場合（１１０８：ＮＯ）、機械学習データセット評価支援装置１００の学習データセット編集部１１５は、学習データセット保持部１１６の学習データセット１２６から、当該学習データを削除する（１１１１）。 On the other hand, as a result of the above determination, if the label and the inference result do not match (1108: NO), the learning data set editing unit 115 of the machine learning data set evaluation support device 100 stores the learning data set 126 in the learning data set holding unit 116. (1111).

また、機械学習データセット評価支援装置１００の学習データセット編集部１１５は、学習データセット保持部１１６のラベル情報１２７から、当該学習データのラベルを削除し（１１１２）、処理をステップ１１１６に遷移させる。 Also, the learning data set editing unit 115 of the machine learning data set evaluation support device 100 deletes the label of the learning data from the label information 127 of the learning data set holding unit 116 (1112), and shifts the process to step 1116. .

こうした、評価データにおける確信度と安定度が高いケースにおいて、当該評価データと類似の学習データのラベルと評価データの推論結果が一致しない場合、当該学習データやラベルの削除処理を行うことで、正解ラベルが不適切な学習データの排除が可能となる（図２２参照）。なお、学習データセット編集部１１５は、削除する学習データやラベルの情報を、出力装置１０６にて表示しても良い。 In such a case where the confidence and stability of the evaluation data are high, if the labels of learning data similar to the evaluation data and the inference results of the evaluation data do not match, the training data and labels are deleted. It becomes possible to exclude learning data with inappropriate labels (see FIG. 22). Note that the learning data set editing unit 115 may display the learning data to be deleted and label information on the output device 106 .

一方、評価データにおける確信度が低いケースにおいて、正解ラベルが異なる学習データの不用意な近接の排除、学習意図とは異なる特徴量が優勢な学習データの排除が可能となる（図２３参照）。なお、学習データセット編集部１１５は、削除する学習データやラベルの情報を、出力装置１０６にて表示しても良い。 On the other hand, in the case where the degree of certainty in the evaluation data is low, it is possible to eliminate inadvertent proximity of learning data with different correct labels, and to eliminate learning data in which feature amounts different from the learning intention are predominant (see FIG. 23). Note that the learning data set editing unit 115 may display the learning data to be deleted and label information on the output device 106 .

また、評価データにおける確信度が高く、安定度が低いケースにおいて、不足している学習データの補足が可能となる（図２４参照）。なお、学習データセット編集部１１５は、補足する学習データの情報を、出力装置１０６にて表示しても良い。 In addition, in the case where the evaluation data has a high degree of certainty and a low degree of stability, it is possible to supplement missing learning data (see FIG. 24). Note that the learning data set editing unit 115 may display the information of the learning data to be supplemented on the output device 106 .

ここで、上述の処理１１０４での判定の結果、評価データが低安定度を示すものである場合（１１０４：ＹＥＳ）、機械学習データセット評価支援装置１００の学習データセット編集部１１５は、当該評価データに類似するデータを生成する（１１１３）。 Here, as a result of the determination in the process 1104 described above, if the evaluation data indicates low stability (1104: YES), the learning data set editing unit 115 of the machine learning data set evaluation support device 100 performs the evaluation Data similar to the data is generated (1113).

なお、この類似するデータの生成は、図１６で示すように、生成元のデータの特徴量を微小変化させたデータを生成する手法１６０１、生成元のデータを微小回転、変形、変色させたデータを生成する手法１６０２、及びデータ生成ニューラルネットワークを利用してデータを生成する手法１６０３、などを想定できる。 As shown in FIG. 16, this similar data is generated by a method 1601 of generating data by slightly changing the feature amount of the original data, data by slightly rotating, deforming, or discoloring the original data. and a method 1603 of generating data using a data generating neural network.

また、機械学習データセット評価支援装置１００の学習データセット編集部１１５は、学習データセット保持部１１６の学習データセット１２６に、ステップ１１１３で生成した学習データを追加する（１１１４）。 Also, the learning data set editing unit 115 of the machine learning data set evaluation support device 100 adds the learning data generated in step 1113 to the learning data set 126 of the learning data set holding unit 116 (1114).

続いて、機械学習データセット評価支援装置１００の学習データセット編集部１１５は
、学習データセット保持部１１６のラベル情報１２７に、ステップ１１１３で生成した学習データに関するラベルを追加し（１１１５）、処理をステップ１１１６に遷移させる。 Subsequently, the learning data set editing unit 115 of the machine learning data set evaluation support device 100 adds the label related to the learning data generated in step 1113 to the label information 127 of the learning data set holding unit 116 (1115), and starts processing. Transition to step 1116 .

また、機械学習データセット評価支援装置１００の学習データセット編集部１１５は、評価データセット１２５の評価データのうち未処理のものが残っていないか判定し（１１１６）、未処理のものが残っている、すなわち今次のフローでの処理対象とした評価データが最後の評価データではない場合（１１１６：ＮＯ）、機械学習データセット評価支援装置１００の学習データセット編集部１１５は、処理を１１０１に戻す。 In addition, the learning data set editing unit 115 of the machine learning data set evaluation support device 100 determines whether or not any unprocessed evaluation data remains in the evaluation data set 125 (1116). In other words, if the evaluation data to be processed in the next flow is not the last evaluation data (1116: NO), the learning data set editing unit 115 of the machine learning data set evaluation support device 100 advances the process to 1101. return.

一方、上述の判定の結果、未処理のものが残っていない、すなわち今次のフローでの処理対象とした評価データが最後の評価データであった場合（１１１６：ＹＥＳ）、機械学習データセット評価支援装置１００の学習データセット編集部１１５は、本フローを終了する。 On the other hand, as a result of the above determination, if there are no unprocessed data left, that is, if the evaluation data to be processed in the next flow is the last evaluation data (1116: YES), the machine learning dataset evaluation The learning data set editing unit 115 of the support device 100 ends this flow.

＜具体例：実施例１＞
続いて、より具体的な例に沿って本実施形態の機械学習データセット評価支援技術について説明する。図１７は、実施例１における機械学習データセット評価支援装置１００の機能構成を示すブロック図である。 <Specific example: Example 1>
Subsequently, the machine learning data set evaluation support technology of the present embodiment will be described along more specific examples. FIG. 17 is a block diagram showing the functional configuration of the machine learning dataset evaluation support device 100 according to the first embodiment.

基本的な構成については図１で示したものと同様であり、異なる部分は、推論実行部１１１における学習済みＡＩモデル保持部１１２で保持するＡＩモデルが、標識識別ＡＩモデル１１８Ａ、である点である。つまり、この場合の学習済みＡＩモデルは、道路標識のイメージから当該道路標識を判別するものである。 The basic configuration is the same as that shown in FIG. 1, except that the AI model held in the learned AI model holding unit 112 in the inference execution unit 111 is the sign identification AI model 118A. be. In other words, the trained AI model in this case distinguishes the road sign from the image of the road sign.

そこで図１８にて、「６０」なる数値を示す道路標識に関する、１～４までの計４つの評価データ１８０１、その推論結果１８０２、及び差異解析結果１８０３の各例を示す。また、そうした差異解析結果等に関する閾値１９０１～１９０３を図１９にて例示する。 Therefore, FIG. 18 shows examples of a total of four evaluation data 1801 from 1 to 4, the inference result 1802, and the difference analysis result 1803 regarding the road sign indicating the numerical value "60". FIG. 19 illustrates threshold values 1901 to 1903 related to such difference analysis results.

また、各評価データに関する評価結果２００１を、図２０で示す。評価データ１の評価結果は、「確信度が高確信度の閾値以上、安定度が低安定度の閾値以上であり、確信度が高く安定度も高い」というものである。評価データ２の評価結果は、「確信度が低確信度の閾値未満であり、確信度が低い」というものである。評価データ３の評価結果は、「安定度が低安定度の閾値未満であり、安定度が低い」というものである。また、評価データ４の評価結果は、「確信度が高確信度の閾値以上、安定度が低安定度の閾値以上であり、確信度が高く安定度も高い」というものである。 FIG. 20 shows evaluation results 2001 for each evaluation data. The evaluation result of the evaluation data 1 is "the degree of certainty is equal to or higher than the threshold for high certainty, the degree of stability is equal to or higher than the threshold for low stability, and the certainty is high and the stability is also high." The evaluation result of the evaluation data 2 is "the certainty is less than the low certainty threshold, and the certainty is low." The evaluation result of the evaluation data 3 is "the stability is less than the low stability threshold, and the stability is low". Further, the evaluation result of the evaluation data 4 is "the confidence factor is equal to or higher than the high confidence threshold, the stability is equal to or higher than the low stability threshold, and the confidence is high and the stability is high".

こうした評価結果を踏まえると、図２１で示すように評価データ１については、推論結果とラベルが一致する類似データを残す。また図２２で示すように評価データ４については、推論結果とラベルが一致しない類似データはラベルとともに削除する。また図２３で示すように評価データ２については、ラベルとともに学習データを削除する。また図２４で示すように評価データ３については、類似データを追加する。 Based on these evaluation results, as shown in FIG. 21, for evaluation data 1, similar data whose labels match the inference results are left. As shown in FIG. 22, for evaluation data 4, similar data whose labels do not match the inference results are deleted together with their labels. Also, as shown in FIG. 23, for the evaluation data 2, the learning data is deleted together with the label. Further, similar data is added to the evaluation data 3 as shown in FIG.

＜具体例：実施例２＞
また、機械学習データセット評価支援装置１００が、図２５で示すように解析結果表示部１１７をさらに備える形態についても想定できる。この場合、解析結果表示部１１７は、図２６で示すように、推論結果の確信度を示す分布図２６０１、２６０２を出力装置１０６に表示させる。同様に、解析結果表示部１１７は、図２７で示すように、推論結果の安定度を示す分布図２７０１、２７０２を出力装置１０６に表示させる。 <Specific example: Example 2>
Further, it is possible to assume a mode in which the machine learning dataset evaluation support device 100 further includes an analysis result display unit 117 as shown in FIG. 25 . In this case, the analysis result display unit 117 causes the output device 106 to display distribution diagrams 2601 and 2602 indicating the certainty of the inference result, as shown in FIG. Similarly, the analysis result display unit 117 causes the output device 106 to display distribution diagrams 2701 and 2702 indicating the stability of the inference results, as shown in FIG.

また、解析結果表示部１１７は、図２８で示すように、図２６及び図２７で示す推論結
果を踏まえた、高確信度、低確信度、及び低安定度の各閾値２８０１～２８０３を出力装置１０６に表示させる。 In addition, as shown in FIG. 28, the analysis result display unit 117 outputs threshold values 2801 to 2803 for high confidence, low confidence, and low stability based on the inference results shown in FIGS. 106 to display.

また、解析結果表示部１１７は、図２９で示すように、評価データの評価結果２９０１を出力装置１０６に表示させる。 Also, the analysis result display unit 117 causes the output device 106 to display the evaluation result 2901 of the evaluation data as shown in FIG.

以上、本発明を実施するための最良の形態などについて具体的に説明したが、本発明はこれに限定されるものではなく、その要旨を逸脱しない範囲で種々変更可能である。 Although the best mode for carrying out the present invention has been specifically described above, the present invention is not limited to this, and can be variously modified without departing from the scope of the invention.

こうした本実施形態によれば、学習済みＡＩモデルの推論特性を確信度と安定度から分析することで、不適切な学習の原因となるデータを除去し、学習が不足しているデータを追加した、学習データセットを構成可能である。ひいては、不適切な学習や学習不足を回避する、洗練された学習データセットの構成が可能となる。 According to this embodiment, by analyzing the inference characteristics of the trained AI model from the confidence and stability, data that causes inappropriate learning is removed and data that lacks learning is added. , can constitute the training dataset. As a result, it is possible to construct a sophisticated training data set that avoids inappropriate learning and insufficient learning.

本明細書の記載により、少なくとも次のことが明らかにされる。すなわち、本実施形態の機械学習データセット評価支援装置において、前記解析結果を出力装置に表示する処理をさらに実行するものである、としてもよい。 At least the following will be clarified by the description of this specification. That is, the machine learning dataset evaluation support device of the present embodiment may further execute a process of displaying the analysis result on an output device.

これによれば、複数の推論結果の間の差異を、推論結果の確信度及び推論結果の安定度の観点でユーザに明示できる。 According to this, the difference between a plurality of inference results can be clearly indicated to the user in terms of the certainty of the inference result and the stability of the inference result.

また、本実施形態の機械学習データセット評価支援装置において、前記推論実行部は、前記複数の学習済みＡＩモデルとして、学習時に学習データセットないしモデル構造を変化させて訓練したＡＩモデルを用いる、としてもよい。 Further, in the machine learning dataset evaluation support device of the present embodiment, the inference execution unit uses an AI model trained by changing the learning dataset or model structure at the time of learning as the plurality of trained AI models. good too.

これによれば、推論結果として、学習データセットやモデル構造のゆらぎを踏まえたものを効率良く得られることとなる。 According to this, it is possible to efficiently obtain an inference result based on fluctuations in the learning data set and the model structure.

また、本実施形態の機械学習データセット評価支援装置において、前記推論実行部は、前記複数の学習済みＡＩモデルとして、推論時にモデル構造を変化させた学習済みＡＩモデルを用いる、としてもよい。 Further, in the machine learning dataset evaluation support device of the present embodiment, the inference execution unit may use, as the plurality of trained AI models, trained AI models whose model structure is changed during inference.

これによれば、１つのモデルを順次変化させることによるモデル数の削減、モデルの推論結果の履歴をモデルの変化に反映することによる推論結果の解析の高精度化を図ることができる。ひいては、不適切な学習や学習不足を回避する、より洗練された学習データセットの構成が可能となる。 According to this, it is possible to reduce the number of models by sequentially changing one model, and to improve the accuracy of the analysis of the inference results by reflecting the history of the inference results of the models in the model changes. As a result, it is possible to construct a more sophisticated learning data set that avoids inappropriate learning and insufficient learning.

また、本実施形態の機械学習データセット評価支援装置において、前記推論実行部は、前記複数の学習済みＡＩモデルを逐次的に実行して複数の推論結果を得るものである、としてもよい。 Further, in the machine learning dataset evaluation support device of the present embodiment, the inference execution unit may sequentially execute the plurality of trained AI models to obtain a plurality of inference results.

これによれば、一度に実行されるのが１つのモデルのみであることによる推論実行に要する計算資源の減少を図ることができる。 According to this, since only one model is executed at a time, it is possible to reduce computational resources required for inference execution.

また、本実施形態の機械学習データセット評価支援装置において、前記推論実行部は、前記複数の学習済みＡＩモデルを並列に実行して複数の推論結果を同時に得るものである、としてもよい。 Further, in the machine learning dataset evaluation support device of the present embodiment, the inference execution unit may execute the plurality of trained AI models in parallel to simultaneously obtain a plurality of inference results.

これによれば、複数の推論結果を同時に得ることによる処理時間の短縮を図ることができる。 According to this, it is possible to shorten the processing time by simultaneously obtaining a plurality of inference results.

また、本実施形態の機械学習データセット評価支援装置において、前記学習データセット編集部は、前記推論結果の確信度が所定の閾値より大きい評価データに類似しかつ評価対象データの推論結果と正解ラベルが異なる学習データを、前記学習データセットから削除する処理をさらに実行するものである、としてもよい。 Further, in the machine learning dataset evaluation support device of the present embodiment, the learning dataset editing unit is configured such that the inference result of the inference result is similar to the evaluation data having a certainty greater than a predetermined threshold, and the inference result of the evaluation target data and the correct label may further execute a process of deleting learning data different from the learning data set from the learning data set.

これによれば、正解ラベルが不適切な学習データの排除を図ることができる。 According to this, it is possible to eliminate learning data with inappropriate correct labels.

また、本実施形態の機械学習データセット評価支援装置において、前記学習データセット編集部は、前記推論結果の確信度が所定の閾値より小さい評価データに類似する学習データを、前記学習データセットから削除する処理をさらに実行するものである、としてもよい。 Further, in the machine learning data set evaluation support device of the present embodiment, the learning data set editing unit deletes from the learning data set learning data similar to the evaluation data in which the certainty of the inference result is smaller than a predetermined threshold. It may be assumed that the processing to be performed is further executed.

これによれば、正解ラベルが異なる学習データの不用意な近接の排除、学習意図とは異なる特徴量が優勢な学習データの排除を図ることができる。 According to this, it is possible to eliminate inadvertent proximity of learning data with different correct labels, and to eliminate learning data in which a feature amount different from the learning intention is dominant.

また、本実施形態の機械学習データセット評価支援装置において、前記学習データセット編集部は、前記推論結果の確信度が所定の閾値より大きく、安定度が所定の閾値より小さい評価データに類似する学習データを生成し、前記学習データセットに追加する処理をさらに実行するものである、としてもよい。 Further, in the machine learning data set evaluation support device of the present embodiment, the learning data set editing unit includes a learning data similar to the evaluation data in which the degree of certainty of the inference result is greater than a predetermined threshold and the degree of stability is less than a predetermined threshold. The process of generating data and adding it to the learning data set may be further performed.

これによれば、不足している学習データの補足が可能となる。 According to this, it is possible to supplement missing learning data.

１００機械学習データセット評価支援装置
１０１記憶装置
１０２プログラム
１０３メモリ
１０４演算装置
１０５入力装置
１０６出力装置
１０７通信装置
１１０評価データセット保持部
１１１推論実行部
１１２学習済みＡＩモデル保持部
１１３差異解析部
１１４データ評価部
１１５学習データセット編集部
１１６学習データセット保持部
１１７解析結果表示部
１１８学習済みＡＩモデル
１２５評価データセット
１２６学習データセット
１２７ラベル情報 100 Machine learning data set evaluation support device 101 Storage device 102 Program 103 Memory 104 Arithmetic device 105 Input device 106 Output device 107 Communication device 110 Evaluation data set holding unit 111 Inference execution unit 112 Trained AI model holding unit 113 Difference analysis unit 114 Data Evaluation unit 115 Learning data set editing unit 116 Learning data set holding unit 117 Analysis result display unit 118 Trained AI model 125 Evaluation data set 126 Learning data set 127 Label information

Claims

a learning data set holding unit that holds a learning data set and label information used for supervised machine learning;
an inference execution unit that inputs a predetermined evaluation data set to a plurality of trained AI models and obtains a plurality of inference results;
a difference analysis unit that analyzes the difference between the plurality of inference results from the degree of certainty of the inference result and the stability of the inference result in each of the plurality of trained AI models;
a data evaluation unit that evaluates the evaluation data set based on the difference analysis result;
a learning data set editing unit that edits the learning data set and label information held in the learning data set holding unit based on the evaluation result of the evaluation data set;
A machine learning dataset evaluation support device comprising:

2. The machine learning dataset evaluation support device according to claim 1, further executing a process of displaying the analysis result on an output device.

2. The machine learning dataset evaluation according to claim 1, wherein the inference execution unit uses an AI model trained by changing a learning dataset or model structure during learning as the plurality of trained AI models. support equipment.

2. The machine learning data set evaluation support device according to claim 1, wherein said inference execution unit uses, as said plurality of trained AI models, trained AI models whose model structures have been changed at the time of inference.

2. The machine learning dataset evaluation support device according to claim 1, wherein said inference execution unit sequentially executes said plurality of trained AI models to obtain a plurality of inference results.

2. The machine learning dataset evaluation support device according to claim 1, wherein the inference execution unit executes the plurality of trained AI models in parallel to simultaneously obtain a plurality of inference results.

The learning data set editing unit performs a process of deleting from the learning data set learning data similar to the evaluation data in which the degree of certainty of the inference result is greater than a predetermined threshold and having a different correct label from the inference result of the evaluation target data. 2. The machine learning dataset evaluation support device according to claim 1, further comprising:

The learning data set editing unit further executes a process of deleting from the learning data set learning data similar to the evaluation data whose certainty of the inference result is smaller than a predetermined threshold. The machine learning data set evaluation support device according to claim 1.

The learning data set editing unit further performs a process of generating learning data similar to the evaluation data in which the confidence of the inference result is greater than a predetermined threshold and the stability is less than a predetermined threshold, and adding the learning data to the learning data set. 2. The machine learning data set evaluation support device according to claim 1, wherein the machine learning data set evaluation support device is characterized by:

The information processing device
holding a learning data set and label information used for supervised machine learning in a storage device;
a process of obtaining a plurality of inference results by inputting a predetermined evaluation data set into a plurality of trained AI models; and a process of analyzing from the stability of the inference result, a process of evaluating the evaluation data set based on the analysis result of the difference, and a learning held in the storage device based on the evaluation result of the evaluation data set The process of editing the dataset and label information,
A machine learning dataset evaluation support method characterized by executing: