WO2021079436A1 - 検出方法、検出プログラム及び情報処理装置 - Google Patents
検出方法、検出プログラム及び情報処理装置 Download PDFInfo
- Publication number
- WO2021079436A1 WO2021079436A1 PCT/JP2019/041547 JP2019041547W WO2021079436A1 WO 2021079436 A1 WO2021079436 A1 WO 2021079436A1 JP 2019041547 W JP2019041547 W JP 2019041547W WO 2021079436 A1 WO2021079436 A1 WO 2021079436A1
- Authority
- WO
- WIPO (PCT)
- Prior art keywords
- data
- detection
- output result
- training data
- class
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Ceased
Links
Images
Classifications
-
- G—PHYSICS
- G06—COMPUTING OR CALCULATING; COUNTING
- G06N—COMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N3/00—Computing arrangements based on biological models
- G06N3/02—Neural networks
- G06N3/08—Learning methods
- G06N3/084—Backpropagation, e.g. using gradient descent
-
- G—PHYSICS
- G06—COMPUTING OR CALCULATING; COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F18/00—Pattern recognition
- G06F18/20—Analysing
- G06F18/21—Design or setup of recognition systems or techniques; Extraction of features in feature space; Blind source separation
- G06F18/217—Validation; Performance evaluation; Active pattern learning techniques
-
- G—PHYSICS
- G06—COMPUTING OR CALCULATING; COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F18/00—Pattern recognition
- G06F18/20—Analysing
- G06F18/21—Design or setup of recognition systems or techniques; Extraction of features in feature space; Blind source separation
- G06F18/217—Validation; Performance evaluation; Active pattern learning techniques
- G06F18/2178—Validation; Performance evaluation; Active pattern learning techniques based on feedback of a supervisor
- G06F18/2185—Validation; Performance evaluation; Active pattern learning techniques based on feedback of a supervisor the supervisor being an automated module, e.g. intelligent oracle
-
- G—PHYSICS
- G06—COMPUTING OR CALCULATING; COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F18/00—Pattern recognition
- G06F18/20—Analysing
- G06F18/21—Design or setup of recognition systems or techniques; Extraction of features in feature space; Blind source separation
- G06F18/217—Validation; Performance evaluation; Active pattern learning techniques
- G06F18/2193—Validation; Performance evaluation; Active pattern learning techniques based on specific statistical tests
-
- G—PHYSICS
- G06—COMPUTING OR CALCULATING; COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F18/00—Pattern recognition
- G06F18/20—Analysing
- G06F18/24—Classification techniques
- G06F18/243—Classification techniques relating to the number of classes
- G06F18/2431—Multiple classes
-
- G—PHYSICS
- G06—COMPUTING OR CALCULATING; COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F18/00—Pattern recognition
- G06F18/20—Analysing
- G06F18/24—Classification techniques
- G06F18/245—Classification techniques relating to the decision surface
- G06F18/2453—Classification techniques relating to the decision surface non-linear, e.g. polynomial classifier
-
- G—PHYSICS
- G06—COMPUTING OR CALCULATING; COUNTING
- G06N—COMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N20/00—Machine learning
-
- G—PHYSICS
- G06—COMPUTING OR CALCULATING; COUNTING
- G06N—COMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N5/00—Computing arrangements using knowledge-based models
- G06N5/04—Inference or reasoning models
- G06N5/045—Explanation of inference; Explainable artificial intelligence [XAI]; Interpretable artificial intelligence
Definitions
- the present invention relates to a detection method and the like.
- system Since the machine learning model judges and classifies according to the teacher data learned at the time of system development, the accuracy of the machine learning model deteriorates if the tendency of the input data changes during the system operation.
- FIG. 27 is a diagram for explaining the deterioration of the machine learning model due to the change in the tendency of the input data.
- the machine learning model described here is a model that classifies the input data into one of the first class, the second class, and the third class, and is pre-learned based on the teacher data before the system operation. To do.
- Teacher data includes training data and validation data.
- distribution 1A shows the distribution of input data at the initial stage of system operation.
- Distribution 1B shows the distribution of the input data at the time when T1 time has passed from the initial stage of system operation.
- Distribution 1C shows the distribution of the input data when T2 hours have passed since the initial stage of system operation. It is assumed that the tendency (feature amount, etc.) of the input data changes with the passage of time. For example, if the input data is an image, the tendency of the input data changes according to the season and the time zone even if the same subject is captured.
- the determination boundary 3 indicates the boundary of the model application areas 3a to 3c.
- the model application area 3a is an area in which training data belonging to the first class is distributed.
- the model application area 3b is an area in which training data belonging to the second class is distributed.
- the model application area 3c is an area in which training data belonging to the third class is distributed.
- the asterisk is the input data belonging to the first class, and it is correct that it is classified into the model application area 3a when it is input to the machine learning model.
- the triangle marks are input data belonging to the second class, and it is correct that they are classified into the model application area 3b when input to the machine learning model.
- the circles are input data belonging to the third class, and it is correct that they are classified into the model application area 3a when they are input to the machine learning model.
- distribution 1A all input data is distributed in the normal model application area. That is, the input data of the star mark is located in the model application area 3a, the input data of the triangle mark is located in the model application area 3b, and the input data of the circle mark is located in the model application area 3c.
- the tendency of the input data changes further, and a part of the input data of the star mark moves to the model application area 3b across the determination boundary 3, and is not properly classified, and the correct answer rate. Is decreasing (the accuracy of the machine learning model is deteriorating).
- T 2 statistics Hotelling's T-square
- the data group of the input data and the normal data (training data) is analyzed by the principal component, and the T 2 statistic of the input data is calculated.
- the T 2 statistic is the sum of the squares of the distances from the origin of each standardized principal component to the data.
- the accuracy deterioration of the machine learning model is detected based on the change in the distribution of the T 2 statistic of the input data group. For example, the T 2 statistic of the input data group corresponds to the proportion of outlier data.
- an object of the present invention is to provide a detection method, a detection program, and an information processing device capable of detecting a deterioration in accuracy of a machine learning model.
- the computer executes the following processing.
- the computer inputs data to the first detection model among the multiple detection models that have learned the decision boundaries that classify the feature space of the data into multiple application areas based on multiple training data corresponding to multiple classes. If so, the first output result indicating which of the plurality of application areas the input data is located in is acquired.
- the computer acquires a second output result indicating which of the plurality of application areas the input data is located in. .. Based on the first output result and the second output result, the computer detects the data that causes the accuracy deterioration of the output result of the trained model based on the time change of the data to be streamed.
- FIG. 1 is a diagram for explaining a reference technique.
- FIG. 2 is a diagram for explaining a mechanism for detecting accuracy deterioration of the machine learning model to be monitored.
- FIG. 3 is a diagram (1) showing an example of a model application area according to the reference technique.
- FIG. 4 is a diagram (2) showing an example of a model application area according to the reference technique.
- FIG. 5 is a diagram (1) for explaining the processing of the information processing apparatus according to the present embodiment.
- FIG. 6 is a diagram (2) for explaining the processing of the information processing apparatus according to the present embodiment.
- FIG. 7 is a diagram for explaining the effect of the information processing apparatus according to the present embodiment.
- FIG. 8 is a functional block diagram showing the configuration of the information processing apparatus according to the present embodiment.
- FIG. 9 is a diagram showing an example of the data structure of the training data set.
- FIG. 10 is a diagram for explaining an example of a machine learning model.
- FIG. 11 is a diagram showing an example of the data structure of the inspector table.
- FIG. 12 is a diagram showing an example of the data structure of the training data table.
- FIG. 13 is a diagram showing an example of the data structure of the operation data table.
- FIG. 14 is a diagram showing an example of the classification surface of the inspector M0.
- FIG. 15 is a diagram comparing the classification planes of the inspectors M0 and M2.
- FIG. 16 is a diagram showing a classification surface of each inspector.
- FIG. 17 is a diagram showing an example of a classification surface in which the classification surfaces of all inspectors are overlapped.
- FIG. 18 is a diagram showing an example of the data structure of the output result table.
- FIG. 19 is a diagram showing an example of the data structure of the output result of the output result table.
- FIG. 20 is a diagram (1) for explaining the processing of the detection unit.
- FIG. 21 is a diagram showing changes in the operational data set over time.
- FIG. 22 is a diagram (2) for explaining the processing of the detection unit.
- FIG. 23 is a diagram showing an example of a graph of accuracy deterioration information.
- FIG. 24 is a flowchart (1) showing a processing procedure of the information processing apparatus according to the present embodiment.
- FIG. 25 is a flowchart (2) showing a processing procedure of the information processing apparatus according to the present embodiment.
- FIG. 26 is a diagram showing an example of a hardware configuration of a computer that realizes the same functions as the information processing apparatus according to the present embodiment.
- FIG. 27 is a diagram for explaining the deterioration of the machine learning model due to the change in the tendency of the input data.
- the accuracy deterioration of the machine learning model is detected by using a plurality of monitors in which the model application area is narrowed under different conditions.
- the observer is referred to as an "inspector”.
- FIG. 1 is a diagram for explaining a reference technique.
- the machine learning model 10 is a machine learning model that has been machine-learned using teacher data.
- teacher data includes training data and validation data.
- the training data is used when the parameters of the machine learning model 10 are machine-learned, and the correct answer label is associated with the training data.
- the verification data is data used when verifying the machine learning model 10.
- the inspectors 11A, 11B, and 11C have different decision boundaries because the model application area is narrowed under different conditions. Since the inspectors 11A to 11C have different determination boundaries, the output results may differ even if the same input data is input. In the reference technique, the accuracy deterioration of the machine learning model 10 is detected based on the difference in the output results of the inspectors 11A to 11C. In the example shown in FIG. 1, inspectors 11A to 11C are shown, but accuracy deterioration may be detected by using another inspector. DNN (Deep Neural Network) is used for the models of Inspectors 11A to 11C.
- DNN Deep Neural Network
- FIG. 2 is a diagram for explaining a mechanism for detecting accuracy deterioration of the machine learning model to be monitored.
- the inspectors 11A and 11B will be used for explanation.
- the determination boundary of the inspector 11A is defined as the determination boundary 12A
- the determination boundary of the inspector 11B is defined as the determination boundary 12B.
- the positions of the determination boundary 12A and the determination boundary 12B are different from each other, and the model application area is different.
- the input data is classified into the first class by the inspector 11A.
- the input data is classified into the second class by the inspector 11A.
- the input data is classified into the first class by the inspector 11B.
- the input data is classified into the second class by the inspector 11B.
- input data D T1 is to position the model application area 4A, classified as "first class”.
- input data D T1 is to position the model application area 4B, classified as "first class”. Since the classification result when the input data DT1 is input is the same for the inspector 11A and the inspector 11B, it is determined that there is no deterioration.
- the number of training data is reduced.
- the reference technique randomly reduces the training data for each inspector.
- the number of training data to be reduced is changed for each inspector.
- FIG. 3 is a diagram (1) showing an example of a model application area based on the reference technique.
- the distributions 20A, 20B, and 20C of the training data are shown.
- the distribution 20A is a distribution of training data used when creating the inspector 11A.
- the distribution 20B is a distribution of training data used when creating the inspector 11B.
- the distribution 20C is a distribution of training data used when creating the inspector 11C.
- the star mark is the training data with the correct answer label of the first class.
- the triangle mark is the training data whose correct label is the second class.
- the circles indicate the training data whose correct label is the third class.
- the number of training data used when creating each inspector is in the order of inspector 11A, inspector 11B, and inspector 11C in descending order of number.
- the model application area of the first class is the model application area 21A.
- the model application area of the second class is the model application area 22A.
- the model application area of the third class is the model application area 23A.
- the model application area of the first class is the model application area 21B.
- the model application area of the second class is the model application area 22B.
- the model application area of the third class is the model application area 23B.
- the model application area of the first class is the model application area 21C.
- the model application area of the second class is the model application area 22C.
- the model application area of the third class is the model application area 23C.
- FIG. 4 is a diagram (2) showing an example of a model application area according to the reference technique.
- the distribution 24A is a distribution of training data used when creating the inspector 11A.
- the distribution 24B is a distribution of training data used when creating the inspector 11B.
- the distribution 24C is a distribution of training data used when creating the inspector 11C.
- the explanation of the training data of the stars, triangles, and circles is the same as the explanation given in FIG.
- the number of training data used when creating each inspector is in the order of inspector 11A, inspector 11B, and inspector 11C in descending order of number.
- the model application area of the first class is the model application area 25A.
- the model application area of the second class is the model application area 26A.
- the model application area of the third class is the model application area 27A.
- the model application area of the first class is the model application area 25B.
- the model application area of the second class is the model application area 26B.
- the model application area of the third class is the model application area 27B.
- the model application area of the first class is the model application area 25C.
- the model application area of the second class is the model application area 26C.
- the model application area of the third class is the model application area 27C.
- each model application area is narrowed according to the number of training data, but in the example described in FIG. 4, each model is not affected by the number of training data.
- the applicable area is not narrowed.
- the reference technology cannot create multiple inspectors that narrow the model application area of the specified classification class.
- the information processing device narrows the model application area by excluding training data having a low score for each classification class from the same training data data set as the machine learning model to be monitored.
- training data set the data set of training data is referred to as "training data set”.
- the training dataset contains multiple training data.
- FIG. 5 is a diagram (1) for explaining the processing of the information processing apparatus according to the present embodiment.
- the correct answer label (classification class) of the training data is the first class or the second class will be described.
- the circles indicate the training data whose correct label is the first class.
- the triangle mark is the training data whose correct label is the second class.
- Distribution 30A shows the distribution of the training data set that creates the inspector 11A. It is assumed that the training data set for creating the inspector 11A is the same as the training data set used when training the machine learning model to be monitored.
- the determination boundary between the model application area 31A of the first class and the model application area 32A of the second class is defined as the determination boundary 33A.
- the score value for each training data becomes smaller as it is closer to the determination boundary of the training model. Therefore, by excluding the training data having a small score from the plurality of training data from the training data set, it is possible to generate an inspector in which the application area of the learning model is narrowed.
- DNN existing learning model
- each training data included in the region 34 has a high score because it is far from the decision boundary 33A.
- Each training data contained in region 35 has a low score because it is close to the decision boundary 33A.
- the information processing apparatus creates a new training data set in which each training data included in the area 35 is deleted from the training data set included in the distribution 30A.
- the information processing device creates the inspector 11B by learning the learning model with the new training data set.
- Distribution 30B shows the distribution of the training dataset that creates the inspector 11B.
- the determination boundary between the model application area 31B of the first class and the model application area 32B of the second class is defined as the determination boundary 33B.
- the new training data set since each training data of the region 35 near the decision boundary 33A is excluded, the position of the decision boundary 33B is moved, and the model application area 31B of the first class is applied to the model of the first class. It is narrower than the region 31A.
- FIG. 6 is a diagram (2) for explaining the processing of the information processing apparatus according to the present embodiment.
- the information processing apparatus according to this embodiment can create an inspector that narrows the model application range of a specific classification class.
- the information processing device can narrow the model application area of a specific class by designating a classification class from the training data and excluding the data having a low score.
- each training data is associated with a correct answer label indicating the classification class.
- the process in which the information processing apparatus creates the inspector 11B in which the model application area corresponding to the first class is narrowed will be described.
- the information processing apparatus performs learning using the first training data set excluding the training data having a low score from the training data corresponding to the correct answer label "first class".
- Distribution 30A shows the distribution of the training data set that creates the inspector 11A.
- the training data set for creating the inspector 11A shall be the same as the training data set used when training the machine learning model to be monitored.
- the determination boundary between the model application area 31A of the first class and the model application area 32A of the second class is defined as the determination boundary 33A.
- the information processing device calculates the score of the training data corresponding to the correct answer label "first class" in the training data set included in the distribution 30A, and identifies the training data whose score is less than the threshold value.
- the information processing apparatus creates a new training data set (first training data set) in which the specified training data is excluded from the training data set included in the distribution 30A.
- the information processing device creates the inspector 11B by learning the learning model using the first training data set.
- Distribution 30B shows the distribution of training data that creates the inspector 11B.
- the determination boundary between the model application area 31B of the first class and the model application area 32B of the second class is defined as the determination boundary 33B.
- the first training data set since each training data near the decision boundary 33A is excluded, the position of the decision boundary 33B moves, and the model application area 31B of the first class is moved from the model application area 31A of the first class. Is also getting narrower.
- the information processing apparatus performs learning using the second training data set excluding the training data having a low score from the training data corresponding to the correct answer label "second class".
- the information processing device calculates the score of the training data corresponding to the correct answer label "second class" in the training data set included in the distribution 30A, and identifies the training data whose score is less than the threshold value.
- the information processing apparatus creates a new training data set (second training data set) in which the specified training data is excluded from the training data set included in the distribution 30A.
- the information processing device creates the inspector 11C by learning the learning model using the second training data set.
- Distribution 30C indicates the distribution of training data that creates the inspector 11C.
- the determination boundary between the model application area 31C of the first class and the model application area 32C of the second class is defined as the determination boundary 33C.
- the second training data group since each training data near the decision boundary 33A is excluded, the position of the decision boundary 33C moves, and the model application area 32C of the second class becomes larger than the model application area 32A of the second class. Is also getting narrower.
- the information processing apparatus narrows the model application area by excluding training data having a low score for each classification class from the same training data as the machine learning model to be monitored. be able to.
- FIG. 7 is a diagram for explaining the effect of the information processing apparatus according to this embodiment.
- the reference technique and the information processing apparatus according to the present embodiment create the inspector 11A by learning the learning model using the training data set used in the learning of the machine learning model 10.
- a new training data set is created by randomly excluding the training data from the training data set used in the training of the machine learning model 10.
- the inspector 11B is created by learning the learning model using the created new training data set.
- the model application area of the first class is the model application area 25B.
- the model application area of the second class is the model application area 26B.
- the model application area of the third class is the model application area 27B.
- the model application area 25B is not narrowed.
- the model application area 26A and the model application area 26B are compared, the model application area 26B is not narrowed. Comparing the model application area 27A and the model application area 27B, the model application area 27B is not narrowed.
- the information processing apparatus creates a new training data set excluding the training data having a low score from the training data set used in the training of the machine learning model 10.
- the information processing device creates the inspector 11B by learning the learning model using the created new training data set.
- the model application area of the first class is the model application area 35B.
- the model application area of the second class is the model application area 36B.
- the model application area of the third class is the model application area 37B.
- the model application area 35B is narrower.
- the information processing apparatus by creating a new training data set excluding the training data having a low score from the training data set used in the training of the machine learning model 10.
- the model application area of the inspector can always be narrowed. As a result, it is possible to reduce the steps such as recreating the inspector required when the model application area is not narrowed.
- the information processing apparatus it is possible to create an inspector that narrows the model application range of a specific classification class.
- the created inspector it is possible to explain the cause of the detected accuracy deterioration.
- FIG. 8 is a functional block diagram showing the configuration of the information processing apparatus according to the present embodiment.
- the information processing device 100 includes a communication unit 110, an input unit 120, a display unit 130, a storage unit 140, and a control unit 150.
- the communication unit 110 is a processing unit that executes data communication with an external device (not shown) via a network.
- the communication unit 110 is an example of a communication device.
- the control unit 150 which will be described later, exchanges data with an external device via the communication unit 110.
- the input unit 120 is an input device for inputting various information to the information processing device 100.
- the input unit 120 corresponds to a keyboard, a mouse, a touch panel, and the like.
- the display unit 130 is a display device that displays information output from the control unit 150.
- the display unit 130 corresponds to a liquid crystal display, an organic EL (Electro Luminescence) display, a touch panel, and the like.
- the storage unit 140 has teacher data 141, machine learning model data 142, inspector table 143, training data table 144, operation data table 145, and output result table 146.
- the storage unit 140 corresponds to semiconductor memory elements such as RAM (Random Access Memory) and flash memory (Flash Memory), and storage devices such as HDD (Hard Disk Drive).
- the teacher data 141 has a training data set 141a and verification data 141b.
- the training data set 141a holds various information about the training data.
- FIG. 9 is a diagram showing an example of the data structure of the training data set. As shown in FIG. 9, this training data set associates record numbers with training data with correct labels.
- the record number is a number that identifies a pair of training data and a correct label.
- the training data corresponds to mail spam data, electricity demand forecast, stock price forecast, poker hand data, image data, and the like.
- the correct answer label is information that uniquely identifies any of the classification classes of the first class, the second class, and the third class.
- the verification data 141b is data for verifying the machine learning model trained by the training data set 141a.
- the verification data 141b is given a correct label. For example, when the verification data 141b is input to the machine learning model and the output result output from the machine learning model matches the correct answer label given to the verification data 141b, the training data set 141a causes the machine learning model. Means that was properly learned.
- the machine learning model data 142 is the data of the machine learning model.
- FIG. 10 is a diagram for explaining an example of a machine learning model.
- the machine learning model 50 has a neural network structure, and has an input layer 50a, a hidden layer 50b, and an output layer 50c.
- the input layer 50a, the hidden layer 50b, and the output layer 50c have a structure in which a plurality of nodes are connected by edges.
- the hidden layer 50b and the output layer 50c have a function called an activation function and a bias value, and the edge has a weight.
- the bias value and weight are referred to as "parameters”.
- the probability of each class is output from the nodes 51a, 51b, 51c of the output layer 20c through the hidden layer 20b.
- the probability of the first class is output from the node 51a.
- the probability of the second class is output from the node 51b.
- the probability of the third class is output from the node 51c.
- the probability of each class is calculated by inputting the value output from each node of the output layer 20c into the Softmax function. In this embodiment, the value before being input to the softmax function is referred to as "score".
- the training data corresponding to the correct answer label "first class” when the training data corresponding to the correct answer label "first class" is input to each node included in the input layer 50a, it is a value output from the node 51a and before being input to the softmax function. Is the score of the input training data.
- the training data corresponding to the correct answer label "second class” is input to each node included in the input layer 50a, the value output from the node 51b and before being input to the softmax function is used. It is the score of the input training data.
- the training data corresponding to the correct answer label "third class” is input to each node included in the input layer 50a, the value output from the node 51c and before being input to the softmax function is set. It is the score of the input training data.
- the machine learning model 50 has been trained based on the training data set 141a of the teacher data 141 and the verification data 141b.
- the machine learning is performed so that the output result of each node of the output layer 20c approaches the correct answer label of the input training data.
- the parameters of the model 50 are trained (trained by the error back propagation method).
- the inspector table 143 is a table that holds data of a plurality of inspectors that detect a deterioration in accuracy of the machine learning model 50.
- FIG. 11 is a diagram showing an example of the data structure of the inspector table. As shown in FIG. 11, the inspector table 143 associates the identification information with the inspector. The identification information is information that identifies the inspector. The inspector is the data of the inspector corresponding to the model identification information. The inspector data has a neural network structure, and has an input layer, a hidden layer, and an output layer in the same manner as the machine learning model 50 described with reference to FIG. In addition, different parameters are set for each inspector.
- the inspector of the identification information "M0” is referred to as “inspector M0".
- the inspector of the identification information "M1” is referred to as “inspector M1”.
- the inspector of the identification information "M2” is referred to as “inspector M2”.
- the inspector of the identification information "M3” is referred to as "inspector M3”.
- the training data table 144 has a plurality of training data sets for learning each inspector.
- FIG. 12 is a diagram showing an example of the data structure of the training data table. As shown in FIG. 12, the training data table 144 has data identification information and a training data set. The data identification information is information that identifies the training data set. The training data set is a training data set used when learning each inspector.
- the training data set of the data identification information "D1" is a training data set obtained by excluding the training data of the correct answer label "first class" having a low score from the training data set 141a.
- the training data set of the data identification information "D1" is referred to as "training data set D1".
- the training data set of the data identification information "D2" is a training data set obtained by excluding the training data of the correct answer label "second class" having a low score from the training data set 141a.
- the training data set of the data identification information "D2" is referred to as "training data set D2".
- the training data set of the data identification information "D3" is a training data set obtained by excluding the training data of the correct answer label "third class" having a low score from the training data set 141a.
- the training data set of the data identification information "D3" is referred to as "training data set D3".
- the operational data table 145 has an operational data set that is added over time.
- FIG. 13 is a diagram showing an example of the data structure of the operation data table. As shown in FIG. 13, the operational data table 145 has data identification information and operational data sets.
- the data identification information is information that identifies an operational data set.
- the operational data set contains a plurality of operational data. Operational data corresponds to email spam data, electricity demand forecasts, stock price forecasts, poker hand data, image data, and the like.
- the operational data set of the data identification information "C1" is an operational data set collected after T1 hours have elapsed from the start of operation. In the following description, the operational data set of the data identification information "C1" will be referred to as "operational data set C1".
- the operational data set of the data identification information "C2" is an operational data set collected after T2 (T2> T1) time has elapsed from the start of operation.
- the operational data set of the data identification information "C2" is referred to as "operational data set C2".
- the operational data set of the data identification information "C3" is an operational data set collected after T3 (T3> T2) time has elapsed from the start of operation.
- T3 T3> T2 time has elapsed from the start of operation.
- operational data set C3 the operational data set of the data identification information "C3” will be referred to as "operational data set C3".
- operation data identification information that uniquely identifies the operation data is given to each operation data included in the operation data sets C0 to C3.
- the operation data sets C0 to C3 are data streamed from the external device to the information processing device 100, and the information processing device 100 registers the data streamed operation data sets C0 to C3 in the operation data table 145.
- the output result table 146 is a table for registering the output results of the inspectors M0 to M3 when the operation data sets C0 to C3 are input to the inspectors M0 to M3.
- the control unit 150 includes a first learning unit 151, a calculation unit 152, a creation unit 153, a second learning unit 154, an acquisition unit 155, and a detection unit 156.
- the control unit 150 can be realized by a CPU (Central Processing Unit), an MPU (Micro Processing Unit), or the like.
- the control unit 150 can also be realized by hard-wired logic such as ASIC (Application Specific Integrated Circuit) or FPGA (Field Programmable Gate Array).
- the first learning unit 151 is a processing unit that creates the inspector M0 by acquiring the training data set 141a and learning the parameters of the learning model based on the training data set 141a.
- the training data set 141a is a training data set used when learning the machine learning model 50. Similar to the machine learning model 50, the learning model has a neural network structure, and has an input layer, a hidden layer, and an output layer. In addition, parameters (initial values of parameters) are set in the training data.
- the first learning unit 151 When the training data of the training data set 141a is input to the input layer of the training model, the first learning unit 151 inputs the training model so that the output result of each node of the output layer approaches the correct answer label of the input training data. Update the parameters of (learning by error back propagation method). The first learning unit 151 registers the created data of the inspector M0 in the inspector table 143.
- FIG. 14 is a diagram showing an example of the classification surface of the inspector M0.
- the classification surface is shown on two axes.
- the horizontal axis of the classification surface is the axis corresponding to the first feature amount of the data, and the vertical axis is the axis corresponding to the second feature amount.
- the data may be three-dimensional or higher-dimensional data.
- the decision boundary of the inspector M0 is the decision boundary 60.
- the model application area for the first class of the inspector M0 is the model application area 60A.
- the model application area 60A includes a plurality of training data 61A corresponding to the first class.
- the model application area for the second class of the inspector M0 is the model application area 60B.
- the model application area 60B includes a plurality of training data 61B corresponding to the second class.
- the model application area for the third class of the inspector M0 is the model application area 60C.
- the model application area 60C includes a plurality of training data 61C corresponding to the second class.
- the decision boundary 60 of the inspector M0 and each model application area 60A to 60C are the same as the decision boundary of the machine learning model and each model application area.
- the calculation unit 152 is a processing unit that calculates the score of each training data included in the training data set 141a.
- the calculation unit 152 executes the inspector M0 and inputs the training data to the executed inspector M0 to calculate the score of each training data.
- the calculation unit 152 outputs the score of each training data to the creation unit 153.
- the calculation unit 152 calculates the scores of a plurality of training data corresponding to the correct answer label "first class".
- first training data the training data corresponding to the correct answer label "first class” is referred to as "first training data”.
- the calculation unit 152 inputs the first training data into the input layer of the inspector M0, and calculates the score of the first training data.
- the calculation unit 152 repeatedly executes the above processing for the plurality of first training data.
- the calculation unit 152 outputs the calculation result data (hereinafter, the first calculation result data) in which the record number of the first training data and the score are associated with each other to the creation unit 153.
- the calculation unit 152 calculates the scores of a plurality of training data corresponding to the correct answer label "second class".
- the training data corresponding to the correct answer label “second class” is referred to as “second training data”.
- the calculation unit 152 inputs the second training data into the input layer of the inspector M0, and calculates the score of the second training data.
- the calculation unit 152 repeatedly executes the above processing for the plurality of second training data.
- the calculation unit 152 outputs the calculation result data (hereinafter, the second calculation result data) in which the record number of the second training data and the score are associated with each other to the creation unit 153.
- the calculation unit 152 calculates the scores of a plurality of training data corresponding to the correct answer label "third class".
- the training data corresponding to the correct answer label “third class” is referred to as “third training data”.
- the calculation unit 152 inputs the third training data into the input layer of the inspector M0, and calculates the score of the third training data.
- the calculation unit 152 repeatedly executes the above processing for the plurality of third training data.
- the calculation unit 152 outputs the calculation result data (hereinafter, the third calculation result data) in which the record number of the third training data and the score are associated with each other to the creation unit 153.
- the creation unit 153 is a processing unit that creates a plurality of training data sets based on the scores of each training data.
- the creation unit 153 acquires the first calculation result data, the second calculation result data, and the third calculation result data from the calculation unit 152 as the score data of each training data.
- the creation unit 153 acquires the first calculation result data, among the first training data included in the first calculation result data, the first training data whose score is less than the threshold is specified as the first training data to be excluded. To do.
- the first training data whose score is less than the threshold value is the first training data near the decision boundary 60.
- the creation unit 153 creates a training data set (training data set D1) excluding the first training data to be excluded from the training data set 141a.
- the creation unit 153 registers the training data set D1 in the training data table 144.
- the creation unit 153 acquires the second calculation result data, among the second training data included in the second calculation result data, the second training data whose score is less than the threshold is specified as the second training data to be excluded. To do.
- the second training data whose score is less than the threshold value is the second training data near the decision boundary 60.
- the creation unit 153 creates a training data set (training data set D2) excluding the second training data to be excluded from the training data set 141a.
- the creation unit 153 registers the training data set D2 in the training data table 144.
- the creation unit 153 acquires the third calculation result data, among the third training data included in the third calculation result data, the third training data whose score is less than the threshold is specified as the third training data to be excluded. To do.
- the third training data whose score is less than the threshold value is the third training data near the decision boundary.
- the creation unit 153 creates a training data set (training data set D3) excluding the third training data to be excluded from the training data set 141a.
- the creation unit 153 registers the training data set D3 in the training data table 144.
- the second learning unit 154 is a processing unit that creates a plurality of inspectors M1, M2, and M3 using the training data sets D1, D2, and D3 of the training data table 144.
- the second learning unit 154 creates the inspector M1 by learning the parameters of the learning model based on the training data set D1.
- the training data set D1 is a data set in which the first training data near the decision boundary 60 is excluded.
- the second learning unit 154 prepares the training model so that the output result of each node of the output layer approaches the correct answer label of the input training data. Update the parameters of (learning by error back propagation method).
- the second learning unit 154 creates the inspector M1.
- the second learning unit 154 registers the data of the inspector M1 in the inspector table 143.
- the second learning unit 154 creates the inspector M2 by learning the parameters of the learning model based on the training data set D2.
- the training data set D2 is a data set in which the second training data near the decision boundary 60 is excluded.
- the second learning unit 154 prepares the training model so that the output result of each node of the output layer approaches the correct answer label of the input training data. Update the parameters of (learning by error back propagation method).
- the second learning unit 154 creates the inspector M2.
- the second learning unit 154 registers the data of the inspector M2 in the inspector table 143.
- FIG. 15 is a diagram comparing the classification planes of the inspectors M0 and M2.
- the classification surface of the inspector M0 is defined as the classification surface 60 M0 .
- the classification surface of the inspector M2 is defined as the classification surface 60 M2 .
- the description of the classification surface 60 M0 of the inspector M0 is the same as the description of FIG.
- the decision boundary of the inspector M2 is the decision boundary 64.
- the model application area for the first class of the inspector M2 is the model application area 64A.
- the model application area for the second class of the inspector M2 is the model application area 64B.
- the model application area 64B includes a plurality of training data 65B corresponding to the second class and having a score equal to or higher than the threshold value.
- the model application area for the third class of the inspector M2 is the model application area 64C.
- Classification surface 60 M0 inspector M0 when comparing the classification plane 60 M2 inspector M2, which corresponds to a model application region of the second class, the model application area 64B is narrower than model application area 60B. This is because the second training data near the decision boundary 60 is excluded from the training data set used when learning the inspector M2.
- the second learning unit 154 creates an inspector M3 by learning the parameters of the learning model based on the training data set D3.
- the training data set D3 is a data set in which the third training data near the decision boundary 60 is excluded.
- the second learning unit 154 prepares the training model so that the output result of each node of the output layer approaches the correct answer label of the input training data. Update the parameters of (learning by error back propagation method).
- the second learning unit 154 creates the inspector M3.
- the second learning unit 154 registers the data of the inspector M3 in the inspector table 143.
- FIG. 16 is a diagram showing a classification surface of each inspector.
- the classification surface of the inspector M0 is defined as the classification surface 60 M0 .
- the classification surface of the inspector M1 is defined as the classification surface 60 M1 .
- the classification surface of the inspector M2 is defined as the classification surface 60 M2 .
- the classification surface of the inspector M3 is defined as the classification surface 60 M3 .
- Classification surface 60 M0 inspector M0 and description on the classification plane 60 M2 Inspector M2 is the same as described in Figure 15.
- the decision boundary of the inspector M1 is the decision boundary 62.
- the model application area for the first class of the inspector M1 is the model application area 62A.
- the model application area for the second class of the inspector M1 is the model application area 62B.
- the model application area for the third class of the inspector M1 is the model application area 62C.
- the decision boundary of the inspector M3 is the decision boundary 66.
- the model application area for the first class of the inspector M3 is the model application area 66A.
- the model application area for the second class of the inspector M3 is the model application area 66B.
- the model application area for the third class of the inspector M3 is the model application area 66C.
- Classification surface 60 M0 inspector M0 when comparing the classification plane 60 M1 inspector M1, which corresponds to the model application area of the first class, the model application area 62A is narrower than model application area 60A. This is because the first training data near the decision boundary 60 (score is less than the threshold value) is excluded from the training data set used when learning the inspector M1.
- Classification surface 60 M0 inspector M0 when comparing the classification plane 60 M2 inspector M2, which corresponds to a model application region of the second class, the model application area 64B is narrower than model application area 60B. This is because the second training data near the decision boundary 60 (score is less than the threshold value) is excluded from the training data set used when learning the inspector M2.
- Classification surface 60 M0 inspector M0 when comparing the classification plane 60 M3 inspector M3, which corresponds to a model application region of the third class, model application region 66C is narrower than model application region 60C. This is because the third training data near the decision boundary 60 (score is less than the threshold value) is excluded from the training data set used when learning the inspector M3.
- FIG. 17 is a diagram showing an example of a classification surface in which the classification surfaces of all inspectors are overlapped. As shown in FIG. 17, the decision boundaries 60, 62, 65, and 66 are different, and the model application areas of the first, second, and third classes are also different.
- the acquisition unit 155 is a processing unit that inputs operational data whose feature amount changes with the passage of time to a plurality of inspectors and acquires an output result.
- the acquisition unit 155 acquires the data of the inspectors M0 to M2 from the inspector table 143 and executes the inspectors M0 to M2.
- the acquisition unit 155 inputs the operation data sets C0 to C3 stored in the operation data table 145 into the inspectors M0 to M2, acquires the output results, and registers them in the output result table 146.
- FIG. 18 is a diagram showing an example of the data structure of the output result table.
- the identification information that identifies the inspector in the output result table 146, the identification information that identifies the inspector, the data identification information that identifies the input operational data set, and the output result are associated with each other.
- the output result corresponding to the identification information "M0" and the data identification information "C0" is the output result when each operation data of the operation data set C0 is input to the inspector M0.
- FIG. 19 is a diagram showing an example of the data structure of the output result of the output result table.
- any of the output results included in the output result table 146 corresponds to one of the output results.
- the operation data identification information and the classification class are associated with the output result.
- the operational data identification information is information that uniquely identifies the operational data.
- the classification class is information that uniquely identifies the classification class in which the operational data is classified. For example, it is shown that the output result (classification class) when the operation data of the operation data identification information "OP1001" is input to the corresponding inspector is the first class.
- the detection unit 156 is a processing unit that detects data that is a factor of the output result of the machine learning model 50 based on the time change of the data based on the output result table 146.
- FIG. 20 is a diagram for explaining the processing of the detection unit.
- the inspectors M0 and M1 will be used for explanation.
- the decision boundary of the inspector M0 is set to the decision boundary 70A
- the decision boundary of the inspector M1 is set to the decision boundary 70B.
- the positions of the decision boundary 70A and the decision boundary 70B are different from each other, and the model application area is different.
- one operational data included in the operational data set is appropriately referred to as an "instance".
- the instance When the instance is located in the model application area 71A, the instance is classified into the first class by the inspector M0. When the instance is located in the model application area 72A, the instance is classified into the second class by the inspector M0.
- the instance When the instance is located in the model application area 71B, the instance is classified into the first class by the inspector M1. When the instance is located in the model application area 72B, the instance is classified into the second class by the inspector M1.
- the instance I1 T1 when the instance I1 T1 is input to the inspector M0 at the initial operation time T1, the instance I1 T1 is located in the model application area 71A and is therefore classified as the “first class”.
- the instance I2 T1 is input to the inspector M0, the instance I2 T1 is located in the model application area 71A and is therefore classified as the “first class”.
- the instance I3 T1 is input to the inspector M0, the instance I3 T1 is located in the model application area 72A and is therefore classified as a “second class”.
- the instance I1 T1 When the instance I1 T1 is input to the inspector M1 at the initial operation time T1, the instance I1 T1 is located in the model application area 71B and is therefore classified into the “first class”.
- the instance I2 T1 When the instance I2 T1 is input to the inspector M1, the instance I2 T1 is located in the model application area 71B and is therefore classified as the “first class”.
- the instance I3 T1 When the instance I3 T1 is input to the inspector M1, the instance I3 T1 is located in the model application area 72B and is therefore classified as a “second class”.
- the detection unit 156 Since the classification results classified when the instances I1 T1 , I2 T1 , and I3 T1 are input to the inspectors M0 and M1 at the initial time T1 of the operation are the same, the detection unit 156 deteriorates the accuracy of the machine learning model 50. Is not detected.
- the instances I1 T1 , I2 T1 , and I3 T1 become the instances I1 T2 , I2 T2 , and I3 T2 .
- the instance I1 T2 is input to the inspector M0
- the instance I1 T2 is located in the model application area 71A and is therefore classified as the “first class”.
- the instance I2 T1 is located in the model application area 71A and is therefore classified as the “first class”.
- the instance I3 T2 is input to the inspector M0, the instance I3 T2 is located in the model application area 72A and is therefore classified as a “second class”.
- instances I1 T2 is to position the model application area 72B, are classified as "second class”.
- the instance I2 T2 is input to the inspector M1
- the instance I2 T2 is located in the model application area 71B and is therefore classified as the “first class”.
- the instance I3 T2 is input to the inspector M1
- the instance I3 T2 is located in the model application area 72B and is therefore classified as a “second class”.
- the detection unit 156 Since the classification results classified when the instances I1 T1 are input to the inspectors M0 and M1 are different at the time T2 when the time has passed from the initial operation, the detection unit 156 detects the deterioration of the accuracy of the machine learning model 50. In addition, the detection unit 156 can detect the instance I1 T2 that has caused the deterioration of accuracy.
- the detection unit 156 refers to the output result table 146, specifies the classification class when inputting to each inspector for each instance (operation data) of each operation data set, and repeatedly executes the above process.
- FIG. 21 is a diagram showing changes in the operational data set over time.
- FIG. 21 shows the distribution when each operational data set is input to the inspector M0.
- each operation data marked with a circle is originally data belonging to the first class and is classified into the model application area 60A.
- each operation data marked with a triangle is originally data belonging to the second class and is classified into the model application area 60B.
- each operation data marked with a square is originally data belonging to the third class and is classified into the model application area 60C.
- each operation data marked with a circle is included in the model application area 60A.
- Each operation data marked with a triangle is included in the model application area 60B.
- Each operation data marked with a square is included in the model application area 60C. That is, each operational data is appropriately classified into a classification class, and accuracy deterioration is not detected.
- each operation data marked with a circle is included in the model application area 60A.
- Each operation data marked with a triangle is included in the model application area 60B.
- Each operation data marked with a square is included in the model application area 60C.
- the center of each operation data marked with a triangle moves (drifts) to the model application area 60A side, most of the operation data is properly classified into the classification class, and accuracy deterioration is not detected.
- each operation data marked with a circle is included in the model application area 60A.
- Each operation data marked with a triangle is included in the model application areas 60A and 60B.
- Each operation data marked with a square is included in the model application area 60C. Approximately half of each operational data marked with a triangle moves (drifts) to the model application area 60A across the determination boundary, and accuracy deterioration is detected.
- each operation data marked with a circle is included in the model application area 60A.
- Each operation data marked with a triangle is included in the model application area 60A.
- Each operation data marked with a square is included in the model application area 60C.
- Each operation data marked with a triangle moves (drifts) to the model application area 60A across the determination boundary, and accuracy deterioration is detected.
- the detection unit 156 executes the following processing to move each instance in the direction of which classification class the instance is caused by the deterioration of accuracy and the feature amount of the instance. Detect if there is.
- the detection unit 156 refers to the output result table 146 and specifies the classification class when the same instance is input to each inspector M0 to M3.
- the same instance is operational data to which the same operational data identification information is assigned.
- the detection unit 156 determines that the corresponding instance is not caused by the deterioration of accuracy. .. On the other hand, when all the classification classes when the same instance is input to each inspector M0 to M3 are not the same, the detection unit 156 detects the corresponding instance as an instance caused by the deterioration of accuracy.
- the detection unit 156 When the output result when the instance due to the deterioration of accuracy is input to the inspector M0 is different from the output result when the instance is input to the inspector M1, the detection unit 156 indicates that the feature amount of the instance is "the direction of the first class". Detects that it has changed to.
- the detection unit 156 When the output result when the instance due to the deterioration of accuracy is input to the inspector M0 is different from the output result when the instance is input to the inspector M2, the detection unit 156 indicates that the feature amount of the instance is "the direction of the second class". Detects that it has changed to.
- the detection unit 156 When the output result when the instance due to the deterioration of accuracy is input to the inspector M0 is different from the output result when the instance is input to the inspector M3, the detection unit 156 indicates that the feature amount of the instance is "the direction of the third class". Detects that it has changed to.
- the detection unit 156 determines whether or not the instance is caused by the deterioration of accuracy and in which classification class the feature amount of the instance is moving. Is detected.
- the detection unit 156 may generate a graph of changes in the classification class with time changes of the operational data included in each model application area of each inspector based on the output result table 146. For example, the detection unit 156 generates the information of the graphs G0 to G3 as shown in FIG. 22. The detection unit 156 may display the information of the graphs G0 to G3 on the display unit 130.
- FIG. 22 is a diagram (2) for explaining the processing of the detection unit.
- graph G0 is a graph showing changes in the number of operational data located in each class application area when each operational data set is input to the inspector M0.
- the graph G1 is a graph showing a change in the number of operational data located in each class application area when each operational data set is input to the inspector M1.
- the graph G2 is a graph showing a change in the number of operational data located in each class application area when each operational data set is input to the inspector M2.
- the graph G3 is a graph showing a change in the number of operational data located in each class application area when each operational data set is input to the inspector M3.
- the horizontal axis of the graphs G0, G1, G2, and G3 is the axis indicating the passage of time in the operational data set.
- the vertical axis of the graphs G0, G1, G2, and G3 is an axis indicating the number of operational data included in each model area data.
- Line 81 of each graph G0, G1, G2, G3 shows the transition of the number of operational data included in the model application area of the first class.
- Line 82 of each graph G0, G1, G2, G3 shows the transition of the number of operational data included in the model application area of the second class.
- Line 83 of each graph G0, G1, G2, G3 shows the transition of the number of operational data included in the model application area of the third class.
- the detection unit 156 detects a sign of deterioration in the accuracy of the machine learning model 50 by comparing the graph G0 corresponding to the inspector M0 with the graphs G1, G2, G3 corresponding to the other inspectors M1, M2, M3. be able to. In addition, the detection unit 156 can identify the cause of the deterioration in accuracy.
- the detection unit 156 Detects accuracy deterioration (a sign of accuracy deterioration) of the machine learning model 50.
- the lines 81 of the graphs G0 to G3 are increasing and the lines of the lines 82 are decreasing, and each operation data classified into the second class is in the first class. Detects that you are moving to the class application area of.
- the detection unit 156 generates a graph of accuracy deterioration information based on the above detection result.
- FIG. 23 is a diagram showing an example of a graph of accuracy deterioration information.
- the horizontal axis of the graph in FIG. 23 is an axis showing the passage of time in the operational data set.
- the detection unit 156 calculates the degree of agreement between the output results of the inspector M0 and the output results of the other inspectors M1 to M3 among the instances included in the operation data set as accuracy.
- the detection unit 156 may calculate the accuracy by using other conventional techniques.
- the detection unit 156 may display a graph of information deterioration information on the display unit 130.
- the detection unit 156 may output a request for re-learning of the machine learning model 50 to the first learning unit 151 when the accuracy becomes less than the threshold value. For example, the detection unit 156 selects the latest exercise data set from each exercise data set included in the operation data table 145. The detection unit 156 inputs each operation data of the selected operation data set to the inspector M0, specifies the output result, and sets the specified output result as the correct answer label of the exercise data. The detection unit 156 generates a new training data set by repeatedly executing the above processing for each operation data.
- the detection unit 156 outputs a new training data set to the first learning unit 151.
- the first learning unit 151 uses the new training data set to perform re-learning to update the parameters of the machine learning model 50.
- the training data of the new training data set is input to the input layer of the machine learning model 50, the first learning unit 151 so that the output result of each node of the output layer approaches the correct answer label of the input training data.
- Update the parameters of the machine learning model (training by error back propagation method).
- FIG. 24 is a flowchart (1) showing a processing procedure of the information processing apparatus according to the present embodiment.
- the first learning unit 151 of the information processing apparatus 100 acquires the training data set 141a used for learning the machine learning model to be monitored (step S101).
- the first learning unit 151 executes learning of the inspector M0 using the training data set 141a (step S102).
- the information processing device 100 sets the value of i to 1 (step S103).
- the calculation unit 152 of the information processing device 100 inputs the training data of the i-class to the inspector M0 and calculates the score related to the training data (step S104).
- the creation unit 153 of the information processing apparatus 100 creates a training data set Di excluding the training data whose score is less than the threshold value from the training data set 141a, and registers the training data set Di in the training data table 144 (step S105).
- the second learning unit 154 of the information processing device 100 executes learning of a plurality of inspectors M1 to M3 using the plurality of training data sets D1 to D3 (step S108).
- the second learning unit 154 registers the learned plurality of inspectors M1 to M3 in the inspector table 143 (step S109).
- FIG. 25 is a flowchart (2) showing a processing procedure of the information processing apparatus according to this embodiment.
- the acquisition unit 155 of the information processing apparatus 100 acquires an operation data set from the operation data table 145 (step S201).
- the acquisition unit 155 selects one instance from the operational data set (step S202).
- the acquisition unit 155 inputs the selected instance to each inspector M0 to M3, acquires the output result, and registers it in the output result table 146 (step S203).
- the detection unit 156 of the information processing apparatus 100 refers to the output result table 146 and determines whether or not each output result is different (step S204).
- step S205 If the output results are not different (steps S205, No), the detection unit 156 shifts to step S208. If the output results are different (steps S205, Yes), the detection unit 156 shifts to step S206.
- the detection unit 156 detects accuracy deterioration (step S206).
- the detection unit 156 detects the selected instance as a factor of accuracy deterioration (step S207).
- the information processing device 100 determines whether or not all the instances have been selected (step S208).
- step S208, Yes When all the instances are selected (step S208, Yes), the information processing device 100 ends the process. On the other hand, if the information processing apparatus 100 has not selected all the instances (steps S208, No), the information processing apparatus 100 proceeds to step S209.
- the acquisition unit 15 selects one unselected instance from the operation data set (step S209), and proceeds to step S203.
- the information processing device 100 executes the process described with reference to FIG. 25 for each operation data set stored in the operation data table 145.
- the information processing device 100 creates a new training data set excluding the training data having a low score from the training data set 141a used in the training of the machine learning model 50, and uses the new training data to inspectors M1 to M3. By creating, you can always narrow the model application area of the inspector. As a result, it is possible to reduce the steps such as recreating the inspector required when the model application area is not narrowed.
- the information processing apparatus 100 it is possible to create inspectors M1 to M3 in which the model application range of a specific classification class is narrowed.
- the class of training data By changing the class of training data to be reduced, it is possible to always create inspectors for different model application areas, so it is possible to create the requirement "multiple inspectors for different model application areas" required for detecting model accuracy deterioration.
- the created inspector it is possible to explain the cause of the detected accuracy deterioration.
- the information processing device 100 inputs the operation data (instance) of the operation data set into the inspectors M0 to M3, acquires the output results of the inspectors M0 to M3, respectively, and based on each output result, the machine learning model 50 Detects deterioration of accuracy. As a result, the accuracy deterioration of the machine learning model 50 can be detected, and the instance that caused the accuracy deterioration can be detected.
- the inspectors M1 to M3 are created has been described, but other inspectors may be further created to detect the deterioration of accuracy.
- the information processing device 100 When the information processing device 100 detects a deterioration in the accuracy of the machine learning model 50, it creates a new training data set in which a classification class (correct answer label) corresponding to the operation data of the operation data set is set, and the created training data set. Is used to perform retraining of the machine learning model 50. As a result, even if the feature amount of the operational data set changes with the passage of time, the machine learning model corresponding to the change can be learned and the feature amount can be changed.
- a classification class correct answer label
- FIG. 26 is a diagram showing an example of a hardware configuration of a computer that realizes the same functions as the information processing apparatus according to the present embodiment.
- the computer 200 has a CPU 201 that executes various arithmetic processes, an input device 202 that receives data input from a user, and a display 203. Further, the computer 200 has a reading device 204 for reading a program or the like from a storage medium, and an interface device 205 for exchanging data with an external device or the like via a wired or wireless network. The computer 200 has a RAM 206 for temporarily storing various information and a hard disk device 207. Then, each device 201 to 207 is connected to the bus 208.
- the hard disk device 207 has a first learning program 207a, a calculation program 207b, a creation program 207c, a second learning program 207d, an acquisition program 207e, and a detection program 207f.
- the CPU 201 reads out the first learning program 207a, the calculation program 207b, the creation program 207c, the second learning program 207d, the acquisition program 207e, and the detection program 207f and deploys them in the RAM 206.
- the first learning program 207a functions as the first learning process 206a.
- the calculation program 207b functions as the calculation process 206b.
- the creation program 207c functions as the creation process 206c.
- the second learning program 207d functions as the second learning process 206d.
- the acquisition program 207e functions as the acquisition process 206e.
- the detection program 207f functions as the detection process 206f.
- the process of the first learning process 206a corresponds to the process of the first learning unit 151.
- the processing of the calculation process 206b corresponds to the processing of the calculation unit 152.
- the process of the creation process 206c corresponds to the process of the creation unit 153.
- the process of the second learning process 206d corresponds to the process of the second learning unit 154.
- the processing of the acquisition process 206e corresponds to the processing of the acquisition unit 155.
- the processing of the detection process 206f corresponds to the processing of the detection unit 156.
- each program does not necessarily have to be stored in the hard disk device 507 from the beginning.
- each program is stored in a "portable physical medium" such as a flexible disk (FD), a CD-ROM, a DVD disk, a magneto-optical disk, or an IC card inserted into the computer 200.
- the computer 200 may read and execute each of the programs 207a to 207f.
- Information processing device 110 Communication unit 120 Input unit 130 Display unit 140 Storage unit 141 Teacher data 141a Training data set 141b Verification data 142 Machine learning model data 143 Inspector table 144 Training data table 145 Operation data table 146 Output result table 150 Control unit 151 1st learning unit 152 Calculation unit 153 Creation unit 154 2nd learning unit 155 Acquisition unit 156 Detection unit
Landscapes
- Engineering & Computer Science (AREA)
- Theoretical Computer Science (AREA)
- Data Mining & Analysis (AREA)
- Physics & Mathematics (AREA)
- Evolutionary Computation (AREA)
- Artificial Intelligence (AREA)
- General Physics & Mathematics (AREA)
- General Engineering & Computer Science (AREA)
- Computer Vision & Pattern Recognition (AREA)
- Life Sciences & Earth Sciences (AREA)
- Evolutionary Biology (AREA)
- Bioinformatics & Computational Biology (AREA)
- Bioinformatics & Cheminformatics (AREA)
- Software Systems (AREA)
- Mathematical Physics (AREA)
- Computing Systems (AREA)
- Medical Informatics (AREA)
- Biomedical Technology (AREA)
- Biophysics (AREA)
- Computational Linguistics (AREA)
- General Health & Medical Sciences (AREA)
- Molecular Biology (AREA)
- Health & Medical Sciences (AREA)
- Nonlinear Science (AREA)
- Probability & Statistics with Applications (AREA)
- Image Analysis (AREA)
Priority Applications (3)
| Application Number | Priority Date | Filing Date | Title |
|---|---|---|---|
| JP2021553208A JP7272455B2 (ja) | 2019-10-23 | 2019-10-23 | 検出方法、検出プログラム及び情報処理装置 |
| PCT/JP2019/041547 WO2021079436A1 (ja) | 2019-10-23 | 2019-10-23 | 検出方法、検出プログラム及び情報処理装置 |
| US17/714,823 US20220230027A1 (en) | 2019-10-23 | 2022-04-06 | Detection method, storage medium, and information processing apparatus |
Applications Claiming Priority (1)
| Application Number | Priority Date | Filing Date | Title |
|---|---|---|---|
| PCT/JP2019/041547 WO2021079436A1 (ja) | 2019-10-23 | 2019-10-23 | 検出方法、検出プログラム及び情報処理装置 |
Related Child Applications (1)
| Application Number | Title | Priority Date | Filing Date |
|---|---|---|---|
| US17/714,823 Continuation US20220230027A1 (en) | 2019-10-23 | 2022-04-06 | Detection method, storage medium, and information processing apparatus |
Publications (1)
| Publication Number | Publication Date |
|---|---|
| WO2021079436A1 true WO2021079436A1 (ja) | 2021-04-29 |
Family
ID=75619701
Family Applications (1)
| Application Number | Title | Priority Date | Filing Date |
|---|---|---|---|
| PCT/JP2019/041547 Ceased WO2021079436A1 (ja) | 2019-10-23 | 2019-10-23 | 検出方法、検出プログラム及び情報処理装置 |
Country Status (3)
| Country | Link |
|---|---|
| US (1) | US20220230027A1 (https=) |
| JP (1) | JP7272455B2 (https=) |
| WO (1) | WO2021079436A1 (https=) |
Cited By (1)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| WO2024047758A1 (ja) * | 2022-08-30 | 2024-03-07 | 富士通株式会社 | 訓練データ分布推定プログラム、装置、及び方法 |
Families Citing this family (1)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| CN113269139B (zh) * | 2021-06-18 | 2023-09-26 | 中电科大数据研究院有限公司 | 一种针对复杂场景的自学习大规模警员图像分类模型 |
Citations (1)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| JP2019139010A (ja) * | 2018-02-08 | 2019-08-22 | 日本電信電話株式会社 | 音声認識精度劣化要因推定装置、音声認識精度劣化要因推定方法、プログラム |
Family Cites Families (4)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| US9165051B2 (en) * | 2010-08-24 | 2015-10-20 | Board Of Trustees Of The University Of Illinois | Systems and methods for detecting a novel data class |
| US9031897B2 (en) * | 2012-03-23 | 2015-05-12 | Nuance Communications, Inc. | Techniques for evaluation, building and/or retraining of a classification model |
| EP3459022A4 (en) * | 2016-05-16 | 2020-02-19 | Purepredictive, Inc. | Predictive drift detection and correction |
| US20180307741A1 (en) * | 2017-04-25 | 2018-10-25 | Intel Corporation | Filtering training data for simpler rbf models |
-
2019
- 2019-10-23 WO PCT/JP2019/041547 patent/WO2021079436A1/ja not_active Ceased
- 2019-10-23 JP JP2021553208A patent/JP7272455B2/ja active Active
-
2022
- 2022-04-06 US US17/714,823 patent/US20220230027A1/en not_active Abandoned
Patent Citations (1)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| JP2019139010A (ja) * | 2018-02-08 | 2019-08-22 | 日本電信電話株式会社 | 音声認識精度劣化要因推定装置、音声認識精度劣化要因推定方法、プログラム |
Non-Patent Citations (1)
| Title |
|---|
| KIM, YOUNGIN: "An Efficient Concept Drift Detection Method for Streaming Data under Limited Labeling", IEICE TRANS. INF. & SYST, vol. E100.D, no. 10, 10 October 2017 (2017-10-10), pages 2537 - 2546, XP055819275, Retrieved from the Internet <URL:https://www.semanticscholar.org/paper/An-Efficient-Concept-Drift-Detection-Method-for-Kim-Park/f1529d882477b8310311bec06de6b9abec982327><DOI:10.1587/transinf.2017EDP7091> [retrieved on 20191217] * |
Cited By (1)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| WO2024047758A1 (ja) * | 2022-08-30 | 2024-03-07 | 富士通株式会社 | 訓練データ分布推定プログラム、装置、及び方法 |
Also Published As
| Publication number | Publication date |
|---|---|
| US20220230027A1 (en) | 2022-07-21 |
| JPWO2021079436A1 (https=) | 2021-04-29 |
| JP7272455B2 (ja) | 2023-05-12 |
Similar Documents
| Publication | Publication Date | Title |
|---|---|---|
| TWI858141B (zh) | 產品分類系統及方法 | |
| US10223615B2 (en) | Learning based defect classification | |
| US11347972B2 (en) | Training data generation method and information processing apparatus | |
| JP7748798B2 (ja) | 教師なし異常検出及び高次元センサデータの多数決投票による原因説明のための方法及びシステム | |
| US20210232966A1 (en) | System and method for improving machine learning models based on confusion error evaluation | |
| JP7400827B2 (ja) | 検出方法、検出プログラムおよび情報処理装置 | |
| US20220188707A1 (en) | Detection method, computer-readable recording medium, and computing system | |
| TW202139131A (zh) | 用於影像分類之適應學習 | |
| US20220222581A1 (en) | Creation method, storage medium, and information processing apparatus | |
| WO2018035878A1 (zh) | 缺陷分类方法和缺陷检查系统 | |
| US20230045330A1 (en) | Multi-term query subsumption for document classification | |
| US9292650B2 (en) | Identifying layout pattern candidates | |
| US20220215294A1 (en) | Detection method, computer-readable recording medium, and computng system | |
| US12586021B2 (en) | Method and apparatus for predicting risk, electronic device, computer readable storage medium | |
| Stoyanov et al. | Predictive analytics methodology for smart qualification testing of electronic components | |
| JP2020115289A (ja) | 学習方法、学習プログラムおよび学習装置 | |
| JP2019067299A (ja) | ラベル推定装置及びラベル推定プログラム | |
| CN112633461A (zh) | 应用辅助系统和方法以及计算机可读记录介质 | |
| US20220230027A1 (en) | Detection method, storage medium, and information processing apparatus | |
| CN113269255B (zh) | 用于检测缺陷的方法、设备和计算机可读存储介质 | |
| JP7424507B2 (ja) | 検知プログラム、検知方法および検知装置 | |
| JP7448026B2 (ja) | 検知プログラム、モデル生成プログラム、検知方法、モデル生成方法、検知装置及びモデル生成装置 | |
| US20240303543A1 (en) | Model training method and model training apparatus | |
| US12613940B2 (en) | Clustering-based deviation pattern recognition | |
| KR102881348B1 (ko) | 딥러닝 모듈을 이용한 차량 추천 시스템 및 그 동작 방법 |
Legal Events
| Date | Code | Title | Description |
|---|---|---|---|
| 121 | Ep: the epo has been informed by wipo that ep was designated in this application |
Ref document number: 19949962 Country of ref document: EP Kind code of ref document: A1 |
|
| ENP | Entry into the national phase |
Ref document number: 2021553208 Country of ref document: JP Kind code of ref document: A |
|
| NENP | Non-entry into the national phase |
Ref country code: DE |
|
| 122 | Ep: pct application non-entry in european phase |
Ref document number: 19949962 Country of ref document: EP Kind code of ref document: A1 |