WO2023042874A1

WO2023042874A1 - Information processing method, information processing device, and program

Info

Publication number: WO2023042874A1
Application number: PCT/JP2022/034515
Authority: WO
Inventors: 泰一坂本; 克彦清水; 弘之石原; 俊祐吉澤; トマエン; クレモンジャケ; ステフェンチェン; 亮介佐賀
Original assignee: テルモ株式会社; 株式会社ロッケン
Priority date: 2021-09-17
Filing date: 2022-09-15
Publication date: 2023-03-23

Abstract

Provided are an information processing method and the like with which additional learning can be performed appropriately.　This information processing method causes a computer to execute a process for: acquiring a learning data set (69) comprising a plurality of sets of learning data (59) each having input data (511) and correct answer data (512) associated with each other, for use in training a learning model (41); performing clustering of at least a part of the learning data set (69) into a plurality of clusters (55); classifying the learning data (59) included in each of the clusters (55) into a set of training data (56), a set of validation data (57), and a set of test data (58); and training the learning model (41) by using, each as a batch, the set of training data (56), the set of validation data (57), and the set of test data (58) in the plurality of clusters (55).

Description

Information processing method, information processing device and program

The present invention relates to an information processing method, an information processing device, and a program.

An image processing device that supports image diagnosis using a learning model generated by supervised machine learning has been proposed (Patent Document 1). An information processing method has been proposed in which additional learning is performed on a generated learning model using data for additional learning (Patent Document 2).

JP 2018-175343 A JP 2019-133626 A

When performing additional learning, separate the data for additional learning into training data, validation data, and test data, and use the training data and validation data to adjust the parameters of the learning model. After that, test data is used to test the adjusted learning model. However, if there is an imbalance in how the data for additional learning is divided, additional learning cannot be performed appropriately.

Similarly, when performing the first learning, the learning data is divided into training data, validation data, and test data, and the training data and validation data are used to adjust the parameters of the learning model. After that, test data is used to test the adjusted learning model. However, if there is a bias in how the learning data is divided, there is a risk that the accuracy of the learning model will not reach the target due to over-learning or the like.

In one aspect, the purpose is to provide an information processing method that enables appropriate learning.

The information processing method acquires a learning data set composed of multiple sets of learning data in which input data and correct data are associated to be used for learning a learning model, and divides at least a part of the learning data into a plurality of clusters. Clustering, dividing the learning data included in each cluster into training data, validation data, and test data, and collecting the training data, validation data, and test data of the plurality of clusters, respectively A computer performs the processing used to train the learning model.

In one aspect, it is possible to provide an information processing method that enables appropriate learning.

It is an explanatory view explaining an outline of an additional learning method. It is an explanatory view explaining an outline of a study model generation method. It is an explanatory view explaining the composition of an information processor. FIG. 4 is an explanatory diagram for explaining the structure of an additional learning data set; FIG. 4 is an explanatory diagram for explaining the structure of an additional learning data set; It is an explanatory view explaining a calculation method of an independence evaluation index. 4 is a flowchart for explaining the flow of processing of a program; FIG. 10 is a flowchart for explaining the flow of processing of a subroutine for problem data extraction; FIG. FIG. 11 is a flowchart for explaining the flow of processing of a subroutine for evaluation index calculation; FIG. FIG. 11 is a flowchart for explaining the flow of processing of a subroutine for index calculation; FIG. FIG. 4 is an explanatory diagram for explaining an outline of a method of generating a learning model; 10 is a flowchart for explaining the flow of processing of a program according to Embodiment 2; FIG. 11 is an explanatory diagram for explaining the configuration of an information processing device according to a third embodiment; FIG. 11 is a functional block diagram of an information processing device according to a fourth embodiment;

[Embodiment 1]
FIG. 1 is an explanatory diagram for explaining the outline of the additional learning method. An additional learning data set 64 consisting of multiple sets of additional learning data 51 in which input data 511 and correct data 512 are associated is prepared. When the input data 511 included in the set of additional learning data 51 is input to the learning model 41 , the learning model 41 outputs output data 53 . The additional learning data 51 is an example of the learning data, and the additional learning data set 64 is an example of the learning data set.

The learning model 41 is, for example, a classification model that, when image data is input, outputs labels related to subjects appearing in the image data. The learning model 41 may be a segmentation model that, when image data is input, outputs a label corresponding to the subject of each pixel that constitutes the image data. The learning model 41 may be an object detection model that outputs the position and label of an object appearing in the image data when the image data is input.

The input of the learning model 41 is not limited to image data. For example, learning model 41 may be a model that accepts text input and outputs audio, images, or labels. The learning model 41 may be a model that accepts speech and outputs text, images, or labels.

Based on the comparison between the correct data 512 associated with the input data 511 and the output data 53, an evaluation value indicating the processing accuracy of the additional learning data 51 is calculated. In the following description, an example will be described in which the evaluation value is defined such that it becomes a high value when the output data 53 is close to the correct data 512 . Details of the evaluation value will be described later.

Problem data whose accuracy of the output data 53 is lower than a predetermined threshold is extracted from the multiple sets of additional learning data 51 . The extracted problem data is divided into a plurality of clusters 55 by a known clustering technique such as the k-means method or Ward's method. The extracted problem data may be automatically divided into a plurality of clusters 55 based on predetermined rules.

In FIG. 1, three clusters 55 of a first cluster 551, a second cluster 552 and a third cluster 553 are schematically shown. A point inside each cluster 55 indicates additional learning data 51 determined to be problem data.

FIG. 2 is an explanatory diagram explaining an overview of the method of generating the study model 43. FIG. FIG. 2 schematically shows the processing for the i-th cluster 55i, which is the i-th cluster 55. As shown in FIG. The additional learning data 51 included in the i-th cluster 55i is divided into i-th training data 56i, i-th validation data 57i and i-th test data 58i.

Using the i-th training data 56i and the i-th validation data 57i, the i-th examination model 43i is newly generated by machine learning. The above processing is executed for all the clusters 55, and the same number of study models 43 as the clusters 55 are generated.

Return to Figure 1 and continue the explanation. In FIG. 1, the first considered model 431 uses the first cluster 551, the second considered model 432 uses the second cluster 552, and the third considered model 433 uses the third cluster 553. A generated example is shown schematically.

For each cluster 55, a learning possibility evaluation index indicating whether or not the study model 43 was properly learned and an independence evaluation index indicating independence between the clusters 55 are evaluated. Details of the learnability evaluation index and the independence evaluation index will be described later.

If one or both of the learnability evaluation index and the independence evaluation index do not show good values, the process returns to the clustering step and the process is performed again. For example, the same Different clusters 55 can be formed even with the problem data group of .

When all the learnability evaluation indices and the independence evaluation indices show good values, the additional learning data 51 included in the additional learning data set 64 are the training data set 61, the validation data set 62 and the test data set 63. distributed to

The training data set 61 includes the training data 56 of each cluster 55 and a portion of the additional learning data 51 determined not to be problem data. The validation data set 62 includes the validation data 57 of each cluster 55 and a portion of the additional learning data 51 determined not to be problem data. The test data set 63 includes the test data 58 of each cluster 55 and a portion of the additional learning data 51 determined not to be problem data.

The training data set 61 and the validation data set 62 are used to readjust the parameters of the learning model 41. A test data set 63 is used to test the retrained learning model 41 .

By the processing described above, bias is prevented when the multiple sets of additional learning data 51 included in the additional learning data set 64 are divided into training data, validation data, and test data, and additional learning of the learning model 41 is performed. can be done properly.

FIG. 3 is an explanatory diagram for explaining the configuration of the information processing device 20. As shown in FIG. The information processing device 20 includes a control section 21, a main storage device 22, an auxiliary storage device 23, a communication section 24, a display section 25, an input section 26 and a bus. The control unit 21 is an arithmetic control device that executes the program of this embodiment. One or a plurality of CPUs (Central Processing Units), GPUs (Graphics Processing Units), TPUs (Tensor Processing Units), multi-core CPUs, or the like is used for the control unit 21 . The control unit 21 is connected to each hardware unit forming the information processing apparatus 20 via a bus.

The main storage device 22 is a storage device such as SRAM (Static Random Access Memory), DRAM (Dynamic Random Access Memory), flash memory, or the like. The main storage device 22 temporarily stores information necessary during the process performed by the control unit 21 and the program being executed by the control unit 21 .

The auxiliary storage device 23 is a storage device such as SRAM, flash memory, hard disk, or magnetic tape. The auxiliary storage device 23 stores the learning model 41, the additional learning data DB 45 recording the additional learning data set 64, the program to be executed by the control unit 21, and various data necessary for executing the program. The learning model 41 and the additional learning data DB 45 may be stored in an external large-capacity storage device connected to the information processing device 20 . The communication unit 24 is an interface that performs communication between the information processing device 20 and a network.

The display unit 25 is, for example, a liquid crystal display panel or an organic EL (electro-luminescence) panel. Input unit 26 is, for example, a keyboard or a mouse. The display unit 25 and the input unit 26 may be stacked to form a touch panel.

The information processing device 20 is a general-purpose personal computer, a tablet, a large computer, a virtual machine running on a large computer, or a quantum computer. The information processing apparatus 20 may be configured by hardware such as a plurality of personal computers or large computers that perform distributed processing. The information processing device 20 may be configured by a cloud computing system or a quantum computer.

4 and 5 are explanatory diagrams explaining the configuration of the additional learning data set 64. FIG. The additional learning data set 64 is a data set made up of multiple sets of additional learning data 51 as described above. 4 and 5, the configuration of the additional learning data set 64 will be described in tabular form in which one set of additional learning data 51 is described in one row. Input data 511 are shown in the left column, and correct data 512 are shown in the right column.

FIG. 4 shows an example of an additional learning data set 64 used for additional learning of the learning model 41 that receives input of image data and outputs labels related to subjects appearing in the image data. For example, additional learning data 51 shown in the top row of FIG. 4 indicates that input data 511, which is image data used for input, and the label corresponding to the subject of the input data 511 is "dog". Correct answer data 512 are included.

The control unit 21 inputs the input data 511 to the learning model 41 or the examination model 43 and acquires the output data 53. When the output data 53 is "dog", the correct answer data 512 and the output data 53 match, so the control unit 21 calculates that the evaluation value is "1". On the other hand, when the output data 53 is "cat", the correct answer data 512 and the output data 53 do not match, so the control unit 21 calculates that the evaluation value is "0".

Note that the evaluation value is not limited to the binary values of "0" and "1". For example, if a label indicating a four-legged animal other than "dog" such as "cat", "horse" or "cow" is output for the correct answer data 512 of "dog", "0.5" is output. , or "0" if any other label is output. In the process of extracting problem data, the control unit 21 extracts the additional learning data 51 whose evaluation value is less than the threshold.

FIG. 5 shows an additional learning data set used for additional learning for a learning model 41, which is a so-called segmentation model that receives input of image data and outputs a label corresponding to the subject of each pixel that constitutes the image data. 64 examples are shown.

For example, the additional learning data 51 shown in the top row of FIG. including.

The control unit 21 inputs the input data 511 to the learning model 41 or the examination model 43 and acquires the output data 53. The control unit 21 compares the correct data 512 and the output data 53 for each pixel to calculate an evaluation value. The evaluation value can be any evaluation value that can be used to evaluate the accuracy of the segmentation model, such as Pixel Accuracy, mIoU (Mean Intersection over Union), or F value.

The control unit 21 extracts problem data whose evaluation value is lower than the threshold from the additional learning data 51 . The threshold is, for example, a predetermined constant. The threshold may be determined based on the calculated evaluation value. For example, the control unit 21 may define a threshold so as to extract the additional learning data 51 with the lowest 10% evaluation value.

Note that the evaluation value used when extracting question data from multiple sets of additional learning data 51 and the evaluation value used when calculating the learnability evaluation index and the independence evaluation index are the same evaluation value. or different evaluation values.

Explain how to calculate the learning possibility evaluation index. The learnability evaluation index is defined by equation (1).

An outline of the procedure for calculating the learning possibility evaluation index L(Fi) for the i-th cluster 55i will be explained. As described using FIG. 2, the control unit 21 divides the additional learning data 51 included in the i-th cluster 55i into the i-th training data 56i, the i-th validation data 57i and the i-th test data 58i. The control unit 21 generates the i-th examination model 43i using the i-th training data 56i and the i-th validation data 57i.

The control unit 21 inputs each input data 511 included in the i-th test data 58i to the i-th study model 43i and acquires the output data 53. The control unit 21 calculates an evaluation value for each additional learning data 51 based on a comparison between the correct data 512 associated with the input data 511 and the output data 53 .

The control unit 21 calculates a representative value of evaluation values evaluated for each additional learning data 51 . The representative value is, for example, an arithmetic mean value, a geometric mean value, a harmonic mean value, a median value, or a mode value of the accuracy evaluated for each additional learning data 51 . The representative value calculated by the control unit 21 is the learning possibility evaluation index L(Fi) of the i-th cluster 55i. The control unit 21 calculates a learning possibility evaluation index for each cluster 55 .

The learning possibility evaluation index is an index related to the accuracy of the generated study model 43. If the learnability evaluation index is less than or equal to the threshold, the generated study model 43 is not a suitable model. If the learnability metric exceeds the threshold, then a study model 43 of good accuracy has been generated. A suitable threshold value is, for example, 0.6.

Explain how to calculate the independence evaluation index. The independence evaluation index is defined by equation (2). Note that the description of the symbols common to the formula (1) will be omitted.

　Formula (2) can be transformed into formula (3) based on the definition of the learning possibility evaluation index shown in formula (1).

An outline of the procedure for calculating the independence evaluation index I(Fi) of the i-th cluster 55i will be explained. First, the learning possibility evaluation index L(Fi) is calculated according to the procedure described above. The control unit 21 inputs each input data 511 included in the j-th test data 58j to the i-th examination model 43i to acquire the output data 53. FIG. The control unit 21 calculates an evaluation value for each additional learning data 51 based on a comparison between the correct data 512 associated with the input data 511 and the output data 53 .

The control unit 21 calculates a representative value of evaluation values evaluated for each additional learning data 51 . The representative value calculated by the control unit 21 is the evaluation index m(i, j) when the i-th study model 43i and the j-th test data 58j are combined. The representative value for calculating the learnability evaluation index and the representative value for calculating the independence evaluation index are preferably the same.

FIG. 6 is an explanatory diagram explaining the method of calculating the independence evaluation index. The vertical direction indicates i in the above equations (2) and (3), and the horizontal direction indicates j in the above equations. Each square except for the right end indicates an evaluation index m(i, j). The squares on the diagonal surrounded by thick frames are learnability evaluation indexes evaluated by combining the study model 43 and the test data 58 of the cluster 55 used to generate the study model 43 .

Returning to formula (3), the explanation continues. The control unit 21 calculates the difference between the learning possibility evaluation index L(Fi) and the evaluation index m(i, j) for each j that is not equal to i. The control unit 21 calculates an arithmetic mean value of the calculated differences. The arithmetic mean value calculated by the control unit 21 is the independence evaluation index I(Fi) of the i-th cluster 55i shown on the right end of FIG.

The independence evaluation index is an index regarding the independence between the clusters 55. If the independence evaluation index is less than or equal to the threshold, clustering is not performed properly. If the independence metric exceeds the threshold, the clustering is done properly. A suitable threshold value is, for example, 0.6. Note that the threshold for the learnability evaluation index and the threshold for the independence evaluation index may be different values.

FIG. 7 is a flowchart explaining the flow of program processing. Prior to executing the program of FIG. 7, an untrained model having a structure corresponding to the learned model 41 is prepared. The control unit 21 executes the program described using FIG. 7 when the additional learning data set 64 including a sufficient number of the additional learning data 51 is recorded in the additional learning data DB 45 .

The control unit 21 starts a subroutine for question data extraction (step S501). The problem data extraction subroutine is a subroutine for extracting problem data from the additional learning data 51 included in the additional learning data set 64 . The processing flow of the problem data extraction subroutine will be described later.

The control unit 21 clusters the extracted question data (step S502). The control unit 21 selects one cluster 55 (step S503). The control unit 21 distributes the additional learning data 51 included in the selected cluster 55 to the training data 56, the validation data 57 and the test data 58 (step S504).

The control unit 21 performs machine learning using the training data 56 and the validation data 57 to generate the examination model 43 (step S505). Since model generation processing by supervised machine learning has been conventionally performed, detailed description is omitted. The control unit 21 records the generated study model 43 in the main storage device 22 or the auxiliary storage device 23 in association with the cluster 55 selected in step S503.

The control unit 21 determines whether or not the processing of all clusters 55 has been completed (step S506). If it is determined that the processing has not ended (NO in step S506), the control unit 21 returns to step S503. If it is determined that the processing has ended (YES in step S506), the control unit 21 starts a subroutine for evaluation index calculation (step S507). The evaluation index calculation subroutine is a subroutine for calculating the learnability evaluation index and the independence evaluation index of the study model 43 generated for each cluster 55 . The processing flow of the evaluation index calculation subroutine will be described later.

The control unit 21 determines whether or not the learning possibility evaluation index and the independence evaluation index of the study model 43 satisfy the conditions (step S508). Specifically, when all of the learnability evaluation index and independence evaluation index calculated for each study model 43 exceed a predetermined threshold value, the control unit 21 determines that the conditions are satisfied.

If it is determined that the conditions are satisfied (YES in step S508), the control unit 21 distributes the additional learning data 51 included in the additional learning data set 64 to the training data set 61, the validation data set 62, and the test data set 63. (step S509).

Here, the training data set 61 includes the training data 56 distributed in step S504 for each cluster 55, and part of the additional learning data 51 determined not to be problem data. The validation data set 62 includes the validation data 57 distributed in step S504 for each cluster 55 and a portion of the additional learning data 51 determined not to be problem data. The test data set 63 includes the test data 58 distributed in step S504 for each cluster 55 and a portion of the additional learning data 51 determined not to be problem data.

The control unit 21 uses the training data set 61 and the validation data set 62 to perform additional learning of the learning model 41, and uses the test data set 63 to test after the additional learning (step S510). Additional learning processing for a model generated by supervised machine learning has been conventionally performed, and therefore detailed description is omitted. The control unit 21 determines whether or not the accuracy of the learning model 41 has been improved by the additional learning (step S511).

If it is determined that there has been an improvement (YES in step S511), the control unit 21 updates the additional learning data DB 45 and the learning model 41 (step S512). Specifically, the control unit 21 deletes the additional learning data 51 used for the additional learning from the additional learning data set 64 recorded in the additional learning data DB 45 . The control unit 21 may record a flag indicating the used additional learning data 51 in the additional learning data set 64 .

The control unit 21 updates the parameters of the learning model 41 recorded in the auxiliary storage device 23. Note that the control unit 21 may record both the learning model 41 before additional learning and the learning model 41 after additional learning in the auxiliary storage device 23 . After that, the control unit 21 terminates the process.

If it is determined that the learnability and independence of the study model 43 do not satisfy the conditions (NO in step S508), or if it is determined that the accuracy of the learning model 41 is not improved by additional learning (step S511 NO), the control unit 21 determines whether or not to end the process (step S521). For example, when step S521 is repeated a predetermined number of times, the control unit 21 determines to end the process.

If it is determined not to end the process (NO in step S521), the control unit 21 returns to step S502. When determining to end the process (YES in step S521), the control unit 21 ends the process. The additional learning data 51 is not deleted from the additional learning data set 64 . After a sufficient number of additional learning data 51 is added to the additional learning data set 64, the program described using FIG. 7 is executed again.

FIG. 8 is a flowchart explaining the process flow of the problem data extraction subroutine. The problem data extraction subroutine is a subroutine for extracting problem data from the additional learning data 51 recorded in the additional learning data set 64 .

The control unit 21 acquires a set of additional learning data 51 from the additional learning data set 64 (step S531). The control unit 21 inputs the input data 511 included in the acquired additional learning data 51 to the learning model 41 and acquires the output data 53 (step S532).

The control unit 21 calculates an evaluation value based on the correct data 512 included in the additional learning data 51 acquired in step S531 and the output data 53 acquired in step S532 (step S533). The control unit 21 associates the additional learning data 51 acquired in step S531 with the calculated evaluation value, and records them in the auxiliary storage device 23 or the main storage device 22 (step S534).

The control unit 21 determines whether or not the processing of the additional learning data 51 recorded in the additional learning data set 64 has ended (step S535). If it is determined that the processing has not ended (NO in step S535), the control unit 21 returns to step S531.

If it is determined that the process has ended (YES in step S535), the control unit 21 extracts additional learning data 51, which is problem data, based on the evaluation value recorded in step S534 (step S536). For example, the control unit 21 extracts additional learning data 51 whose evaluation value is lower than a predetermined threshold. Note that the control unit 21 may calculate the threshold value based on the recorded evaluation value. After that, the control unit 21 extracts problem data based on the calculated threshold, and ends the process.

FIG. 9 is a flow chart explaining the flow of processing of the evaluation index calculation subroutine. The evaluation index calculation subroutine is a subroutine for calculating the learnability evaluation index and the independence evaluation index of the study model 43 generated for each cluster 55 .

The control unit 21 selects the cluster 55 for which the evaluation index is to be calculated (step S541). The cluster 55 selected in step S541 corresponds to the i-th cluster Fi in equations (1) to (3). The clusters 55 selected in step S541 are illustrative of the used clusters used to generate the study model 43 being processed.

The control unit 21 acquires the study model 43 generated based on the cluster 55 selected in step S541 (step S542). The study model 43 selected in step S542 corresponds to the i-th study model Mi in the equations (1) to (3).

The control unit 21 acquires the test data 58 included in the cluster 55 selected in step S541 (step S543). The test data 58 acquired in step S543 corresponds to the test data Fite separated from the i-th cluster 55 in the formulas (1) to (3).

The control unit 21 uses the investigation model 43 acquired in step S542 and the test data 58 acquired in step S543 as arguments to start the index calculation subroutine (step S544). The index calculation subroutine is a subroutine for calculating an evaluation index m(a, b) for evaluating the a-th study model 43a using the b-th test data 58b. In step S544, a=b=i, and the calculated evaluation index m(i, i) is the learning possibility evaluation index L(Fi ). The processing flow of the index calculation subroutine will be described later.

The control unit 21 selects the cluster 55 of the test data 58 (step S545). The cluster 55 selected in step S545 corresponds to the j-th cluster Fj in equations (2) to (3). The clusters 55 selected in step S545 are illustrative of unused clusters that have not been used to generate the study model 43 being processed.

The control unit 21 acquires the test data 58 included in the cluster 55 selected in step S545 (step S546). The test data 58 obtained in step S546 corresponds to the test data Fjte separated from the j-th cluster 55 in formulas (2) to (3).

The control unit 21 uses the study model 43 acquired in step S542 and the test data 58 acquired in step S546 as arguments to start the index calculation subroutine (step S547). The index calculation subroutine is the same subroutine as the subroutine activated in step S544. In step S544, a=i and b=j, and the evaluation index m(i, j) in equations (2) to (3) is calculated.

The control unit 21 determines whether or not all clusters 55 other than the cluster 55 selected in step S541 have been processed (step S548). If it is determined that the process has not ended (NO in step S548), the control unit 21 returns to step S545.

If it is determined that the process has ended (YES in step S548), the control unit 21 calculates the independence evaluation index I (Fi) for the i-th cluster 55 based on the formula (2) or (3) ( step S549).

The control unit 21 determines whether or not the calculation of the learnability evaluation index and the independence evaluation index for all clusters 55 has been completed (step S550). If it is determined that the process has not ended (NO in step S550), the control unit 21 returns to step S541. If it is determined that the process has ended (YES in step S550), the control unit 21 ends the process.

FIG. 10 is a flowchart for explaining the flow of the subroutine for index calculation. The index calculation subroutine is a subroutine for calculating an evaluation index m(a, b) for evaluating the a-th study model 43a using the b-th test data 58b.

The control unit 21 acquires a set of additional learning data 51 from the b-th test data 58b designated as an argument (step S551). The control unit 21 inputs the input data 511 included in the acquired additional learning data 51 to the a-th study model 43a designated by the argument, and acquires the output data 53 (step S552).

The control unit 21 calculates an evaluation value based on the correct data 512 included in the additional learning data 51 acquired in step S551 and the output data 53 acquired in step S552 (step S553). The control unit 21 associates the additional learning data 51 acquired in step S551 with the calculated evaluation value, and records them in the auxiliary storage device 23 or the main storage device 22 (step S554).

The control unit 21 determines whether or not the processing of the additional learning data 51 included in the b-th test data 58b has ended (step S555). If it is determined that the process has not ended (NO in step S555), the control unit 21 returns to step S551.

If it is determined that the process has ended (YES in step S555), the control unit 21 calculates a representative value of the evaluation values recorded in step S554 (step S556). The evaluation value calculated in step S556 is the evaluation index m(a,b). After that, the control unit 21 terminates the process.

According to the present embodiment, the additional learning data 51 included in the additional learning data set 64 can be distributed to the training data set 61, validation data set 62 and test data set 63 without bias. Therefore, additional learning of the learning model 41 can be appropriately performed.

According to the present embodiment, the additional learning data 51 can be distributed appropriately from the two viewpoints of the learning possibility evaluation index and the independence evaluation index.

According to the present embodiment, if the additional learning data 51 included in the additional learning data set 64 does not satisfy the conditions for appropriate additional learning, additional learning is not performed. Wait for the additional learning data 51 to be accumulated. Therefore, it is possible to prevent the accuracy of the learning model 41 from deteriorating due to inappropriate additional learning.

[Embodiment 2]
This embodiment relates to generation of a new learning model 41 . Descriptions of parts common to the first embodiment are omitted. In the following description, as in the first embodiment, the information processing apparatus 20 is used to generate the learning model 41 as an example. The generation of the learning model 41 and the additional learning of the learning model 41 described in Embodiment 1 may be performed by different hardware.

FIG. 11 is an explanatory diagram explaining an overview of the method for generating the learning model 41. FIG. A learning data set 69 is prepared which consists of a large number of sets of learning data 59 in which input data 511 and correct data 512 are associated. The learning data set 69 is recorded in a database stored in the auxiliary storage device 23 or an external large-capacity storage device connected to the information processing device 20 .

Since the configurations of the learning data 59 and the learning data set 69 are the same as the additional learning data 51 and the additional learning data set 64 described with reference to FIGS. 4 and 5, detailed description thereof will be omitted. .

The learning data 59 is divided into a plurality of clusters 55 by a known clustering method such as the k-means method or Ward's method. In FIG. 11, three clusters 55 of a first cluster 551, a second cluster 552 and a third cluster 553 are schematically shown. A point inside each cluster 55 indicates learning data 59 .

After the learning data 59 included in each cluster 55 is divided into training data 56, validation data 57, and test data 58, a study model 43 is generated by machine learning. The above processing is executed for all the clusters 55, and the same number of study models 43 as the clusters 55 are generated.

For each cluster 55, a learning possibility evaluation index indicating whether or not the study model 43 was properly learned and an independence evaluation index indicating independence between the clusters 55 are evaluated. The calculation method of the learning possibility evaluation index is the same as the calculation method described using the formula (1) in the first embodiment. The calculation method of the independence evaluation index is the same as the calculation method described using the formulas (2) and (3) in the second embodiment.

If one or both of the learnability evaluation index and the independence evaluation index do not show good values, the process returns to the clustering step and the process is performed again. When all the learnability evaluation indexes and independence evaluation indexes show good values, the learning data 59 included in the learning data set 69 are distributed to the training data set 61, the validation data set 62 and the test data set 63. be done.

A training data set 61 includes training data 56 for each cluster 55 . Validation data set 62 includes validation data 57 for each cluster 55 . Test data set 63 includes test data 58 for each cluster 55 .

Using the training data set 61 and the validation data set 62, the learning model 41 is generated by machine learning. A test data set 63 is used to test the generated learning model 41 .

By the processing described above, bias is prevented when the plurality of sets of learning data 59 included in the learning data set 69 are divided into training data, validation data, and test data 58, and the learning model 41 is appropriately generated. can do

FIG. 12 is a flowchart for explaining the processing flow of the program according to the second embodiment. Prior to executing the program of FIG. 12, an unlearned model having a structure corresponding to the learning model 41 is prepared.

The control unit 21 clusters the learning data 59 recorded in the learning data set 69 (step S561). The control unit 21 selects one cluster 55 (step S562). The control unit 21 distributes the learning data 59 included in the selected cluster 55 to the training data 56, validation data 57 and test data 58 (step S563).

The control unit 21 performs machine learning using the training data 56 and the validation data 57 to generate the examination model 43 (step S564). Since model generation processing by supervised machine learning has been conventionally performed, detailed description is omitted. The control unit 21 records the generated study model 43 in the main storage device 22 or the auxiliary storage device 23 in association with the cluster 55 selected in step S562.

The control unit 21 determines whether or not the processing of all clusters 55 has been completed (step S565). If it is determined that the process has not ended (NO in step S565), the control unit 21 returns to step S562. If it is determined that the processing has ended (YES in step S565), the control unit 21 starts the evaluation index calculation subroutine (step S566). The evaluation index calculation subroutine is a subroutine similar to the subroutine described using FIG.

The control unit 21 determines whether the learning possibility evaluation index and the independence evaluation index of the study model 43 satisfy the conditions (step S567). Specifically, when all of the learnability evaluation index and independence evaluation index calculated for each study model 43 exceed a predetermined threshold value, the control unit 21 determines that the conditions are satisfied.

If it is determined that the conditions are satisfied (YES in step S567), the control unit 21 distributes the learning data 59 included in the learning data set 69 to the training data set 61, the validation data set 62, and the test data set 63 ( step S568).

Here, the training data set 61 is composed of the training data 56 distributed for each cluster 55 in step S563. The validation data set 62 is composed of the validation data 57 distributed for each cluster 55 in step S563. The test data set 63 is composed of the test data 58 distributed in step S563 for each cluster 55. FIG.

The control unit 21 uses the training data set 61 and the validation data set 62 to perform machine learning and generate the learning model 41 (step S569). The control unit 21 tests the learning model 41 generated using the test data set 63 . The control unit 21 determines whether or not the learning model 41 having a predetermined accuracy is generated based on the test result (step S570).

If it is determined that it has been generated (YES in step S570), the control unit 21 records the parameters of the generated learning model 41 in the auxiliary storage device 23 (step S571). After that, the control unit 21 terminates the process.

If it is determined that the learnability and independence of the study model 43 do not satisfy the conditions (NO in step S567), or if it is determined that the learning model 41 having a predetermined accuracy is not generated (step S570 NO), the control unit 21 determines whether or not to end the process (step S572). For example, when step S572 is repeated a predetermined number of times, the control unit 21 determines to end the process.

If it is determined not to end the process (NO in step S572), the control unit 21 returns to step S561. When determining to end the process (YES in step S572), the control unit 21 ends the process. Note that the control unit 21 may notify the user that the appropriate learning model 41 could not be generated before terminating the process.

According to the present embodiment, it is possible to prevent bias when distributing the learning data 59 recorded in the learning data set 69 to the training data set 61, the validation data set 62, and the test data set 63, so that the number of trials can be reduced. A suitable learning model 41 can be generated.

[Embodiment 3]
FIG. 13 is an explanatory diagram illustrating the configuration of the information processing device 20 according to the third embodiment. The present embodiment relates to a mode of realizing the information processing apparatus 20 of the present embodiment by operating a general-purpose computer 90 and a program 97 in combination. Descriptions of parts common to the first embodiment are omitted.

The computer 90 includes a reading section 29 in addition to the aforementioned control section 21, main storage device 22, auxiliary storage device 23, communication section 24, display section 25, input section 26 and bus.

The program 97 is recorded on a portable recording medium 96. The control unit 21 reads the program 97 via the reading unit 29 and stores it in the auxiliary storage device 23 . Control unit 21 may also read program 97 stored in semiconductor memory 98 such as a flash memory installed in computer 90 . Furthermore, the control unit 21 may download the program 97 from another server computer (not shown) connected via the communication unit 24 and a network (not shown) and store it in the auxiliary storage device 23 .

The program 97 is installed as a control program of the computer 90, loaded into the main storage device 22 and executed. As described above, the information processing apparatus 20 described in the first embodiment is realized.

[Embodiment 4]
FIG. 14 is a functional block diagram of the information processing device 20 according to the fourth embodiment. The information processing device 20 includes an acquisition unit 81 , a clustering unit 82 , a distribution unit 83 and a learning unit 84 .

The acquisition unit 81 acquires a plurality of sets of learning data 59 associated with the input data 511 and the correct data 512 used for learning the learning model 41 . A clustering unit 82 clusters the learning data 59 into a plurality of clusters 55 .

The distribution unit 83 divides the learning data 59 included in each cluster 55 into training data 56 , validation data 57 and test data 58 . The learning unit 84 collectively uses the training data 56 , validation data 57 , and test data 58 of the plurality of clusters 55 for learning of the learning model 41 .

The technical features (constituent elements) described in each embodiment can be combined with each other, and new technical features can be formed by combining them.
The embodiments disclosed this time are illustrative in all respects and should be considered not restrictive. The scope of the present invention is not defined by the above-described meaning, but is indicated by the scope of claims, and is intended to include all modifications within the meaning and scope equivalent to the scope of claims.

20 information processing device 21 control unit 22 main storage device 23 auxiliary storage device 24 communication unit 25 display unit 26 input unit 29 reading unit 41 learning model 43 study model 431 first study model 432 second study model 433 third study model 43a a study model 43i i-th study model 45 additional learning data DB
51 additional learning data 511 input data 512 correct data 53 output data 55 cluster 551 first cluster 552 second cluster 553 third cluster 55i i-th cluster 56 training data 56i i-th training data 57 validation data 57i i-th validation data 58 test data 58b b-th test data 58i i-th test data 58j j-th test data 59 learning data 61 training data set 62 validation data set 63 test data set 64 additional learning data set 69 learning data set 81 acquisition unit 82 clustering unit 83 Distribution unit 84 Learning unit 90 Computer 96 Portable recording medium 97 Program 98 Semiconductor memory

Claims

Acquiring a learning data set consisting of multiple sets of learning data associated with input data and correct data used for learning a learning model,
clustering at least a portion of the training data set into a plurality of clusters;
The learning data included in each cluster is divided into training data, validation data, and test data,
An information processing method in which a computer executes a process of collectively using the training data, validation data, and test data of the plurality of clusters for learning of the learning model.
Generate a plurality of study models by machine learning using each of the plurality of clusters,
For each of the generated multiple consideration models,
Using the test data separated from the use cluster used to generate each study model among the plurality of clusters, calculating a learnability evaluation index regarding the learnability of the use cluster;
Using the test data separated from the unused clusters not used to generate each study model among the plurality of clusters, calculating an independence evaluation index regarding the independence of the used clusters from the unused clusters,
The information processing method according to claim 1, wherein the clustering is repeated when the learnability evaluation index and the independence evaluation index do not satisfy a predetermined condition.
The information processing method according to claim 2, wherein the learnability evaluation index is defined by formula (1), and the independence evaluation index is defined by formula (2).
4. The information processing method according to claim 3, wherein the clustering is redone if all the learnability evaluation indexes and the independence evaluation indexes do not exceed 0.6.
The learning data set is an additional learning data set used for additional learning of the learning model,
inputting the input data to the learning model and evaluating the accuracy of the output data based on output data output from the learning model and correct data associated with the input data;
5. The clustering according to any one of claims 1 to 4, wherein the clustering is performed on the plurality of sets of learning data in which the accuracy of the output data is equal to or less than a predetermined threshold among the additional learning data sets. Information processing method described.
6. The information processing method according to claim 5, wherein the plurality of sets of learning data whose accuracy of the output data exceeds a predetermined threshold are configured into one cluster.
Acquiring a learning data set consisting of multiple sets of learning data associated with input data and correct data used for learning a learning model,
clustering the plurality of sets of training data sets into a plurality of clusters;
Divide the learning data set included in each cluster into training data, validation data, and test data,
creating a training data set including the training data for each of the plurality of clusters, a validation data set including the validation data for each of the plurality of clusters, and a training data set including the test data for each of the plurality of clusters An information processing method in which a computer executes processing.
an acquisition unit that acquires a plurality of sets of learning data in which input data and correct data are associated to be used for learning a learning model;
a clustering unit that clusters the learning data into a plurality of clusters;
a distribution unit that divides the learning data included in each cluster into training data, validation data, and test data;
and a learning unit that collects the training data, validation data, and test data of the plurality of clusters and uses them for learning of the learning model.
Acquiring a plurality of sets of learning data associated with input data and correct data used for learning a learning model,
clustering the learning data into a plurality of clusters;
The learning data included in each cluster is divided into training data, validation data, and test data,
A program for causing a computer to execute a process of collectively using training data, validation data, and test data of the plurality of clusters for learning of the learning model.