JP2021039641A

JP2021039641A - Re-learning method, and computer program

Info

Publication number: JP2021039641A
Application number: JP2019161823A
Authority: JP
Inventors: 北田　成秀; Seishu Kitada; 成秀北田
Original assignee: Seiko Epson Corp
Current assignee: Seiko Epson Corp
Priority date: 2019-09-05
Filing date: 2019-09-05
Publication date: 2021-03-11

Abstract

To provide a technique for generating learning data in a region intended by a user when a learning model is re-learned.SOLUTION: A re-learning method includes displaying a feature quantity space and a plurality of first learning data elements arranged on the feature quantity space according to a feature quantity on a display device, extracting a specific region satisfying a region detection condition, generating a plurality of second learning data elements having the feature quantity in the specific region in association with labels, and re-learning a learning model using the plurality of generated second learning data elements and the labels in association with each of the plurality of second learning data elements.SELECTED DRAWING: Figure 4

Description

本開示は、学習モデルを再学習させる技術に関する。 The present disclosure relates to a technique for retraining a learning model.

従来、特徴量空間の中で、学習データの数が少ない低密度領域を検出して、低密度領域内の学習データを新たに生成して、生成した学習データを用いて学習モデルを再学習させる技術が知られている（特許文献１）。 Conventionally, a low-density region with a small number of training data is detected in the feature space, new training data in the low-density region is generated, and the training model is retrained using the generated training data. The technique is known (Patent Document 1).

特開２０１５−１９１４２６号公報Japanese Unexamined Patent Publication No. 2015-191426

従来の技術において、学習データの数が少ない低密度領域内の学習データを一律に生成して、学習モデルを再学習させる場合、ユーザーが意図しない領域、例えば学習モデルの精度向上に寄与する程度が低い低密度領域についても学習データを生成する必要が生じ得る。 In the conventional technique, when the learning data in the low density region where the number of learning data is small is uniformly generated and the learning model is retrained, the degree of contribution to the improvement of the accuracy of the region not intended by the user, for example, the learning model, is It may be necessary to generate training data even for low density regions.

本開示の一形態によれば、複数の第１学習データ要素と、前記複数の第１学習データ要素に関連付けられたラベルと、を用いて学習された学習モデルを再学習する再学習方法が提供される。前記学習モデルは、前記複数の第１学習データ要素と、前記複数の第１学習データ要素のそれぞれの特徴量との対応を表す学習済み生成モデルを含み、前記再学習方法は、前記特徴量を表すための特徴量空間と、前記特徴量に応じて前記特徴量空間上に配置された前記複数の第１学習データ要素とを、表示装置に表示することと、前記特徴量空間上における特定領域を検出するための領域検出条件を満たす前記特定領域を抽出することと、前記特定領域内の前記特徴量を有する複数の第２学習データ要素を、前記ラベルと関連付けて生成することと、生成した前記複数の第２学習データ要素と、前記複数の第２学習データ要素のそれぞれに関連付けられた前記ラベルとを用いて前記学習モデルを再学習することと、を備える。 According to one form of the present disclosure, there is provided a re-learning method for re-learning a learning model learned using a plurality of first learning data elements and labels associated with the plurality of first learning data elements. Will be done. The learning model includes a trained generation model that represents the correspondence between the plurality of first training data elements and the feature amounts of the plurality of first training data elements, and the retraining method uses the feature amounts. Displaying the feature quantity space for representing and the plurality of first learning data elements arranged on the feature quantity space according to the feature quantity on the display device, and a specific region on the feature quantity space. The specific region satisfying the region detection condition for detecting the above is extracted, and a plurality of second learning data elements having the feature amount in the specific region are generated in association with the label. It comprises retraining the training model using the plurality of second training data elements and the labels associated with each of the plurality of second training data elements.

第１実施形態としての機械学習システムを説明するための図。The figure for demonstrating the machine learning system as 1st Embodiment. 検査対象物を示す図。The figure which shows the inspection object. 判定システムを説明するための図。The figure for demonstrating the judgment system. 学習モデルの再学習方法を示すフローチャート。A flowchart showing how to relearn a learning model. 再学習方法を説明するための第１の図。The first figure for demonstrating the re-learning method. 再学習方法を説明するための第２の図。The second figure for demonstrating the re-learning method. 再学習方法を説明するための第３の図。The third figure for demonstrating the re-learning method. 第２実施形態を説明するための第１の図。The first figure for demonstrating the second embodiment. 第２実施形態を説明するための第２の図。The second figure for demonstrating the second embodiment. 第３実施形態を説明するための第１の図。The first figure for demonstrating a third embodiment. 第３実施形態を説明するための第２の図。The second figure for demonstrating the third embodiment. 第３実施形態を説明するための第３の図。FIG. 3 for explaining a third embodiment.

Ａ．第１実施形態：
図１は、本開示の第１実施形態としての機械学習システム１０を説明するための図である。図２は、検査対象物３９を示す図である。機械学習システム１０は、工場などで製造された検査対象物３９を検査するシステムに用いられ、コンピューター１００のプロセッサーによって実行される。図２に示すように、検査対象物３９には製造上、第１種対象物３１と、第２種対象物３２と、第３種対象物３３との３分類が製造され得る。第１種対象物３１は、欠けや汚れが無い正常な部品である。第２種対象物３２は、欠けがあるが汚れが無い部品である。第３種対象物３３は、欠けは無いが汚れがある部品である。 A. First Embodiment:
FIG. 1 is a diagram for explaining a machine learning system 10 as the first embodiment of the present disclosure. FIG. 2 is a diagram showing an inspection object 39. The machine learning system 10 is used in a system for inspecting an inspection object 39 manufactured in a factory or the like, and is executed by a processor of a computer 100. As shown in FIG. 2, the inspection object 39 can be manufactured into three categories, that is, a first-class object 31, a second-class object 32, and a third-class object 33. The first-class object 31 is a normal part having no chips or stains. The second-class object 32 is a part that is chipped but has no stains. The third-class object 33 is a part that is not chipped but is dirty.

図１に示すように、機械学習システム１０は、ディープラーニングのモデルの一つである変分オートエンコーダー(Variational Autoencoder(VAE))を用いたシステムである。機械学習システム１０は、入力層１２と、エンコーダーである特徴量算出部１４と、デコーダーである復元部１８と、出力層１９と、記憶部１０６と、を備える。機械学習システム１０では、第１学習画像３０が入力層１２に入力された場合に、第１学習画像３０が特徴量Ｚで表されることで圧縮される。また機械学習システム１０では、圧縮されたデータから入力された第１学習画像３０と同一の学習画像３０Ｌが出力層１９から出力される。第１学習画像３０は、検査対象物３９をセンサーの一例であるカメラによって検出した第１学習データ要素である。 As shown in FIG. 1, the machine learning system 10 is a system using a variational autoencoder (VAE), which is one of the models of deep learning. The machine learning system 10 includes an input layer 12, a feature amount calculation unit 14 which is an encoder, a restoration unit 18 which is a decoder, an output layer 19, and a storage unit 106. In the machine learning system 10, when the first learning image 30 is input to the input layer 12, the first learning image 30 is compressed by being represented by the feature amount Z. Further, in the machine learning system 10, the same learning image 30L as the first learning image 30 input from the compressed data is output from the output layer 19. The first learning image 30 is a first learning data element in which the inspection object 39 is detected by a camera which is an example of a sensor.

入力層１２は、圧縮対象となる第１学習画像３０の入力を受け付ける。なお、入力層１２に入力する第１学習画像３０には、検査対象物３９の種類データ要素を関連付けている。種類データ要素は、検査対象物３９が第１種対象物３１、第２種対象物３２、第３種対象物３３のいずれであるかを識別するためのデータ要素である。特徴量算出部１４は、入力層１２に入力された第１学習画像３０から複数の特徴を抽出し、抽出した複数の特徴を用いて潜在変数である特徴量Ｚを算出するニューラルネットワークである。特徴量Ｚは、学習画像３０の次元よりも小さい次元で表される。例えば、特徴量Ｚは１００次元で表される。 The input layer 12 receives the input of the first learning image 30 to be compressed. The first learning image 30 input to the input layer 12 is associated with the type data element of the inspection object 39. The type data element is a data element for identifying whether the inspection object 39 is a type 1 object 31, a type 2 object 32, or a type 3 object 33. The feature amount calculation unit 14 is a neural network that extracts a plurality of features from the first learning image 30 input to the input layer 12 and calculates a feature amount Z which is a latent variable using the extracted plurality of features. The feature amount Z is represented by a dimension smaller than the dimension of the learning image 30. For example, the feature amount Z is represented in 100 dimensions.

復元部１８は、特徴量Ｚを入力として元の第１学習画像３０としての学習画像３０Ｌを復元するニューラルネットワークである。出力層１９は、復元部１８によって復元した学習画像３０Ｌを出力する。特徴量算出部１４や復元部１８は、元の第１学習画像３０と同一の学習画像３０Ｌが出力されるように学習されている。なお、特徴量算出部１４や復元部１８は、ニューラルネットワークに限定されるものではなく、線形回帰やベイズなどの他のアルゴリズムが用いられてもよい。 The restoration unit 18 is a neural network that restores the learning image 30L as the original first learning image 30 by inputting the feature amount Z. The output layer 19 outputs the learning image 30L restored by the restoration unit 18. The feature amount calculation unit 14 and the restoration unit 18 are trained so as to output the same learning image 30L as the original first learning image 30. The feature amount calculation unit 14 and the restoration unit 18 are not limited to the neural network, and other algorithms such as linear regression and Bayes may be used.

記憶部１０６は、ＲＯＭやＲＡＭなどによって構成されている。記憶部１０６には、入力層１２に入力された第１学習画像３０および後述する画像３０Ｉや、特徴量算出部１４によって算出された特徴量Ｚが記憶されている。また、機械学習システム１０の入力層１２や特徴量算出部１４や復元部１８や出力層１９は、記憶部１０６に記憶されている。 The storage unit 106 is composed of a ROM, a RAM, and the like. The storage unit 106 stores the first learning image 30 input to the input layer 12, the image 30I described later, and the feature amount Z calculated by the feature amount calculation unit 14. Further, the input layer 12, the feature amount calculation unit 14, the restoration unit 18, and the output layer 19 of the machine learning system 10 are stored in the storage unit 106.

図３は、判定システム２０を説明するための図である。判定システム２０は、入力された検査対象物３９の画像３０Ｉを元に、予め定めた判定条件を用いて判定された判定結果Ｒｕを出力するシステムであり、コンピューター１００のプロセッサーによって実行される。画像３０Ｉは、検査対象物３９をカメラによって撮像することで取得される。判定結果Ｒｕの分類は、部品として出荷可能な状態を示す「良品」と、出荷不可能な状態を示す「不良品」との２つである。予め定めた判定条件としては、第１判定条件がある。第１判定条件は、第１種対象物３１を「良品」と判定し、第２種対象物３２と第３種対象物３３とを「不良品」と判定する条件である。 FIG. 3 is a diagram for explaining the determination system 20. The determination system 20 is a system that outputs a determination result Ru determined using predetermined determination conditions based on the input image 30I of the inspection object 39, and is executed by the processor of the computer 100. The image 30I is acquired by photographing the inspection object 39 with a camera. Judgment result Ru is classified into two categories, "non-defective product" indicating a state in which it can be shipped as a part, and "defective product" indicating a state in which it cannot be shipped. As a predetermined determination condition, there is a first determination condition. The first determination condition is a condition for determining the type 1 object 31 as a "non-defective product" and determining the type 2 object 32 and the type 3 object 33 as a "defective product".

判定システム２０は、機械学習システム１０の入力層１２、特徴量算出部１４と、を備える。また判定システム２０は、復元部１８に代えて判定部２２を備える。判定部２２は、特徴量算出部１４によって算出された特徴量Ｚを元に、判定結果Ｒｕの分類を行って外部の表示装置５０に出力するニューラルネットワークである。表示装置５０は、液晶モニターなどの表示機能を有する装置である。 The determination system 20 includes an input layer 12 of the machine learning system 10 and a feature amount calculation unit 14. Further, the determination system 20 includes a determination unit 22 instead of the restoration unit 18. The determination unit 22 is a neural network that classifies the determination result Ru based on the feature amount Z calculated by the feature amount calculation unit 14 and outputs it to the external display device 50. The display device 50 is a device having a display function such as a liquid crystal monitor.

判定システム２０の学習は、画像３０Ｉを用いた判定を行う前に実行される。判定システム２０の学習は、検査対象物３９が撮像された複数の第１学習画像３０と、複数の第１学習画像３０に対応付けられた官能評価結果と、を教師データとして用いて実行される。官能評価結果は、判定員が学習画像３０と第１判定条件とを用いて官能判定を行った結果を示す結果ラベルである。第１判定条件を用いたラベルとしての結果ラベルは、上述のごとく、「良品」であることを示す「良品ラベル」と、「不良品」であることを示す「不良品ラベル」とがある。 The learning of the determination system 20 is executed before the determination using the image 30I is performed. The learning of the determination system 20 is executed by using the plurality of first learning images 30 in which the inspection object 39 is captured and the sensory evaluation results associated with the plurality of first learning images 30 as teacher data. .. The sensory evaluation result is a result label indicating the result of the sensory determination performed by the judge using the learning image 30 and the first determination condition. As described above, the result label as a label using the first determination condition includes a "non-defective product label" indicating that it is a "non-defective product" and a "defective product label" indicating that it is a "defective product".

判定部２２は、入力された第１学習画像３０を元に、入力された第１学習画像３０に関連付けられた官能評価結果を判定結果Ｒｕとして出力するように学習する。この第１判定条件と複数の第１学習画像３０とを用いて学習した判定システム２０を学習モデルとも呼ぶ。学習モデルは、複数の第１学習画像３０と、複数の第１学習画像３０のそれぞれの特徴量Ｚとの対応を表す学習済み生成モデル２９を含む。学習済み生成モデル２９は、入力層１２と特徴量算出部１４とを含む。記憶部１０６は、学習モデルの学習に用いられた教師データを記憶する。 The determination unit 22 learns to output the sensory evaluation result associated with the input first learning image 30 as the determination result Ru based on the input first learning image 30. The determination system 20 learned by using the first determination condition and the plurality of first learning images 30 is also called a learning model. The learning model includes a trained generative model 29 representing the correspondence between the plurality of first learning images 30 and the feature quantities Z of the plurality of first learning images 30. The trained generative model 29 includes an input layer 12 and a feature amount calculation unit 14. The storage unit 106 stores the teacher data used for learning the learning model.

図４は、学習モデルの再学習方法を示すフローチャートである。図５は、再学習方法を説明するための第１の図である。図６は、再学習方法を説明するための第２の図である。図７は、再学習方法を説明するための第３の図である。図４に示す再学習方法は、複数の第１学習画像３０と、複数の第１学習画像３０に関連付けられたラベルとを用いて学習された学習モデルを再学習する再学習方法である。 FIG. 4 is a flowchart showing a re-learning method of the learning model. FIG. 5 is a first diagram for explaining a re-learning method. FIG. 6 is a second diagram for explaining the re-learning method. FIG. 7 is a third diagram for explaining the re-learning method. The re-learning method shown in FIG. 4 is a re-learning method for re-learning a learning model learned using a plurality of first learning images 30 and labels associated with the plurality of first learning images 30.

図４に示すように、ステップＳ１０において、コンピューター１００は、学習済み生成モデル２９を用いて第１学習画像３０に対応付けて学習画像３０の特徴量Ｚを出力する。具体的には、記憶部１０６に記憶された複数の第１学習画像３０と複数の第１学習画像３０に対応した特徴量Ｚとを表示装置５０に出力する。そして、ステップＳ２０において、コンピューター１００は、学習用画面６０を表示装置５０に表示する。図５に示すように、学習用画面６０は、特徴量Ｚを表すための特徴量空間６２と、特徴量Ｚに応じて特徴量空間６２上に配置された複数の第１学習画像３０と、領域検索条件の入力を受け付ける入力画面７１とを備える。すなわち、ステップＳ２０の処理は、コンピューター１００が、入力画面７１を表示装置５０に表示する処理を含む。 As shown in FIG. 4, in step S10, the computer 100 uses the trained generative model 29 to output the feature amount Z of the training image 30 in association with the first training image 30. Specifically, the plurality of first learning images 30 stored in the storage unit 106 and the feature amounts Z corresponding to the plurality of first learning images 30 are output to the display device 50. Then, in step S20, the computer 100 displays the learning screen 60 on the display device 50. As shown in FIG. 5, the learning screen 60 includes a feature amount space 62 for representing the feature amount Z, and a plurality of first learning images 30 arranged on the feature amount space 62 according to the feature amount Z. It is provided with an input screen 71 that accepts input of area search conditions. That is, the process of step S20 includes a process in which the computer 100 displays the input screen 71 on the display device 50.

一般に機械学習に用いられる第１学習画像３０の数は膨大である。よって、図５に示すように、コンピューター１００は、学習モデルの学習に用いた複数の第１学習画像３０のうちの一部を抽出して、抽出した複数の第１学習画像３０と、抽出した第１学習画像３０の特徴量Ｚとを表示装置５０に出力してもよい。これにより、ユーザーは特徴量空間６２に配置された学習画像３０を容易に確認できる。なお、コンピューター１００は、学習モデルの学習に用いた複数の学習画像３０の全てを、特徴量Ｚと共に表示装置５０に出力してもよい。またコンピューター１００は、特徴量Ｚを特徴量空間６２上に可視化して表示装置５０に表示させるために、複数次元の特徴量Ｚを２次元に圧縮する。この圧縮は、例えば、t-SNEというアルゴリズムを用いて実行される。なお、コンピューター１００は、複数次元の特徴量Ｚを、１次元や３次元以上の次元に圧縮してもよい。 The number of first learning images 30 generally used for machine learning is enormous. Therefore, as shown in FIG. 5, the computer 100 extracts a part of the plurality of first learning images 30 used for learning the learning model, and extracts the extracted plurality of first learning images 30. The feature amount Z of the first learning image 30 may be output to the display device 50. As a result, the user can easily confirm the learning image 30 arranged in the feature amount space 62. The computer 100 may output all of the plurality of learning images 30 used for learning the learning model to the display device 50 together with the feature amount Z. Further, the computer 100 compresses the multidimensional feature amount Z into two dimensions in order to visualize the feature amount Z on the feature amount space 62 and display it on the display device 50. This compression is performed, for example, using an algorithm called t-SNE. The computer 100 may compress the multidimensional feature amount Z into one dimension or three or more dimensions.

図５に示すように、特徴量空間６２は、次元ｘと次元ｙの２次元の空間である。特徴量空間６２は、特徴量Ｚに応じて複数のブロック領域Ｒｘｙに仕切られている。複数のブロック領域Ｒｘｙはそれぞれ、次元ｘと次元ｙの範囲量が同じである。本実施形態では、複数のブロック領域Ｒｘｙはそれぞれ、次元ｘの範囲量が０．１２５であり、次元ｙの範囲量が０．１２５である。複数の学習画像３０は、特徴量Ｚに応じて複数のブロック領域Ｒｘｙのいずれかに配置される。例えば、左上側に位置する学習画像３０ａの特徴量Ｚにおいて、ｘ次元が０．１００であり、ｙ次元が０．２００であった場合、この特徴量Ｚを含むブロック領域Ｒｘ１ｙ１に学習画像３０ａが配置される。なお、ユーザーの視認性を向上させるために、コンピューター１００は、各ブロック領域Ｒｘｙに３つ以下の学習画像３０が位置するように記憶部１０６の学習画像３０を選択することが好ましい。本実施形態では、コンピューター１００は、各ブロック領域Ｒｘｙに１つ以下の学習画像３０が位置するように記憶部１０６の学習画像３０を選択する。また、コンピューター１００は、第１判定条件下での判定結果Ｒｕを各学習画像３０に対応付けて表示してもよい。こうすることで、ユーザーは、特徴量空間６２に位置する各学習画像３０の判定結果Ｒｕを容易に判別できる。 As shown in FIG. 5, the feature space 62 is a two-dimensional space having dimensions x and y. The feature amount space 62 is divided into a plurality of block regions Rxy according to the feature amount Z. Each of the plurality of block areas Rxy has the same range amount of the dimension x and the dimension y. In the present embodiment, each of the plurality of block regions Rxy has a dimension x range amount of 0.125 and a dimension y range amount of 0.125. The plurality of training images 30 are arranged in any of the plurality of block regions Rxy according to the feature amount Z. For example, in the feature amount Z of the learning image 30a located on the upper left side, when the x-dimension is 0.100 and the y-dimension is 0.200, the learning image 30a is placed in the block region Rx1y1 including the feature amount Z. Be placed. In order to improve the visibility of the user, it is preferable that the computer 100 selects the learning image 30 of the storage unit 106 so that three or less learning images 30 are located in each block area Rxy. In the present embodiment, the computer 100 selects the learning image 30 of the storage unit 106 so that one or less learning images 30 are located in each block area Rxy. Further, the computer 100 may display the determination result Ru under the first determination condition in association with each learning image 30. By doing so, the user can easily determine the determination result Ru of each learning image 30 located in the feature amount space 62.

入力画面７１は、図５に示す領域検出条件の入力を受け付けるための第１条件画面７０と、図６に示す再学習の領域の詳細条件を受け付けるための第２条件画面７２と、第１条件画面７０や第２条件画面７２の入力内容を受け付けてコンピューター１００に実行させるための決定画面７５と、を備える。ステップＳ２０の処理では、図５に示す第１条件画面７０と決定画面７５とが表示装置５０に表示される。第１条件画面７０および第２条件画面７２の詳細については後述する。 The input screen 71 has a first condition screen 70 for accepting the input of the area detection condition shown in FIG. 5, a second condition screen 72 for accepting the detailed condition of the relearning area shown in FIG. 6, and a first condition. A decision screen 75 for receiving the input contents of the screen 70 and the second condition screen 72 and causing the computer 100 to execute the input contents is provided. In the process of step S20, the first condition screen 70 and the determination screen 75 shown in FIG. 5 are displayed on the display device 50. Details of the first condition screen 70 and the second condition screen 72 will be described later.

図４に示すように、コンピューター１００は、ステップＳ３０において、入力画面７１を介して領域検索条件の入力を受け付ける。詳細には、図５に示すように、コンピューター１００は、第１条件画面７０によって領域検索条件の入力を受け付ける。第１条件画面７０は、領域検索条件を入力するための、第１詳細画面７０ａ、第２詳細画面７０ｂ、および、第３詳細画面７０ｃを有する。第１詳細画面７０ａと第２詳細画面７０ｂと第３詳細画面７０ｃとは、領域検索条件を入力する条件画面を構成する。 As shown in FIG. 4, the computer 100 receives the input of the area search condition via the input screen 71 in step S30. Specifically, as shown in FIG. 5, the computer 100 accepts the input of the area search condition on the first condition screen 70. The first condition screen 70 includes a first detail screen 70a, a second detail screen 70b, and a third detail screen 70c for inputting area search conditions. The first detail screen 70a, the second detail screen 70b, and the third detail screen 70c form a condition screen for inputting area search conditions.

第１詳細画面７０ａは、特徴量空間６２における特定領域ＳＲの範囲の入力を受け付ける画面である。第１詳細画面７０ａは、特定領域ＳＲのｘ次元の範囲とｙ次元の範囲と１つのブロック領域Ｒｘｙを単位として入力する。例えば、図５に示すように、第１詳細画面７０ａの１つ目の空欄に「３」、２つ目の空欄に「３」が入力された場合、ｘ次元に並んだ３つのブロック領域Ｒｘｙとｙ次元に並んだ３つのブロック領域Ｒｘｙとで形成された格子状の領域を形成する合計９つのブロック領域Ｒｘｙが特定領域ＳＲの範囲となる。 The first detailed screen 70a is a screen that accepts the input of the range of the specific area SR in the feature amount space 62. On the first detail screen 70a, the x-dimensional range, the y-dimensional range, and one block area Rxy of the specific area SR are input as units. For example, as shown in FIG. 5, when "3" is entered in the first blank of the first detail screen 70a and "3" is entered in the second blank, the three block areas Rxy arranged in the x dimension. A total of nine block regions Rxy forming a grid-like region formed by the three block regions Rxy arranged in the y dimension are the range of the specific region SR.

第２詳細画面７０ｂは、第１詳細画面７０ａによって受け付けた特定領域ＳＲの範囲内における、第１学習画像３０に関連付けられたラベルの分類数の入力を受け付ける画面である。つまり、第２詳細画面７０ｂは、特定領域ＳＲの範囲内の特徴量を有する第１学習画像３０のラベルの分類数の条件を受け付ける。分類数の条件を満たすかどうかの対象となる第１学習画像３０は、第１学習モデルの学習に用いられた、記憶部１０６に記憶された学習画像３０である。つまり、表示装置５０に表示された特徴量空間６２上に配置された一部の学習画像３０だけが対象ではない。ラベルの分類とは、本実施形態では、「良品ラベル」と「不良品ラベル」の２分類である。本実施形態では、ユーザーは、特定領域ＳＲの条件の一つとして、第２詳細画面７０ｂの空欄に分類数を入力する。図５では、分類数が「２」と入力されているため、第１詳細画面７０ａに入力された範囲内に、「良品ラベル」が関連付けられた学習画像３０と、「不良品ラベル」が関連付けられた学習画像３０とが存在することが、領域検索条件の一つの条件となる。 The second detail screen 70b is a screen that accepts the input of the number of classifications of the label associated with the first learning image 30 within the range of the specific area SR received by the first detail screen 70a. That is, the second detail screen 70b accepts the condition of the number of classifications of the label of the first learning image 30 having the feature amount within the range of the specific area SR. The first learning image 30 that is the target of whether or not the condition of the number of classifications is satisfied is the learning image 30 stored in the storage unit 106 used for learning the first learning model. That is, not only a part of the learning images 30 arranged on the feature amount space 62 displayed on the display device 50 is the target. In the present embodiment, the label classification is two classifications, "non-defective product label" and "defective product label". In the present embodiment, the user inputs the number of classifications in the blank of the second detail screen 70b as one of the conditions of the specific area SR. In FIG. 5, since the number of classifications is input as “2”, the learning image 30 to which the “good product label” is associated is associated with the “defective product label” within the range input to the first detail screen 70a. The existence of the learned image 30 is one of the conditions for the area search condition.

第３詳細画面７０ｃは、第１詳細画面７０ａによって受け付けた特定領域ＳＲの範囲内での第１学習データ要素としての第１学習画像３０の密度情報の入力を受け付ける画面である。本実施形態では、第３詳細画面７０ｃは、密度情報の一例であるブロック領域Ｒｘｙでのデータ要素数、すなわちブロック領域Ｒｘｙでの第１学習画像３０の数の入力を受け付ける画面である。密度情報の条件を満たすかどうかの対象となる第１学習画像３０は、第１学習モデルの学習に用いられた、記憶部１０６に記憶された学習画像３０である。図５に示す例では、特定領域ＳＲの範囲内の特徴量を有する第１学習画像３０の４つ以下であることが、領域検索条件の一つの条件となる。なお、密度情報は、データ要素数に限定されるものではない。例えば、密度情報は、「濃密」、「密」、「疎」、「極疎」などのような特定領域ＳＲの範囲内における第１学習データ要素の密度を表すための複数の段階を示す要素や、第１学習画像３０の全データ要素数に対する特定領域ＳＲの範囲内における第１学習画像３０のデータ要素数の割合によって構成されていてもよい。複数の段階を示す要素で第３詳細画面７０ｃにおいて入力を受け付ける場合、第３詳細画面７０ｃには、「濃密」、「密」、「疎」、「極疎」のような密度情報を複数の段落に分けた文字画像が表示され、ユーザーは表示された文字画像から所望の一つを選択する。「濃密」、「密」、「疎」、「極疎」の密度は、例えば、第１学習画像３０の全データ要素数に対する特定領域ＳＲの範囲内における第１学習画像３０のデータ要素数の割合で予め規定されていてもよいし、特定領域ＳＲの範囲内における第１学習画像３０の数で予め規定されていてもよい。 The third detail screen 70c is a screen that accepts the input of the density information of the first learning image 30 as the first learning data element within the range of the specific area SR received by the first detail screen 70a. In the present embodiment, the third detailed screen 70c is a screen that accepts input of the number of data elements in the block area Rxy, which is an example of density information, that is, the number of the first learning images 30 in the block area Rxy. The first learning image 30 that is the target of whether or not the condition of the density information is satisfied is the learning image 30 stored in the storage unit 106 used for learning the first learning model. In the example shown in FIG. 5, one condition of the area search condition is that the number of the first learning images 30 having the feature amount within the range of the specific area SR is four or less. The density information is not limited to the number of data elements. For example, the density information is an element indicating a plurality of stages for expressing the density of the first training data element within the range of the specific region SR such as "dense", "dense", "sparse", and "extremely sparse". Alternatively, it may be composed of the ratio of the number of data elements of the first learning image 30 within the range of the specific area SR to the total number of data elements of the first learning image 30. When input is accepted on the third detail screen 70c with an element indicating a plurality of stages, the third detail screen 70c contains a plurality of density information such as "dense", "dense", "sparse", and "extremely sparse". Character images divided into paragraphs are displayed, and the user selects a desired one from the displayed character images. The density of "dense", "dense", "sparse", and "extremely sparse" is, for example, the number of data elements of the first learning image 30 within the range of the specific area SR with respect to the total number of data elements of the first learning image 30. The ratio may be predetermined, or the number of the first learning images 30 within the range of the specific area SR may be predetermined.

ユーザーによって第１条件画面７０上での入力が行われた後に、決定画面７５がカーソルなどのグラフィカルユーザーインターフェース７４を用いて押下することで、第１条件画面に入力された領域検索条件がコンピューター１００に受け付けられる。これにより、図４のステップＳ３０において、コンピューター１００は、受け付けた領域検索条件を満たす特定領域ＳＲを抽出して、表示装置５０に表示された特徴量空間６２に表示する。例えば、図６に示すように、コンピューター１００は、特定領域ＳＲとして第１特定領域ＳＲ１と第２特定領域ＳＲ２と第３特定領域ＳＲ３とを抽出した場合、抽出した第１特定領域ＳＲ１〜第３特定領域ＳＲ３のそれぞれを太字画像で取り囲むことで表示する。 After the user inputs the input on the first condition screen 70, the determination screen 75 is pressed by using the graphical user interface 74 such as a cursor, so that the area search condition input on the first condition screen is the computer 100. Will be accepted. As a result, in step S30 of FIG. 4, the computer 100 extracts the specific area SR satisfying the received area search condition and displays it in the feature amount space 62 displayed on the display device 50. For example, as shown in FIG. 6, when the computer 100 extracts the first specific area SR1, the second specific area SR2, and the third specific area SR3 as the specific area SR, the extracted first specific area SR1 to third Each of the specific areas SR3 is displayed by surrounding them with a bold image.

また、第１条件画面７０に入力された領域検索条件が受け付けられた場合、コンピューター１００は、図６に示すように第１条件画面７０に代えて第２条件画面７２を表示装置５０に表示する。図４に示すように、コンピューター１００は、ステップＳ３０の次に、ステップＳ４０において、学習モデルを再学習するために用いる特定領域ＳＲ、すなわち学習用特定領域ＬＳＲの入力を受け付ける。学習用特定領域ＬＳＲの入力は、図６に示す第２条件画面７２上で行われる。第２条件画面７２は、学習用特定領域ＬＳＲを入力するための領域入力画面７２ａと、教師データを構成するラベルを入力するためのラベル設定画面７２ｂとを備える。領域入力画面７２ａは、ステップＳ４０において抽出された第１特定領域ＳＲ１〜第３特定領域ＳＲ３のうちで、学習モデルの再学習を行うために用いる第２学習データを生成する領域の入力を受け付けるための画面である。ユーザーは、特徴量空間６２上に表示された第１特定領域ＳＲ１〜第３特定領域ＳＲ３のうちで、学習モデルの再学習に用いる領域を領域入力画面７２ａに入力する。図６では、領域入力画面７２ａには第１特定領域ＳＲ１が入力されている。なお、領域入力画面７２ａは、複数の特定領域ＳＲを入力可能である。 When the area search condition input on the first condition screen 70 is accepted, the computer 100 displays the second condition screen 72 on the display device 50 instead of the first condition screen 70 as shown in FIG. .. As shown in FIG. 4, the computer 100 receives the input of the specific area SR used for re-learning the learning model, that is, the learning specific area LSR in step S40 after step S30. The input of the learning specific area LSR is performed on the second condition screen 72 shown in FIG. The second condition screen 72 includes an area input screen 72a for inputting a specific area LSR for learning, and a label setting screen 72b for inputting labels constituting teacher data. The area input screen 72a accepts the input of the area for generating the second learning data used for re-learning the learning model among the first specific area SR1 to the third specific area SR3 extracted in step S40. It is a screen of. The user inputs the area used for re-learning the learning model from the first specific area SR1 to the third specific area SR3 displayed on the feature amount space 62 on the area input screen 72a. In FIG. 6, the first specific area SR1 is input to the area input screen 72a. The area input screen 72a can input a plurality of specific area SRs.

また、ステップＳ４０において、コンピューター１００は、ラベル設定画面７２ｂにおいて、領域入力画面７２ａで入力された特定領域ＳＲの範囲内でのラベル設定の入力を受け付ける。具体的には、ユーザーが、ラベル設定画面７２ｂにおいて設定するラベルを「不良品」に設定し、カーソルなどのグラフィカルユーザーインターフェース７４で特定領域ＳＲ内のブロック領域Ｒｘｙを押下することで、押下されたブロック領域Ｒｘｙの特徴量については不良品ラベルが関連付けられる。一方で、ユーザーが、ラベル設定画面７２ｂにおいてラベルを「良品」に設定し、グラフィカルユーザーインターフェース７４で特定領域ＳＲ内のブロック領域Ｒｘｙを押下することで、押下されたブロック領域Ｒｘｙの特徴量については良品ラベルが関連付けられる。図７に示す例では、コンピューター１００は、「良品ラベル」が関連付けられたブロック領域Ｒｘｙにはシングルハッチング画像を付し、「不良品ラベル」が関連付けられたブロック領域Ｒｘｙにはクロスハッチング画像を付している。図７に示す状態で決定画面７５が押下されると、図４に示すようにコンピューター１００はステップＳ６０において、複数の第２学習データ要素としての複数の第２学習画像をラベルと関連付けて生成する。具体的には、指定された第１特定領域ＳＲ１内の特徴量を有する第２学習画像を、復元部１８を用いて生成する。例えば、コンピューター１００は、指定された第１特定領域ＳＲ１のブロック領域Ｒｘｙごとに、異なる特徴量を有する複数の第２学習画像を生成する。生成された複数の第２学習画像には、教師データとしてのラベルが関連付けられている。なお、ステップＳ６０において、コンピューター１００は、指定された特定領域ＳＲのうちで、第１学習画像３０の数が予め定めた閾値以下のブロック領域Ｒｘｙについて、複数の第２学習画像を生成してもよい。また、特定領域ＳＲの各ブロック領域Ｒｘｙについて、生成される第２学習画像の数をユーザーが指定してもよい。 Further, in step S40, the computer 100 receives the input of the label setting within the range of the specific area SR input on the area input screen 72a on the label setting screen 72b. Specifically, the user sets the label set on the label setting screen 72b to "defective product" and presses the block area Rxy in the specific area SR with the graphical user interface 74 such as a cursor. A defective label is associated with the feature amount of the block area Rxy. On the other hand, when the user sets the label to "good product" on the label setting screen 72b and presses the block area Rxy in the specific area SR on the graphical user interface 74, the feature amount of the pressed block area Rxy is A good product label is associated. In the example shown in FIG. 7, the computer 100 attaches a single hatched image to the block area Rxy associated with the “non-defective product label” and attaches a cross-hatched image to the block area Rxy associated with the “defective product label”. doing. When the determination screen 75 is pressed in the state shown in FIG. 7, the computer 100 generates a plurality of second learning images as a plurality of second learning data elements in association with the label in step S60 as shown in FIG. .. Specifically, the restoration unit 18 is used to generate a second learning image having a feature amount in the designated first specific region SR1. For example, the computer 100 generates a plurality of second learning images having different feature quantities for each block region Rxy of the designated first specific region SR1. A label as teacher data is associated with the plurality of generated second learning images. In step S60, the computer 100 may generate a plurality of second learning images for the block area Rxy in which the number of the first learning images 30 is equal to or less than a predetermined threshold value in the designated specific area SR. Good. Further, the user may specify the number of second learning images to be generated for each block area Rxy of the specific area SR.

次に、コンピューター１００は、ステップＳ７０において、生成した複数の第２学習画像と、複数の第２学習画像のそれぞれに関連付けられたラベルとを用いて、学習モデルを再学習する。具体的には、生成した複数の第２学習画像と、複数の第２学習画像のそれぞれに関連付けられたラベルとを教師データとして判定システム２０の入力層１２に入力し、入力された第２学習画像に関連付けられたラベルを判定結果Ｒｕとして出力するように学習する。 Next, in step S70, the computer 100 relearns the learning model using the generated second learning image and the label associated with each of the plurality of second learning images. Specifically, the generated plurality of second learning images and the labels associated with each of the plurality of second learning images are input to the input layer 12 of the determination system 20 as teacher data, and the input second learning is input. Learn to output the label associated with the image as the determination result Ru.

上記第１実施形態によれば、領域検出条件の入力を受け付けて、領域検出条件を満たす特定領域ＳＲを用いて再学習を行うための第２学習画像を生成することで、特徴量空間６２におけるユーザーの意図した領域の第２学習画像を生成して学習モデルの再学習を行うことができる。また上記第１実施形態によれば、ユーザーは、入力画面７１を用いて領域検出条件を入力することで、意図した再学習のための領域をより精度良く指定できる。特に上記第１実施形態によれば、ユーザーは、表示装置５０に表示された第１学習画像３０が配置された特徴量空間６２を参照しながら、第１条件画面７０の第１詳細画面７０ａ〜第３詳細画面７０ｃを用いて、意図した再学習のための領域をより一層精度良く指定できる。 According to the first embodiment, by accepting the input of the region detection condition and generating the second learning image for performing re-learning using the specific region SR satisfying the region detection condition, the feature quantity space 62 It is possible to relearn the learning model by generating a second learning image of the area intended by the user. Further, according to the first embodiment, the user can more accurately specify the intended re-learning area by inputting the area detection condition using the input screen 71. In particular, according to the first embodiment, the user refers to the feature amount space 62 in which the first learning image 30 displayed on the display device 50 is arranged, and refers to the first detailed screen 70a to the first condition screen 70. By using the third detail screen 70c, the area for the intended re-learning can be specified more accurately.

Ｂ．第２実施形態：
図８は、第２実施形態を説明するための第１の図である。図９は、第２実施形態を説明するための第２の図である。上記第１実施形態との違いは、領域検出条件の一つとして第１学習画像３０の種類情報が加えられた点である。その他の構成については第１実施形態と同様の構成であるため、同様の構成については同一符号を付すと共に説明を省略する。 B. Second embodiment:
FIG. 8 is a first diagram for explaining a second embodiment. FIG. 9 is a second diagram for explaining the second embodiment. The difference from the first embodiment is that the type information of the first learning image 30 is added as one of the region detection conditions. Since the other configurations are the same as those in the first embodiment, the same reference numerals are given and the description thereof will be omitted.

第２実施形態では、図８に示すように、ステップＳ３０において、入力画面７１Ａを介して領域検索条件の入力が受け付けられる。入力画面７１Ａは、第１詳細画面７０ａ、第２詳細画面７０ｂ、第３詳細画面７０ｃに加え、第４詳細画面７０ｄを有する。第４詳細画面７０ｄは、ラベルを関連付ける因子となる第１学習画像３０の種類に関する情報の入力を受け付ける。第１学習画像３０の種類に関する情報とは、第１種対象物３１〜第３種対象物３３のいずれを撮像した画像であるかを示す情報である。本実施形態では、第１学習画像３０の種類に関する情報は、第１種対象物３１の第１学習画像３０を示す「正常」と、第２種対象物３２の第１学習画像３０を示す「欠け」と、第３種対象物３３の第１学習画像３０示す「汚れ」との３つの種類情報を含む。ユーザーは、第４詳細画面７０ｄに特定領域ＳＲに含まれる第１学習画像３０の種類を、領域検出条件の一つとして入力する。例えば、第４詳細画面７０ｄに、「正常」と「欠け」が入力された場合には、コンピューター１００は、「正常」である第１学習画像３０と、「欠け」である第１学習画像３０との両方が存在する特定領域ＳＲを抽出する。本実施形態では、第１判定条件下では、「正常」や「汚れ」の第１学習画像３０には「良品ラベル」が関連付けられ、「欠け」の第１学習画像３０には「不良品ラベル」が関連付けられる。よって、特徴量空間６２において、異なるラベルが関連付けられる領域ほど、判定システム２０の学習に用いられるデータ要素数を多くすることで、判定システム２０の判定精度を向上できる。そこで、ユーザーは、異なるラベルが関連付けられる領域を領域検出条件の一つとして入力することで、ステップＳ４０において、判定システム２０の判定精度により寄与する領域が特定領域ＳＲとして抽出される。図８に示す第１条件画面７０Ａに入力された領域検出条件が実行された場合、コンピューター１００は、例えば図９に示すように第１特定領域ＳＲ１のみを抽出して表示装置５０に表示する。 In the second embodiment, as shown in FIG. 8, in step S30, the input of the area search condition is accepted via the input screen 71A. The input screen 71A has a fourth detail screen 70d in addition to the first detail screen 70a, the second detail screen 70b, and the third detail screen 70c. The fourth detail screen 70d accepts input of information regarding the type of the first learning image 30, which is a factor for associating the label. The information regarding the type of the first learning image 30 is information indicating which of the first-class object 31 to the third-class object 33 is an image. In the present embodiment, the information regarding the type of the first learning image 30 is "normal" indicating the first learning image 30 of the first type object 31 and "normal" indicating the first learning image 30 of the second type object 32. It includes three types of information: "chip" and "dirt" shown in the first learning image 30 of the third type object 33. The user inputs the type of the first learning image 30 included in the specific area SR on the fourth detail screen 70d as one of the area detection conditions. For example, when "normal" and "missing" are input to the fourth detail screen 70d, the computer 100 has a "normal" first learning image 30 and a "missing" first learning image 30. Extract a specific region SR in which both and are present. In the present embodiment, under the first determination condition, the first learning image 30 of "normal" or "dirt" is associated with a "good product label", and the first learning image 30 of "missing" is associated with a "defective product label". Is associated. Therefore, in the feature quantity space 62, the determination accuracy of the determination system 20 can be improved by increasing the number of data elements used for learning of the determination system 20 as the area where different labels are associated. Therefore, the user inputs an area to which different labels are associated as one of the area detection conditions, and in step S40, the area contributing to the determination accuracy of the determination system 20 is extracted as the specific area SR. When the area detection condition input to the first condition screen 70A shown in FIG. 8 is executed, the computer 100 extracts only the first specific area SR1 and displays it on the display device 50, for example, as shown in FIG.

上記第２実施形態によれば、また上記第２実施形態によれば、第４詳細画面７０ｄを用いて、ユーザーは意図した再学習のための領域をより一層精度良く指定できる。例えば、ユーザーは、判定システム２０の判定結果Ｒｕに影響のある第１学習画像３０の種類データ要素、例えば、判定結果が分かれる２つの種類データ要素が混在することを領域検出条件の一つとして設定することで、再学習させた学習モデルの判定精度を向上できる。 According to the second embodiment and according to the second embodiment, the user can specify the intended re-learning area with higher accuracy by using the fourth detail screen 70d. For example, the user sets as one of the area detection conditions that the type data elements of the first learning image 30 that affect the determination result Ru of the determination system 20, for example, two types of data elements whose determination results are divided are mixed. By doing so, the determination accuracy of the retrained learning model can be improved.

Ｃ．第３実施形態：
図１０は、第３実施形態を説明するための第１の図である。図１１は、第３実施形態を説明するための第２の図である。図１２は、第３実施形態を説明するための第３の図である。第１実施形態との違いは、領域検出条件の一つとして一致率［％］の閾値が加えられた点である。その他の構成については第１実施形態と同様の構成であるため、同様の構成については同一符号を付すと共に説明を省略する。 C. Third Embodiment:
FIG. 10 is a first diagram for explaining a third embodiment. FIG. 11 is a second diagram for explaining the third embodiment. FIG. 12 is a third diagram for explaining the third embodiment. The difference from the first embodiment is that a threshold value of the match rate [%] is added as one of the region detection conditions. Since the other configurations are the same as those in the first embodiment, the same reference numerals are given and the description thereof will be omitted.

第３実施形態では、図１０に示すように、ステップＳ３０において、入力画面７１Ｂを介して領域検索条件の入力が受け付けられる。第１条件画面７０Ｂは、第１詳細画面７０ａ、第２詳細画面７０ｂ、第３詳細画面７０ｃに加え、第５詳細画面７０ｅを有する。第５詳細画面７０ｅは、第１詳細画面７０ａに入力された範囲内において、学習モデルの学習時に用いられたラベルの内容と、学習モデルを用いた判定結果Ｒｕとの一致率に関する情報の入力を受け付ける。つまり、ユーザーは、領域検出条件の一つとして、第５詳細画面７０ｅに０〜１００までの数値を入力する。図１０に示す例では、一致率が９０％以下であることが領域検出条件の一つとして入力されている。 In the third embodiment, as shown in FIG. 10, in step S30, the input of the area search condition is accepted via the input screen 71B. The first condition screen 70B has a fifth detail screen 70e in addition to the first detail screen 70a, the second detail screen 70b, and the third detail screen 70c. The fifth detail screen 70e inputs information on the matching rate between the content of the label used when learning the learning model and the determination result Ru using the learning model within the range input on the first detail screen 70a. Accept. That is, the user inputs a numerical value from 0 to 100 on the fifth detail screen 70e as one of the area detection conditions. In the example shown in FIG. 10, that the matching rate is 90% or less is input as one of the region detection conditions.

ステップＳ４０において、コンピューター１００は、入力画面７１Ｂによって入力された領域検索条件を満たす特定領域ＳＲを抽出して表示装置５０に表示する。コンピューター１００は、領域検出条件の一つである一致率を満たすか否かを以下のように判定する。まず、コンピューター１００は、第１詳細画面７０ａ〜第３詳細画面７０ｃに入力された条件を満たす領域である第１候補特定領域ＣＳＲ１と第２候補特定領域ＣＳＲ２と第３候補特定領域ＣＳＲ３とを抽出する。そしてコンピューター１００は、抽出した候補特定領域ＣＳＲ１〜ＣＳＲ３のそれぞれについて、第５詳細画面７０ｅに入力された一致率の条件を満たすか否かを判定し、一致率の条件を満たす領域を特定領域ＳＲとして抽出する。例えば、図１１に示すように、第１候補特定領域ＣＳＲ１内の特徴量を有する記憶部１０６に記憶された複数の第１学習画像３０について、ラベルの内容と学習モデルによる判定結果Ｒｕとの一致率を算出する。第１候補特定領域ＣＳＲ１における一致率は９５．２％であるため、一致率の条件を満たさない。よって、コンピューター１００は、第１候補特定領域ＣＳＲ１を特定領域ＳＲとして抽出しない。一方で、例えば、第２候補特定領域ＣＳＲ２や第３候補特定領域ＣＳＲ３のそれぞれの一致率が９０％以下である場合には、コンピューター１００は、図１２に示すように第２候補特定領域ＣＳＲ２や第３候補特定領域ＣＳＲ３を、特定領域ＳＲ２，ＳＲ３として抽出して表示装置５０に表示する。 In step S40, the computer 100 extracts the specific area SR satisfying the area search condition input by the input screen 71B and displays it on the display device 50. The computer 100 determines whether or not the match rate, which is one of the area detection conditions, is satisfied as follows. First, the computer 100 extracts the first candidate specific area CSR1, the second candidate specific area CSR2, and the third candidate specific area CSR3, which are areas satisfying the conditions input to the first detailed screen 70a to the third detailed screen 70c. To do. Then, the computer 100 determines whether or not the match rate condition satisfied on the fifth detail screen 70e is satisfied for each of the extracted candidate specific areas CSR1 to CSR3, and sets the area satisfying the match rate condition as the specific area SR. Extract as. For example, as shown in FIG. 11, for a plurality of first learning images 30 stored in the storage unit 106 having a feature amount in the first candidate specific area CSR1, the contents of the label and the determination result Ru by the learning model match. Calculate the rate. Since the match rate in the first candidate specific region CSR1 is 95.2%, the condition of the match rate is not satisfied. Therefore, the computer 100 does not extract the first candidate specific area CSR1 as the specific area SR. On the other hand, for example, when the matching rate of each of the second candidate specific area CSR2 and the third candidate specific area CSR3 is 90% or less, the computer 100 uses the second candidate specific area CSR2 or The third candidate specific area CSR3 is extracted as the specific areas SR2 and SR3 and displayed on the display device 50.

上記第３実施形態によれば、上記第１実施形態と同様の構成を有する点において同様の効果を奏する。例えば、第３実施形態によれば、ユーザーの意図した領域の第２学習画像を生成して学習モデルの再学習を行うことができる。また上記第３実施形態によれば、さらに第５詳細画面７０ｅを用いて、ユーザーは意図した再学習のための領域をより一層精度良く指定できる。例えば、ユーザーは、一致率が低い領域を領域検出条件の一つとして設定することで、一致率の高い領域において第２学習画像を生成することを抑制できる。つまり、学習モデルの再学習の為のデータ要素を低減しつつ、判定精度を向上できる再学習を効率良く実行できる。 According to the third embodiment, the same effect is obtained in that it has the same configuration as the first embodiment. For example, according to the third embodiment, it is possible to generate a second learning image of a region intended by the user and relearn the learning model. Further, according to the third embodiment, the user can further accurately specify the intended re-learning area by using the fifth detail screen 70e. For example, the user can suppress the generation of the second learning image in the region having a high matching rate by setting the region having a low matching rate as one of the region detection conditions. That is, it is possible to efficiently execute re-learning that can improve the determination accuracy while reducing the data elements for re-learning the learning model.

Ｄ．他の実施形態：
Ｄ−１．第１の他の実施形態：
上記各実施形態では、入力画面７１を表示装置５０に表示させることで、ユーザーからの領域検出条件の入力を受け付けていたが、これに限定されるものではない。例えば、コンピューターは、音声認識によってユーザーから領域検出条件の入力を受け付けてもよい。また、入力画面７１を介することなく、ユーザーが直線や曲線や円などによって、特徴量空間６２上に特定領域ＳＲを入力してもよい。 D. Other embodiments:
D-1. The first other embodiment:
In each of the above embodiments, the input screen 71 is displayed on the display device 50 to accept the input of the area detection condition from the user, but the present invention is not limited to this. For example, the computer may accept input of the area detection condition from the user by voice recognition. Further, the user may input the specific area SR on the feature amount space 62 by a straight line, a curve, a circle, or the like without going through the input screen 71.

Ｄ−２．第２の他の実施形態：
上記第２実施形態と第３実施形態とを組み合わせて実行してもよい。つまり、入力画面７１は、第１詳細画面７０ａ〜第５詳細画面７０ｅを有していてもよい。 D-2. Second other embodiment:
The second embodiment and the third embodiment may be combined and executed. That is, the input screen 71 may have the first detail screen 70a to the fifth detail screen 70e.

Ｄ−３．第３の他の実施形態：
上記各実施形態では、学習データ要素は、カメラによって検査対象物３９を撮像した学習画像３０であったが、各種センサーによって取得されるデータ要素であってもよい。例えば、学習データ要素としては、音データ要素や振動データ要素などであってもよい。音データ要素は、例えば打音検査に用いられ、マイク等によって取得されて音波の波形として表される。振動データ要素は、例えば設備診断技術等に利用される振動法に用いられ、加速度センサーやジャイロセンサー等によって取得されて、振動波形として表される。 D-3. Third other embodiment:
In each of the above embodiments, the learning data element is the learning image 30 obtained by capturing the inspection object 39 with the camera, but it may be a data element acquired by various sensors. For example, the learning data element may be a sound data element, a vibration data element, or the like. The sound data element is used, for example, for a tapping sound inspection, is acquired by a microphone or the like, and is represented as a sound wave waveform. The vibration data element is used in a vibration method used in, for example, equipment diagnosis technology, is acquired by an acceleration sensor, a gyro sensor, or the like, and is represented as a vibration waveform.

Ｄ−４．第４の他の実施形態：
上記各実施形態では、領域検出条件は、入力画面７１，７１Ａ，７１Ｂを介してユーザーによって入力されることでコンピューター１００が受け付けたが、これに限定されるものではない。例えば、領域検出条件は、予めコンピューター１００に設定されていてもよい。 D-4. Fourth Other Embodiment:
In each of the above embodiments, the area detection condition is accepted by the computer 100 by being input by the user via the input screens 71, 71A, 71B, but is not limited thereto. For example, the area detection condition may be set in the computer 100 in advance.

Ｅ．他の形態：
本開示は、上述の実施形態に限られるものではなく、その趣旨を逸脱しない範囲において種々の構成で実現することができる。例えば、発明の概要の欄に記載した各形態中の技術的特徴に対応する実施形態中の技術的特徴は、上述の課題の一部又は全部を解決するために、あるいは、上述の効果の一部又は全部を達成するために、適宜、差し替えや、組み合わせを行うことが可能である。また、その技術的特徴が本明細書中に必須なものとして説明されていなければ、適宜、削除することが可能である。 E. Other forms:
The present disclosure is not limited to the above-described embodiment, and can be realized by various configurations within a range not deviating from the gist thereof. For example, the technical features in the embodiments corresponding to the technical features in each form described in the column of the outline of the invention may be used to solve some or all of the above-mentioned problems, or one of the above-mentioned effects. It is possible to replace or combine as appropriate to achieve part or all. Further, if the technical feature is not described as essential in the present specification, it can be deleted as appropriate.

（１）本開示の一形態によれば、複数の第１学習データ要素と、前記複数の第１学習データ要素に関連付けられたラベルと、を用いて学習された学習モデルを再学習する再学習方法が提供される。前記学習モデルは、前記複数の第１学習データ要素と、前記複数の第１学習データ要素のそれぞれの特徴量との対応を表す学習済み生成モデルを含み、前記再学習方法は、前記特徴量を表すための特徴量空間と、前記特徴量に応じて前記特徴量空間上に配置された前記複数の第１学習データ要素とを、表示装置に表示することと、前記特徴量空間上における特定領域を検出するための領域検出条件を満たす前記特定領域を抽出することと、前記特定領域内の前記特徴量を有する複数の第２学習データ要素を、前記ラベルと関連付けて生成することと、生成した前記複数の第２学習データ要素と、前記複数の第２学習データ要素のそれぞれに関連付けられた前記ラベルとを用いて前記学習モデルを再学習することと、を備える。この形態によれば、領域検出条件の入力を受け付けて、領域検出条件を満たす特定領域を用いて再学習を行うための第２学習データ要素を生成することで、特徴量空間におけるユーザーの意図した領域の第２学習データを生成して学習モデルの再学習を行うことができる。 (1) According to one form of the present disclosure, re-learning for re-learning a learning model learned using a plurality of first learning data elements and labels associated with the plurality of first learning data elements. A method is provided. The learning model includes a trained generation model that represents the correspondence between the plurality of first training data elements and the feature amounts of the plurality of first training data elements, and the retraining method uses the feature amounts. Displaying the feature quantity space for representing and the plurality of first learning data elements arranged on the feature quantity space according to the feature quantity on the display device, and a specific region on the feature quantity space. The specific region satisfying the region detection condition for detecting the above is extracted, and a plurality of second learning data elements having the feature amount in the specific region are generated in association with the label. It comprises retraining the training model using the plurality of second training data elements and the labels associated with each of the plurality of second training data elements. According to this form, the user intended in the feature space by accepting the input of the area detection condition and generating the second learning data element for re-learning using the specific area satisfying the area detection condition. The second training data of the region can be generated and the learning model can be retrained.

（２）上記形態であって、さらに、前記領域検出条件の入力を受け付けることを有していてもよい。この形態によれば、ユーザーが領域検出条件を入力できるので、意図した再学習のための領域をより精度良く指定できる。 (2) In the above-mentioned form, it may further have to accept the input of the area detection condition. According to this form, since the user can input the area detection condition, the area for the intended re-learning can be specified more accurately.

（３）上記形態であって、前記入力を受け付けることは、前記領域検出条件の入力を受け付けるための入力画面を表示装置に表示することを含んでもよい。この形態によれば、ユーザーは入力画面を用いて意図した再学習のための領域をより精度良く指定できる。 (3) In the above-described embodiment, accepting the input may include displaying an input screen for accepting the input of the area detection condition on the display device. According to this form, the user can more accurately specify the intended re-learning area using the input screen.

（４）上記形態であって、前記入力画面は、前記特徴量空間における前記特定領域の範囲の入力を受け付ける第１詳細画面と、前記範囲内における前記第１学習データ要素に関連付けられた前記ラベルの分類数の入力を受け付ける第２詳細画面と、前記範囲内での前記第１学習データ要素の密度情報の入力を受け付ける第３詳細画面と、を有してもよい。この形態によれば、第１詳細画面と、第２詳細画面と、第３詳細画面とを用いて、ユーザーは意図した再学習のための領域をより一層精度良く指定できる。 (4) In the above embodiment, the input screen is a first detail screen that accepts input of a range of the specific area in the feature amount space, and a label associated with the first learning data element within the range. It may have a second detail screen that accepts the input of the number of classifications, and a third detail screen that accepts the input of the density information of the first learning data element within the range. According to this form, the user can more accurately specify the intended re-learning area by using the first detail screen, the second detail screen, and the third detail screen.

（５）上記形態であって、前記入力画面は、さらに、前記第１学習データ要素の種類に関する情報の入力を受け付ける第４詳細画面を有してもよい。 (5) In the above embodiment, the input screen may further have a fourth detail screen that accepts input of information regarding the type of the first learning data element.

（６）上記形態であって、前記入力画面は、さらに、前記範囲内の前記特徴量を有する前記第１学習データ要素について、前記学習モデルの学習時に用いられた前記ラベルの内容と、前記学習モデルを用いた判定結果との一致率に関する情報の入力を受け付ける第５詳細画面を有してもよい。この形態によれば、さらに第５詳細画面を用いて、ユーザーは意図した再学習のための領域をより一層精度良く指定できる。 (6) In the above embodiment, the input screen further includes the contents of the label used at the time of learning the learning model and the learning of the first learning data element having the feature amount within the range. It may have a fifth detail screen that accepts input of information regarding the concordance rate with the determination result using the model. According to this form, the user can further accurately specify the intended re-learning area by using the fifth detail screen.

本開示は、上記形態の他に、学習モデルを再学習するためのコンピュータープログラムや、コンピュータープログラムを記録した非一過性の記録媒体、学習モデルを再学習するための再学習装置等の形態で実現することができる。 In addition to the above forms, the present disclosure is in the form of a computer program for re-learning a learning model, a non-transient recording medium on which a computer program is recorded, a re-learning device for re-learning a learning model, and the like. It can be realized.

１０…機械学習システム、１２…入力層、１４…特徴量算出部、１８…復元部、１９…出力層、２０…判定システム、２２…判定部、２９…生成モデル、３０…第１学習画像、３０Ｉ…画像、３０Ｌ…学習画像、３０ａ…学習画像、３１…第１種対象物、３２…第２種対象物、３３…第３種対象物、３９…検査対象物、５０…表示装置、６０…学習用画面、６２…特徴量空間、７０…第１条件画面、７０ａ…第１詳細画面、７０ｂ…第２詳細画面、７０ｃ…第３詳細画面、７０ｄ…第４詳細画面、７０ｅ…第５詳細画面、７１…入力画面、７２…第２条件画面、７２ａ…領域入力画面、７２ｂ…ラベル設定画面、７４…グラフィカルユーザーインターフェース、７５…決定画面、１００…コンピューター、１０６…記憶部、ＣＳＲ１…第１候補特定領域、ＣＳＲ２…第２候補特定領域、ＣＳＲ３…第３候補特定領域、ＬＳＲ…学習用特定領域、Ｒｕ…判定結果、Ｒｘｙ…ブロック領域、ＳＲ…特定領域、ＳＲ１…第１特定領域、ＳＲ２…第２特定領域、ＳＲ３…第３特定領域、 10 ... Machine learning system, 12 ... Input layer, 14 ... Feature amount calculation unit, 18 ... Restoration unit, 19 ... Output layer, 20 ... Judgment system, 22 ... Judgment unit, 29 ... Generation model, 30 ... First learning image, 30I ... image, 30L ... learning image, 30a ... learning image, 31 ... type 1 object, 32 ... type 2 object, 33 ... type 3 object, 39 ... inspection object, 50 ... display device, 60 ... Learning screen, 62 ... Feature space, 70 ... 1st condition screen, 70a ... 1st detail screen, 70b ... 2nd detail screen, 70c ... 3rd detail screen, 70d ... 4th detail screen, 70e ... 5th Detail screen, 71 ... Input screen, 72 ... Second condition screen, 72a ... Area input screen, 72b ... Label setting screen, 74 ... Graphical user interface, 75 ... Decision screen, 100 ... Computer, 106 ... Storage unit, CSR1 ... No. 1 candidate specific area, CSR2 ... 2nd candidate specific area, CSR3 ... 3rd candidate specific area, LSR ... learning specific area, Ru ... judgment result, Rxy ... block area, SR ... specific area, SR1 ... 1st specific area, SR2 ... 2nd specific area, SR3 ... 3rd specific area,

Claims

A re-learning method for re-learning a learning model learned using a plurality of first training data elements and labels associated with the plurality of first training data elements.
The learning model includes a trained generative model that represents the correspondence between the plurality of first training data elements and the feature amounts of the plurality of first training data elements.
The re-learning method is
Displaying the feature amount space for expressing the feature amount and the plurality of first learning data elements arranged on the feature amount space according to the feature amount on the display device.
Extracting the specific region that satisfies the region detection condition for detecting the specific region on the feature space, and
To generate a plurality of second learning data elements having the feature amount in the specific region in association with the label.
A re-learning method comprising re-learning the learning model using the generated plurality of second learning data elements and the labels associated with each of the plurality of second learning data elements.

The re-learning method according to claim 1, further
A re-learning method comprising accepting an input of the area detection condition.

The re-learning method according to claim 2.
Receiving the input is a re-learning method including displaying an input screen for receiving the input of the area detection condition on the display device.

The re-learning method according to claim 3.
The input screen has a first detail screen that accepts input of a range of the specific area in the feature amount space, and a second that accepts input of the number of classifications of the label associated with the first learning data element within the range. A re-learning method including a detail screen and a third detail screen that accepts input of density information of the first learning data element within the range.

The re-learning method according to claim 4.
The re-learning method further comprises a fourth detail screen that accepts input of information regarding the type of the first learning data element.

The re-learning method according to any one of claims 3 to 5.
The input screen further matches the content of the label used at the time of learning the learning model with the determination result using the learning model for the first learning data element having the feature amount within the range. A re-learning method having a fifth detail screen that accepts input of information about the rate.

A computer program for re-learning a learning model learned using a plurality of first training data elements and labels associated with the plurality of first training data elements.
The learning model includes a trained generative model that represents the correspondence between the plurality of first training data elements and the feature amounts of the plurality of first training data elements.
The computer program
A function of displaying the feature amount space for expressing the feature amount and the plurality of first learning data elements arranged on the feature amount space according to the feature amount on a display device.
A function of extracting the specific area satisfying the area detection condition for detecting the specific area in the feature space, and
A function of generating a plurality of second learning data elements having the feature amount in the specific region in association with the label, and
A computer program that causes a computer to execute a function of retraining the learning model using the generated plurality of second training data elements and the label associated with each of the plurality of second training data elements. ..