WO2021210413A1

WO2021210413A1 - Discrimination target classification method, program employed in same, and discriminating device

Info

Publication number: WO2021210413A1
Application number: PCT/JP2021/014130
Authority: WO
Inventors: 貴之石黒; 星野　仁
Original assignee: 株式会社Ｒｏｘｙ
Priority date: 2020-04-17
Filing date: 2021-04-01
Publication date: 2021-10-21
Also published as: JPWO2021210413A1

Abstract

[Problem] To improve classification accuracy. [Solution] First multidimensional feature vectors 90a, 90b, 90c are extracted from a plurality of sets of teaching signal attached sample image data 34a, discrimination target image data 34b, or specified region image data 34c, using a trained feature extractor 26, the extracted first multidimensional feature vectors 90a, 90b, 90c are converted, using a trained feature converter 27, into second multidimensional feature vectors 92a, 92b, 92c having fewer dimensions than the first multidimensional feature vectors 90a, 90b, 90c and effective for classification of a discrimination target, and classification of the discrimination target is performed on the basis of the converted second multidimensional feature vectors 92a, 92b, 92c. As a result, classification of the discrimination target can be implemented with high accuracy.

Description

Classification method of discrimination target, program used for this, and discrimination device

The present invention relates to a method for classifying discriminating objects, a program used for the classification method, and a discriminating device.

Japanese Patent Application Laid-Open No. 2019-21128 (Patent Document 1) uses a convolutional neural network to perform a calculation by a convolutional layer on an image to be discriminated, and features from the first fully connected layer after the calculation of the convolutional layer is completed. A method of extracting a quantity (multidimensional feature vector), executing a multivariate analysis using the extracted multidimensional feature vector, and classifying the discrimination target based on the result of the multivariate analysis is described.

In the classification method, extraction of a multidimensional feature vector (using a convolutional neural network) and classification of a discrimination target (using multivariate analysis) are realized by different configurations. As a result, the classification criteria for non-defective products and defective products can be easily changed.

JP-A-2019-21128

However, the multidimensional feature vector extracted from the first fully connected layer after the calculation of the convolution layer is completed is not always effective as a feature quantity used for classification of the discrimination target, and in terms of improving the classification accuracy, it is still possible. There is room for improvement.

The present invention has been made in view of the above, and one of the objects of the present invention is to provide a technique that contributes to improvement of classification accuracy.

According to a preferable form of the classification method of the discrimination target according to the present invention, (a) the data of the discrimination target is acquired, and (b) a feature extractor using a neural network including deep learning is used to obtain the data of the discrimination target. A one-dimensional feature vector is extracted, and (c) the extracted first multidimensional feature vector is subjected to a second multidimensional lower dimension than the first multidimensional feature vector by a feature converter using a plurality of fully connected layers. It is converted into a feature vector, and (d) based on the converted second multidimensional feature vector, the discrimination target is classified by a classifier using statistical machine learning. Here, the "data to be discriminated" in the present invention typically corresponds to the image data to be discriminated, and the image data to be discriminated literally corresponds to this, but has been learned. In the learning stage of the model, it is a concept that includes sample data with a teacher signal. Further, the "statistical machine learning device" in the present invention refers to machine learning other than machine learning using a neural network including deep learning, such as gradient boosting, support vector machine, random forest, neural network, and Gaussian normalization. , Sansamble inspection, etc. correspond to this.

According to the present invention, since the low-dimensional second multidimensional feature vector, which is a feature converted by the feature converter and is effective for classifying the discrimination target, is used for the classification of the discrimination target, the classification of the discrimination target is accurate. It can be realized well. It should be noted that a feature extractor for extracting the first multidimensional feature vector, which is a variety of feature quantities of the data to be discriminated, and a plurality of full couplings for converting the first multidimensional feature vector into a low-dimensional second multidimensional feature vector. Since the feature converter using the layer and the classifier that classifies the discrimination target based on the second multidimensional feature vector converted by the feature converter have different configurations, it is necessary to improve the classification accuracy of the discrimination target. Learning of feature extractors and feature converters and learning of classifiers can be performed separately. That is, when the cause of the low classification accuracy is the low feature extraction / conversion accuracy by the feature extractor and the feature converter, the feature extractor and the feature converter are trained and the classification accuracy by the classifier is performed. If it is caused by the low level, only the classifier can be trained, and the classification accuracy can be improved efficiently.

According to the preferred form of the program according to the present invention, a program for classifying the discrimination target is configured. The program is for causing one or a plurality of computers to execute each step of the classification method of the discrimination target according to the present invention in any of the above-described aspects. The program may be recorded on a computer-readable recording medium such as a hard disk, ROM, SSD, flash memory (USB memory, SD card, etc.), floppy disk, CD, DVD, or a transmission medium. For example, it may be distributed from one computer to another computer via a communication network such as the Internet or LAN, or may be exchanged in any other manner.

According to the present invention, by causing one computer to execute the program or having a plurality of computers execute each step in a shared manner, each step of the discrimination target classification method according to the present invention in any of the above-described embodiments. Is executed, so that the same action and effect as the above-described method for classifying the discrimination target according to the present invention, for example, the effect that the classification of the discrimination target can be accurately realized can be obtained.

According to a preferred embodiment of the discrimination device according to the present invention, a data acquisition unit for acquiring data to be discriminated and a neural network including deep learning for extracting a first multidimensional feature vector from the acquired data to be discriminated are used. A feature extractor and a plurality of fully connected layers that convert the first multidimensional feature vector extracted by the feature extractor into a second multidimensional feature vector having a lower dimension than the first multidimensional feature vector are used. It includes a feature converter and a classifier using statistical machine learning that classifies discrimination targets using a second multidimensional feature vector. Here, the "data to be discriminated" in the present invention typically corresponds to the image data to be discriminated, and the image data to be discriminated literally corresponds to this, but feature extraction. In the learning stage of the device and the feature converter, it is a concept that includes image data of a sample with a teacher signal. Further, the "statistical machine learning device" in the present invention refers to machine learning other than machine learning using a neural network including deep learning, such as gradient boosting, support vector machine, random forest, neural network, and Gaussian normalization. , Sansamble inspection, etc. correspond to this.

According to the present invention, since the low-dimensional second multidimensional feature vector, which is a feature converted by the feature converter and is effective for classifying the discrimination target, is used for the classification of the discrimination target, the classification of the discrimination target is accurate. It can be realized well. A feature extractor that extracts the first multidimensional feature vector, which is a variety of feature quantities of the data to be discriminated, and a plurality of all that convert the first multidimensional feature vector into a low-dimensional second multidimensional feature vector. Since the feature converter using the coupling layer and the classifier that classifies the discrimination target based on the second multidimensional feature vector converted by the feature converter have different configurations, the classification accuracy of the discrimination target is improved. , The learning of the feature extractor and the feature converter, and the learning of the classifier can be carried out separately. That is, when the cause of the low classification accuracy is the low feature extraction accuracy by the feature extractor and the feature converter, the feature extractor and the feature converter are trained and the classification accuracy by the classifier is low. If this is the case, only the classifier can be trained, and the classification accuracy can be improved efficiently.

According to a further form of the discriminating device according to the present invention, a storage unit that stores the second multidimensional feature vector and a distance value between each of the plurality of second multidimensional feature vectors stored in the storage unit are stored. It also has a calculation department for calculation. Then, in the learning stage, the feature extractor and the feature converter learn by using the error backpropagation method and the gradient descent method based on the distance value calculated by the calculation unit.

According to this embodiment, the classification of the discrimination target is effective, and the classification of the discrimination target is performed using the second multidimensional feature vector extracted and converted in the learning stage of the feature extractor and the feature converter. Therefore, the classification accuracy is correct. Can be further improved.

According to the present invention, the classification accuracy can be improved.

It is a block diagram which shows the outline of the structure of the computer 1 which functions as the discriminating device which concerns on embodiment of this invention. It is a functional block diagram which shows the functional structure of the computer 1 which functions as the discrimination apparatus which concerns on embodiment of this invention. It is a block diagram which shows the outline of the structure of the feature extractor 26. It is explanatory drawing which shows the outline of the generation of the trained model 35 and the classification of the discriminant object. It is explanatory drawing which shows the outline of the 2D graphing of the 2nd

multidimensional feature vector

92a, 92b, 92c. It is a flowchart which shows an example of the classification discrimination routine. It is explanatory drawing which shows the learning situation (individuality) of the trained model 35 which has the same numerical index. It is explanatory drawing which shows the state which the 2D graph Og, Ng for a model, the 2D graph Dg for discrimination, and the selection 2D graph Cg are displayed in the window 62. It is explanatory drawing which shows the state of the 2D graph Dg for discrimination, and the selection 2D graph Cg when the setting of the calculation parameter or the threshold value of the classifier 28 is not appropriate.

Next, the best mode for carrying out the present invention will be described with reference to examples.

As shown in FIG. 1, the computer 1 that functions as the discrimination device according to the present embodiment is configured as a microprocessor centered on the CPU 2, and has a ROM 4 that stores various processing programs and temporarily stores data. RAM 6, GPU 8 that performs calculation processing and matrix calculation processing necessary for performing image processing, hard disk (HDD) 10, which is a large-capacity memory that stores various data including various application programs (simply called applications) and image data, It is provided with an input / output interface (I / F) 12 for inputting / outputting data to / from an external device such as a camera 70. In the following, for convenience of explanation, the computer 1 as the discrimination device according to the present embodiment will be described as a device that discriminates whether the discrimination target is an OK product or an NG product. The computer 1 is an example of an implementation configuration corresponding to the "discrimination device" in the present invention.

The computer 1 is provided with an input device 14 such as a keyboard and a mouse for which a user inputs various commands, a display 60 for displaying various information, and the like. The CPU 2, ROM 4, RAM 6, GPU 8, HDD 10, I / F 12, input device 14, display 60, and the like are electrically connected by a bus 80 so that various control signals and data can be exchanged with each other.

The computer 1 has a function of executing an operation corresponding to the input operation when the user inputs the cursor or the like displayed on the display 60 via the input device 14. Further, the computer 1 functions as a discrimination device according to the present embodiment by executing various processes by an application stored in the HDD 10, specifically, a discrimination application that executes the discrimination processing of the discrimination target. .. In the present embodiment, the discriminating device is configured to be feasible by the computer 1, but it may be realized as a dedicated device.

Further, as shown in FIG. 2, the computer 1 includes the above-mentioned hardware resources such as CPU2, ROM4, RAM6, GPU8, HDD10, I / F12, input device 14, and display 60, a discrimination application, and the present embodiment. Image display control unit 20, image acquisition unit 22, area designation unit 24, feature extractor 26, feature converter 27, classifier 28, distance calculation in collaboration with software such as the visualization program related to The device 29, the two-dimensional graphed data generation unit 30, the storage unit 33, and the like are configured as functional blocks. In other words, each of these parts (image display control unit 20, image acquisition unit 22, area designation unit 24, feature extractor 26, feature converter 27, classifier 28, distance calculator 29, two-dimensional graphing data generation unit). 30; 14, display 60, etc.) can be said to be a function realized by operating independently or in cooperation. The image display control unit 20, the image acquisition unit 22, the area designation unit 24, the feature extractor 26, the feature converter 27, the classifier 28, the distance calculator 29, the two-dimensional graph data generation unit 30, and the storage unit 33. Etc. are electrically connected by a bus line 82 such as an address bus or a data bus. The feature extractor 26, the feature converter 27, and the display 60 (window 62 described later) are examples of an implementation configuration corresponding to the “visualization device” in the present invention.

When the discrimination application is started, the image display control unit 20 displays a predetermined window 62 on the screen 61 of the display 60 as shown in FIG. Further, the image display control unit 20 teaches based on the sample image data 34a with a teacher signal (see FIG. 2) selected by the user inputting the cursor or the like displayed on the display 60 via the input device 14. A sample image with a signal is displayed on the window 62, a discrimination target image is displayed on the window 62 based on the discrimination target image data 34b (see FIG. 2) acquired by the image acquisition unit 22, and the area designation unit 24 is displayed. The designated area image is displayed on the window 62 based on the designated area image data 34c (see FIG. 2) acquired by.

Further, the image display control unit 20 displays a two-dimensional graph on the window 62 based on the model two-dimensional graphed data 36a (see FIG. 2) stored in the storage unit 33, or generates two-dimensional graphed data. The discrimination two-dimensional graph is displayed on the window 62 based on the discrimination two-dimensional graphing data 36b generated by the unit 30. Further, the image display control unit 20 displays the discrimination result by the classifier 28 in the window 62. The sample image with a teacher signal is an image used for learning the feature extractor 26 and the feature converter 27. Further, the discrimination target image typically corresponds to an image of a target whose classification (OK product or NG product) is unknown, but verification and learning of the learning results of the feature extractor 26 and the feature converter 27 For the purpose of improving accuracy, it preferably includes an image of a target whose classification (OK product or NG product) is known, for example, a sample image with a teacher signal arbitrarily selected by the user from a plurality of sample images with a teacher signal.

The image acquisition unit 22 acquires sample image data 34a with a teacher signal and image data 34b to be discriminated taken by the camera 70 (see FIG. 1), and uses these acquired image data as the image display control unit 20 and the feature extractor 26. , Supply to the storage unit 33 and the like.

The area designation unit 24 is used when an arbitrary area of the sample image with a teacher signal or the image to be discriminated displayed on the window 62 is clicked or dragged by an input operation via the user's input device 14. 34c of the image data (hereinafter referred to as “designated area image data”) 34c is acquired, and the acquired designated area image data 34c (see FIG. 2) is supplied to the image display control unit 20, the feature extractor 26, the storage unit 33, and the like. ..

The feature extractor 26 extracts the first

multidimensional feature vectors

90a, 90b, 90c from the plurality of sample image data 34a with teacher signals, the discriminant target image data 34b, and the designated area image data 34c, and extracts the first multidimensional. The

feature vectors

90a, 90b, 90c are supplied to the feature converter 27. Here, in the present embodiment, the feature extractor 26 is configured to use a convolutional neural network (CNN) as shown in FIGS. 3 and 4. The first

multidimensional feature vectors

90a, 90b, and 90c are examples of implementation configurations corresponding to the "first multidimensional feature vector" in the present invention, respectively.

In the extraction of the first

multidimensional feature vectors

90a, 90b, 90c by the feature extractor 26, as shown in FIG. 3, a plurality of sample image data 34a with teacher signals and discrimination targets are used using a predetermined filter (not shown). The so-called convolution process for extracting the features of the

image data

34a, 34b, 34c from the image data 34b and the designated area image data 34c without losing the features is performed a plurality of times, and then flattened into a column vector.

As shown in FIG. 4, the feature converter 27 has a plurality of fully

connected layers

27a and 27b, and the first multidimensional extracted by the feature extractor 26 using the fully

connected layers

27a and 27b. The process of lowering the dimension of the

feature vectors

90a, 90b, 90c is executed. Specifically, the first

multidimensional feature vectors

90a, 90b, 90c are converted into the first

multidimensional feature vectors

90a, 90b, 90c by Fully Connected processing, and the second multidimensional feature vectors 92a, which are lower in dimension than the first

multidimensional feature vectors

90a, 90b, 90c, It is converted into 92b and 92c and supplied to the classifier 28, the distance calculator 29, the two-dimensional graphing data purification unit 30, and the storage unit 33. It should be noted that the first

multidimensional feature vectors

90a, 90b, 90c are obtained from the first

multidimensional feature vectors

90a, 90b, 90c by pooling processing such as Global max Polling or Global Average Polling instead of the Fully Connected processing. May be converted into low-dimensional second

multidimensional feature vectors

92a, 92b, 92c. The second

multidimensional feature vectors

92a, 92b, and 92c are stored in the storage unit 33, respectively. The second

multidimensional feature vectors

92a, 92b, and 92c are examples of the implementation configurations corresponding to the "second multidimensional feature vector" in the present invention.

The feature extractor 26 and the feature converter 27 include the first

multidimensional feature vectors

90a, 90b, 90c and the second

multidimensional feature vectors

92a, 92b, which can accurately discriminate the classification of the discrimination target. Learning is performed in advance so that 92c can be obtained, and the learned model 35 is stored in the storage unit 33. Twice

As shown in FIG. 4B, the classifier 28 functions when discriminating the classification of the discriminating target. As shown in FIG. 4B, the classification of the discrimination target, that is, whether the discrimination target is an OK product or an NG product, is based on the second

multidimensional feature vectors

92a, 92b, 92c converted by the feature converter 27. Is determined, and the determination result is supplied to the image display control unit 20. Specifically, the classification of the discrimination target is that the second

multidimensional feature vectors

92a, 92b, 92c to be discriminated by the arithmetic parameters machine-learned using the second

multidimensional feature vectors

92a, 92b, 92c are OK products. A one-dimensional numerical value indicating the degree of certainty is calculated, and if it is equal to or more than the set threshold value, it is classified as an OK product, and if it is less than the set threshold value, it is classified as an NG product. In the present embodiment, the classifier 28 is configured to use so-called statistical machine learning such as gradient boosting, support vector machine, random forest, neural network, Gaussian normalization, and ensemble inspection.

The distance calculator 29 functions when generating the trained model 35, and when verifying the trained model 35 and improving the accuracy. When generating the trained model 35, the distance calculator 29 calculates the distance between the second multidimensional feature vectors 92a of the sample image data 34a with a plurality of teacher signals, and uses the calculated distance as the distance data 38. It is stored in the storage unit 33. The calculated distance is fed back to the feature extractor 26 and the feature converter 27 (see FIG. 4A). On the other hand, when verifying the trained model 35 and improving the accuracy, the distance calculator 29 uses the second multidimensional feature vector 92a of the sample image data 34a with a plurality of teacher signals, the discriminant image data 34b, and the designation. The distance between the second

multidimensional feature vectors

92b and 92c of the area image data 34c is calculated, and the calculated distance is stored in the storage unit 33 as the distance data 38.

In the present embodiment, the feedback to the feature extractor 26 and the feature converter 27 is the distance between the second multidimensional feature vectors 92a between the OK products and the second multidimensional feature vector 92a of the OK product and the NG product. The feature extractor 26 and feature conversion by the error backpropagation method and the gradient descent method using a loss function such as the Triplet loss function so that the distance between the second multidimensional feature vectors 92a is relatively optimized. The configuration was made by modifying the parameters of the vessel 27. In such parameter modification (learning of the feature extractor 26 and the feature converter 27), the second multidimensional feature vector 92a is visualized (two-dimensional graphing), and the trained model 35 (feature extractor 26 and the feature converter 27) is modified. After grasping the learning situation (individuality) of the feature converter 27), it is carried out until the learning situation (individuality) becomes a desired state. In this way, the trained model 35 is generated and stored in the storage unit 33. In the present embodiment, the learning of the trained model 35 and the learning of the feature extractor 26 and the feature converter 27 are synonymous.

Here, in the present embodiment, deep metric learning is used for modifying the parameters of the feature extractor 26 and the feature converter 27. That is, the first multidimensional feature vector 90a is extracted by the feature extractor 26, and the extracted first multidimensional feature vector 90a is converted into the second multidimensional feature vector 92a by the feature converter 27, and the converted second multi. A series of processes in which the distance between the dimensional feature vectors 92a is calculated by the distance calculator 29, and the parameters of the feature extractor 26 and the feature converter 27 are corrected by the error back propagation method and the gradient descent method based on the calculated distance. Deep metric learning was applied to.

The two-dimensional graphing data generation unit 30 generates the model two-dimensional graphing data 36a based on the second multidimensional feature vector 92a of the plurality of sample image data 34a with teacher signals, and also generates the discrimination target image data 34b and Based on the second

multidimensional feature vectors

92b and 92c of the designated area image data 34c, the discriminant two-dimensional graphed data 36b is generated.

Specifically, the two-dimensional graphing data 36a for the model and the two-dimensional graphing data 36b for discrimination are generated by using the second

multidimensional feature vectors

92a, 92b, and 92c as a set of the feature quantities of the number of dimensions. For example, when the second

multidimensional feature vectors

92a, 92b, 92c are n-dimensional column vectors (f ₁ , f ₂ , f ₃ , ..., f _n-1 , f _n ), n feature quantities f. _{Data as a set of 1} , f ₂ , f ₃ , ..., f _n-1 , f _n , that is, as shown in FIG. 5,

column numbers

1, 2, 3, ..., N on the vertical axis. In a coordinate system (Cartesian coordinate system) with -1, n and feature quantities f ₁ , f ₂ , f ₃ , ..., f _n-1 , f _n _{on the horizontal axis, these feature quantities f 1} , Plot f ₂ , f ₃ , ..., f _n-1 , f _n to generate data that can display a two-dimensional graph connected by a curve or a straight line.

For the purpose of verifying the learning result of the trained model 35 (feature extractor 26 and feature converter 27) and improving the learning accuracy, the sample image data 34a with a plurality of teacher signals stored in the storage unit 33 by the user. When any one sample image data 34a with a teacher signal is selected from the data, the two-dimensional graphing data generation unit 30 has the arbitrary one teacher signal from the model two-dimensional graphed data 36a. A model generated based on the second multidimensional feature vector data 92a (hereinafter referred to as "arbitrary second multidimensional feature vector 92a'") of the sample image data 34a (hereinafter referred to as "arbitrary sample image data 34a'"). Two-dimensional graphing data 36a for use (hereinafter, referred to as "selected two-dimensional graphing data 36a'") is extracted.

The storage unit 33 is secured in at least one of the RAM 6 and the HDD 10, and is the sample image data 34a with a teacher signal taken by the camera 70 (see FIG. 1), the image data 34b to be discriminated, and the designation acquired by the area designation unit 24. Area image data 34c, trained model 35, model two-dimensional graphing data 36a generated by the two-dimensional graphing data generation unit 30, discriminant two-dimensional graphing data 36b, second converted by the feature converter 27. The

multidimensional feature vectors

92a, 92b, 92c and the distance data 38 calculated by the distance calculator 29 are stored.

Next, the operation of the computer 1 as the discrimination device according to the present embodiment, particularly the operation when discriminating the classification of the discrimination target will be described. FIG. 6 is a main flowchart showing an example of the classification determination routine.

When the classification discrimination routine is executed, first, the feature extractor 26 (trained feature extractor 26) using the trained model 35 performs the discrimination target image data 34b, the arbitrary sample image data 34a', or the designated area. Image data 34c is read (step S10), and a process of extracting the first

multidimensional feature vectors

90a, 90b, 90c from the read discrimination target image data 34b, arbitrary sample image data 34a', or designated area image data 34c is executed. (Step S12).

Subsequently, the feature converter 27 (trained feature converter 27) using the trained model 35 uses the first

multidimensional feature vectors

90a, 90b, 90c extracted by the feature extractor 26 as the second multidimensional feature vector. The process of converting to 92a, 92b, 92c is executed (step S14). Then, the classifier 28 uses the second

multidimensional feature vectors

92a, 92b, 92c converted by the feature converter 27 to discriminate not only the image of the target whose classification is unknown but also the arbitrary sample image. , And, the process of classifying the images (designated area images) of arbitrary areas of these images specified by the user is executed (step S16), and the discrimination result (OK product or NG product) according to the classification is output. (Step S18), this routine is terminated.

Here, the first

multidimensional feature vectors

90a, 90b, 90c are converted into the second

multidimensional feature vectors

92a, 92b, 92c by the feature converter 27 learned in advance using deep metric learning, and thus discriminated. It is possible to reduce the dimension without losing the features that are effective for classifying the object.

In the present embodiment, the learning of the trained model 35 (feature extractor 26 and feature converter 27) using deep metric learning is visualized and the trained model 35 (feature extractor 26 and feature conversion) is performed. The second multidimensional feature vector 92a, which is effective for classifying the discrimination target, is performed until the learning situation (individuality) of the vessel 27) is visually confirmed and the learning situation (individuality) becomes a desired state. It can be appropriately converted to 92b and 92c.

FIG. 7A is an explanatory diagram showing a state in which the trained model 35 is in an appropriate learning situation (individuality), and FIG. 7B is a state in which the learning situation (individuality) of the trained model 35 is not appropriate. It is explanatory drawing which shows. In the figure, the reference numeral "Og" is a graph generated by the two-dimensional graphing data generation unit 30 based on the second multidimensional feature vector 92a of the sample image data 34a with a plurality of teacher signals of the OK product. Among them, the symbol "Ng" is a graph generated by the two-dimensional graphing data generation unit 30 based on the second multidimensional feature vector 92a of the sample image data 34a with a plurality of teacher signals of the NG product. 7 (a) and 7 (b) are one-dimensional numerical indexes such as a correct answer rate, a precision rate, and a recall rate that are generally used when evaluating the learning status (individuality) of the trained model 35. All are visualizations of the trained model 35 which is the same or has similar numerical indexes.

As shown in FIG. 7, when the trained model 35 is in an appropriate learning situation (individuality), the two-dimensional graph Og for the model and the two-dimensional graph Ng for the model are displayed clearly separated, and the boundary is displayed. The second

multidimensional feature vectors

92a, 92b, 92c, which are effective for classifying the discrimination target, are appropriate by using the feature converter 27 using the trained model 35, which is clarified and the learning situation (individuality) is appropriate. Can be obtained.

On the other hand, if the learning situation (individuality) of the trained model 35 is not appropriate, the two-dimensional graph Og for the model and the two-dimensional graph Ng for the model are displayed close to each other, and the boundary becomes unclear. If the feature converter 27 using the trained model 35 whose situation (individuality) is not appropriate is used, the second

multidimensional feature vectors

92a, 92b, 92c effective for classifying the discrimination target cannot be appropriately obtained.

By visualizing the learning status (individuality) of the trained model 35 in this way, it is possible to evaluate whether or not the trained model 35 is in a learning status (individuality) that can appropriately discriminate the classification of the discrimination target. , An appropriate trained model 35 can be generated with a smaller amount of sample data than in the past. Then, by being able to generate an appropriate trained model 35, it is possible to widen the setting range of the threshold value appropriate for the classification of the discrimination target by the classifier 28. As a result, it is possible to suppress erroneous discrimination due to disturbance, for example, the influence of external light when acquiring an image to be discriminated, and it is possible to stably obtain an accurate discriminant result.

Of course, in the present embodiment, as shown in FIG. 8, in addition to the two-dimensional graphs Og and Ng for the model, the second multidimensional feature vector 92b to be discriminated and the second multidimensional feature of the region specified by the user. Since the state of the vector 92c is displayed as a two-dimensional graph Dg for discrimination, it is possible to visualize the discrimination status of the discrimination target and the classification of the designated area. As a result, it is possible to visually confirm the discrimination status of the discrimination target and the discrimination status of the portion of concern by the user (for example, when there is a scratch on the discrimination target, the scratch can be discriminated as NG. Depending on the discrimination situation of the part of concern (whether it is OK, etc.), it is determined that it is NG even though it is an OK product, or conversely, it is an OK product even though it is an NG product. The learning of the feature extractor 26 and the feature converter 27, or the learning of the classifier 28 can be redone.

Since the classifier 28 classifies the discrimination target using the second

multidimensional feature vectors

92a, 92b, 92c having features effective for the classification of the discrimination target, the classification of the discrimination target can be realized with high accuracy.

In the present embodiment, the classification of the discrimination target by the classifier 28 is performed by the second multidimensional of the discrimination target based on the calculation parameters machine-learned using the plurality of second multidimensional feature vectors 92a stored in the storage unit 33. A one-dimensional numerical value indicating the certainty that the

feature vectors

92b and 92c are OK products is calculated and compared with the set threshold value.

Further, in the present embodiment, since the learning situation (individuality) of the trained model 35 is visualized and the feature extractor 26, the feature converter 27, and the classifier 28 are provided as separate configurations, the classification of the discrimination target is performed. When an erroneous determination is made in the determination of the above, the cause of the erroneous determination can be easily identified, and the countermeasure can be appropriately taken.

For example, as shown in FIG. 9A, the shapes of the discriminant two-dimensional graph Dg and the selected two-dimensional graph Cg are separated from the model two-dimensional graph Ng, and it is determined that the product is NG despite the different shapes. In this case, or as shown in FIG. 9B, the shapes of the discriminant two-dimensional graph Dg and the selected two-dimensional graph Cg are different from the model two-dimensional graph Og, and despite the different shapes, the product is OK. If it is judged, that is, if the characteristics of the OK and NG products can be grasped, but the classification of the OK and NG products is incorrect, machine learning of the arithmetic parameters of the classifier 28 or It can be determined that the setting of the threshold value is not appropriate, and in this case, it can be dealt with by machine learning the calculation parameter of the classifier 28 or resetting the setting of the threshold value. As a result, the determination accuracy of the determination device can be easily and quickly improved, and the reliability of the trained model 35 can be improved. The two-dimensional graph Dg for discrimination is a graph generated by the two-dimensional graphing data generation unit 30 based on the second

multidimensional feature vectors

92b and 92c of the discrimination target image data 34b and the designated area image data 34c. .. The selected two-dimensional graph Cg is a graph generated by the two-dimensional graphing data generation unit 30 based on the selected two-dimensional graphing data 36a'.

According to the discrimination device according to the present embodiment described above, the first sample image data 34a with a plurality of teacher signals, the discrimination target image data 34b, and the designated area image data 34c are used by using the trained feature extractor 26. The

multidimensional feature vectors

90a, 90b, 90c are extracted, and the extracted first

multidimensional feature vectors

90a, 90b, 90c are used with the trained feature converter 27 to obtain the first

multidimensional feature vectors

90a, 90b, 90c. In order to convert to the second

multidimensional feature vectors

92a, 92b, 92c, which are lower in dimension and effective for classifying the discrimination target, and to classify the discrimination target based on the converted second

multidimensional feature vectors

92a, 92b, 92c. , The classification of the discrimination target can be realized with high accuracy.

Further, according to the discrimination device according to the present embodiment, the first sample image data 34a with a plurality of teacher signals to be discriminated, the discriminant target image data 34b, and the designated area image data 34c have various feature quantities. The feature extractor 26 for extracting the

multidimensional feature vectors

90a, 90b, 90c and the first

multidimensional feature vectors

90a, 90b, 90c are converted into the low-dimensional second

multidimensional feature vectors

92a, 92b, 92c with the feature converter 27. Since the classifier 28 that classifies the discrimination target based on the second

multidimensional feature vectors

92a, 92b, 92c converted by the feature converter 27 has a different configuration, the feature is improved in improving the classification accuracy of the discrimination target. Learning of the extractor 26 and the feature converter 27 and learning of the classifier can be carried out separately. That is, when the cause of the low classification accuracy is the low feature extraction / conversion accuracy by the feature extractor 26 and the feature converter 27, the feature extractor 26 and the feature converter 27 are trained and classified. When it is caused by the low classification accuracy of the device 28, only the classifier 28 can be trained, and the classification accuracy can be efficiently improved.

In the present embodiment, the discriminating device has been described as a device that discriminates whether the discriminating target is an OK product or an NG product, but the present invention is not limited to this. For example, the discriminating device may be applied to a device that discriminates which of a plurality of classifications the discriminating target is classified into.

In the present embodiment, as a graph of the second

multidimensional feature vectors

92a, 92b, 92c, the vertical axis has

column numbers

1, 2, 3, ..., N-1, n, and the horizontal axis is the feature quantity. In a coordinate system (Cartesian coordinate system) that takes _{f 1} , f ₂ , f ₃ , ..., f _n-1 , f _n _{, these feature quantities f 1} , f ₂ , f ₃ , ..., f _{n A} two-dimensional graph in which -1 and f _n are plotted and connected by a curve or a straight line is used, but the present invention is not limited to this.

In the present embodiment, the input device 14 has been described as a keyboard and a mouse, but may include a pointing device such as a touch panel, a button, a dial, a touch sensor, a touch pad, and the like.

In the present embodiment, the large-capacity memory for storing various data including image data is HDD 10, but the present invention is not limited to this. As a large-capacity memory for storing various data including image data, a flash memory (USB memory, SD card, etc.), SSD, floppy disk, CD, DVD, or the like may be applied.

In the present embodiment, various applications including a visualization program are stored in the HDD 10, but the present invention is not limited to this. For example, various applications including a visualization program may be configured to be distributed from another computer to the computer 1 as a discrimination device according to the present embodiment via a transmission medium, for example, a communication network such as the Internet or LAN.

The present embodiment shows an example of a mode for carrying out the present invention. Therefore, the present invention is not limited to the configuration of the present embodiment.

1 Computer 1 (discrimination device)
2 CPU
4 ROM
6 RAM
8 GPU
10 HDD
12 Input / output interface 14 Input device 20 Image display control unit 22 Image acquisition unit 24 Area designation unit 26 Feature extractor (feature extractor)
27 Feature converter (feature converter)
27a Fully bonded layer (multiple fully bonded layers)
27b Fully bonded layer (multiple fully bonded layers)
28 Classifier (classifier)
29 Distance calculator 30 Two-dimensional graphed data generation unit 33 Storage unit 34a Sample image data with teacher signal 34b Discrimination target image data 34c Designated area image data 35 Trained model (trained model)
36a Two-dimensional graphed data for model 36b Two-dimensional graphed data for discrimination 38 Distance data 60 Display 62 Window 80 Bus 82 Bus line 90a First multidimensional feature vector (first multidimensional feature vector)
90b 1st multidimensional feature vector (1st multidimensional feature vector)
90c 1st multidimensional feature vector (1st multidimensional feature vector)
92a Second multidimensional feature vector (second multidimensional feature vector)
92a'Any second multidimensional feature vector (second multidimensional feature vector)
92b Second multidimensional feature vector (second multidimensional feature vector)
92c 2nd multidimensional feature vector (2nd multidimensional feature vector)
fn feature amount (feature amount)
n Column number Og Two-dimensional graph for OK product model Ng Two-dimensional graph for NG product model Dg Two-dimensional graph for discrimination Cg Selected two-dimensional graph

Claims

(A) Acquire the data to be discriminated and
(B) A first multidimensional feature vector is extracted from the data to be discriminated by a feature extractor using a neural network including deep learning.
(C) The extracted first multidimensional feature vector is converted into a second multidimensional feature vector having a lower dimension than the first multidimensional feature vector by a feature converter using a plurality of fully connected layers.
(D) A method for classifying the discriminant target by a classifier using statistical machine learning based on the converted second multidimensional feature vector.
It is a program for classifying the discrimination target,
A program for causing one or more computers to execute each step of the classification method of the discrimination target according to claim 1.
The data acquisition unit that acquires the data to be discriminated, and
A feature extractor using a neural network including deep learning that extracts the first multidimensional feature vector from the acquired data to be discriminated, and
A feature converter using a plurality of fully connected layers that converts the first multidimensional feature vector extracted by the feature extractor into a second multidimensional feature vector having a lower dimension than the first multidimensional feature vector. ,
A classifier using statistical machine learning that classifies the discrimination target using the second multidimensional feature vector, and
A discriminating device comprising.
A storage unit that stores the second multidimensional feature vector,
A calculation unit that calculates the distance value between each of the plurality of second multidimensional feature vectors stored in the storage unit, and a calculation unit.
Is further equipped with
The discrimination device according to claim 3, wherein the feature extractor and the feature converter learn by using an error backpropagation method and a gradient descent method based on the distance value calculated by the calculation unit in the learning stage.