CN113658109A

CN113658109A - Glass defect detection method based on field loss prediction active learning

Info

Publication number: CN113658109A
Application number: CN202110831710.9A
Authority: CN
Inventors: 刘贵松; 解修蕊; 张邵楷; 占求港; 黄鹂; 杨新
Original assignee: Southwestern University Of Finance And Economics
Current assignee: Southwestern University Of Finance And Economics
Priority date: 2021-07-22
Filing date: 2021-07-22
Publication date: 2021-11-16

Abstract

The invention discloses a glass defect detection method based on field loss prediction active learning, which defines a loss prediction model, wherein the model can be trained and optimized together with a multi-label classification model, and carries out loss prediction on glass samples in an unlabelled sample pool, the first K glass samples with the largest predicted loss values are taken as high-value samples selected by active learning after sequencing, and the high-value samples are added into a labeled sample set after labeling for training the multi-label classification model so as to identify the defect types in glass to be detected. By the method, the classification accuracy of the model under a small multi-label image sample size can be effectively improved, so that the model effect can be expected by using a small data size, and the sample labeling cost is reduced.

Description

Glass defect detection method based on field loss prediction active learning

Technical Field

The invention belongs to the technical field of convolutional neural networks, and relates to a glass defect detection method based on field loss prediction active learning.

Background

The glass has important significance for urbanization and modern industry, and for the field of glass defect detection, a large amount of glass defect detection work is carried out every day, and most of the glass defect detection work is carried out by manual screening. However, manual identification of glass defects is costly and inefficient. Professional glass defect detecting instruments are expensive, occupied space of equipment is large, and the glass defect detecting instruments are not easy to popularize in common enterprises. There is a strong need for a low cost glass defect detection scheme for these enterprises. The glass may usually have a plurality of different defects, so that the glass belongs to the identification of glass images and belongs to the multi-label classification problem. Correctly identifying and classifying the categories of the defects has important significance for reducing the cost of a factory and improving the efficiency.

Multi-label image classification differs from single-label classification in that the samples to be classified may belong to two or more classes simultaneously, and there is a more complex relationship between the classes. The multi-label classification algorithm is mainly divided into two types: one is a Problem transformation method (Problem transformation methods); the other is an Algorithm conversion method (Algorithm adaptation methods). The basic idea of the problem transformation method is to transform the multi-label classification problem into a simple two-classification problem and then solve the multi-label problem using known methods. The algorithm conversion method is generally to optimize the traditional single-label classification algorithm so as to adapt to the multi-label classification data set. Some classical multi-label learning algorithms are: probabilistic model-based methods, decision tree-based methods, support vector machine-based methods, neural network-based methods, KNN-based methods.

The core idea is to find the training sample with the most value through some heuristic strategies, so that the model can achieve or even exceed the expected effect by using as few labeled samples as possible. Therefore, the method can be well used in the field of glass defect detection. The concept of active learning was proposed by Simon in 1974. Subsequently, the active learning method is infinite in many fields, and is further generalized into classical scenarios such as a generative member Query (Membership Query Synthesis), a streaming-Based Selective learning method (Stream-Based Sampling), and an active learning method Based on an unlabeled sample Pool (Pool-Based Sampling). In comparison, the active learning method based on the unlabeled sample pool forms a large number of unlabeled samples into the unlabeled sample pool, and the samples with the most value are screened from the unlabeled sample pool by designing a sample screening strategy and are preferentially labeled. In addition, with the heat of the internet and the continuous promotion of data acquisition technology, a large amount of unlabelled data can be acquired in many fields at low cost. Therefore, active learning methods based on unlabeled sample pools are most popular and widely used in different fields, and are very important in machine learning and data mining applications.

Disclosure of Invention

The invention aims to: the method is characterized in that a loss prediction active learning model is provided, the most effective image of the classification model is selected to be labeled and used for model training, so that the model can better identify the defects of which types exist in the glass image with the lowest labeling cost, and the expected effect is achieved.

The technical scheme adopted by the invention is as follows:

a glass defect detection method based on field loss prediction active learning comprises the following steps: inputting unmarked glass images to be processed, randomly selecting partial images, and submitting the partial images to an expert for marking to serve as an initial marking data set;

step 2: constructing a multi-label classification model and a loss prediction model;

and step 3: training a multi-label classification model and a loss prediction model by using a current labeling data set;

and 4, step 4: selecting samples from all glass samples which are not marked by using a loss prediction model, and adding a marked data set after marking;

and 5: and (5) repeating the step (3) and the step (4) until the budget is exhausted or the expected effect is achieved, and obtaining the trained multi-label classification model.

Further, the multi-label classification model in the step 2 is composed of an image feature learning model and a graph convolution neural network model;

wherein the multi-label classification loss function is as follows:

wherein C represents the number of categories, sigma represents a sigmoid function,

represents the prediction score under category c;

the loss prediction model in the step 2 is composed of a plurality of loss prediction modules, each loss prediction module comprises a whole local average pooling layer and a whole connection layer, the input of the loss prediction module is the characteristics of a middle layer of the multi-label classification model, and the characteristics of each loss prediction module are connected and then subjected to dot product operation with the output of the graph convolution neural network in the multi-label classification model through the whole connection layer to obtain the predicted loss;

wherein the loss function for the predicted loss is as follows:

wherein,

ξ is a predefined boundary;

the final loss function is defined as: loss ═ L_target+λ·L_loss(ii) a Where λ is a weight parameter.

Further, based on the current labeled data set, a back propagation algorithm is used for training the multi-label classification model and the loss prediction model in the step 2.

Further, based on the trained loss prediction model obtained in the step 3, performing loss prediction and sequencing on the glass samples in the unlabeled sample pool, and taking the top K samples as important samples selected by active learning; after labeling, add to the labeled sample set.

Further, repeating the step 3 and the step 4 until the budget is exhausted or the expected effect is achieved, and obtaining the trained multi-label image classification model.

In summary, due to the adoption of the technical scheme, the invention has the beneficial effects that:

1. the labeling of the multi-label glass image is complex, the requirement on labeling experts is high, and the labeling cost of the dimensional sample is high. Through experimental tests of a plurality of data sets by the embodiment of the invention, under the condition of the same sample amount, the classification accuracy of the model trained by using the selected sample is higher than that of the model trained by randomly selecting the sample; meanwhile, the invention can achieve more than 80% of the accuracy of the training models of all the training sets by using the sample size which is 50% lower than that of all the training sets. The method has the advantages that the method is remarkably improved in the field of active learning, and high classification accuracy is achieved on the premise of reducing the labeling cost.

2. In the existing multi-label active learning research, a query strategy mostly selects one sample at a time. The active learning method provided by the invention is batch mode query, selects a batch of valuable active learning samples by one-time query, can greatly improve the active learning query efficiency, and is suitable for a multi-label classification model based on deep learning.

3. The method enables the multi-label active learning method to be practically applied to multi-label glass image classification scenes, and expands the application scenes of the multi-label active learning method.

Drawings

In order to more clearly illustrate the technical solutions of the embodiments of the present invention, the drawings that are required to be used in the embodiments will be briefly described below, it should be understood that the following drawings only illustrate some embodiments of the present invention and therefore should not be considered as limiting the scope, and that for those skilled in the art, other relevant drawings can be obtained according to the drawings without inventive effort, wherein:

FIG. 1 is a schematic view of a glass image provided in example 1 of the present invention;

fig. 2 is a schematic flow chart of an active learning method for loss prediction according to embodiment 1 of the present invention.

Detailed Description

In order to make the objects, technical solutions and advantages of the present invention more apparent, the present invention is described in further detail below with reference to the accompanying drawings and embodiments. It is to be understood that the specific embodiments described herein are for purposes of illustration only and are not to be construed as limiting the invention, i.e., the described embodiments are merely a subset of the embodiments of the invention and not all embodiments. The components of embodiments of the present invention generally described and illustrated in the figures herein may be arranged and designed in a wide variety of different configurations.

Thus, the following detailed description of the embodiments of the present invention, presented in the figures, is not intended to limit the scope of the invention, as claimed, but is merely representative of selected embodiments of the invention. All other embodiments, which can be derived by a person skilled in the art from the embodiments of the invention without making any creative effort, fall within the protection scope of the invention.

It is noted that relational terms such as "first" and "second," and the like, may be used solely to distinguish one entity or action from another entity or action without necessarily requiring or implying any actual such relationship or order between such entities or actions. Also, the terms "comprises," "comprising," or any other variation thereof, are intended to cover a non-exclusive inclusion, such that a process, method, article, or apparatus that comprises a list of elements does not include only those elements but may include other elements not expressly listed or inherent to such process, method, article, or apparatus. Without further limitation, an element defined by the phrase "comprising an … …" does not exclude the presence of other identical elements in a process, method, article, or apparatus that comprises the element.

The features and properties of the present invention are described in further detail below with reference to examples.

Example one

Referring to fig. 2, the method of embodiment 1 of the present invention includes the following specific steps:

s101: randomly selecting a certain number of images from 3500 training sets of glass image data sets, submitting the images to an expert for marking to serve as an initial marking data set, wherein the images are shown in FIG. 1: (ii) a

S102: constructing a multi-label classification model and a loss prediction model; the multi-label classification model consists of an image feature learning model and a graph convolution neural network model. The image feature learning model uses ResNet-101 here, and the atlas neural network model consists of 2-layer stacked GCNs. The multi-label classification loss function used is as follows:

wherein C represents the number of categories, σ represents the Sigmoid function,

indicates the prediction score under category c.

Defining a loss prediction model, as shown in fig. 2, wherein the loss prediction model is composed of a plurality of loss prediction modules, each loss prediction module comprises a global pooling layer and a full-link layer, the input of the loss prediction module is the characteristics of the middle layer of the multi-label classification model, and the characteristics of each loss prediction module are connected and then subjected to dot product operation with the output of the graph convolution neural network in the multi-label classification model through the full-link layer to obtain the predicted loss. The loss function for the predicted loss is as follows:

wherein,

ξ is a predefined boundary;

defining the final joint loss function as: loss ═ L_target+λ·L_loss(ii) a Wherein λ is a weight parameter set to 0.25;

s103: training a multi-label classification model and a loss prediction model defined by S102 based on the current label data set for 30 times;

s104: using a loss prediction model to predict and sort the loss of the samples in the unlabeled sample pool, and taking the top K samples as important glass samples selected by active learning; after labeling, adding a labeled sample set;

s105: and repeating S103 and S104 until the budget is exhausted or the expected effect is achieved, and obtaining the trained multi-label classification model.

The above description is only for the purpose of illustrating the preferred embodiments of the present invention and should not be taken as limiting the scope of the present invention, and any modifications, equivalents and improvements made by those skilled in the art within the spirit and principle of the present invention should be included in the scope of the present invention.

Claims

1. A glass defect detection method based on field loss prediction active learning is characterized by comprising the following steps:

step 1: inputting an unmarked glass image to be processed, randomly selecting a partial image, submitting the partial image to an expert for marking, and taking the partial image as an initial marking data set;

2. The method for detecting glass defects based on active learning of domain loss prediction according to claim 1, wherein the method comprises the following steps: the multi-label classification model in the step 2 consists of an image feature learning model and a graph convolution neural network model;

wherein the multi-label classification loss function is as follows:

represents the prediction score under category c;

the loss prediction model in the step 2 is composed of a plurality of loss prediction modules, each loss prediction module comprises a global average pooling layer and a full-connection layer, the input of the loss prediction module is the characteristics of a middle layer of the multi-label classification model, and the characteristics of each loss prediction module are connected and then subjected to dot product operation with the output of the graph convolution neural network in the multi-label classification model through the full-connection layer to obtain the predicted loss;

wherein the loss function for the predicted loss is as follows:

wherein,

ξ is a predefined boundary;

3. The method for detecting glass defects based on active learning of domain loss prediction according to claim 1, wherein the method comprises the following steps: and (3) training the multi-label classification model and the loss prediction model in the step (2) by using a back propagation algorithm based on the current labeled data set.

4. The method for detecting glass defects based on active learning of domain loss prediction according to claim 1, wherein the method comprises the following steps: based on the trained loss prediction model obtained in the step 3, performing loss prediction and sequencing on glass samples in the unlabeled sample pool, and taking the top K samples as important samples selected by active learning; after labeling, add to the labeled sample set.

5. The method for detecting glass defects based on active learning of domain loss prediction according to claim 1, wherein the method comprises the following steps: and (5) repeating the step (3) and the step (4) until the budget is exhausted or the expected effect is achieved, and obtaining the trained multi-label image classification model.