CN114663392A

CN114663392A - Knowledge distillation-based industrial image defect detection method

Info

Publication number: CN114663392A
Application number: CN202210285918.XA
Authority: CN
Inventors: 雷可; 杨俊杰; 郑军
Original assignee: Matrixtime Robotics Shanghai Co ltd
Current assignee: Matrixtime Robotics Shanghai Co ltd
Priority date: 2022-03-22
Filing date: 2022-03-22
Publication date: 2022-06-24

Abstract

The invention relates to the technical field of image processing, in particular to a knowledge distillation-based industrial image defect detection method, which trains a teacher network and a student network by using a normal image set, takes the difference between characteristic graphs generated by the teacher network and the student network as a loss function, only updates the parameters of the student network, stops training under the condition that the loss does not decrease any more, and obtains the trained student network; inputting an image to be detected to a teacher network and a student network, recording the difference between feature maps of different scales of the teacher network and the student network as an abnormal score standard, sampling the feature maps to the size of the input image, multiplying corresponding positions to obtain the final abnormal score of each pixel point on the image, setting a threshold value, and determining an abnormal area if the threshold value is larger than the threshold value. According to the invention, the mode of extracting OK image features from the student network to the learning teacher network is adopted, and when NG images are encountered, the difference between the student network and the teacher network is highlighted, so that the purpose of detecting the NG images is achieved.

Description

Knowledge distillation-based industrial image defect detection method

Technical Field

The invention relates to the technical field of image processing, in particular to an industrial image defect detection method based on knowledge distillation.

Background

With the rapid development of deep learning in recent years, more and more deep learning algorithms are applied to various industries. The knowledge distillation algorithm in the deep learning algorithm has important research and application values for application in industrial image defect detection.

The method comprises the following steps of 1, most of detection pictures are OK samples, the proportion of the defect pictures in the total detection pictures is small, the requirement that a supervision algorithm needs to label defects in advance is not very favorable, 2, the occurrence positions and the forms of the defects on the industrial images are random, the types of the defects are difficult to define, 3, the number difference among the types of the defects on the industrial images is large, 4, the defects on the industrial images are unknown, and a plurality of defects can appear along with the aging of equipment.

For the current situation and problems of the defect detection of the current industrial image summarized above, the traditional image algorithm and the supervised deep learning algorithm have certain limitations on the degree of solution of the scene, and the final detection effect is generally difficult to achieve an ideal effect. Therefore, an industrial image defect detection method based on knowledge distillation is provided.

Disclosure of Invention

Based on the technical problems in the background art, the invention provides the industrial image defect detection method based on knowledge distillation, which does not need to screen and label a data set in a supervision algorithm, reduces a large amount of manual workload, and solves the problem that the traditional image algorithm and the supervision deep learning algorithm have certain limitation on the solution degree of industrial image defect detection.

The invention provides the following technical scheme: a knowledge distillation-based industrial image defect detection method comprises the following steps:

s1, firstly, a normal image set I containing no defect is given₁，I₂，...，I_nPreprocessing each picture in the input image set; constructing teacher network and student network, inputting image I in training process_kThe teacher network and the student network respectively output three feature maps with different scales, the difference between the teacher network feature map and the student network feature maps is used as a loss function, the SGD method is used in the training process, the parameters of the student network are updated, the parameters of the teacher network remain unchanged, the training is stopped under the condition that the loss does not decrease any more, and the trained student network is obtained;

s2, preprocessing the input image to be detected, simultaneously inputting the preprocessed image into a teacher network and a student network to respectively obtain three feature graphs of different scales, and recording the difference between the feature graphs of different scales of the teacher network and the feature graphs of different scales of the student network as an abnormal score standard:

the final anomaly score is calculated as follows:

the up-sampling mode adopts bilinear interpolation, each characteristic image is up-sampled to the size of an input image, the more abnormal part on the image to be detected is, the higher the score is, and the image area larger than the thresh part is regarded as the abnormal area on the detection image by setting a threshold thresh.

Preferably, I_k∈R^{wxh xc}W is the input image width, h is the input image height, and c is the input image channel number;

the teacher network outputs three characteristic graphs with different scales as

The student network outputs three characteristic graphs with different scales as

Preferably, the loss function is defined as follows:

i.e. the sum of the distances at the corresponding locations (i, j) for each feature scale.

Preferably, the preprocessing is histogram equalization processing for eliminating the influence of image gray level non-uniformity caused in the shooting process.

Preferably, the teacher network is a deep convolutional network pre-trained on imagenet, the student network and the teacher network have the same structure, and the parameters of the student network do not use the parameters in the pre-trained network, but are initialized randomly.

The invention provides an industrial image defect detection method based on knowledge distillation, which is used for detecting industrial defects by using a knowledge distillation method in deep learning, does not need to screen and label a data set in a supervision algorithm, and reduces a large amount of manual workload. By designing a knowledge distillation network and a related loss function, the student network is trained to learn normal image feature extraction in the teacher network. In the inference process, the difference between the two networks is compared to be used as the abnormal score of each pixel point on the current image, and the abnormal area can be determined by setting a threshold value.

Drawings

FIG. 1 is a flow chart of the training process of the present invention;

FIG. 2 is a flow chart of the inference process of the present invention.

Detailed Description

The technical solutions in the embodiments of the present invention will be clearly and completely described below with reference to the drawings in the embodiments of the present invention, and it is obvious that the described embodiments are only a part of the embodiments of the present invention, and not all of the embodiments. All other embodiments, which can be derived by a person skilled in the art from the embodiments given herein without making any creative effort, shall fall within the protection scope of the present invention.

The invention provides a technical scheme that: a knowledge distillation-based industrial image defect detection method is divided into two parts, namely a training part and an inference part.

As shown in FIG. 1, the training part is mainly divided into the following steps, firstly, a normal image set I without defects is given₁，I₂，...，I_nAnd preprocessing each picture in the input image set, mainly histogram equalization, so as to eliminate the influence of uneven image gray scale caused in the shooting process.

The second step is to build a teacher network, which is a deep convolutional network pre-trained on imagenet, typically resnet 50.

And thirdly, constructing a student network, wherein the student network and the teacher network have the same structure, but the parameters of the student network do not use the parameters in the pre-training network, but are randomly initialized.

The fourth step is to construct the whole training network and input images I in the training process_kIn which I_k∈R^{wxh xc}W is the width of the input image, h is the height of the input image, c is the number of channels of the input image, and the teacher network outputs three feature maps with different scales

The student network outputs three feature maps with different scales

Using the difference between the teacher network and the student network profiles as a loss function, the loss function is defined as follows

I.e., the sum of the distances of l2 at the corresponding locations (i, j) for each feature scale.

And updating parameters of the student network by using an SGD method in the training process, keeping the parameters of the teacher network unchanged, and stopping training under the condition that the loss is not reduced any more to obtain the trained student network.

As shown in fig. 2, the inference part is the following steps,

and (4) preprocessing the input image to be detected, wherein the specific operation and the training process are consistent.

Inputting the preprocessed image I into a teacher network to obtain three characteristic graphs with different scales

Simultaneously inputting the images into a student network to obtain three characteristic graphs with different scales

The difference between the feature maps of different scales of the teacher network and the student network is recorded as an abnormal score standard:

the final anomaly score is calculated as follows:

wherein the upsampling mode adopts bilinear interpolation. The respective feature maps are up-sampled to the input image size, w, h. The more abnormal places on the image to be detected are, the higher the score is, and by setting a threshold thresh, the image area larger than the thresh is regarded as the abnormal area on the detected image.

The teacher network and the student network of the distillation network in the prior art are generally complex, the student network is simple, the teacher network and the student network in the application are of the same structure, the learning process in the prior art needs to label original data, the method in the application does not need to be implemented only by completely dropping OK images, the central idea of the algorithm in the application is a mode of hopeing the student network to learn the characteristics of the OK images in the teacher network, and when NG images are encountered, the difference between the student network and the teacher network is highlighted, so that the purpose of detecting the NG images is achieved.

The above description is only for the preferred embodiment of the present invention, but the scope of the present invention is not limited thereto, and any person skilled in the art should be considered to be within the technical scope of the present invention, and the technical solutions and the inventive concepts thereof according to the present invention should be equivalent or changed within the scope of the present invention.

Claims

1. A knowledge distillation-based industrial image defect detection method is characterized by comprising the following steps: the method comprises the following steps:

s1, firstly, a training normal image set I containing no defects is given₁，I₂，...，I_nFor each picture in the input image setCarrying out pretreatment; constructing teacher network and student network, inputting image I in training process_kThe teacher network and the student network respectively output three feature maps with different scales, the difference between the teacher network feature map and the student network feature maps is used as a loss function, the SGD method is used in the training process, the parameters of the student network are updated, the parameters of the teacher network remain unchanged, the training is stopped under the condition that the loss does not decrease any more, and the trained student network is obtained;

s2, preprocessing the input image to be detected, simultaneously inputting the preprocessed image into a teacher network and a student network to respectively obtain three feature graphs with different scales, and recording the difference between the feature graphs with different scales of the teacher network and the feature graphs with different scales of the student network as an abnormal score standard:

the final anomaly score is calculated as follows:

2. The knowledge-based distillation industrial image defect detection method according to claim 1, wherein: i is_k∈R^{wxh xc}W is the input image width, h is the input image height, and c is the input image channel number;

3. The knowledge-based distillation industrial image defect detection method according to claim 1, wherein: the loss function is defined as follows:

4. The knowledge-based distillation industrial image defect detection method according to claim 1, wherein: the preprocessing is histogram equalization processing and is used for eliminating the influence of uneven image gray scale caused in the shooting process.

5. The knowledge-based distillation industrial image defect detection method according to claim 1, wherein: the teacher network is a deep convolutional network pre-trained on imagenet, the student network and the teacher network have the same structure, and parameters of the student network do not use parameters in the pre-trained network, but are initialized randomly.