CN112634205A

CN112634205A - Micro-feature detection algorithm based on image

Info

Publication number: CN112634205A
Application number: CN202011412891.3A
Authority: CN
Inventors: 孙长进; 刘立庄; 陈萍; 孔彩凤
Original assignee: Jiangsu Kingen Intelligent Technology Co ltd
Current assignee: Jiangsu Kingen Intelligent Technology Co ltd
Priority date: 2020-12-04
Filing date: 2020-12-04
Publication date: 2021-04-09

Abstract

The invention discloses a micro-feature detection algorithm based on an image, and particularly relates to the technical field of detection algorithms, wherein the technical scheme is as follows: s1, inputting a picture, obtaining a three-layer feature pyramid after a feature extraction network consisting of depth separable convolution, and obtaining a prediction result of the input picture, namely the offset, the category result and the confidence of each picture position frame after each layer of feature graph passes one layer of convolution layer; s2, obtaining the actual position of each prediction frame through the position offset; obtaining category information through logistic regression; s3, the confidence coefficient is the product of the probability of the object existing in the prediction frame and the intersection ratio of the real frame and the prediction frame, the invention has the advantages that: YOLOv3-Lite was detectable for either long range cracks, small cracks at rivets, or small cracks in aircraft engine interior blades that were dimly lit and cluttered in background.

Description

Micro-feature detection algorithm based on image

Technical Field

The invention relates to the field of detection algorithms, in particular to a micro-feature detection algorithm based on an image.

Background

The airplane fault can cause serious hidden danger to the flight safety if not eliminated in time, so that the airline companies are required to timely troubleshoot and maintain the structural damage of the airplane, the future development prospect of the civil aviation industry is wide at present, the number of airports and the number of airplanes in China are continuously increased, more and more groups select the airplanes as travel tools due to the convenience of the airplanes, but the airplane safety accidents can also occur every year, most of the reasons are the faults of the airplane, the mechanical property of the material can decay with the increase of the service life of the airplane, the crack damage is easy to generate, especially for the old airplane, the possibility of generating the damage is higher because of the long service life, therefore, in order to ensure the flight safety of the airplane, the airline company can perform maintenance work on one airplane for 3 times or more on average every day, wherein the maintenance work includes inspection before flight, after flight and inspection at stations for 1 time or more.

The prior art has the following defects: the existing method for detecting the damage of the airplane mainly comprises visual detection and nondestructive detection, wherein the nondestructive detection technology comprises ultrasonic detection, ray detection, penetration detection and other technologies, and in addition, detection means such as infrared, microwave, acoustic vibration, industrial CT and the like play an important role, the infrared detection technology is taken as an example, the method adopts the radiation principle to scan the temperature change caused by defects on the surface of the airplane and obtain the damage information according to the change, although the nondestructive detection technology is mature, the method still has problems, such as the incomplete training system of professional technicians, the problem of delay in the purchase and use of nondestructive detection equipment and the like, compared with the nondestructive detection, the visual detection is more convenient and easy to operate, the method is a main detection mode in the maintenance of the airplane and the cargo aircraft, the visual detection accounts for 90 percent and 80 percent respectively according to statistics, visual inspection is one of the most common ways to inspect and maintain the structures of an airplane, and all-around inspection is performed by crew members during the flying interval of the airplane, but the number of airplanes increases faster, and the training speed of the crew members is difficult to match.

Therefore, it is necessary to invent an image-based micro-feature detection algorithm.

Disclosure of Invention

The invention provides a micro-feature detection algorithm based on an image, which is characterized in that the offset, the category result and the confidence of a position frame of an input image are obtained by respectively passing each layer of feature map through one layer of convolution layer, then the actual position of each grid prediction frame can be obtained through the position offset, and category information is obtained through logistic regression, then the probability of crack damage in the frame is predicted according to a formula, and finally the final network prediction result is obtained through a non-maximum inhibition method, so that the problems of visual detection and nondestructive detection of a damage detection method are mainly solved.

In order to achieve the above purpose, the invention provides the following technical scheme: an image-based micro-feature detection algorithm, comprising the steps of:

s1, inputting a picture, obtaining a three-layer feature pyramid after a feature extraction network consisting of depth separable convolution, and obtaining a prediction result of the input picture, namely the offset, the category result and the confidence of each picture position frame after each layer of feature graph passes one layer of convolution layer;

s2, obtaining the actual position of each prediction frame through the position offset; obtaining category information through logistic regression;

s3, the confidence is the product of the probability of the object existing in the prediction box and the intersection of the real box and the prediction box:

s4, wherein P_r(Object) is the probability that there is fracture damage in the prediction box,

IoU of a prediction frame and a crack damage label frame, and finally obtaining a final network prediction result by a non-maximum value inhibition method;

s5, completing feature extraction of the Yolov3-Lite by depth separable convolution design, wherein the feature extraction structure of the Yolov3-Lite has 52 depth separable convolution layers;

s6, a three-layer feature pyramid of YOLOv3-Lite is used for detecting cracks with different sizes, the dimension of an input image is 416 x 416, a feature graph with 13 x 13 output dimension is obtained through the last layer of a feature extraction network and marked as f1, the 27 th layer is connected with f1 of up sampling, the feature graph with 26 x 26 output dimension is marked as f2, finally, the feature graph with 52 x 52 dimension is obtained through calculation of the 10 th layer of the network and is connected with f2 of up sampling, the connection part is constructed by a residual error network, the residual error network can combine low-level semantic information and high-level semantic information, under the connection mode, the network can effectively learn the crack features with different sizes, and finally, the three layers form the feature pyramid;

s7, dividing the data set into ten parts randomly in the training process, wherein nine parts are training sets, one part is a verification set, the network training comprises two stages, the first stage adopts a model pre-trained on the ImageNet data set, freezes the whole feature extraction layer, only trains the last three convolutional layers, the second stage trains the parameters of the whole network, the size of each batch in the first stage is set to be 10, the learning rate is set to be 0.001, the Adam optimization mode is adopted, the data set is called a round each time the network training is finished, if the network training is carried out for 3 rounds, the loss of the verification set is not reduced, the learning rate is reduced to be 1/10, the training is stopped when the error of the verification set is reduced to be less than 0 after 10 rounds of training, 300 rounds are iterated, the second stage adjusts the parameters of the whole model because the characteristic of the crack is far away from the characteristic of the object in the ImageNet, the Batchsize in the stage is set to be 4, the learning rate is set to 0.0001, adjusted in Adam's optimization and callback as in phase one, and the same end training condition is used for a total of 50 iterations in this phase.

Preferably, in S5, each depth separable convolutional layer includes a depth convolutional layer and a point-by-point convolutional layer, and all layers are followed by a Batch Normalization layer and a ReLU nonlinear layer.

Preferably, in S6, the receptive field of the 13 × 13 feature map is large for detecting large-sized cracks.

Preferably, in S6, the smaller receptive field of the 52 × 52 feature map can be used for detecting smaller cracks.

Preferably, in S6, the 26 × 26 feature map is between the 13 × 13 feature map and the 52 × 52 feature map.

Preferably, in S7, all training is run on the video card of NVIDIA Tesla K20 GPU, and the development environment is tensrflow 1.7.0.

The invention has the beneficial effects that:

the crack damage on the surface of the airplane body far away from the camera is small in proportion in the whole image, the position of the crack can still be accurately detected by the YOLOv3-Lite, and the YOLOv3-Lite can be detected no matter whether the crack is observed remotely and the crack is tiny at a rivet or the crack is tiny at the blade inside the airplane engine with dim light and messy background.

Drawings

FIG. 1 is a schematic diagram of a YOLOv3-Lite network structure provided by the present invention;

FIG. 2 is a schematic diagram of the results of the YOLOv3-Lite experiment provided by the present invention.

Detailed Description

The preferred embodiments of the present invention will be described in conjunction with the accompanying drawings, and it will be understood that they are described herein for the purpose of illustration and explanation and not limitation.

Referring to fig. 1, the present invention provides an image-based micro-feature detection algorithm, which includes the following steps:

s5, the feature extraction part of YOLOv3-Lite is completed by depth separable convolution design, the feature extraction structure of YOLOv3-Lite has 52 depth separable convolution layers, each depth separable convolution layer comprises a depth convolution layer and a point-by-point convolution layer, and all the layers are followed by a Batch Normalization layer and a ReLU nonlinear layer.

S6, a three-layer feature pyramid of YOLOv3-Lite is used for detecting cracks with different sizes, the dimension of an input image is 416 x 416, a feature map with 13 x 13 output dimensions is obtained through the last layer of a feature extraction network and is marked as f1, the 27 th layer is connected with f1 of up sampling, the feature map with 26 x 26 output dimensions is marked as f2, finally, the feature map with 52 x 52 dimension is obtained through calculation of the 10 th layer of the network and is connected with f2 of up sampling, a connection part is constructed by a residual error network, the residual error network can combine low-level semantic information and high-level semantic information, under the connection mode, the network can effectively learn the crack features with different sizes, the last three layers form the feature pyramid, the sense field of the 13 x 13 feature map is large for detecting the cracks with large sizes, the sense field of the 52 x 52 feature map is smaller for detecting smaller cracks, the 26 × 26 feature map is intermediate between the 13 × 13 feature map and the 52 × 52 feature map.

Feature extraction network architecture

S7, dividing the data set into ten parts randomly in the training process, wherein nine parts are training sets, one part is a verification set, the network training comprises two stages, the first stage adopts a model pre-trained on the ImageNet data set, freezes the whole feature extraction layer, only trains the last three convolutional layers, the second stage trains the parameters of the whole network, the size of each Batch in the first stage is set to be 10, the learning rate is set to be 0.001, the Adam optimization mode is adopted, the data set is called a round each time the network training is finished, if the network training is carried out for 3 rounds, the loss of the verification set is not reduced, the learning rate is reduced to be 1/10, the training is stopped when the error of the verification set is reduced to be less than 0 after 10 rounds of training, 300 rounds are iterated, the second stage adjusts the parameters of the whole model because the characteristic of the crack is far away from the characteristic of the object in the ImageNet, the size of the Batch size in the stage is set to be 4, the learning rate is set to 0.0001, the learning rate is adjusted by adopting the optimization mode of Adam and the callback mode same as the first stage, the same training ending condition is adopted, 50 rounds of iteration are performed in the first stage, all training is performed on a video card of an NVIDIA Tesla K20 GPU, and the development environment is Tensorflow 1.7.0.

The using process of the invention is as follows: when the method is used, an experiment is firstly carried out on a video card of NVIDIA Tesla K20 GPU and a development environment Tensorflow 1.7.0, a data set is randomly divided into ten parts in a training process, nine parts are training sets, one part is a verification set, network training comprises two stages, the first stage adopts a model pre-trained on an ImageNet data set, a whole characteristic extraction layer is frozen, only the last three convolutional layers are trained, the second stage trains parameters of the whole network, the size of each batch in the first stage is set to be 10, the learning rate is set to be 0.001, an Adam optimization mode is adopted, the data set is called one round each time when the data set is traversed once, if the network training is carried out for 3 rounds, the loss of the verification set is not reduced, the learning rate is reduced to the original 1/10, the training is stopped when the error of the verification set is reduced to be less than 0 after the 10 rounds of training, 300 rounds of iteration are performed totally, and the characteristic of a crack is far different from the characteristic of an object in ImageNet, therefore, the second stage adjusts the whole model parameters, the size of the Batch size in the stage is set to be 4, the learning rate is set to be 0.0001, the learning rate is adjusted by adopting the optimization mode of Adam and the callback mode same as the stage one, and the same training ending condition is adopted, and 50 rounds of iteration are performed in the stage;

according to the experimental result of Yolov3-Lite, the left and right four columns are respectively divided into two groups, the left column of each group is the original image, and the right column is the detection result.

The above description is only a preferred embodiment of the present invention, and any person skilled in the art may modify the present invention or modify it into an equivalent technical solution by using the technical solution described above. Therefore, any simple modifications or equivalent substitutions made in accordance with the technical solution of the present invention are within the scope of the claims of the present invention.

Claims

1. An image-based micro-feature detection algorithm, characterized by: the method comprises the following steps:

s7, dividing the data set into ten parts randomly in the training process, wherein nine parts are training sets, one part is a verification set, the network training comprises two stages, the first stage adopts a model pre-trained on the ImageNet data set, freezes the whole feature extraction layer, only trains the last three convolutional layers, the second stage trains the parameters of the whole network, the size of each Batch in the first stage is set to be 10, the learning rate is set to be 0.001, the Adam optimization mode is adopted, the data set is called a round each time the network training is finished, if the network training is carried out for 3 rounds, the loss of the verification set is not reduced, the learning rate is reduced to be 1/10, the training is stopped when the error of the verification set is reduced to be less than 0 after 10 rounds of training, 300 rounds are iterated, the second stage adjusts the parameters of the whole model because the characteristic of the crack is far away from the characteristic of the object in the ImageNet, the size of the Batch size in the stage is set to be 4, the learning rate is set to 0.0001, adjusted in Adam's optimization and callback as in phase one, and the same end training condition is used for a total of 50 iterations in this phase.

2. An image-based micro-feature detection algorithm according to claim 1, wherein: in S5, each depth separable convolutional layer includes a depth convolutional layer and a point-by-point convolutional layer, and all layers are followed by a Batch Normalization and ReLU nonlinear layer.

3. An image-based micro-feature detection algorithm according to claim 2, wherein: in S6, the receptor field of the 13 × 13 feature map is large for detecting large-size cracks.

4. An image-based micro-feature detection algorithm according to claim 1, wherein: in S6, the smaller receptive field of the 52 × 52 feature map can be used to detect smaller cracks.

5. An image-based micro-feature detection algorithm according to claim 1, wherein: in S6, the 26 × 26 feature map is intermediate between the 13 × 13 feature map and the 52 × 52 feature map.

6. An image-based micro-feature detection algorithm according to claim 1, wherein: in S7, all training is run on the video card of NVIDIA Tesla K20 GPU, and the development environment is tensrflow 1.7.0.