CN111524117A

CN111524117A - Tunnel surface defect detection method based on characteristic pyramid network

Info

Publication number: CN111524117A
Application number: CN202010312482.XA
Authority: CN
Inventors: 汪俊; 隆昆; 李大伟; 李虎; 刘树亚
Original assignee: Nanjing University of Aeronautics and Astronautics
Current assignee: Nanjing University of Aeronautics and Astronautics
Priority date: 2020-04-20
Filing date: 2020-04-20
Publication date: 2020-08-11

Abstract

The invention discloses a tunnel surface defect detection method based on a characteristic pyramid network, which comprises the steps of collecting tunnel images with surface defects, constructing a data set, and dividing the images in the data set into a training set and a testing set; preprocessing a tunnel surface image; marking defects on the surface of the tunnel, and marking cracks and hollow areas in the image by adopting a marking tool; constructing a significance detection network for defect detection; and constructing a training loss function, and detecting the tunnel surface defects by using a detection network of the member. The method has the advantages of strong robustness and significance detection effectiveness, and can well locate the correct tunnel surface defects.

Description

Tunnel surface defect detection method based on characteristic pyramid network

Technical Field

The invention relates to a tunnel surface defect detection method, in particular to a tunnel surface defect detection method based on a characteristic pyramid network, and belongs to the field of computer vision and image processing.

Background

In recent years, with the gradual saturation of ground road traffic of various large cities, the situation of traffic jam at a peak is more and more serious, and subways gradually become main public transport means of the cities due to the advantages of convenience, quickness and the like. The development prospect of the subway is good, and each large city is also tightening and widening a subway network and increasing subway lines. The quality of the tunnel is unqualified due to various reasons in the subway construction process, and the quality problems of cracks and the like in the tunnel due to the surrounding construction of the tunnel or natural disasters in the subway tunnel use process can further cause major accidents such as tunnel collapse. Therefore, the defect detection is carried out in the subway construction process and the subway using process, so that the occurrence of major traffic safety accidents can be effectively avoided and prevented. The traditional defect detection method mainly depends on human eyes for detection, consumes a large amount of time, manpower and financial resources, and is not suitable for the requirement of the modern society on the fault detection efficiency, so that the development of a high-precision and high-efficiency tunnel surface defect detection method plays an important role in the development of modern tunnel traffic.

At present, the tunnel surface defect detection method except for manual identification is mainly a method based on the traditional digital image processing technology, and comprises the following steps: (1) a detection method based on threshold processing; (2) detection methods based on edge detection and morphological operations. The detection method based on threshold processing is mainly realized by searching an interested area through gray level, but the detection effect in a complex background is poor, and false detection is easy to occur when the gray level value of a target to be detected is close to that of noise. The processing method based on edge detection and morphological operation mainly comprises the steps of obtaining a defect edge through an edge detection operator, and then determining a defect area through morphological operation.

Disclosure of Invention

The invention aims to provide a tunnel surface defect detection method based on a characteristic pyramid network, and improve the efficiency and the precision of tunnel surface defect detection.

In order to solve the technical problems, the technical scheme adopted by the invention is as follows:

a tunnel surface defect detection method based on a characteristic pyramid network is characterized by comprising the following steps:

the method comprises the following steps: acquiring tunnel images with surface defects, constructing a data set, and dividing the images in the data set into a training set and a testing set;

step two: preprocessing a tunnel surface image;

step three: marking defects on the surface of the tunnel, and marking cracks and hollow areas in the image by adopting a marking tool;

step four: constructing a significance detection network for defect detection;

step five: and constructing a training loss function, and detecting the tunnel surface defects by using the constructed detection network.

Further, the first step is specifically to collect a tunnel image with surface defects, cut key information of the image, construct an image library, perform scaling, turning and mirror image expansion on the image in the image library, and divide the image in the expanded image library into a training sample set and a testing sample set according to a ratio of 4: 1.

Further, the second step is specifically to perform preprocessing on the tunnel surface image and remove image noise by using guided filtering, and the specific process is as follows:

taking the image as a two-dimensional function, assuming that the output and input of the function satisfy a linear relationship within a two-dimensional window:

where q is the value of the output pixel, I is the input image, I and k are the pixel indices, ω_kIs a window, a_kAnd b_kIs the coefficient of the linear function when the window is located at k; taking the gradient on two sides of the above formula:

the difference between the fitting function output value and the true value is:

wherein p is a true value, and e represents a minimum value;

the coefficients of the image function are obtained by the least squares method:

where | ω | is the window ω |_kNumber of middle pixels, μ_kIs that I is in the window omega_kThe average value of (a) is,

is that I is in the window omega_kThe variance in (a) is greater than or equal to,

is that the image p to be filtered is in the window omega_kThe average value of (1);

when calculating the linear coefficients of each window, since each pixel is described by a plurality of linear functions, when specifically calculating the output value of a certain point, all the linear functions including the point are averaged:

in the above formula I_iIs the intensity of the pixel i, ω_kIs the total window containing pixel i, k is its central position,

represents the slope parameter of the linear regression,

representing the linear regression intercept parameter.

Further, the fourth step is specifically to construct a saliency detection network, and construct a feature pyramid network based on the VGG16 network. Two low-level blocks Conv1-2 and Conv2-2 of the VG 16 network are used for extracting low-level features of the image, and because background regions which interfere with the generation of the saliency map exist in the low-level features, a space attention mechanism is adopted to filter out some background details according to the high-level features so as to focus more on the foreground regions;

taking the characteristics generated by three high-level blocks Conv3-3, Conv4-5 and Conv5-3 of the VGG16 network as initial high-level characteristics, and in order to enable the finally extracted high-level characteristics to contain scale and shape invariance, adopting the convolution of holes with expansion rates of 3, 5 and 7 respectively to obtain the context information of multiple receptive fields; then, a channel attention mechanism is adopted to allocate a larger weight value for a channel with high response to the obvious target; cross-channel cascade connection is carried out on the feature mapping of the void convolution layers with different expansion rates and the 1 x 1 dimensional dimension reduction features to obtain three features with different scales and context perception information, and two smaller scale features are promoted to the scale with the same maximum feature scale through up-sampling; and finally, combining the high-level features and the low-level features through cross-channel cascade connection to serve as the output of the context-aware pyramid feature network.

Further, a channel attention mechanism is added after the context perception pyramid features are extracted, and the weighted multi-scale and multi-perception wild features are extracted:

CA＝F(v^h,W)＝σ₁(fc₂((fc₁(v^h,W₁)),W₂)) (7)

where CA stands for channel attention, v^hRepresenting the channel feature vector, W refers to the parameter in the attention block of each channel, σ₁Sigmoid operation is indicated, fc is a full connection layer, and an activation function ReLU is indicated; the final output of the channel attention module is defined as the high level feature f of the feature pyramid^hThe weighting value of (1):

the saliency map of the low-level features contains a large amount of detail information, and in order to obtain a detailed boundary between a salient object and a background, a spatial attention method is adopted to concentrate attention on a foreground area; to increase the receptive field to obtain global information without increasing parameters, convolutional layers with two kernels of 1 × k and k × 1, respectively, are used for high-level feature capture spatial interest:

wherein C is₁、C₂Representing intermediate results of two different orders of convolution operations, conv₁、conv₂Which represents a convolution operation, is a function of,

the parameters in the spatial attention module are represented,

an output of the channel attention module;

then, carrying out normalization processing on the coded spatial feature map mapped to [0,1] by using sigmoid operation:

where SA denotes spatial attention, W denotes a parameter in the spatial attention module, σ₂Representing sigmoid operation;

the final output of the spatial attention module is the low-level feature f^lThe weighting value of (1):

further, the fifth step is to define a loss function in the significance target detection by using the cross entropy between the final significance detection result graph and the theoretical result:

where Y represents the theoretical result, P represents the significance of the network output, α_sA balance parameter representing a positive sample and a negative sample;

the above loss function only provides a general guidance for generating a saliency map, and in order to highlight the generation of the boundary details of a salient object, a laplacian operator is used to obtain a theoretical boundary of the salient object and a boundary of the salient object in the saliency map output by a network:

wherein f represents a feature, x, y are cartesian coordinates of the xy plane, the laplace algorithm is executed using the gradient of the image, directly invoking the convolution operation conv () internally, tanh () is an activation function, K_laplaceIs laplacian, abs () represents absolute value operations;

the generation of the significant object boundary is then supervised by cross entropy loss:

finally, defining the total loss function as the weighted sum of the significant object detection loss and the significant object boundary detection loss:

L＝αL_S+(1-α)L_B(17)

wherein L is a loss function, L_SFor significant target detection loss, L_BFor significant object boundary detection loss α is a weight.

Further, α in the formula (13)_s＝0.528。

Further, in the model training process, firstly, α is set to 1.0, the initial learning rate is 0.01, a rough saliency map is generated through training, then α is adjusted to refine the boundaries of the saliency map, and the adjusted value of α is 0.7.

Compared with the prior art, the invention has the following advantages and effects: the invention provides a tunnel surface defect detection method based on a significance detection network aiming at the problems that the defect detection of the surface of a subway tunnel is difficult to finish with high efficiency and high precision, and the like, wherein a characteristic pyramid attention network is designed and established by considering different characteristics of different levels of characteristics, a context perception pyramid characteristic extraction module is designed aiming at high-level characteristics, and multi-scale void convolution and a channel attention mechanism are used for obtaining the high-level semantic characteristics; using a spatial attention mechanism for low-level features to suppress background noise, attention is focused on salient objects. Experiments show that the network has stronger robustness and significance detection effectiveness, and the method can well locate correct tunnel surface defects.

Drawings

FIG. 1 is a flowchart of a method for detecting defects on a tunnel surface based on a feature pyramid network according to the present invention.

FIG. 2 is a schematic diagram of tunnel surface defect test data according to an embodiment of the present invention.

Fig. 3 is a schematic diagram of a tunnel surface defect detection result according to an embodiment of the present invention.

FIG. 4 is a schematic diagram of a feature pyramid attention network structure according to an embodiment of the present invention.

Detailed Description

To further clarify the working principle and working process of the present invention, the following description is given in conjunction with the accompanying drawings and specific embodiments for facilitating the understanding and implementation of the present invention for those skilled in the art, and the following detailed description is given for the purpose of illustration and explanation, and it is to be understood that the embodiments described herein are only used for the purpose of illustration and explanation, and are not to be construed as limiting the present invention.

As shown in fig. 1, a method for detecting defects on a tunnel surface based on a feature pyramid network of the present invention is characterized by comprising the following steps:

the method comprises the steps of collecting tunnel images with surface defects, cutting key information of the images, constructing an image library, carrying out scaling, turning and mirror image expansion on the images in the image library, and dividing the images in the expanded image library into a training sample set and a testing sample set according to the ratio of 4: 1.

Step two: preprocessing a tunnel surface image;

the method comprises the following steps of preprocessing a tunnel surface image, and removing image noise by using guide filtering, wherein the specific process comprises the following steps:

the difference between the fitting function output value and the true value is:

wherein p is a true value, and e represents a minimum value;

represents the slope parameter of the linear regression,

representing the linear regression intercept parameter.

step four: constructing a significance detection network for defect detection;

and constructing a significance detection network, and constructing a feature pyramid network based on the VGG16 network. Two low-level blocks Conv1-2 and Conv2-2 of the VG 16 network are used for extracting low-level features of the image, and because background regions which interfere with the generation of the saliency map exist in the low-level features, a space attention mechanism is adopted to filter out some background details according to the high-level features so as to focus more on the foreground regions;

Adding a channel attention mechanism after the context perception pyramid feature extraction, and extracting the weighted multi-scale and multi-perception wild features:

CA＝F(v^h,W)＝σ₁(fc₂((fc₁(v^h,W₁)),W₂)) (7)

the parameters in the spatial attention module are represented,

an output of the channel attention module;

step five: and constructing a training loss function, and detecting the tunnel surface defects by using a detection network of the member.

The cross entropy between the final saliency detection result map and the theoretical result is typically used in saliency target detection to define a loss function:

where Y represents the theoretical result, P represents the significance of the network output, α_sRepresenting the balance parameters of the positive and negative samples, α in this example_s＝0.528。

L＝αL_S+(1-α)L_B(17)

In the model training process, firstly, alpha is set to be 1.0, the initial learning rate is 0.01, a rough saliency map is generated through training, then alpha is adjusted to refine the boundaries of the saliency map, and the adjusted optimal value of alpha is 0.7.

And finally, inputting the data to be tested into the network model with the good component to automatically detect the defects of the tunnel surface.

The invention provides a tunnel surface defect detection method based on a significance detection network aiming at the problems that the defect detection of the surface of a subway tunnel is difficult to finish with high efficiency and high precision, and the like, wherein a characteristic pyramid attention network is designed and established by considering different characteristics of different levels of characteristics, a context perception pyramid characteristic extraction module is designed aiming at high-level characteristics, and multi-scale void convolution and a channel attention mechanism are used for obtaining the high-level semantic characteristics; using a spatial attention mechanism for low-level features to suppress background noise, attention is focused on salient objects. Experiments show that the network has stronger robustness and significance detection effectiveness, and the method can well locate correct tunnel surface defects.

The foregoing shows and describes the basic principles, principal steps and advantages of the present invention. It is to be understood that the invention is not limited to the embodiments described above, which are described in the foregoing embodiments and description only to illustrate the principles of the invention, but rather that various changes and modifications may be made without departing from the spirit and scope of the invention, as defined by the appended claims, the description and equivalents thereof.

Claims

1. A tunnel surface defect detection method based on a characteristic pyramid network is characterized by comprising the following steps:

step two: preprocessing a tunnel surface image;

step four: constructing a significance detection network for defect detection;

2. The method for detecting defects on the surface of a tunnel based on a feature pyramid network as claimed in claim 1, wherein: the first step is to collect tunnel images with surface defects, cut key information of the images, construct an image library, zoom, turn and mirror image expansion the images in the image library, and divide the images in the expanded image library into a training sample set and a testing sample set according to a ratio of 4: 1.

3. The method for detecting defects on the surface of a tunnel based on a feature pyramid network as claimed in claim 1, wherein: the second step is specifically to carry out pretreatment on the tunnel surface image and remove image noise by using guide filtering, and the specific process is as follows:

the difference between the fitting function output value and the true value is:

wherein p is a true value, and e represents a minimum value;

represents the slope parameter of the linear regression,

representing the linear regression intercept parameter.

4. The method for detecting the defects of the tunnel surface based on the feature pyramid network as claimed in claim 1, wherein: and the fourth step is specifically to construct a significance detection network, and construct a feature pyramid network based on the VGG16 network. Two low-level blocks Conv1-2 and Conv2-2 of the VG 16 network are used for extracting low-level features of the image, and because background regions which interfere with the generation of the saliency map exist in the low-level features, a space attention mechanism is adopted to filter out some background details according to the high-level features so as to focus more on the foreground regions;

5. The method for detecting defects on the surface of a tunnel based on a feature pyramid network as claimed in claim 4, wherein: adding a channel attention mechanism after the context perception pyramid feature extraction, and extracting the weighted multi-scale and multi-perception wild features:

CA＝F(v^h，W)＝σ₁(fc₂((fc₁(v^h，W₁))，W₂)) (7)

the parameters in the spatial attention module are represented,

an output of the channel attention module;

6. the method for detecting defects on the surface of a tunnel based on a feature pyramid network as claimed in claim 1, wherein: the fifth step is to define a loss function by using the cross entropy between the final significance detection result graph and the theoretical result in the significance target detection:

L＝αL_s+(1-α)L_B(17)

7. The method as claimed in claim 6, wherein α in formula (13) is defined as_s＝0.528。

8. The method for detecting defects on the surface of a tunnel based on a feature pyramid network as claimed in claim 6, wherein: in the model training process, firstly, alpha is set to be 1.0, the initial learning rate is 0.01, a rough saliency map is generated through training, then alpha is adjusted to refine the boundaries of the saliency map, and the adjusted value of alpha is 0.7.