CN111242955A

CN111242955A - Road surface crack image segmentation method based on full convolution neural network

Info

Publication number: CN111242955A
Application number: CN202010070397.7A
Authority: CN
Inventors: 苏超; 董义佳; 王文君; 袁荣耀
Original assignee: Hohai University HHU
Current assignee: Hohai University HHU
Priority date: 2020-01-21
Filing date: 2020-01-21
Publication date: 2020-06-05
Anticipated expiration: 2040-01-21
Also published as: CN111242955B

Abstract

The invention discloses a road surface crack image segmentation method based on a full convolution neural network, which is characterized by comprising the following steps of: the method comprises the following steps: the method comprises the following steps that a camera shoots a pavement crack, a picture data set is established, and images in the data set are divided into two types, namely cracked images and non-cracked images; utilizing Lableme software to label the segmented data of the image with the crack in the data set; dividing the constructed data set into a training set and a test set; constructing a full convolution neural network for image segmentation; training the full convolution neural network by using the divided training set, and optimizing related parameters until a global optimal solution is obtained; and submitting the test concentrated crack image to a full convolution neural network, and outputting an image segmentation result.

Description

Road surface crack image segmentation method based on full convolution neural network

Technical Field

The invention belongs to the field of road quality and safety monitoring, and discloses a road surface crack image segmentation method based on a full convolution neural network.

Background

The detection of the pavement cracks is an important traffic maintenance work for ensuring the driving safety, is a crucial step for road management, and aims to obtain pavement maintenance state information. Cracks are the most common type of damage to pavements. Because the automatic crack detection system has the advantages of high safety, low cost, high efficiency, objectivity and the like, the research of the automatic crack detection system is widely concerned.

Image-based crack detection algorithms have been widely discussed and applied over the past few decades. In earlier studies, methods were more based on traditional digital combination or improved image processing techniques such as thresholding, mathematical morphology, and edge detection. These methods are generally based on photometric and geometric assumptions about the properties of the fracture image. However, these methods are very sensitive to noise because they are performed on a single pixel. This remains a challenging task due to crack non-uniformity and background complexity, such as low contrast to the surrounding pavement, and the potential for near-intensity shadows.

In recent years, deep Convolutional Neural Networks (CNNs) have found widespread use in computer image processing.

Deep neural networks have impressive performance in many computer vision tasks, demonstrating the effectiveness of learned deep features. A depth Full Convolution Network (FCN) model is constructed, semantic segmentation is carried out on a pavement crack image, automatic detection of pavement cracks is achieved, and the method has important significance for monitoring pavement conditions.

Disclosure of Invention

The purpose of the invention is as follows: aiming at the defects of the prior art, the invention provides a road surface crack image segmentation method based on a full convolution neural network, so as to realize crack detection, further improve crack detection efficiency and reduce detection cost.

The technical scheme is as follows: the invention provides a road surface crack image segmentation method based on a full convolution neural network, which comprises the following steps:

the method comprises the following steps that an S1 camera photographs a pavement crack, a picture data set is established, and images in the data set are divided into two types, namely crack-carrying images and crack-free images;

s2, utilizing Lableme software to label the segmentation data of the image with the crack in the data set;

s3 dividing the constructed data set into a training set (3/5 of the data set) and a testing set (2/5 of the data set);

s4, constructing a full convolution neural network for image segmentation;

s5, training the full convolution neural network by using the divided training set, and optimizing related parameters until a global optimal solution is obtained;

and S6, submitting the test concentrated crack image to a full convolution neural network, and outputting an image segmentation result.

Step S2 labels the crack image, and the label is a pixel-level label.

The constructed full convolution neural network specifically comprises the following steps:

the full convolution neural network comprises 1 convolution layer, 16 expansion convolution layers, 1 pyramid sampling layer, 2 decoder layers and 1 semantic output layer.

Step S4 is a full convolution neural network model structure and parameters specifically:

the structure of the full convolution neural network expansion module is specifically as follows:

a. span is 1: the input picture is overlapped with the input through the results of point convolution 1x1 (the activation function is Relu6), depth convolution 3x3 (the activation function is Relu6) and point convolution 1x1 (the activation function is linear) to obtain the output.

b. Span is 2 hours: the input picture is directly output after being subjected to point convolution 1x1 (the activation function is Relu6), depth convolution 3x3 (the activation function is Relu6) and point convolution 1x1 (the activation function is linear).

The full convolution neural network pyramid sampling module has 4 ways, namely sampling the average value with the side length of 10 and averaging the whole characteristic diagram; convolving three cavities with the cavity ratios of 1, 2 and 4 respectively, and extracting information from the characteristic diagram by using different receptive fields; the output results of the 4 paths are scaled to the same size and then combined in a stacked manner.

In the full convolution neural network, a decoder 0 is connected with an extended convolution 3#, 16 characteristic graphs are input, and 32 characteristic graphs are output; the decoder 1 is connected to the convolutional layer, and inputs 8 feature maps and outputs 32 feature maps.

The full convolution neural network parameter optimization uses a Moment optimizer, the momentum value of the optimizer is 0.9, the initial learning rate is 0.0001, a descending learning rate updating strategy is adopted, and the attenuation is 0.9 times of the learning rate of the previous step after every 2000 steps; during training, two pictures are taken to calculate the gradient of the error pair network parameters, and the network parameters are further updated.

The target function of the full convolution neural network for image segmentation is the average of the cross entropy of a given label and the prediction result prediction of the neural network at each pixel:

has the advantages that: compared with the prior art, the road surface crack image segmentation method based on the full convolution neural network has the following beneficial effects:

1. the recognition efficiency is high: the method comprises the steps of establishing a picture data set, and dividing images in the data set into two types, namely a cracked type and a non-cracked type; utilizing Lableme software to label the segmented data of the image with the crack in the data set; dividing the constructed data set into a training set and a test set; constructing a full convolution neural network for image segmentation; the full convolution neural network is trained by utilizing the divided training set, relevant parameters are optimized until a global optimal solution is obtained, and the picture recognition efficiency is effectively improved.

2. The crack detection model is suitable for the detection task of the road surface image of the same material after training is completed, and has wide applicability.

3. According to the crack identification method, the detection vehicle with the camera is used for driving on a road to obtain an original image, and then the crack condition is evaluated through neural network processing, so that the requirements on manpower, material resources and financial resources are reduced.

Drawings

FIG. 1 is an overall flow chart designed based on the method of the present invention;

FIG. 2 is a diagram of a full convolutional neural network architecture in accordance with the present invention;

FIG. 3 shows the segmentation result of the road surface crack image by the method of the present invention.

Detailed Description

The present invention is further illustrated by the following figures and specific examples, which are to be understood as illustrative only and not as limiting the scope of the invention, which is to be given the full breadth of the appended claims and any and all equivalent modifications thereof which may occur to those skilled in the art upon reading the present specification.

In the present embodiment, the road surface crack image is subjected to image segmentation to detect cracks. Firstly, preparing a data set, photographing a campus road by using a camera, acquiring an original crack image, and establishing the data set. And then, cutting the image acquired by the camera on a computer, wherein the size of the cut image is 320X320, and after the data set is enhanced, the data set comprises 6000 pictures. Wherein, the number of the images with cracks and without cracks is 3000, and the number of the images for model training and testing is 1800 images and 1200 images respectively. And then, carrying out segmentation data annotation on the image with the crack in the data set by using Lableme software. The construction and training work of the model is completed under a Tensorflow framework, the batch size is all set to be 2, a Moment optimizer is used for optimizing a cost function, the initial learning rate is set to be 0.0001, a descending learning rate updating strategy is adopted, and the attenuation is 0.9 times of the learning rate of the previous step after every 2000 steps. During training, two pictures are taken to calculate the gradient of the error to the network parameters, so that the network parameters are updated. The neural network input image size is unified into a picture of 320 × 320 size. The fracture image segmentation results are shown in fig. 3.

The experimental platform environment is configured as follows:

intel (R) core (TM) i7CPU @3.20GHz, NVIDIA GTX1080Ti video card, 16G DDR4 memory (main frequency 2400MHz), 11G video memory (frequency 1.4 GHz). The whole training process is completed on the GPU in an accelerated mode.

Referring to fig. 1, the method for segmenting the road surface crack image comprises the following specific steps:

s1, photographing the pavement cracks by using a camera, establishing a data set, and dividing images in the data set into two types, namely cracks and no cracks;

s2, utilizing Lableme software to label the segmented data of the image with the crack in the data set;

s3, dividing the constructed data set into a training set (3/5 of the data set) and a testing set (2/5 of the data set);

s4, constructing a full convolution neural network for image segmentation;

the full convolution neural network consists of 1 convolution layer, 16 expansion convolution layers, 1 pyramid sampling layer, 2 decoder layers and 1 semantic output layer.

The structure and parameters of the full product neural network model in step S4 are as follows:

s6, submitting the test concentrated crack image to a full convolution neural network, and outputting an image segmentation result;

The full convolution neural network pyramid sampling module has 4 ways, namely sampling the average value with the side length of 10 and averaging the whole characteristic diagram; three hole convolutions with hole ratios 1, 2, 4, respectively, are used to extract information from the feature map using different receptive fields. The output results of the 4 paths are scaled to the same size and then combined in a stacked manner.

The full convolution neural network parameter optimization uses a Moment optimizer, the momentum value of the optimizer is 0.9, the initial learning rate is 0.0001, a descending learning rate updating strategy is adopted, and the attenuation is 0.9 times of the learning rate of the previous step after every 2000 steps. During training, two pictures are taken to calculate the gradient of the error to the network parameters, so that the network parameters are updated.

it should be noted that, for those skilled in the art, without departing from the principle of the present invention, several improvements and modifications can be made, and these improvements and modifications should also be construed as the protection scope of the present invention. All the components not specified in the present embodiment can be realized by the prior art.

Claims

1. A road surface crack image segmentation method based on a full convolution neural network is characterized by comprising the following steps: the method comprises the following steps:

s3 dividing the constructed data set into a training set and a testing set;

s4, constructing a full convolution neural network for image segmentation;

2. The road surface crack image segmentation method based on the full convolution neural network as claimed in claim 1, wherein the image segmentation method comprises the following steps: step S2 labels the crack image, and the label is a pixel-level label.

3. The road surface crack image segmentation method based on the full convolution neural network as claimed in claim 1, characterized in that: the constructed full convolution neural network specifically comprises the following steps:

4. The road surface crack image segmentation method based on the full convolution neural network as claimed in claim 1 or 3, wherein the image segmentation method comprises the following steps:

5. the road surface crack image segmentation method based on the full convolution neural network as claimed in claim 4, wherein the method comprises the following steps: the structure of the full convolution neural network expansion module is specifically as follows:

6. The road surface crack image segmentation method based on the full convolution neural network as claimed in claim 1, wherein the method comprises the following steps: the full convolution neural network pyramid sampling module has 4 ways, namely sampling the average value with the side length of 10 and averaging the whole characteristic diagram; convolving three cavities with the cavity ratios of 1, 2 and 4 respectively, and extracting information from the characteristic diagram by using different receptive fields; the output results of the 4 paths are scaled to the same size and then combined in a stacked manner.

7. The road surface crack image segmentation method based on the full convolution neural network as claimed in claim 1, wherein the method comprises the following steps: in the full convolution neural network, a decoder 0 is connected with an extended convolution 3#, 16 characteristic graphs are input, and 32 characteristic graphs are output; the decoder 1 is connected to the convolutional layer, and inputs 8 feature maps and outputs 32 feature maps.

8. The road surface crack image segmentation method based on the full convolution neural network as claimed in claim 1, wherein the method comprises the following steps: the full convolution neural network parameter optimization uses a Moment optimizer, the momentum value of the optimizer is 0.9, the initial learning rate is 0.0001, a descending learning rate updating strategy is adopted, and the attenuation is 0.9 times of the learning rate of the previous step after every 2000 steps; during training, two pictures are taken to calculate the gradient of the error pair network parameters, and the network parameters are further updated.

9. The road surface crack image segmentation method based on the full convolution neural network as claimed in claim 1, wherein the method comprises the following steps: the target function of the full convolution neural network for image segmentation is the average of the cross entropy of a given label and the prediction result prediction of the neural network at each pixel: