CN113537033A

CN113537033A - Building rubbish remote sensing image identification method based on deep learning

Info

Publication number: CN113537033A
Application number: CN202110785190.2A
Authority: CN
Inventors: 颜子健; 董静薇
Original assignee: Harbin University of Science and Technology
Current assignee: Harbin University of Science and Technology
Priority date: 2021-07-12
Filing date: 2021-07-12
Publication date: 2021-10-22

Abstract

A building rubbish remote sensing image recognition method based on deep learning belongs to the field of remote sensing image recognition. The existing remote sensing image identification method is easy to interfere, and the whole information can not be mined, so that the identification precision is low. A building rubbish remote sensing image recognition method based on deep learning comprises the steps of preprocessing an obtained remote sensing image to obtain a remote sensing image data set; expanding a remote sensing image data set sample, adding an L2 regularization punishment item to the seventh layer of the neural network, and training a network model added with an L2 regularization punishment item by using the expanded data set to obtain a target identification model; the mIOU ratio of the intersection and the union of the real value and the predicted value in the semantic segmentation method of deep Lab is calculated to realize the improvement of the semantic segmentation algorithm; and image recognition is carried out by utilizing an improved recognition model and an algorithm. The method identifies the accuracy draft, can monitor the processing progress of illegal stacking, and realizes dynamic tracking monitoring and purification of the urban environment.

Description

Building rubbish remote sensing image identification method based on deep learning

Technical Field

The invention relates to a building rubbish remote sensing image identification method based on deep learning.

Background

The remote sensing image identification roughly goes through the following processes: traditional remote sensing image identification methods based on pixel, such as maximum likelihood method and K-Means mean value method, but the image spectrum brightness information is easy to be interfered, and the whole information can not be mined, so that the method is easy to generate 'salt and pepper noise', and is only used as a contrast item or a preprocessing method at present; based on the object-oriented remote sensing identification method, although the advantage of rich attribute features of the polygonal object is exerted, the object is easy to over-segment or under-segment, and the segmentation scale is not easy to determine; the image semantics based on the image elements are segmented into popular research directions for the current remote sensing image recognition, the characteristics of strong self-learning capability and fault-tolerant capability of the image semantics based on the image elements are derived from deep learning, and the research and implementation of thousands of classification methods are established.

However, the effectiveness of the deep learning method also depends on the richness of the training data, and a large amount of sample data becomes a necessary condition for research. The sample expansion can be obtained by using a simple data enhancement method such as inversion, and can also be obtained by using a machine learning method such as generation countermeasure network to perform picture synthesis. In addition, the deep learning related research method often requires that the feature distribution of training data and test data is similar, but the requirement is difficult to achieve in practical project application, so the deep learning method is difficult to be applied in engineering projects due to the data requirement. Migration learning is a branch of deep learning research methods and is a great research hotspot at present.

Disclosure of Invention

The invention aims to solve the problems that the existing remote sensing image identification method is easy to interfere and low in identification precision due to the fact that the existing remote sensing image identification method cannot mine whole information, and provides a building rubbish remote sensing image identification method based on deep learning.

A building rubbish remote sensing image identification method based on deep learning is achieved through the following steps:

firstly, preprocessing an acquired remote sensing image to obtain a remote sensing image data set;

expanding a remote sensing image data set sample, adding an L2 regularization punishment item into the seventh layer of the neural network, and training a network model added with an L2 regularization punishment item by using the expanded data set to obtain a target identification model;

step three, improving a semantic segmentation algorithm by calculating the mIOU ratio of the intersection and the union of the real value and the predicted value in the semantic segmentation method of deep Lab;

and step four, carrying out image recognition by using the improved recognition model and the improved algorithm.

Preferably, the step of preprocessing the acquired remote sensing image to obtain a remote sensing image data set in the step one includes:

and performing remote sensing image preprocessing operation of orthorectification and image fusion on the remote sensing image by adopting an ENVI platform, and performing histogram equalization operation on the result data.

Preferably, the step of expanding the remote sensing image dataset sample described in the step two fuses features of a plurality of images in a manner of improving generation of a countermeasure network, and the specific steps include:

the GAN generates new data on the basis of the original data set, and the GAN generation countermeasure network comprises two models: generating a model and a discrimination model, wherein the representative symbols of the two models are G and D respectively; the game implementation using these two models generates a competing network,

and if z is random noise and x is real data, the generative network and the discriminant network are respectively represented by G and D, wherein D can be regarded as a two-classifier, and then the cross entropy is adopted for representation, and the description is as follows:

minmaxV＝Ex～pdata(x)[logD(x)]+Ez～pz(z)[log(1-D(G(z)))]

wherein, logD (x) of the first item represents the judgment of the discriminator on the real data, and log (1-D (G (z)) of the second item represents the synthesis and judgment of the data; through such a Max-min game, G and D are respectively optimized to train the required generating network and the required discriminating network circularly and alternately until a Nash equilibrium point is reached;

the DCGAN provides a training framework on the basis of the GAN, the DCGAN conducts training guidance on the GAN, replaces a full connection layer with a convolution layer, removes a pooling layer, and introduces development results of a discriminant model into a generated model by adopting Batch Normalization (BN) technology.

Preferably, the step three of improving the recognition model and the recognition semantic segmentation algorithm by calculating the mliou ratio of the intersection and the union of the real value and the predicted value in the semantic segmentation method of deep lab includes:

a semantic segmentation method of deep Lab is adopted, the condition randomness and the convolution neural network are combined, the full convolution network is used as a basis, and each layer is continuously optimized; then, the mIOU ratio of the intersection and union of the real value and the predicted value is calculated, and the improvement of the semantic segmentation algorithm is realized.

The invention has the beneficial effects that:

the invention applies the deep learning algorithm to the semantic segmentation of the remote sensing image and to the identification of the construction waste in the remote sensing image, thereby saving more manpower and material resources. The method is characterized in that a sample expansion experiment for image generation is carried out aiming at the problem of few samples of a building remote sensing data set, urban building rubbish is detected from the aspect of semantic segmentation of remote sensing images, a reliable data source expansion method is provided for urban building rubbish remote sensing monitoring, and technical support is provided for building rubbish stockpiling management. Meanwhile, the earth observation remote sensing technology has the characteristics of long-distance detection, large-area coverage, short revisit period and the like, whether the construction waste is cleared or not can be found quickly through the research, the current situation information such as the stacking area of the construction waste can be mastered, the illegal stacking processing progress can be monitored, and the urban environment can be dynamically tracked, monitored and purified.

Drawings

FIG. 1 is a flow chart of a method of the present invention;

FIG. 2 is a basic model for generating a countermeasure network to which the present invention relates;

FIG. 3 is a diagram of a DCGAN generator according to the present invention;

FIG. 4 shows the structure of DeepLabv3ASPP according to the present invention.

Detailed Description

The first embodiment is as follows:

in the embodiment, as shown in fig. 1, the method for identifying the remote sensing image of the construction waste based on deep learning is implemented by the following steps:

The second embodiment is as follows:

different from the first specific embodiment, in the first method for identifying remote sensing images of construction waste based on deep learning of the present embodiment, the step of preprocessing the acquired remote sensing images to obtain a remote sensing image data set includes:

due to the influence of the overall positioning precision of a remote sensing information platform and the error rate of a sensor, the reason that layers of panchromatic images and multispectral images in a satellite remote sensing technology image can not be aligned and the like, more preprocessing operations are needed to be carried out on the satellite remote sensing image, the remote sensing image preprocessing operations of orthorectification and image fusion are carried out on the remote sensing image by adopting an ENVI platform, and histogram equalization operation is carried out on the result data; the relative position precision of the remote sensing data is improved, the image quality is improved, and the data characteristics are enhanced.

The third concrete implementation mode:

different from the first or second specific embodiments, in the method for identifying the remote sensing images of the construction waste based on deep learning of the second embodiment, the step of expanding the remote sensing image data set sample fuses the features of the plurality of images in a mode of improving and generating a countermeasure network, and the specific steps include:

the method is characterized in that a sample expansion experiment of image generation is carried out aiming at the problem of few samples of a building remote sensing data set, urban building rubbish is detected from the aspect of semantic segmentation of remote sensing images, a reliable data source expansion method is provided for urban building rubbish remote sensing monitoring, and technical support is provided for building rubbish stockpiling management. The generation of the confrontation network is improved, the semantic segmentation network precision is improved, and the production image is closer to a real image.

Common data expansion is implemented by flipping, randomly cropping, rotating, locally distorting the image, and using GAN (creating a competing network) methods. Training of artificial intelligence requires a large number of data sets, which can be costly if collected and labeled all by human labor.

The GAN generates new data on the basis of the original data set so as to train a more robust model, and the GAN generation countermeasure network comprises two models: generating a model and a discrimination model, wherein the representative symbols of the two models are G and D respectively; the game using the two models realizes the generation of the countermeasure network, thereby leading the two models to improve the overall competition effect.

minmaxV＝Ex～pdata(x)[logD(x)]+Ez～pz(z)[log(1-D(G(z)))]

wherein, logD (x) of the first item represents the judgment of the discriminator on the real data, and log (1-D (G (z)) of the second item represents the synthesis and judgment of the data; through such a Max-min game, G and D are respectively optimized to train the required generating network and the required discriminating network circularly and alternately until a Nash equilibrium point is reached; generating a basic model of the countermeasure network is shown in FIG. 2;

the DCGAN provides a training framework on the basis of the GAN, the DCGAN conducts training guidance on the GAN, replaces a full connection layer with a convolution layer, removes a pooling layer, and introduces development results of a discriminant model into a generated model by adopting Batch Normalization (BN) technology. The structure of the DCGAN generator is shown in fig. 3. In addition, DCGAN also emphasizes the importance and guidance of hidden layer analysis and visual counting on GAN training

The fourth concrete implementation mode:

different from the third specific embodiment, in the method for identifying a building waste remote sensing image based on deep learning of the third embodiment, the step of improving the identification model and the identification semantic segmentation algorithm by calculating the mliou ratio of the intersection and the union of the real value and the predicted value in the semantic segmentation method of deep lab includes:

by adopting a semantic segmentation method of deep Lab, by combining conditional randomness and a convolutional neural network, and using a full convolutional network as a basis, each layer is continuously optimized, so that the change of a receptive field can be ensured in the semantic segmentation process, and the position information in the aspect of space can also be reserved; then, the mIOU ratio of the intersection and union of the real value and the predicted value is calculated, and the improvement of the semantic segmentation algorithm is realized, so that the accuracy is increased.

The Deeplab can extract different features and control different resolutions, an algorithm has a unique solution, and each level is continuously optimized by combining conditional randomness and a convolutional neural network and using a full convolutional network as a basis, so that the change of a receptive field can be ensured in a semantic segmentation process, and the position information in the aspect of space can be also reserved. The DeepLabv3ASPP structure is shown in FIG. 4.

The above is only a preferred embodiment of the present invention, and is not intended to limit the present invention, and various modifications and changes may be made to the present invention by those skilled in the art. Any modification, equivalent replacement, or improvement made within the spirit and principle of the present invention should be included in the protection scope of the present invention.

Claims

1. A building rubbish remote sensing image identification method based on deep learning is characterized in that: the method is realized by the following steps:

2. The building rubbish remote sensing image recognition method based on deep learning of claim 1, characterized in that: the step one of preprocessing the acquired remote sensing image to obtain a remote sensing image data set comprises the following steps:

3. The construction waste remote sensing image recognition method based on deep learning according to claim 1 or 2, characterized in that: step two the step of expanding the remote sensing image data set sample fuses the characteristics of a plurality of images in a mode of improving and generating a countermeasure network, and the specific steps comprise:

minmaxV＝Ex～pdata(x)[logD(x)]+Ez～pz(z)[log(1-D(G(z)))]

4. The building rubbish remote sensing image recognition method based on deep learning of claim 3 is characterized in that: step three, the step of improving the recognition model and the recognition semantic segmentation algorithm by calculating the mIOU ratio of the intersection and the union of the real value and the predicted value in the semantic segmentation method of deep Lab comprises the following steps: