CN113537033A - Building rubbish remote sensing image identification method based on deep learning - Google Patents

Building rubbish remote sensing image identification method based on deep learning Download PDF

Info

Publication number
CN113537033A
CN113537033A CN202110785190.2A CN202110785190A CN113537033A CN 113537033 A CN113537033 A CN 113537033A CN 202110785190 A CN202110785190 A CN 202110785190A CN 113537033 A CN113537033 A CN 113537033A
Authority
CN
China
Prior art keywords
remote sensing
sensing image
network
data set
model
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN202110785190.2A
Other languages
Chinese (zh)
Inventor
颜子健
董静薇
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Harbin University of Science and Technology
Original Assignee
Harbin University of Science and Technology
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Harbin University of Science and Technology filed Critical Harbin University of Science and Technology
Priority to CN202110785190.2A priority Critical patent/CN113537033A/en
Publication of CN113537033A publication Critical patent/CN113537033A/en
Pending legal-status Critical Current

Links

Images

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F18/00Pattern recognition
    • G06F18/20Analysing
    • G06F18/21Design or setup of recognition systems or techniques; Extraction of features in feature space; Blind source separation
    • G06F18/214Generating training patterns; Bootstrap methods, e.g. bagging or boosting
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F18/00Pattern recognition
    • G06F18/20Analysing
    • G06F18/24Classification techniques
    • G06F18/241Classification techniques relating to the classification model, e.g. parametric or non-parametric approaches
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F18/00Pattern recognition
    • G06F18/20Analysing
    • G06F18/25Fusion techniques
    • G06F18/253Fusion techniques of extracted features
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06NCOMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N3/00Computing arrangements based on biological models
    • G06N3/02Neural networks
    • G06N3/04Architecture, e.g. interconnection topology
    • G06N3/045Combinations of networks
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06NCOMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N3/00Computing arrangements based on biological models
    • G06N3/02Neural networks
    • G06N3/08Learning methods

Landscapes

  • Engineering & Computer Science (AREA)
  • Theoretical Computer Science (AREA)
  • Data Mining & Analysis (AREA)
  • Physics & Mathematics (AREA)
  • Artificial Intelligence (AREA)
  • General Engineering & Computer Science (AREA)
  • Life Sciences & Earth Sciences (AREA)
  • Evolutionary Computation (AREA)
  • General Physics & Mathematics (AREA)
  • Computer Vision & Pattern Recognition (AREA)
  • Bioinformatics & Computational Biology (AREA)
  • Bioinformatics & Cheminformatics (AREA)
  • Evolutionary Biology (AREA)
  • Biophysics (AREA)
  • Computing Systems (AREA)
  • Mathematical Physics (AREA)
  • Software Systems (AREA)
  • Molecular Biology (AREA)
  • General Health & Medical Sciences (AREA)
  • Computational Linguistics (AREA)
  • Biomedical Technology (AREA)
  • Health & Medical Sciences (AREA)
  • Image Analysis (AREA)
  • Image Processing (AREA)

Abstract

A building rubbish remote sensing image recognition method based on deep learning belongs to the field of remote sensing image recognition. The existing remote sensing image identification method is easy to interfere, and the whole information can not be mined, so that the identification precision is low. A building rubbish remote sensing image recognition method based on deep learning comprises the steps of preprocessing an obtained remote sensing image to obtain a remote sensing image data set; expanding a remote sensing image data set sample, adding an L2 regularization punishment item to the seventh layer of the neural network, and training a network model added with an L2 regularization punishment item by using the expanded data set to obtain a target identification model; the mIOU ratio of the intersection and the union of the real value and the predicted value in the semantic segmentation method of deep Lab is calculated to realize the improvement of the semantic segmentation algorithm; and image recognition is carried out by utilizing an improved recognition model and an algorithm. The method identifies the accuracy draft, can monitor the processing progress of illegal stacking, and realizes dynamic tracking monitoring and purification of the urban environment.

Description

Building rubbish remote sensing image identification method based on deep learning
Technical Field
The invention relates to a building rubbish remote sensing image identification method based on deep learning.
Background
The remote sensing image identification roughly goes through the following processes: traditional remote sensing image identification methods based on pixel, such as maximum likelihood method and K-Means mean value method, but the image spectrum brightness information is easy to be interfered, and the whole information can not be mined, so that the method is easy to generate 'salt and pepper noise', and is only used as a contrast item or a preprocessing method at present; based on the object-oriented remote sensing identification method, although the advantage of rich attribute features of the polygonal object is exerted, the object is easy to over-segment or under-segment, and the segmentation scale is not easy to determine; the image semantics based on the image elements are segmented into popular research directions for the current remote sensing image recognition, the characteristics of strong self-learning capability and fault-tolerant capability of the image semantics based on the image elements are derived from deep learning, and the research and implementation of thousands of classification methods are established.
However, the effectiveness of the deep learning method also depends on the richness of the training data, and a large amount of sample data becomes a necessary condition for research. The sample expansion can be obtained by using a simple data enhancement method such as inversion, and can also be obtained by using a machine learning method such as generation countermeasure network to perform picture synthesis. In addition, the deep learning related research method often requires that the feature distribution of training data and test data is similar, but the requirement is difficult to achieve in practical project application, so the deep learning method is difficult to be applied in engineering projects due to the data requirement. Migration learning is a branch of deep learning research methods and is a great research hotspot at present.
Disclosure of Invention
The invention aims to solve the problems that the existing remote sensing image identification method is easy to interfere and low in identification precision due to the fact that the existing remote sensing image identification method cannot mine whole information, and provides a building rubbish remote sensing image identification method based on deep learning.
A building rubbish remote sensing image identification method based on deep learning is achieved through the following steps:
firstly, preprocessing an acquired remote sensing image to obtain a remote sensing image data set;
expanding a remote sensing image data set sample, adding an L2 regularization punishment item into the seventh layer of the neural network, and training a network model added with an L2 regularization punishment item by using the expanded data set to obtain a target identification model;
step three, improving a semantic segmentation algorithm by calculating the mIOU ratio of the intersection and the union of the real value and the predicted value in the semantic segmentation method of deep Lab;
and step four, carrying out image recognition by using the improved recognition model and the improved algorithm.
Preferably, the step of preprocessing the acquired remote sensing image to obtain a remote sensing image data set in the step one includes:
and performing remote sensing image preprocessing operation of orthorectification and image fusion on the remote sensing image by adopting an ENVI platform, and performing histogram equalization operation on the result data.
Preferably, the step of expanding the remote sensing image dataset sample described in the step two fuses features of a plurality of images in a manner of improving generation of a countermeasure network, and the specific steps include:
the GAN generates new data on the basis of the original data set, and the GAN generation countermeasure network comprises two models: generating a model and a discrimination model, wherein the representative symbols of the two models are G and D respectively; the game implementation using these two models generates a competing network,
and if z is random noise and x is real data, the generative network and the discriminant network are respectively represented by G and D, wherein D can be regarded as a two-classifier, and then the cross entropy is adopted for representation, and the description is as follows:
minmaxV=Ex~pdata(x)[logD(x)]+Ez~pz(z)[log(1-D(G(z)))]
wherein, logD (x) of the first item represents the judgment of the discriminator on the real data, and log (1-D (G (z)) of the second item represents the synthesis and judgment of the data; through such a Max-min game, G and D are respectively optimized to train the required generating network and the required discriminating network circularly and alternately until a Nash equilibrium point is reached;
the DCGAN provides a training framework on the basis of the GAN, the DCGAN conducts training guidance on the GAN, replaces a full connection layer with a convolution layer, removes a pooling layer, and introduces development results of a discriminant model into a generated model by adopting Batch Normalization (BN) technology.
Preferably, the step three of improving the recognition model and the recognition semantic segmentation algorithm by calculating the mliou ratio of the intersection and the union of the real value and the predicted value in the semantic segmentation method of deep lab includes:
a semantic segmentation method of deep Lab is adopted, the condition randomness and the convolution neural network are combined, the full convolution network is used as a basis, and each layer is continuously optimized; then, the mIOU ratio of the intersection and union of the real value and the predicted value is calculated, and the improvement of the semantic segmentation algorithm is realized.
The invention has the beneficial effects that:
the invention applies the deep learning algorithm to the semantic segmentation of the remote sensing image and to the identification of the construction waste in the remote sensing image, thereby saving more manpower and material resources. The method is characterized in that a sample expansion experiment for image generation is carried out aiming at the problem of few samples of a building remote sensing data set, urban building rubbish is detected from the aspect of semantic segmentation of remote sensing images, a reliable data source expansion method is provided for urban building rubbish remote sensing monitoring, and technical support is provided for building rubbish stockpiling management. Meanwhile, the earth observation remote sensing technology has the characteristics of long-distance detection, large-area coverage, short revisit period and the like, whether the construction waste is cleared or not can be found quickly through the research, the current situation information such as the stacking area of the construction waste can be mastered, the illegal stacking processing progress can be monitored, and the urban environment can be dynamically tracked, monitored and purified.
Drawings
FIG. 1 is a flow chart of a method of the present invention;
FIG. 2 is a basic model for generating a countermeasure network to which the present invention relates;
FIG. 3 is a diagram of a DCGAN generator according to the present invention;
FIG. 4 shows the structure of DeepLabv3ASPP according to the present invention.
Detailed Description
The first embodiment is as follows:
in the embodiment, as shown in fig. 1, the method for identifying the remote sensing image of the construction waste based on deep learning is implemented by the following steps:
firstly, preprocessing an acquired remote sensing image to obtain a remote sensing image data set;
expanding a remote sensing image data set sample, adding an L2 regularization punishment item into the seventh layer of the neural network, and training a network model added with an L2 regularization punishment item by using the expanded data set to obtain a target identification model;
step three, improving a semantic segmentation algorithm by calculating the mIOU ratio of the intersection and the union of the real value and the predicted value in the semantic segmentation method of deep Lab;
and step four, carrying out image recognition by using the improved recognition model and the improved algorithm.
The second embodiment is as follows:
different from the first specific embodiment, in the first method for identifying remote sensing images of construction waste based on deep learning of the present embodiment, the step of preprocessing the acquired remote sensing images to obtain a remote sensing image data set includes:
due to the influence of the overall positioning precision of a remote sensing information platform and the error rate of a sensor, the reason that layers of panchromatic images and multispectral images in a satellite remote sensing technology image can not be aligned and the like, more preprocessing operations are needed to be carried out on the satellite remote sensing image, the remote sensing image preprocessing operations of orthorectification and image fusion are carried out on the remote sensing image by adopting an ENVI platform, and histogram equalization operation is carried out on the result data; the relative position precision of the remote sensing data is improved, the image quality is improved, and the data characteristics are enhanced.
The third concrete implementation mode:
different from the first or second specific embodiments, in the method for identifying the remote sensing images of the construction waste based on deep learning of the second embodiment, the step of expanding the remote sensing image data set sample fuses the features of the plurality of images in a mode of improving and generating a countermeasure network, and the specific steps include:
the method is characterized in that a sample expansion experiment of image generation is carried out aiming at the problem of few samples of a building remote sensing data set, urban building rubbish is detected from the aspect of semantic segmentation of remote sensing images, a reliable data source expansion method is provided for urban building rubbish remote sensing monitoring, and technical support is provided for building rubbish stockpiling management. The generation of the confrontation network is improved, the semantic segmentation network precision is improved, and the production image is closer to a real image.
Common data expansion is implemented by flipping, randomly cropping, rotating, locally distorting the image, and using GAN (creating a competing network) methods. Training of artificial intelligence requires a large number of data sets, which can be costly if collected and labeled all by human labor.
The GAN generates new data on the basis of the original data set so as to train a more robust model, and the GAN generation countermeasure network comprises two models: generating a model and a discrimination model, wherein the representative symbols of the two models are G and D respectively; the game using the two models realizes the generation of the countermeasure network, thereby leading the two models to improve the overall competition effect.
And if z is random noise and x is real data, the generative network and the discriminant network are respectively represented by G and D, wherein D can be regarded as a two-classifier, and then the cross entropy is adopted for representation, and the description is as follows:
minmaxV=Ex~pdata(x)[logD(x)]+Ez~pz(z)[log(1-D(G(z)))]
wherein, logD (x) of the first item represents the judgment of the discriminator on the real data, and log (1-D (G (z)) of the second item represents the synthesis and judgment of the data; through such a Max-min game, G and D are respectively optimized to train the required generating network and the required discriminating network circularly and alternately until a Nash equilibrium point is reached; generating a basic model of the countermeasure network is shown in FIG. 2;
the DCGAN provides a training framework on the basis of the GAN, the DCGAN conducts training guidance on the GAN, replaces a full connection layer with a convolution layer, removes a pooling layer, and introduces development results of a discriminant model into a generated model by adopting Batch Normalization (BN) technology. The structure of the DCGAN generator is shown in fig. 3. In addition, DCGAN also emphasizes the importance and guidance of hidden layer analysis and visual counting on GAN training
The fourth concrete implementation mode:
different from the third specific embodiment, in the method for identifying a building waste remote sensing image based on deep learning of the third embodiment, the step of improving the identification model and the identification semantic segmentation algorithm by calculating the mliou ratio of the intersection and the union of the real value and the predicted value in the semantic segmentation method of deep lab includes:
by adopting a semantic segmentation method of deep Lab, by combining conditional randomness and a convolutional neural network, and using a full convolutional network as a basis, each layer is continuously optimized, so that the change of a receptive field can be ensured in the semantic segmentation process, and the position information in the aspect of space can also be reserved; then, the mIOU ratio of the intersection and union of the real value and the predicted value is calculated, and the improvement of the semantic segmentation algorithm is realized, so that the accuracy is increased.
The Deeplab can extract different features and control different resolutions, an algorithm has a unique solution, and each level is continuously optimized by combining conditional randomness and a convolutional neural network and using a full convolutional network as a basis, so that the change of a receptive field can be ensured in a semantic segmentation process, and the position information in the aspect of space can be also reserved. The DeepLabv3ASPP structure is shown in FIG. 4.
The above is only a preferred embodiment of the present invention, and is not intended to limit the present invention, and various modifications and changes may be made to the present invention by those skilled in the art. Any modification, equivalent replacement, or improvement made within the spirit and principle of the present invention should be included in the protection scope of the present invention.

Claims (4)

1. A building rubbish remote sensing image identification method based on deep learning is characterized in that: the method is realized by the following steps:
firstly, preprocessing an acquired remote sensing image to obtain a remote sensing image data set;
expanding a remote sensing image data set sample, adding an L2 regularization punishment item into the seventh layer of the neural network, and training a network model added with an L2 regularization punishment item by using the expanded data set to obtain a target identification model;
step three, improving a semantic segmentation algorithm by calculating the mIOU ratio of the intersection and the union of the real value and the predicted value in the semantic segmentation method of deep Lab;
and step four, carrying out image recognition by using the improved recognition model and the improved algorithm.
2. The building rubbish remote sensing image recognition method based on deep learning of claim 1, characterized in that: the step one of preprocessing the acquired remote sensing image to obtain a remote sensing image data set comprises the following steps:
and performing remote sensing image preprocessing operation of orthorectification and image fusion on the remote sensing image by adopting an ENVI platform, and performing histogram equalization operation on the result data.
3. The construction waste remote sensing image recognition method based on deep learning according to claim 1 or 2, characterized in that: step two the step of expanding the remote sensing image data set sample fuses the characteristics of a plurality of images in a mode of improving and generating a countermeasure network, and the specific steps comprise:
the GAN generates new data on the basis of the original data set, and the GAN generation countermeasure network comprises two models: generating a model and a discrimination model, wherein the representative symbols of the two models are G and D respectively; the game implementation using these two models generates a competing network,
and if z is random noise and x is real data, the generative network and the discriminant network are respectively represented by G and D, wherein D can be regarded as a two-classifier, and then the cross entropy is adopted for representation, and the description is as follows:
minmaxV=Ex~pdata(x)[logD(x)]+Ez~pz(z)[log(1-D(G(z)))]
wherein, logD (x) of the first item represents the judgment of the discriminator on the real data, and log (1-D (G (z)) of the second item represents the synthesis and judgment of the data; through such a Max-min game, G and D are respectively optimized to train the required generating network and the required discriminating network circularly and alternately until a Nash equilibrium point is reached;
the DCGAN provides a training framework on the basis of the GAN, the DCGAN conducts training guidance on the GAN, replaces a full connection layer with a convolution layer, removes a pooling layer, and introduces development results of a discriminant model into a generated model by adopting Batch Normalization (BN) technology.
4. The building rubbish remote sensing image recognition method based on deep learning of claim 3 is characterized in that: step three, the step of improving the recognition model and the recognition semantic segmentation algorithm by calculating the mIOU ratio of the intersection and the union of the real value and the predicted value in the semantic segmentation method of deep Lab comprises the following steps:
a semantic segmentation method of deep Lab is adopted, the condition randomness and the convolution neural network are combined, the full convolution network is used as a basis, and each layer is continuously optimized; then, the mIOU ratio of the intersection and union of the real value and the predicted value is calculated, and the improvement of the semantic segmentation algorithm is realized.
CN202110785190.2A 2021-07-12 2021-07-12 Building rubbish remote sensing image identification method based on deep learning Pending CN113537033A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN202110785190.2A CN113537033A (en) 2021-07-12 2021-07-12 Building rubbish remote sensing image identification method based on deep learning

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN202110785190.2A CN113537033A (en) 2021-07-12 2021-07-12 Building rubbish remote sensing image identification method based on deep learning

Publications (1)

Publication Number Publication Date
CN113537033A true CN113537033A (en) 2021-10-22

Family

ID=78127465

Family Applications (1)

Application Number Title Priority Date Filing Date
CN202110785190.2A Pending CN113537033A (en) 2021-07-12 2021-07-12 Building rubbish remote sensing image identification method based on deep learning

Country Status (1)

Country Link
CN (1) CN113537033A (en)

Cited By (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN115147729A (en) * 2022-06-27 2022-10-04 浙江大学 Solid waste landfill site risk identification method based on sky ground data and intelligent algorithm
CN116091953A (en) * 2023-04-11 2023-05-09 耕宇牧星(北京)空间科技有限公司 Building rubbish identification method based on grouping wavelet calibration network

Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN110598784A (en) * 2019-09-11 2019-12-20 北京建筑大学 Machine learning-based construction waste classification method and device
CN111027445A (en) * 2019-12-04 2020-04-17 安徽工程大学 Target identification method for marine ship
CN112489054A (en) * 2020-11-27 2021-03-12 中北大学 Remote sensing image semantic segmentation method based on deep learning
CN112861752A (en) * 2021-02-23 2021-05-28 东北农业大学 Crop disease identification method and system based on DCGAN and RDN

Patent Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN110598784A (en) * 2019-09-11 2019-12-20 北京建筑大学 Machine learning-based construction waste classification method and device
CN111027445A (en) * 2019-12-04 2020-04-17 安徽工程大学 Target identification method for marine ship
CN112489054A (en) * 2020-11-27 2021-03-12 中北大学 Remote sensing image semantic segmentation method based on deep learning
CN112861752A (en) * 2021-02-23 2021-05-28 东北农业大学 Crop disease identification method and system based on DCGAN and RDN

Cited By (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN115147729A (en) * 2022-06-27 2022-10-04 浙江大学 Solid waste landfill site risk identification method based on sky ground data and intelligent algorithm
CN116091953A (en) * 2023-04-11 2023-05-09 耕宇牧星(北京)空间科技有限公司 Building rubbish identification method based on grouping wavelet calibration network

Similar Documents

Publication Publication Date Title
Chen et al. Vehicle detection in high-resolution aerial images via sparse representation and superpixels
Sánchez-Rodríguez et al. Automated detection and decomposition of railway tunnels from Mobile Laser Scanning Datasets
CN112232199A (en) Wearing mask detection method based on deep learning
CN113537033A (en) Building rubbish remote sensing image identification method based on deep learning
CN113689445B (en) High-resolution remote sensing building extraction method combining semantic segmentation and edge detection
Kahraman et al. Road extraction techniques from remote sensing images: A review
CN113505670A (en) Remote sensing image weak supervision building extraction method based on multi-scale CAM and super-pixels
Babawuro et al. Satellite imagery cadastral features extractions using image processing algorithms: A viable option for cadastral science
Yang et al. C-RPNs: Promoting object detection in real world via a cascade structure of Region Proposal Networks
Tran et al. Pp-linknet: Improving semantic segmentation of high resolution satellite imagery with multi-stage training
CN110929670A (en) Muck truck cleanliness video identification and analysis method based on yolo3 technology
Xiao et al. 3D urban object change detection from aerial and terrestrial point clouds: A review
Yang et al. Superpixel image segmentation-based particle size distribution analysis of fragmented rock
CN105469099B (en) Pavement crack detection and identification method based on sparse representation classification
Yadav et al. Building change detection using multi-temporal airborne LiDAR data
Su et al. Which CAM is better for extracting geographic objects? A perspective from principles and experiments
Sharifi et al. Efficiency evaluating of automatic lineament extraction by means of remote sensing (Case study: Venarch, Iran)
Wang et al. Pavement crack detection using attention u-net with multiple sources
Wolf et al. Applicability of neural networks for image classification on object detection in mobile mapping 3d point clouds
Xu et al. Crowd density estimation based on improved Harris & OPTICS Algorithm
Chen et al. All-in-one YOLO architecture for safety hazard detection of environment along high-speed railway
Böhm et al. Diskmask: Focusing object features for accurate instance segmentation of elongated or overlapping objects
Li et al. Evaluation of super-resolution on bird detection performance based on deep convolutional networks
Yan et al. Building Extraction at Amodal-Instance-Segmentation Level: Datasets and Framework
Purohit et al. ConeQuest: A Benchmark for Cone Segmentation on Mars

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
WD01 Invention patent application deemed withdrawn after publication

Application publication date: 20211022

WD01 Invention patent application deemed withdrawn after publication