CN110059772B - Remote sensing image semantic segmentation method based on multi-scale decoding network - Google Patents

Remote sensing image semantic segmentation method based on multi-scale decoding network Download PDF

Info

Publication number
CN110059772B
CN110059772B CN201910397121.7A CN201910397121A CN110059772B CN 110059772 B CN110059772 B CN 110059772B CN 201910397121 A CN201910397121 A CN 201910397121A CN 110059772 B CN110059772 B CN 110059772B
Authority
CN
China
Prior art keywords
network
convolution
remote sensing
pooling
sensing image
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Active
Application number
CN201910397121.7A
Other languages
Chinese (zh)
Other versions
CN110059772A (en
Inventor
张笑钦
肖智恒
李东阳
樊明宇
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Wenzhou University
Original Assignee
Wenzhou University
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Wenzhou University filed Critical Wenzhou University
Priority to CN201910397121.7A priority Critical patent/CN110059772B/en
Publication of CN110059772A publication Critical patent/CN110059772A/en
Application granted granted Critical
Publication of CN110059772B publication Critical patent/CN110059772B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Images

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F18/00Pattern recognition
    • G06F18/20Analysing
    • G06F18/21Design or setup of recognition systems or techniques; Extraction of features in feature space; Blind source separation
    • G06F18/213Feature extraction, e.g. by transforming the feature space; Summarisation; Mappings, e.g. subspace methods
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F18/00Pattern recognition
    • G06F18/20Analysing
    • G06F18/21Design or setup of recognition systems or techniques; Extraction of features in feature space; Blind source separation
    • G06F18/214Generating training patterns; Bootstrap methods, e.g. bagging or boosting
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F18/00Pattern recognition
    • G06F18/20Analysing
    • G06F18/24Classification techniques
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06NCOMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N3/00Computing arrangements based on biological models
    • G06N3/02Neural networks
    • G06N3/04Architecture, e.g. interconnection topology
    • G06N3/045Combinations of networks

Landscapes

  • Engineering & Computer Science (AREA)
  • Theoretical Computer Science (AREA)
  • Data Mining & Analysis (AREA)
  • Physics & Mathematics (AREA)
  • Computer Vision & Pattern Recognition (AREA)
  • Evolutionary Computation (AREA)
  • Artificial Intelligence (AREA)
  • General Engineering & Computer Science (AREA)
  • General Physics & Mathematics (AREA)
  • Life Sciences & Earth Sciences (AREA)
  • Bioinformatics & Computational Biology (AREA)
  • Bioinformatics & Cheminformatics (AREA)
  • Evolutionary Biology (AREA)
  • Health & Medical Sciences (AREA)
  • Biomedical Technology (AREA)
  • Biophysics (AREA)
  • Computational Linguistics (AREA)
  • General Health & Medical Sciences (AREA)
  • Molecular Biology (AREA)
  • Computing Systems (AREA)
  • Mathematical Physics (AREA)
  • Software Systems (AREA)
  • Image Analysis (AREA)
  • Compression Or Coding Systems Of Tv Signals (AREA)

Abstract

The invention discloses a remote sensing image semantic segmentation method based on a multi-scale decoding network, which comprises the following steps of: randomly cutting a high-resolution remote sensing image used for training and a label graph corresponding to the high-resolution remote sensing image into small images, dividing a network structure into two parts, namely encoding and multi-scale decoding, doubling the resolution of encoded information through an inverse pooling path and an inverse convolution path, connecting the encoded information with a result of cavity convolution through a channel, recovering a characteristic image to the original size through inverse convolution up-sampling, inputting an output label graph into a PPB (Peer-to-Peer) module to perform multi-scale aggregation processing, and finally updating network parameters in a random gradient reduction mode by taking cross entropy as a loss function; and inputting the small images sequentially cut from the test picture into a neural network to predict the corresponding label graphs, and splicing the label graphs into the original size. According to the technical scheme, the segmentation precision of the model is improved, the complexity of the network is reduced, and the training time is saved.

Description

Remote sensing image semantic segmentation method based on multi-scale decoding network
Technical Field
The invention relates to the technical field of machine vision, in particular to a remote sensing image semantic segmentation method based on a multi-scale decoding network.
Background
Semantic segmentation is an important issue of general attention in the fields of unmanned driving, medical image analysis, geographic information systems and the like. Semantic segmentation is to segment different objects in a picture from the level of pixels, label each pixel in an original picture, and classify the pixel into different labels, and the segmentation precision includes understanding of information in the picture. The remote sensing image has the characteristics of complex imaging, high picture pixel and large information amount, so that how to rapidly and accurately extract useful information from the remote sensing image by using an artificial intelligence technology is a research hotspot in the field of machine vision.
Semantic segmentation based on neural networks has been studied more. FCN (full probabilistic network) is a classic framework for image semantic segmentation, which is trained in an end-to-end manner and used for semantic segmentation of a trained classification network; to restore the resolution of the image, the FCN also upsamples using deconvolution. Unlike FCN, SegNet upsamples using an inverse pooling method, so that the network parameters are much less than FCN. Compared with FCN and SegNet, U-Net has a more symmetric coding and decoding structure, and the jump connection from the coding to the decoding part facilitates the recovery of position information, but also makes the network structure complex, requiring more training time. The network structure often uses pooling to increase the receptive field, but pooling causes a decrease in spatial resolution when the receptive field is increased. Although the receptive field is enlarged and the loss of resolution is avoided through the hole convolution, and the information with different scales can be captured by utilizing the convolution with different hole rates, the hole convolution adopts a sparse sampling mode to cause the local information to be lost, so that the long-distance information lacks correlation. In semantic segmentation a large field can provide more global information, but local information is ignored. How to balance the sizes of the receptive fields is considered as one of the keys for improving the semantic segmentation precision, however, on the premise of ensuring the segmentation precision, reducing the complexity of the model and the training time are also the problems to be considered.
Disclosure of Invention
Aiming at the defects in the prior art, the invention aims to provide a remote sensing image semantic segmentation method based on a multi-scale decoding network, which can improve the segmentation precision of a model, reduce the complexity of the network and save the training time.
In order to achieve the purpose, the invention provides the following technical scheme: a remote sensing image semantic segmentation method based on a multi-scale decoding network comprises the following steps:
(1) randomly cutting a high-resolution remote sensing image used for training and a label graph corresponding to the high-resolution remote sensing image into small images of 256 multiplied by 256 pixels, wherein the cut images are divided into two parts, one part is used as a training set of a network, and the other part is used as a verification set;
(2) the network structure is divided into two parts of encoding and decoding, the front 16 layers of VGG16 of a classification network are used as an encoding network, the decoding network is composed of three paths of an inverse pooling path, an inverse convolution path and a void convolution path, the resolution of encoded information is doubled through the inverse pooling path and the inverse convolution path, the encoded information and the void convolution result are subjected to channel connection, a characteristic image is restored to the original size through inverse convolution upsampling, an output label image is input into a PPB module to be subjected to multi-scale aggregation processing, and finally, the network parameters are updated in a random gradient descending mode by taking cross entropy as a loss function;
(3) the test picture is sequentially cut into small images of 256 multiplied by 256 pixels, the small images are input to a neural network to predict corresponding label graphs, and then the label graphs are spliced into an original size.
Preferably, step (2) comprises the sub-steps of:
(1.1) randomly cutting the high-pixel remote sensing image into image fragments with specified sizes;
and (1.2) adopting the first 16 layers of the VGG network as an encoding network to extract semantic features of the preprocessed image fragments.
Preferably, step (2) further comprises the sub-steps of:
(2.1) recovering the size of the characteristic image by deconvolution and inverse pooling, combining the deconvolution with the inverse pooling for up-sampling, adding the inverse pooling after the fifth pooling of the VGG network, and obtaining a first characteristic map by convolution of 3 × 3 and 1 × 1;
(2.2) after the fifth pooling of the VGG network, connecting convolution of 3 x 3 and 1 x 1, expanding the size of the feature map by deconvolution of 4 x 4 with the step size of 2, cutting the feature map according to the size of the first feature map to obtain a second feature map, and after the fourth pooling of the VGG network, generating a third feature map by convolution of 3 x 3 with the hole rate of 2;
(2.3) connecting the characteristic graphs generated by the 3 paths, and integrating information of different scales to enable the network to select an optimal combination; and then, restoring the feature map to the original size by using convolution of step size 16 and 32 multiplied by 32, and outputting a prediction label through a softmax layer to obtain a semantic segmentation image.
Preferably, the method for manufacturing the prediction tag comprises the following steps:
(3.1) performing 3 x 3 convolution on the decoded label, and performing down-sampling on the obtained feature map through global average pooling of different scales;
(3.2) up-sampling the down-sampling result, and aggregating the down-sampling result into an eigentensor in a depth connection mode;
(3.3) reducing dimensions by using 1 × 1 convolution to obtain a prediction label.
Preferably, the invention adopts a computer with an Intel Core-i5 central processing unit and 4 gigabytes of memory and establishes an algorithm framework for migrating the semantic segmentation of the remote sensing image of the VGG network by using Matlab language.
Preferably, the parameters are updated by a random gradient descent with momentum of 0.9, using cross entropy as a loss function.
The invention has the advantages that: compared with the prior art, the method has the advantages that,
1. the semantic segmentation model of the remote sensing image based on the VGG provided by the invention has a good segmentation effect on the remote sensing image with high resolution.
2. The remote sensing image semantic segmentation model provided by the invention greatly reduces the time consumption of network training on the premise of ensuring the segmentation precision.
3. The decoding mode combining the three paths also provides a new idea for semantic segmentation of the image.
The invention is further described with reference to the drawings and the specific embodiments in the following description.
Drawings
FIG. 1 is a schematic diagram of a remote sensing image semantic segmentation model for migrating a VGG network in an embodiment of the present invention;
FIG. 2 is a schematic diagram of a network architecture according to an embodiment of the present invention;
FIG. 3 is a schematic diagram of a PPB module according to an embodiment of the present invention;
FIG. 4 is a graph illustrating a loss function according to an embodiment of the present invention;
FIG. 5 is a graph illustrating the verification accuracy according to an embodiment of the present invention.
Detailed Description
In the description of the present embodiment, it should be noted that, as the terms "center", "upper", "lower", "left", "right", "vertical", "horizontal", "inner", "outer", "front", "rear", etc. appear, the indicated orientation or positional relationship thereof is based on the orientation or positional relationship shown in the drawings, and is only for convenience of description and simplification of description, but does not indicate or imply that the indicated device or element must have a specific orientation, be constructed and operated in a specific orientation, and thus, should not be construed as limiting the present invention. Furthermore, the terms "first," "second," and "third" as appearing herein are used for descriptive purposes only and are not to be construed as indicating or implying relative importance.
Referring to fig. 1, fig. 2, fig. 3, fig. 4 and fig. 5, the invention discloses a remote sensing image semantic segmentation method based on a multi-scale decoding network, which comprises the following steps:
(1) randomly cutting a high-resolution remote sensing image used for training and a label graph corresponding to the high-resolution remote sensing image into small images of 256 multiplied by 256 pixels, wherein the cut images are divided into two parts, one part is used as a training set of a network, and the other part is used as a verification set;
(2) the network structure is divided into two parts of encoding and decoding, the front 16 layers of VGG16 of a classification network are used as an encoding network, the decoding network is composed of three paths of an inverse pooling path, an inverse convolution path and a void convolution path, the resolution of encoded information is doubled through the inverse pooling path and the inverse convolution path, the encoded information and the void convolution result are subjected to channel connection, a characteristic image is restored to the original size through inverse convolution upsampling, an output label image is input into a PPB module to be subjected to multi-scale aggregation processing, and finally, the network parameters are updated in a random gradient descending mode by taking cross entropy as a loss function;
(3) the test picture is sequentially cut into small images of 256 multiplied by 256 pixels, the small images are input to a neural network to predict corresponding label graphs, and then the label graphs are spliced into an original size.
The following 4 indexes are adopted for quantitatively evaluating the quality of image segmentation
Global precision: sigmainii/∑iti
Average precision: (1/n)cl)∑inii/ti
Average overlapping ratio: (1/n)cl)∑inii/(ti+∑jnjj-nii)
Weighted overlap ratio: (Sigma)ktk)-1itinii/(ti+∑jnij-nii)
Wherein n isijIs the number of i-type pixel points predicted as j-type, and the total number of i-type pixel points is
Figure GDA0002959068350000061
Preferably, step (2) comprises the sub-steps of:
(1.1) randomly cutting the high-pixel remote sensing image into image fragments with specified sizes;
and (1.2) adopting the first 16 layers of the VGG network as an encoding network to extract semantic features of the preprocessed image fragments.
Preferably, step (2) further comprises the sub-steps of:
(2.1) recovering the size of the characteristic image by deconvolution and inverse pooling, combining the deconvolution with the inverse pooling for up-sampling, adding the inverse pooling after the fifth pooling of the VGG network, and obtaining a first characteristic map by convolution of 3 × 3 and 1 × 1;
(2.2) after the fifth pooling of the VGG network, connecting convolution of 3 x 3 and 1 x 1, expanding the size of the feature map by deconvolution of 4 x 4 with the step size of 2, cutting the feature map according to the size of the first feature map to obtain a second feature map, and after the fourth pooling of the VGG network, generating a third feature map by convolution of 3 x 3 with the hole rate of 2;
(2.3) connecting the characteristic graphs generated by the 3 paths, and integrating information of different scales to enable the network to select an optimal combination; and then, restoring the feature map to the original size by using convolution of step size 16 and 32 multiplied by 32, and outputting a prediction label through a softmax layer to obtain a semantic segmentation image.
Preferably, the method for manufacturing the prediction tag comprises the following steps:
(3.1) performing 3 x 3 convolution on the decoded label, and performing down-sampling on the obtained feature map through global average pooling of different scales;
(3.2) up-sampling the down-sampling result, and aggregating the down-sampling result into an eigentensor in a depth connection mode;
(3.3) reducing dimensions by using 1 × 1 convolution to obtain a prediction label.
Preferably, the invention adopts a computer with an Intel Core-i5 central processing unit and 4 gigabytes of memory and establishes an algorithm framework for migrating the semantic segmentation of the remote sensing image of the VGG network by using Matlab language.
Preferably, the parameters are updated by a random gradient descent with momentum of 0.9, using cross entropy as a loss function.
The present embodiment is described in further detail below:
conv: convolution operation (convolution)
Pooling: an operation similar to downsampling;
ReLu: an activation function, mathematical form max (0, x);
softmax: suppose V is an array, ViIs the ith element of V, which is mathematically represented as:
Figure GDA0002959068350000071
deconv: a transposed convolution operation (deconvolution) may be used for the upsampling.
Un zooling: and (4) inverse pooling, which can be used for upsampling.
Scaled Conv: and (4) carrying out void convolution, wherein the cavity rate is utilized to improve the receptive field of a convolution result without reducing the resolution.
Details of the network architecture:
the pooling layer is typically used to extract abstract features and filter out noisy activations but can cause input feature resolution contraction and information loss. Deconvolution and inverse pooling are commonly used to recover the size of the feature image, deconvolution and inverse pooling are combined to perform upsampling, inverse pooling is added after the fifth pooling of the VGG network, and convolution of 3 × 3 and 1 × 1 is used to obtain a first feature map; in addition, after the fifth pooling of the VGG network, the convolution of 3 x 3 and 1 x 1 is carried out, then the size of the feature map is enlarged by the deconvolution of 4 x 4 with the step size of 2, and the feature map is cut according to the size of the first feature map to obtain a second feature map.
A third profile is generated after the fourth pooling of VGG networks using a convolution of 3 × 3 with a hole rate of 2.
And finally, connecting the third dimensions of the characteristic diagrams generated by the 3 paths, and integrating information of different dimensions to enable the network to select an optimal combination. The feature map is then restored to the original size by 32 x 32 convolution of step size 16.
As shown in fig. 3, after convolution processing of 3 × 3, performing global average pooling of 4 times, 8 times, 16 times and 32 times on the output features respectively to construct 4 pooled pyramids, finally performing dimensionality reduction by using 1 × 1 convolution, and outputting prediction label mapping through a softmax layer, that is, outputting prediction labels through the softmax layer to obtain a semantic segmentation image.
The remote sensing image semantic segmentation model of the migration VGG network adopts cross entropy as a loss function, and updates parameters through random gradient descent with momentum of 0.9. The loss function and the verification accuracy of the network training process are shown in fig. 4 and 5.
The hardware and programming language for the specific operation of the method of the invention are not limited, and the writing can be completed by any language, so that other working modes are not described any more.
The semantic segmentation model of the remote sensing image based on the VGG provided by the invention has a good segmentation effect on the remote sensing image with high resolution.
The remote sensing image semantic segmentation model provided by the invention greatly reduces the time consumption of network training on the premise of ensuring the segmentation precision.
The decoding mode combining the three paths also provides a new idea for semantic segmentation of the image.
The above embodiments are described in detail for the purpose of further illustrating the present invention and should not be construed as limiting the scope of the present invention, and the skilled engineer can make insubstantial modifications and variations of the present invention based on the above disclosure.

Claims (4)

1. A remote sensing image semantic segmentation method based on a multi-scale decoding network is characterized by comprising the following steps: the method comprises the following steps:
(1) randomly cutting a high-resolution remote sensing image used for training and a label graph corresponding to the high-resolution remote sensing image into small images of 256 multiplied by 256 pixels, wherein the cut images are divided into two parts, one part is used as a training set of a network, and the other part is used as a verification set;
(2) the network structure is divided into two parts of encoding and multi-scale decoding, the front 16 layers of VGG16 of the classification network are used as an encoding network, the multi-scale decoding network is composed of three paths of an inverse pooling path, an inverse convolution path and a void convolution path, the resolution of encoded information is doubled through the inverse pooling path and the inverse convolution path, the encoded information and the void convolution result are subjected to channel connection, the characteristic image is restored to the original size through sampling on the inverse convolution, the output label image is input into a PPB module to be subjected to multi-scale aggregation processing, and finally, the network parameters are updated in a random gradient descending mode by taking cross entropy as a loss function;
a PPB module: performing global average pooling on the output characteristics by 4 times, 8 times, 16 times and 32 times after convolution processing of 3 × 3 respectively, constructing 4 pooled pyramids, finally performing dimensionality reduction by using 1 × 1 convolution, and outputting prediction label mapping through a softmax layer;
(3) sequentially cutting a test picture into small images of 256 multiplied by 256 pixels, inputting the small images into a neural network to predict corresponding label graphs, and splicing the label graphs into an original size;
the step (2) comprises the following substeps:
(1.1) randomly cutting the high-pixel remote sensing image into image fragments with specified sizes;
(1.2) extracting semantic features of the preprocessed image fragments by adopting the first 16 layers of the VGG network as a coding network;
the step (2) further comprises the following substeps:
(2.1) recovering the size of the characteristic image by deconvolution and inverse pooling, combining the deconvolution with the inverse pooling for up-sampling, adding the inverse pooling after the fifth pooling of the VGG network, and obtaining a first characteristic map by convolution of 3 × 3 and 1 × 1;
(2.2) after the fifth pooling of the VGG network, connecting convolution of 3 x 3 and 1 x 1, expanding the size of the feature map by deconvolution of 4 x 4 with the step size of 2, cutting the feature map according to the size of the first feature map to obtain a second feature map, and after the fourth pooling of the VGG network, generating a third feature map by convolution of 3 x 3 with the hole rate of 2;
(2.3) connecting the characteristic graphs generated by the 3 paths, integrating information of different scales, and using the information as a decoding network to enable the model to select an optimal combination; and then, restoring the feature map to the original size by using convolution of step size 16 and 32 multiplied by 32, and outputting a prediction label through a softmax layer to obtain a semantic segmentation image.
2. The remote sensing image semantic segmentation method based on the multi-scale decoding network according to claim 1, characterized in that: the manufacturing method of the prediction label comprises the following steps:
(3.1) performing 3 x 3 convolution on the decoded label, and performing down-sampling on the obtained feature map through global average pooling of different scales;
(3.2) up-sampling the down-sampling result, and aggregating the down-sampling result into an eigentensor in a depth connection mode;
(3.3) reducing dimensions by using 1 × 1 convolution to obtain a prediction label.
3. The remote sensing image semantic segmentation method based on the multi-scale decoding network according to claim 1, characterized in that: a computer with an Intel Core-i5 central processing unit and a 4G byte memory is adopted, and an Matlab language is used for constructing an algorithm framework for semantic segmentation of remote sensing images of a multi-scale decoding network.
4. The remote sensing image semantic segmentation method based on the multi-scale decoding network according to claim 1, characterized in that: the parameters were updated by a random gradient descent with momentum of 0.9 using cross entropy as a loss function.
CN201910397121.7A 2019-05-14 2019-05-14 Remote sensing image semantic segmentation method based on multi-scale decoding network Active CN110059772B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201910397121.7A CN110059772B (en) 2019-05-14 2019-05-14 Remote sensing image semantic segmentation method based on multi-scale decoding network

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201910397121.7A CN110059772B (en) 2019-05-14 2019-05-14 Remote sensing image semantic segmentation method based on multi-scale decoding network

Publications (2)

Publication Number Publication Date
CN110059772A CN110059772A (en) 2019-07-26
CN110059772B true CN110059772B (en) 2021-04-30

Family

ID=67323037

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201910397121.7A Active CN110059772B (en) 2019-05-14 2019-05-14 Remote sensing image semantic segmentation method based on multi-scale decoding network

Country Status (1)

Country Link
CN (1) CN110059772B (en)

Families Citing this family (28)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN110503052A (en) * 2019-08-27 2019-11-26 西南交通大学 A kind of image, semantic dividing method based on improvement U-NET network
CN110569790B (en) * 2019-09-05 2023-03-28 中国人民解放军61540部队 Residential area element extraction method based on texture enhancement convolutional network
US11514694B2 (en) 2019-09-20 2022-11-29 Samsung Electronics Co., Ltd. Teaching GAN (generative adversarial networks) to generate per-pixel annotation
CN110674742B (en) * 2019-09-24 2023-04-07 电子科技大学 Remote sensing image road extraction method based on DLinkNet
CN110781923B (en) * 2019-09-27 2023-02-07 重庆特斯联智慧科技股份有限公司 Feature extraction method and device
CN110781895B (en) * 2019-10-10 2023-06-20 湖北工业大学 Image semantic segmentation method based on convolutional neural network
CN111126451A (en) * 2019-12-01 2020-05-08 复旦大学 Method for dual semantic segmentation
CN111161277B (en) * 2019-12-12 2023-04-18 中山大学 Natural image matting method based on deep learning
CN111079649B (en) * 2019-12-17 2023-04-07 西安电子科技大学 Remote sensing image ground feature classification method based on lightweight semantic segmentation network
CN111080652B (en) * 2019-12-23 2023-03-10 西安电子科技大学 Optical remote sensing image segmentation method based on multi-scale lightweight cavity convolution
CN111209813B (en) * 2019-12-27 2021-01-22 南京航空航天大学 Remote sensing image semantic segmentation method based on transfer learning
CN111259905B (en) * 2020-01-17 2022-05-31 山西大学 Feature fusion remote sensing image semantic segmentation method based on downsampling
CN111553921B (en) * 2020-02-19 2023-04-25 中山大学 Real-time semantic segmentation method based on channel information sharing residual error module
CN111368843B (en) * 2020-03-06 2022-06-10 电子科技大学 Method for extracting lake on ice based on semantic segmentation
CN111460936A (en) * 2020-03-18 2020-07-28 中国地质大学(武汉) Remote sensing image building extraction method, system and electronic equipment based on U-Net network
CN111401379A (en) * 2020-03-24 2020-07-10 北方民族大学 Deep L abv3plus-IRCNet image semantic segmentation algorithm based on coding and decoding structure
CN111798460B (en) * 2020-06-17 2023-08-01 南京信息工程大学 Satellite image segmentation method
CN111738268B (en) * 2020-07-22 2023-11-14 浙江大学 Semantic segmentation method and system for high-resolution remote sensing image based on random block
CN112070779B (en) * 2020-08-04 2022-03-15 武汉大学 Remote sensing image road segmentation method based on convolutional neural network weak supervised learning
CN111931779A (en) * 2020-08-10 2020-11-13 韶鼎人工智能科技有限公司 Image information extraction and generation method based on condition predictable parameters
CN112084923B (en) * 2020-09-01 2023-12-22 西安电子科技大学 Remote sensing image semantic segmentation method, storage medium and computing device
CN111951292B (en) * 2020-09-01 2022-11-08 吉林大学 Object surface reflection attribute extraction method, device, equipment and storage medium
CN112419322A (en) * 2020-10-16 2021-02-26 北京工业大学 Temporal bone external semicircular canal segmentation method based on 3D multi-scale multi-pooling feature fusion network
CN112508031B (en) * 2020-12-22 2022-09-02 北京航空航天大学 Unsupervised remote sensing image semantic segmentation method and model from virtual to reality
CN112861829B (en) * 2021-04-13 2023-06-30 山东大学 Water body extraction method and system based on deep convolutional neural network
CN113592885B (en) * 2021-06-29 2024-03-12 中南大学 SegNet-RS network-based large obstacle contour segmentation method
CN113706695B (en) * 2021-09-01 2023-06-23 杭州柳叶刀机器人有限公司 System and method for deep learning 3D femoral head modeling and storage medium
CN115205300B (en) * 2022-09-19 2022-12-09 华东交通大学 Fundus blood vessel image segmentation method and system based on cavity convolution and semantic fusion

Citations (13)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US7096272B1 (en) * 2001-11-20 2006-08-22 Cisco Technology, Inc. Methods and apparatus for pooling and depooling the transmission of stream data
CN106372577A (en) * 2016-08-23 2017-02-01 北京航空航天大学 Deep learning-based traffic sign automatic identifying and marking method
CN106650690A (en) * 2016-12-30 2017-05-10 东华大学 Night vision image scene identification method based on deep convolution-deconvolution neural network
CN106682730A (en) * 2017-01-10 2017-05-17 西安电子科技大学 Network performance assessment method based on VGG16 image deconvolution
CN106981080A (en) * 2017-02-24 2017-07-25 东华大学 Night unmanned vehicle scene depth method of estimation based on infrared image and radar data
CN108062753A (en) * 2017-12-29 2018-05-22 重庆理工大学 The adaptive brain tumor semantic segmentation method in unsupervised domain based on depth confrontation study
AU2018101336A4 (en) * 2018-09-12 2018-10-11 Hu, Yuan Miss Building extraction application based on machine learning in Urban-Suburban-Integration Area
CN108830913A (en) * 2018-05-25 2018-11-16 大连理工大学 Semantic level line original text painting methods based on User Colors guidance
CN109145769A (en) * 2018-08-01 2019-01-04 辽宁工业大学 The target detection network design method of blending image segmentation feature
CN109190626A (en) * 2018-07-27 2019-01-11 国家新闻出版广电总局广播科学研究院 A kind of semantic segmentation method of the multipath Fusion Features based on deep learning
CN109614973A (en) * 2018-11-22 2019-04-12 华南农业大学 Rice seedling and Weeds at seedling image, semantic dividing method, system, equipment and medium
CN109636905A (en) * 2018-12-07 2019-04-16 东北大学 Environment semanteme based on depth convolutional neural networks builds drawing method
CN109636802A (en) * 2019-01-18 2019-04-16 天津工业大学 Pulmonary parenchyma based on depth convolutional neural networks is through CT image partition method

Family Cites Families (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US10198624B2 (en) * 2016-02-18 2019-02-05 Pinscreen, Inc. Segmentation-guided real-time facial performance capture

Patent Citations (13)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US7096272B1 (en) * 2001-11-20 2006-08-22 Cisco Technology, Inc. Methods and apparatus for pooling and depooling the transmission of stream data
CN106372577A (en) * 2016-08-23 2017-02-01 北京航空航天大学 Deep learning-based traffic sign automatic identifying and marking method
CN106650690A (en) * 2016-12-30 2017-05-10 东华大学 Night vision image scene identification method based on deep convolution-deconvolution neural network
CN106682730A (en) * 2017-01-10 2017-05-17 西安电子科技大学 Network performance assessment method based on VGG16 image deconvolution
CN106981080A (en) * 2017-02-24 2017-07-25 东华大学 Night unmanned vehicle scene depth method of estimation based on infrared image and radar data
CN108062753A (en) * 2017-12-29 2018-05-22 重庆理工大学 The adaptive brain tumor semantic segmentation method in unsupervised domain based on depth confrontation study
CN108830913A (en) * 2018-05-25 2018-11-16 大连理工大学 Semantic level line original text painting methods based on User Colors guidance
CN109190626A (en) * 2018-07-27 2019-01-11 国家新闻出版广电总局广播科学研究院 A kind of semantic segmentation method of the multipath Fusion Features based on deep learning
CN109145769A (en) * 2018-08-01 2019-01-04 辽宁工业大学 The target detection network design method of blending image segmentation feature
AU2018101336A4 (en) * 2018-09-12 2018-10-11 Hu, Yuan Miss Building extraction application based on machine learning in Urban-Suburban-Integration Area
CN109614973A (en) * 2018-11-22 2019-04-12 华南农业大学 Rice seedling and Weeds at seedling image, semantic dividing method, system, equipment and medium
CN109636905A (en) * 2018-12-07 2019-04-16 东北大学 Environment semanteme based on depth convolutional neural networks builds drawing method
CN109636802A (en) * 2019-01-18 2019-04-16 天津工业大学 Pulmonary parenchyma based on depth convolutional neural networks is through CT image partition method

Non-Patent Citations (2)

* Cited by examiner, † Cited by third party
Title
STFCN: Spatio-Temporal Fully Convolutional Neural Network for Semantic Segmentation of Street Scenes;Mohsen Fayyaz等;《semantic segmentation vgg16 hole convolution》;20170331;第493-507页 *
基于GAN的城市环境图像语义分割;侯津京;《中国优秀硕士学位论文全文数据库(信息科技辑)》;20190115(第01期);第I138-4036页 *

Also Published As

Publication number Publication date
CN110059772A (en) 2019-07-26

Similar Documents

Publication Publication Date Title
CN110059772B (en) Remote sensing image semantic segmentation method based on multi-scale decoding network
CN111259905B (en) Feature fusion remote sensing image semantic segmentation method based on downsampling
CN114943963B (en) Remote sensing image cloud and cloud shadow segmentation method based on double-branch fusion network
CN113888744A (en) Image semantic segmentation method based on Transformer visual upsampling module
CN110751111B (en) Road extraction method and system based on high-order spatial information global automatic perception
CN113486897A (en) Semantic segmentation method for convolution attention mechanism up-sampling decoding
CN111062395B (en) Real-time video semantic segmentation method
CN112258526A (en) CT (computed tomography) kidney region cascade segmentation method based on dual attention mechanism
CN116051549B (en) Method, system, medium and equipment for dividing defects of solar cell
CN111898461B (en) Time sequence behavior segment generation method
CN111401379A (en) Deep L abv3plus-IRCNet image semantic segmentation algorithm based on coding and decoding structure
CN116797787B (en) Remote sensing image semantic segmentation method based on cross-modal fusion and graph neural network
CN114022770A (en) Mountain crack detection method based on improved self-attention mechanism and transfer learning
CN113269054A (en) Aerial video analysis method based on space-time 2D convolutional neural network
CN115272940B (en) Neural network-based intelligent rock debris particle metering method and system
CN116229106A (en) Video significance prediction method based on double-U structure
CN116469100A (en) Dual-band image semantic segmentation method based on Transformer
Pham Semantic road segmentation using deep learning
CN116205962A (en) Monocular depth estimation method and system based on complete context information
CN112766099B (en) Hyperspectral image classification method for extracting context information from local to global
Zhou et al. RISTRA: Recursive Image Super-resolution Transformer with Relativistic Assessment
US20240062347A1 (en) Multi-scale fusion defogging method based on stacked hourglass network
CN115995002B (en) Network construction method and urban scene real-time semantic segmentation method
CN116310916A (en) Semantic segmentation method and system for high-resolution remote sensing city image
CN115941966A (en) Video compression method and electronic equipment

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
GR01 Patent grant
GR01 Patent grant