CN110826597A - Remote sensing image classification method based on integrated depth Fisher vector - Google Patents

Remote sensing image classification method based on integrated depth Fisher vector Download PDF

Info

Publication number
CN110826597A
CN110826597A CN201910960279.0A CN201910960279A CN110826597A CN 110826597 A CN110826597 A CN 110826597A CN 201910960279 A CN201910960279 A CN 201910960279A CN 110826597 A CN110826597 A CN 110826597A
Authority
CN
China
Prior art keywords
remote sensing
fisher
sensing image
feature
method based
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Withdrawn
Application number
CN201910960279.0A
Other languages
Chinese (zh)
Inventor
苏卫华
李博扬
张世月
赵欣然
李秉宣
黄如强
蹇锐
刘洋
谢鹏发
李世国
卫家诚
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Tianjin (binhai) Intelligence Military-Civil Integration Innovation Center
National Defense Technology Innovation Institute PLA Academy of Military Science
Original Assignee
Tianjin (binhai) Intelligence Military-Civil Integration Innovation Center
National Defense Technology Innovation Institute PLA Academy of Military Science
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Tianjin (binhai) Intelligence Military-Civil Integration Innovation Center, National Defense Technology Innovation Institute PLA Academy of Military Science filed Critical Tianjin (binhai) Intelligence Military-Civil Integration Innovation Center
Priority to CN201910960279.0A priority Critical patent/CN110826597A/en
Publication of CN110826597A publication Critical patent/CN110826597A/en
Withdrawn legal-status Critical Current

Links

Images

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F18/00Pattern recognition
    • G06F18/20Analysing
    • G06F18/24Classification techniques
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F18/00Pattern recognition
    • G06F18/20Analysing
    • G06F18/21Design or setup of recognition systems or techniques; Extraction of features in feature space; Blind source separation
    • G06F18/217Validation; Performance evaluation; Active pattern learning techniques

Abstract

The invention discloses a remote sensing image classification method based on integrated depth Fisher vectors, which can solve the problems of long classification time and low classification precision of the current remote sensing image classification algorithm. The method comprises the following steps of (1) feature extraction; (2) fisher feature encoding; (3) feature concatenation and classification. The invention has the beneficial effects that: the algorithm provided by the invention is subjected to precision evaluation from a quantitative point of view, and compared with the existing algorithm, the algorithm achieves excellent classification effect, and the accuracy results of 98.81% and 95.21% are respectively obtained on UCM and RSSCN 7. From the aspect of classification precision, the algorithm provided by the invention can accurately realize remote sensing image classification.

Description

Remote sensing image classification method based on integrated depth Fisher vector
Technical Field
The invention belongs to an image classification method, and particularly relates to a remote sensing image classification method based on an integrated depth Fisher vector.
Background
With the continuous development of high-resolution remote sensing image acquisition technology, a large number of remote sensing satellite images with high spatial resolution are easier to obtain, and classification and identification based on high-precision remote sensing images are widely applied to the military and civil dual-purpose fields of land resource management, city planning, dangerous environment reconnaissance and the like, so that huge practical value is exerted.
Currently, generalized solutions to such problems mainly include three categories:
the first type is a classification scheme based on manual features, semantic descriptions based on the feature construction principle are generated by adopting manually designed manual features such as HOG, LBP and the like, and then the semantic descriptions are put into a pre-trained classifier to obtain a classification result.
The second type is an end-to-end-based classification scheme, which abandons the idea of artificially selecting a feature descriptor in the first type method, hands the work of optimizing feature learning to network self-learning, and autonomously completes the task of optimizing network iteration through a back propagation method.
The third category is a classification scheme based on deep learning features, a deep learning model trained by million-level images has the generalized cognitive ability on everything, a pre-trained deep learning model is used as a feature extractor, higher-level semantic features are extracted, and then the pre-trained deep learning model is placed into a pre-trained classifier for classification.
However, the prior art has at least the following disadvantages and shortcomings after being researched:
the first category of methods has the disadvantages: the high-precision remote sensing terrain classification task has the difficulties of large intra-class difference, small inter-class difference, variable scale, variable geometry, variable scene and the like, some terrain images have strong visual deceptiveness, and the classification scheme adopting manual features faces the problems of low semantic level and difficulty in realizing high-precision classification.
The second category of methods has the disadvantages: the existing remote sensing terrain data set has small data storage, the existing mainstream benchmarking data set only has thousands of data, and for the classification problem of small samples, the adoption of an end-to-end classification scheme is easy to cause overfitting of a network, so that the generalization capability of the network is poor, and the classification precision is low.
The third category of methods has the disadvantages: although the existing scheme based on the pre-training deep learning model is greatly improved compared with the former two schemes, the problems of long operation time, insufficient post-processing method and the like still exist, and the classification precision still needs to be further improved.
Therefore, in order to further improve the effect of terrain classification of the high-precision remote sensing image, high-level semantic information needs to be further mined, multi-scale image information is integrated, global and local semantic information is integrated, and a diversified preprocessing/postprocessing method is integrated on the basis of a deep learning feature extraction scheme, so that the semantic recognition capability of a network on the high-precision remote sensing image features is further improved.
Disclosure of Invention
The invention aims to provide a remote sensing image classification method based on integrated depth Fisher vectors, which can solve the problems of long classification time and low classification precision of the current remote sensing image classification algorithm.
The technical scheme of the invention is as follows: a remote sensing image classification method based on integrated depth Fisher vector comprises the following steps,
(1) extracting characteristics;
(2) fisher feature encoding;
(3) feature concatenation and classification.
And the step (1) comprises the step of inputting the preprocessed image features into a deep learning model obtained through ImageNet pre-training, and obtaining the highly-differentiated global semantic features and local semantic features in the multi-scale image.
The multi-scale image in the step (1) comprises a first class of scales which are the same 224 × 224 scales as the default scales of the pre-trained deep learning model ResNet-50, and under the condition of the scales, a first non-convolutional layer in the network is extracted as a global description:
Figure BDA0002228686860000035
the multi-scale image in the step (1) comprises a second class of scales (128 × 128, 256 × 256, 512 × 512) in sequence, under the condition of the class of multi-scales, a middle layer of the multi-scale image in the network is screened to serve as an optimal layer combined with subsequent Fisher coding to generate an optimal localized description, and through a layer-by-layer coding experiment, the 37 th layer output in ResNet-50 is adopted as a deep convolution feature to be coded:
Figure BDA0002228686860000036
in the step (1), after the second-class scale is subjected to feature extraction, L2 regularization is carried out, namely, feature preprocessing is carried out on the feature L to obtain the multi-scale localization description after the correlation is removed,
Figure BDA0002228686860000039
and the correlation between the local depth convolution characteristics L is reduced, and the same variance between the L characteristics is ensured.
And (3) performing Fisher feature coding in the step (2), further optimizing the local semantic features obtained by the feature extraction in the last step to generate deep Fisher features, and improving the description capability of the high-precision remote sensing terrain image.
The Fisher coding layer is used for describing the multi-scale localization of the input
Figure BDA0002228686860000037
Recoding is carried out, local description optimization of the remote sensing image is completed, a depth Fisher vector DFF of the terrain remote sensing image is output, and the Fisher coding layer uses a Gaussian mixture model to construct a word codebook
Constructing a word codebook dictionary by using a Gaussian mixture model to locally describe the input single scale
Figure BDA00022286868600000310
Coding expression is carried out, and Gaussian mean and variance information of a feature space is extracted:
Figure BDA0002228686860000031
Figure BDA0002228686860000032
wherein T is the number of local characteristic points on the terrain remote sensing image; f. oftIs a t-th local feature;the mean difference between the local features and the Gaussian mixture model;
Figure BDA0002228686860000034
for variance differences between local features and Gaussian mixture models, { wnnnα represents the mixed weight, mean and diagonal covariance of each Gaussian distribution in the word codebook B1, respectivelyt(n) assigning weights to the flexibilities, characterizing the height of the t-th quasi-local features relative to the n-thThe weight values of the s-hybrid model,
Figure BDA0002228686860000041
wherein, N (f)t;μnn) Is ftThe value in the nth gaussian distribution, the coding result is:
DFF:
Figure BDA0002228686860000042
the step (2) is described for multi-scale localizationAnd (5) performing the coding flow on the residual scales in the step (A), and finally connecting the DFFs of all scales in series to obtain a multi-scale Fisher vector I.
And (3) fusing the global semantic features and the depth Fisher vectors in a series connection mode to obtain new feature vectors, inputting the new feature vectors into a linear classifier, and finishing the classification task of the high-precision remote sensing terrain.
Specifically, the step (3) is to concatenate the encoding results in a cascade manner, and then obtain a final representation of the image through L2 regularization: integrated deep fischer
ADFF=[I,H]
Preferably, a linear support vector mechanism establishes a classification layer, the specific implementation is LIBSVM, punishment parameters of the LIBSVM are obtained by adopting ten-fold cross validation, and the classification layer outputs semantic labels to finish terrain classification.
The invention has the beneficial effects that: from a quantitative perspective, the algorithm presented herein was evaluated for accuracy, as shown in table 1 and table 2, and it can be seen that the algorithm achieved excellent classification compared to the above algorithm, and achieved 98.81% and 95.21% accuracy results on UCM and RSSCN7, respectively. In conclusion, the algorithm provided by the invention can accurately realize remote sensing image classification from the aspect of classification precision.
Drawings
FIG. 1 is a flow chart of an embodiment of a remote sensing image classification method based on an integrated depth Fisher vector provided by the invention;
FIG. 2 is an evaluation data set selected from the UCM data set employed in the present invention;
FIG. 3 is an evaluation data set selected from the RSSCN7 data set employed by the present invention.
Detailed Description
The invention is described in further detail below with reference to the figures and the embodiments.
The remote sensing image classification method based on the integrated depth Fisher vector is used for quickly and accurately generating a high-precision classification result and mainly comprises the following three steps:
(1) feature extraction
And inputting the preprocessed image features into a deep learning model obtained through ImageNet pre-training, and obtaining the high-distinguishability global semantic features and the local semantic features in the multi-scale image.
The deep convolution feature extraction in the embodiment of the invention is performed under two types of scales, and the first type of scale is 224 multiplied by 224 scales which are the same as the default scale of the pre-trained deep learning model ResNet-50. Under this scale, the invention extracts its first non-convolutional layer H in the above network as a global description:
Figure BDA0002228686860000051
h is a vector with a characteristic dimension of 1 × M, d1, d2, …, dK, …, dK represent the kth value of H, with a dimension of 1 × 1.
The second category of scales is (128 × 128, 256 × 256, 512 × 512) in turn. Under the multi-scale condition, the invention is to screen a certain middle layer in the network as an optimal layer combined with subsequent Fisher coding to generate an optimal localized description. Through a layer-by-layer coding experiment, the invention adopts the 37 th layer output in ResNet-50 as the depth convolution characteristic L to be coded:
Figure BDA0002228686860000052
l is a vector with characteristic dimensions E × K × N. Wherein DN represents the vector with the characteristic numerical value of the Nth scale as DN and the dimension of E multiplied by K.
The multi-scale deep convolution features extracted through the pre-training deep learning model are generally highly coupled and have strong correlation, so that the subsequent codebook clustering is challenged. According to the invention, L2 regularization (Normalization) is implemented to carry out feature preprocessing on the feature L to obtain the multi-scale localization description after the correlation is removed.
Figure BDA0002228686860000053
L is normalized by L2, and each numerical value is consistent with the meaning of L in the previous step. L is a vector with characteristic dimensions E × K × N. Wherein CN represents a vector with an nth scale feature value of CN and dimension of E × K.
Therefore, the correlation between the local depth convolution characteristics L is reduced, and the same variance among the characteristics L is ensured.
(2) Fisher feature encoding
Through Fisher feature coding, the local semantic features obtained by the last step of feature extraction are further optimized to generate deep Fisher features, and the description capability of the high-precision remote sensing terrain image is improved.
Multi-scale localization description of input by Fisher coding layer
Figure BDA0002228686860000067
And recoding is carried out, local description optimization of the remote sensing image is completed, and a depth Fisher vector DFF of the terrain remote sensing image is output.
The Fisher encoding layer constructs a word codebook B1 using a Gaussian Mixture Model (GMM)
The word codebook B1 is a vector whose feature dimensions are T × K. The value of the kth codebook is dK, and the dimension is a vector of T × 1.
GMM codebook describes the attribution condition and focusing degree of new local features after deep convolution feature extraction, the effect is optimal when the size of the codebook is determined to be 16 according to experiments, and the GMM codebook dictionary is utilized to locally describe the input single scale
Figure BDA0002228686860000066
Carrying out coding expression, and extracting Gaussian mean (1st) and variance (2nd) information of a feature space:
Figure BDA0002228686860000061
Figure BDA0002228686860000062
wherein T is the number of local characteristic points on the terrain remote sensing image; f. oftIs a t-th local feature;
Figure BDA0002228686860000063
the mean difference between the local features and the Gaussian mixture model;
Figure BDA0002228686860000064
is the variance difference between the local features and the gaussian mixture model. { wnnnα represents the mixed weight, mean and diagonal covariance of each Gaussian distribution in the word codebook B1, respectivelytAnd (n) flexibly distributing weight, representing the weight value of the t-th quasi-local feature relative to the n-th Gaussian mixture model.
Figure BDA0002228686860000071
Wherein, N (f)t;μnn) Is ftThe value in the nth gaussian distribution. The encoding result is:
DFF:
Figure BDA0002228686860000072
subsequently, for multiscale localization description
Figure BDA0002228686860000073
And (5) carrying out similar coding processes on the residual scales, and finally connecting the DFFs of all scales in series to obtain a multi-scale Fisher vector I.
(3) Feature concatenation and classification
And fusing the global semantic features and the depth Fisher vectors in a series connection mode to obtain new feature vectors, inputting the new feature vectors into a linear classifier, completing a classification task of the high-precision remote sensing terrain, and ending the algorithm.
The encoding results are connected in series by adopting a cascading mode, and then the final expression of the image is obtained through the regularization processing of L2: deep Fisher Features (ADFF) are integrated.
ADFF=[I,H]
The embodiment of the invention preferably selects a linear Support Vector Machine (SVM) to construct a classification layer, particularly realizes the adoption of LIBSVM, punishment parameters of the LIBSVM are obtained by adopting ten-fold cross validation, and the classification layer outputs semantic labels to finish terrain classification. The specific operation of this part is well known to those skilled in the art, and the detailed description of the embodiment of the present invention is omitted here. In conclusion, the invention adopts an integration scheme combining a deep learning method and an unsupervised feature coding method, combines the high-level semantic advantage of a deep learning model and the advantage of high resistance to change (size change resistance and scene change resistance) of unsupervised feature coding, fully captures the high-distinguishability semantic information of the remote sensing image, and improves the terrain classification performance of the remote sensing image.
Example (b):
as shown in fig. 1, the image of an arbitrary size is first resized to generate a scale 1: 224 × 224 and scale 2: two kinds of scales (128 × 128, 256 × 256, 512 × 512)
(1) Feature extraction:
for scale 1, its first non-convolutional layer in the above network is extracted as a global description:
Figure BDA0002228686860000081
for the scale 2, the output of the layer 37 in ResNet-50 is adopted as the depth convolution characteristic to be coded:
Figure BDA0002228686860000082
and performing feature preprocessing on the features L by implementing L2 regularization (Normalization) to obtain the decorrelated multi-scale localization description. To this end, step (1) is completed
(2) Fisher feature encoding
Multiscale localized description of input using Fisher coding layer
Figure BDA0002228686860000083
Carrying out unsupervised feature coding on each scale specification, respectively generating corresponding deep integration Fisher vector DFFs, connecting the DFFs of all scales in series to obtain a multi-scale Fisher vector I, and finishing the step (2)
(3) Feature concatenation and classification
The encoding results are connected in series by adopting a cascading mode, and then the final expression of the image is obtained through the regularization processing of L2: deep Fisher Features (ADFF) are integrated.
ADFF=[I,H]
Finally, the embodiment of the invention preferably selects a linear Support Vector Machine (SVM) to construct a classification layer, particularly realizes that LIBSVM is adopted, punishment parameters are obtained by adopting ten-fold cross validation, and the classification layer outputs semantic labels to finish terrain classification. So far, all the steps of the algorithm are completed.
The invention uses the following data sets as test evaluations:
UCM data set
The UCM (UC MercedLand-Use) dataset is the most popular test dataset in the field of high-precision remote sensing image classification, and is established by computer vision laboratories of California university in 2010. The data set covers 21 common land use categories, each sub-category consisting of 100 images of the same size, with a resolution of 256x256, all three channels of RGB. Figure 2 is selected from the data set.
RSSCN7 dataset
RSSCN7 is a data set published in 2015 that contains 7 common remote sensing topographic images, each category of images containing 400 images, and is challenging due to the diversity of image scales, with 4 different image scales for each category of 400 images. Figure 3 is selected from the data set.
As shown in table 1 and table 2 below, the remote sensing image classification method based on the integrated depth fisher vector achieves excellent image classification results.
Table 1: UCM remote sensing image terrain classification performance comparison
Figure BDA0002228686860000091
Table 2: RSSCN7 remote sensing image terrain classification performance comparison
Figure BDA0002228686860000092
Figure BDA0002228686860000101
From a quantitative perspective, the algorithm presented herein was evaluated for accuracy, as shown in table 1 and table 2, and it can be seen that the algorithm achieved excellent classification compared to the above algorithm, and achieved 98.81% and 95.21% accuracy results on UCM and RSSCN7, respectively. In conclusion, the algorithm provided by the invention can accurately realize remote sensing image classification from the aspect of classification precision.
In addition to the above embodiments, the present invention has other embodiments, and any technical solutions formed by equivalent replacement or equivalent transformation fall within the protection scope of the present invention.

Claims (10)

1. A remote sensing image classification method based on integrated depth Fisher vectors is characterized in that: which comprises the following steps of,
(1) extracting characteristics;
(2) fisher feature encoding;
(3) feature concatenation and classification.
2. The remote sensing image classification method based on the integrated depth fisher vector as claimed in claim 1, wherein: and the step (1) comprises the step of inputting the preprocessed image features into a deep learning model obtained through ImageNet pre-training, and obtaining the highly-differentiated global semantic features and local semantic features in the multi-scale image.
3. The remote sensing image classification method based on the integrated depth fisher vector as claimed in claim 2, wherein: the multi-scale image in the step (1) comprises a first class of scales which are the same 224 × 224 scales as the default scales of the pre-trained deep learning model ResNet-50, and under the condition of the scales, a first non-convolutional layer in the network is extracted as a global description:
4. the remote sensing image classification method based on the integrated depth fisher vector as claimed in claim 2, wherein: the multi-scale image in the step (1) comprises a second class of scales (128 × 128, 256 × 256, 512 × 512) in sequence, under the condition of the class of multi-scales, a middle layer of the multi-scale image in the network is screened to serve as an optimal layer combined with subsequent Fisher coding to generate an optimal localized description, and through a layer-by-layer coding experiment, the 37 th layer output in ResNet-50 is adopted as a deep convolution feature to be coded:
Figure FDA0002228686850000012
5. the remote sensing image classification method based on the integrated depth fisher vector as claimed in claim 4, wherein: in the step (1), after the second-class scale is subjected to feature extraction, L2 regularization is carried out, namely, feature preprocessing is carried out on the feature L to obtain the multi-scale localization description after the correlation is removed,
Figure FDA0002228686850000013
and the correlation between the local depth convolution characteristics L is reduced, and the same variance between the L characteristics is ensured.
6. The remote sensing image classification method based on the integrated depth fisher vector as claimed in claim 1, wherein: and (3) performing Fisher feature coding in the step (2), further optimizing the local semantic features obtained by the feature extraction in the last step to generate deep Fisher features, and improving the description capability of the high-precision remote sensing terrain image.
7. The remote sensing image classification method based on the integrated depth fisher vector as claimed in claim 6, wherein: the Fisher coding layer is used for describing the multi-scale localization of the input
Figure FDA0002228686850000021
Recoding is carried out, local description optimization of the remote sensing image is completed, a depth Fisher vector DFF of the terrain remote sensing image is output, and the Fisher coding layer uses a Gaussian mixture model to construct a word codebook B1
Figure FDA0002228686850000022
Constructing a word codebook dictionary by using a Gaussian mixture model to locally describe the input single scale
Figure FDA0002228686850000023
Coding expression is carried out, and Gaussian mean and variance information of a feature space is extracted:
Figure FDA0002228686850000024
wherein T is the number of local characteristic points on the terrain remote sensing image; f. oftIs a t-th local feature;
Figure FDA0002228686850000026
the mean difference between the local features and the Gaussian mixture model;
Figure FDA0002228686850000027
for variance differences between local features and Gaussian mixture models, { wnnnα represents the mixed weight, mean and diagonal covariance of each Gaussian distribution in the word codebook B1, respectivelyt(n) flexibly assigning weights representing the weight values of the t-th quasi-local features relative to the n-th Gaussian mixture model,
Figure FDA0002228686850000028
wherein, N (f)t;μnn) Is ftThe value in the nth gaussian distribution, the coding result is:
DFF:
Figure FDA0002228686850000029
8. the remote sensing image classification method based on the integrated depth fisher vector as claimed in claim 7, wherein: the step (2) is described for multi-scale localization
Figure FDA00022286868500000210
And (5) performing the coding flow on the residual scales in the step (A), and finally connecting the DFFs of all scales in series to obtain a multi-scale Fisher vector I.
9. The remote sensing image classification method based on the integrated depth fisher vector as claimed in claim 8, wherein: and (3) fusing the global semantic features and the depth Fisher vectors in a series connection mode to obtain new feature vectors, inputting the new feature vectors into a linear classifier, and finishing the classification task of the high-precision remote sensing terrain.
10. The remote sensing image classification method based on the integrated depth fisher vector as claimed in claim 9, wherein: specifically, the step (3) is that the encoding results are connected in series by adopting a cascading mode, then the final expression of the image is obtained by L2 regularization processing, the depth Fisher feature ADFF is integrated,
ADFF=[I,H]
preferably, a linear support vector mechanism establishes a classification layer, the specific implementation is LIBSVM, punishment parameters of the LIBSVM are obtained by adopting ten-fold cross validation, and the classification layer outputs semantic labels to finish terrain classification.
CN201910960279.0A 2019-10-10 2019-10-10 Remote sensing image classification method based on integrated depth Fisher vector Withdrawn CN110826597A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201910960279.0A CN110826597A (en) 2019-10-10 2019-10-10 Remote sensing image classification method based on integrated depth Fisher vector

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201910960279.0A CN110826597A (en) 2019-10-10 2019-10-10 Remote sensing image classification method based on integrated depth Fisher vector

Publications (1)

Publication Number Publication Date
CN110826597A true CN110826597A (en) 2020-02-21

Family

ID=69549109

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201910960279.0A Withdrawn CN110826597A (en) 2019-10-10 2019-10-10 Remote sensing image classification method based on integrated depth Fisher vector

Country Status (1)

Country Link
CN (1) CN110826597A (en)

Citations (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN105956610A (en) * 2016-04-22 2016-09-21 中国人民解放军军事医学科学院卫生装备研究所 Remote sensing image landform classification method based on multi-layer coding structure
CN108108751A (en) * 2017-12-08 2018-06-01 浙江师范大学 A kind of scene recognition method based on convolution multiple features and depth random forest

Patent Citations (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN105956610A (en) * 2016-04-22 2016-09-21 中国人民解放军军事医学科学院卫生装备研究所 Remote sensing image landform classification method based on multi-layer coding structure
CN108108751A (en) * 2017-12-08 2018-06-01 浙江师范大学 A kind of scene recognition method based on convolution multiple features and depth random forest

Non-Patent Citations (1)

* Cited by examiner, † Cited by third party
Title
BOYANG LI ET AL: "Aggregated Deep Fisher Feature for VHR Remote Sensing Scene Classification", 《IEEE JOURNAL OF SELECTED TOPICS IN APPLIED EARTH OBSERVATIONS AND REMOTE SENSING》 *

Similar Documents

Publication Publication Date Title
Abdollahi et al. Building footprint extraction from high resolution aerial images using generative adversarial network (GAN) architecture
Giang et al. U-Net convolutional networks for mining land cover classification based on high-resolution UAV imagery
CN109829430B (en) Cross-modal pedestrian re-identification method and system based on heterogeneous hierarchical attention mechanism
CN107122375B (en) Image subject identification method based on image features
CN110555458A (en) Multi-band image feature level fusion method for generating countermeasure network based on attention mechanism
CN110837836A (en) Semi-supervised semantic segmentation method based on maximized confidence
CN106570521B (en) Multilingual scene character recognition method and recognition system
CN110533024B (en) Double-quadratic pooling fine-grained image classification method based on multi-scale ROI (region of interest) features
CN107807914A (en) Recognition methods, object classification method and the data handling system of Sentiment orientation
CN105005789B (en) A kind of remote sensing images terrain classification method of view-based access control model vocabulary
CN109409240A (en) A kind of SegNet remote sensing images semantic segmentation method of combination random walk
CN111339935B (en) Optical remote sensing picture classification method based on interpretable CNN image classification model
CN110674685B (en) Human body analysis segmentation model and method based on edge information enhancement
CN114332544B (en) Image block scoring-based fine-grained image classification method and device
Nguyen et al. Satellite image classification using convolutional learning
CN109033321B (en) Image and natural language feature extraction and keyword-based language indication image segmentation method
CN106777402A (en) A kind of image retrieval text method based on sparse neural network
CN107016371A (en) UAV Landing Geomorphological Classification method based on improved depth confidence network
CN112215847A (en) Method for automatically segmenting overlapped chromosomes based on counterstudy multi-scale features
Jandial et al. Trace: Transform aggregate and compose visiolinguistic representations for image search with text feedback
CN116912708A (en) Remote sensing image building extraction method based on deep learning
CN116796810A (en) Deep neural network model compression method and device based on knowledge distillation
Choi et al. Comparative analysis of generalized intersection over union
Kim et al. Generating pedestrian training dataset using DCGAN
CN117033609A (en) Text visual question-answering method, device, computer equipment and storage medium

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
WW01 Invention patent application withdrawn after publication

Application publication date: 20200221

WW01 Invention patent application withdrawn after publication